Encoding Standard

E48501

WHATWG specification character encoding specification web standard

The Encoding Standard is a WHATWG specification that defines how text is encoded and decoded on the web to ensure consistent character handling across browsers and platforms.

Try in SPARQL Jump to: Surface forms Disambiguation Statements Elicitation Referenced by

All labels observed (1)

Label	Occurrences
Encoding Standard canonical	5

How this entity was disambiguated

This entity first appeared as the object of triple T380424 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07

Target entity: Encoding Standard
Context triple: [WHATWG, develops, Encoding Standard]

A. Unicode Technical Standard #10
Unicode Technical Standard #10 is the specification that defines the Unicode Collation Algorithm, providing a standardized method for comparing and sorting Unicode text across languages and platforms.
B. ISO/IEC 10646
ISO/IEC 10646 is an international standard that defines the Universal Coded Character Set (UCS), a comprehensive repertoire of characters used worldwide and closely aligned with the Unicode Standard.
C. ASCII
ASCII is a widely used character encoding standard that represents text in computers and other devices using 7-bit numerical codes for letters, digits, punctuation, and control characters.
D. Unicode
Unicode is a universal character encoding standard that assigns unique code points to virtually all written scripts, symbols, and emojis used in modern computing.
E. Unicode Consortium
The Unicode Consortium is a non-profit organization that standardizes the representation of text and symbols in digital systems worldwide through the Unicode Standard.
F. None of above. chosen
G. Unsure - the case is ambiguous/there is not enough information to decide.

NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07

Target entity: Encoding Standard
Target entity description: The Encoding Standard is a WHATWG specification that defines how text is encoded and decoded on the web to ensure consistent character handling across browsers and platforms.

A. Unicode Technical Standard #10
Unicode Technical Standard #10 is the specification that defines the Unicode Collation Algorithm, providing a standardized method for comparing and sorting Unicode text across languages and platforms.
B. ISO/IEC 10646
ISO/IEC 10646 is an international standard that defines the Universal Coded Character Set (UCS), a comprehensive repertoire of characters used worldwide and closely aligned with the Unicode Standard.
C. ASCII
ASCII is a widely used character encoding standard that represents text in computers and other devices using 7-bit numerical codes for letters, digits, punctuation, and control characters.
D. Unicode
Unicode is a universal character encoding standard that assigns unique code points to virtually all written scripts, symbols, and emojis used in modern computing.
E. Unicode Consortium
The Unicode Consortium is a non-profit organization that standardizes the representation of text and symbols in digital systems worldwide through the Unicode Standard.
F. None of above. chosen

Statements (53)

Predicate	Object
instanceOf	WHATWG specification ⓘ character encoding specification ⓘ web standard ⓘ
aimsTo	ensure consistent character handling across browsers ⓘ ensure consistent character handling across platforms ⓘ standardize legacy encodings behavior ⓘ
defines	error handling for decoders ⓘ error handling for encoders ⓘ how text is decoded on the web ⓘ how text is encoded on the web ⓘ the algorithm for BOM sniffing ⓘ the algorithm for decoding a byte stream ⓘ the algorithm for encoding a string ⓘ the algorithm for getting an encoding from a label ⓘ the algorithm for handling invalid byte sequences ⓘ the concept of ASCII-compatible encodings ⓘ the concept of BOM sniffing ⓘ the concept of Big5 encoding ⓘ the concept of EUC-KR encoding ⓘ the concept of GBK encoding ⓘ the concept of ISO-2022-JP encoding ⓘ the concept of ISO-8859-15 encoding ⓘ the concept of ISO-8859-2 encoding ⓘ the concept of Shift_JIS encoding ⓘ the concept of UTF-16 encodings ⓘ the concept of UTF-16BE encoding ⓘ the concept of UTF-16LE encoding ⓘ the concept of UTF-8 encoding ⓘ the concept of decoders ⓘ the concept of encoders ⓘ the concept of encoding labels ⓘ the concept of legacy single-byte encodings ⓘ the concept of replacement character handling ⓘ the concept of replacement encoding ⓘ the concept of windows-1251 encoding ⓘ the concept of windows-1252 encoding ⓘ the concept of x-user-defined encoding ⓘ the encoding index data structures ⓘ the labels used to identify encodings ⓘ the mapping between code points and bytes for supported encodings ⓘ the set of encodings used on the web ⓘ
maintainedBy	WHATWG ⓘ
publisher	WHATWG ⓘ
relatedTo	HTML Living Standard ⓘ surface form: HTML Standard URL Standard ⓘ Unicode ⓘ surface form: Unicode Standard
scope	HTML user agents ⓘ web browsers ⓘ web platforms ⓘ
status	Living Standard ⓘ
title	Encoding Standard self-link ⓘ
url	https://encoding.spec.whatwg.org/ ⓘ
uses	Unicode code points ⓘ

How these facts were elicited

Referenced by (5)

Full triples — surface form annotated when it differs from this entity's canonical label.

WHATWG → develops → Encoding Standard ⓘ

Web Hypertext Application Technology Working Group → developsStandard → Encoding Standard ⓘ

Encoding Standard → title → Encoding Standard self-link ⓘ

Streams Standard → relatedSpecification → Encoding Standard ⓘ

URL Standard → relatedTo → Encoding Standard ⓘ

All labels observed (1)

How this entity was disambiguated Show

Statements (53)

How these facts were elicited Show

Referenced by (5)

How this entity was disambiguated

How these facts were elicited