The Unicode Standard

E568049

character encoding standard international standard

The Unicode Standard is a universal character encoding system that assigns unique code points to text and symbols from virtually all writing systems, enabling consistent digital representation and interchange of written language worldwide.

Try in SPARQL Jump to: Surface forms Disambiguation Statements Elicitation Referenced by

All labels observed (7)

Label	Occurrences
The Unicode Standard canonical	2
The Unicode Standard, Version 3.0 (book)	1
The Unicode Standard, Version 4.1.0	1
Unicode Standard is revised	1
Unicode Standard, Section on Private-Use Characters and Planes	1
Unicode standard	1
the core Unicode Standard	1

How this entity was disambiguated

This entity first appeared as the object of triple T6096376 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07

Target entity: The Unicode Standard
Context triple: [Grantha (Unicode block), definedIn, The Unicode Standard]

A. Unicode Technical Standard #10
Unicode Technical Standard #10 is the specification that defines the Unicode Collation Algorithm, providing a standardized method for comparing and sorting Unicode text across languages and platforms.
B. Unicode Standard Annexes
Unicode Standard Annexes are supplementary technical reports that define detailed specifications, algorithms, and guidelines extending and clarifying the core Unicode Standard.
C. Unicode Technical Standard #35
Unicode Technical Standard #35 is a Unicode Consortium specification that defines the Locale Data Markup Language (LDML) and related mechanisms for internationalization, including formatting of dates, times, numbers, and other locale-sensitive data.
D. Unicode Consortium
The Unicode Consortium is a non-profit organization that standardizes the representation of text and symbols in digital systems worldwide through the Unicode Standard.
E. ISO/IEC 10646
ISO/IEC 10646 is an international standard that defines the Universal Coded Character Set (UCS), a comprehensive repertoire of characters used worldwide and closely aligned with the Unicode Standard.
F. None of above. chosen
G. Unsure - the case is ambiguous/there is not enough information to decide.

NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07

Target entity: The Unicode Standard
Target entity description: The Unicode Standard is a universal character encoding system that assigns unique code points to text and symbols from virtually all writing systems, enabling consistent digital representation and interchange of written language worldwide.

A. Unicode Technical Standard #10
Unicode Technical Standard #10 is the specification that defines the Unicode Collation Algorithm, providing a standardized method for comparing and sorting Unicode text across languages and platforms.
B. Unicode Standard Annexes
Unicode Standard Annexes are supplementary technical reports that define detailed specifications, algorithms, and guidelines extending and clarifying the core Unicode Standard.
C. Unicode Technical Standard #35
Unicode Technical Standard #35 is a Unicode Consortium specification that defines the Locale Data Markup Language (LDML) and related mechanisms for internationalization, including formatting of dates, times, numbers, and other locale-sensitive data.
D. Unicode Consortium
The Unicode Consortium is a non-profit organization that standardizes the representation of text and symbols in digital systems worldwide through the Unicode Standard.
E. ISO/IEC 10646
ISO/IEC 10646 is an international standard that defines the Universal Coded Character Set (UCS), a comprehensive repertoire of characters used worldwide and closely aligned with the Unicode Standard.
F. None of above. chosen

Statements (65)

Predicate	Object
instanceOf	character encoding standard ⓘ international standard ⓘ
alignedWith	ISO/IEC 10646 NERFINISHED ⓘ
appliesTo	data interchange ⓘ databases ⓘ digital text processing ⓘ operating systems ⓘ programming languages ⓘ virtually all writing systems ⓘ web technologies ⓘ
covers	compatibility characters ⓘ emoji ⓘ historic scripts ⓘ mathematical notation ⓘ modern scripts ⓘ musical notation symbols ⓘ punctuation ⓘ symbols ⓘ technical symbols ⓘ
defines	Unicode bidirectional algorithm NERFINISHED ⓘ Unicode blocks NERFINISHED ⓘ Unicode character properties ⓘ Unicode code points ⓘ Unicode collation algorithm NERFINISHED ⓘ Unicode normalization forms NERFINISHED ⓘ Unicode planes ⓘ Unicode scalar values NERFINISHED ⓘ canonical decomposition ⓘ case mapping rules ⓘ combining characters ⓘ compatibility decomposition ⓘ emoji properties ⓘ general category property ⓘ grapheme cluster ⓘ line breaking properties ⓘ numeric values for characters ⓘ script property ⓘ surrogate pairs ⓘ
documentType	technical standard ⓘ
enables	internationalization ⓘ localization ⓘ multilingual computing ⓘ
firstPlaneName	Basic Multilingual Plane NERFINISHED ⓘ
firstPlaneRange	0x0000–0xFFFF ⓘ
firstPublished	1991 ⓘ
fullName	The Unicode Standard NERFINISHED ⓘ
goal	consistent text interchange ⓘ interoperable text representation ⓘ universal character encoding ⓘ
hasSupplement	Unicode Standard Annexes NERFINISHED ⓘ Unicode Technical Reports NERFINISHED ⓘ Unicode Technical Standards NERFINISHED ⓘ
latestVersionPublisher	Unicode Consortium NERFINISHED ⓘ
maintainedBy	Unicode Consortium NERFINISHED ⓘ
maximumCodePoints	1114112 ⓘ
organizesCodeSpaceInto	17 planes ⓘ
primaryEncodingForms	UTF-16 NERFINISHED ⓘ UTF-32 ⓘ UTF-8 NERFINISHED ⓘ
relatedStandard	ISO/IEC 10646 NERFINISHED ⓘ
replaces	legacy character sets ⓘ
shortName	Unicode NERFINISHED ⓘ
usesCodeSpace	0x0000–0x10FFFF ⓘ
versioningScheme	major.minor.update ⓘ
website	https://www.unicode.org/standard/standard.html ⓘ

How these facts were elicited

Referenced by (8)

Full triples — surface form annotated when it differs from this entity's canonical label.

Mark → encodedIn → The Unicode Standard ⓘ

this entity surface form: Unicode standard

Grantha (U+11300–U+1137F) → definedIn → The Unicode Standard ⓘ

subject surface form: Grantha (Unicode block)

Unicode Standard Annexes → clarifies → The Unicode Standard ⓘ

this entity surface form: the core Unicode Standard

Unicode Standard Annexes → updatedWhen → The Unicode Standard ⓘ

this entity surface form: Unicode Standard is revised

Supplementary Private Use Area-A → documentation → The Unicode Standard ⓘ

this entity surface form: Unicode Standard, Section on Private-Use Characters and Planes

UCD → partOf → The Unicode Standard ⓘ

Unicode 4.1 → documentedIn → The Unicode Standard ⓘ

this entity surface form: The Unicode Standard, Version 4.1.0

Unicode 3.0 → documentedIn → The Unicode Standard ⓘ

this entity surface form: The Unicode Standard, Version 3.0 (book)

All labels observed (7)

How this entity was disambiguated Show

Statements (65)

How these facts were elicited Show

Referenced by (8)

How this entity was disambiguated

How these facts were elicited