Unicode 15.0
E71247
Unicode 15.0 is a version of the Unicode Standard that expanded the global character set with additional scripts, symbols, and emoji to improve digital text representation across diverse languages.
All labels observed (3)
| Label | Occurrences |
|---|---|
| Unicode 14.0 | 1 |
| Unicode 15.0 canonical | 1 |
| Unicode Standard versions | 1 |
How this entity was disambiguated
This entity first appeared as the object of triple T569238 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
Target entity: Unicode 15.0 Context triple: [Kawi script, UnicodeStandardVersionAdded, Unicode 15.0]
-
A.
Unicode Technical Standard #10
Unicode Technical Standard #10 is the specification that defines the Unicode Collation Algorithm, providing a standardized method for comparing and sorting Unicode text across languages and platforms.
-
B.
Unicode Technical Report #29
Unicode Technical Report #29 is the specification that defines how to determine and segment user-perceived text elements (grapheme clusters), words, and sentences in Unicode text.
-
C.
Unicode Consortium
The Unicode Consortium is a non-profit organization that standardizes the representation of text and symbols in digital systems worldwide through the Unicode Standard.
-
D.
Unicode Character Database
The Unicode Character Database is a comprehensive collection of machine-readable data files that define the properties, classifications, and behaviors of every character encoded in the Unicode Standard.
-
E.
Unicode
Unicode is a universal character encoding standard that assigns unique code points to virtually all written scripts, symbols, and emojis used in modern computing.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Target entity: Unicode 15.0 Target entity description: Unicode 15.0 is a version of the Unicode Standard that expanded the global character set with additional scripts, symbols, and emoji to improve digital text representation across diverse languages.
-
A.
Unicode Technical Standard #10
Unicode Technical Standard #10 is the specification that defines the Unicode Collation Algorithm, providing a standardized method for comparing and sorting Unicode text across languages and platforms.
-
B.
Unicode Technical Report #29
Unicode Technical Report #29 is the specification that defines how to determine and segment user-perceived text elements (grapheme clusters), words, and sentences in Unicode text.
-
C.
Unicode Consortium
The Unicode Consortium is a non-profit organization that standardizes the representation of text and symbols in digital systems worldwide through the Unicode Standard.
-
D.
Unicode Character Database
The Unicode Character Database is a comprehensive collection of machine-readable data files that define the properties, classifications, and behaviors of every character encoded in the Unicode Standard.
-
E.
Unicode
Unicode is a universal character encoding standard that assigns unique code points to virtually all written scripts, symbols, and emojis used in modern computing.
- F. None of above. chosen
Statements (56)
| Predicate | Object |
|---|---|
| instanceOf | version of the Unicode Standard ⓘ |
| addsCharactersCount | 4488 ⓘ |
| addsEmoji |
donkey
ⓘ
flute ⓘ folding hand fan ⓘ ginger ⓘ goose ⓘ grey heart ⓘ hair pick ⓘ hyacinth ⓘ jellyfish ⓘ khanda ⓘ light blue heart ⓘ maracas ⓘ moose ⓘ pea pod ⓘ pink heart ⓘ shaking face ⓘ wing ⓘ wireless ⓘ |
| addsEmojiCharactersCount | 20 ⓘ |
| addsEmojiSequencesCount | 11 ⓘ |
| addsEmojiZwjSequencesCount | 0 ⓘ |
| addsScript |
Kawi
ⓘ
Mundari ⓘ
surface form:
Nag Mundari
|
| addsScriptsCount | 2 ⓘ |
| aimsTo | support digital text representation across diverse languages ⓘ |
| defines |
bidirectional behavior
ⓘ
character properties ⓘ code points ⓘ collation data ⓘ line breaking rules ⓘ normalization forms ⓘ |
| documentedIn |
Unicode Character Database
ⓘ
Unicode Standard Annexes ⓘ |
| followedBy | Unicode 15.1 ⓘ |
| follows |
Unicode 15.0
self-linksurface differs
ⓘ
surface form:
Unicode 14.0
|
| hasPrimaryFormat |
PDF
ⓘ
online documentation ⓘ |
| includes |
Arabic script characters
ⓘ
Cyrillic script characters ⓘ Han script characters ⓘ Latin script characters ⓘ currency symbols ⓘ emoji ⓘ mathematical symbols ⓘ punctuation ⓘ symbols ⓘ |
| partOf |
Unicode
ⓘ
surface form:
Unicode Standard
|
| publisher | Unicode Consortium ⓘ |
| releaseDate | 2022-09-13 ⓘ |
| releaseYear | 2022 ⓘ |
| standardizes | character encoding ⓘ |
| status | published ⓘ |
| totalCharactersCount | 149186 ⓘ |
| versionNumber | 15.0 ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Subject: Unicode 15.0 Description of subject: Unicode 15.0 is a version of the Unicode Standard that expanded the global character set with additional scripts, symbols, and emoji to improve digital text representation across diverse languages.
Referenced by (3)
Full triples — surface form annotated when it differs from this entity's canonical label.