Unicode Character Database
E26667
The Unicode Character Database is a comprehensive collection of machine-readable data files that define the properties, classifications, and behaviors of every character encoded in the Unicode Standard.
Observed surface forms (3)
| Surface form | Occurrences |
|---|---|
| UAX #44 | 1 |
| Unicode character classifications | 1 |
| Unified Canadian Aboriginal Syllabics | 1 |
Statements (68)
| Predicate | Object |
|---|---|
| instanceOf |
machine-readable data collection
ⓘ
technical standard component ⓘ |
| accessURL | https://www.unicode.org/Public/UCD/ ⓘ |
| covers | every encoded Unicode character ⓘ |
| dataFormat |
machine-readable files
ⓘ
plain text files ⓘ |
| defines |
Unicode character behaviors
ⓘ
Unicode Character Database self-linksurface differs ⓘ
surface form:
Unicode character classifications
Unicode character properties ⓘ |
| definesProperty |
Bidi_Class
ⓘ
Block ⓘ Canonical_Combining_Class ⓘ Case_Folding ⓘ Decomposition_Mapping ⓘ East_Asian_Width ⓘ General_Category ⓘ Grapheme_Cluster_Break ⓘ Line_Break ⓘ Lowercase_Mapping ⓘ Normalization_Quick_Check ⓘ Numeric_Value ⓘ Script ⓘ Sentence_Break ⓘ Titlecase_Mapping ⓘ Uppercase_Mapping ⓘ Word_Break ⓘ |
| documentedIn |
Unicode Standard Annexes
ⓘ
surface form:
Unicode Standard Annex #44
|
| hasAbbreviation | UCD ⓘ |
| includesFile |
ArabicShaping.txt
ⓘ
BidiMirroring.txt ⓘ Blocks.txt ⓘ CaseFolding.txt ⓘ DerivedAge.txt ⓘ DerivedCoreProperties.txt ⓘ DerivedNormalizationProps.txt ⓘ EastAsianWidth.txt ⓘ HangulSyllableType.txt ⓘ IndicPositionalCategory.txt ⓘ IndicSyllabicCategory.txt ⓘ Jamo.txt ⓘ LineBreak.txt ⓘ NameAliases.txt ⓘ NormalizationProps.txt ⓘ PropList.txt ⓘ Scripts.txt ⓘ SpecialCasing.txt ⓘ UnicodeData.txt ⓘ |
| introducedBy |
Unicode Technical Committee
ⓘ
surface form:
Unicode Consortium technical committee
|
| license | Unicode License ⓘ |
| maintainedBy | Unicode Consortium ⓘ |
| partOf |
Unicode
ⓘ
surface form:
Unicode Standard
|
| primaryAudience |
font and rendering engine developers
ⓘ
implementers of Unicode ⓘ software developers ⓘ |
| provides |
default property values for characters
ⓘ
stability guarantees for properties ⓘ |
| scope |
all assigned Unicode code points
ⓘ
some unassigned code points with default properties ⓘ |
| UAXNumber |
Unicode Character Database
self-linksurface differs
ⓘ
surface form:
UAX #44
|
| updatedWhen | new Unicode version is released ⓘ |
| usedFor |
bidirectional text algorithms
ⓘ
case conversion ⓘ collation ⓘ line breaking algorithms ⓘ normalization ⓘ text processing ⓘ text rendering ⓘ |
| versionedWith |
Unicode 15.0
ⓘ
surface form:
Unicode Standard versions
|
Referenced by (7)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
UAX #44
this entity surface form:
Unicode character classifications
this entity surface form:
Unified Canadian Aboriginal Syllabics