Unicode Character Database

E26667

The Unicode Character Database is a comprehensive collection of machine-readable data files that define the properties, classifications, and behaviors of every character encoded in the Unicode Standard.

Jump to: Surface forms Statements Referenced by

Observed surface forms (3)


Statements (68)

Predicate Object
instanceOf machine-readable data collection
technical standard component
accessURL https://www.unicode.org/Public/UCD/
covers every encoded Unicode character
dataFormat machine-readable files
plain text files
defines Unicode character behaviors
Unicode Character Database self-linksurface differs
surface form: Unicode character classifications

Unicode character properties
definesProperty Bidi_Class
Block
Canonical_Combining_Class
Case_Folding
Decomposition_Mapping
East_Asian_Width
General_Category
Grapheme_Cluster_Break
Line_Break
Lowercase_Mapping
Normalization_Quick_Check
Numeric_Value
Script
Sentence_Break
Titlecase_Mapping
Uppercase_Mapping
Word_Break
documentedIn Unicode Standard Annexes
surface form: Unicode Standard Annex #44
hasAbbreviation UCD
includesFile ArabicShaping.txt
BidiMirroring.txt
Blocks.txt
CaseFolding.txt
DerivedAge.txt
DerivedCoreProperties.txt
DerivedNormalizationProps.txt
EastAsianWidth.txt
HangulSyllableType.txt
IndicPositionalCategory.txt
IndicSyllabicCategory.txt
Jamo.txt
LineBreak.txt
NameAliases.txt
NormalizationProps.txt
PropList.txt
Scripts.txt
SpecialCasing.txt
UnicodeData.txt
introducedBy Unicode Technical Committee
surface form: Unicode Consortium technical committee
license Unicode License
maintainedBy Unicode Consortium
partOf Unicode
surface form: Unicode Standard
primaryAudience font and rendering engine developers
implementers of Unicode
software developers
provides default property values for characters
stability guarantees for properties
scope all assigned Unicode code points
some unassigned code points with default properties
UAXNumber Unicode Character Database self-linksurface differs
surface form: UAX #44
updatedWhen new Unicode version is released
usedFor bidirectional text algorithms
case conversion
collation
line breaking algorithms
normalization
text processing
text rendering
versionedWith Unicode 15.0
surface form: Unicode Standard versions

Referenced by (7)

Full triples — surface form annotated when it differs from this entity's canonical label.

Unicode Character Database UAXNumber Unicode Character Database self-linksurface differs
this entity surface form: UAX #44
Unicode Technical Report #29 compatibleWith Unicode Character Database
Unicode Character Database defines Unicode Character Database self-linksurface differs
this entity surface form: Unicode character classifications
Unicode 15.0 documentedIn Unicode Character Database
Canadian Aboriginal syllabics hasUnicodeBlock Unicode Character Database
this entity surface form: Unified Canadian Aboriginal Syllabics
Unicode includesDatabase Unicode Character Database
Unicode Consortium maintainsStandard Unicode Character Database