Unicode

E3674

Unicode is a universal character encoding standard that assigns unique code points to virtually all written scripts, symbols, and emojis used in modern computing.


Statements (60)
Predicate Object
instanceOf character encoding standard
international standard
abbreviation Unicode
alignedWith ISO/IEC 10646
basicMultilingualPlaneRange U+0000 to U+FFFF
codeSpaceRange U+0000 to U+10FFFF
compatibleWith ISO/IEC 10646
defines bidirectional text behavior
character properties
code points
collation rules
encoding forms
grapheme cluster boundaries
line breaking rules
normalization forms
developedBy Unicode Consortium
documentationFormat multi-volume standard text and data files
firstPublished 1991
fullName The Unicode Standard
goal interoperability across platforms and languages
universal character set
hasEncodingForm UTF-16
UTF-32
UTF-8
hasGoverningBody Unicode Consortium
hasTechnicalReport Unicode Technical Report #29
hasTechnicalStandard Unicode Technical Standard #10
includesDatabase Unicode Character Database
initialVersion Unicode 1.0
latestVersion Unicode 15.1
license freely available standard
maintainedBy Unicode Consortium
mostCommonEncodingOnWeb UTF-8
numberOfPlanes 17
organizesInto planes
provides Unicode Scalar Values
replaces many legacy character encodings
supplementaryPlanesRange U+10000 to U+10FFFF
supports Arabic script
Chinese characters
Cyrillic script
Devanagari script
Greek script
Hebrew script
Japanese scripts
Korean Hangul
Latin script
currency symbols
emoji
historic scripts
mathematical symbols
musical notation symbols
punctuation
technical symbols
usedIn databases
modern operating systems
modern programming languages
web technologies
usesBitWidth 21-bit code space
versioningScheme major.minor

Referenced by (55)
Subject (surface form when different) Predicate
Basic Multilingual Plane ("Unicode Standard")
Devanagari Extended-A ("Unicode Standard")
Unicode 15.0 ("Unicode Standard")
Unicode Character Database ("Unicode Standard")
Unicode Technical Report #29 ("Unicode Standard")
partOf
Adobe InDesign
ICQ
Java
Perl
XML
supports
Georgian Supplement ("Unicode Standard")
Latin Extended-B
Latin-1 Supplement
assignedInStandard
Amharic
Kannada
Marathi language
hasDigitalSupport
Hangul
Sundanese script
encodingStandard
Sinhala script
Thaana script
hasEncodingStandard
ISO/IEC 10646 ("Unicode Standard")
Unicode Technical Standard #10 ("Unicode Standard")
relatedStandard
Cyrillic Extended-B
Devanagari Extended-A
standard
KORMARC
XML
usesCharacterEncoding
Unicode
abbreviation
ISO/IEC 10646 ("Unicode Standard")
alignedWith
Unicode Technical Standard #10 ("Unicode text")
appliesTo
MathML
compatibleWith
Unicode Scalar Values ("Unicode Standard")
definedBy
Lao script
digitalEncodingStandard
Latin alphabet
encodedIn
Grantha script
encodedInStandard
XHTML
encoding
Unicode ("The Unicode Standard")
fullName
Icelandic
hasDigitalEncoding
Malayalam script
hasDigitalEncodingStandard
Baybayin
hasModernEncoding
ASCII
influenced
Unicode ("Unicode 1.0")
initialVersion
Basic Latin ("Unicode 1.0")
introducedInStandard
UTF-32 ("Unicode Standard")
isDefinedBy
ASCII
isSubsetOf
Unicode Consortium ("Unicode Standard")
maintainsStandard
Unicode Consortium ("Unicode Emoji List")
oversees
Unicode Consortium ("Unicode Standard")
primaryProduct
Cyrillic script ("Unicode Standard")
recognizedBy
Encoding Standard ("Unicode Standard")
relatedTo
Odia script ("Unicode Standard")
standardizedIn
Unicode Technical Standard #10
subject
ISO/IEC 8859-1
supersededInPracticeBy
Terminal (macOS)
supportsFeature
Nuskhuri
unicodeStandard
Unicode Technical Report #29 ("Unicode Standard")
updatedWithEachVersionOf

Please wait…