Unicode Scalar Values

E23920

Unicode Scalar Values are the set of valid Unicode code points (excluding surrogate code points) that uniquely identify abstract characters in the Unicode standard.

Jump to: Surface forms Statements Referenced by

Observed surface forms (1)

Surface form Occurrences
Unicode code points 1

Statements (48)

Predicate Object
instanceOf Unicode concept
set of Unicode code points
areBasisFor Unicode string models in many languages
areDefinedIn Unicode Core Specification
Unicode Standard Annexes
areDisjointWith Unicode surrogate code points
areDomainOf Unicode character properties
areEncodedAs 1 or 2 code units in UTF-16
1 to 4 code units in UTF-8
areInterpretedAs 32-bit integer values in UTF-32
areSubsetOf Unicode Scalar Values self-linksurface differs
surface form: Unicode code points
areSupersetOf Unicode assigned characters
Unicode noncharacters
Unicode reserved code points
areUsedTo avoid surrogate misuse in text processing
define character boundaries in some languages
define valid Unicode strings
specify allowed code points in APIs
cardinality 1112064
conceptualizedAs valid Unicode code point excluding surrogates
definedBy Unicode
surface form: Unicode Standard
endRange U+10FFFF
excludeRange U+D800–U+DFFF
excludes Unicode surrogate code points
haveAbbreviation Unicode scalars
scalar values
haveProperty each value maps to at most one abstract character
no value is a surrogate code point
includeRange U+0000–U+D7FF
U+E000–U+10FFFF
includes all Unicode code points except surrogates
relatedTo Unicode code points
Unicode scalar type
startRange U+0000
usedBy UTF-16
UTF-32
UTF-8
usedIn Unicode character properties
Unicode collation
Unicode encoding forms
Unicode normalization
Unicode text processing
usedInProgrammingLanguage Dart
Julia
Kotlin
Rust
Swift
usedTo identify abstract characters

Referenced by (2)

Full triples — surface form annotated when it differs from this entity's canonical label.

Unicode Scalar Values areSubsetOf Unicode Scalar Values self-linksurface differs
this entity surface form: Unicode code points
Unicode provides Unicode Scalar Values