Unicode Scalar Values
E23920
Unicode Scalar Values are the set of valid Unicode code points (excluding surrogate code points) that uniquely identify abstract characters in the Unicode standard.
Observed surface forms (1)
| Surface form | Occurrences |
|---|---|
| Unicode code points | 1 |
Statements (48)
| Predicate | Object |
|---|---|
| instanceOf |
Unicode concept
ⓘ
set of Unicode code points ⓘ |
| areBasisFor | Unicode string models in many languages ⓘ |
| areDefinedIn |
Unicode Core Specification
ⓘ
Unicode Standard Annexes ⓘ |
| areDisjointWith | Unicode surrogate code points ⓘ |
| areDomainOf | Unicode character properties ⓘ |
| areEncodedAs |
1 or 2 code units in UTF-16
ⓘ
1 to 4 code units in UTF-8 ⓘ |
| areInterpretedAs | 32-bit integer values in UTF-32 ⓘ |
| areSubsetOf |
Unicode Scalar Values
self-linksurface differs
ⓘ
surface form:
Unicode code points
|
| areSupersetOf |
Unicode assigned characters
ⓘ
Unicode noncharacters ⓘ Unicode reserved code points ⓘ |
| areUsedTo |
avoid surrogate misuse in text processing
ⓘ
define character boundaries in some languages ⓘ define valid Unicode strings ⓘ specify allowed code points in APIs ⓘ |
| cardinality | 1112064 ⓘ |
| conceptualizedAs | valid Unicode code point excluding surrogates ⓘ |
| definedBy |
Unicode
ⓘ
surface form:
Unicode Standard
|
| endRange | U+10FFFF ⓘ |
| excludeRange | U+D800–U+DFFF ⓘ |
| excludes | Unicode surrogate code points ⓘ |
| haveAbbreviation |
Unicode scalars
ⓘ
scalar values ⓘ |
| haveProperty |
each value maps to at most one abstract character
ⓘ
no value is a surrogate code point ⓘ |
| includeRange |
U+0000–U+D7FF
ⓘ
U+E000–U+10FFFF ⓘ |
| includes | all Unicode code points except surrogates ⓘ |
| relatedTo |
Unicode code points
ⓘ
Unicode scalar type ⓘ |
| startRange | U+0000 ⓘ |
| usedBy |
UTF-16
ⓘ
UTF-32 ⓘ UTF-8 ⓘ |
| usedIn |
Unicode character properties
ⓘ
Unicode collation ⓘ Unicode encoding forms ⓘ Unicode normalization ⓘ Unicode text processing ⓘ |
| usedInProgrammingLanguage |
Dart
ⓘ
Julia ⓘ Kotlin ⓘ Rust ⓘ Swift ⓘ |
| usedTo | identify abstract characters ⓘ |
Referenced by (2)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Unicode code points