UTF-32
E23921
UTF-32 is a fixed-length Unicode character encoding that represents each code point using 32 bits, providing simple indexing at the cost of higher memory usage.
All labels observed (2)
| Label | Occurrences |
|---|---|
| UTF-32 canonical | 4 |
| UTF-32 (conceptually, as 32-bit form) | 1 |
Statements (47)
| Predicate | Object |
|---|---|
| instanceOf |
Unicode transformation format
ⓘ
character encoding ⓘ fixed-length encoding ⓘ |
| BOMCodeUnit | 0x0000FEFF ⓘ |
| codeUnitSize | 4 bytes ⓘ |
| doesNotEncode | noncharacters outside Unicode range ⓘ |
| encodes | Unicode code points ⓘ |
| hasDisadvantage |
increased bandwidth usage
ⓘ
larger cache footprint ⓘ |
| hasEndianness |
big-endian
ⓘ
little-endian ⓘ |
| hasProperty |
direct mapping between code point and code unit
ⓘ
fixed-length code units ⓘ high memory usage per character ⓘ no surrogate pairs ⓘ simple indexing by code point ⓘ |
| hasVariant |
UTF-32BE
ⓘ
UTF-32LE ⓘ |
| introducedTo | provide simple mapping from code point index to memory offset ⓘ |
| isAlternativeTo |
UTF-1
ⓘ
UTF-16 ⓘ UTF-7 ⓘ UTF-8 ⓘ |
| isCommonlyUsedIn |
some programming language runtimes
ⓘ
some text processing libraries ⓘ |
| isCompatibleWith | Unicode scalar values ⓘ |
| isDefinedBy |
Unicode
ⓘ
surface form:
Unicode Standard
|
| isLessEfficientThan |
UTF-16 for storage
ⓘ
UTF-8 for storage ⓘ |
| isPartOf |
Unicode Standard Annexes
ⓘ
surface form:
Unicode Standard encodings
|
| isRarelyUsedFor |
file storage
ⓘ
web content ⓘ |
| isRelatedTo | UCS-4 ⓘ |
| isStandardizedBy |
ISO/IEC 10646
ⓘ
Unicode Consortium ⓘ |
| isUsedFor |
APIs requiring constant-time indexing
ⓘ
internal string representation ⓘ |
| mayUse | byte order mark ⓘ |
| supportsCodeSpace | U+000000 to U+10FFFF ⓘ |
| supportsPlane |
Basic Multilingual Plane
ⓘ
Supplementary Ideographic Plane ⓘ Supplementary Multilingual Plane ⓘ Supplementary Private Use Area-A ⓘ Supplementary Private Use Area-B ⓘ Supplementary Special-purpose Plane ⓘ |
| usesBitWidth | 32 bits per code point ⓘ |
| wasPreviouslyCalled | UCS-4 ⓘ |
Referenced by (5)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
UTF-32 (conceptually, as 32-bit form)