UTF-32
E23921
UTF-32 is a fixed-length Unicode character encoding that represents each code point using 32 bits, providing simple indexing at the cost of higher memory usage.
Statements (47)
| Predicate | Object |
|---|---|
| instanceOf |
Unicode transformation format
→
character encoding → fixed-length encoding → |
| BOMCodeUnit |
0x0000FEFF
→
|
| codeUnitSize |
4 bytes
→
|
| doesNotEncode |
noncharacters outside Unicode range
→
|
| encodes |
Unicode code points
→
|
| hasDisadvantage |
increased bandwidth usage
→
larger cache footprint → |
| hasEndianness |
big-endian
→
little-endian → |
| hasProperty |
direct mapping between code point and code unit
→
fixed-length code units → high memory usage per character → no surrogate pairs → simple indexing by code point → |
| hasVariant |
UTF-32BE
→
UTF-32LE → |
| introducedTo |
provide simple mapping from code point index to memory offset
→
|
| isAlternativeTo |
UTF-1
→
UTF-16 → UTF-7 → UTF-8 → |
| isCommonlyUsedIn |
some programming language runtimes
→
some text processing libraries → |
| isCompatibleWith |
Unicode scalar values
→
|
| isDefinedBy |
Unicode Standard
→
|
| isLessEfficientThan |
UTF-16 for storage
→
UTF-8 for storage → |
| isPartOf |
Unicode Standard encodings
→
|
| isRarelyUsedFor |
file storage
→
web content → |
| isRelatedTo |
UCS-4
→
|
| isStandardizedBy |
ISO/IEC 10646
→
Unicode Consortium → |
| isUsedFor |
APIs requiring constant-time indexing
→
internal string representation → |
| mayUse |
byte order mark
→
|
| supportsCodeSpace |
U+000000 to U+10FFFF
→
|
| supportsPlane |
Basic Multilingual Plane
→
Supplementary Ideographic Plane → Supplementary Multilingual Plane → Supplementary Private Use Area-A → Supplementary Private Use Area-B → Supplementary Special-purpose Plane → |
| usesBitWidth |
32 bits per code point
→
|
| wasPreviouslyCalled |
UCS-4
→
|
Referenced by (3)
| Subject (surface form when different) | Predicate |
|---|---|
|
Unicode
→
|
hasEncodingForm |
|
Unicode Scalar Values
→
|
usedBy |
|
ISO/IEC 10646
→
|
usesEncodingForm |