RFC 3629
E138249
RFC 3629 is the Internet standard that defines the UTF-8 character encoding for representing Unicode/ISO/IEC 10646 characters in a byte-oriented format.
All labels observed (1)
| Label | Occurrences |
|---|---|
| RFC 3629 canonical | 3 |
Statements (49)
| Predicate | Object |
|---|---|
| instanceOf |
Internet standard
ⓘ
Request for Comments ⓘ |
| addresses |
invalid byte sequences in UTF-8
ⓘ
overlong UTF-8 sequences ⓘ security considerations for UTF-8 ⓘ |
| appliesTo |
Internet protocols
ⓘ
textual data interchange ⓘ |
| area | Applications Area ⓘ |
| category | Character encoding standard ⓘ |
| defines |
UTF-8
ⓘ
UTF-8 character encoding ⓘ encoding of Unicode characters in 8-bit octets ⓘ transformation format of ISO/IEC 10646 ⓘ |
| definesCodeUnitSequenceLength | 1 to 4 octets ⓘ |
| definesEncodingUnit |
8-bit byte
ⓘ
octet ⓘ |
| definesMaximumCodePoint | U+10FFFF ⓘ |
| definesProperty |
backward compatibility with ASCII
ⓘ
self-synchronizing property of UTF-8 ⓘ |
| documentType | Standards Track RFC ⓘ |
| encodingType |
byte-oriented encoding
ⓘ
variable-length encoding ⓘ |
| ensures | ASCII characters have same byte values in UTF-8 ⓘ |
| hasAbbreviation | RFC 3629 ⓘ |
| language | English ⓘ |
| obsoletes | RFC 2279 ⓘ |
| organization | Internet Society ⓘ |
| prohibits | encoding of UTF-16 surrogates in UTF-8 ⓘ |
| publishedBy |
Internet Engineering Task Force
ⓘ
surface form:
IETF
Internet Engineering Task Force ⓘ |
| relatedStandard |
ISO/IEC 10646
ⓘ
Unicode Standard ⓘ |
| relatesTo |
ISO/IEC 10646
ⓘ
Unicode ⓘ |
| restricts | UTF-8 code sequences to 4 bytes ⓘ |
| RFCNumber | 3629 ⓘ |
| specifies |
encoding of Unicode scalar values
ⓘ
ill-formed UTF-8 byte sequences ⓘ rules for surrogate code points in UTF-8 ⓘ well-formed UTF-8 byte sequences ⓘ |
| standardizes | UTF-8 ⓘ |
| status |
Internet Standard
ⓘ
Standards Track RFC ⓘ |
| stream | IETF Stream ⓘ |
| title | UTF-8, a transformation format of ISO 10646 ⓘ |
| topic |
character encoding
ⓘ
internationalization ⓘ text representation in computers ⓘ |
| updates | UTF-8 specification ⓘ |
Referenced by (3)
Full triples — surface form annotated when it differs from this entity's canonical label.