RFC 3629

E138249

RFC 3629 is the Internet standard that defines the UTF-8 character encoding for representing Unicode/ISO/IEC 10646 characters in a byte-oriented format.

Try in SPARQL Jump to: Surface forms Statements Referenced by

All labels observed (1)

Label Occurrences
RFC 3629 canonical 3

Statements (49)

Predicate Object
instanceOf Internet standard
Request for Comments
addresses invalid byte sequences in UTF-8
overlong UTF-8 sequences
security considerations for UTF-8
appliesTo Internet protocols
textual data interchange
area Applications Area
category Character encoding standard
defines UTF-8
UTF-8 character encoding
encoding of Unicode characters in 8-bit octets
transformation format of ISO/IEC 10646
definesCodeUnitSequenceLength 1 to 4 octets
definesEncodingUnit 8-bit byte
octet
definesMaximumCodePoint U+10FFFF
definesProperty backward compatibility with ASCII
self-synchronizing property of UTF-8
documentType Standards Track RFC
encodingType byte-oriented encoding
variable-length encoding
ensures ASCII characters have same byte values in UTF-8
hasAbbreviation RFC 3629
language English
obsoletes RFC 2279
organization Internet Society
prohibits encoding of UTF-16 surrogates in UTF-8
publishedBy Internet Engineering Task Force
surface form: IETF

Internet Engineering Task Force
relatedStandard ISO/IEC 10646
Unicode Standard
relatesTo ISO/IEC 10646
Unicode
restricts UTF-8 code sequences to 4 bytes
RFCNumber 3629
specifies encoding of Unicode scalar values
ill-formed UTF-8 byte sequences
rules for surrogate code points in UTF-8
well-formed UTF-8 byte sequences
standardizes UTF-8
status Internet Standard
Standards Track RFC
stream IETF Stream
title UTF-8, a transformation format of ISO 10646
topic character encoding
internationalization
text representation in computers
updates UTF-8 specification

Referenced by (3)

Full triples — surface form annotated when it differs from this entity's canonical label.

ISO/IEC 10646 relatedStandard RFC 3629
UTF-7 deprecatedIn RFC 3629
UTF-8 standardizedIn RFC 3629