ISO/IEC 8859-1
E29335
ISO/IEC 8859-1 is an 8-bit single-byte character encoding standard that covers Western European languages and was widely used before the adoption of Unicode.
All labels observed (6)
| Label | Occurrences |
|---|---|
| ISO/IEC 8859-1 canonical | 5 |
| ISO 8859-1 | 2 |
| ISO Latin 1 | 1 |
| ISO-8859-1 | 1 |
| Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1 | 1 |
| Latin-1 | 1 |
How this entity was disambiguated
This entity first appeared as the object of triple T228293 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
Target entity: ISO/IEC 8859-1 Context triple: [Latin-1 Supplement, encodingCompatibility, ISO/IEC 8859-1]
-
A.
Latin-1 Supplement
Latin-1 Supplement is a Unicode block that extends the basic Latin script with additional characters, including accented letters and symbols used in many Western European languages.
-
B.
Latin Extended-B
Latin Extended-B is a Unicode block that adds additional Latin characters used for various historical, phonetic, and minority language orthographies beyond the basic Latin set.
-
C.
Cyrillic Extended-B
Cyrillic Extended-B is a Unicode block that contains additional Cyrillic characters used for writing various minority and historic languages that employ the Cyrillic script.
-
D.
ISO/IEC 10646
ISO/IEC 10646 is an international standard that defines the Universal Coded Character Set (UCS), a comprehensive repertoire of characters used worldwide and closely aligned with the Unicode Standard.
-
E.
ASCII
ASCII is a widely used character encoding standard that represents text in computers and other devices using 7-bit numerical codes for letters, digits, punctuation, and control characters.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Target entity: ISO/IEC 8859-1 Target entity description: ISO/IEC 8859-1 is an 8-bit single-byte character encoding standard that covers Western European languages and was widely used before the adoption of Unicode.
-
A.
Latin-1 Supplement
Latin-1 Supplement is a Unicode block that extends the basic Latin script with additional characters, including accented letters and symbols used in many Western European languages.
-
B.
Latin Extended-B
Latin Extended-B is a Unicode block that adds additional Latin characters used for various historical, phonetic, and minority language orthographies beyond the basic Latin set.
-
C.
Cyrillic Extended-B
Cyrillic Extended-B is a Unicode block that contains additional Cyrillic characters used for writing various minority and historic languages that employ the Cyrillic script.
-
D.
ISO/IEC 10646
ISO/IEC 10646 is an international standard that defines the Universal Coded Character Set (UCS), a comprehensive repertoire of characters used worldwide and closely aligned with the Unicode Standard.
-
E.
ASCII
ASCII is a widely used character encoding standard that represents text in computers and other devices using 7-bit numerical codes for letters, digits, punctuation, and control characters.
- F. None of above. chosen
Statements (55)
| Predicate | Object |
|---|---|
| instanceOf |
8-bit single-byte character set
ⓘ
ISO/IEC standard ⓘ character encoding ⓘ |
| alsoKnownAs |
ISO/IEC 8859-1
ⓘ
surface form:
ISO Latin 1
ISO/IEC 8859-1 ⓘ
surface form:
Latin-1
|
| bitWidth | 8 bits ⓘ |
| category | single-byte encoding ⓘ |
| codePointsCount | 256 ⓘ |
| codeUnitSize | 1 byte ⓘ |
| controlCharactersRange |
0–31
ⓘ
127 ⓘ |
| coversLanguageGroup | Western European languages ⓘ |
| designedFor | Western European text processing systems ⓘ |
| doesNotIncludeCharacter | € ⓘ |
| extendedCharactersRange | 160–255 ⓘ |
| fullName |
ISO/IEC 8859-1
self-linksurface differs
ⓘ
surface form:
Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1
|
| graphicCharactersCount | 191 ⓘ |
| includesCharacter |
Á
ⓘ
Å ⓘ Æ ⓘ Ç ⓘ É ⓘ Í ⓘ Ñ ⓘ Ó ⓘ Ø ⓘ Ú ⓘ ß ⓘ |
| includesScript | Latin script ⓘ |
| maintainedBy |
International Electrotechnical Commission
ⓘ
International Organization for Standardization ⓘ |
| mimeCharsetName | ISO-8859-1 ⓘ |
| partOf | ISO/IEC 8859 ⓘ |
| primaryUseEra | pre-Unicode era ⓘ |
| printableCharactersRange | 32–126 ⓘ |
| registeredWith | IANA ⓘ |
| relatedStandard |
ISO/IEC 8859-15
ⓘ
Windows-1252 ⓘ |
| standardNumber | ISO/IEC 8859-1 self-link ⓘ |
| supersededInPracticeBy | Unicode ⓘ |
| supportsLanguage |
Danish language
ⓘ
surface form:
Danish
Dutch ⓘ English ⓘ Finnish ⓘ French ⓘ German ⓘ Icelandic ⓘ Italian ⓘ Norwegian language ⓘ
surface form:
Norwegian
Portuguese ⓘ Spanish ⓘ Swedish language ⓘ
surface form:
Swedish
|
| usedAs |
basis for many Western European encodings
ⓘ
default character set in early World Wide Web ⓘ default encoding in early HTML specifications ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Subject: ISO/IEC 8859-1 Description of subject: ISO/IEC 8859-1 is an 8-bit single-byte character encoding standard that covers Western European languages and was widely used before the adoption of Unicode.
Referenced by (11)
Full triples — surface form annotated when it differs from this entity's canonical label.