Windows-1252
E157300
Windows-1252 is a character encoding used primarily on Microsoft Windows systems that extends ISO 8859-1 with additional printable characters, including typographic punctuation and symbols.
All labels observed (2)
| Label | Occurrences |
|---|---|
| Windows-1252 canonical | 2 |
| CP1252 | 1 |
How this entity was disambiguated
This entity first appeared as the object of triple T1380989 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
Target entity: Windows-1252 Context triple: [ISO/IEC 8859-1, relatedStandard, Windows-1252]
-
A.
ISO/IEC 8859-1
ISO/IEC 8859-1 is an 8-bit single-byte character encoding standard that covers Western European languages and was widely used before the adoption of Unicode.
-
B.
ISO/IEC 8859
ISO/IEC 8859 is a family of 8-bit character encoding standards that define various single-byte coded character sets for different languages and scripts, widely used before the adoption of Unicode.
-
C.
Latin-1 Supplement
Latin-1 Supplement is a Unicode block that extends the basic Latin script with additional characters, including accented letters and symbols used in many Western European languages.
-
D.
Cyrillic Extended-C
Cyrillic Extended-C is a Unicode block that adds additional Cyrillic characters used for specialized, historic, or lesser-used orthographies beyond those covered in the core Cyrillic blocks.
-
E.
ASCII
ASCII is a widely used character encoding standard that represents text in computers and other devices using 7-bit numerical codes for letters, digits, punctuation, and control characters.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Target entity: Windows-1252 Target entity description: Windows-1252 is a character encoding used primarily on Microsoft Windows systems that extends ISO 8859-1 with additional printable characters, including typographic punctuation and symbols.
-
A.
ISO/IEC 8859-1
ISO/IEC 8859-1 is an 8-bit single-byte character encoding standard that covers Western European languages and was widely used before the adoption of Unicode.
-
B.
ISO/IEC 8859
ISO/IEC 8859 is a family of 8-bit character encoding standards that define various single-byte coded character sets for different languages and scripts, widely used before the adoption of Unicode.
-
C.
Latin-1 Supplement
Latin-1 Supplement is a Unicode block that extends the basic Latin script with additional characters, including accented letters and symbols used in many Western European languages.
-
D.
Cyrillic Extended-C
Cyrillic Extended-C is a Unicode block that adds additional Cyrillic characters used for specialized, historic, or lesser-used orthographies beyond those covered in the core Cyrillic blocks.
-
E.
ASCII
ASCII is a widely used character encoding standard that represents text in computers and other devices using 7-bit numerical codes for letters, digits, punctuation, and control characters.
- F. None of above. chosen
Statements (64)
| Predicate | Object |
|---|---|
| instanceOf |
Windows code page
ⓘ
character encoding ⓘ single-byte character encoding ⓘ |
| alsoKnownAs |
ANSI code page
ⓘ
Windows-1252 ⓘ
surface form:
CP1252
Windows Western European ⓘ |
| basedOn |
ISO/IEC 8859-1
ⓘ
surface form:
ISO 8859-1
|
| byteSize | 8-bit ⓘ |
| characterRepertoireSize | 256 code points ⓘ |
| codePageNumber | 1252 ⓘ |
| commonMislabelingAs |
ISO/IEC 8859-1
ⓘ
surface form:
ISO-8859-1
latin1 ⓘ |
| compatibleWith | ASCII for code points 0–127 ⓘ |
| defaultEncodingFor | many legacy Western-language Windows applications ⓘ |
| developer | Microsoft ⓘ |
| differsFrom | ISO 8859-1 in code points 0x80–0x9F ⓘ |
| extends |
ISO/IEC 8859-1
ⓘ
surface form:
ISO 8859-1
|
| IANACharsetName | windows-1252 ⓘ |
| includesCharacter |
Œ (Latin capital ligature OE)
ⓘ
œ (Latin small ligature oe) ⓘ Š (Latin capital letter S with caron) ⓘ š (Latin small letter s with caron) ⓘ Ÿ (Latin capital letter Y with diaeresis) ⓘ Ž (Latin capital letter Z with caron) ⓘ ž (Latin small letter z with caron) ⓘ ƒ (Latin small letter f with hook) ⓘ ˆ (modifier letter circumflex accent) ⓘ ˜ (small tilde) ⓘ – (en dash) ⓘ — (em dash) ⓘ ‘ (left single quotation mark) ⓘ ’ (right single quotation mark) ⓘ ‚ (single low-9 quotation mark) ⓘ “ (left double quotation mark) ⓘ ” (right double quotation mark) ⓘ „ (double low-9 quotation mark) ⓘ † (dagger) ⓘ ‡ (double dagger) ⓘ • (bullet) ⓘ … (horizontal ellipsis) ⓘ ‰ (per mille sign) ⓘ ‹ (single left-pointing angle quotation mark) ⓘ › (single right-pointing angle quotation mark) ⓘ € (Euro sign) ⓘ ™ (trade mark sign) ⓘ |
| includesCharacterType |
control characters
ⓘ
currency symbols ⓘ diacritics ⓘ mathematical symbols ⓘ typographic punctuation ⓘ |
| introducedFor | supporting additional punctuation and symbols beyond ISO 8859-1 ⓘ |
| ISO8859-1Range0x80-0x9F | control characters in ISO 8859-1 ⓘ |
| mapsRangeToPrintableCharacters | 0x80–0x9F ⓘ |
| MIMEName | Windows-1252 self-link ⓘ |
| primaryPlatform |
Windows
ⓘ
surface form:
Microsoft Windows
|
| region |
Americas
ⓘ
Western Europe ⓘ |
| script | Latin ⓘ |
| supersededBy | UTF-8 in modern applications ⓘ |
| usedFor |
HTML content on older websites
ⓘ
text files on Windows in Western locales ⓘ |
| usesCodePointRange | U+0000–U+00FF ⓘ |
| usesCodeUnitRange | 0–255 ⓘ |
| writingSystem | Western European languages ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Subject: Windows-1252 Description of subject: Windows-1252 is a character encoding used primarily on Microsoft Windows systems that extends ISO 8859-1 with additional printable characters, including typographic punctuation and symbols.
Referenced by (3)
Full triples — surface form annotated when it differs from this entity's canonical label.