CJK Unified Ideographs Extension B

E775981

CJK Unified Ideographs Extension B is a large Unicode block that encodes an extensive set of rare and historical Chinese characters used across East Asian languages, including those found in systems like Chữ Nôm.

All labels observed (1)

Label Occurrences
CJK Unified Ideographs Extension B canonical 2

Statements (45)

Predicate Object
instanceOf Unicode block
belongsTo CJK Unified Ideographs repertoire
blockName CJK Unified Ideographs Extension B
characterType logographic
codePointRangeEnd U+2A6DF
codePointRangeStart U+20000
contains CJK ideographs
historic Chinese characters
rare Chinese characters
coverage rare and historic Han characters
encodingStandard ISO/IEC 10646 NERFINISHED
Unicode NERFINISHED
hasProperty supplementary block
unified ideographs
introducedInVersion Unicode 3.1 NERFINISHED
plane Supplementary Ideographic Plane NERFINISHED
planeNumber Plane 2
relatedBlock CJK Unified Ideographs
CJK Unified Ideographs Extension A NERFINISHED
CJK Unified Ideographs Extension C NERFINISHED
CJK Unified Ideographs Extension D NERFINISHED
CJK Unified Ideographs Extension E NERFINISHED
CJK Unified Ideographs Extension F NERFINISHED
CJK Unified Ideographs Extension G NERFINISHED
script Han
scriptDirection left-to-right
top-to-bottom
standardizedBy Unicode Consortium NERFINISHED
usageRegion China NERFINISHED
Japan NERFINISHED
Korea NERFINISHED
Vietnam NERFINISHED
usedFor historical documents
philological research
rare personal names
usedInLanguageFamily Austroasiatic languages NERFINISHED
Japonic languages
Koreanic languages
Sino-Tibetan languages NERFINISHED
usesInSystem Chữ Nôm NERFINISHED
writingSystem Chinese
Japanese
Korean
Vietnamese NERFINISHED
yearIntroduced 2001

How these facts were elicited

The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.

Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10.

# Requirements
- If you don't know the subject at all, return an empty list.
- If the subject is not a named entity, return an empty list.
- Include at least one triple where predicate is "instanceOf".
- Do not get too wordy.
- Separate several objects into multiple triples with one object.
Input
Subject: CJK Unified Ideographs Extension B
Description of subject: CJK Unified Ideographs Extension B is a large Unicode block that encodes an extensive set of rare and historical Chinese characters used across East Asian languages, including those found in systems like Chữ Nôm.

Referenced by (2)

Full triples — surface form annotated when it differs from this entity's canonical label.

Chữ Nôm UnicodeBlock CJK Unified Ideographs Extension B
CJK Unified Ideographs hasUnicodeBlock CJK Unified Ideographs Extension B