CJK Unified Ideographs
E412588
CJK Unified Ideographs is a standardized set of Chinese, Japanese, and Korean logographic characters encoded in Unicode to unify and represent Han-based writing systems across East Asia.
All labels observed (5)
| Label | Occurrences |
|---|---|
| CJK Unified Ideographs canonical | 3 |
| CJK | 1 |
| CJK Unified Han | 1 |
| CJK Unified Ideographs (basic set) | 1 |
| CJK Unified Ideographs Extension B | 1 |
Statements (49)
| Predicate | Object |
|---|---|
| instanceOf |
CJK character set
ⓘ
Han character repertoire ⓘ Unicode block collection ⓘ |
| alsoKnownAs |
CJK Unified Ideographs
ⓘ
surface form:
CJK Unified Han
URO ⓘ |
| coversLanguage |
Chinese
ⓘ
Japanese ⓘ Korean ⓘ Vietnamese ⓘ |
| designGoal | unification of Han characters across CJK languages ⓘ |
| encodingModel | logographic ⓘ |
| excludes |
most non-Han scripts used in CJK regions
ⓘ
purely stylistic glyph variants ⓘ |
| hasBidirectionalClass | L ⓘ |
| hasCategory | Letter_Other in Unicode general category for most characters ⓘ |
| hasCombiningClass | 0 for most characters ⓘ |
| hasProperty |
includes Japanese Kanji
ⓘ
includes Korean Hanja ⓘ includes Vietnamese Chữ Hán and Chữ Nôm characters ⓘ includes simplified Chinese characters ⓘ includes traditional Chinese characters ⓘ organized by radical-stroke order in charts ⓘ supports round-trip mapping to legacy East Asian encodings ⓘ unified code points for historically variant Han characters ⓘ |
| hasScriptProperty | Han ⓘ |
| hasSubset |
CJK Unified Ideographs basic block
ⓘ
CJK Unified Ideographs extensions ⓘ |
| hasUnicodeBlock |
CJK Compatibility Ideographs
ⓘ
CJK Compatibility Ideographs Supplement ⓘ CJK Unified Ideographs self-link ⓘ CJK Unified Ideographs Extension A ⓘ CJK Unified Ideographs Extension B ⓘ CJK Unified Ideographs Extension C ⓘ CJK Unified Ideographs Extension D ⓘ CJK Unified Ideographs Extension E ⓘ CJK Unified Ideographs Extension F ⓘ CJK Unified Ideographs Extension G ⓘ CJK Unified Ideographs Extension H ⓘ CJK Unified Ideographs extensions ⓘ
surface form:
CJK Unified Ideographs Extension I
|
| partOf | Unicode Standard ⓘ |
| relatedStandard | ISO/IEC 10646 ⓘ |
| standardizedBy |
ISO/IEC JTC 1/SC 2
ⓘ
Unicode Consortium ⓘ |
| unificationPrinciple | same abstract character unified despite glyph differences ⓘ |
| usedIn |
digital typography for CJK scripts
ⓘ
internationalization of software ⓘ text processing for East Asian languages ⓘ web standards and HTML ⓘ |
| usesScript | Han script ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Input
Subject: CJK Unified Ideographs Description of subject: CJK Unified Ideographs is a standardized set of Chinese, Japanese, and Korean logographic characters encoded in Unicode to unify and represent Han-based writing systems across East Asia.
Referenced by (7)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
CJK Unified Ideographs (basic set)
this entity surface form:
CJK Unified Ideographs Extension B
this entity surface form:
CJK
this entity surface form:
CJK Unified Han