CJK Unified Ideographs

E412588

CJK Unified Ideographs is a standardized set of Chinese, Japanese, and Korean logographic characters encoded in Unicode to unify and represent Han-based writing systems across East Asia.

Try in SPARQL Jump to: Surface forms Statements Referenced by

All labels observed (5)

Statements (49)

Predicate Object
instanceOf CJK character set
Han character repertoire
Unicode block collection
alsoKnownAs CJK Unified Ideographs
surface form: CJK Unified Han

URO
coversLanguage Chinese
Japanese
Korean
Vietnamese
designGoal unification of Han characters across CJK languages
encodingModel logographic
excludes most non-Han scripts used in CJK regions
purely stylistic glyph variants
hasBidirectionalClass L
hasCategory Letter_Other in Unicode general category for most characters
hasCombiningClass 0 for most characters
hasProperty includes Japanese Kanji
includes Korean Hanja
includes Vietnamese Chữ Hán and Chữ Nôm characters
includes simplified Chinese characters
includes traditional Chinese characters
organized by radical-stroke order in charts
supports round-trip mapping to legacy East Asian encodings
unified code points for historically variant Han characters
hasScriptProperty Han
hasSubset CJK Unified Ideographs basic block
CJK Unified Ideographs extensions
hasUnicodeBlock CJK Compatibility Ideographs
CJK Compatibility Ideographs Supplement
CJK Unified Ideographs self-link
CJK Unified Ideographs Extension A
CJK Unified Ideographs Extension B
CJK Unified Ideographs Extension C
CJK Unified Ideographs Extension D
CJK Unified Ideographs Extension E
CJK Unified Ideographs Extension F
CJK Unified Ideographs Extension G
CJK Unified Ideographs Extension H
CJK Unified Ideographs extensions
surface form: CJK Unified Ideographs Extension I
partOf Unicode Standard
relatedStandard ISO/IEC 10646
standardizedBy ISO/IEC JTC 1/SC 2
Unicode Consortium
unificationPrinciple same abstract character unified despite glyph differences
usedIn digital typography for CJK scripts
internationalization of software
text processing for East Asian languages
web standards and HTML
usesScript Han script

How these facts were elicited

The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.

Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10.

# Requirements
- If you don't know the subject at all, return an empty list.
- If the subject is not a named entity, return an empty list.
- Include at least one triple where predicate is "instanceOf".
- Do not get too wordy.
- Separate several objects into multiple triples with one object.
Input
Subject: CJK Unified Ideographs
Description of subject: CJK Unified Ideographs is a standardized set of Chinese, Japanese, and Korean logographic characters encoded in Unicode to unify and represent Han-based writing systems across East Asia.

Referenced by (7)

Full triples — surface form annotated when it differs from this entity's canonical label.

block CJK Unified Ideographs
Plane 0 contains CJK Unified Ideographs
this entity surface form: CJK Unified Ideographs (basic set)
Supplementary Ideographic Plane contains CJK Unified Ideographs
this entity surface form: CJK Unified Ideographs Extension B
Unicode 4.1 refinesExistingScripts CJK Unified Ideographs
this entity surface form: CJK
Chữ Nôm UnicodeBlock CJK Unified Ideographs
CJK Unified Ideographs alsoKnownAs CJK Unified Ideographs
this entity surface form: CJK Unified Han
CJK Unified Ideographs hasUnicodeBlock CJK Unified Ideographs self-link