UCD
E142321
UCD is the standard Unicode Character Database that defines the properties and metadata for every character in the Unicode Standard.
All labels observed (1)
| Label | Occurrences |
|---|---|
| UCD canonical | 1 |
How this entity was disambiguated
This entity first appeared as the object of triple T1241530 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: UCD Context triple: [Unicode Character Database, hasAbbreviation, UCD]
-
A.
UCD
UCD is a major public research university in Davis, California, known for its strengths in agriculture, veterinary medicine, environmental science, and engineering.
-
B.
Dublin City University
Dublin City University is a modern public research university in Dublin, Ireland, known for its strong focus on innovation, industry partnerships, and career-oriented education.
-
C.
University College Dublin
University College Dublin is a major public research university in Ireland, renowned for its wide range of academic programs and significant contributions to scholarship and innovation.
-
D.
Kilkenny College
Kilkenny College is a historic Irish secondary school in County Kilkenny, noted for educating prominent figures such as philosopher George Berkeley.
-
E.
Technological University Dublin
Technological University Dublin is a major Irish technological university formed from the merger of several institutes of technology, offering a wide range of career-focused programs and applied research across multiple campuses in Dublin.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: UCD Target entity description: UCD is the standard Unicode Character Database that defines the properties and metadata for every character in the Unicode Standard.
-
A.
UCD
UCD is a major public research university in Davis, California, known for its strengths in agriculture, veterinary medicine, environmental science, and engineering.
-
B.
Dublin City University
Dublin City University is a modern public research university in Dublin, Ireland, known for its strong focus on innovation, industry partnerships, and career-oriented education.
-
C.
University College Dublin
University College Dublin is a major public research university in Ireland, renowned for its wide range of academic programs and significant contributions to scholarship and innovation.
-
D.
Kilkenny College
Kilkenny College is a historic Irish secondary school in County Kilkenny, noted for educating prominent figures such as philosopher George Berkeley.
-
E.
Technological University Dublin
Technological University Dublin is a major Irish technological university formed from the merger of several institutes of technology, offering a wide range of career-focused programs and applied research across multiple campuses in Dublin.
- F. None of above. chosen
Statements (52)
| Predicate | Object |
|---|---|
| instanceOf |
Unicode standard component
ⓘ
character database ⓘ |
| abbreviation | UCD ⓘ |
| alsoKnownAs |
Unicode Character Database
ⓘ
surface form:
Unicode character property database
|
| contains |
Blocks.txt
ⓘ
CaseFolding.txt ⓘ DerivedCoreProperties.txt ⓘ EastAsianWidth.txt ⓘ GraphemeBreakProperty.txt ⓘ LineBreak.txt ⓘ NameAliases.txt ⓘ NormalizationProps files ⓘ PropList.txt ⓘ PropertyAliases.txt ⓘ PropertyValueAliases.txt ⓘ Scripts.txt ⓘ SentenceBreakProperty.txt ⓘ SpecialCasing.txt ⓘ Unicode Character Database ⓘ
surface form:
UnicodeData.txt
WordBreakProperty.txt ⓘ |
| definedBy | Unicode Consortium ⓘ |
| defines |
East Asian width properties
ⓘ
bidirectional class property ⓘ block property ⓘ canonical combining classes ⓘ case mapping properties ⓘ combining class property ⓘ decomposition mappings ⓘ general category property ⓘ grapheme break properties ⓘ line breaking properties ⓘ normalization properties ⓘ numeric value property ⓘ script property ⓘ sentence break properties ⓘ word break properties ⓘ |
| distributionFormat |
machine-readable data files
ⓘ
plain text files ⓘ |
| documentation |
Unicode Standard Annexes
ⓘ
surface form:
Unicode Standard Annex #44
|
| fullName | Unicode Character Database ⓘ |
| partOf | The Unicode Standard ⓘ |
| provides |
character metadata
ⓘ
character properties ⓘ |
| scope | all encoded Unicode characters ⓘ |
| updatedWithEach | new version of the Unicode Standard ⓘ |
| usedFor |
character classification
ⓘ
collation ⓘ implementing Unicode support in software ⓘ normalization ⓘ regular expression engines ⓘ rendering ⓘ text processing ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Input
Subject: UCD Description of subject: UCD is the standard Unicode Character Database that defines the properties and metadata for every character in the Unicode Standard.
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.