Triple

T16760735
Position Surface form Disambiguated ID Type / Status
Subject Tigré E407334 entity
Predicate belongsToGroup P12263 FINISHED
Object North Ethiosemitic languages
The North Ethiosemitic languages are a branch of the Semitic language family spoken primarily in Eritrea and northern Ethiopia, including languages such as Tigré and Geʽez.
E1232400 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: North Ethiosemitic languages | Statement: [Tigré, belongsToGroup, North Ethiosemitic languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: North Ethiosemitic languages
Context triple: [Tigré, belongsToGroup, North Ethiosemitic languages]
  • A. South Semitic languages
    South Semitic languages are a branch of the Semitic language family spoken primarily in the southern Arabian Peninsula and the Horn of Africa, including languages such as Arabic’s ancient South Arabian relatives and the Modern South Arabian and Ethiopian Semitic languages.
  • B. Central Cushitic languages
    The Central Cushitic languages are a small branch of the Cushitic family of Afroasiatic languages spoken primarily in northwestern Ethiopia.
  • C. Afroasiatic languages
    Afroasiatic languages are a major language family of Africa and parts of Western Asia that includes languages such as Arabic, Hebrew, Amharic, and Hausa.
  • D. Solomonic languages
    Solomonic languages are a subgroup of Oceanic languages spoken primarily in the Solomon Islands, characterized by shared phonological and grammatical features distinct within the broader Austronesian family.
  • E. Central Semitic languages
    Central Semitic languages are a major branch of the Semitic language family that includes Arabic and several closely related languages of the Middle East.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: North Ethiosemitic languages
Triple: [Tigré, belongsToGroup, North Ethiosemitic languages]
Generated description
The North Ethiosemitic languages are a branch of the Semitic language family spoken primarily in Eritrea and northern Ethiopia, including languages such as Tigré and Geʽez.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: North Ethiosemitic languages
Target entity description: The North Ethiosemitic languages are a branch of the Semitic language family spoken primarily in Eritrea and northern Ethiopia, including languages such as Tigré and Geʽez.
  • A. South Semitic languages
    South Semitic languages are a branch of the Semitic language family spoken primarily in the southern Arabian Peninsula and the Horn of Africa, including languages such as Arabic’s ancient South Arabian relatives and the Modern South Arabian and Ethiopian Semitic languages.
  • B. Central Cushitic languages
    The Central Cushitic languages are a small branch of the Cushitic family of Afroasiatic languages spoken primarily in northwestern Ethiopia.
  • C. Afroasiatic languages
    Afroasiatic languages are a major language family of Africa and parts of Western Asia that includes languages such as Arabic, Hebrew, Amharic, and Hausa.
  • D. Solomonic languages
    Solomonic languages are a subgroup of Oceanic languages spoken primarily in the Solomon Islands, characterized by shared phonological and grammatical features distinct within the broader Austronesian family.
  • E. Central Semitic languages
    Central Semitic languages are a major branch of the Semitic language family that includes Arabic and several closely related languages of the Middle East.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d8839174188190909f190097207065 completed April 10, 2026, 4:58 a.m.
NER Named-entity recognition batch_69e3abec638c81909d71ff452a4123c9 completed April 18, 2026, 4:06 p.m.
NED1 Entity disambiguation (via context triple) batch_6a00a52d077081908080c61da67e0032 completed May 10, 2026, 3:33 p.m.
NEDg Description generation batch_6a00a685753881908c3fef10823ce569 completed May 10, 2026, 3:38 p.m.
NED2 Entity disambiguation (via description) batch_6a00a7174f5c8190891ddd180c50aee3 completed May 10, 2026, 3:41 p.m.
Created at: April 10, 2026, 5:21 a.m.