Triple
T15291063
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Eastern Trans-Fly languages |
E365527
|
entity |
| Predicate | likelyRelatedTo |
P37
|
FINISHED |
| Object |
Pahoturi languages
The Pahoturi languages are a small family of Papuan languages spoken in the Pahoturi River region of southern Papua New Guinea, known for their typological diversity and relative underdocumentation.
|
E1147923
|
NE FINISHED |
How this triple was built (5 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Pahoturi languages | Statement: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Pahoturi languages Context triple: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
-
A.
Chimakuan languages
The Chimakuan languages are a small family of now-extinct Indigenous languages once spoken in the Pacific Northwest of North America, particularly on the Olympic Peninsula in Washington State.
-
B.
Nunusaku languages
The Nunusaku languages are a subgroup of Austronesian languages spoken in the central Maluku region of eastern Indonesia, characterized by shared phonological and grammatical features that distinguish them from neighboring language groups.
-
C.
Thagichu languages
The Thagichu languages are a subgroup of closely related Bantu languages of central Kenya that include Kikuyu and its nearest linguistic relatives.
-
D.
Tani languages
The Tani languages are a subgroup of the Sino-Tibetan language family spoken primarily in Arunachal Pradesh and adjoining regions of Northeast India by various indigenous communities.
-
E.
Thura-Yura languages
Thura-Yura languages are a group of closely related Australian Aboriginal languages traditionally spoken in parts of South Australia.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg
Description generation
gpt-5.1
Instruction
Generate a one-sentence description of the target entity. You are given a context triple in the form (subject, predicate, object), where the object is the target entity. # Instructions Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. Avoid repeating the information from the triple, unless really essential. # Response Format Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Pahoturi languages Triple: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
Generated description
The Pahoturi languages are a small family of Papuan languages spoken in the Pahoturi River region of southern Papua New Guinea, known for their typological diversity and relative underdocumentation.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Pahoturi languages Target entity description: The Pahoturi languages are a small family of Papuan languages spoken in the Pahoturi River region of southern Papua New Guinea, known for their typological diversity and relative underdocumentation.
-
A.
Chimakuan languages
The Chimakuan languages are a small family of now-extinct Indigenous languages once spoken in the Pacific Northwest of North America, particularly on the Olympic Peninsula in Washington State.
-
B.
Nunusaku languages
The Nunusaku languages are a subgroup of Austronesian languages spoken in the central Maluku region of eastern Indonesia, characterized by shared phonological and grammatical features that distinguish them from neighboring language groups.
-
C.
Thagichu languages
The Thagichu languages are a subgroup of closely related Bantu languages of central Kenya that include Kikuyu and its nearest linguistic relatives.
-
D.
Tani languages
The Tani languages are a subgroup of the Sino-Tibetan language family spoken primarily in Arunachal Pradesh and adjoining regions of Northeast India by various indigenous communities.
-
E.
Thura-Yura languages
Thura-Yura languages are a group of closely related Australian Aboriginal languages traditionally spoken in parts of South Australia.
- F. None of above. chosen
PD
Predicate disambiguation
gpt-5-mini-2025-08-07
Target predicate: likelyRelatedTo Context triple: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
-
A.
moreCloselyRelatedTo
Indicates that one entity has a stronger or closer relationship, connection, or similarity to a second entity than to some other reference entity.
-
B.
relatedTo
chosen
Indicates a general, non-specific relationship or association exists between two entities.
-
C.
relatedToTerm
Indicates a general, non-specific relationship or association between one term and another.
-
D.
commonlyLinkedTo
Indicates that one entity is frequently or typically associated, connected, or co-occurring with another entity.
-
E.
relatedType
Indicates that one entity is connected to another through a specified type or category of relationship.
- F. None of above.
Provenance (6 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d85a103d9081908c1ea6c4c73ac8e3 |
completed | April 10, 2026, 2:01 a.m. |
| NER | Named-entity recognition | batch_69e03680b60c8190a3ea54a9d34c8105 |
completed | April 16, 2026, 1:08 a.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69feef7d4da4819080f101c3a525ea11 |
completed | May 9, 2026, 8:25 a.m. |
| NEDg | Description generation | batch_69fef050d1588190942e1d3f3083607a |
completed | May 9, 2026, 8:29 a.m. |
| NED2 | Entity disambiguation (via description) | batch_69fef19d55588190b816d4c37f3a7010 |
completed | May 9, 2026, 8:34 a.m. |
| PD | Predicate disambiguation | batch_69deca90739081909bd1b797cdb8af2b |
completed | April 14, 2026, 11:15 p.m. |
Created at: April 10, 2026, 3:15 a.m.