Triple

T15291063
Position Surface form Disambiguated ID Type / Status
Subject Eastern Trans-Fly languages E365527 entity
Predicate likelyRelatedTo P37 FINISHED
Object Pahoturi languages
The Pahoturi languages are a small family of Papuan languages spoken in the Pahoturi River region of southern Papua New Guinea, known for their typological diversity and relative underdocumentation.
E1147923 NE FINISHED

How this triple was built (5 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Pahoturi languages | Statement: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Pahoturi languages
Context triple: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
  • A. Chimakuan languages
    The Chimakuan languages are a small family of now-extinct Indigenous languages once spoken in the Pacific Northwest of North America, particularly on the Olympic Peninsula in Washington State.
  • B. Nunusaku languages
    The Nunusaku languages are a subgroup of Austronesian languages spoken in the central Maluku region of eastern Indonesia, characterized by shared phonological and grammatical features that distinguish them from neighboring language groups.
  • C. Thagichu languages
    The Thagichu languages are a subgroup of closely related Bantu languages of central Kenya that include Kikuyu and its nearest linguistic relatives.
  • D. Tani languages
    The Tani languages are a subgroup of the Sino-Tibetan language family spoken primarily in Arunachal Pradesh and adjoining regions of Northeast India by various indigenous communities.
  • E. Thura-Yura languages
    Thura-Yura languages are a group of closely related Australian Aboriginal languages traditionally spoken in parts of South Australia.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Pahoturi languages
Triple: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
Generated description
The Pahoturi languages are a small family of Papuan languages spoken in the Pahoturi River region of southern Papua New Guinea, known for their typological diversity and relative underdocumentation.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Pahoturi languages
Target entity description: The Pahoturi languages are a small family of Papuan languages spoken in the Pahoturi River region of southern Papua New Guinea, known for their typological diversity and relative underdocumentation.
  • A. Chimakuan languages
    The Chimakuan languages are a small family of now-extinct Indigenous languages once spoken in the Pacific Northwest of North America, particularly on the Olympic Peninsula in Washington State.
  • B. Nunusaku languages
    The Nunusaku languages are a subgroup of Austronesian languages spoken in the central Maluku region of eastern Indonesia, characterized by shared phonological and grammatical features that distinguish them from neighboring language groups.
  • C. Thagichu languages
    The Thagichu languages are a subgroup of closely related Bantu languages of central Kenya that include Kikuyu and its nearest linguistic relatives.
  • D. Tani languages
    The Tani languages are a subgroup of the Sino-Tibetan language family spoken primarily in Arunachal Pradesh and adjoining regions of Northeast India by various indigenous communities.
  • E. Thura-Yura languages
    Thura-Yura languages are a group of closely related Australian Aboriginal languages traditionally spoken in parts of South Australia.
  • F. None of above. chosen
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: likelyRelatedTo
Context triple: [Eastern Trans-Fly languages, likelyRelatedTo, Pahoturi languages]
  • A. moreCloselyRelatedTo
    Indicates that one entity has a stronger or closer relationship, connection, or similarity to a second entity than to some other reference entity.
  • B. relatedTo chosen
    Indicates a general, non-specific relationship or association exists between two entities.
  • C. relatedToTerm
    Indicates a general, non-specific relationship or association between one term and another.
  • D. commonlyLinkedTo
    Indicates that one entity is frequently or typically associated, connected, or co-occurring with another entity.
  • E. relatedType
    Indicates that one entity is connected to another through a specified type or category of relationship.
  • F. None of above.

Provenance (6 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d85a103d9081908c1ea6c4c73ac8e3 completed April 10, 2026, 2:01 a.m.
NER Named-entity recognition batch_69e03680b60c8190a3ea54a9d34c8105 completed April 16, 2026, 1:08 a.m.
NED1 Entity disambiguation (via context triple) batch_69feef7d4da4819080f101c3a525ea11 completed May 9, 2026, 8:25 a.m.
NEDg Description generation batch_69fef050d1588190942e1d3f3083607a completed May 9, 2026, 8:29 a.m.
NED2 Entity disambiguation (via description) batch_69fef19d55588190b816d4c37f3a7010 completed May 9, 2026, 8:34 a.m.
PD Predicate disambiguation batch_69deca90739081909bd1b797cdb8af2b completed April 14, 2026, 11:15 p.m.
Created at: April 10, 2026, 3:15 a.m.