Triple

T7208312
Position Surface form Disambiguated ID Type / Status
Subject Tetun language E148730 entity
Predicate hasStandardizingBody P1251 FINISHED
Object Instituto Nacional de Linguística (Timor-Leste) E173798 NE FINISHED

How this triple was built (3 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Instituto Nacional de Linguística (Timor-Leste) | Statement: [Tetun language, hasStandardizingBody, Instituto Nacional de Linguística (Timor-Leste)]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Instituto Nacional de Linguística (Timor-Leste)
Context triple: [Tetun language, hasStandardizingBody, Instituto Nacional de Linguística (Timor-Leste)]
  • A. Instituto Nacional de Linguística (Timor-Leste) chosen
    The Instituto Nacional de Linguística (Timor-Leste) is a national research and standardization institute responsible for the study, development, and codification of the languages of Timor-Leste, particularly Tetum.
  • B. Institute of National Language
    The Institute of National Language was the former Philippine government body responsible for developing and promoting a national language before being succeeded by the Komisyon sa Wikang Filipino.
  • C. Institute of National Language
    The Institute of National Language is a Cambodian academic body dedicated to the study, preservation, and standardization of the Khmer language under the Royal Academy of Cambodia.
  • D. Instituto Nacional de Lenguas Indígenas
    The Instituto Nacional de Lenguas Indígenas is a Mexican government institution dedicated to the protection, promotion, and development of the country’s indigenous languages and the linguistic rights of their speakers.
  • E. Badan Bahasa
    Badan Bahasa is Indonesia’s official government body responsible for developing, standardizing, and promoting the Indonesian language.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: hasStandardizingBody
Context triple: [Tetun language, hasStandardizingBody, Instituto Nacional de Linguística (Timor-Leste)]
  • A. hasStandardizationBody chosen
    Indicates that an entity is associated with, governed by, or defined by a specific standards-setting organization or authority.
  • B. hasStandardizedDimensions
    Indicates that an entity conforms to a predefined, uniform set of measurements or size specifications.
  • C. isStandardizationTargetOf
    Indicates that an entity is the object or subject being standardized within a standardization process, activity, or initiative.
  • D. hasStandardizationLevel
    Indicates the degree or extent to which something conforms to an established standard or set of standardized criteria.
  • E. hasStandardizationEvent
    Indicates that an entity is associated with a formal standardization process or event that defines, approves, or harmonizes its specifications or practices.
  • F. None of above.

Provenance (4 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c687e8cf188190b5f3ecffd681f04e completed March 27, 2026, 1:36 p.m.
NER Named-entity recognition batch_69c6e96ae4dc8190b0b9e064ff968c10 completed March 27, 2026, 8:32 p.m.
NED1 Entity disambiguation (via context triple) batch_69c7bfc4e6688190ad9e0d31505e65af completed March 28, 2026, 11:47 a.m.
PD Predicate disambiguation batch_69c6e75f84e481909e7866186ae80cff completed March 27, 2026, 8:23 p.m.
Created at: March 27, 2026, 2:52 p.m.