Triple

T11364932
Position Surface form Disambiguated ID Type / Status
Subject Eastern South Asia E269177 entity
Predicate hasMajorLanguageFamily P1047 FINISHED
Object Indo-Aryan languages E7769 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Indo-Aryan languages | Statement: [Eastern South Asia, hasMajorLanguageFamily, Indo-Aryan languages]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Indo-Aryan languages
Context triple: [Eastern South Asia, hasMajorLanguageFamily, Indo-Aryan languages]
  • A. Indo-Aryan languages chosen
    Indo-Aryan languages are a major branch of the Indo-European family spoken primarily in the Indian subcontinent, including languages such as Hindi, Bengali, Punjabi, and Marathi.
  • B. Insular Indo-Aryan languages
    Insular Indo-Aryan languages are a small branch of the Indo-Aryan language family comprising the Indo-Aryan languages historically spoken on islands of the Indian Ocean, most notably Dhivehi in the Maldives and Sri Lanka.
  • C. Eastern Indo-Aryan languages
    Eastern Indo-Aryan languages are a branch of the Indo-Aryan language family spoken mainly in eastern India, Bangladesh, and Nepal, including major languages such as Bengali, Assamese, and Odia.
  • D. Northern Indo-Aryan languages
    Northern Indo-Aryan languages are a subgroup of the Indo-Aryan branch of the Indo-European language family spoken primarily in the northern regions of the Indian subcontinent, including languages such as Dogri, Kashmiri, and Punjabi.
  • E. Northwestern Indo-Aryan languages
    Northwestern Indo-Aryan languages are a subgroup of the Indo-Aryan branch of the Indo-European language family spoken primarily in northwestern parts of the Indian subcontinent, including languages such as Sindhi, Punjabi, and Lahnda.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

Stage Batch ID Job type Status
creating batch_69d6aacca1048190b39dbbc2174616fa elicitation completed
NER batch_69d7ea4589908190948a8225768e1eec ner completed
NED1 batch_69e55667d4908190b6290135eba41e54 ned_source_triple completed
Created at: April 8, 2026, 9:33 p.m.