Triple
T18669196
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Danny Denzongpa |
E456423
|
entity |
| Predicate | ethnicGroup |
P194
|
FINISHED |
| Object | Bhutia |
—
|
NE NERFINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Bhutia | Statement: [Danny Denzongpa, ethnicGroup, Bhutia]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Bhutia Context triple: [Danny Denzongpa, ethnicGroup, Bhutia]
-
A.
Bhutia
chosen
Bhutia is a Sino-Tibetan language spoken primarily by the Bhutia community in the Himalayan regions of India, especially in Sikkim and parts of West Bengal.
-
B.
Khamti
Khamti is a Tai language variety spoken primarily by the Khamti people in parts of northeastern India and northern Myanmar.
-
C.
Mishmi
Mishmi are an indigenous tribal community of the eastern Himalayas, primarily inhabiting the remote, mountainous regions of Arunachal Pradesh in Northeast India.
-
D.
Dholuo
Dholuo is a Nilotic language spoken primarily by the Luo people of western Kenya and parts of Tanzania.
-
E.
Monpa people
The Monpa people are an indigenous ethnic group of the eastern Himalayas, primarily inhabiting Arunachal Pradesh in India and parts of Tibet, known for their Tibetan Buddhist traditions and distinct language and culture.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (2 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d8d38f72b4819090a935175d9ca8af |
completed | April 10, 2026, 10:40 a.m. |
| NER | Named-entity recognition | batch_69e556b0502881909ea05f2746163746 |
completed | April 19, 2026, 10:26 p.m. |
Created at: April 10, 2026, 11:48 a.m.