Triple

T16610576
Position Surface form Disambiguated ID Type / Status
Subject Anglo-Indians E403554 entity
Predicate typicalSurnamesInclude P15990 FINISHED
Object Smith
Smith is a common English-language surname widely used in many countries and communities, including among Anglo-Indians.
E30542 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Smith | Statement: [Anglo-Indians, typicalSurnamesInclude, Smith]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Smith
Context triple: [Anglo-Indians, typicalSurnamesInclude, Smith]
  • A. John
    John is the given name of John Thomas Romney Robinson, a 19th-century Irish astronomer and physicist known for his work on stellar magnitudes and the Robinson anemometer.
  • B. John
    John is the given name of John Purdue, the 19th-century American industrialist and primary benefactor of Purdue University.
  • C. John
    John is the given name of John Egerton, 2nd Duke of Bridgewater, an 18th-century English nobleman known for pioneering canal construction during the Industrial Revolution.
  • D. John
    John of Cornwall was a 12th-century Cornish scholar and theologian known for his Latin writings and contributions to medieval intellectual life.
  • E. John
    John Crichton-Stuart, 3rd Marquess of Bute, was a 19th-century Scottish aristocrat and industrialist best known for transforming Cardiff into a major coal-exporting port and for his extensive architectural patronage.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Smith
Triple: [Anglo-Indians, typicalSurnamesInclude, Smith]
Generated description
Smith is a common English-language surname widely used in many countries and communities, including among Anglo-Indians.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Smith
Target entity description: Smith is a common English-language surname widely used in many countries and communities, including among Anglo-Indians.
  • A. Smith chosen
    Smith is a common English surname borne by numerous notable individuals across diverse fields such as politics, arts, sports, and academia.
  • B. John
    John I, Count of Holland, was a medieval nobleman who ruled the County of Holland at the turn of the 14th century.
  • C. John
    John Brabourne was a British film and television producer and peer, known for producing works such as the 1979 adaptation of "Murder on the Orient Express."
  • D. John
    John is the given name of John Bowen, a British novelist and playwright known for his crime and speculative fiction.
  • E. John
    John is the given name of John Boyle O'Reilly, a 19th-century Irish-born poet, journalist, and civil rights activist who became influential in the United States.
  • F. None of above.

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d883880d0c81908b5fcd454e767b60 completed April 10, 2026, 4:58 a.m.
NER Named-entity recognition batch_69e3609572508190a5d7e6c3e0a8cf95 completed April 18, 2026, 10:44 a.m.
NED1 Entity disambiguation (via context triple) batch_6a007dad10ec8190b41d82b38fcd4dae completed May 10, 2026, 12:44 p.m.
NEDg Description generation batch_6a007e9ae6c881909c78906e59b08d49 completed May 10, 2026, 12:48 p.m.
NED2 Entity disambiguation (via description) batch_6a007f3bf6e081908554238d069d9abc completed May 10, 2026, 12:51 p.m.
Created at: April 10, 2026, 5:17 a.m.