Triple

T16610572
Position Surface form Disambiguated ID Type / Status
Subject Anglo-Indians E403554 entity
Predicate typicalSurnamesInclude P15990 FINISHED
Object D'Souza
D'Souza is a common surname of Portuguese origin frequently associated with Anglo-Indian and Goan Catholic communities.
E1222938 NE FINISHED

How this triple was built (5 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: D'Souza | Statement: [Anglo-Indians, typicalSurnamesInclude, D'Souza]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: D'Souza
Context triple: [Anglo-Indians, typicalSurnamesInclude, D'Souza]
  • A. Dinesh D’Souza
    Dinesh D’Souza is a conservative political commentator, author, and filmmaker known for his controversial critiques of liberalism and American politics.
  • B. Michael Allen Jones
    Michael Allen Jones is an American rapper better known by his stage name Mike Jones, who gained mainstream popularity in the mid-2000s with hits like "Still Tippin'" and "Back Then."
  • C. Alex Jones
    Alex Jones is a controversial American radio host and conspiracy theorist best known for founding the website InfoWars and promoting numerous debunked claims.
  • D. Larry Klayman
    Larry Klayman is an American lawyer and conservative activist best known as the founder of the watchdog organization Judicial Watch.
  • E. Kevin Dobson
    Kevin Dobson was an American actor best known for his television roles in series such as Kojak and Knots Landing.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: D'Souza
Triple: [Anglo-Indians, typicalSurnamesInclude, D'Souza]
Generated description
D'Souza is a common surname of Portuguese origin frequently associated with Anglo-Indian and Goan Catholic communities.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: D'Souza
Target entity description: D'Souza is a common surname of Portuguese origin frequently associated with Anglo-Indian and Goan Catholic communities.
  • A. Dinesh D’Souza
    Dinesh D’Souza is a conservative political commentator, author, and filmmaker known for his controversial critiques of liberalism and American politics.
  • B. Michael Allen Jones
    Michael Allen Jones is an American rapper better known by his stage name Mike Jones, who gained mainstream popularity in the mid-2000s with hits like "Still Tippin'" and "Back Then."
  • C. Alex Jones
    Alex Jones is a controversial American radio host and conspiracy theorist best known for founding the website InfoWars and promoting numerous debunked claims.
  • D. Larry Klayman
    Larry Klayman is an American lawyer and conservative activist best known as the founder of the watchdog organization Judicial Watch.
  • E. Kevin Dobson
    Kevin Dobson was an American actor best known for his television roles in series such as Kojak and Knots Landing.
  • F. None of above. chosen
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: typicalSurnamesInclude
Context triple: [Anglo-Indians, typicalSurnamesInclude, D'Souza]
  • A. isAmongMostCommonSurnamesIn
    Indicates that a surname ranks within the group of most frequently occurring surnames in a specified region or population.
  • B. usedAsSurnameInCountry chosen
    Indicates that a particular name functions as a family surname within the specified country.
  • C. familyNameSuffix
    Indicates that one entity is the suffix portion (such as “Jr.” or “III”) of another entity’s family name.
  • D. languageOfSurnameVariants
    Indicates the language in which particular surname variants are used or originate.
  • E. familyNameIn
    Indicates that an entity has a specified family name (surname) in a particular language or cultural context.
  • F. None of above.

Provenance (6 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d883880d0c81908b5fcd454e767b60 completed April 10, 2026, 4:58 a.m.
NER Named-entity recognition batch_69e3609572508190a5d7e6c3e0a8cf95 completed April 18, 2026, 10:44 a.m.
NED1 Entity disambiguation (via context triple) batch_6a0075aca5c0819092637e0d83ce8ac0 completed May 10, 2026, 12:10 p.m.
NEDg Description generation batch_6a00783fcde08190963ce80fcf4aac90 completed May 10, 2026, 12:21 p.m.
NED2 Entity disambiguation (via description) batch_6a0078a78ee08190887e93c08edbaead completed May 10, 2026, 12:23 p.m.
PD Predicate disambiguation batch_69e296aabc508190b3836a91b49113ad completed April 17, 2026, 8:23 p.m.
Created at: April 10, 2026, 5:17 a.m.