Triple

T13887860
Position Surface form Disambiguated ID Type / Status
Subject Nicobar tree shrew E333891 entity
Predicate biogeographicRealm P2178 FINISHED
Object Indomalayan realm E135956 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Indomalayan realm | Statement: [Nicobar tree shrew, biogeographicRealm, Indomalayan realm]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Indomalayan realm
Context triple: [Nicobar tree shrew, biogeographicRealm, Indomalayan realm]
  • A. Indomalayan realm chosen
    The Indomalayan realm is a major biogeographic region of South and Southeast Asia characterized by tropical and subtropical forests with high biodiversity and many endemic species.
  • B. South and Southeast Asia
    South and Southeast Asia is a geographically and culturally diverse region spanning the Indian subcontinent and the area east of it through the Malay Archipelago, known for its tropical climates, rich biodiversity, and dense human populations.
  • C. Sundaland
    Sundaland is a biogeographical region of Southeast Asia comprising the Malay Peninsula and the western Indonesian islands, known for its high biodiversity and past exposure as a contiguous landmass during periods of low sea level.
  • D. Australasian realm
    The Australasian realm is a major biogeographic region encompassing Australia, New Guinea, New Zealand, and surrounding islands, characterized by highly distinctive and often endemic flora and fauna.
  • E. South Asia
    South Asia is a culturally and linguistically diverse region of the Asian continent that includes countries such as India, Pakistan, Bangladesh, Nepal, Sri Lanka, Bhutan, and the Maldives.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d81c5dd2d48190b7a5fc1e009de936 completed April 9, 2026, 9:38 p.m.
NER Named-entity recognition batch_69de23a281e481908a6184bcd7f59c03 completed April 14, 2026, 11:23 a.m.
NED1 Entity disambiguation (via context triple) batch_69f7c718140c8190a625da87231ee814 completed May 3, 2026, 10:07 p.m.
Created at: April 9, 2026, 10:15 p.m.