Triple

T4963105
Position Surface form Disambiguated ID Type / Status
Subject Serengeti National Park E111455 entity
Predicate approximateNumberOfBirdSpecies P6211 FINISHED
Object over 500 LITERAL FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: over 500 | Statement: [Serengeti National Park, approximateNumberOfBirdSpecies, over 500]
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: approximateNumberOfBirdSpecies
Context triple: [Serengeti National Park, approximateNumberOfBirdSpecies, over 500]
  • A. isMostNumerousBirdSpecies
    Indicates that the subject bird species has the largest population size compared to all other bird species in the relevant context.
  • B. birdDiversity
    Indicates the variety and richness of different bird species present within a given area, community, or dataset.
  • C. numberOfSpecies chosen
    Indicates the count of distinct species associated with a given entity or context.
  • D. proportionOfAllBirdSpecies
    Indicates the fraction or percentage that a given subset of bird species represents out of all known bird species.
  • E. hasBirdSpecies
    Indicates that there exists a relationship in which a subject possesses, contains, or is associated with a particular bird species.
  • F. None of above.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69bd4419393c819086319a6fe4bf8542 completed March 20, 2026, 12:56 p.m.
NER Named-entity recognition batch_69bd72e49b048190bac55d9e7a6f7963 completed March 20, 2026, 4:16 p.m.
PD Predicate disambiguation batch_69bd71447fe88190bb62c5e8753da7a7 completed March 20, 2026, 4:09 p.m.
Created at: March 20, 2026, 1:32 p.m.