Triple

T16768104
Position Surface form Disambiguated ID Type / Status
Subject Flora Brasiliensis E407520 entity
Predicate digitalAccessProvider P57 FINISHED
Object Biodiversity Heritage Library
The Biodiversity Heritage Library is a large open-access digital library that provides free online access to historical and contemporary literature on biodiversity and natural history from institutions around the world.
E1232188 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Biodiversity Heritage Library | Statement: [Flora Brasiliensis, digitalAccessProvider, Biodiversity Heritage Library]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Biodiversity Heritage Library
Context triple: [Flora Brasiliensis, digitalAccessProvider, Biodiversity Heritage Library]
  • A. Global Biodiversity Information Facility
    The Global Biodiversity Information Facility (GBIF) is an international open-data infrastructure that provides free access to biodiversity occurrence records from institutions and citizen-science projects worldwide.
  • B. HathiTrust Digital Library
    HathiTrust Digital Library is a large-scale collaborative digital repository of scanned books and other materials from academic and research institutions worldwide.
  • C. Index Herbariorum
    Index Herbariorum is an international directory and reference resource that catalogs the world’s herbaria and their associated staff, providing standardized acronyms and institutional information for botanical research.
  • D. NHM Commons
    NHM Commons is a public gathering and exhibition space associated with the Natural History Museums of Los Angeles County, designed to host community programs, events, and educational experiences.
  • E. Central National Herbarium
    The Central National Herbarium is a major Indian repository of preserved plant specimens and taxonomic research, serving as a key reference center for the country's botanical diversity.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Biodiversity Heritage Library
Triple: [Flora Brasiliensis, digitalAccessProvider, Biodiversity Heritage Library]
Generated description
The Biodiversity Heritage Library is a large open-access digital library that provides free online access to historical and contemporary literature on biodiversity and natural history from institutions around the world.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Biodiversity Heritage Library
Target entity description: The Biodiversity Heritage Library is a large open-access digital library that provides free online access to historical and contemporary literature on biodiversity and natural history from institutions around the world.
  • A. Global Biodiversity Information Facility
    The Global Biodiversity Information Facility (GBIF) is an international open-data infrastructure that provides free access to biodiversity occurrence records from institutions and citizen-science projects worldwide.
  • B. HathiTrust Digital Library
    HathiTrust Digital Library is a large-scale collaborative digital repository of scanned books and other materials from academic and research institutions worldwide.
  • C. Index Herbariorum
    Index Herbariorum is an international directory and reference resource that catalogs the world’s herbaria and their associated staff, providing standardized acronyms and institutional information for botanical research.
  • D. NHM Commons
    NHM Commons is a public gathering and exhibition space associated with the Natural History Museums of Los Angeles County, designed to host community programs, events, and educational experiences.
  • E. Central National Herbarium
    The Central National Herbarium is a major Indian repository of preserved plant specimens and taxonomic research, serving as a key reference center for the country's botanical diversity.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d8839174188190909f190097207065 completed April 10, 2026, 4:58 a.m.
NER Named-entity recognition batch_69e3b0349bc88190938750f1e5af192a completed April 18, 2026, 4:24 p.m.
NED1 Entity disambiguation (via context triple) batch_6a00a533e83481909966a7b86c8c8e64 completed May 10, 2026, 3:33 p.m.
NEDg Description generation batch_6a00a6d6a6d08190b103c2dfd30f0e28 completed May 10, 2026, 3:40 p.m.
NED2 Entity disambiguation (via description) batch_6a00a749f2688190af57bd13b9dedeb1 completed May 10, 2026, 3:42 p.m.
Created at: April 10, 2026, 5:21 a.m.