Triple
T16768104
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Flora Brasiliensis |
E407520
|
entity |
| Predicate | digitalAccessProvider |
P57
|
FINISHED |
| Object |
Biodiversity Heritage Library
The Biodiversity Heritage Library is a large open-access digital library that provides free online access to historical and contemporary literature on biodiversity and natural history from institutions around the world.
|
E1232188
|
NE FINISHED |
How this triple was built (4 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Biodiversity Heritage Library | Statement: [Flora Brasiliensis, digitalAccessProvider, Biodiversity Heritage Library]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Biodiversity Heritage Library Context triple: [Flora Brasiliensis, digitalAccessProvider, Biodiversity Heritage Library]
-
A.
Global Biodiversity Information Facility
The Global Biodiversity Information Facility (GBIF) is an international open-data infrastructure that provides free access to biodiversity occurrence records from institutions and citizen-science projects worldwide.
-
B.
HathiTrust Digital Library
HathiTrust Digital Library is a large-scale collaborative digital repository of scanned books and other materials from academic and research institutions worldwide.
-
C.
Index Herbariorum
Index Herbariorum is an international directory and reference resource that catalogs the world’s herbaria and their associated staff, providing standardized acronyms and institutional information for botanical research.
-
D.
NHM Commons
NHM Commons is a public gathering and exhibition space associated with the Natural History Museums of Los Angeles County, designed to host community programs, events, and educational experiences.
-
E.
Central National Herbarium
The Central National Herbarium is a major Indian repository of preserved plant specimens and taxonomic research, serving as a key reference center for the country's botanical diversity.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg
Description generation
gpt-5.1
Instruction
Generate a one-sentence description of the target entity. You are given a context triple in the form (subject, predicate, object), where the object is the target entity. # Instructions Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. Avoid repeating the information from the triple, unless really essential. # Response Format Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Biodiversity Heritage Library Triple: [Flora Brasiliensis, digitalAccessProvider, Biodiversity Heritage Library]
Generated description
The Biodiversity Heritage Library is a large open-access digital library that provides free online access to historical and contemporary literature on biodiversity and natural history from institutions around the world.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Biodiversity Heritage Library Target entity description: The Biodiversity Heritage Library is a large open-access digital library that provides free online access to historical and contemporary literature on biodiversity and natural history from institutions around the world.
-
A.
Global Biodiversity Information Facility
The Global Biodiversity Information Facility (GBIF) is an international open-data infrastructure that provides free access to biodiversity occurrence records from institutions and citizen-science projects worldwide.
-
B.
HathiTrust Digital Library
HathiTrust Digital Library is a large-scale collaborative digital repository of scanned books and other materials from academic and research institutions worldwide.
-
C.
Index Herbariorum
Index Herbariorum is an international directory and reference resource that catalogs the world’s herbaria and their associated staff, providing standardized acronyms and institutional information for botanical research.
-
D.
NHM Commons
NHM Commons is a public gathering and exhibition space associated with the Natural History Museums of Los Angeles County, designed to host community programs, events, and educational experiences.
-
E.
Central National Herbarium
The Central National Herbarium is a major Indian repository of preserved plant specimens and taxonomic research, serving as a key reference center for the country's botanical diversity.
- F. None of above. chosen
Provenance (5 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d8839174188190909f190097207065 |
completed | April 10, 2026, 4:58 a.m. |
| NER | Named-entity recognition | batch_69e3b0349bc88190938750f1e5af192a |
completed | April 18, 2026, 4:24 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_6a00a533e83481909966a7b86c8c8e64 |
completed | May 10, 2026, 3:33 p.m. |
| NEDg | Description generation | batch_6a00a6d6a6d08190b103c2dfd30f0e28 |
completed | May 10, 2026, 3:40 p.m. |
| NED2 | Entity disambiguation (via description) | batch_6a00a749f2688190af57bd13b9dedeb1 |
completed | May 10, 2026, 3:42 p.m. |
Created at: April 10, 2026, 5:21 a.m.