Triple

T8651510
Position Surface form Disambiguated ID Type / Status
Subject CAS Registry Number E205108 entity
Predicate associatedWith P37 FINISHED
Object SciFinder database E204176 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: SciFinder database | Statement: [CAS Registry Number, associatedWith, SciFinder database]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: SciFinder database
Context triple: [CAS Registry Number, associatedWith, SciFinder database]
  • A. SciFinder-n chosen
    SciFinder-n is a comprehensive research discovery platform from Chemical Abstracts Service that enables scientists to search and analyze chemical substances, reactions, literature, and related scientific data.
  • B. Reaxys
    Reaxys is a comprehensive chemistry research database and workflow tool that provides curated information on chemical substances, reactions, and properties for scientists and researchers.
  • C. Chemical Abstracts Service
    Chemical Abstracts Service is a division of the American Chemical Society that provides a comprehensive database of chemical literature and substances, widely used for research and indexing in the chemical sciences.
  • D. CAS References database
    The CAS References database is a comprehensive bibliographic resource curated by the Chemical Abstracts Service that indexes scientific literature and patent references related to chemistry and related disciplines.
  • E. PaperChem
    PaperChem is a specialized database focused on chemical and materials science information related to paper and pulp research, accessible through the Engineering Village platform.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69ca834e56848190abb0eeaec9dedd32 completed March 30, 2026, 2:06 p.m.
NER Named-entity recognition batch_69cc48150e6c8190a7a3b92b4b640858 completed March 31, 2026, 10:17 p.m.
NED1 Entity disambiguation (via context triple) batch_69cf285de8c081908abca2189f206a40 completed April 3, 2026, 2:39 a.m.
Created at: March 30, 2026, 6:29 p.m.