Triple

T18204326
Position Surface form Disambiguated ID Type / Status
Subject DistilBERT E435865 entity
Predicate paperArchive P48111 FINISHED
Object arXiv NE NERFINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: arXiv | Statement: [DistilBERT, paperArchive, arXiv]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: arXiv
Context triple: [DistilBERT, paperArchive, arXiv]
  • A. arXiv chosen
    arXiv is an open-access repository of electronic preprints in fields such as physics, mathematics, computer science, and related disciplines, widely used by researchers to share and access scientific papers before formal peer-reviewed publication.
  • B. INSPIRE-HEP
    INSPIRE-HEP is a leading digital library and information system for high-energy physics literature, providing comprehensive indexing, citation data, and research tools for the global particle physics community.
  • C. Semantic Scholar
    Semantic Scholar is an AI-powered academic search engine that helps researchers discover and understand scientific literature more efficiently.
  • D. CiteSeerX
    CiteSeerX is a public digital library and search engine that focuses on indexing and providing access to scientific and academic research papers, particularly in computer and information science.
  • E. AAS Open Research
    AAS Open Research is an open-access publishing platform of the American Astronomical Society that supports rapid, transparent dissemination and peer review of research in astronomy and astrophysics.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d8b90dba6481908e119eb9aa4ca0cb completed April 10, 2026, 8:47 a.m.
NER Named-entity recognition batch_69e4e222831081908f7d5500424e3acb completed April 19, 2026, 2:09 p.m.
Created at: April 10, 2026, 10:32 a.m.