Triple

T18204457
Position Surface form Disambiguated ID Type / Status
Subject BART E435868 entity
Predicate hasVariant P455 FINISHED
Object BART-large-XSum NE NERFINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: BART-large-XSum | Statement: [BART, hasVariant, BART-large-XSum]

Disambiguation candidates (2 decisions)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: BART-large-XSum
Context triple: [BART, hasVariant, BART-large-XSum]
  • A. mBART
    mBART is a multilingual sequence-to-sequence Transformer model designed for tasks like machine translation and text generation across many languages.
  • B. Longformer
    Longformer is a transformer-based neural network architecture designed for efficient processing of very long sequences using sparse attention mechanisms.
  • C. Maximal Marginal Relevance (MMR) for information retrieval and summarization
    Maximal Marginal Relevance (MMR) is an information retrieval and summarization technique that selects results by jointly maximizing relevance to a query while minimizing redundancy among the chosen items.
  • D. BERT
    BERT is a widely used transformer-based language model developed by Google that learns deep bidirectional representations of text for tasks like question answering and text classification.
  • E. Unitxt
    Unitxt is an experimental electronic music album by German artist Alva Noto, known for its minimalist, glitch-based sound design and conceptual approach to rhythm and data.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: BART-large-XSum
Target entity description: BART-large-XSum is a fine-tuned variant of Facebook AI’s BART-large model specialized for abstractive summarization on the XSum (Extreme Summarization) dataset.
  • A. mBART
    mBART is a multilingual sequence-to-sequence Transformer model designed for tasks like machine translation and text generation across many languages.
  • B. Longformer
    Longformer is a transformer-based neural network architecture designed for efficient processing of very long sequences using sparse attention mechanisms.
  • C. Maximal Marginal Relevance (MMR) for information retrieval and summarization
    Maximal Marginal Relevance (MMR) is an information retrieval and summarization technique that selects results by jointly maximizing relevance to a query while minimizing redundancy among the chosen items.
  • D. BERT
    BERT is a widely used transformer-based language model developed by Google that learns deep bidirectional representations of text for tasks like question answering and text classification.
  • E. Unitxt
    Unitxt is an experimental electronic music album by German artist Alva Noto, known for its minimalist, glitch-based sound design and conceptual approach to rhythm and data.
  • F. None of above. chosen

Provenance (2 batches)

Stage Batch ID Job type Status
creating batch_69d8b90dba6481908e119eb9aa4ca0cb elicitation completed
NER batch_69e4e222831081908f7d5500424e3acb ner completed
Created at: April 10, 2026, 10:32 a.m.