Triple
T18016182
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | CelebA |
E431002
|
entity |
| Predicate | isPopularBenchmarkFor |
P23745
|
FINISHED |
| Object | face attribute classification |
—
|
LITERAL FINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's LITERAL type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: face attribute classification | Statement: [CelebA, isPopularBenchmarkFor, face attribute classification]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
PD
Predicate disambiguation
gpt-5-mini-2025-08-07
Target predicate: isPopularBenchmarkFor Context triple: [CelebA, isPopularBenchmarkFor, face attribute classification]
-
A.
numberOfBenchmarksUsed
Indicates the quantity of distinct benchmarks that are utilized in a given context or evaluation.
-
B.
benchmarkVariant
Indicates that one entity is a specific version or variation of another entity used for benchmarking or performance comparison.
-
C.
benchmarkFamily
Indicates that one entity serves as a benchmark or reference standard for evaluating or comparing another entity within the same family or category.
-
D.
benchmarkFor
chosen
Indicates that one entity serves as a standard or reference point against which the performance, quality, or characteristics of another entity are measured or evaluated.
-
E.
primaryBenchmarkProvider
Indicates that one entity serves as the main or default source of benchmark data or performance standards for another entity.
- F. None of above.
Provenance (3 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69d8b904530081908bf341d842464856 |
elicitation | completed |
| NER | batch_69e4b523f588819097389e067dda7f23 |
ner | completed |
| PD | batch_69e3f904b8048190add43883cd7cb191 |
pd | completed |
Created at: April 10, 2026, 10:24 a.m.