Triple
T9110215
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Hard to Say I'm Sorry |
E218580
|
entity |
| Predicate | chartName |
P23845
|
FINISHED |
| Object | Billboard Hot 100 |
E5595
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Billboard Hot 100 | Statement: [Hard to Say I'm Sorry, chartName, Billboard Hot 100]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Billboard Hot 100 Context triple: [Hard to Say I'm Sorry, chartName, Billboard Hot 100]
-
A.
U.S. Billboard Hot 100
chosen
The U.S. Billboard Hot 100 is the premier American music industry chart that ranks the most popular songs each week based on a combination of sales, radio airplay, and streaming data.
-
B.
Billboard
"Billboard" is a prominent abstract expressionist painting by American artist Grace Hartigan, reflecting her dynamic style and engagement with contemporary culture.
-
C.
Billboard
Billboard is an American entertainment media brand best known for its music charts, industry news, and analysis of trends in the recording industry.
-
D.
Billboard 200
Billboard 200 is a weekly U.S. music industry chart that ranks the most popular albums across all genres based on sales, streaming, and other consumption metrics.
-
E.
American Top 40
American Top 40 is a long-running syndicated radio countdown show that ranks and showcases the most popular songs in the United States each week.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69ca83dc94ac8190b9ef42684d36ff39 |
completed | March 30, 2026, 2:08 p.m. |
| NER | Named-entity recognition | batch_69cca845d9b0819084230e7cdd92dee0 |
completed | April 1, 2026, 5:08 a.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69d03038dba48190991cb76576349bc3 |
completed | April 3, 2026, 9:25 p.m. |
Created at: March 30, 2026, 7:16 p.m.