Triple
T5410711
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Cosmicflows-2 data set |
E121004
|
entity |
| Predicate | relatedTo |
P37
|
FINISHED |
| Object |
Cosmicflows project
The Cosmicflows project is an astronomical research initiative that maps the large-scale structure and motions of galaxies in the nearby universe to study cosmic flows and constrain cosmological models.
|
E121004
|
NE FINISHED |
How this triple was built (4 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Cosmicflows project | Statement: [Cosmicflows-2 data set, relatedTo, Cosmicflows project]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Cosmicflows project Context triple: [Cosmicflows-2 data set, relatedTo, Cosmicflows project]
-
A.
Cosmicflows-2 data set
The Cosmicflows-2 data set is a large compilation of galaxy distance and velocity measurements used to map the three-dimensional structure and motions of galaxies in the nearby universe.
-
B.
Zwicky catalog of galaxies
The Zwicky catalog of galaxies is an influential astronomical catalog compiled by Swiss astronomer Fritz Zwicky that systematically lists and classifies thousands of galaxies and galaxy clusters.
-
C.
The Large-Scale Structure of the Universe
The Large-Scale Structure of the Universe is a foundational cosmology book by James Peebles that systematically develops the theory and observations of how matter is distributed on the largest cosmic scales.
-
D.
Shapley Supercluster
The Shapley Supercluster is one of the most massive known concentrations of galaxies in the nearby universe, exerting a significant gravitational influence on surrounding cosmic structures.
-
E.
NASA Extragalactic Database
The NASA Extragalactic Database (NED) is an online astronomical resource that compiles and cross-references data on extragalactic objects such as galaxies, quasars, and clusters from numerous surveys and publications.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg
Description generation
gpt-5.1
Instruction
Generate a one-sentence description of the target entity. You are given a context triple in the form (subject, predicate, object), where the object is the target entity. # Instructions Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. Avoid repeating the information from the triple, unless really essential. # Response Format Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Cosmicflows project Triple: [Cosmicflows-2 data set, relatedTo, Cosmicflows project]
Generated description
The Cosmicflows project is an astronomical research initiative that maps the large-scale structure and motions of galaxies in the nearby universe to study cosmic flows and constrain cosmological models.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Cosmicflows project Target entity description: The Cosmicflows project is an astronomical research initiative that maps the large-scale structure and motions of galaxies in the nearby universe to study cosmic flows and constrain cosmological models.
-
A.
Cosmicflows-2 data set
chosen
The Cosmicflows-2 data set is a large compilation of galaxy distance and velocity measurements used to map the three-dimensional structure and motions of galaxies in the nearby universe.
-
B.
Zwicky catalog of galaxies
The Zwicky catalog of galaxies is an influential astronomical catalog compiled by Swiss astronomer Fritz Zwicky that systematically lists and classifies thousands of galaxies and galaxy clusters.
-
C.
The Large-Scale Structure of the Universe
The Large-Scale Structure of the Universe is a foundational cosmology book by James Peebles that systematically develops the theory and observations of how matter is distributed on the largest cosmic scales.
-
D.
Shapley Supercluster
The Shapley Supercluster is one of the most massive known concentrations of galaxies in the nearby universe, exerting a significant gravitational influence on surrounding cosmic structures.
-
E.
NASA Extragalactic Database
The NASA Extragalactic Database (NED) is an online astronomical resource that compiles and cross-references data on extragalactic objects such as galaxies, quasars, and clusters from numerous surveys and publications.
- F. None of above.
Provenance (5 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69bd463a41cc8190b32ff5af2b96ca93 |
completed | March 20, 2026, 1:06 p.m. |
| NER | Named-entity recognition | batch_69bd87985ee0819092a9a5cd6a948138 |
completed | March 20, 2026, 5:44 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69bf339e02dc8190bb2ca6e0a0ef4621 |
completed | March 22, 2026, 12:11 a.m. |
| NEDg | Description generation | batch_69bf34b0e22c819091aefd30a5e5a13d |
completed | March 22, 2026, 12:15 a.m. |
| NED2 | Entity disambiguation (via description) | batch_69bf35442cf481908053d3645e6e9968 |
completed | March 22, 2026, 12:18 a.m. |
Created at: March 20, 2026, 2:05 p.m.