Triple
T8805843
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | San Martin, California |
E209527
|
entity |
| Predicate | nearestCityNorth |
P24509
|
FINISHED |
| Object | Morgan Hill, California |
E155307
|
NE FINISHED |
How this triple was built (3 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Morgan Hill, California | Statement: [San Martin, California, nearestCityNorth, Morgan Hill, California]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Morgan Hill, California Context triple: [San Martin, California, nearestCityNorth, Morgan Hill, California]
-
A.
Morgan Hill
chosen
Morgan Hill is a suburban city in California’s Silicon Valley region, known for its affluent residential communities, wineries, and proximity to high-tech industry centers.
-
B.
Moreno Valley, California
Moreno Valley, California is a large suburban city in Riverside County known for its rapid growth, diverse population, and role as a residential and logistics hub in Southern California’s Inland Empire region.
-
C.
Portola Hills, California
Portola Hills, California is a residential community in the foothills of the Santa Ana Mountains within the city of Lake Forest in Orange County.
-
D.
Moraga, California
Moraga, California is a suburban town in the San Francisco Bay Area known for its residential character and as the home of Saint Mary's College of California.
-
E.
Mill Valley, California
Mill Valley, California is a small, affluent city in Marin County just north of San Francisco, known for its redwood forests, scenic hills, and vibrant arts and outdoor culture.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
PD
Predicate disambiguation
gpt-5-mini-2025-08-07
Target predicate: nearestCityNorth Context triple: [San Martin, California, nearestCityNorth, Morgan Hill, California]
-
A.
nearestLargeCityNorth
Indicates that one city is the closest large city located to the north of another city.
-
B.
nearestMajorTownOnNorthSide
chosen
Indicates that one entity is the closest significant town located to the north of another entity.
-
C.
nearestMajorCity
Indicates that one city is the closest significant urban center to another location or city compared to all other major cities.
-
D.
isNorthernmostCityOf
Indicates that a city is the one located furthest to the north within a specified region, area, or set of cities.
-
E.
majorCityNearMouth
Indicates that a major city is located close to the mouth (outflow point) of a river.
- F. None of above.
Provenance (4 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69ca836320e48190b5cf585b90a322c4 |
completed | March 30, 2026, 2:06 p.m. |
| NER | Named-entity recognition | batch_69cc5fd04eb88190acc4e085d82016c0 |
completed | March 31, 2026, 11:59 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69cf8921a13081909346b97c024110b6 |
completed | April 3, 2026, 9:32 a.m. |
| PD | Predicate disambiguation | batch_69cc5c1f28ec8190a34311cb412920c2 |
completed | March 31, 2026, 11:43 p.m. |
Created at: March 30, 2026, 6:45 p.m.