Triple
T15475740
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Samogitians |
E376775
|
entity |
| Predicate | dialectGroup |
P1254
|
FINISHED |
| Object | Western Aukštaitian–Samogitian continuum |
E132966
|
NE FINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Western Aukštaitian–Samogitian continuum | Statement: [Samogitians, dialectGroup, Western Aukštaitian–Samogitian continuum]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Western Aukštaitian–Samogitian continuum Context triple: [Samogitians, dialectGroup, Western Aukštaitian–Samogitian continuum]
-
A.
Samogitian language
chosen
The Samogitian language is a distinct variety of Lithuanian spoken primarily in the Samogitia region, notable for its unique phonetic and grammatical features that set it apart from standard Lithuanian.
-
B.
Old Prussian language
Old Prussian was an extinct West Baltic language once spoken by the Old Prussians in the area of modern-day northeastern Poland, Kaliningrad, and parts of Lithuania.
-
C.
Baltic languages
Baltic languages are an Indo-European language group spoken around the Baltic Sea, including living languages like Lithuanian and Latvian as well as several extinct varieties.
-
D.
Proto-Baltic
Proto-Baltic is the reconstructed common ancestor language of the Baltic branch of the Indo-European family, from which modern Baltic languages such as Lithuanian and Latvian developed.
-
E.
Tat dialect continuum
The Tat dialect continuum is a group of closely related Southwestern Iranian dialects spoken primarily in the eastern Caucasus region, notably in parts of Azerbaijan and Russia.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69d85cd21dcc81908646251b1c26ea00 |
elicitation | completed |
| NER | batch_69e03f6e859481909c3d08343b7ad27c |
ner | completed |
| NED1 | batch_69ff2d093ccc8190aefc355a837c83f4 |
ned_source_triple | completed |
Created at: April 10, 2026, 3:34 a.m.