Santalī
E199649
Santalī is an Austroasiatic language spoken primarily by the Santal people in eastern India, Bangladesh, Nepal, and Bhutan.
All labels observed (2)
How this entity was disambiguated
This entity first appeared as the object of triple T1770040 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Santalī Context triple: [Santhali, hasAlternativeName, Santalī]
-
A.
Anusapati
Anusapati was a 13th-century Javanese king who ruled the Singhasari Kingdom in East Java, Indonesia.
-
B.
Wiradhuri
Wiradhuri refers to the Wiradjuri people, one of the largest Aboriginal groups in New South Wales, Australia, known for their rich cultural traditions and strong connection to the land.
-
C.
Gajanana
Gajanana is a revered aspect of the Hindu deity Ganesha, emphasizing his elephant-faced form and role as the remover of obstacles and bestower of wisdom.
-
D.
Pranhita
Pranhita is a major river in central India that flows through the states of Maharashtra and Telangana before joining the Godavari River.
-
E.
Saketa
Saketa is the ancient name of the historic Indian city now known as Ayodhya, a major cultural and religious center in Hindu tradition.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Santalī Target entity description: Santalī is an Austroasiatic language spoken primarily by the Santal people in eastern India, Bangladesh, Nepal, and Bhutan.
-
A.
Anusapati
Anusapati was a 13th-century Javanese king who ruled the Singhasari Kingdom in East Java, Indonesia.
-
B.
Wiradhuri
Wiradhuri refers to the Wiradjuri people, one of the largest Aboriginal groups in New South Wales, Australia, known for their rich cultural traditions and strong connection to the land.
-
C.
Gajanana
Gajanana is a revered aspect of the Hindu deity Ganesha, emphasizing his elephant-faced form and role as the remover of obstacles and bestower of wisdom.
-
D.
Pranhita
Pranhita is a major river in central India that flows through the states of Maharashtra and Telangana before joining the Godavari River.
-
E.
Saketa
Saketa is the ancient name of the historic Indian city now known as Ayodhya, a major cultural and religious center in Hindu tradition.
- F. None of above. chosen
Statements (48)
| Predicate | Object |
|---|---|
| instanceOf |
Austroasiatic language
ⓘ
Munda language ⓘ language ⓘ |
| basicWordOrder | SOV ⓘ |
| closelyRelatedTo |
Ho language
ⓘ
Mundari ⓘ |
| hasAlternativeName |
Santhali
ⓘ
surface form:
Santali language
Santhali ⓘ |
| hasDialects |
Khortha-influenced Santalī
ⓘ
Malto-influenced Santalī ⓘ Santhali ⓘ
surface form:
Manbhum Santalī
|
| hasEthnicity |
Santalī
self-linksurface differs
ⓘ
surface form:
Santal
|
| hasGlottocode | sant1410 ⓘ |
| hasNativeName |
Santhali
ⓘ
surface form:
Santali
|
| hasOfficialStatusIn |
Assam
ⓘ
Jharkhand ⓘ Orissa ⓘ
surface form:
Odisha
West Bengal ⓘ |
| hasPhonologicalFeature |
contrastive nasalization
ⓘ
tone-like pitch distinctions ⓘ |
| hasScript |
Ol Chiki script
ⓘ
surface form:
Ol Chiki
|
| hasStandardForm | standard Santalī ⓘ |
| ISO639-2 | sat ⓘ |
| ISO639-3 | sat ⓘ |
| languageFamily | Austroasiatic ⓘ |
| recognizedAs | scheduled language of India ⓘ |
| region |
Assam
ⓘ
Bihar ⓘ Jharkhand ⓘ Orissa ⓘ
surface form:
Odisha
West Bengal ⓘ eastern India ⓘ |
| spokenBy | Santal people ⓘ |
| spokenIn |
People's Republic of Bangladesh (from East Pakistan)
ⓘ
surface form:
Bangladesh
Bhutan ⓘ India ⓘ Nepal ⓘ |
| subfamily | Munda ⓘ |
| typology | agglutinative language ⓘ |
| usedFor |
folksongs
ⓘ
oral literature ⓘ primary education in some regions ⓘ religious rituals ⓘ |
| writingSystem |
Bengali script
ⓘ
Devanagari script ⓘ Latin alphabet ⓘ
surface form:
Latin script
Odia script ⓘ Ol Chiki script ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Input
Subject: Santalī Description of subject: Santalī is an Austroasiatic language spoken primarily by the Santal people in eastern India, Bangladesh, Nepal, and Bhutan.
Referenced by (3)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Santal
this entity surface form:
Santal