Triple

T17577242
Position Surface form Disambiguated ID Type / Status
Subject Joumine E428102 entity
Predicate languageUsed P238 FINISHED
Object Tunisian Arabic NE NERFINISHED

How this triple was built (3 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Tunisian Arabic | Statement: [Joumine, languageUsed, Tunisian Arabic]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Tunisian Arabic
Context triple: [Joumine, languageUsed, Tunisian Arabic]
  • A. Maghrebi Arabic
    Maghrebi Arabic is a group of closely related Arabic dialects spoken in North Africa, particularly in countries like Morocco, Algeria, Tunisia, Libya, and Mauritania, characterized by significant Berber, French, and other linguistic influences.
  • B. Hassaniya Arabic
    Hassaniya Arabic is a variety of Arabic spoken primarily in Mauritania and parts of neighboring West African and Saharan countries, known for its Bedouin roots and distinctive phonology and vocabulary.
  • C. Ayt Seghrouchen dialect
    The Ayt Seghrouchen dialect is a variety of Central Atlas Tamazight spoken by the Ayt Seghrouchen Amazigh community in north-central Morocco.
  • D. Souss dialect
    Souss dialect is a regional variety of the Tashelhit (Shilha) Berber language spoken primarily in Morocco’s Souss region.
  • E. Levantine Arabic
    Levantine Arabic is a major colloquial variety of Arabic spoken primarily in the Eastern Mediterranean region, including countries such as Lebanon, Syria, Jordan, and Palestine.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Tunisian Arabic
Target entity description: Tunisian Arabic is a Maghrebi Arabic dialect spoken primarily in Tunisia, characterized by significant Berber, French, and Italian influences.
  • A. Maghrebi Arabic chosen
    Maghrebi Arabic is a group of closely related Arabic dialects spoken in North Africa, particularly in countries like Morocco, Algeria, Tunisia, Libya, and Mauritania, characterized by significant Berber, French, and other linguistic influences.
  • B. Hassaniya Arabic
    Hassaniya Arabic is a variety of Arabic spoken primarily in Mauritania and parts of neighboring West African and Saharan countries, known for its Bedouin roots and distinctive phonology and vocabulary.
  • C. Ayt Seghrouchen dialect
    The Ayt Seghrouchen dialect is a variety of Central Atlas Tamazight spoken by the Ayt Seghrouchen Amazigh community in north-central Morocco.
  • D. Souss dialect
    Souss dialect is a regional variety of the Tashelhit (Shilha) Berber language spoken primarily in Morocco’s Souss region.
  • E. Levantine Arabic
    Levantine Arabic is a major colloquial variety of Arabic spoken primarily in the Eastern Mediterranean region, including countries such as Lebanon, Syria, Jordan, and Palestine.
  • F. None of above.

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d889e0385081908a04b66f4dd4bd0d completed April 10, 2026, 5:25 a.m.
NER Named-entity recognition batch_69e463ca76848190a7beb6deb4b0f1a4 completed April 19, 2026, 5:10 a.m.
Created at: April 10, 2026, 5:50 a.m.