Triple

T17103171
Position Surface form Disambiguated ID Type / Status
Subject Optus E415028 entity
Predicate subjectOf P38 FINISHED
Object Optus data breach 2022
The Optus data breach 2022 was a major cybersecurity incident in Australia in which millions of customers’ personal details were exposed after a hack on the telecommunications company Optus.
E1250436 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Optus data breach 2022 | Statement: [Optus, subjectOf, Optus data breach 2022]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Optus data breach 2022
Context triple: [Optus, subjectOf, Optus data breach 2022]
  • A. SolarWinds
    SolarWinds is an American software company best known for its IT infrastructure management tools and for being at the center of a major 2020 supply-chain cyberattack.
  • B. Maysa Leak
    Maysa Leak is an American jazz and soul singer known for her smooth, expressive vocals and work both as a solo artist and as a member of the band Incognito.
  • C. Cambridge Analytica
    Cambridge Analytica was a now-defunct political consulting and data analytics firm notorious for its role in harvesting Facebook user data to influence elections, including the 2016 U.S. presidential campaign and the Brexit referendum.
  • D. Data61
    Data61 is an Australian national data science and digital innovation research organization within CSIRO, focused on advanced analytics, cybersecurity, and emerging technologies.
  • E. Shibboleth incident
    The Shibboleth incident is a biblical episode in which the pronunciation of the word “shibboleth” was used as a linguistic test to distinguish and execute enemy Ephraimites.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Optus data breach 2022
Triple: [Optus, subjectOf, Optus data breach 2022]
Generated description
The Optus data breach 2022 was a major cybersecurity incident in Australia in which millions of customers’ personal details were exposed after a hack on the telecommunications company Optus.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Optus data breach 2022
Target entity description: The Optus data breach 2022 was a major cybersecurity incident in Australia in which millions of customers’ personal details were exposed after a hack on the telecommunications company Optus.
  • A. SolarWinds
    SolarWinds is an American software company best known for its IT infrastructure management tools and for being at the center of a major 2020 supply-chain cyberattack.
  • B. Maysa Leak
    Maysa Leak is an American jazz and soul singer known for her smooth, expressive vocals and work both as a solo artist and as a member of the band Incognito.
  • C. Cambridge Analytica
    Cambridge Analytica was a now-defunct political consulting and data analytics firm notorious for its role in harvesting Facebook user data to influence elections, including the 2016 U.S. presidential campaign and the Brexit referendum.
  • D. Data61
    Data61 is an Australian national data science and digital innovation research organization within CSIRO, focused on advanced analytics, cybersecurity, and emerging technologies.
  • E. Shibboleth incident
    The Shibboleth incident is a biblical episode in which the pronunciation of the word “shibboleth” was used as a linguistic test to distinguish and execute enemy Ephraimites.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d886cfc8e88190b05ba466edd35591 completed April 10, 2026, 5:12 a.m.
NER Named-entity recognition batch_69e3dc2495c88190b5b16a006a994faf completed April 18, 2026, 7:31 p.m.
NED1 Entity disambiguation (via context triple) batch_6a0139ffbe808190a24e827331ee4a6c completed May 11, 2026, 2:07 a.m.
NEDg Description generation batch_6a013ae388548190b09d2c81e1ab0d02 completed May 11, 2026, 2:11 a.m.
NED2 Entity disambiguation (via description) batch_6a013b4df74c81908b3b99e276531e13 completed May 11, 2026, 2:13 a.m.
Created at: April 10, 2026, 5:35 a.m.