Bloom
E435874
Bloom is a large open-access multilingual language model developed by the BigScience research workshop for text generation and understanding tasks.
All labels observed (1)
| Label | Occurrences |
|---|---|
| Bloom canonical | 4 |
How this entity was disambiguated
This entity first appeared as the object of triple T4389201 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
Target entity: Bloom Context triple: [Hugging Face Transformers, supportsModelType, Bloom]
-
A.
Bloom
Bloom is a common English and Jewish surname borne by numerous notable figures in literature, academia, and the arts.
-
B.
In Bloom
"In Bloom" is a popular grunge song by Nirvana, known for its heavy guitar riffs and critique of mainstream misinterpretation of the band's music.
-
C.
Bloomy
Bloomy is an informal nickname commonly used to refer to the city of Bloomington, Indiana.
-
D.
The Flower
The Flower is the nickname of Guy Lafleur, the legendary Montreal Canadiens right winger renowned for his speed, scoring prowess, and flowing blond hair.
-
E.
Flourish
Flourish is a positive psychology book by Martin Seligman that outlines his theory of well-being and practical strategies for enhancing happiness and life satisfaction.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Target entity: Bloom Target entity description: Bloom is a large open-access multilingual language model developed by the BigScience research workshop for text generation and understanding tasks.
-
A.
Bloom
Bloom is a common English and Jewish surname borne by numerous notable figures in literature, academia, and the arts.
-
B.
In Bloom
"In Bloom" is a popular grunge song by Nirvana, known for its heavy guitar riffs and critique of mainstream misinterpretation of the band's music.
-
C.
Bloomy
Bloomy is an informal nickname commonly used to refer to the city of Bloomington, Indiana.
-
D.
The Flower
The Flower is the nickname of Guy Lafleur, the legendary Montreal Canadiens right winger renowned for his speed, scoring prowess, and flowing blond hair.
-
E.
Flourish
Flourish is a positive psychology book by Martin Seligman that outlines his theory of well-being and practical strategies for enhancing happiness and life satisfaction.
- F. None of above. chosen
Statements (48)
| Predicate | Object |
|---|---|
| instanceOf |
autoregressive transformer model
ⓘ
large language model ⓘ multilingual language model ⓘ |
| accessModel | open-access ⓘ |
| architecture | decoder-only transformer ⓘ |
| contextWindowSize | 2048 tokens ⓘ |
| developer |
BigScience Research Workshop
NERFINISHED
ⓘ
BigScience community ⓘ BigScience workshop NERFINISHED ⓘ Hugging Face NERFINISHED ⓘ |
| hostingPlatform | Hugging Face Hub NERFINISHED ⓘ |
| intendedUse |
downstream NLP applications
ⓘ
experimentation ⓘ research ⓘ |
| languageSupport | multilingual ⓘ |
| license | Responsible AI License (RAIL) variant NERFINISHED ⓘ |
| notableFeature | one of the first open-access LLMs at 100B+ parameters ⓘ |
| parameterCount | 176 billion ⓘ |
| projectType | collaborative international research project ⓘ |
| releaseDate | 2022 ⓘ |
| safetyConsideration | subject to content and usage restrictions via license ⓘ |
| supportsLanguage |
Arabic
ⓘ
Chinese ⓘ English ⓘ French ⓘ German ⓘ Hindi NERFINISHED ⓘ Portuguese ⓘ Russian ⓘ Spanish ⓘ dozens of other languages ⓘ |
| task |
language modeling
ⓘ
text generation ⓘ text understanding ⓘ |
| tokenizerType |
SentencePiece
NERFINISHED
ⓘ
subword tokenizer ⓘ |
| trainingComputeType | GPU cluster ⓘ |
| trainingDataSize | over 300 billion tokens ⓘ |
| trainingDataSource | ROOTS corpus NERFINISHED ⓘ |
| trainingDataType |
academic publications
ⓘ
books ⓘ code ⓘ web text ⓘ |
| trainingDuration | approximately 3.5 months ⓘ |
| trainingHardware | Jean Zay supercomputer NERFINISHED ⓘ |
| trainingHardwareProvider |
GENCI
NERFINISHED
ⓘ
IDRIS NERFINISHED ⓘ |
| trainingObjective | causal language modeling ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Subject: Bloom Description of subject: Bloom is a large open-access multilingual language model developed by the BigScience research workshop for text generation and understanding tasks.
Referenced by (4)
Full triples — surface form annotated when it differs from this entity's canonical label.