bzip2
E299198
bzip2 is a free and open-source data compression program known for its high compression ratios using the Burrows–Wheeler algorithm.
All labels observed (1)
| Label | Occurrences |
|---|---|
| bzip2 canonical | 2 |
How this entity was disambiguated
This entity first appeared as the object of triple T2792608 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: bzip2 Context triple: [GNU Tar, supportsCompression, bzip2]
-
A.
Zip2
Zip2 was an early online city guide and business directory software company from the late 1990s that provided web-based publishing tools for newspapers.
-
B.
UPX
UPX is an executable packer and compressor commonly used to reduce the size of binary programs.
-
C.
BZ
BZ is the two-letter ISO 3166-1 alpha-2 country code assigned to Belize.
-
D.
BZ
BZ is the commonly used abbreviation for the Ministry of Foreign Affairs of the Netherlands, which is responsible for the country’s foreign policy and international relations.
-
E.
LZA
LZA is the regional vehicle registration code assigned to motor vehicles registered in the city of Zamość in Poland.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: bzip2 Target entity description: bzip2 is a free and open-source data compression program known for its high compression ratios using the Burrows–Wheeler algorithm.
-
A.
Zip2
Zip2 was an early online city guide and business directory software company from the late 1990s that provided web-based publishing tools for newspapers.
-
B.
UPX
UPX is an executable packer and compressor commonly used to reduce the size of binary programs.
-
C.
BZ
BZ is the two-letter ISO 3166-1 alpha-2 country code assigned to Belize.
-
D.
BZ
BZ is the commonly used abbreviation for the Ministry of Foreign Affairs of the Netherlands, which is responsible for the country’s foreign policy and international relations.
-
E.
LZA
LZA is the regional vehicle registration code assigned to motor vehicles registered in the city of Zamość in Poland.
- F. None of above. chosen
Statements (46)
| Predicate | Object |
|---|---|
| instanceOf |
data compression program
ⓘ
file format ⓘ free software ⓘ open-source software ⓘ |
| blockBased | true ⓘ |
| category |
cross-platform software
ⓘ
data compression software ⓘ free data compression software ⓘ |
| compressionType | lossless data compression ⓘ |
| developer | Julian Seward ⓘ |
| distributionModel |
precompiled binaries
ⓘ
source code ⓘ |
| fileExtension |
.bz
ⓘ
.bz2 ⓘ |
| firstReleaseYear | 1996 ⓘ |
| hasMagicNumber | BZh ⓘ |
| hasParallelImplementation |
lbzip2
ⓘ
pbzip2 ⓘ |
| homepage | https://sourceware.org/bzip2/ ⓘ |
| influenced |
lbzip2
ⓘ
pbzip2 ⓘ xz ⓘ |
| isFreeSoftware | true ⓘ |
| latestStableReleaseVersion | 1.0.8 ⓘ |
| latestStableReleaseYear | 2019 ⓘ |
| license | bzip2 license ⓘ |
| maximumBlockSize | 900 kB ⓘ |
| notableFeature |
high compression ratio compared to gzip
ⓘ
uses Burrows–Wheeler transform for block sorting ⓘ widely available on Unix-like systems ⓘ |
| operatingSystem |
BSD
ⓘ
Linux ⓘ Unix-like systems ⓘ Windows ⓘ macOS ⓘ |
| programmingLanguage | C ⓘ |
| replacedBy | bzip3 ⓘ |
| replaces | bzip ⓘ |
| standardToolOn |
many BSD systems
ⓘ
many Linux distributions ⓘ |
| supportsStreaming | false ⓘ |
| typicalUse |
compressing tar archives
ⓘ
general-purpose file compression ⓘ |
| usesAlgorithm |
Burrows–Wheeler transform
ⓘ
Huffman coding ⓘ run-length encoding ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Input
Subject: bzip2 Description of subject: bzip2 is a free and open-source data compression program known for its high compression ratios using the Burrows–Wheeler algorithm.
Referenced by (2)
Full triples — surface form annotated when it differs from this entity's canonical label.
subject surface form:
Apache Ant