Internet Archive
E55012
The Internet Archive is a nonprofit digital library that preserves and provides free access to vast collections of websites, books, audio, video, and other cultural artifacts online.
All labels observed (8)
| Label | Occurrences |
|---|---|
| Internet Archive canonical | 25 |
| Wayback Machine | 5 |
| archive.org | 2 |
| Internet Archive PURL service | 1 |
| Internet Archive audio collections | 1 |
| Internet Archive digital library | 1 |
| Internet Archive headquarters | 1 |
| Internet Archive mirror in Egypt | 1 |
Statements (78)
| Predicate | Object |
|---|---|
| instanceOf |
nonprofit digital library
ⓘ
online library ⓘ web archiving organization ⓘ |
| abbreviation | IA ⓘ |
| accessPolicy | free public access ⓘ |
| archives |
audio recordings
ⓘ
books ⓘ cultural artifacts ⓘ images ⓘ software ⓘ television news ⓘ videos ⓘ websites ⓘ |
| archivesFrequency | periodic web crawls ⓘ |
| businessModel | donation-funded ⓘ |
| country |
United States of America
ⓘ
surface form:
United States
|
| dataAccessMethod |
APIs
ⓘ
bulk data access ⓘ web interface ⓘ |
| dataFormat | digital ⓘ |
| focusesOn |
cultural heritage preservation
ⓘ
long-term digital preservation ⓘ open access to information ⓘ |
| foundedBy | Brewster Kahle ⓘ |
| founder | Brewster Kahle ⓘ |
| hasBuilding | former Fourth Church of Christ, Scientist, San Francisco ⓘ |
| hasCollection |
academic texts
ⓘ
digitized library books ⓘ government documents ⓘ historical web pages ⓘ live music recordings ⓘ open source software ⓘ podcasts ⓘ public domain books ⓘ radio broadcasts ⓘ user-uploaded media ⓘ |
| hasIdentifier |
ISNI:0000 0004 1930 1894
ⓘ
VIAF:151902901 ⓘ Wikidata ⓘ
surface form:
Wikidata:Q461
|
| hasLanguage | English ⓘ |
| hasLegalIssue | copyright lawsuits related to digital lending and archiving ⓘ |
| hasLogo | Internet Archive logo ⓘ |
| hasOffice |
Richmond, California
ⓘ
San Francisco, California, United States of America ⓘ
surface form:
San Francisco, California
other scanning centers worldwide ⓘ |
| hasPartner |
cultural institutions
ⓘ
government agencies ⓘ libraries ⓘ universities ⓘ |
| hasScanningCenter | library scanning centers ⓘ |
| hasService |
Live Music Archive
ⓘ
Open Library project ⓘ
surface form:
Open Library
TV News Archive ⓘ Internet Archive self-linksurface differs ⓘ
surface form:
Wayback Machine
audio archive ⓘ image archive ⓘ software archive ⓘ text archive ⓘ video archive ⓘ web archive ⓘ |
| headquartersLocation |
Richmond District, San Francisco
ⓘ
San Francisco, California, United States of America ⓘ
surface form:
San Francisco, California
|
| inception | 1996 ⓘ |
| industry |
digital library
ⓘ
digital preservation ⓘ web archiving ⓘ |
| legalForm | 501(c)(3) nonprofit organization ⓘ |
| mission | to provide universal access to all knowledge ⓘ |
| motto | Universal access to all knowledge ⓘ |
| name | Internet Archive self-link ⓘ |
| operates |
Open Library project
ⓘ
surface form:
Open Library
Internet Archive self-linksurface differs ⓘ
surface form:
Wayback Machine
|
| primaryDomain |
Internet Archive
self-linksurface differs
ⓘ
surface form:
archive.org
|
| serviceArea | worldwide ⓘ |
| supportsSearch |
URL-based lookup in Wayback Machine
ⓘ
full-text search of many collections ⓘ |
| taxID | 94-3242767 ⓘ |
| website | https://archive.org ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Input
Subject: Internet Archive Description of subject: The Internet Archive is a nonprofit digital library that preserves and provides free access to vast collections of websites, books, audio, video, and other cultural artifacts online.
Referenced by (37)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Wayback Machine
this entity surface form:
Internet Archive mirror in Egypt
this entity surface form:
archive.org
this entity surface form:
Wayback Machine
this entity surface form:
Wayback Machine
subject surface form:
Brewster Kahle
this entity surface form:
Wayback Machine
subject surface form:
Brewster Kahle
this entity surface form:
Internet Archive digital library
subject surface form:
Open Library
subject surface form:
Open Library
this entity surface form:
Wayback Machine
this entity surface form:
archive.org
this entity surface form:
Internet Archive audio collections
this entity surface form:
Internet Archive headquarters
this entity surface form:
Internet Archive PURL service