PyTables
E459726
PyTables is a Python library that provides efficient management, querying, and storage of large amounts of data using the HDF5 format.
All labels observed (1)
| Label | Occurrences |
|---|---|
| PyTables canonical | 1 |
How this entity was disambiguated
This entity first appeared as the object of triple T4599959 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: PyTables Context triple: [Python scientific stack, hasComponent, PyTables]
-
A.
Tabularium
The Tabularium was the official records office of ancient Rome, a monumental state archive building overlooking the Roman Forum.
-
B.
HDF
HDF is the acronym for the Hungarian Defence Forces, the unified military organization responsible for Hungary’s national defense and participation in international security operations.
-
C.
Dask
Dask is an open-source parallel computing library for Python that enables scalable, distributed data processing and analytics using familiar interfaces like NumPy, pandas, and scikit-learn.
-
D.
pandas
pandas is a popular open-source Python library that provides powerful, easy-to-use data structures and tools for data analysis and manipulation.
-
E.
NumPy
NumPy is a fundamental Python library that provides efficient multi-dimensional arrays and numerical computing tools widely used in scientific computing and data analysis.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: PyTables Target entity description: PyTables is a Python library that provides efficient management, querying, and storage of large amounts of data using the HDF5 format.
-
A.
Tabularium
The Tabularium was the official records office of ancient Rome, a monumental state archive building overlooking the Roman Forum.
-
B.
HDF
HDF is the acronym for the Hungarian Defence Forces, the unified military organization responsible for Hungary’s national defense and participation in international security operations.
-
C.
Dask
Dask is an open-source parallel computing library for Python that enables scalable, distributed data processing and analytics using familiar interfaces like NumPy, pandas, and scikit-learn.
-
D.
pandas
pandas is a popular open-source Python library that provides powerful, easy-to-use data structures and tools for data analysis and manipulation.
-
E.
NumPy
NumPy is a fundamental Python library that provides efficient multi-dimensional arrays and numerical computing tools widely used in scientific computing and data analysis.
- F. None of above. chosen
Statements (56)
| Predicate | Object |
|---|---|
| instanceOf |
Python library
ⓘ
open-source software ⓘ |
| basedOn | HDF5 library NERFINISHED ⓘ |
| compatibleWith | CPython NERFINISHED ⓘ |
| designedFor |
large numerical datasets
ⓘ
out-of-core computation ⓘ |
| hasComponent |
array objects
ⓘ
file handler objects ⓘ group objects ⓘ table objects ⓘ |
| license | open-source license ⓘ |
| primaryUse |
efficient data querying
ⓘ
efficient data storage ⓘ hierarchical data management ⓘ management of large datasets ⓘ |
| programmingLanguage | Python ⓘ |
| supportsFeature |
I/O optimization
ⓘ
array-like data structures ⓘ chunked storage ⓘ columnar access ⓘ compressed storage ⓘ compression filters ⓘ data type conversion ⓘ extendable arrays ⓘ hierarchical groups ⓘ hierarchical organization of data ⓘ indexing of table columns ⓘ integration with NumPy ⓘ integration with SciPy ⓘ integration with pandas ⓘ iterative access to rows ⓘ memory-mapped access patterns ⓘ metadata storage ⓘ nested data structures ⓘ on-disk storage ⓘ partial I/O ⓘ querying with conditions ⓘ row-wise access ⓘ support for complex data types ⓘ support for compression libraries ⓘ support for filters like Blosc ⓘ support for filters like LZO ⓘ support for filters like Zlib ⓘ support for nested records ⓘ support for variable-length arrays ⓘ table-like data structures ⓘ time series storage ⓘ |
| supportsFormat | HDF5 NERFINISHED ⓘ |
| typicalDomain |
data science
ⓘ
high-performance computing ⓘ numerical analysis ⓘ scientific computing ⓘ |
| useCase |
large-scale data analytics
ⓘ
storing experimental data ⓘ storing simulation results ⓘ |
| writtenIn | Python NERFINISHED ⓘ |
How these facts were elicited
The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.
Instruction
You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10. # Requirements - If you don't know the subject at all, return an empty list. - If the subject is not a named entity, return an empty list. - Include at least one triple where predicate is "instanceOf". - Do not get too wordy. - Separate several objects into multiple triples with one object.
Input
Subject: PyTables Description of subject: PyTables is a Python library that provides efficient management, querying, and storage of large amounts of data using the HDF5 format.
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.