Delta Lake storage layer
E1038749
Delta Lake storage layer is an open-source data storage framework that brings ACID transactions, schema enforcement, and reliability to data lakes, particularly in big data and analytics environments.
Statements (49)
| Predicate | Object |
|---|---|
| instanceOf |
data lake technology
ⓘ
data storage framework ⓘ open-source software ⓘ table storage layer ⓘ |
| basedOn | Apache Parquet NERFINISHED ⓘ |
| category | data management technology ⓘ |
| compatibleWith |
Apache Spark
NERFINISHED
ⓘ
Apache Spark Structured Streaming NERFINISHED ⓘ SQL-based analytics ⓘ cloud object storage ⓘ on-premises storage ⓘ |
| designedFor |
analytics workloads
ⓘ
big data workloads ⓘ data lakes ⓘ |
| ensures | serializable isolation for transactions ⓘ |
| hostedBy | Linux Foundation NERFINISHED ⓘ |
| improves | reliability of data lakes ⓘ |
| openSourceLicense | Apache License 2.0 NERFINISHED ⓘ |
| originatedAt | Databricks NERFINISHED ⓘ |
| partOf | Delta Lake project NERFINISHED ⓘ |
| provides |
atomicity
ⓘ
consistency ⓘ durability ⓘ isolation ⓘ |
| storesMetadataIn | transaction log ⓘ |
| supports |
rollback to previous table versions
ⓘ
schema-on-write ⓘ |
| supportsFeature |
ACID transactions
ⓘ
Z-order clustering ⓘ batch processing ⓘ concurrent reads and writes ⓘ data compaction ⓘ data quality enforcement ⓘ data reliability ⓘ data versioning ⓘ deletes ⓘ file optimization ⓘ merges ⓘ partitioning ⓘ schema enforcement ⓘ schema evolution ⓘ streaming processing ⓘ time travel queries ⓘ upserts ⓘ |
| transactionLogFormat | JSON ⓘ |
| usedFor |
ETL pipelines
ⓘ
building lakehouse architectures ⓘ data warehousing on data lakes ⓘ machine learning data management ⓘ |
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.