HDFS
E187921
HDFS (Hadoop Distributed File System) is a fault-tolerant, distributed file system designed to store and manage large volumes of data across clusters of commodity hardware.
All labels observed (4)
| Label | Occurrences |
|---|---|
| HDFS canonical | 9 |
| Hadoop Distributed File System | 6 |
| DataNode | 1 |
| NameNode | 1 |
Statements (49)
| Predicate | Object |
|---|---|
| instanceOf |
Hadoop component
ⓘ
distributed file system ⓘ |
| accessibleVia |
Java Class Library
ⓘ
surface form:
Java API
WebHDFS REST API ⓘ command-line interface ⓘ |
| architecture | master-slave ⓘ |
| compatibleWith | YARN ⓘ |
| designedFor |
batch processing workloads
ⓘ
large-scale data storage ⓘ |
| developedBy | Apache Software Foundation ⓘ |
| ensuresReliabilityBy | replicating data blocks across multiple DataNodes ⓘ |
| fullName |
HDFS
self-linksurface differs
ⓘ
surface form:
Hadoop Distributed File System
|
| hasComponent |
HDFS
self-linksurface differs
ⓘ
surface form:
DataNode
Apache ZooKeeper ⓘ
surface form:
JournalNode
HDFS self-linksurface differs ⓘ
surface form:
NameNode
Secondary NameNode ⓘ ZKFailoverController ⓘ |
| hasDefaultBlockSize | 128 MB (typical modern default) ⓘ |
| implements | write-ahead logging for metadata (edit log) ⓘ |
| introducedIn | early versions of Apache Hadoop (circa mid-2000s) ⓘ |
| license | Apache License 2.0 ⓘ |
| notDesignedFor | low-latency random writes ⓘ |
| optimizedFor | large files ⓘ |
| partOf | Apache Hadoop ecosystem ⓘ |
| provides | single namespace for files in a cluster ⓘ |
| runsOn | clusters of commodity hardware ⓘ |
| storesBlocksOn | DataNode ⓘ |
| storesMetadataIn | fsimage and edit log files ⓘ |
| storesMetadataOn | NameNode ⓘ |
| supports |
fault tolerance
ⓘ
high throughput data access ⓘ |
| supportsFeature |
append operations
ⓘ
data replication ⓘ erasure coding ⓘ federation ⓘ high availability ⓘ rack awareness ⓘ snapshots ⓘ streaming data access ⓘ transparent data encryption ⓘ write-once-read-many semantics ⓘ |
| supportsOS |
Linux
ⓘ
Unix-like systems ⓘ |
| typicalReplicationFactor | 3 ⓘ |
| usedBy |
Apache HBase
ⓘ
Apache Hive ⓘ Apache Spark ⓘ MapReduce ⓘ |
| uses | block-based storage ⓘ |
Referenced by (17)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Hadoop Distributed File System
this entity surface form:
Hadoop Distributed File System
this entity surface form:
Hadoop Distributed File System
this entity surface form:
Hadoop Distributed File System
this entity surface form:
Hadoop Distributed File System
this entity surface form:
Hadoop Distributed File System
this entity surface form:
NameNode
this entity surface form:
DataNode