HDFS

E187921

HDFS (Hadoop Distributed File System) is a fault-tolerant, distributed file system designed to store and manage large volumes of data across clusters of commodity hardware.

Try in SPARQL Jump to: Surface forms Statements Referenced by

All labels observed (4)

Label Occurrences
HDFS canonical 9
Hadoop Distributed File System 6
DataNode 1

Statements (49)

Predicate Object
instanceOf Hadoop component
distributed file system
accessibleVia Java Class Library
surface form: Java API

WebHDFS REST API
command-line interface
architecture master-slave
compatibleWith YARN
designedFor batch processing workloads
large-scale data storage
developedBy Apache Software Foundation
ensuresReliabilityBy replicating data blocks across multiple DataNodes
fullName HDFS self-linksurface differs
surface form: Hadoop Distributed File System
hasComponent HDFS self-linksurface differs
surface form: DataNode

Apache ZooKeeper
surface form: JournalNode

HDFS self-linksurface differs
surface form: NameNode

Secondary NameNode
ZKFailoverController
hasDefaultBlockSize 128 MB (typical modern default)
implements write-ahead logging for metadata (edit log)
introducedIn early versions of Apache Hadoop (circa mid-2000s)
license Apache License 2.0
notDesignedFor low-latency random writes
optimizedFor large files
partOf Apache Hadoop ecosystem
provides single namespace for files in a cluster
runsOn clusters of commodity hardware
storesBlocksOn DataNode
storesMetadataIn fsimage and edit log files
storesMetadataOn NameNode
supports fault tolerance
high throughput data access
supportsFeature append operations
data replication
erasure coding
federation
high availability
rack awareness
snapshots
streaming data access
transparent data encryption
write-once-read-many semantics
supportsOS Linux
Unix-like systems
typicalReplicationFactor 3
usedBy Apache HBase
Apache Hive
Apache Spark
MapReduce
uses block-based storage

Referenced by (17)

Full triples — surface form annotated when it differs from this entity's canonical label.

Hadoop hasComponent HDFS
Hadoop storageLayer HDFS
ORC compatibleWith HDFS
Apache Spark canUseStorage HDFS
this entity surface form: Hadoop Distributed File System
MapReduce commonlyUsedWith HDFS
this entity surface form: Hadoop Distributed File System
Apache Hive supportsPlatform HDFS
this entity surface form: Hadoop Distributed File System
Apache HBase integratesWith HDFS
this entity surface form: Hadoop Distributed File System
Apache Flume supports HDFS
Google File System inspired HDFS
this entity surface form: Hadoop Distributed File System
HDFS fullName HDFS self-linksurface differs
this entity surface form: Hadoop Distributed File System
HDFS hasComponent HDFS self-linksurface differs
this entity surface form: NameNode
HDFS hasComponent HDFS self-linksurface differs
this entity surface form: DataNode