big data framework
C11253
concept
A big data framework is a software platform that enables the distributed storage, processing, and analysis of large-scale, complex datasets across clusters of machines.
All labels observed (21)
| Label | Occurrences |
|---|---|
| data processing framework | 5 |
| Hadoop component | 3 |
| big data framework canonical | 3 |
| Apache Spark component | 2 |
| Spark | 2 |
| cluster computing framework | 2 |
| distributed data processing framework | 2 |
| managed big data platform | 2 |
| stream processing framework | 2 |
| Apache HBase component | 1 |
| Apache Storm component | 1 |
| MapReduce master service | 1 |
| big data framework component | 1 |
| big data processing engine | 1 |
| big data processing service | 1 |
| big data technology | 1 |
| big data tool | 1 |
| component of Apache Hadoop | 1 |
| data-parallel programming framework | 1 |
| distributed data processing engine | 1 |
| graph processing framework | 1 |
Instances (27)
| Instance | Via concept surface |
|---|---|
| Google Cloud Dataproc | big data processing service |
| Apache Spark | distributed data processing engine |
| Yet Another Resource Negotiator | component of Apache Hadoop |
| YARN | Hadoop component |
| Apache Storm | stream processing framework |
| Google MapReduce | data processing framework |
| HDFS | Hadoop component |
| Apache Pig | big data tool |
| Apache Flink | stream processing framework |
| Hadoop | — |
| Dask | data processing framework |
| Apache Beam | data processing framework |
| Hugging Face Datasets | data processing framework |
| Ray | cluster computing framework |
| Amazon EMR | managed big data platform |
| Oracle Big Data Service | managed big data platform |
| PySpark | big data framework component |
| Apache Tez | distributed data processing framework |
| FlumeJava | data-parallel programming framework |
| Storm UI | Apache Storm component |
| Structured Streaming | Apache Spark component |
| GraphX | graph processing framework |
| HMaster | Apache HBase component |
| Hadoop MapReduce v1 JobTracker | MapReduce master service |
|
Tez
surface form:
Apache Tez
|
data processing framework |
| Gilgamesh Wulfenbach | Spark |
| Baron Klaus Wulfenbach | Spark |