DataSet API
E711832
DataSet API is Apache Flink’s now-legacy batch processing API for defining and executing scalable, distributed data transformations.
Statements (44)
| Predicate | Object |
|---|---|
| instanceOf |
Apache Flink API
ⓘ
batch processing API ⓘ |
| category |
big data framework API
ⓘ
data-parallel programming API ⓘ |
| designedFor |
fault-tolerant processing
ⓘ
scalable processing ⓘ |
| developer | Apache Flink community ⓘ |
| documentationURL | https://nightlies.apache.org/flink/flink-docs-stable/dev/batch ⓘ |
| ecosystem | Apache Flink stack NERFINISHED ⓘ |
| executionEngine | Flink batch runtime ⓘ |
| executionModel | distributed ⓘ |
| feature |
custom partitioning
ⓘ
grouping and aggregation ⓘ iterations ⓘ joins ⓘ operators like map, flatMap, filter, reduce ⓘ support for user-defined functions ⓘ type-safe transformations ⓘ |
| inputFormat |
HDFS
NERFINISHED
ⓘ
collections ⓘ files ⓘ |
| integratesWith | Flink runtime ⓘ |
| license | Apache License 2.0 ⓘ |
| notDesignedFor | unbounded streaming data ⓘ |
| outputFormat |
HDFS
NERFINISHED
ⓘ
files ⓘ |
| partOf | Apache Flink NERFINISHED ⓘ |
| programmingLanguage |
Java
ⓘ
Scala NERFINISHED ⓘ |
| relation | predecessor of unified Flink APIs for batch and streaming ⓘ |
| replacedBy |
Flink DataStream API
NERFINISHED
ⓘ
Flink Table API NERFINISHED ⓘ |
| scope | bounded data ⓘ |
| status | legacy ⓘ |
| supports |
batch processing
ⓘ
data transformations ⓘ distributed data processing ⓘ |
| supportsOptimization |
automatic execution plan optimization
ⓘ
data pipelining ⓘ operator chaining ⓘ |
| targetUser |
Java developers
ⓘ
Scala developers ⓘ big data engineers ⓘ data engineers ⓘ |
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.