Apache Oozie
E185677
Apache Software Foundation project
Hadoop ecosystem component
open-source software
workflow scheduler system
Apache Oozie is a workflow scheduler system designed to manage and coordinate Hadoop jobs such as MapReduce, Pig, and Hive in complex data processing pipelines.
All labels observed (1)
| Label | Occurrences |
|---|---|
| Apache Oozie canonical | 1 |
Statements (49)
| Predicate | Object |
|---|---|
| instanceOf |
Apache Software Foundation project
ⓘ
Hadoop ecosystem component ⓘ open-source software ⓘ workflow scheduler system ⓘ |
| configurationFormat | XML ⓘ |
| deploymentModel | server-side web application ⓘ |
| designedFor |
coordinating complex data processing pipelines
ⓘ
managing Hadoop workflows ⓘ |
| developer | Apache Software Foundation ⓘ |
| integratesWith |
Hadoop
ⓘ
surface form:
Apache Hadoop
Apache Hive ⓘ Apache Pig ⓘ Apache Sqoop ⓘ HDFS ⓘ YARN ⓘ |
| license | Apache License 2.0 ⓘ |
| partOf |
Apache ecosystem
ⓘ
surface form:
Apache Hadoop ecosystem
|
| programmingLanguage | Java ⓘ |
| provides |
REST API
ⓘ
command-line interface ⓘ web console ⓘ |
| requires |
Hadoop cluster
ⓘ
relational database for state storage ⓘ |
| supports |
Apache Hive jobs
ⓘ
Apache Pig jobs ⓘ Apache Sqoop jobs ⓘ HDFS operations ⓘ Hadoop MapReduce jobs ⓘ Java programs ⓘ SLA monitoring ⓘ bundle jobs ⓘ coordinator jobs ⓘ data-availability-based scheduling ⓘ decision control nodes ⓘ email notifications ⓘ error handling and retries ⓘ fork and join control nodes ⓘ shell scripts ⓘ sub-workflows ⓘ time-based scheduling ⓘ workflow dependency management ⓘ |
| supportsVersion |
Hadoop
ⓘ
surface form:
Hadoop 1.x
Hadoop ⓘ
surface form:
Hadoop 2.x
|
| useCase |
ETL pipelines
ⓘ
batch data processing workflows ⓘ coordinated execution of multiple Hadoop jobs ⓘ periodic data ingestion ⓘ |
| website | https://oozie.apache.org/ ⓘ |
| workflowDefinitionLanguage | XML ⓘ |
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.