Apache Oozie

E185677

Apache Oozie is a workflow scheduler system designed to manage and coordinate Hadoop jobs such as MapReduce, Pig, and Hive in complex data processing pipelines.

Try in SPARQL Jump to: Surface forms Statements Referenced by

All labels observed (1)

Label Occurrences
Apache Oozie canonical 1

Statements (49)

Predicate Object
instanceOf Apache Software Foundation project
Hadoop ecosystem component
open-source software
workflow scheduler system
configurationFormat XML
deploymentModel server-side web application
designedFor coordinating complex data processing pipelines
managing Hadoop workflows
developer Apache Software Foundation
integratesWith Hadoop
surface form: Apache Hadoop

Apache Hive
Apache Pig
Apache Sqoop
HDFS
YARN
license Apache License 2.0
partOf Apache ecosystem
surface form: Apache Hadoop ecosystem
programmingLanguage Java
provides REST API
command-line interface
web console
requires Hadoop cluster
relational database for state storage
supports Apache Hive jobs
Apache Pig jobs
Apache Sqoop jobs
HDFS operations
Hadoop MapReduce jobs
Java programs
SLA monitoring
bundle jobs
coordinator jobs
data-availability-based scheduling
decision control nodes
email notifications
error handling and retries
fork and join control nodes
shell scripts
sub-workflows
time-based scheduling
workflow dependency management
supportsVersion Hadoop
surface form: Hadoop 1.x

Hadoop
surface form: Hadoop 2.x
useCase ETL pipelines
batch data processing workflows
coordinated execution of multiple Hadoop jobs
periodic data ingestion
website https://oozie.apache.org/
workflowDefinitionLanguage XML

Referenced by (1)

Full triples — surface form annotated when it differs from this entity's canonical label.

Hadoop ecosystemIncludes Apache Oozie