Apache Gobblin

E705297

Apache Gobblin is an open-source distributed data integration framework designed for large-scale data ingestion, replication, and lifecycle management across diverse data sources and sinks.

Try in SPARQL Jump to: Surface forms Statements Referenced by

All labels observed (1)

Label Occurrences
Apache Gobblin canonical 1

Statements (53)

Predicate Object
instanceOf Apache Software Foundation project
distributed data ingestion framework
open-source data integration framework
developer Apache Software Foundation NERFINISHED
donatedTo Apache Software Foundation NERFINISHED
feature checkpointing
config-driven job specification
fault tolerance
job scheduling
metrics collection
monitoring and alerting
pluggable source and sink architecture
schema management
task parallelism
watermarking
license Apache License 2.0
originatedAt LinkedIn NERFINISHED
programmingLanguage Java
repository https://github.com/apache/gobblin
supportsDataSourceType NoSQL databases
RDBMS
REST APIs
file systems
message queues
supportsDeploymentModel MapReduce-based deployment
YARN-based deployment
cluster mode
containerized deployment
service mode
standalone mode
supportsEnvironment cloud deployments
hybrid deployments
on-premises deployments
supportsPlatform Apache Helix NERFINISHED
Apache Kafka NERFINISHED
Apache YARN NERFINISHED
Hadoop NERFINISHED
Kubernetes NERFINISHED
supportsSinkType HDFS NERFINISHED
Kafka NERFINISHED
RDBMS
data warehouses
object stores
supportsUseCase ETL
data compaction
data integration
data lifecycle management
data migration
data quality management
data replication
large-scale data ingestion
metadata management
website https://gobblin.apache.org/

Referenced by (1)

Full triples — surface form annotated when it differs from this entity's canonical label.

Apache Sqoop supersededBy Apache Gobblin