Apache Samza

E710969

Apache Samza is a distributed stream processing framework designed for scalable, fault-tolerant processing of real-time data streams, often used with Apache Kafka and YARN.

Try in SPARQL Jump to: Statements Referenced by

Statements (52)

Predicate Object
instanceOf Apache Software Foundation project
distributed stream processing framework
open-source software
deploymentModel YARN-based deployment
container-based deployment
standalone deployment
designedFor fault-tolerant processing of data streams
real-time data streams
scalable processing of data streams
stateful stream processing
developer Apache Software Foundation NERFINISHED
feature checkpointing
durable state storage
fault tolerance
high-level API for stream processing
horizontal scalability
low-level API for fine-grained control
message reprocessing
metrics and monitoring support
partitioned streams
pluggable state stores
task-based execution model
integratesWith Apache Beam (via runners or adapters) NERFINISHED
Apache Hadoop NERFINISHED
Apache Hadoop HDFS NERFINISHED
Apache Hadoop YARN NERFINISHED
Apache Kafka NERFINISHED
Apache Kafka Streams ecosystem NERFINISHED
Apache Zookeeper NERFINISHED
NoSQL stores via connectors
RDBMS systems via connectors
license Apache License 2.0
partOf Apache Big Data ecosystem
processingModel near-real-time processing
stream processing
programmingLanguage Java
supports at-least-once processing semantics
batch processing via integration
event-time processing
exactly-once processing semantics
local state storage
state management
windowed computations
supportsProgrammingLanguage Java NERFINISHED
Scala NERFINISHED
useCase ETL on streaming data
event-driven applications
fraud detection
log processing
monitoring and alerting
real-time analytics
website https://samza.apache.org/

Referenced by (1)

Full triples — surface form annotated when it differs from this entity's canonical label.

Apache Storm competesWith Apache Samza