29 projects
Logstash
Logstash is a server-side data processing pipeline that ingests data from multiple sources simultaneously, transforms it, and then sends it to a destination of choice. It is commonly used to collect logs and other time-series data for search, analysis and visualization in Elasticsearch.
5,700
1,308
$5.5M
Apache Flink
Apache Flink is a distributed stream processing and batch computation framework. It provides high-throughput, low-latency data streaming engines as well as support for event-driven applications and batch processing. The framework enables stateful computations over data streams and features automatic memory management, fault tolerance, and exactly-once processing semantics.
4,683
567
$99M
Apache Kafka
Apache Kafka is a distributed event streaming platform designed for high-throughput, fault-tolerant handling of real-time data feeds. It enables building real-time streaming data pipelines and applications that can process, transform, and react to streams of events.
4,567
750
$45M
Strimzi
Strimzi provides a way to run an Apache Kafka cluster on Kubernetes in various deployment configurations.
3,640
864
$334M
Apache RocketMQ
Apache RocketMQ is a distributed messaging and streaming platform with low latency, high performance and reliability, trillion-level capacity and flexible scalability. It supports pub-sub and push-pull messaging models, providing a reliable message-based asynchronous communication and streaming data processing solution for distributed applications.
3,598
272
$10M
Apache Pulsar
Apache Pulsar is a cloud-native, distributed messaging and streaming platform designed to handle high-performance workloads. It provides pub-sub messaging, queuing, stream processing, and event streaming capabilities with built-in multi-tenancy, geo-replication, and persistent storage features.
3,528
540
$29M
Hazelcast
Hazelcast is an open-source distributed computing platform that provides in-memory data storage and processing capabilities. It offers features like distributed caching, distributed data structures, distributed computing, and clustering for building scalable applications.
2,979
462
$64M
Sarama
Sarama is a Go library for Apache Kafka.
2,157
616
$2M
AutoMQ
AutoMQ is a cloud-native alternative to Kafka by decoupling durability to S3 and EBS. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency.
1,854
206
$44M
Apache Storm
Apache Storm
1,138
201
$14M
Redpanda
Redpanda is a streaming data platform that serves as a Kafka-compatible event streaming engine, offering high performance, low latency, and simplified operations. It is designed to handle real-time data streams and event processing with features like JVM-free architecture, thread-per-core model, and cloud-native capabilities.
1,054
286
$37M
Pravega
Pravega is a storage system that uses Stream as the main building block for storing continuous and limitless data.
769
163
$70M
Apache BookKeeper
Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads
739
110
$13M
TiFlow
TiFlow is a data replication and migration tool that enables real-time data flow between different databases and systems. It includes CDC (Change Data Capture) capabilities for capturing and streaming database changes, and supports integration with TiDB database ecosystem.
512
61
$20M
EventStream
EventStream is a Node.js library that provides a powerful streaming interface for working with event-based data. It enables transforming, filtering, and manipulating streams of events using functional programming patterns, with support for both synchronous and asynchronous operations.
509
250
$38K
Franz-Go
franz-go contains a feature complete, pure Go library for interacting with Kafka from 0.8.0 through 3.8+. Producing, consuming, transacting, administrating, etc.
493
153
$4.1M
Bento Stream Processor
A stream processing engine and framework for building real-time data pipelines and analytics applications
489
104
$12M
Materialize
Materialize is a streaming database that processes real-time data streams and maintains materialized views, enabling fast SQL queries over streaming data. It allows users to build real-time applications and analytics by transforming complex streaming data into queryable views.
463
92
$34M
Conduit data streaming
Conduit is a streaming data library for Haskell that provides composable, memory-efficient processing of data streams. It enables handling large datasets and I/O operations through a unified streaming interface.
435
165
$466K
Apache Celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
417
58
$6.9M
Benthos
Benthos is a high-performance stream processing engine that enables users to build resilient data pipelines. It connects various sources and sinks, performs data transformations, and handles message routing with a declarative configuration approach.
406
83
$5.2M
Nussknacker
Nussknacker is a visual workflow designer and runtime engine that enables users to create and deploy real-time data processing applications. It provides a low-code platform for building streaming processes using Apache Flink, with features for ETL operations, data enrichment, and complex event processing.
241
16
$13M
Substreams
Substreams is a blockchain data processing engine that enables streaming of on-chain data through a series of WASM modules. It provides high-performance parallel processing capabilities for extracting, transforming, and analyzing blockchain data in real-time.
197
42
$5M
Tremor
Tremor is an early stage event processing system for unstructured data with rich support for structural pattern matching, filtering and transformation.
180
93
$13M
Pump
Pump is a streaming data pipeline library that allows piping between Node.js streams with error handling and automatic cleanup. It provides a simpler alternative to manually handling stream events and error propagation.
80
43
$6.5K
Combined Stream
A Node.js module that allows combining multiple streams into a single stream, enabling sequential reading from multiple sources as if they were a single source
63
29
$30K
streamx
A minimal and efficient Node.js streaming library that provides a lightweight alternative to the built-in streams implementation, focusing on performance and simplicity
54
33
$82K
Split2
Split2 is a Node.js module that splits text input into lines using a delimiter. It provides a transform stream interface for breaking up text streams by newlines or custom separators, with support for both string and buffer inputs.
53
32
$15K
Kafka Docker Playground
🐳✨ Fully automated Apache Kafka® and Confluent Docker based examples // 👷♂️ Easily build examples or reproduction models