LFX Platform

Know more about LFX Platform

LFX Insights

OLAP Datastores

Datastores optimized for real‑time, distributed OLAP analytics, enabling low‑latency complex queries over large-scale data.

13 projects

37,344 contributors

$2.4B

ClickHouse

ClickHouse is an open-source column-oriented database management system that enables real-time analytics using SQL queries. It is designed for high performance on large datasets, featuring fast data ingestion, efficient compression, and parallel processing capabilities.

Contributors

11,652

Organizations

1,629

Software value

$103M

DuckDB

DuckDB is an embeddable SQL OLAP database management system that runs queries directly on raw data files. It provides fast analytical query processing while maintaining ACID properties, and can be embedded within larger applications or used as a standalone database.

Contributors

5,429

Organizations

1,226

Software value

$70M

Apache Doris

Apache Doris is a modern, high-performance MPP (Massively Parallel Processing) analytical database system that provides real-time data warehouse solutions. It offers fast query capabilities, high concurrency, and strong data consistency while supporting both batch and streaming data ingestion.

Contributors

4,532

Organizations

244

Software value

$175M

TiDB

TiDB is an open-source, distributed SQL database that supports Hybrid Transactional and Analytical Processing (HTAP) workloads. It features horizontal scalability, strong consistency, and MySQL compatibility.

Contributors

3,231

Organizations

553

Software value

$83M

YDB

YDB is an open-source distributed SQL database management system that combines OLTP and OLAP workloads in a single DBMS. It offers ACID transactions, high availability, horizontal scalability, and strong consistency while maintaining high performance for both transactional and analytical queries.

Contributors

3,018

Organizations

86

Software value

$1.3B

TimescaleDB

TimescaleDB is an open-source time-series database built as an extension to PostgreSQL. It provides automatic partitioning across time and space (partitioning key), native compression, continuous aggregations, and other features optimized for time-series data while retaining full SQL compatibility.

Contributors

2,817

Organizations

626

Software value

$6.7M

Apache Pinot

Apache Pinot is a real-time distributed OLAP datastore designed to deliver scalable real-time analytics with low latency. It can ingest data from batch and streaming sources and provides a SQL interface for querying. The system is built to handle high throughput analytics workloads and supports rich indexing capabilities for optimized query performance.

Contributors

1,727

Organizations

232

Software value

$48M

QuestDB

QuestDB is a high performance, open-source, time-series database

Contributors

1,634

Organizations

269

Software value

$50M

OceanBase Database

OceanBase is an enterprise-grade distributed relational database system that provides high availability, scalability, and performance for large-scale online transaction processing (OLTP) workloads. It features multi-tenancy support, strong consistency, and compatibility with MySQL protocols.

Contributors

1,558

Organizations

154

Software value

$397M

CrateDB

CrateDB is a distributed SQL database that enables real-time analytics and queries on large-scale machine data and IoT workloads. It features horizontal scalability, automatic data sharding, and support for structured and unstructured data.

Contributors

819

Organizations

177

Software value

$23M

TiFlash

TiFlash is a columnar storage engine and analytical processing component of TiDB, designed to handle real-time HTAP (Hybrid Transactional/Analytical Processing) workloads. It provides MPP (Massively Parallel Processing) acceleration for complex analytical queries while maintaining strong consistency with transactional data.

Contributors

471

Organizations

66

Software value

$43M

Materialize

Materialize is a streaming database that processes real-time data streams and maintains materialized views, enabling fast SQL queries over streaming data. It allows users to build real-time applications and analytics by transforming complex streaming data into queryable views.

Contributors

456

Organizations

91

Software value

$29M

ProtonSQL

High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka, Pulsar, or ClickHouse, and seamlessly write results back. Supports powerful features like JOIN, CDC, UPSERT, and LOOKUP, enabling real-time analytics and ETL at scale.

This project hasn't been onboarded to LFX Insights.
Looking for a project that’s not listed?