5 projects
Trino
Trino is a distributed SQL query engine designed to query large data sets distributed across multiple heterogeneous data sources. It enables fast, interactive analytics across diverse data sources including Hadoop, object stores, relational databases, and other systems.
5,178
731
$69M
Apache DataFusion
Apache DataFusion is a fast, extensible query execution framework written in Rust that enables efficient processing of large-scale data using SQL. It provides a modular architecture for building high-performance data processing systems and analytics applications, with support for various data sources and formats.
2,500
581
$22M
Velox
Velox is a C++ database acceleration library and execution engine that provides high-performance data processing capabilities. It offers vectorized execution, SIMD optimization, and dynamic code generation to accelerate analytical queries and machine learning workloads.
1,424
155
$52M
Comunica
Comunica is a modular JavaScript framework for querying Linked Data on the Web. It provides a flexible architecture for building SPARQL query engines that can operate over various data sources and interfaces, supporting both local and remote data access.
258
95
$5.1M
Calcite
Apache Calcite