4 projects
Apache Iceberg
Apache Iceberg is an open source table format for huge analytic datasets that provides atomic transactions, concurrent reads/writes, schema evolution, hidden partitioning, time travel queries and efficient data maintenance. It enables reliable, high-performance analytics on massive data lakes.
4,023
521
$36M
Apache ORC
Apache ORC (Optimized Row Columnar) is a high-performance columnar storage file format for Hadoop workloads. It provides efficient data compression and encoding schemes with advanced column-level operations, enabling fast data processing and analytics. ORC files can store collections of structured data and are optimized for large-scale data processing frameworks like Apache Hive and Apache Spark.
448
89
$5.1M
Apache Parquet
Apache Parquet Format
loaders.gl
Loaders for big data visualization. Website: