Search projects, repositories, or collections

⇧+K

Curated Collections

Big Data Processing Frameworks

Tools for processing and analyzing extremely large and complex datasets.

by The Linux Foundation

・

28 projects

Only Linux Foundation projects

Project

Contributors

Organizations

Software value

ClickHouse

11,861

1,669

$107M

Polars

5,839

1,196

$23M

The Presto Foundation Fund

5,433

749

$2.1B

Trino

5,240

738

$69M

Apache Beam

4,798

639

$95M

Dask

3,597

908

$6.9M

Apache Hudi

3,073

280

$24M

Hazelcast

2,984

466

$64M

Apache DataFusion

2,595

610

$23M

Apache Hadoop

2,337

273

$188M

Apache Hive

1,505

148

$96M

Apache HBase

1,481

130

$44M

sparklyr

1,296

143

$2.1M

Apache Paimon

1,048

102

$22M

Vespa

783

119

$72M

Scio

665

125

$3.8M

Apache Drill

636

101

$34M

Scalding

576

120

$2.6M

Apache CarbonData

485

$14M

Apache Impala

418

$46M

ListenBrainz

396

$9.6M

HPCC Systems Platform

291

$90M

AsterixDB

175

$38M

Vineyard

149

$7.6M

PartD

$84K

GenevaERS

$57M

TonY Project

Archived

Apache Spark Website

Looking for a project that’s not listed?