68 projects
LF AI & Data
LF AI & Data is a foundation under the Linux Foundation dedicated to advancing open-source artificial intelligence (AI), machine learning (ML), and data projects. It fosters collaboration between industry leaders, researchers, and developers to create scalable, trustworthy, and interoperable AI and data solutions.
vLLM
vLLM is a Linux Foundation project focused on optimizing large language model inference, providing high-throughput serving capabilities with efficient memory management for AI applications.
27,499
1,247
ONNX
ONNX is an open ecosystem for interoperable AI models. It's a community project: we welcome your contributions!
9,362
961
Delta Lake Project
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
9,229
491
DeepSpeed
DeepSpeed is a Linux Foundation project that optimizes deep learning training by enhancing speed, scale, and efficiency. It provides a high-performance training library that reduces computing costs while enabling larger AI models on limited hardware resources.
6,349
103
Milvus
The open source vector database designed for AI applications
6,173
401
OPEA
OPEA is an ecosystem orchestration framework to integrate performant GenAI technologies & workflows leading to quicker GenAI adoption and business value.
4,884
53
Kedro Project
Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It is hosted in incubation in LF AI & Data.
3,541
185
IREE
IREE (Intermediate Representation Execution Environment) is a Linux Foundation project providing a compiler toolchain and runtime for executing machine learning models across diverse hardware accelerators and platforms.
3,429
114
Kserve
KServe is a highly scalable and standards based Model Inference Platform on Kubernetes for Trusted AI. It is hosted in Incubation in LF AI & Data Foundation.
3,031
375
DELTA
DELTA is a deep learning based end-to-end natural language and speech processing platform. It's an incubation project in the LF AI & Data Foundation.
2,920
33
OpenFL
Open source framework for federated learning hosted in incubation in LF AI & Data Foundation
2,882
35
DeepRec
DeepRec is an open source high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
2,755
212
FATE Project
Collaborative Learning and Knowledge Transfer with Data Protection
2,550
113
Horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
2,302
331
Flyte
Organization that hosts the Flyte Project with all of the core components. Flyte is an LF AI & Data Graduated Project
2,100
334
CLAIMED
CLAIMED is a runtime and programming language agnostic Data & AI component framework abstracting away all complexity for advanced MLOps and TrustedAI.
2,068
19
Feast
The Open Source Feature Store for AI/ML
1,807
342
JanusGraph
Open-source, distributed graph database
1,623
228
TonY Project
TonY, an incubation project is LF AI & Data Foundation, is a framework to natively run deep learning frameworks on Apache Hadoop.
1,611
29
Angel
Angel is a high-performance and full-stack distributed ML platform. It is an LF AI Graduation project.
1,496
56
Pyro
Pyro - Deep Universal Probabilistic Programming
1,439
232
sparklyr
R Interface to Apache Spark
1,319
137
Ludwig
Ludwig is a toolbox that allows to train and test deep learning models without the need to write code. It is an incubation level project in LF AI Foundation.
1,304
148
Amundsen
Amundsen is a data discovery and metadata engine. It's an incubation project in LF AI & Data Foundation.
1,109
255
Adversarial Robustness Toolbox
This GitHub org hosts LF AI Foundation projects in the category of Trusted and Responsible AI.
945
56
Marquez
Collect, aggregate, and visualize a data ecosystem's metadata
934
58
Open Lineage
OpenLineage, an LF AI & Data hosted project, is an open source collaboration project aiming to standardize lineage and metadata collection.
720
84
Neural Network (NN) Streamer
🔀 Neural Network (NN) Streamer, Stream Processing Paradigm for Neural Network Apps/Devices.
708
43
Egeria
LF AI & Data is an umbrella foundation,artificial intelligence, machine learning, deep learning, and data.
698
58
Elyra
Elyra is an open source set of AI-centric extensions to JupyterLab Notebooks. The project is hosted in incubation in the LF AI & Data Foundation.
626
93
Recommenders
Recommenders offers examples & best practices for building recommendation systems, provided as Jupyter notebooks. An LF AI & Data Sandbox project.
605
116
RWKV
RWKV is an RNN with Transformer-level LLM performance. It is hosted as an incubation project in LF AI & Data Foundation.
568
31
Monocle
The goal of the open source Monocle project is to help GenAI developers trace their applications. Hosted in incubation as a Sandbox project in LF AI & Data.
514
9
AI Fairness 360
AI Fairness 360 is a Linux Foundation project that provides tools and resources to detect, measure, and mitigate bias in artificial intelligence systems, promoting ethical and equitable AI development across industries.
402
45
DocArray
DocArray is an open source project offering the data structure for multimodal data. It is hosted in incubation in the LF AI & Data Foundation.
334
48
FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model. The project is hosted in LF AI & Data.
331
26
LF AI & Data
LF AI & Data is a Linux Foundation initiative fostering innovation and collaboration in artificial intelligence and data technologies through open source projects, promoting standards, best practices, and sustainable ecosystem growth.
219
38
Kompute
Kompute is a Sandbox Project in LF AI & Data Foundation focused on advancing the GPU acceleration ecosystem through cross-vendor graphics card tooling.
196
30
Feathr
Feathr is an open source enterprise-grade, high performance feature store, hosted in incubation in the LF AI & Data Foundation.
190
29
Substra
Substra is the most proven federated learning software designed for healthcare research.
184
20
Xtreme1
The Next GEN Platform for Multisensory Training Data. Xtreme1 is hosted in incubation in LF AI & Data Foundation.
176
24
AI Explainability 360
AI Explainability 360 is a Linux Foundation project that provides tools and resources to help developers and users understand, interpret, and explain AI model decisions, promoting transparency and trust in artificial intelligence systems.
149
20
Datashim
Datashim, an LF AI & Data incubation project, that enables and accelerates data access for Kubernetes/OpenShift workloads in a transparent & declarative way.
148
47
Open Voice Network Interoperability Initiative
The goal of the Open Voice Network Interoperability Initiative is to enable voice and conversational AI to work like the web.
135
30
Adlik
Adlik is an end-to-end optimizing framework for Deep Learning models. Adlik accelerates DL inference process both on cloud and embedded environments.
112
7
Bitol
Open Data Contract Standard (ODCS) describes a structure for a data contract. Hosted in LF AI & Data as a Sandbox project.
81
17
Open Model Initiative
A community-driven effort to promote the development & adoption of openly licensed AI models for image, video & audio generation. Hosted in LF AI & Data.
72
6
LakeSoul
LakeSoul provides a data management platform for lakehouse architecture, allowing users to query, explore and visualize data in a unified and scalable way.
71
5
Data Practices
A Linux Foundation initiative focused on establishing best practices, standards, and methodologies for effective data management, governance, and utilization across organizations to enhance data-driven decision making and innovation.
68
7
Elastic Deep Learning (EDL)
EDL, an LF AI incubation project, is an elastic deep learning framework designed to help cloud providers build cluster cloud services using DL frameworks.
64
10
Machine Learning eXchange (MLX)
MLX, a Sandbox Project at the LF AI & Data Foundation, is a Data and AI assets catalog and execution engine.
64
8
SapientML
Generative AutoML for Tabular Data
62
10
OpenDS4All
OpenDS4All is a Linux Foundation initiative providing open-source data science educational materials to democratize data science education, enabling academic institutions to develop comprehensive data science curricula efficiently.
58
14
RosaeNLG Project
RosaeNLG, an LF AI & Data Sandbox project, is an open source Natural Language Generation software license under the Apache 2.0.
40
10
SOAJS
SOAJS, an incubation project under the LF AI Foundation, offers a complete enterprise open source microservice management platform.
35
12
1chipML
test
31
7
OpenDataology
OpenDataology is a project for AI model training with trusted dataset compliance. It is hosted as a Sandbox project in LF AI & Data Foundation.
27
5
OpenBytes
OpenBytes, a Sandbox Project in the LF AI & Data Foundation., aims to bring transformational changes to AI by making open datasets more available & accessible.
22
6
DeepCausality
DeepCausality is a hyper-geometric computational causality library. It is hosted as a Sandbox Project in the LF AI & Data Foundation.
21
7
Artigraph
Artigraph is an open source tool focused on improving the authorship, management, and quality of data. It's hosted in LF AI & Data as a Sandbox Project.
21
10