84 projects
TensorFlow
TensorFlow is an open-source machine learning framework developed by Google that enables numerical computation and large-scale machine learning. It provides a flexible system for defining and executing computations involving tensors, which are multi-dimensional arrays. The framework supports deep learning and neural networks across multiple platforms and devices.
47,080
6,189
$196M
vLLM
The mission of the Project is to develop an open-source library for fast LLM inference and serving.
18,859
2,211
$24M
ONNX
ONNX is an open format built to represent machine learning models. ONNX defines a common set of operators - the building blocks of machine learning and deep learning models - and a common file format to enable AI developers to use models with a variety of frameworks, tools, runtimes, and compilers.
7,962
1,000
$47M
spaCy
spaCy is an industrial-strength natural language processing library for Python, designed for production use. It offers fast and accurate syntactic analysis, named entity recognition, text classification, and more. The library includes pre-trained statistical models and word vectors, and supports deep learning integration.
6,466
1,130
$7.8M
JAX
JAX is a high-performance numerical computing and machine learning library that combines Numpy's familiar API with GPU and TPU hardware acceleration. It features automatic differentiation, just-in-time compilation, and enables writing transformable numerical programs.
6,035
1,109
$20M
XGBoost
XGBoost is a scalable, distributed gradient boosting library that provides parallel tree boosting for machine learning tasks. It implements machine learning algorithms under the gradient boosting framework, offering high performance, flexibility and portability across multiple programming languages and platforms.
5,907
828
$6.3M
Gradio
Gradio is an open-source Python library that enables developers to quickly create customizable web interfaces for machine learning models, data processing pipelines, and other Python functions. It allows for easy demo creation and sharing of ML models with drag-and-drop interfaces, requiring minimal code.
4,414
696
$9.4M
Detectron2
Detectron2 is a computer vision library developed by Facebook AI Research (FAIR) that implements state-of-the-art object detection algorithms. It provides a modular, flexible platform for implementing and training computer vision models, with support for tasks like object detection, instance segmentation, keypoint detection, and panoptic segmentation.
4,292
606
$2.3M
Llama Models
A collection of large language models (LLMs) developed by Meta AI, including the Llama family of models. These models are designed for natural language processing tasks and are made available for research and commercial use under specific licensing terms.
3,952
676
$425K
CatBoost
CatBoost is a high-performance, open-source gradient boosting library developed by Yandex that implements gradient boosting on decision trees. It provides fast, scalable, and accurate machine learning algorithms for classification, regression, and ranking tasks, with built-in support for categorical features.
3,521
348
$236M
LightGBM
LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed and efficient with faster training speed and higher efficiency, lower memory usage, better accuracy, parallel and GPU learning, and handling large-scale data.
3,139
482
$3.4M
DeepRec
The mission of the Project is to develop a high-performance recommendation deep learning framework.
2,726
250
$149M
Sentence Transformers
Sentence Transformers is a Python framework for state-of-the-art sentence and text embeddings. It provides easy-to-use methods to compute dense vector representations for sentences, paragraphs and images, enabling semantic similarity comparisons and information retrieval tasks.
2,492
458
$2.3M
Whisper
Whisper is an automatic speech recognition (ASR) system developed by OpenAI that can transcribe and translate spoken language from audio into text. It is trained on a large dataset of multilingual speech data and can handle various languages, accents, and acoustic environments.
2,140
294
$606K
FATE Project
FATE is an open-source project initiated by Webank’s AI Department to provide a secure computing framework to support the federated AI ecosystem.
1,642
108
$36M
Stable Baselines3 (SB3)
Stable Baselines3 (SB3) is a reliable implementation of reinforcement learning algorithms in PyTorch. It provides a set of high-quality implementations of state-of-the-art reinforcement learning algorithms, including PPO, A2C, DQN, and SAC. The library focuses on providing clean, documented, and reliable implementations for research and development in reinforcement learning.
1,446
227
$745K
GGML
GGML is a tensor library for machine learning that enables efficient neural network inference on CPU. It provides low-level primitives for implementing deep learning models with a focus on performance and memory efficiency, particularly for running large language models on consumer hardware.
1,421
268
$6.7M
Ludwig
Ludwig is an open-source, declarative machine learning framework that makes it easy to define deep learning pipelines with a simple and flexible data-driven configuration system. Ludwig is a low-code framework for building custom AI models like LLMs and other deep neural networks.
942
155
$8.7M
Adversarial Robustness Toolbox
Adversarial Robustness Toolbox (ART) provides tools that enable developers and researchers to evaluate, defend, certify and verify Machine Learning models and applications against the adversarial threats.
708
65
$5.3M
Recommenders
The mission of the Project is to develop examples and best practices for building recommendation systems, provided as Jupyter notebooks.
525
111
$3M
Natural Language Toolkit (NLTK)
NLTK (Natural Language Toolkit) is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning.
507
60
$4.1M
Elyra
The mission of the Project is to create and maintain an open-source development workspace that simplifies the creation and orchestration of the AI model development lifecycle tasks.
488
93
$23M
Neural Network (NN) Streamer
? Neural Network (NN) Streamer, Stream Processing Paradigm for Neural Network Apps/Devices.
445
48
$30M
OpenFL
The mission of the OpenFL projet is to build a flexible, secure, scalable and easily learnable Federated Learning tool for data scientists and data owners.
311
39
$2.5M
DocArray
The mission of the DocArray project is to develop a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh.
308
47
$1.6M
Faiss
Faiss is a library for efficient similarity search and clustering of dense vectors, developed by Facebook Research. It contains algorithms that search in sets of vectors of any size, up to ones that possibly do not fit in RAM. It also includes support for different similarity metrics and various optimization methods for fast and accurate vector search.
265
34
$4.8M
SapientML
The mission of the Project is to help data scientists rapidly create and amend AI models.
50
9
$1.5M
BeyondML
The mission of the Project is to advance the state of artificial intelligence by designing and implementing an open-source framework for developing sparse, optimized neural networks capable of efficiently performing multiple tasks across multiple data domains.
7
4
$6M
Apache Mahout
Mirror of Apache Mahout
BugBug
Platform for Machine Learning projects on Software Engineering
Caffe
Caffe: a fast open framework for deep learning.
Chainer
A flexible framework of neural networks for deep learning
DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
FEDML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
Flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
GNINA
A deep learning framework for molecular docking
GPy
Gaussian processes framework in python
GROBID
A machine learning software for extracting information from scholarly documents
Generative AI with Gemini on Vertex AI
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Gensim
Topic Modelling for Humans
Gloo
Collective communications library with various primitives for multi-machine training.
Graph Data Science
Source code for the Neo4j Graph Data Science library of graph algorithms.
Hongbo Miao R&D Lab
A personal research and development (R&D) lab that facilitates the sharing of knowledge.
Hugging Face JS
Utilities to use the Hugging Face Hub API
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Kornia
🐍 Geometric Computer Vision Library for Spatial AI
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Llama Cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
Lux.jl
Elegant and Performant Scientific Machine Learning in Julia
MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md)
NNlib
Neural Network primitives with multiple backends
NetKet
Machine learning algorithms for many-body quantum systems
Nilearn
Machine learning for NeuroImaging in Python
ONNX Runtime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Optax
Optax is a gradient processing and optimization library for JAX.
Optuna
A hyperparameter optimization framework
PennyLane
PennyLane is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. Train a quantum computer the same way as a neural network.
PySR
High-Performance Symbolic Regression in Python and Julia
PyTorch Geometric
Graph Neural Network Library for PyTorch