61 projects
Apache Spark
Apache Spark - A unified analytics engine for large-scale data processing
9,125
1,282
$82M
ipython
ipython is a sub-project of Project Jupyter.
8,696
1,714
$72M
Dask
Dask is a flexible parallel computing library for analytics that provides dynamic task scheduling optimized for computation and integrates with Python data science libraries like NumPy, Pandas and Scikit-learn. It enables parallel and distributed computing through intuitive APIs and scales Python code from multi-core machines to clusters.
3,572
901
$6.9M
OpenJDK Panama Foreign
https://openjdk.org/projects/panama
2,539
119
$431M
Open MPI
Open MPI is a high-performance message passing library implementation of the Message Passing Interface (MPI) standard, designed for distributed memory computing environments and parallel programming. It provides a portable and efficient framework for developing parallel applications across different computing platforms and architectures.
2,259
411
$21M
CuPy
NumPy & SciPy for GPU
1,993
428
$8.6M
Colossal-AI
Making large AI models cheaper, faster and more accessible
1,460
160
$9.2M
joblib
Joblib is a set of tools to provide lightweight pipelining in Python, designed to optimize disk I/O operations, caching, and parallel processing. It offers transparent disk-caching of functions and lazy re-evaluation, as well as simple parallel computing capabilities.
1,246
401
$694K
Kokkos Project
The mission of the Project is to develop and support a programming system for performance portability across hardware architectures relevant for compute heavy workloads. The Project is composed of subprojects addressing different aspects of a comprehensive performance portability solution for engineering, science and other compute intensive software.
1,223
181
$606M
MLX
MLX: An array framework for Apple silicon
1,028
245
$5.5M
Rayon
Rayon is a data parallelism library for Rust that makes it easy to convert sequential computations into parallel ones. It provides parallel iterators, parallel collections, and other utilities for writing fast, safe parallel code.
828
305
$972K
AMReX
The mission and charitable purposes (collectively, the “Charitable Purposes”) of the Project is to provide a parallel, performance-portable block-structured adaptive mesh refinement framework for science and engineering applications.
792
75
$185M
CUDA Core Compute Libraries (CCCL)
CUDA Core Compute Libraries
778
148
$28M
WarpX
The mission and charitable purposes (collectively, the “Charitable Purposes”) of the Project is to provide a community, performance-portable and modular Particle in-Cell code with advanced algorithms.
601
56
$12M
HYPRE
Parallel solvers for sparse linear systems featuring multigrid methods.
536
66
$24M
Daft
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
535
111
$15M
HOOMD-blue
Molecular dynamics and Monte Carlo soft matter simulation on GPUs.
498
37
$8.6M
RAPIDS Accelerator For Apache Spark
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
402
40
$13M
ASPECT
A parallel, extensible finite element code to simulate convection in both 2D and 3D models.
401
27
$50M
Parsl
Parsl - a Python parallel scripting library
389
84
$1.1M
Highway
Performance-portable, length-agnostic SIMD with runtime dispatch
371
99
$4.6M
NWChem
NWChem: Open Source High-Performance Computational Chemistry
322
43
$218M
Hardware Locality (hwloc)
Hardware locality (hwloc)
319
98
$3.5M
Legion
Legion is a high-performance data-centric parallel programming system that enables efficient distributed execution of applications on heterogeneous architectures. It provides a programming model and runtime system for writing portable high performance programs targeting distributed heterogeneous architectures.
311
51
$23M
QMCPACK
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support
292
30
$74M
LinearSolve.jl
LinearSolve.jl: High-Performance Unified Interface for Linear Solvers in Julia. Easily switch between factorization and Krylov methods, add preconditioners, and all in one interface.
286
80
$697K
OpenPMIx
OpenPMIx Project Repository
271
70
$5.6M
PIConGPU
Performance-Portable Particle-in-Cell Simulations for the Exascale Era :sparkles:
261
32
$15M
QUDA
QUDA is a library for performing calculations in lattice QCD on GPUs.
253
28
$5.3M
preCICE
A coupling library for partitioned multi-physics simulations, including, but not restricted to fluid-structure interaction and conjugate heat transfer simulations.
253
29
$4.8M
Feel++
:gem: Feel++: Finite Element Embedded Language and Library in C++
243
18
$30M
Ginkgo
Numerical linear algebra software package
211
43
$9.7M
RAJA
RAJA Performance Portability Layer (C++)
205
19
$4.8M
ITensors.jl
A Julia library for efficient tensor computations and tensor network calculations. ITensor is supported by the Simons Foundation's Flatiron Institute.
201
36
$1.2M
Earth System Modeling Framework (ESMF)
The Earth System Modeling Framework (ESMF) is a suite of software tools for developing high-performance, multi-component Earth science modeling applications.
184
29
$44M
Yambo
This is the official GPL repository of the yambo code
152
18
$4.5M
TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
145
21
$5.7M
SCOREC Core
SCOREC Core is a scientific computing library that provides mesh data structures, parallel mesh adaptation, and geometric modeling capabilities for high-performance computing applications. It focuses on parallel mesh modification, load balancing, and geometric operations for simulation workflows.
109
4
$3.8M
Goma
A Full-Newton Finite Element Program for Free and Moving Boundary Problems with Coupled Fluid/Solid Momentum, Energy, Mass, and Chemical Species Transport
71
3
$13M
PISM (Parallel Ice Sheet Model)
PISM (Parallel Ice Sheet Model) is a computer model for simulating large-scale ice sheets and glaciers. It uses parallel computing techniques to simulate ice dynamics, thermodynamics, and interactions with climate and ocean systems, enabling research on past and future ice sheet behavior, sea level change, and climate dynamics.
39
2
$4.3M
Apache Uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
Castro
Castro (Compressible Astrophysics): An adaptive mesh, astrophysical compressible (radiation-, magneto-) hydrodynamics simulation code for massively parallel CPU and GPU architectures.
GPUArrays
Reusable array functionality for Julia's various GPU backends.
Gloo
Collective communications library with various primitives for multi-machine training.
HeAT
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
KernelAbstractions.jl
Heterogeneous programming in Julia
MPI.jl
MPI wrappers for Julia
NCCL
Optimized primitives for collective multi-GPU communication
OneFlow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Paboyle_Data_Parallel_Cpp_Grid
Data parallel C++ mathematical object library
SIMDe
Implementations of SIMD instruction sets for systems which don't natively support them.
SPHinXsys
SPHinXsys provides C++ APIs for engineering simulation and optimization. It aims at complex systems driven by fluid, structure, multi-body dynamics and beyond. The multi-physics library is based on a unique and unified computational framework by which strong coupling has been achieved for all involved physics.
SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
VOLK
The Vector Optimized Library of Kernels
Workflow
C++ Parallel Computing and Asynchronous Networking Framework
Xilinx Runtime
Run Time for AIE and FPGA based platforms
libEnsemble
A Python toolkit for coordinating asynchronous and dynamic ensembles of calculations.