132 projects
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
23,357
3,547
$26M
NumPy
NumPy is a Python library for numerical computing, providing support for multi-dimensional arrays, mathematical functions, and efficient operations on large datasets.
10,108
2,271
$24M
SciPy
SciPy is a Python library that builds on NumPy to provide advanced algorithms for scientific computing, including optimization, integration, signal processing, and linear algebra.
10,037
2,059
$30M
SymPy
SymPy is a Python library for symbolic mathematics that provides computer algebra system (CAS) capabilities. It enables users to perform symbolic calculations, algebraic manipulations, calculus operations, and mathematical computations with exact precision rather than numerical approximations.
6,621
1,102
$25M
JAX
JAX is a high-performance numerical computing and machine learning library that combines Numpy's familiar API with GPU and TPU hardware acceleration. It features automatic differentiation, just-in-time compilation, and enables writing transformable numerical programs.
6,185
1,132
$20M
SageMath
Main repository of SageMath
3,779
268
$100M
Numba
Numba is a Just-In-Time (JIT) compiler that translates a subset of Python and NumPy code into fast machine code, specializing in numerical computing and scientific applications. It enables Python functions to be compiled to native machine instructions, significantly improving performance for computationally intensive operations.
3,629
875
$9.9M
Dask
Dask is a flexible parallel computing library for analytics that provides dynamic task scheduling optimized for computation and integrates with Python data science libraries like NumPy, Pandas and Scikit-learn. It enables parallel and distributed computing through intuitive APIs and scales Python code from multi-core machines to clusters.
3,575
902
$6.9M
statsmodels
Statsmodels is a Python library for statistical modeling and econometrics that provides classes and functions for estimating various statistical models, conducting statistical tests, and exploring statistical data. It includes linear regression models, time series analysis, discrete choice models, and many other statistical methods.
3,395
606
$24M
xarray
Xarray is a Python library that introduces labeled arrays and datasets, extending NumPy's capabilities by adding coordinates, dimensions, and attributes to N-dimensional arrays. It enables working with multi-dimensional data by providing data structures and operations for labeled arrays, making it particularly useful for scientific computing and analysis of structured data like climate and weather data.
2,981
705
$6.5M
Astropy
Astropy is a core package for astronomy in Python that provides tools and functionality for performing astronomical calculations, handling astronomical data, working with coordinates, time, units, and other common astronomy-related tasks. It serves as a foundational library for the astronomical computing ecosystem.
2,783
590
$20M
OpenBLAS
OpenBLAS is a high-performance open-source implementation of the Basic Linear Algebra Subprograms (BLAS) library, optimized for various processor architectures. It provides efficient matrix and vector operations essential for scientific computing, machine learning, and numerical analysis.
2,590
563
$165M
PyMC
Bayesian Modeling and Probabilistic Programming in Python
2,523
461
$3.3M
Qiskit
Qiskit is an open-source software development kit for quantum computing that enables users to create, manipulate and run quantum circuits on quantum computers and simulators. It provides tools for quantum programming, algorithm development, hardware access, and quantum education.
2,500
271
$17M
RDKit
RDKit is an open-source cheminformatics and machine learning software toolkit that provides functionality for processing, analyzing, and manipulating chemical structures. It includes tools for 2D and 3D molecular operations, chemical fingerprinting, molecular descriptor calculation, substructure searching, and integration with popular data science frameworks.
2,306
342
$49M
MOOSE
MOOSE (Multiphysics Object-Oriented Simulation Environment) is a finite element framework developed by Idaho National Laboratory for solving complex engineering and scientific problems. It provides a flexible, parallel computational framework for coupled systems of nonlinear partial differential equations.
2,079
78
$101M
Biopython
Biopython is an open-source library of tools for biological computation written in Python. It provides modules for working with biological sequences, file parsers for various bioinformatics file formats, access to online biological databases, and tools for performing common bioinformatics operations.
2,075
381
$62M
AliPhysics
AliPhysics is a physics analysis framework for analyzing data from high-energy particle physics experiments, particularly focused on heavy-ion collisions. It provides tools and algorithms for particle reconstruction, event analysis, and data processing in nuclear physics research.
1,833
25
$272M
ROOT
ROOT is a scientific software framework developed at CERN that provides tools for big data processing, statistical analysis, visualization and storage in high-energy physics research. It includes a C++ interpreter, advanced statistical tools, machine learning capabilities, and powerful graphics tools for data visualization.
1,800
253
$293M
Math.js
Math.js is an extensive mathematics library for JavaScript that provides a flexible expression parser and offers support for symbolic computation, complex numbers, units, matrices, and a wide range of mathematical functions. It can be used both in Node.js and in web browsers.
1,798
428
$3.6M
h5py
h5py is a Pythonic interface to the HDF5 binary data format, providing a high-level interface for storing and organizing large amounts of numerical data. It allows Python programs to store huge amounts of numerical data in a hierarchical format, and efficiently manipulate that data from NumPy.
1,540
476
$1.1M
Trilinos
The mission of the Project is the creation of algorithms and enabling technologies for the solution of large-scale, complex multi-physics engineering and scientific problems on new and emerging high-performance computing (HPC) architectures. The Project is composed of packages addressing various computational needs of engineering and scientific simulations.
1,370
118
$658M
LAMMPS
LAMMPS (Large-scale Atomic/Molecular Massively Parallel Simulator) is a classical molecular dynamics simulation code designed to efficiently compute atomic, polymeric, biological, metallic, granular, and coarse-grained systems using various interatomic potentials and boundary conditions on parallel computers.
1,362
168
$69M
OpenMM
OpenMM is a toolkit for molecular simulation using high performance GPU code.
1,356
204
$40M
Pymatgen
Pymatgen (Python Materials Genomics) is a robust, open-source Python library for materials analysis. It provides core functionality for analyzing materials science data, performing crystallographic operations, generating phase diagrams, and analyzing electronic structures. The library serves as a key component in the Materials Project ecosystem for computational materials science research.
1,246
132
$49M
MFEM
MFEM is a free, lightweight, scalable C++ library for finite element methods. It provides high-performance mathematical algorithms for finite element discretization, linear algebra operations, mesh representation, and parallel computing capabilities for solving partial differential equations.
1,089
95
$18M
NIPYPE
NIPYPE is a Python-based neuroimaging data processing framework that provides a uniform interface to existing neuroimaging software and facilitates interaction between these packages within a single workflow. It enables reproducible, distributed analysis of neuroimaging data through workflows and interfaces to commonly used neuroimaging tools.
1,037
191
$7.2M
Drake
Drake is a robotics toolbox and model-based design tool for analyzing and controlling robotic systems. It provides a comprehensive suite of tools for simulation, optimization, and control of complex robotic systems, with support for rigid body dynamics, trajectory optimization, and control system design.
1,026
154
$32M
deal.II
deal.II is an open-source finite element analysis library designed for computational science and numerical simulations. It provides tools and algorithms for solving partial differential equations using adaptive finite element methods, with particular focus on scientific computing and engineering applications.
1,026
112
$154M
stdlib
stdlib is a comprehensive standard library for JavaScript and Node.js that provides a collection of mathematical, statistical, and scientific computing functions, along with utilities for data processing, machine learning, and numerical computing
1,017
146
$334M
Energy Exascale Earth System Model (E3SM)
The Energy Exascale Earth System Model (E3SM) is a state-of-the-art Earth system model developed for high-performance computing to address challenging climate science problems. It focuses on modeling Earth's water cycle, biogeochemistry, and cryosphere systems at high resolution.
989
45
$75M
ITK
ITK (Insight Toolkit) is an open-source, cross-platform library providing an extensive suite of software tools for image analysis, focusing on medical image processing and registration. It offers algorithms for segmentation, registration, filtering, and analysis of multi-dimensional scientific images.
957
148
$77M
SU2
SU2 is an open-source suite of tools for performing computational fluid dynamics (CFD) simulations and solving partial differential equations. It is designed for aerodynamic shape optimization and is capable of simulating compressible, incompressible, and multiphysics flows.
870
55
$21M
OpenModelica
OpenModelica is an open-source Modelica-based modeling and simulation environment intended for industrial and academic usage. It supports model-based design and simulation of complex physical systems in multiple domains including mechanical, electrical, electronic, hydraulic, thermal, control, and electric power systems.
862
75
$70M
PyBaMM
PyBaMM (Python Battery Mathematical Modelling) is an open-source battery simulation package that enables fast and flexible simulations of battery models. It provides a unified framework for implementing and solving various physics-based battery models, from simple lumped parameter models to complex multiphysics simulations.
853
67
$6.2M
Zarr
Zarr is a format and library for chunked, compressed N-dimensional arrays, designed for efficient storage and access of large scientific datasets. It provides a Python implementation with support for cloud storage, parallel computing, and hierarchical organization of arrays.
825
292
$1.5M
ALICE O2
ALICE O2 (Online-Offline) is a computing system and software framework for the ALICE experiment at CERN. It handles data acquisition, reconstruction, calibration, and analysis for high-energy physics experiments, specifically designed to process data from heavy-ion collisions at the Large Hadron Collider.
814
32
$48M
Psi4
Open-Source Quantum Chemistry – an electronic structure package in C++ driven by Python
809
112
$29M
AMReX
The mission and charitable purposes (collectively, the “Charitable Purposes”) of the Project is to provide a parallel, performance-portable block-structured adaptive mesh refinement framework for science and engineering applications.
794
75
$185M
Firedrake
Firedrake is an automated system for the portable solution of partial differential equations using the finite element method (FEM)
789
85
$10M
GEOS-Chem
GEOS-Chem is a global 3-D chemical transport model for atmospheric composition that simulates atmospheric chemistry, aerosols, and greenhouse gases. It is used by research groups worldwide for studying atmospheric chemistry and composition, air quality, and climate change.
741
19
$18M
Kratos Multiphysics
Kratos Multiphysics is an open-source framework for the implementation of numerical methods for solving engineering problems, particularly focused on finite element methods, computational fluid dynamics, and structural mechanics. It provides a unified environment for multi-physics simulations with parallel processing capabilities.
736
46
$124M
netCDF4-python
netcdf4-python: python/numpy interface to the netCDF C library
733
205
$674K
SpikeInterface
A Python-based module for creating flexible and robust spike sorting pipelines.
702
56
$4.1M
FieldTrip
The MATLAB toolbox for MEG, EEG and iEEG analysis
688
70
$23M
WRF
The Weather Research and Forecasting (WRF) Model is a next-generation mesoscale numerical weather prediction system designed for both atmospheric research and operational forecasting applications. It features multiple dynamical cores and a software architecture allowing for parallel computation and system extensibility.
667
59
$49M
ModelingToolkit.jl
ModelingToolkit.jl is a symbolic modeling framework for scientific computing and automated model transformation in Julia. It enables users to define, manipulate, and solve complex mathematical models through symbolic computation, automatic differentiation, and code generation capabilities.
660
156
$2.1M
wxMaxima
A gui for the computer algebra system Maxima built with wxWidgets
651
88
$5.7M
Unidata NetCDF
NetCDF (Network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. This C implementation provides the reference library and tools for working with NetCDF files, enabling efficient storage and retrieval of multi-dimensional scientific data.
640
170
$20M
Symbolics.jl
Symbolic programming for the next generation of numerical software
639
167
$584K
OpenMC
OpenMC Monte Carlo Code
632
58
$6.8M
CP2K
Quantum chemistry and solid state physics software package
613
84
$39M
Mantid
Mantid (Manipulation and Analysis Toolkit for Instrument Data) is a scientific software framework for processing and analyzing neutron and muon scattering data from large-scale facilities. It provides tools for data reduction, visualization, and analysis used by scientists at neutron and muon sources worldwide.
601
41
$93M
WarpX
The mission and charitable purposes (collectively, the “Charitable Purposes”) of the Project is to provide a community, performance-portable and modular Particle in-Cell code with advanced algorithms.
601
56
$12M
OrdinaryDiffEq.jl
OrdinaryDiffEq.jl is a high-performance solver package for ordinary differential equations (ODEs) in Julia. It provides a comprehensive suite of numerical methods for solving various types of ODEs, including stiff and non-stiff problems, with features for automatic algorithm selection and performance optimization.
584
140
$7M
CaImAn
Computational toolbox for large scale Calcium Imaging Analysis, including movie handling, motion correction, source extraction, spike deconvolution and result visualization.
561
66
$1.5M
PyCBC
Core package to analyze gravitational-wave data, find signals, and study their parameters. This package was used in the first direct detection of gravitational waves (GW150914), and is used in the ongoing analysis of LIGO/Virgo data.
555
38
$13M
QCoDeS
Modular data acquisition framework
543
45
$7.9M
Stan
Stan is a state-of-the-art platform for statistical modeling and high-performance statistical computation. It provides a probabilistic programming language for specifying statistical models and implements gradient-based Markov Chain Monte Carlo (MCMC) sampling for Bayesian inference.
543
108
$3.3M
PowSyBl
PowSyBl (Power System Blocks) is an open source library dedicated to electrical grid modeling and simulation. PowSyBl is written in Java and licensed under the Mozilla Public License 2.0. PowSyBl may be scripted for a quick solution, but it can also be used as a library to write complex software for power system simulation and analysis. Using PowSyBl, developers can create applications able to perform dynamic power flow simulations and security analyses on the network, handle a variety of formats including CGMES for European data exchanges, and many other tasks. PowSyBl has a completely modular design, enabling developers to extend or customize its features by providing their own plugins. Modularity is a key value of PowSyBl’s open source design.
511
54
$692M