33 projects
Elasticsearch
Elasticsearch is a distributed, RESTful search and analytics engine capable of addressing a growing number of use cases. As the heart of the Elastic Stack, it centrally stores data for lightning fast search, fine‑tuned relevancy, and powerful analytics.
18,348
3,665
$204M
Faiss
Faiss is a library for efficient similarity search and clustering of dense vectors, developed by Facebook Research. It contains algorithms that search in sets of vectors of any size, up to ones that possibly do not fit in RAM. It also includes support for different similarity metrics and various optimization methods for fast and accurate vector search.
2,750
501
$5.2M
Chroma
the AI-native open-source embedding database
2,422
461
$14M
Meilisearch
Meilisearch is an open-source search engine that provides fast, relevant, and typo-tolerant full-text search capabilities. It is designed to be easy to integrate into applications and offers features like instant search results, custom ranking rules, and faceted search.
1,692
480
$7.5M
InstantSearch
⚡️ Libraries for building performant and instant search and recommend experiences with Algolia. Compatible with JavaScript, TypeScript, React and Vue.
1,583
341
$9.4M
mu
mu is an email indexing and search tool that enables fast email search, filtering and organization. It works with Maildirs and supports integration with email clients like Emacs. The tool provides a command-line interface for indexing and searching emails, with features like full-text search, message threading, and attachment handling.
1,329
377
$2.6M
Apache Lucene
Apache Lucene is a high-performance, full-featured text search engine library written in Java. It provides indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities.
1,282
177
$40M
Elasticsearch Curator
Elasticsearch Curator is a maintenance and management tool for Elasticsearch indices that helps users perform administrative tasks like creating, deleting, and managing indices based on time-based patterns, size thresholds, and other configurable criteria.
1,265
376
$1M
Fuse.js
Fuse.js is a lightweight fuzzy-search library that enables powerful searching with approximate string matching and ranking in JavaScript. It allows developers to search through lists and find items even when there are spelling mistakes or partial matches.
1,170
397
$301K
Laravel Scout
Laravel Scout provides a driver based solution to searching your Eloquent models.
1,028
268
$237K
Apache Solr
Apache Solr is an open-source enterprise search platform built on Apache Lucene. It provides distributed indexing, replication, load-balanced querying, automated failover and recovery, centralized configuration, and full-text search capabilities. Solr powers the search and navigation features of many large-scale internet sites.
993
136
$58M
OpenGrok
OpenGrok is a fast and usable source code search and cross reference engine that enables searching and navigating source code repositories. It helps developers understand and analyze code bases by providing features like full text search, cross-referencing, and syntax highlighting.
974
149
$4.7M
Anserini
Anserini is an information retrieval toolkit built on Lucene that provides indexing and search capabilities for academic research and production deployments. It offers efficient implementations of state-of-the-art IR techniques, reproducible research methods, and tools for working with common IR test collections.
919
88
$8M
PgSearch
PgSearch is a Ruby gem that adds full text search capabilities to PostgreSQL databases in Ruby on Rails applications. It provides a simple interface for creating scopes that perform full text search using PostgreSQL's built-in full text search features.
895
285
$183K
Bleve
Bleve is a modern text indexing and search library for Go that provides full-text search capabilities with support for multiple storage engines, rich query types, and faceted search. It enables developers to add search functionality to their Go applications with features like text analysis, term mapping, and scoring.
657
240
$4.5M
Lunr.js
Lunr.js is a lightweight, full-text search library for browser-based applications that enables offline search functionality without external dependencies. It provides features like field-based weighting, fuzzy matching, and stemming while being designed to run entirely on the client side.
656
257
$664K
FlexSearch
FlexSearch is a full-text search library for JavaScript that provides fast and memory-efficient search capabilities with advanced features like fuzzy matching, field-specific search, and customizable indexing strategies
608
208
$3.1M
DocSearch
:blue_book: The easiest way to add search to your documentation.
589
215
$796K
dnGrep
Graphical GREP tool for Windows
427
28
$10M
RoaringBitmap
A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others
343
73
$3.1M
Kernel Memory
Kernel Memory is an AI service and SDK that provides document ingestion, storage, and retrieval capabilities with semantic search and AI reasoning. It enables applications to efficiently process, index, and query documents using natural language, supporting multiple data sources and AI models.
305
46
$825K
emojilib
A comprehensive JSON library containing emoji data including keywords, categories, and annotations to help implement emoji functionality in applications
265
86
$619K
Apache Nutch
Apache Nutch is a highly extensible and scalable open source web crawler framework written in Java. It enables automated fetching, parsing, and data extraction from web pages at scale, with features for distributed crawling, link analysis, and integration with Apache Hadoop for processing large datasets.
221
41
$2.7M
DataWave
DataWave is an ingest/query framework that leverages Apache Accumulo to provide fast, secure data access.
218
20
$34M
Hibernate Search
Hibernate Search: full-text search for domain model
195
41
$18M
Earthdata Search
Earthdata Search is a web application developed by NASA that enables users to search, discover, visualize, and access Earth science data from NASA's Earth Observing System Data and Information System (EOSDIS). It provides a unified interface for browsing and downloading satellite imagery, climate data, and other Earth science datasets.
188
35
$22M
Xapian
Xapian is an open source search engine library that enables probabilistic information retrieval and full-text indexing capabilities. It provides a highly adaptable toolkit for adding intelligent searching to applications, supporting features like relevance-based ranking, boolean queries, and phrase searching.
146
32
$179M
NNTmux
NNTmux is a Usenet indexing application forked from Newznab, designed to help users search, browse and manage Usenet content. It provides features for indexing newsgroups, managing NZB files, and integrating with various Usenet services.
144
16
$4.9M
FTS Xapian
Dovecot FTS plugin based on Xapian
142
32
$103K
photon
Photon is an open-source geocoding engine that converts addresses and place names into geographic coordinates. It provides fast and accurate search functionality for OpenStreetMap data, offering both forward (text to coordinates) and reverse (coordinates to text) geocoding capabilities through a REST API.
69
8
$557K
Apache Lucene and Solr
Apache Lucene and Solr open-source search software