LFX Platform

Know more about LFX Platform

LFX Insights

Indexing Libraries & Query Tools

Software for creating and querying data indexes efficiently.

33 projects

41,823 contributors

$641M

Elasticsearch

Elasticsearch is a distributed, RESTful search and analytics engine capable of addressing a growing number of use cases. As the heart of the Elastic Stack, it centrally stores data for lightning fast search, fine‑tuned relevancy, and powerful analytics.

Contributors

18,348

Organizations

3,665

Software value

$204M

Faiss

Faiss is a library for efficient similarity search and clustering of dense vectors, developed by Facebook Research. It contains algorithms that search in sets of vectors of any size, up to ones that possibly do not fit in RAM. It also includes support for different similarity metrics and various optimization methods for fast and accurate vector search.

Contributors

2,750

Organizations

501

Software value

$5.2M

Chroma

the AI-native open-source embedding database

Contributors

2,422

Organizations

461

Software value

$14M

Meilisearch

Meilisearch is an open-source search engine that provides fast, relevant, and typo-tolerant full-text search capabilities. It is designed to be easy to integrate into applications and offers features like instant search results, custom ranking rules, and faceted search.

Contributors

1,692

Organizations

480

Software value

$7.5M

InstantSearch

⚡️ Libraries for building performant and instant search and recommend experiences with Algolia. Compatible with JavaScript, TypeScript, React and Vue.

Contributors

1,583

Organizations

341

Software value

$9.4M

mu

mu is an email indexing and search tool that enables fast email search, filtering and organization. It works with Maildirs and supports integration with email clients like Emacs. The tool provides a command-line interface for indexing and searching emails, with features like full-text search, message threading, and attachment handling.

Contributors

1,329

Organizations

377

Software value

$2.6M

Apache Lucene

Apache Lucene is a high-performance, full-featured text search engine library written in Java. It provides indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities.

Contributors

1,282

Organizations

177

Software value

$40M

Elasticsearch Curator

Elasticsearch Curator is a maintenance and management tool for Elasticsearch indices that helps users perform administrative tasks like creating, deleting, and managing indices based on time-based patterns, size thresholds, and other configurable criteria.

Contributors

1,265

Organizations

376

Software value

$1M

Fuse.js

Fuse.js is a lightweight fuzzy-search library that enables powerful searching with approximate string matching and ranking in JavaScript. It allows developers to search through lists and find items even when there are spelling mistakes or partial matches.

Contributors

1,170

Organizations

397

Software value

$301K

Laravel Scout

Laravel Scout provides a driver based solution to searching your Eloquent models.

Contributors

1,028

Organizations

268

Software value

$237K

Apache Solr

Apache Solr is an open-source enterprise search platform built on Apache Lucene. It provides distributed indexing, replication, load-balanced querying, automated failover and recovery, centralized configuration, and full-text search capabilities. Solr powers the search and navigation features of many large-scale internet sites.

Contributors

993

Organizations

136

Software value

$58M

OpenGrok

OpenGrok is a fast and usable source code search and cross reference engine that enables searching and navigating source code repositories. It helps developers understand and analyze code bases by providing features like full text search, cross-referencing, and syntax highlighting.

Contributors

974

Organizations

149

Software value

$4.7M

Anserini

Anserini is an information retrieval toolkit built on Lucene that provides indexing and search capabilities for academic research and production deployments. It offers efficient implementations of state-of-the-art IR techniques, reproducible research methods, and tools for working with common IR test collections.

Contributors

919

Organizations

88

Software value

$8M

PgSearch

PgSearch is a Ruby gem that adds full text search capabilities to PostgreSQL databases in Ruby on Rails applications. It provides a simple interface for creating scopes that perform full text search using PostgreSQL's built-in full text search features.

Contributors

895

Organizations

285

Software value

$183K

Bleve

Bleve is a modern text indexing and search library for Go that provides full-text search capabilities with support for multiple storage engines, rich query types, and faceted search. It enables developers to add search functionality to their Go applications with features like text analysis, term mapping, and scoring.

Contributors

657

Organizations

240

Software value

$4.5M

Lunr.js

Lunr.js is a lightweight, full-text search library for browser-based applications that enables offline search functionality without external dependencies. It provides features like field-based weighting, fuzzy matching, and stemming while being designed to run entirely on the client side.

Contributors

656

Organizations

257

Software value

$664K

FlexSearch

FlexSearch is a full-text search library for JavaScript that provides fast and memory-efficient search capabilities with advanced features like fuzzy matching, field-specific search, and customizable indexing strategies

Contributors

608

Organizations

208

Software value

$3.1M

DocSearch

:blue_book: The easiest way to add search to your documentation.

Contributors

589

Organizations

215

Software value

$796K

dnGrep

Graphical GREP tool for Windows

Contributors

427

Organizations

28

Software value

$10M

RoaringBitmap

A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others

Contributors

343

Organizations

73

Software value

$3.1M

Kernel Memory

Kernel Memory is an AI service and SDK that provides document ingestion, storage, and retrieval capabilities with semantic search and AI reasoning. It enables applications to efficiently process, index, and query documents using natural language, supporting multiple data sources and AI models.

Contributors

305

Organizations

46

Software value

$825K

emojilib

A comprehensive JSON library containing emoji data including keywords, categories, and annotations to help implement emoji functionality in applications

Contributors

265

Organizations

86

Software value

$619K

Apache Nutch

Apache Nutch is a highly extensible and scalable open source web crawler framework written in Java. It enables automated fetching, parsing, and data extraction from web pages at scale, with features for distributed crawling, link analysis, and integration with Apache Hadoop for processing large datasets.

Contributors

221

Organizations

41

Software value

$2.7M

DataWave

DataWave is an ingest/query framework that leverages Apache Accumulo to provide fast, secure data access.

Contributors

218

Organizations

20

Software value

$34M

Hibernate Search

Hibernate Search: full-text search for domain model

Contributors

195

Organizations

41

Software value

$18M

Earthdata Search

Earthdata Search is a web application developed by NASA that enables users to search, discover, visualize, and access Earth science data from NASA's Earth Observing System Data and Information System (EOSDIS). It provides a unified interface for browsing and downloading satellite imagery, climate data, and other Earth science datasets.

Contributors

188

Organizations

35

Software value

$22M

Xapian

Xapian is an open source search engine library that enables probabilistic information retrieval and full-text indexing capabilities. It provides a highly adaptable toolkit for adding intelligent searching to applications, supporting features like relevance-based ranking, boolean queries, and phrase searching.

Contributors

146

Organizations

32

Software value

$179M

NNTmux

NNTmux is a Usenet indexing application forked from Newznab, designed to help users search, browse and manage Usenet content. It provides features for indexing newsgroups, managing NZB files, and integrating with various Usenet services.

Contributors

144

Organizations

16

Software value

$4.9M

FTS Xapian

Dovecot FTS plugin based on Xapian

Contributors

142

Organizations

32

Software value

$103K

photon

Photon is an open-source geocoding engine that converts addresses and place names into geographic coordinates. It provides fast and accurate search functionality for OpenStreetMap data, offering both forward (text to coordinates) and reverse (coordinates to text) geocoding capabilities through a REST API.

Contributors

69

Organizations

8

Software value

$557K

Apache Lucene and Solr

Apache Lucene and Solr open-source search software

This project hasn't been onboarded to LFX Insights.
Looking for a project that’s not listed?