LFX Platform

Know more about LFX Platform

LFX Insights
Curated Collections

Text Analysis & Corpus Management

Libraries and tools for processing, analyzing, and managing text corpora, enabling quantitative analysis of textual data for research and applications in computational linguistics and content analysis.

The Linux Foundation

by The Linux Foundation

19 projects

Project
Contributors
Organizations
Software value
spaCy
6,495
1,146
$7.8M
Natural Language Toolkit (NLTK)
2,713
627
$4.1M
Stanford CoreNLP
1,317
241
$26M
HanLP
1,277
94
$2.1M
Compromise
731
228
$2.7M
Natural
723
240
$39M
tiktoken
709
199
$81K
Chrono
651
219
$858K
Ansj Chinese Segmentation
621
57
$25M
Moses SMT Decoder
483
57
$17M
quanteda
440
69
$4.4M
Thinc
300
107
$1.7M
WebAnno
263
56
$5M
inflect
229
70
$223K
Snowball
207
80
$1.1M
Apache OpenNLP
172
39
$5.9M
DKPro Core
170
34
$15M
retext
68
36
$61K
preshed
31
17
$36K
Looking for a project that’s not listed?