LFX Platform

Know more about LFX Platform

LFX Insights

Speech Processing Toolkits

Toolkits specialized in processing spoken language data, including speech recognition, speaker diarization, speech enhancement, and spoken language understanding.

9 projects

9,758 contributors

$54M

Coqui TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Contributors

2,610

Organizations

356

Software value

$11M

Whisper

Whisper is an automatic speech recognition (ASR) system developed by OpenAI that can transcribe and translate spoken language from audio into text. It is trained on a large dataset of multilingual speech data and can handle various languages, accents, and acoustic environments.

Contributors

2,166

Organizations

295

Software value

$606K

Kaldi Speech Recognition Toolkit

kaldi-asr/kaldi is the official location of the Kaldi project.

Contributors

2,044

Organizations

254

Software value

$27M

SpeechBrain

SpeechBrain is an open-source speech toolkit built on PyTorch that provides state-of-the-art speech technologies, including speech recognition, speaker recognition, speech enhancement, multi-microphone signal processing and speech separation. It features a unified, flexible interface for speech research and applications.

Contributors

1,414

Organizations

176

Software value

$9.4M

eSpeak NG

eSpeak NG is an open-source speech synthesizer that supports multiple languages and can convert text to speech. It is a fork and continuation of the original eSpeak project, offering improved voice quality, additional language support, and various phonetic improvements.

Contributors

1,059

Organizations

172

Software value

$2.1M

VOICEVOX

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのエディター

Contributors

298

Organizations

35

Software value

$2.4M

Archived

DELTA

Delta is a deep learning based end-to-end natural language and speech processing platform. DELTA aims to provide easy and fast experiences for using, deploying, and developing natural language processing and speech models for both academia and industry use cases. DELTA is mainly implemented using TensorFlow and Python 3.

Contributors

167

Organizations

19

Software value

$2M

Lhotse

Tools for handling speech data in machine learning projects.

This project hasn't been onboarded to LFX Insights.

torchaudio

Data manipulation and transformation for audio signal processing, powered by PyTorch

This project hasn't been onboarded to LFX Insights.
Looking for a project that’s not listed?