🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
SoftVC VITS Singing Voice Conversion
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
ModelScope: bring the notion of Model-as-a-Service to life.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Foundational model for human-like, expressive TTS
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Noise supression using deep filtering
OpenAI Whisper ASR Webservice API
Lingvo
Data manipulation and transformation for audio signal processing, powered by PyTorch
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."