-
Concordia University/Mila
- Montreal
- https://sites.google.com/site/mircoravanelli/
Stars
A general purpose task-agnostic speech augmentation policy
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
This repository contains the SpeechBrain Benchmarks
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
This repository provides a comprehensive pipeline for analyzing motor imagery EEG data using MNE-Python and PyTorch.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Asynchronous Distributed Hyperparameter Optimization.
Command-line tools for speech and intent recognition on Linux
A powerful and flexible machine learning platform for drug discovery
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DN…
Self-Supervised Speech Pre-training and Representation Learning Toolkit
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Extensions to YAML syntax for better python interaction
BabyAI platform. A testbed for training agents to understand and execute language commands.
Repository of QC Experiments based on D-Wave Leap
PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning
Various tutorials given for welcoming new students at MILA.
This library provides common speech features for ASR including MFCCs and filterbank energies.
Tools and kaldi baselines for the DIRHA English wsj dataset
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…
Implementation of speech recognition with TwinNet
Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
Tensors and Dynamic neural networks in Python with strong GPU acceleration