- San Francisco
Highlights
- Pro
Stars
[ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
Expressive Anechoic Recordings of Speech (EARS)
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
Audio Source Separation using the Non Negative Matrix Multiplication
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
This example shows how to build and train a convolutional neural network (CNN) from scratch to perform a classification task with an EEG dataset.
Checkout bot capable of monitoring site and sending a text message via twilio when the desired item is found
📚 Freely available programming books