Starred repositories
Enable macOS HiDPI and have a native setting.
Awesome speech/audio LLMs, representation learning, and codec models
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Align Anything: Training All-modality Model with Feedback
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
A generative speech model for daily dialogue.
A fast, simple & powerful blog framework, powered by Node.js.
Build real-time multimodal AI applications 🤖🎙️📹
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Application of MB-iSTFT-VITS components to vits2_pytorch
Unofficial Implementation of Long-term Forecasting with TiDE: Time-series Dense Encoder
Train transformer language models with reinforcement learning.
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
✨✨Latest Advances on Multimodal Large Language Models
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
A curated list of insanely awesome libraries, packages and resources for systematic trading. Crypto, Stock, Futures, Options, CFDs, FX, and more | 量化交易 | 量化投资
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.