Starred repositories
A powerful tool that translates ComfyUI workflows into executable Python code.
Scalable and memory-optimized training of diffusion models
This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.
A pipeline parallel training script for diffusion models.
The python library for real-time communication
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
[NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Let us democratise high-resolution generation! (CVPR 2024)
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Attempt at cog wrapper for Panoramic SDXL inpainted image
Finetune ModelScope's Text To Video model using Diffusers 🧨
SD.Next: All-in-one WebUI for AI generative image and video creation
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Stable Diffusion with Core ML on Apple Silicon
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voic…
Open Academic Research on Improving LLaMA to SOTA LLM
A high-throughput and memory-efficient inference and serving engine for LLMs
Large Language Model Text Generation Inference