-
UC Santa Cruz
-
17:31
(UTC +01:00) - xk-huang.github.io
Highlights
- Pro
Lists (10)
Sort Name ascending (A-Z)
Stars
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
StableLM: Stability AI Language Models
This repository contains the source code for the paper First Order Motion Model for Image Animation
High-Resolution Image Synthesis with Latent Diffusion Models
LAVIS - A One-stop Library for Language-Vision Intelligence
Code release for NeRF (Neural Radiance Fields)
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Using Low-rank adaptation to quickly fine-tune diffusion models.
CoreNet: A library for training deep neural networks
Taming Transformers for High-Resolution Image Synthesis
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning
Instant-ngp in pytorch+cuda trained with pytorch-lightning (high quality with high speed, with only few lines of legible code)
Official PyTorch repo for GAN's N' Roses. Diverse im2im and vid2vid selfie to anime translation.
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
yzhou359 / MakeItTalk
Forked from adobe-research/MakeItTalkFaceScape (PAMI2023 & CVPR2020)
VPoser: Variational Human Pose Prior