Skip to content
View xk-huang's full-sized avatar
👋
XPUs go brrr!
👋
XPUs go brrr!

Highlights

  • Pro

Block or report xk-huang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
60 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 71,543 10,509 Updated Jun 18, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 51,981 6,083 Updated Sep 18, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 30,888 3,773 Updated Jul 23, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,505 2,450 Updated Mar 13, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,964 1,540 Updated Sep 5, 2024

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 16,955 2,872 Updated Sep 10, 2025

StableLM: Stability AI Language Models

Jupyter Notebook 15,802 1,032 Updated Apr 8, 2024

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 14,933 3,288 Updated Nov 14, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,373 1,666 Updated Feb 29, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,511 879 Updated Sep 1, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,927 1,067 Updated Nov 18, 2024

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,611 1,440 Updated Apr 12, 2025

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 9,260 979 Updated Feb 5, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,415 541 Updated May 18, 2025
Jupyter Notebook 7,452 1,110 Updated Jul 9, 2023

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,444 490 Updated Mar 22, 2024

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,020 547 Updated Aug 24, 2025

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,324 1,207 Updated Jul 30, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,507 715 Updated Aug 5, 2024

Material for gpu-mode lectures

Jupyter Notebook 5,105 511 Updated Sep 23, 2025

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 4,254 1,114 Updated Jan 1, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,157 198 Updated May 19, 2025

NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning

Jupyter Notebook 2,802 477 Updated Aug 3, 2023
Jupyter Notebook 2,582 456 Updated Dec 16, 2023

Instant-ngp in pytorch+cuda trained with pytorch-lightning (high quality with high speed, with only few lines of legible code)

Jupyter Notebook 1,285 157 Updated Jun 16, 2023

Official PyTorch repo for GAN's N' Roses. Diverse im2im and vid2vid selfie to anime translation.

Jupyter Notebook 1,155 151 Updated May 27, 2022

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 1,045 75 Updated Mar 25, 2023
Jupyter Notebook 1,020 226 Updated Mar 20, 2024

FaceScape (PAMI2023 & CVPR2020)

Jupyter Notebook 918 103 Updated Oct 20, 2023

VPoser: Variational Human Pose Prior

Jupyter Notebook 911 157 Updated Oct 25, 2022
Next