Skip to content
View LiNO3Dy's full-sized avatar

Highlights

  • Pro

Block or report LiNO3Dy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Swift 383 28 Updated Oct 1, 2024
Python 1,836 62 Updated Jun 28, 2024

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

Python 635 32 Updated Jul 25, 2025

Implementation of all RL algorithms in a simpler way

Jupyter Notebook 1,146 198 Updated Aug 29, 2025

A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch

Jupyter Notebook 63 11 Updated Jun 16, 2025

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,749 999 Updated Sep 20, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,337 174 Updated Sep 30, 2025

📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.

275 17 Updated Sep 30, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 15,188 1,107 Updated Sep 29, 2025

End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Python 292 16 Updated Sep 22, 2025
Python 59 Updated Sep 25, 2025

A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.

Python 95 8 Updated Sep 19, 2025

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,049 34 Updated Sep 16, 2025

Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation

Python 31 1 Updated Aug 5, 2025

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think

Python 555 35 Updated Sep 30, 2025

T2I-Adapter

Python 3,746 226 Updated Jun 21, 2024

The collection of awesome papers on alignment of diffusion models.

339 16 Updated Sep 25, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,038 785 Updated Sep 22, 2025

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 685 58 Updated Mar 22, 2024

An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,372 66 Updated Sep 18, 2025

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 25,941 2,675 Updated Sep 30, 2025

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 321 14 Updated Sep 15, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 570 66 Updated Sep 19, 2025

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 363 10 Updated Sep 29, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,247 398 Updated Jun 28, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,305 273 Updated Sep 30, 2025

Ring attention implementation with flash attention

Python 883 84 Updated Sep 10, 2025

This repo powers my blog experiment where ChatGPT manages a real-money micro-cap stock portfolio.

Python 6,222 1,352 Updated Sep 30, 2025

"VideoRAG: Chat with Your Videos"

Python 1,155 165 Updated Sep 12, 2025
Next