Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v…

Python 9,914 872 Updated Sep 16, 2025

pengzhangzhi / Open-dLLM

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 239 11 Updated Sep 16, 2025

josejg / how-to-imagenet

Tooling to download and prepare the ILSVRC2012 dataset

Jupyter Notebook 1 Updated Aug 12, 2025

fontra / fontra

A browser-based font editor

JavaScript 675 41 Updated Sep 13, 2025

p-doom / jasmine

Forked from FLAIROx/jafar

A simple, performant and scalable JAX-based world modeling codebase

Python 73 6 Updated Sep 16, 2025

UCSC-VLAA / MedVLThinker

MedVLThinker: Simple Baselines for Multimodal Medical Reasoning

Jupyter Notebook 34 2 Updated Aug 17, 2025

The-Pocket / PocketFlow

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

Python 8,369 949 Updated Aug 13, 2025

X-Omni-Team / X-Omni

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 372 10 Updated Aug 26, 2025

wyhlovecpp / GPT-Image-Edit

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Python 225 4 Updated Aug 15, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 5,409 419 Updated Sep 17, 2025

ximinng / PyTorch-SVGRender

SVG Differentiable Rendering: Generating vector graphics using neural networks. Support: text-to-SVG, Image-to-SVG, SVG Editing.

Python 437 47 Updated Feb 25, 2025

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 75,469 8,012 Updated Sep 17, 2025

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 16,811 2,840 Updated Sep 10, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,908 191 Updated Sep 11, 2025

open-thought / reasoning-gym

procedural reasoning datasets

Python 1,102 89 Updated Sep 15, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 22,853 2,003 Updated Sep 12, 2025

CharlesQ9 / Alita

804 46 Updated Aug 30, 2025

SynthLabsAI / big-math

A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Python 64 4 Updated Feb 25, 2025

SkyworkAI / Matrix-Game

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,627 161 Updated Aug 21, 2025

AMAP-ML / UniVG-R1

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Python 139 6 Updated Jun 2, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 44,422 7,550 Updated Dec 9, 2024

shiyi-zh0408 / FlexiAct

[SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"

Jupyter Notebook 332 28 Updated Aug 18, 2025

microsoft / x-reasoner

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

47 2 Updated May 9, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,597 272 Updated Sep 16, 2025

willccbb / verifiers

Verifiers for LLM Reinforcement Learning

Python 3,053 337 Updated Sep 13, 2025

FoundationAgents / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 58,430 7,061 Updated Jun 30, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,484 199 Updated Jun 17, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 41,622 4,856 Updated Sep 17, 2025

GuanxingLu / vlarl

Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.

Python 278 12 Updated Sep 13, 2025

SkyworkAI / Skywork-R1V

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

Python 2,935 269 Updated Aug 2, 2025

Xiaoke Huang xk-huang

Highlights

Lists (10)

3D Generation Friends

3DMM Friends

Diffusion Model Friends

Face Reenactment

✨ Inspiration

NeRF Friends

Regression Friends

Utilities

Video Generation Friends

Visual-Language

Stars