verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 886 69 Updated Sep 15, 2025

THUNLP-MT / StreamingBench

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Python 131 6 Updated May 16, 2025

HQHBench / HQHBench

Python 9 Updated May 31, 2025

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

938 34 Updated Aug 30, 2025

Visual-Agent / DeepEyes

Python 807 42 Updated Sep 3, 2025

TIGER-AI-Lab / Pixel-Reasoner

Pixel-Level Reasoning Model trained with RL

Python 204 7 Updated Sep 10, 2025

tulerfeng / Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 689 39 Updated Sep 7, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v…

Python 9,867 869 Updated Sep 15, 2025

OpenGVLab / VideoChat-R1

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Python 184 6 Updated Aug 18, 2025

Ablustrund / LoRAMoE

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 373 31 Updated Apr 29, 2024

EnjunDu / auto-IJCAI-scores

IJCAI Review & MetaReview Monitor

Python 107 4 Updated Apr 18, 2025

VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 291 28 Updated May 14, 2025

HumanMLLM / R1-Omni

Python 941 63 Updated Mar 24, 2025

MME-Benchmarks / Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

641 25 Updated Aug 22, 2025

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,403 176 Updated Mar 28, 2025

HJYao00 / Mulberry

Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,213 110 Updated Mar 28, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,584 269 Updated Sep 13, 2025

SkyworkAI / Skywork-R1V

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

Python 2,937 269 Updated Aug 2, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 691 17 Updated Sep 10, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 733 27 Updated Sep 7, 2025

Wild-Cooperation-Hub / Awesome-MLLM-Reasoning-Benchmarks

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

68 7 Updated Mar 18, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 608 22 Updated Mar 18, 2025

FanqingM / MM-Eureka-V0

MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka

Python 317 9 Updated Jun 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

davidluciolu

Achievements

Achievements

Block or report davidluciolu

Stars

ByteDance-Seed / m3-agent

yhy-2000 / VideoDeepResearch

BytedTsinghua-SIA / MemAgent

zai-org / LVBench

MIT-MI / MEM1

HumanMLLM / ViSpeak

volcengine / verl

langfengQ / verl-agent