Skip to content
View davidluciolu's full-sized avatar
  • University of Chinese Academy of Sciences
  • Beijing

Block or report davidluciolu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 911 75 Updated Sep 2, 2025
Python 101 4 Updated Jul 11, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 665 50 Updated Jul 31, 2025

[ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark

Python 118 1 Updated Jul 9, 2025
Python 107 8 Updated Aug 20, 2025

(ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"

Python 40 1 Updated Jul 1, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 13,376 2,362 Updated Sep 15, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 886 69 Updated Sep 15, 2025

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Python 131 6 Updated May 16, 2025
Python 9 Updated May 31, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

938 34 Updated Aug 30, 2025
Python 807 42 Updated Sep 3, 2025

Pixel-Level Reasoning Model trained with RL

Python 204 7 Updated Sep 10, 2025

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 689 39 Updated Sep 7, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v…

Python 9,867 869 Updated Sep 15, 2025

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Python 184 6 Updated Aug 18, 2025

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 373 31 Updated Apr 29, 2024

IJCAI Review & MetaReview Monitor

Python 107 4 Updated Apr 18, 2025

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 291 28 Updated May 14, 2025
Python 941 63 Updated Mar 24, 2025

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

641 25 Updated Aug 22, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,403 176 Updated Mar 28, 2025

Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,213 110 Updated Mar 28, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,584 269 Updated Sep 13, 2025

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

Python 2,937 269 Updated Aug 2, 2025

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 691 17 Updated Sep 10, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 733 27 Updated Sep 7, 2025

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

68 7 Updated Mar 18, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 608 22 Updated Mar 18, 2025

MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka

Python 317 9 Updated Jun 21, 2025
Next