xujz18

Follow

Jiazheng Xu xujz18

Follow

Ph.D. student of Tsinghua CS @THUDM

143 followers · 65 following

Tsinghua University, KEG Group
Beijing, China
05:47 (UTC +08:00)
@xujz0703
in/jiazheng-xu-6a96a11b2
https://www.semanticscholar.org/author/Jiazheng-Xu/2214082934

Achievements

Achievements

Organizations

Stars

Liuziyu77 / Visual-RFT

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,238 56 Updated Mar 12, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,439 85 Updated Mar 14, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,101 255 Updated Mar 14, 2025

hkust-nlp / CodeIO

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 467 29 Updated Feb 21, 2025

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,004 42 Updated Feb 23, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,658 99 Updated Mar 7, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 526 31 Updated Mar 13, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,689 616 Updated Mar 7, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,779 465 Updated Mar 14, 2025

Xiao9905 / AutoGLM

JavaScript 46 6 Updated Nov 5, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,285 221 Updated Feb 13, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,052 54 Updated Feb 8, 2025

FanqingM / R1-Multimodal-Journey

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 272 5 Updated Mar 8, 2025

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,170 1,125 Updated Mar 14, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,239 559 Updated Feb 26, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,793 366 Updated Mar 14, 2025

thu-ml / STAIR

Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

Python 26 1 Updated Feb 26, 2025

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 23,383 2,024 Updated Jan 23, 2025

bytedance / UI-TARS

2,833 171 Updated Feb 17, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,249 255 Updated Mar 1, 2025

deepseek-ai / DeepSeek-V3

Python 92,138 14,936 Updated Feb 24, 2025

mapo-t2i / mapo

Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).

Python 71 8 Updated Jun 11, 2024

google-research-datasets / richhf-18k

RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or im…

121 4 Updated Jun 25, 2024

MoonshotAI / Kimi-k1.5

3,209 193 Updated Mar 7, 2025

deepseek-ai / DeepSeek-R1

86,349 11,137 Updated Feb 24, 2025

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 742 90 Updated Mar 14, 2025

Xiao9905 / Xiao9905

4 Updated Feb 4, 2025

THUDM / VisionReward

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 174 3 Updated Feb 17, 2025

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 82,149 12,066 Updated Mar 14, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,630 550 Updated Mar 14, 2025