Sea AI Lab

understand-r1-zero Public

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1.1k 51

zero-bubble-pipeline-parallelism Public

Zero Bubble Pipeline Parallelism

Python 428 29

lorahub Public

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 653 40

EditAnything Public

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3.4k 201

oat Public

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 472 32

stde Public

Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024

Python 121 7

Provide feedback