Skip to content

Pinned Loading

  1. understand-r1-zero understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 1.1k 51

  2. zero-bubble-pipeline-parallelism zero-bubble-pipeline-parallelism Public

    Forked from NVIDIA/Megatron-LM

    Zero Bubble Pipeline Parallelism

    Python 428 29

  3. lorahub lorahub Public

    [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Python 653 40

  4. EditAnything EditAnything Public

    Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

    Python 3.4k 201

  5. oat oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    Python 472 32

  6. stde stde Public

    Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024

    Python 121 7

Repositories

Showing 10 of 93 repositories
  • feedback-conditional-policy Public

    Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"

    sail-sg/feedback-conditional-policy’s past year of commit activity
    Python 31 0 0 0 Updated Sep 29, 2025
  • variational-reasoning Public

    Code for "Variational Reasoning for Language Models"

    sail-sg/variational-reasoning’s past year of commit activity
    Python 37 0 0 0 Updated Sep 29, 2025
  • jrystal Public

    A JAX-based Differentiable Density Functional Theory Framework for Materials

    sail-sg/jrystal’s past year of commit activity
    Python 35 Apache-2.0 2 5 0 Updated Sep 27, 2025
  • sail-sg/LifelongSafetyAlignment’s past year of commit activity
    Python 10 0 1 0 Updated Sep 20, 2025
  • autofd Public

    Automatic Functional Differentiation in JAX

    sail-sg/autofd’s past year of commit activity
    Python 77 Apache-2.0 1 6 0 Updated Sep 18, 2025
  • oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    sail-sg/oat’s past year of commit activity
    Python 472 Apache-2.0 32 1 1 Updated Sep 15, 2025
  • BanditSpec Public
    sail-sg/BanditSpec’s past year of commit activity
    Python 0 0 0 0 Updated Sep 2, 2025
  • understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    sail-sg/understand-r1-zero’s past year of commit activity
    Python 1,099 MIT 51 7 0 Updated Aug 27, 2025
  • SkyLadder Public Forked from jzhang38/TinyLlama

    The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

    sail-sg/SkyLadder’s past year of commit activity
    Python 34 Apache-2.0 574 0 0 Updated Aug 25, 2025
  • sail-sg/Video-Next-Event-Prediction’s past year of commit activity
    Python 19 MIT 1 3 0 Updated Aug 9, 2025