Pinned Loading
Repositories
Showing 10 of 93 repositories
- feedback-conditional-policy Public
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
sail-sg/feedback-conditional-policy’s past year of commit activity - LifelongSafetyAlignment Public
sail-sg/LifelongSafetyAlignment’s past year of commit activity - BanditSpec Public
sail-sg/BanditSpec’s past year of commit activity - Video-Next-Event-Prediction Public
sail-sg/Video-Next-Event-Prediction’s past year of commit activity