Fix `UnslothTrainingArguments` not patching `trl.Config` properly #2873

Erland366 · 2025-07-03T19:07:00Z

Few updates ago, we patch trl.Config especially in the bf16 and fp16 args to bypass the Ampere GPU error -> https://github.com/unslothai/unsloth/blob/main/unsloth/models/rl.py#L235-L267

But the patching is not propagate to the inheritance class that is using dataclass. Therefore, we should inherit it using Python Class instead to propagate the patching.

This fixes training error on Mistral CPT

…uments import

…te in constructor

Erland366 added 4 commits July 3, 2025 14:08

Always use SFTConfig

a2fa52f

Refactor UnslothTrainingArguments to support fallback for TrainingArg…

18c73a4

…uments import

Refactor UnslothTrainingArguments to initialize embedding_learning_ra…

fcb0b94

…te in constructor

Initialize parent class in UnslothTrainingArguments constructor

3bad31a

Erland366 requested a review from danielhanchen July 3, 2025 19:08

shimmyshimmer merged commit 51a7023 into unslothai:main Jul 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix `UnslothTrainingArguments` not patching `trl.Config` properly #2873

Fix `UnslothTrainingArguments` not patching `trl.Config` properly #2873

Uh oh!

Erland366 commented Jul 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix UnslothTrainingArguments not patching trl.Config properly #2873

Fix UnslothTrainingArguments not patching trl.Config properly #2873

Uh oh!

Conversation

Erland366 commented Jul 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix `UnslothTrainingArguments` not patching `trl.Config` properly #2873

Fix `UnslothTrainingArguments` not patching `trl.Config` properly #2873