FP8 training enhancements #3496

Datta0 · 2025-10-23T09:21:30Z

For FP8 models, make it supported for non fast inference.
Also fix for when weight shape is not multiple of 8. Read here.

Also move hf quantizer patch to unsloth from unsloth zoo to make it run for non fast inference.

Patch Fbgemmfp8linear and fp8linear classes' forward methods here to make them work for compiled models (which don't use mamtul_lora explicitly)

Sample Qwen 2.5 VL 7B on FP8 GRPO :)

Datta0 added 3 commits October 23, 2025 07:01

Fix FP8 for models with non 8 multiple weights

78f7c79

patch fp8 forward methods for compiled models

c315cba

patch hf quantizer for fp8

87cff2e

Datta0 changed the title ~~Fix FP8 for models with non 8 multiple weights~~ FP8 training enhancements Oct 23, 2025

Datta0 mentioned this pull request Oct 23, 2025

FP8 training enhancements unslothai/unsloth-zoo#337

Merged

Datta0 added 2 commits October 27, 2025 05:20

Failsafe import of fbgemmfp8linear and fp8linear

37a019a

Beautify

0d61b24

danielhanchen merged commit fc178b5 into unslothai:main Oct 27, 2025

Provide feedback