Skip to content

Conversation

@mmathew23
Copy link
Collaborator

compilation adds 20-30 seconds for inference of sesame csm on the T4, and doesn't result in much of a speedup so we can turn it off for now.

@danielhanchen danielhanchen merged commit b347ec5 into unslothai:main Jul 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants