Fix Mistral, Qwen #1565

danielhanchen · 2025-01-20T09:26:50Z

No description provided.

* Update granite to work with latest post_patch methods * Pass position_embeddings for granite even if transformers<4.47 * Update llama.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* Update granite.py Grab residual multiplier directly from layer * Update llama.py Version should read >= 4.47.1 as that is the version requiring the changes * Update granite.py * Update llama.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* support modelscope * change modelscope args * remove useless import * remove useless import * fix * wip * fix * remove useless code * add readme * add some comments * change print to raise error * update comment * Update loader.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* check for torch.cuda and triton if available on my machine(mac m3) the cuda were not available * Update pyproject.toml * Update __init__.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

Signed-off-by: datta0 <venkatadattasainimmaturi@gmail.com>

…ightly

* fix: flash_attn_detection_error * Update _utils.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

tajimagrp and others added 30 commits November 26, 2024 20:20

use exact model name

12f97c8

Update save.py

c4cb50b

Update _utils.py

75e4756

Update _utils.py

e86b18f

Update _utils.py

f565ccf

Update _utils.py

c5d0aa9

print

af7d6cc

Update _utils.py

281cb73

Update _utils.py

b60acda

Update llama.py

855d0f8

Update _utils.py

fe4e9b8

Update vision.py

48161a2

Update _utils.py

52b2451

Update _utils.py

8d39e73

Update _utils.py

a7e5803

Update _utils.py

5038ba7

Update _utils.py

0882287

Update _utils.py

ab71dce

Update _utils.py

dd054c3

Update _utils.py

6c80d0f

Update loader.py

ea8e8a2

accurate_accumulation

33ed089

Update loader.py

c3b41b8

Update loader.py

142f026

Update _utils.py

eecab40

Update loader.py

8cec2fa

Update loader.py

c68007c

Update loader.py

5495311

Update loader.py

ea2c647

Update pyproject.toml

f1da2a6

danielhanchen and others added 29 commits January 7, 2025 03:33

Update llama.py

0cb9c5f

Update llama.py

e3a92e0

Update granite to work with latest post_patch methods (#1502)

422c033

* Update granite to work with latest post_patch methods * Pass position_embeddings for granite even if transformers<4.47 * Update llama.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

Merge branch 'main' into nightly

63ad366

Phi 4

a7d7838

Merge branch 'main' into nightly

62f074c

Merge branch 'main' into nightly

8d28389

Update llama.py

2ced650

Merge branch 'main' into nightly

a76953a

Torch.Cuda Is Available Condition and Warning (#1545)

dd9b4e1

* check for torch.cuda and triton if available on my machine(mac m3) the cuda were not available * Update pyproject.toml * Update __init__.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

Update mistral.py

bc37b7a

Update mistral.py

2e7a886

Update _utils.py

15e6036

Update _utils.py

0b6bb12

Update _utils.py

76403f9

Update _utils.py

3c4ef99

Update _utils.py

b4c0b02

Fix

24a24bf

Bug fixes

a953bfc

Update mapper.py

e6d677b

Add dropout to granite to match HF's implementation (#1557)

d8d8bdc

Signed-off-by: datta0 <venkatadattasainimmaturi@gmail.com>

Merge branch 'nightly' of https://github.com/unslothai/unsloth into n…

aa53ed4

…ightly

Update llama.py

f42d0e9

Merge branch 'main' into nightly

a2b55ef

Update llama.py

b667bc6

Bug fixes

1ce40ce

fix: flash_attn_detection_error (#1556)

cdb3259

* fix: flash_attn_detection_error * Update _utils.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

danielhanchen merged commit d8c58fb into main Jan 20, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix Mistral, Qwen #1565

Fix Mistral, Qwen #1565

danielhanchen commented Jan 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

Uh oh!

Fix Mistral, Qwen #1565

Fix Mistral, Qwen #1565

Conversation

danielhanchen commented Jan 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants