[maca] support deepseekv2 for maca backend. #2918

Reinerzhou · 2024-12-18T02:40:47Z

Modification

add DlinferYarnRotaryEmbedding and add softmax_scale para to support deepseekv2 for maca backend.

…oad_state_dict * commit 'f6f7a5d707e3ccbc69af10babf1c9afcaf72a402': fix deepseekv2 has no attribute use_mla error (InternLM#3188) fix blocked fp8 moe (InternLM#3181) [Feature] support deepseek-vl2 for pytorch engine (InternLM#3149) make turbomind support gpu embedding inputs (InternLM#3177) fix temperature=0 (InternLM#3176) Update qwen2.py (InternLM#3174) Fix tool call prompt for InternLM and Qwen (InternLM#3156) Use pad_token_id as image_token_id for vl models (InternLM#3158) fix default temperature value (InternLM#3166) fix min length penalty (InternLM#3150) update cuda runtime package dependencies (InternLM#3142) fix typing (InternLM#3153) support deepseekv2 for maca backend. (InternLM#2918) fix the issue that stop_token may be less than defined in model.py (InternLM#3148) [fix] fix vl gradio, use pipeline api and remove interactive chat (InternLM#3136) [feature] add dlinfer w8a8 support. (InternLM#2988) Use aiohttp inside proxy server && add --disable-cache-status argument (InternLM#3020) support eos_token list in turbomind (InternLM#3044)

lvhan028 added the enhancement New feature or request label Dec 18, 2024

lvhan028 requested a review from jinminxi104 December 18, 2024 04:19

jinminxi104 marked this pull request as draft December 18, 2024 08:00

Reinerzhou force-pushed the zhousl/maca_dev branch 2 times, most recently from c4e740e to 5c3f925 Compare January 27, 2025 05:56

support deepseekv2 for maca backend.

e6dd548

Reinerzhou force-pushed the zhousl/maca_dev branch from 5c3f925 to e6dd548 Compare February 12, 2025 03:16

jinminxi104 approved these changes Feb 19, 2025

View reviewed changes

jinminxi104 marked this pull request as ready for review February 19, 2025 03:03

jinminxi104 requested review from grimoire and lvhan028 February 19, 2025 03:03

grimoire approved these changes Feb 19, 2025

View reviewed changes

lvhan028 merged commit 33236e9 into InternLM:main Feb 19, 2025
5 checks passed

Reinerzhou deleted the zhousl/maca_dev branch February 20, 2025 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[maca] support deepseekv2 for maca backend. #2918

[maca] support deepseekv2 for maca backend. #2918

Reinerzhou commented Dec 18, 2024

[maca] support deepseekv2 for maca backend. #2918

[maca] support deepseekv2 for maca backend. #2918

Conversation

Reinerzhou commented Dec 18, 2024

Modification