Update tokenizer #3061

lvhan028 · 2025-01-21T06:39:56Z

remove never used tokenizer "SentencePieceTokenizer"
Let async_engine instead of inference engines init tokenizer
converter places the tokenizer-related files under the root directory of workspace rather than the weight directory

AllentDan · 2025-01-21T07:30:30Z

Unit tests failed.

lmdeploy/pytorch/engine/engine.py

AllentDan

Does it mean our convert.py should deprecate meta_llama models?

lvhan028 · 2025-01-22T12:15:30Z

Does it mean our convert.py should deprecate meta_llama models?

Yes, you are right

AllentDan

LGTM

lmdeploy/turbomind/chat.py

* main: (90 commits) Fix cogvlm and phi3vision (InternLM#3137) support release pipeline (InternLM#3069) [ci] fix some fail in daily testcase (InternLM#3134) Fix internvl2.5 error after eviction (InternLM#3122) fix UT of deepseek chat template (InternLM#3125) Update benchmark script and user guide (InternLM#3110) bump version to v0.7.0.post3 (InternLM#3115) fix postional argument (InternLM#3086) remove logitswarper (InternLM#3109) [Fix] fix the URL judgment problem in Windows (InternLM#3103) fix user guide about cogvlm deployment (InternLM#3088) add option max-concurrent-requests for api_server(InternLM#2961) bump version to v0.7.0.post2 (InternLM#3094) Fix xcomposer2d5 (InternLM#3087) Add system role to deepseek chat template (InternLM#3031) Update tokenizer (InternLM#3061) Add deepseek-r1 chat template (InternLM#3072) bump version to v0.7.0.post1 (InternLM#3076) More arguments in api_client, update docstrings (InternLM#3077) fix sliding window mgr (InternLM#3068) ... # Conflicts: # lmdeploy/turbomind/turbomind.py

lvhan028 added 4 commits January 20, 2025 17:36

change tokenizer path in converter

9ccc469

update

8f3cb0f

remove sentencepiece tokenizer

d26e81a

merge main

71f6d4c

lvhan028 added the improvement label Jan 21, 2025

lvhan028 requested review from lzhangzz and AllentDan January 21, 2025 06:43

fix

71e3a45

lvhan028 requested a review from grimoire January 21, 2025 13:58

fix

0a7ac4b

grimoire reviewed Jan 22, 2025

View reviewed changes

lmdeploy/pytorch/engine/engine.py Show resolved Hide resolved

grimoire approved these changes Jan 22, 2025

View reviewed changes

AllentDan reviewed Jan 22, 2025

View reviewed changes

remove meta_llama.py

623927f

AllentDan approved these changes Jan 23, 2025

View reviewed changes

irexyc reviewed Jan 23, 2025

View reviewed changes

lmdeploy/turbomind/chat.py Show resolved Hide resolved

fix

a02a616

lzhangzz approved these changes Jan 23, 2025

View reviewed changes

lvhan028 merged commit 26622b8 into InternLM:main Jan 27, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tokenizer #3061

Update tokenizer #3061

lvhan028 commented Jan 21, 2025

AllentDan commented Jan 21, 2025

AllentDan left a comment

lvhan028 commented Jan 22, 2025

AllentDan left a comment

Update tokenizer #3061

Update tokenizer #3061

Conversation

lvhan028 commented Jan 21, 2025

AllentDan commented Jan 21, 2025

AllentDan left a comment

Choose a reason for hiding this comment

lvhan028 commented Jan 22, 2025

AllentDan left a comment

Choose a reason for hiding this comment