Add system role to deepseek chat template #3031

AllentDan · 2025-01-15T02:34:15Z

No description provided.

lvhan028 · 2025-01-15T03:27:50Z

model = MODELS.get('deepseek')()
messages = [{
    'role': 'system',
    'content': 'you are a helpful assistant'
}, {
    'role': 'user',
    'content': 'who are you'
}, {
    'role': 'assistant',
    'content': 'I am an AI'
}, {
    'role': 'user',
    'content': 'hi'
}]
from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained(
    'deepseek-ai/DeepSeek-V2-Lite', trust_remote_code=True)
ref = tokenizer.apply_chat_template(messages, tokenize=False)
res = '<｜begin▁of▁sentence｜>' + model.messages2prompt(messages)
assert res.startswith(ref)

It tests failed using the above case.

AllentDan · 2025-01-15T03:42:25Z

But in real testing, 'who are you' should be responsed with ' I am an AI'

lvhan028 · 2025-01-15T06:54:24Z

Can we add UTs to cover ti?

RunningLeon

LGTM

lvhan028 · 2025-01-20T04:21:07Z

[:-1] should be removed

Conflicts: lmdeploy/model.py

lmdeploy/model.py

zhyncs · 2025-01-22T12:47:37Z

tests/test_lmdeploy/test_model.py

+        'content': 'hi'
+    }]
+    from transformers import AutoTokenizer
+    tokenizer = AutoTokenizer.from_pretrained('deepseek-ai/DeepSeek-V2-Lite', trust_remote_code=True)


How about adding test cases for DeepSeek V3 and DeepSeek R1 here?

Done in #3072

lmdeploy/model.py

* main: (90 commits) Fix cogvlm and phi3vision (InternLM#3137) support release pipeline (InternLM#3069) [ci] fix some fail in daily testcase (InternLM#3134) Fix internvl2.5 error after eviction (InternLM#3122) fix UT of deepseek chat template (InternLM#3125) Update benchmark script and user guide (InternLM#3110) bump version to v0.7.0.post3 (InternLM#3115) fix postional argument (InternLM#3086) remove logitswarper (InternLM#3109) [Fix] fix the URL judgment problem in Windows (InternLM#3103) fix user guide about cogvlm deployment (InternLM#3088) add option max-concurrent-requests for api_server(InternLM#2961) bump version to v0.7.0.post2 (InternLM#3094) Fix xcomposer2d5 (InternLM#3087) Add system role to deepseek chat template (InternLM#3031) Update tokenizer (InternLM#3061) Add deepseek-r1 chat template (InternLM#3072) bump version to v0.7.0.post1 (InternLM#3076) More arguments in api_client, update docstrings (InternLM#3077) fix sliding window mgr (InternLM#3068) ... # Conflicts: # lmdeploy/turbomind/turbomind.py

remove the space following assistant

cd2272a

lvhan028 self-requested a review January 15, 2025 03:08

lvhan028 added the Bug:P2 label Jan 15, 2025

add deepseek system role and keep the same with HF

3e44d37

AllentDan changed the title ~~remove the space following assistant~~ Add system role to deepseek chat template Jan 15, 2025

lvhan028 requested a review from RunningLeon January 15, 2025 06:53

update the unit test

650b94d

RunningLeon approved these changes Jan 16, 2025

View reviewed changes

AllentDan added 4 commits January 21, 2025 10:59

remove [:-1]

74a3aa4

fix UT but not aligned with HF

34850f7

Merge branch 'main' into fix-chat-template

e8a64d1

Conflicts: lmdeploy/model.py

fix

8f0a470

lvhan028 reviewed Jan 22, 2025

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

lvhan028 reviewed Jan 22, 2025

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

zhyncs reviewed Jan 22, 2025

View reviewed changes

lvhan028 reviewed Jan 22, 2025

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

fix

818932f

lvhan028 approved these changes Jan 27, 2025

View reviewed changes

lvhan028 merged commit 31adcf9 into InternLM:main Jan 27, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add system role to deepseek chat template #3031

Add system role to deepseek chat template #3031

AllentDan commented Jan 15, 2025

lvhan028 commented Jan 15, 2025

AllentDan commented Jan 15, 2025 •

edited

Loading

lvhan028 commented Jan 15, 2025

RunningLeon left a comment

lvhan028 commented Jan 20, 2025

zhyncs Jan 22, 2025

AllentDan Jan 23, 2025

Add system role to deepseek chat template #3031

Add system role to deepseek chat template #3031

Conversation

AllentDan commented Jan 15, 2025

lvhan028 commented Jan 15, 2025

AllentDan commented Jan 15, 2025 • edited Loading

lvhan028 commented Jan 15, 2025

RunningLeon left a comment

Choose a reason for hiding this comment

lvhan028 commented Jan 20, 2025

zhyncs Jan 22, 2025

Choose a reason for hiding this comment

AllentDan Jan 23, 2025

Choose a reason for hiding this comment

AllentDan commented Jan 15, 2025 •

edited

Loading