System Info Transformers v4.45.0.dev0 Who can h

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Also <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url

ValueError: Cannot use apply_chat_template() because tokenizer.chat_template is not set about transformers HOT 4 CLOSED

NielsRogge commented on September 12, 2024

ValueError: Cannot use apply_chat_template() because tokenizer.chat_template is not set

from transformers.

Comments (4)

Rocketknight1 commented on September 12, 2024 1

Yes, you're right about the cause! Rather than trying to merge a proper chat template for Blenderbot (which is very obsolete by now), I'll just rewrite the doc to use a different model.

from transformers.

PhilipAmadasun commented on September 12, 2024

@Rocketknight1 I'm getting the same error when I try to use some models like gemma. I can try to use the template parameter but not sure what the format is for gemma model (I can look it up in the tokenizer_config.json right?). Is this pretty much what we now have to do when we get this error? manually set the template? for models that dont accept "role":"system" what wwould be the work around?

from transformers.

Rocketknight1 commented on September 12, 2024

Hi @PhilipAmadasun, the most likely cause is that you're loading the base gemma models, like gemma-2-2b, instead of the models that are "instruction tuned" for chat, like gemma-2-2b-it. The base models are just simple language models and don't support chat, and therefore don't have a chat template. If you use a model trained for chat, it should work!

from transformers.

Rocketknight1 commented on September 12, 2024

Also @NielsRogge, fix has now been merged

from transformers.

ValueError: Cannot use apply_chat_template() because tokenizer.chat_template is not set about transformers HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent