Giter Club home page Giter Club logo

Comments (13)

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024 1

Specifically, you seem to be using the meta-llama/Meta-Llama-3-8B model in a chat context, so I suspect you are querying the /chat/completions API

This model does not have a chat template, so the openai server is going to use a default template which will not work well

Please try with meta-llama/Meta-Llama-3-8B-Instruct

from vllm.

simon-mo avatar simon-mo commented on August 24, 2024

@robertgshaw2-neuralmagic does this ring a bell? any clue?

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024

@eamonnlambda - can you share some sample requests you are sending so I can debug?

from vllm.

njhill avatar njhill commented on August 24, 2024

@eamonnlambda what's is the eos_token_id in the model's config.json, and in generation_config.json if it exists? The model will only stop if the <|im_end|> token is configured as that.

It looks like you're trying to do chat stuff with a non- chat-tuned model?

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024

So I think what is going is that you are sending chat requests, we have the default chat-template (which I think use im_end and then the model is responding as if it were a few shot scenario)

from vllm.

eamonnlambda avatar eamonnlambda commented on August 24, 2024

Hi!

I appreciate all the responses here. I just tried the -Instruct model, and it appears to be working better. However, I will note, I originally saw this issue using the exact examples provided in the documentation.

Using facebook/opt-125m, I receive the output:

It's not a problem, people will just be logging out of the lobby on their phone so you can still use it.\n> You'll still be able to access the lobby when you're in the lobby, so you can still use it  ...  ...  ...  ......  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ... ...  ...  ...  ...  ...  ...  ...  ...  ...  ...   ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ... ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n

Now, as much as I like games, I would expect a result slightly different from the above for the question being asked. Would it then make sense to modify the documentation to use a model that does not produce spurious output?

Thanks again for everyone dropping in!

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024

Can you post the command you are running so that I can re-create?

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024

Hi!

I appreciate all the responses here. I just tried the -Instruct model, and it appears to be working better. However, I will note, I originally saw this issue using the exact examples provided in the documentation.

Using facebook/opt-125m, I receive the output:

It's not a problem, people will just be logging out of the lobby on their phone so you can still use it.\n> You'll still be able to access the lobby when you're in the lobby, so you can still use it  ...  ...  ...  ......  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ... ...  ...  ...  ...  ...  ...  ...  ...  ...  ...   ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ... ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n

Now, as much as I like games, I would expect a result slightly different from the above for the question being asked. Would it then make sense to modify the documentation to use a model that does not produce spurious output?

Thanks again for everyone dropping in!

We can only do as good as the model! opt-125m is not a good model!

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024

Hi!

I appreciate all the responses here. I just tried the -Instruct model, and it appears to be working better. However, I will note, I originally saw this issue using the exact examples provided in the documentation.

Using facebook/opt-125m, I receive the output:

It's not a problem, people will just be logging out of the lobby on their phone so you can still use it.\n> You'll still be able to access the lobby when you're in the lobby, so you can still use it  ...  ...  ...  ......  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ... ...  ...  ...  ...  ...  ...  ...  ...  ...  ...   ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ... ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...  ...\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\n\"Game\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n\"Game\"\n\nGame\"\n\nGame\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\nGame\"\n\n

Now, as much as I like games, I would expect a result slightly different from the above for the question being asked. Would it then make sense to modify the documentation to use a model that does not produce spurious output?

Thanks again for everyone dropping in!

Which documentation are you referring to?

from vllm.

eamonnlambda avatar eamonnlambda commented on August 24, 2024

Certainly!

python -m vllm.entrypoints.openai.api_server --model facebook/opt-125m
curl --location --request POST 'localhost:8000/v1/chat/completions' \
--header 'Content-Type: application/json' \
--data-raw '{
    "model": "facebook/opt-125m",
    "messages": [
        {
            "role": "system",
            "content": "You are a helpful assistant."
        },
        {
            "role": "user",
            "content": "Who won the world series in 2020?"
        }
    ]
}'

It should be the same as in the Quickstart page: https://docs.vllm.ai/en/stable/getting_started/quickstart.html#using-openai-chat-api-with-vllm

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024

Certainly!

python -m vllm.entrypoints.openai.api_server --model facebook/opt-125m
curl --location --request POST 'localhost:8000/v1/chat/completions' \
--header 'Content-Type: application/json' \
--data-raw '{
    "model": "facebook/opt-125m",
    "messages": [
        {
            "role": "system",
            "content": "You are a helpful assistant."
        },
        {
            "role": "user",
            "content": "Who won the world series in 2020?"
        }
    ]
}'

It should be the same as in the Quickstart page: https://docs.vllm.ai/en/stable/getting_started/quickstart.html#using-openai-chat-api-with-vllm

Thanks - Im going to swap out the model here. Best to use llama-3-instruct for the samples as opt gives garbage in the chat setup

from vllm.

eamonnlambda avatar eamonnlambda commented on August 24, 2024

Perfect. This can be safely closed, then. I really appreciate all the help!

In terms of opt-125m not being a good model... I'm not going to disagree with you there!

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on August 24, 2024

Sounds good - let me know if you need any other help

from vllm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.