Your current environment <div class="snippet-clipboard-content notranslate posit

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

My PR <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-

[Bug]: OpenAI LogProbs format for Chat-Completion is incorrect about vllm HOT 5 CLOSED

br3no commented on September 26, 2024

[Bug]: OpenAI LogProbs format for Chat-Completion is incorrect

from vllm.

Comments (5)

br3no commented on September 26, 2024

@DarkLight1337 you were quicker by one hour, but you still have failing tests, so I win 😜

from vllm.

DarkLight1337 commented on September 26, 2024

It's not a competition xD We can combine our solutions in your PR if need be.

from vllm.

br3no commented on September 26, 2024

I know! I was being (not so) funny.

from vllm.

DarkLight1337 commented on September 26, 2024

I have updated my PR with more test cases. From my understanding, the behaviour of disabling logprobs and specifying zero top logprobs should be distinct. In particular:

Disabling logprobs should return no logprobs at all:
- Completions API: Input logprobs=None should result in output top_logprobs==None
- Chat Completions API: Input logprobs=False should result in output len(top_logprobs)==0
Specifying zero top logprobs should return the logprob for the output token only for Completions API (if it exists):
- Completions API: Input logprobs=0 should result in output len(top_logprobs)<=1
- Chat Completions API: Input logprobs=True,top_logprobs=0 should result in output ~~len(top_logprobs)<=k~~ len(top_logprobs)==0
Specifying k top logprobs should return the top k items, plus the logprob for the output token only for Completions API (if it exists):
- Completions API: Input logprobs=k should result in output len(top_logprobs)<=k+1
- Chat Completions API: Input logprobs=True,top_logprobs=k should result in output ~~len(top_logprobs)<=k+1~~ len(top_logprobs)==k

Edit: Thanks @br3no for the correction!

from vllm.

DarkLight1337 commented on September 26, 2024

My PR #5026 now passes all tests as well.

from vllm.

[Bug]: OpenAI LogProbs format for Chat-Completion is incorrect about vllm HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent