The model to consider. <a href="https://huggingface.co/microsoft/P

<a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="22

[New Model]: Phi-3-medium-128k-instruct support about vllm HOT 5 CLOSED

ai8hyf commented on June 16, 2024

[New Model]: Phi-3-medium-128k-instruct support

from vllm.

Comments (5)

ai8hyf commented on June 16, 2024 1

I have the same error message with 4*A100. I tried the latest version and 0.4.2 version, both of them cannot work on Phi-3-medium-128k-instruct. My env is built from the Dockerfile.

@UCASZ I think the issue is with the number of GPUs. Once i changed tensor parallel from 4 to 2, the errors all went away.

from vllm.

simon-mo commented on June 16, 2024

#4799

from vllm.

getorca commented on June 16, 2024

Phi-3-medium-* is supported, and I can confirm it's working for me with vllm==0.4.2. The medium has the same architecture as mini, Phi3ForCausalLM vs small, which for what ever reason has Phi3SmallForCausalLM as architecture, and isn't supported. And looking at your error it doesn't look related to support for the model architecture.

from vllm.

UCASZ commented on June 16, 2024

I have the same error message with 4*A100. I tried the latest version and 0.4.2 version, both of them cannot work on Phi-3-medium-128k-instruct. My env is built from the Dockerfile.

from vllm.

TonySimonovsky commented on June 16, 2024

Doesn't work for me. Using vllm-worker on Runpod.

Getting:
▼ 2024-05-30 15:28:53.431 [sh1003a97ylbp1] [info] �B File "/vllm-installation/vllm/config.py", line 111, in __init__ ▼ 2024-05-30 15:28:53.431 [sh1003a97ylbp1] [info] � model_config = ModelConfig( ▼ 2024-05-30 15:28:53.431 [sh1003a97ylbp1] [info] �Y File "/vllm-installation/vllm/engine/arg_utils.py", line 287, in create_engine_configs ▼ 2024-05-30 15:28:53.431 [sh1003a97ylbp1] [info] �9 engine_configs = engine_args.create_engine_configs() ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �[ File "/vllm-installation/vllm/engine/async_llm_engine.py", line 622, in from_engine_args ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �K return AsyncLLMEngine.from_engine_args(AsyncEngineArgs(**self.config)) ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �6 File "/src/engine.py", line 103, in _initialize_llm ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] � raise e ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �6 File "/src/engine.py", line 106, in _initialize_llm ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �D self.llm = self._initialize_llm() if engine is None else engine ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �. File "/src/engine.py", line 24, in __init__ ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �� vllm_engine = vLLMEngine() ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �. File "/src/handler.py", line 6, in <module> ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �#Traceback (most recent call last): ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �� ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �EThe above exception was the direct cause of the following exception: ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �� ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] ��ValueError: Loading microsoft/Phi-3-medium-128k-instruct requires you to execute the configuration file in that repo on your local machine. Make sure you have read the code there to avoid malicious use, then set the option trust_remote_code=Trueto remove this error. ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �� raise ValueError( ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �~ File "/usr/local/lib/python3.10/dist-packages/transformers/dynamic_module_utils.py", line 627, in resolve_trust_remote_code ▼ 2024-05-30 15:28:53.430 [sh1003a97ylbp1] [info] �3 trust_remote_code = resolve_trust_remote_code( ▼ 2024-05-30 15:28:53.429 [sh1003a97ylbp1] [info] �~ File "/usr/local/lib/python3.10/dist-packages/transformers/models/auto/configuration_auto.py", line 931, in from_pretrained ▼ 2024-05-30 15:28:53.429 [sh1003a97ylbp1] [info] �) config = AutoConfig.from_pretrained( ▼ 2024-05-30 15:28:53.429 [sh1003a97ylbp1] [info] �V File "/vllm-installation/vllm/transformers_utils/config.py", line 30, in get_config ▼ 2024-05-30 15:28:53.429 [sh1003a97ylbp1] [info] �#Traceback (most recent call last): ▼ 2024-05-30 15:28:53.429 [sh1003a97ylbp1] [info] ��2engine.py :105 2024-05-30 07:28:53,428 Error initializing vLLM engine: Failed to load the model config. If the model is a custom model not yet available in the HuggingFace transformers library, consider settingtrust_remote_code=Truein LLM or using the--trust-remote-codeflag in the CLI. ▼ 2024-05-30 15:28:53.216 [sh1003a97ylbp1] [info] �vSpecial tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. ▼ 2024-05-30 15:28:52.547 [sh1003a97ylbp1] [info] �� warnings.warn( ▼ 2024-05-30 15:28:52.547 [sh1003a97ylbp1] [info] ��/usr/local/lib/python3.10/dist-packages/huggingface_hub/file_download.py:1132: FutureWarning:resume_downloadis deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, useforce_download=True. ▼ 2024-05-30 15:28:50.788 [sh1003a97ylbp1] [info] ��

from vllm.

[New Model]: Phi-3-medium-128k-instruct support about vllm HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent