Your current environment <div class="snippet-clipboard-content notranslate posit

[Usage]: if I want to run a 34B model，like yi-34B-chat,how can I use multi GPU,I just have A100 40G about vllm HOT 5 OPEN

hellostronger commented on May 27, 2024

[Usage]: if I want to run a 34B model，like yi-34B-chat,how can I use multi GPU,I just have A100 40G

from vllm.

Comments (5)

hellostronger commented on May 27, 2024

i have tried engine_use_ray = True in AsyncEngineArgs or CUDA_VISIBLE_DEVICES = 0,1 but it still does not work ,GPU0 OOM

from vllm.

kir1to00 commented on May 27, 2024

use tensor-parallel-size

from vllm.

hellostronger commented on May 27, 2024

thk, the problem that GPU0 OOM is solved, but new problem comes ,it shows " ValueError: When using LoRA, vocab size must be 32000 >= vocab_size <= 33024 " ,anybody can load the yi-34B-chat with lora successfuly? hoping your suggestion

from vllm.

kir1to00 commented on May 27, 2024

thk, the problem that GPU0 OOM is solved, but new problem comes ,it shows " ValueError: When using LoRA, vocab size must be 32000 >= vocab_size <= 33024 " ,anybody can load the yi-34B-chat with lora successfuly? hoping your suggestioncu

maybe use max-model-len?

from vllm.

jeejeelee commented on May 27, 2024

thk, the problem that GPU0 OOM is solved, but new problem comes ,it shows " ValueError: When using LoRA, vocab size must be 32000 >= vocab_size <= 33024 " ,anybody can load the yi-34B-chat with lora successfuly? hoping your suggestion

"ValueError: When using LoRA, vocab size must be 32000 >= vocab_size <= 33024" has already been resolved by #4015. You can build the source from the main branch or wait for the 0.4.1 release. After resolving this issue, Yi-34 can load LoRA successfully.

from vllm.

[Usage]: if I want to run a 34B model，like yi-34B-chat,how can I use multi GPU,I just have A100 40G about vllm HOT 5 OPEN

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent