Comments (2)
can you try #5473 ? it should fix your error i think.
from vllm.
can you try #5473 ? it should fix your error i think.
I pulled the latest vllm code and tried to install it
but it encountered some problems during installation
if I use pip install -e .
,it will get stuck here:
Looking in indexes: https://mirrors.aliyun.com/pypi/simple
Obtaining file:///workspace/huj11%40xiaopeng.com/code/vllm
Installing build dependencies ... |
if I use pip install --editable ./ --no-build-isolation
, it will get stuck here:
Looking in indexes: https://mirrors.aliyun.com/pypi/simple
Obtaining file:///workspace/huj11%40xiaopeng.com/code/vllm
Checking if build backend supports build_editable ... done
Preparing editable metadata (pyproject.toml) ... done
Requirement already satisfied: cmake>=3.21 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (3.29.5)
Requirement already satisfied: ninja in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (1.11.1.1)
Requirement already satisfied: psutil in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (5.9.7)
Requirement already satisfied: sentencepiece in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.1.99)
Requirement already satisfied: numpy in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (1.26.3)
Requirement already satisfied: requests in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (2.31.0)
Requirement already satisfied: py-cpuinfo in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (9.0.0)
Requirement already satisfied: transformers>=4.40.0 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (4.41.2)
Requirement already satisfied: tokenizers>=0.19.1 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.19.1)
Requirement already satisfied: fastapi in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.109.0)
Requirement already satisfied: openai in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (1.33.0)
Requirement already satisfied: uvicorn[standard] in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.27.0.post1)
Requirement already satisfied: pydantic>=2.0 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (2.5.3)
Requirement already satisfied: prometheus-client>=0.18.0 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.19.0)
Requirement already satisfied: prometheus-fastapi-instrumentator>=7.0.0 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (7.0.0)
Requirement already satisfied: tiktoken==0.6.0 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.6.0)
Requirement already satisfied: lm-format-enforcer==0.10.1 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.10.1)
Requirement already satisfied: outlines==0.0.34 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (0.0.34)
Requirement already satisfied: typing-extensions in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (4.9.0)
Requirement already satisfied: filelock>=3.10.4 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (3.13.1)
Requirement already satisfied: ray>=2.9 in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (2.9.1)
Requirement already satisfied: nvidia-ml-py in /opt/conda/lib/python3.10/site-packages (from vllm==0.4.2) (12.555.43)
Collecting vllm-nccl-cu12<2.19,>=2.18 (from vllm==0.4.2)
Downloading https://mirrors.aliyun.com/pypi/packages/41/07/c1be8f4ffdc257646dda26470b803487150c732aa5c9f532dd789f186a54/vllm_nccl_cu12-2.18.1.0.4.0.tar.gz (6.2 kB)
from vllm.
Related Issues (20)
- [Performance]: How use vllm.attention.ops.triton_flash_attention replace flash_attn package HOT 1
- [Bug]: Performance : very slow inference for Mixtral 8x7B Instruct FP8 on H100 with 0.5.0 and 0.5.0.post1 HOT 2
- [Bug]: CUDA illegal memory access error when `enable_prefix_caching=True` HOT 4
- [Bug]: Vllm 0.3.0 got weired output
- [Feature]: LoRA support for Mixtral GPTQ and AWQ HOT 1
- [Feature]: asymmetric tensor parallel
- [Bug]: prefix-caching: inconsistent completions HOT 1
- [Bug]: Distribute Tests PR test fails
- [Bug]: llava-v1.6-mistral-7b-hf prompt template handling error HOT 3
- [Bug]: RuntimeError: CUDA error: no kernel image is available for execution on the device HOT 1
- [Bug]: OOM when setting prompt_logprobs=1 HOT 3
- [RFC]: Refactor Worker and ModelRunner to consolidate control plane communication HOT 10
- [Bug]: Using tensor-parallel-size 4 fails for some models with pyo3_runtime.PanicException: The global thread pool has not been initialized.: ThreadPoolBuildError {"Resource temporarily unavailable" })
- [RFC]: Implement KV cache transferring mechanism in vLLM HOT 2
- [Performance] [Speculative decoding] Speed up autoregressive proposal methods by making sampler CPU serialization optional HOT 2
- [Bug]: Speculative decoding server: `ValueError: could not broadcast input array from shape (513,) into shape (512,)` HOT 11
- [Bug]: Regression in LoRA Adapter loading speed between vllm 0.4.3 and 0.5.0 HOT 3
- [RFC]: Branch based version control, and development version
- [Usage]: how to use marlin kernel for GPTQ model HOT 9
- [Performance]:
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vllm.