Comments (4)
I suggest you contact your support team. You are using a custom built pytorch, which we can answer very limited questions about.
from vllm.
Thanks, will do. But does anyone have rough idea what might have caused this?
from vllm.
Same issue here
Appears right after fresh Jetson AGX flashing
here some sw info
jetson_release
Model: NVIDIA Jetson AGX Orin Developer Kit - Jetpack 6.0 [L4T 36.3.0]
NV Power Mode[0]: MAXN
Hardware:
- Module: NVIDIA Jetson AGX Orin
Platform:
- Distribution: Ubuntu 22.04 Jammy Jellyfish
- Release: 5.15.136-tegra
jtop:
- Version: 4.2.8
- Service: Active
Libraries:
- CUDA: 12.2.140
- cuDNN: 8.9.4.25
- TensorRT: 8.6.2.3
- VPI: 3.1.5
- Vulkan: 1.3.204
- OpenCV: 4.10.0-dev - with CUDA: YES
Python 3.10.12 [GCC 11.4.0] on linux
>>> import torch
>>> print(torch.version.cuda)
12.2
from vllm.
Same problem here. Using print(torch.__version__)
, I found that current vllm tries to use PyTorch version 2.3.0 in setup.py . However, my currently installed version is 2.1.0a0+41361538.nv23.06, which is the latest supported by my Jetpack 5.1.2. According to the NVIDIA documentation, PyTorch 2.3.0 is not supported on my setup.
To work around this, I attempted to install vllm-0.2.4, which requires PyTorch only 2.1.0. However, I encountered an OSError: CUDA_HOME environment variable is not set. Please set it to your CUDA install root.
error, even though I have already added the correct CUDA_HOME to my .bashrc
from vllm.
Related Issues (20)
- [Bug]: Model "talking to itself" and ignoring `<|im_end|>` HOT 13
- [New Model]: Florence-2
- Virtual Office Hours: July 9 and July 25
- [Bug]: Illegal memory access for MoE kernel with large workloads HOT 1
- [Bug]: "work_use_ray" not work anymore in the latest version HOT 2
- [Usage]: can I save log to a file? HOT 1
- [Usage]: 是否可以多节点多CPU推理 HOT 1
- [Feature]: Way to using LLM's last hidden state embedding vector
- [Bug]: Can't support Phi-3-medium-* models with more than 2 GPUs
- [Bug]: Chunked prefill vs. non-chunked output is different for a long prompt HOT 1
- Gemma2 models from google HOT 3
- [Bug]: qwen1.5-32b-chat no response HOT 1
- [Feature]: `/info` endpoint for OpenAI-compatible API Server HOT 1
- [Bug]: vLLM crash when running Phi-3-small-8k-instruct with enable-chunked-prefill HOT 3
- [Bug]: Phi-3 vision crash: TypeError: only integer tensors of a single element can be converted to an index HOT 2
- [Bug]: New bug in last few days for phi-3-vision. The model's max seq len (131072) is larger than the maximum number of tokens that can be stored in KV cache (50944) HOT 5
- [Bug]: AttributeError: 'NoneType' object has no attribute 'prefill_metadata' HOT 1
- [Bug]: TypeError: FlashAttentionMetadata.__init__() missing 10 required positional arguments HOT 5
- about the RotaryEmbedding
- [New Model]: support for BartForSequenceClassification
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vllm.