Comments (9)
Hi @MYaoBQ,
This may due to a bug from IPEX when iGPU and dGPU are both available, and especially when dGPU is not recognized as 'xpu:0'
.
To solve this problem and run on ARC A770, you may try to disable UHD Graphics 770 on your machine first and have a try again :)
You could do that through
Device manager (设备管理器) -> Display adapter (显示适配器) -> UHD Graphics 770 - right click (右击) -> Disable device (禁用设备)
Please let us know for any further problems :)
from ipex-llm.
from ipex-llm.
Hi @MYaoBQ,
There are several things you could have a try:
- Restart the machine after disabling iGPU
- No need to set
ONEAPI_DEVICE_SELECTOR
any more as only one GPU is available :)
from ipex-llm.
from ipex-llm.
Hi @MYaoBQ ,
We have recently fixed this 通信错误 error, please kindly update to our latest code and have a try again :)
Besides, for Arc A770, here are the recommended configuations:
set USE_XETLA=OFF
set SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS=1
set SYCL_CACHE_PERSISTENT=1
set BIGDL_QUANTIZE_KV_CACHE=1
set BIGDL_LLM_XMX_DISABLED=1
set no_proxy=localhost,127.0.0.1
set BIGDL_IMPORT_IPEX=0
python startup.py -a
Please let us know for any further problems :)
from ipex-llm.
Hi @MYaoBQ ,
We have recently fixed this 通信错误 error, please kindly update to our latest code and have a try again :)
Besides, for Arc A770, here are the recommended configuations:
set USE_XETLA=OFF set SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS=1 set SYCL_CACHE_PERSISTENT=1 set BIGDL_QUANTIZE_KV_CACHE=1 set BIGDL_LLM_XMX_DISABLED=1 set no_proxy=localhost,127.0.0.1 set BIGDL_IMPORT_IPEX=0 python startup.py -aPlease let us know for any further problems :)
Base on your reply, I try it on Arc dGPU, but get error.
set BIGDL_IMPORT_IPEX=0
means that IPEX won't be imported, right?
from ipex-llm.
Hi @violet17,
We have currently updated our Windows guide on Arc A-Series, it is recommended to follow our guide for Langchain-chatchat deployment on WIndows Arc A-Series.
Besides, set BIGDL_IMPORT_IPEX=0
means not automatically importing ipex, but we manually did that in our Langchain-chatchat support code.
Please let us know for any further problems :)
from ipex-llm.
Hi @Oscilloscope98 , thank you for your reply. I fix could not create a primitive
with disable iGPU when using dGPU for inference instead of using set BIGDL_IMPORT_IPEX=0
.
from ipex-llm.
Hi @Oscilloscope98 , thank you for your reply. I fix
could not create a primitive
with disable iGPU when using dGPU for inference instead of usingset BIGDL_IMPORT_IPEX=0
.
Hi @violet17,
I'm glad you solved the problem :) Just some clarification that set BIGDL_IMPORT_IPEX=0
is not for resolving the could not create a primitive
issue. It is related to this issue.
Please let me know for any further problems :)
from ipex-llm.
Related Issues (20)
- ipex-llm version 0510 has regression than 0430, especially for BS=16,32 and 8k input HOT 3
- Failing to run ipex-llm ollama on Intel Arc A770 HOT 12
- Can you help to release common.lib for llama.cpp with ipex-llm? HOT 1
- llama3-8B causes MTL iGPU runtime error when ipex-llm's running AI inference HOT 3
- Segmentation fault (core dumped) while inferencing with MTL iGPU HOT 4
- Support both Llama2 and stablelm/Zephyr-3B HOT 2
- all-in-one benchmark with Baichuan2-13B OOM HOT 1
- MTL Windows Qwen-VL AttributeError: 'QWenAttention' object has no attribute 'position_ids' HOT 4
- ChatGLM run error on MTL iGPU HOT 1
- failed to run truthfulqa_mc1 by harness HOT 2
- how to switch to load multiple llm models in a streamlit page? HOT 3
- Transform a string into input llama2-specific and llama3-specific input ? HOT 1
- Docker on Windows vllm serving issue HOT 15
- default values of max_generated_tokens, top_k, top_p, and temperature? HOT 1
- log using ipex-llm instead of bigdl-llm in while running native models
- Weights of LlamaForCausalLM were not initialized from the model checkpoint at meta-llama/Meta-Llama-3-8B-Instruct? HOT 1
- vLLM offline_inference.py failed to run on CPU inference HOT 1
- Unable to save quantized model HOT 1
- Llama 3 performance drop from transformers version 4.37.2 to 4.38.0 HOT 1
- about conflict HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ipex-llm.