i'm using NUC with win 11 on ARC 770 (driver version: 31.0.101.5379) and UHD Graphics

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-ho

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-ho

LangChain-Chatchat shows RuntimeError: could not create a primitive about ipex-llm HOT 9 OPEN

MYaoBQ commented on June 9, 2024

LangChain-Chatchat shows RuntimeError: could not create a primitive

from ipex-llm.

Comments (9)

Oscilloscope98 commented on June 9, 2024

Hi @MYaoBQ,

This may due to a bug from IPEX when iGPU and dGPU are both available, and especially when dGPU is not recognized as 'xpu:0'.

To solve this problem and run on ARC A770, you may try to disable UHD Graphics 770 on your machine first and have a try again :)

You could do that through

Device manager (设备管理器) -> Display adapter (显示适配器) -> UHD Graphics 770 - right click (右击) -> Disable device (禁用设备)

Please let us know for any further problems :)

from ipex-llm.

MYaoBQ commented on June 9, 2024

Yuwen, Just tried after disable the iGPU (UHD Graphics 770), but it hangs on “converting the current model to sym_int4…” like forever, I have to kill the process by using Ctl+C. any advice? Thanks, ***@***.*** [A close up of a sign Description automatically generated] Michael Yao Technical Sales Specialist for Enterprise Client | Sales and Marketing M +8613501999436 ***@***.******@***.***> Intel Corporation | intel.com<http://intel.com/> From: Yuwen Hu ***@***.***> Sent: Monday, April 1, 2024 2:47 PM To: intel-analytics/ipex-llm ***@***.***> Cc: Yao, Michael BQ ***@***.***>; Mention ***@***.***> Subject: Re: [intel-analytics/ipex-llm] LangChain-Chatchat shows RuntimeError: could not create a primitive (Issue #10605) Hi @MYaoBQ<https://github.com/MYaoBQ>, This may due to a bug from IPEX when iGPU and dGPU are both available, and especially when dGPU is not recognized as 'xpu:0'. To solve this bug and run on ARC A770, you may try to disable UHD Graphics 770 on your machine first and have a try again :) You could do that through Device manager (设备管理器) -> Display adapter (显示适配器) -> UHD Graphics 770 -> Disable device (禁用设备) and have a try again? :) — Reply to this email directly, view it on GitHub<#10605 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/BHPRSHJFBPMKFU73ST7XLLTY3D7GRAVCNFSM6AAAAABFQ5UT5OVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMRZGI3DAMZXGE>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

from ipex-llm.

Oscilloscope98 commented on June 9, 2024

Hi @MYaoBQ,

There are several things you could have a try:

Restart the machine after disabling iGPU
No need to set ONEAPI_DEVICE_SELECTOR any more as only one GPU is available :)

from ipex-llm.

MYaoBQ commented on June 9, 2024

Thanks, it works after disabling the iGPU and reboot. When testing RAG, sometimes it shows below error message “API 通信错误”, please see below example. No issue when ask “what’s ArrowLake”. But error pop up when ask “what is ARL”. What’s the reason behind? Both “Arrow Lake” and “ARL” are in the documents that uploaded. Thanks. ***@***.*** [A close up of a sign Description automatically generated] Michael Yao Technical Sales Specialist for Enterprise Client | Sales and Marketing M +8613501999436 ***@***.******@***.***> Intel Corporation | intel.com<http://intel.com/> From: Yuwen Hu ***@***.***> Sent: Monday, April 1, 2024 6:23 PM To: intel-analytics/ipex-llm ***@***.***> Cc: Yao, Michael BQ ***@***.***>; Mention ***@***.***> Subject: Re: [intel-analytics/ipex-llm] LangChain-Chatchat shows RuntimeError: could not create a primitive (Issue #10605) Hi @MYaoBQ<https://github.com/MYaoBQ>, There are several things you could have a try: 1. Restart the machine after disabling iGPU 2. No need to set ONEAPI_DEVICE_SELECTOR any more as only one GPU is available :) — Reply to this email directly, view it on GitHub<#10605 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/BHPRSHI6CFVTVTYZBKCGRLDY3EYPLAVCNFSM6AAAAABFQ5UT5OVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMRZGUZTQOJRGI>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

from ipex-llm.

Oscilloscope98 commented on June 9, 2024

Hi @MYaoBQ ,

We have recently fixed this 通信错误 error, please kindly update to our latest code and have a try again :)

Besides, for Arc A770, here are the recommended configuations:

set USE_XETLA=OFF
set SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS=1
set SYCL_CACHE_PERSISTENT=1
set BIGDL_QUANTIZE_KV_CACHE=1
set BIGDL_LLM_XMX_DISABLED=1

set no_proxy=localhost,127.0.0.1
set BIGDL_IMPORT_IPEX=0

python startup.py -a

Please let us know for any further problems :)

from ipex-llm.

violet17 commented on June 9, 2024

Hi @MYaoBQ ,

We have recently fixed this 通信错误 error, please kindly update to our latest code and have a try again :)

Besides, for Arc A770, here are the recommended configuations:
set USE_XETLA=OFF
set SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS=1
set SYCL_CACHE_PERSISTENT=1
set BIGDL_QUANTIZE_KV_CACHE=1
set BIGDL_LLM_XMX_DISABLED=1

set no_proxy=localhost,127.0.0.1
set BIGDL_IMPORT_IPEX=0

python startup.py -a
Please let us know for any further problems :)

Base on your reply, I try it on Arc dGPU, but get error.
set BIGDL_IMPORT_IPEX=0 means that IPEX won't be imported, right?

from ipex-llm.

Oscilloscope98 commented on June 9, 2024

Hi @violet17,

We have currently updated our Windows guide on Arc A-Series, it is recommended to follow our guide for Langchain-chatchat deployment on WIndows Arc A-Series.

Besides, set BIGDL_IMPORT_IPEX=0 means not automatically importing ipex, but we manually did that in our Langchain-chatchat support code.

Please let us know for any further problems :)

from ipex-llm.

violet17 commented on June 9, 2024

Hi @Oscilloscope98 , thank you for your reply. I fix could not create a primitive with disable iGPU when using dGPU for inference instead of using set BIGDL_IMPORT_IPEX=0.

from ipex-llm.

Oscilloscope98 commented on June 9, 2024

Hi @Oscilloscope98 , thank you for your reply. I fix could not create a primitive with disable iGPU when using dGPU for inference instead of using set BIGDL_IMPORT_IPEX=0.

Hi @violet17,

I'm glad you solved the problem :) Just some clarification that set BIGDL_IMPORT_IPEX=0 is not for resolving the could not create a primitive issue. It is related to this issue.

Please let me know for any further problems :)

from ipex-llm.

LangChain-Chatchat shows RuntimeError: could not create a primitive about ipex-llm HOT 9 OPEN

Comments (9)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent