Comments (6)
In my case, on Linux, trying to use the GPU I see on the logs:
ERROR Could not load engine: Could not load library "/home/jose/.config/Jan/data/extensions/@janhq/inference-cortex-extension/dist/bin/linux-cuda-12-0/engines/cortex.llamacpp/libengine.so"
libcudart.so.12: cannot open shared object file: No such file or directory - server.cc:299
@milen-prg On what previous version it worked for you?
from jan.
@josepgl , on v0.5.3 worked (there was another problem, the models was disappeared, but here easy helped me then).
On Windows where to see the logs?
from jan.
@milen-prg in Settings > Advanced Settings > Jan Data Folder
is the data path, I see a log
folder there in Linux.
from jan.
There is logs folder, but it is empty.
from jan.
There is logs folder, but it is empty.
We only store your logs for 24h
from jan.
This is a known issue. The Hugging Face model download from the Search Box in model hub is pretty broken for now, and it doesn't retrieve the correct GGUF model's metadata. We've filed an issue on this and are working on the fix from the engine: #3558
In the meantime, please help us add an ngl
setting to the settings section for now to enable GPU acceleration. It worked fine before because the previous versions hardcoded an ngl setting, which is hacky and not correct for all models.
from jan.
Related Issues (20)
- idea: Decouple & Enhance Quick Ask feature
- QA: Jan 0.5.4 Release Sign-off HOT 35
- idea: Better Hardware Settings & Error Handling
- epic: Jan's path to cortex.cpp? HOT 3
- bug: Jan application window unusable if last window position is outside monitor resolution [Windows 11] HOT 2
- bug: Debian install breaks due to 'error creating hard link: Invalid cross-device link' HOT 4
- bug: A broken thread.json could break the entire thread list
- bug: O1-Preview does not work because of: "Unsupported parameter: 'max_tokens' is not supported with this model.". HOT 3
- bug: Can't run model `Deepseek Coder 33B Instruct` HOT 1
- feat: Roll out `Save instruction for new threads` as a stable feature
- bug: Unable to chat with image using Moondream2 Vision model HOT 1
- idea: Enhanced thread title generation and allow users to have cost management options (for cloud APIs)
- idea: Improve code block UX in streaming responses HOT 1
- bug: Wrong maximum context length for qwen2.5-coder HOT 1
- idea: Add Claude Prompt Caching Support to Jan
- feat: Support Qwen 2.5
- bug: `Llama 3 8B Instruct Q4` cannot recognize that I have uploaded a file HOT 1
- bug: Stop button still shows after i stop the model
- bug: LLM model fails to downlad, can still "be used" HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from jan.