Comments (5)
@M-Fannilla this one isn't related to the model checkpoint, but rather to the installed packages you have. Seems like flash-attn wasn't installed properly or has some dependency issues. Try to uninstall flash-attn and load again
If it doesn't work feel free to open a new issue :)
from transformers.
@M-Fannilla hi, I just updates llava-weights in the hub which caused the error. I will revert the changes soon
from transformers.
@zucchini-nlp Great, Thanks!
from transformers.
@M-Fannilla should be working now, closing the issue as resolved!
from transformers.
There is new issue:
Failed to import transformers.models.llama.modeling_llama because of the following error (look up to see its traceback):
/usr/local/lib/python3.10/dist-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops5zeros4callEN3c108ArrayRefINS2_6SymIntEEENS2_8optionalINS2_10ScalarTypeEEENS6_INS2_6LayoutEEENS6_INS2_6DeviceEEENS6_IbEE
I did not have this one before.
from transformers.
Related Issues (20)
- 如果在单个GPU上out of memory 如何用两个GPU加载推理同一个模型? HOT 3
- Check diff files in `check_copies`
- from_pretrained 加载checkpoint过慢的问题 HOT 1
- LLM during inference do not deallocate memory
- NVMLError_NotSupported when creating Trainer() object. HOT 2
- Stopping criteria not working with \n
- GGML (GGUF) Llama3 unit test fails HOT 2
- Error on fine tuning paligemma for object detection HOT 7
- Potential Bug in llava_next when calling pack_image_features function. HOT 5
- Source link to `LlamaForSequenceClassification` seems broken, if so, update it. HOT 3
- Process hangs when evaluating the model before finishing an epoch using `accelerate` in a multi-GPU environment (no trainer). HOT 4
- HuggingFace GroundingDINO inference execution time is slower than the original groundingDINO (~100ms) HOT 2
- Batch Generation giving different output when using batch size > 1 or when using padding in MambaForCausalLM HOT 2
- gh: consider `i18n` HOT 2
- Nested from_pretrained() gives warnings loading weights - "copying from a non-meta parameter" HOT 1
- Problem with the masked language modeling tutorial HOT 1
- When running `ruff format src/transformers`, some files needs to be reformatted HOT 2
- Something wrong for `StoppingCriteria` HOT 5
- Index out of range when generate using optimum HOT 1
- Fail to load model without .safetensors file
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from transformers.