Comments (3)
Thanks for reporting this bug @axsaucedo ! We have a fix in #162 that should solve the issue :)
Regarding your question:
For completeness I am getting the notice The model 'ORTModelForCausalLM' is not supported for text-generation. - is that expected or does that suggest my transformers/hugging-face library is also not compatible with the latest live branch of optimum?
This is indeed expected because we're wrapping the pipeline()
function from transformers
which is only aware of the model classes in that library. We will have a look at providing a better warning for users, so thanks for flagging this!
We'll also look at adding support to set the target device in the transformers.pipeline()
function by adding a ORTModel.to()
method :)
from optimum.
Thank you for the fast turnaround @lewtun - I just tested it and works perfectly. Also thank you for the clarification on the warning, makes sense - I have added #163 for context.
I still cannot define device=X
in the pipeline parametres, however I can confirm that it runs automatically in GPU, so it's not necessary. Would it still be required to ensure that the ORT models support the to(...)
as well as other params like config
? It's not currently a blocker from our side but just wanted to highlight.
from optimum.
I still cannot define
device=X
in the pipeline parametres, however I can confirm that it runs automatically in GPU, so it's not necessary. Would it still be required to ensure that the ORT models support theto(...)
as well as other params likeconfig
? It's not currently a blocker from our side but just wanted to highlight.
Yes, currently the device
arg can't be set in the pipeline()
function for ORTModelForXxx
classes - it's on our TODO list and will hopefully be added soon :)
And thanks for opening a new issue about the warning - that really helps for tracking!
from optimum.
Related Issues (20)
- model.ByteSize() is negative when converting microsoft/phi2 model HOT 3
- Converted LayoutLM ONNX model - Required input `bbox` missing from input feed `['input_ids', 'attention_mask', 'token_type_ids']` HOT 3
- Mixtral quantization hard-freezes Python HOT 4
- Xenova mbart-large-50-many-to-many-mmt conversion is not translating. HOT 2
- Clarity on the convert.py for a model to ONNX.py.. documentation issue HOT 2
- TextStreamer not supported for ORTCausalLM? HOT 1
- Whisper-v3 ValueError: Transformers now supports natively BetterTransformer optimizations
- Bitsandbytes integration in ORTModelForCausalLM.from_pretrained()
- the onnx in optimum can not accelerate the speed of batch feature extraction. HOT 3
- task text-classification to be supported in the ONNX export for bart HOT 2
- Assistant model in generate method
- GPTQQuantizer hard coded the device to 0 HOT 2
- ONNX support for Mixtral text-classification HOT 1
- export `audio-classification` model based on OpenAI whisper model to other formats HOT 1
- KeyError: 'last_hidden_state' HOT 1
- Non-pinned Transformers Version Incompatability HOT 2
- ORTStableDiffusionXLPipeline sdxl onnx fp16 unet load error
- ORTModelForCausalLM inference fails (after converting transformer to ONNX) HOT 4
- optimum-cli export neuron failed for 'meta-llama/Llama-2-13b-hf' HOT 2
- Issue in Quantizer decoeder_model_merged.onnx of MT5 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from optimum.