Comments (2)
Some useful resources from the docs:
- On Static Cache and compile for faster generation (not all models support compile though): https://huggingface.co/docs/transformers/v4.44.0/en/llm_optims#static-kv-cache-and-torchcompile
- General docs on cache classes: https://huggingface.co/docs/transformers/v4.44.0/kv_cache
- Static Cache docs: https://huggingface.co/docs/transformers/v4.44.0/en/internal/generation_utils#transformers.StaticCache
from transformers.
Related Issues (20)
- Community contribution: Adding GGUF support for more architectures HOT 7
- LlavaNextProcessor bug in `_get_unpadded_features` HOT 1
- latest 44.4.2 doesn't support falcon_mamba HOT 1
- Add Log-Softmax Temperature Option to transformers.Seq2SeqTrainingArguments for CE Loss HOT 1
- prepare_fa2_from_position_ids error in training with batch_size > 1
- Custom pipeline in remote repo cannot load custom model from remote repo. HOT 5
- Qwen2-VL from_config broken? HOT 2
- TypeError: MistralForSequenceClassification.forward() got an unexpected keyword argument 'token_type_ids'
- transformers 4.44.2 doesn't work with torch.compile and torch.export on T5 generate() HOT 3
- 'DepthEstimationPipeline' object has no attribute 'image_size' when num_workers > 0 HOT 1
- Qwen2-VL Doesn't Execute on TPUs
- oom when using adafactor optimizer in deepspeed HOT 1
- Is it possible to make wasm support all models in huggingface?
- "Qwen2-VL FP16 inference results in errors or gibberish output." HOT 3
- Mask2FormerImageProcessor - fails to process multichannel image HOT 1
- Decoder and cross-attention shape is different when obtained by model.generate() and model() HOT 2
- Out-of-Index Error when training by `Qwen2VLFlashAttention2` HOT 1
- tokenizer `save_pretrained` can not handle non-string value in dtype HOT 1
- [Help] Correct Way to do Simple Model Constraints HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from transformers.