Comments (6)
api server的日志输出没有任何报错
from chinese-llama-alpaca-2.
经过一晚上又降回了17显存
from chinese-llama-alpaca-2.
大概每50-100次就会遇到下面这个报错,torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 3.51 GiB (GPU 0; 23.65 GiB total capacity; 17.10 GiB already allocated; 2.61 GiB free; 20.28 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALL
OC_CON
对于4090这样的显卡运行 Chinese-Alpaca-2 7B 这样的模型,显存占用是合理的吗?
from chinese-llama-alpaca-2.
模型本身权重就有14G左右,17G显存占用是合理的。运行中显存占用升高应该是每次推理缓存没有清理的原因,可以试试每次推理完将之前的记录等都清理了。脚本本身是指导性的作用,不一定那么完善,欢迎修改完善。
from chinese-llama-alpaca-2.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your consideration.
from chinese-llama-alpaca-2.
Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.
from chinese-llama-alpaca-2.
Related Issues (20)
- 6卡指令精调,报错oom HOT 4
- 预训练完成后模型的使用 HOT 4
- 指令精调 HOT 2
- 指令精调 HOT 4
- 无法从checkpoint恢复训练 HOT 3
- 多卡训练卡在加载模型 HOT 7
- ImportError: /usr/local/lib/python3.10/dist-packages/transformer_engine_extensions.cpython-310-x86_64-linux-gnu.so: undefined symbol: HOT 2
- 通过openai_server_demo/openai_api_server_vllm.py 运行,输出出现自问自答 HOT 2
- 训练垂直领域大模型应该基于哪个版本? HOT 3
- 权重合并后重新加载训练时出现错误 HOT 30
- 微调后的lora模块 HOT 9
- 预训练数据以及微调数据会开源吗? HOT 2
- 模型,做了屏蔽词管理么? HOT 1
- 使用transformer命令行进行交互时推理报错 HOT 2
- HELP!!!!!!!!!!!!!!!!!!!!!!! HOT 1
- 模型微调 HOT 2
- 模型预训练时的labels问题 HOT 2
- 训练数据和测试数据开源了么? HOT 2
- 请问reward模型怎么部署推理? HOT 2
- 什么导致chinese-alpaca-2-7b推理存在大量重复生成情况 呢 HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chinese-llama-alpaca-2.