Comments (4)
If I want to use 4 consumer-grade graphics cards, can I do it this way?
just change --device "0,1,2,3"
python -u text2sql.py \
--batch_size 6 \
--gradient_descent_step 16 \
--device "0,1,2,3" \
--learning_rate 5e-5 \
--epochs 128 \
--seed 42 \
--save_path "./models/text2sql-t5-3b" \
--tensorboard_save_path "./tensorboard_log/text2sql-t5-3b" \
--model_name_or_path "t5-3b" \
--use_adafactor \
--mode train \
--train_filepath "./data/preprocessed_data/resdsql_train_spider.json"
However, it seems that this flag I saw in the code is the GPU's ID.
I would like to ask if there is a way to perform multi-card training and inference. thank you.
from resdsql.
There is currently no distributed version.
I've been busy lately, so maybe you can follow Huggingface Accelerate (https://huggingface.co/docs/accelerate/index) and implement the distributed code yourself. It's simple and easy to use.
from resdsql.
ok. thank you for your answer. i will try the t5-small to represent it.
from resdsql.
I have made a PR about dataParallel, which can accelerate the training speed. However, for running the larger model, you may have to use model parallel, which i will create a PR lately
from resdsql.
Related Issues (20)
- Execuse me. What happened to paper CodeS? Isn't this article open source before? HOT 9
- Low training metrics HOT 14
- Support for Historical Conversation in RESDSQL HOT 4
- Question about evaluation scripts HOT 2
- 请问推理方法 HOT 2
- 最低支持的GPU内存是多少,我怎么跑不起来。
- Dev result file?
- 部分带有别名的sql在经过normalization处理后出现错误 HOT 2
- Inference script not working HOT 5
- CoSQL HOT 1
- 训练Cross-Encoder的时候为什么24G的显存还不够用? HOT 1
- 关于RESDSQL在BIRD上的运行时间 HOT 2
- Training cross-coder error HOT 1
- xlm_roberta_text2natsql_schema_item_classifier HOT 3
- Evaluation detail on CSpider HOT 1
- 你好,请问如何将自己的数据集处理成CSpider的形式? HOT 3
- 你好,请问如何SQL2NatSQL?我想用自己的数据集跑text2NatSQL的方法。 HOT 2
- 请问模型训练有多gpu并行支持吗 HOT 1
- Can the ranking-filter successfully choose all the right schema items? HOT 1
- 为什么我使用对bird训练的classifier时出现了truncated_dataset.json文件,而且陷入了循环无法结束运行 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from resdsql.