Comments (6)
训练的是什么数据集呢?其他的超参数可以提供一下吗?
from paddleclas.
我也碰到了这个问题,自己的数据集,身份证、医保卡、银行卡等数据分类,在pytorch上面跑过,没有问题
用的MobileNetV3_small_x0_35.yaml,就改了跟数据集有关的地方,比如类别数这些,其他都是官方提供的
2020-07-27 17:35:21 INFO: epoch:4 train step:170 loss: 0.1664 top1: 0.9688 top5: 1.0000 lr: 2.299093 elapse: 0.881s
2020-07-27 17:35:28 INFO: epoch:4 train step:180 loss: 0.3008 top1: 0.9297 top5: 0.9922 lr: 2.312960 elapse: 0.652s
2020-07-27 17:35:34 INFO: epoch:4 train step:190 loss: 0.2220 top1: 0.9141 top5: 1.0000 lr: 2.326827 elapse: 0.759s
2020-07-27 17:35:42 INFO: epoch:4 train step:200 loss: 0.2745 top1: 0.9375 top5: 0.9922 lr: 2.340693 elapse: 0.737s
2020-07-27 17:35:48 INFO: epoch:4 train step:210 loss: 0.2979 top1: 0.9219 top5: 1.0000 lr: 2.354560 elapse: 0.692s
2020-07-27 17:35:55 INFO: epoch:4 train step:220 loss: 0.2368 top1: 0.9297 top5: 1.0000 lr: 2.368427 elapse: 0.548s
2020-07-27 17:36:01 INFO: epoch:4 train step:230 loss: 0.3466 top1: 0.8828 top5: 1.0000 lr: 2.382293 elapse: 0.666s
2020-07-27 17:36:08 INFO: epoch:4 train step:240 loss: 0.2592 top1: 0.9141 top5: 1.0000 lr: 2.396160 elapse: 0.689s
2020-07-27 17:36:15 INFO: epoch:4 train step:250 loss: 0.3722 top1: 0.9531 top5: 0.9844 lr: 2.410027 elapse: 0.595s
2020-07-27 17:36:21 INFO: epoch:4 train step:260 loss: 0.7002 top1: 0.8828 top5: 0.9844 lr: 2.423893 elapse: 0.698s
2020-07-27 17:36:28 INFO: epoch:4 train step:270 loss: 57.0000 top1: 0.1094 top5: 0.9219 lr: 2.437760 elapse: 0.651s
2020-07-27 17:36:34 INFO: epoch:4 train step:280 loss: nan top1: 0.0312 top5: 0.8984 lr: 2.451627 elapse: 0.699s
2020-07-27 17:36:41 INFO: epoch:4 train step:290 loss: nan top1: 0.0156 top5: 0.8906 lr: 2.465493 elapse: 0.621s
2020-07-27 17:36:47 INFO: epoch:4 train step:300 loss: nan top1: 0.0156 top5: 0.9531 lr: 2.479360 elapse: 0.494s
2020-07-27 17:36:54 INFO: epoch:4 train step:310 loss: nan top1: 0.0234 top5: 0.9297 lr: 2.493227 elapse: 0.715s
2020-07-27 17:37:01 INFO: epoch:4 train step:320 loss: nan top1: 0.0156 top5: 0.8750 lr: 2.507093 elapse: 0.684s
2020-07-27 17:37:08 INFO: epoch:4 train step:330 loss: nan top1: 0.0312 top5: 0.9141 lr: 2.520960 elapse: 0.723s
2020-07-27 17:37:14 INFO: epoch:4 train step:340 loss: nan top1: 0.0156 top5: 0.9219 lr: 2.534827 elapse: 0.704s
2020-07-27 17:37:21 INFO: epoch:4 train step:350 loss: nan top1: 0.0156 top5: 0.9453 lr: 2.548693 elapse: 0.582s
2020-07-27 17:37:28 INFO: epoch:4 train step:360 loss: nan top1: 0.0078 top5: 0.9062 lr: 2.562560 elapse: 0.596s
2020-07-27 17:37:35 INFO: epoch:4 train step:370 loss: nan top1: 0.0000 top5: 0.8750 lr: 2.576427 elapse: 1.042s
2020-07-27 17:37:37 INFO: END epoch:4 train loss_avg: nan top1_avg: 0.6615 top5_avg: 0.9625 elapse_sum: 250.617s
from paddleclas.
多次测试,nan100%会出现,并非偶然
from paddleclas.
可以提供完整的log文件吗?出nan的话,可以减小学习率试下
from paddleclas.
@littletomatodonkey 单卡和多卡,在CosineWarmup模式下,初始学习率分别怎么设置合理点?
from paddleclas.
降低学习率work
from paddleclas.
Related Issues (20)
- 训练SwinTransformer模型,loss不下降 HOT 9
- 二分类训练评估时如何输出,正样本的精确率、召回率以及f1-score HOT 1
- PaddleClas支持龙芯吗 HOT 1
- 关于特征提取后进行feature_normalize的疑问 HOT 3
- 您好,是否能提供MixFormer的源码? HOT 3
- 缺少PULC_table_attribute.md文档 HOT 2
- 在进行训练时如果自己真实数据样本没有翻转的情况,数据增强RandFlipImage是不是可以不加 HOT 1
- 图片分类训练时报错 HOT 1
- 关于tripletangularmarginloss.py中的负样本类距离loss计算absolut_loss_an HOT 13
- 请问 paddleClas适合会计票据的分类吗 HOT 2
- 使用Python命令索引库如何更新 HOT 4
- list index out of range的原因是?
- 使用PPLCNetV2_base_ShiTu模型,在GPU上运行加速效果不明显 HOT 1
- PaddleClas,图像识别部署,根据2.5文档服务化部署预测过程中出现报错 HOT 4
- 一张图片中两行文字发生了折叠,如果有文字发生折叠的图片和文字未发生折叠的图片,用哪一个分类模型效果会好一些? HOT 3
- PaddleClas 如何实现模型在train 以及 infer 的时候使用不同分支的forword HOT 1
- KeyError: 'save_infer_model/scale_0.tmp_1.lod' HOT 1
- PPLCNetV2_base_ShiTu模型增加图片的分辨率跟输出的维数会增加检索精度吗? HOT 1
- Direct prediction API HOT 1
- 关于paddleClas和paddlepaddle版本的问题 HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paddleclas.