Comments (11)
高度32 宽度可变,字符数可以变
from crnn_chinese_characters_rec.
如果我生成的数据字符数多于10个,那图片是不是也要成比例加长?我的意思是必须保证每个字符占28个像素吗,少于28个像素可不可以
from crnn_chinese_characters_rec.
没有要求,字体大小是随机的,只要字符能出现在图片中就行
from crnn_chinese_characters_rec.
@Sierkinhane 训练不定长度的数据代码需要修改么?是要对一个batch中的不用长度进行补齐么? 我lmdb制作ok了,直接跑的话跑不起来,代码运行到 cost = criterion(preds, text, preds_size, length) / batch_size 这行就会段错误,求教
from crnn_chinese_characters_rec.
我在训练时字符数和宽高都是定的,只需要在测试时修改一小部分代码就可以实现不定长识别, 可以把报错的信息发出来吗
from crnn_chinese_characters_rec.
你好,我的训练数据是定宽32,不定长的, 是训练代码出错的 ,目前只显示了Segment fault ..
from crnn_chinese_characters_rec.
from crnn_chinese_characters_rec.
可以
from crnn_chinese_characters_rec.
我的意思是如果我生成的间距与真实数据不一样,会不会影响识别效果呀?
from crnn_chinese_characters_rec.
高度固定到32的话,也就是说,竖排文字就没法用了。
from crnn_chinese_characters_rec.
高度固定到32的话,也就是说,竖排文字就没法用了。
对的,因为高度固定32是为了保证CNN的部分最后一层输出的宽是1,这样才能进到RNN层里面。我感觉如果你想做竖版文字识别的话首先训练集得有竖版文字,其次要把crop出来的竖版文字图像旋转一下再做resize然后传入网络。
from crnn_chinese_characters_rec.
Related Issues (20)
- 这个网络是否采用了GRU?
- 尝试了安装这个包也不行,请问要怎么处理
- 字符位置预测问题
- demo和validate结果不一致 HOT 1
- export to onnx issue
- The disk data is full, and the GPU operation efficiency is very low
- ctc loss error
- 作者提供的数据集为什么只有156万多,没有360w HOT 1
- 如何进行测试呢?
- Test loss: 5.1653, accuracy: 0.0000. I tried to train 2 images as own data, accuracy is always 0.0000. What is the probem? [email protected]
- 过拟合问题
- 训练loss不降,acc=0, 预测为空 HOT 2
- 训练速度太慢了,怎么解决??
- 怎么使用多卡训练 HOT 2
- log_softmax HOT 1
- demo里识别不了车牌和验证码,只能识别截图的字 HOT 3
- loss一开始就0.5左右,后续能降到0.00几 HOT 2
- mm
- 数据集问题
- 修改完8P后loss不收敛
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from crnn_chinese_characters_rec.