Comments (13)
感谢大佬指点。
我想50%的识别率,每2张中就能认出1张,还是可以用的。
我程序加了个小技巧:每次验证码都按时间戳做文件名存盘。凡是通过验证进入下一页面的,我就将文件名改为验证码。这样至少有一半的图片是自动打了标签的(手工打太累了)。
于是乎一上午又多了300多张可用的样板,重新学习后,识别率高达93%。
from captcha_break.
@armstrong1972 你好,请问应该如何修改读取自己的图片数据集呢?小白完全不懂该如何下手....
修改 CaptchaDataset 类即可,给你个文件做参考
只要在主程序下,建2个子目录:
img/train 放train的图片
img/valid 放valid的图片
主程序内删除 CaptchaDataset 类所有代码,头部增加 :
from dset_files import CaptchaDataset
from captcha_break.
增加数据量
from captcha_break.
凡是通过验证进入下一页面的,我就将文件名改为验证码。
这是非常好的思路。
from captcha_break.
@armstrong1972 你好,请问应该如何修改读取自己的图片数据集呢?小白完全不懂该如何下手....
from captcha_break.
from captcha_break.
from captcha_break.
请问您是怎么实现保存验证码的?我目前是用selenium + chrome + pyautogui实现的,但是现在需要将chrome设为headless,那pyautogui的右键保存功能就不能用了。请问您有别的方法吗?
from captcha_break.
不要用 pyautogui ,selenium 本身就有存图片的功能
from selenium import webdriver
PATH_ChromeDriver = "chromedriver.exe"
drvChrome = webdriver.Chrome(PATH_ChromeDriver)
drvChrome.get(HOME_PAGE)
imgVC=drvChrome.find_element_by_id("id_of_captcha")
f_vcode = basepath + "/img/__temp.png"
with open(f_vcode,'wb') as fp:
fp.write(imgVC.screenshot_as_png)
from captcha_break.
很好用,非常感谢!因为变成120*30像素的png了,用作者的模型就报错了,
RuntimeError: Given groups=1, weight of size 32 3 3 3, expected input[128, 4, 30, 120] to have 3 channels, but got 4 channels instead
我想是因为jpg3个通道,png4个通道的缘故,但是不会改模型参数,劳请您再指教下!谢谢!
from captcha_break.
抱歉忘了提醒了,需要你去找个工具批量将他们改为 jpg 3通道文件。
from captcha_break.
或者 :用程序读取4通道png,然后转成3通道
from PIL import Image
im = Image.open(im_path)
if im.shape[2] == 4:
im =im.convert("RGB")
from captcha_break.
或者 :用程序读取4通道png,然后转成3通道
from PIL import Image
im = Image.open(im_path)
if im.shape[2] == 4:
im =im.convert("RGB")
您好,我在修改为读取图片测试时,遇到这样的问题:
RuntimeError: input.size(-1) must be equal to input_size. Expected 512, got 256
应该是输入尺寸的问题,但是我不清楚在哪里修改,劳烦指教一下,谢谢
from captcha_break.
Related Issues (20)
- 如何把保存下来的ctc模型载入继续训练呢?
- 您可否告知下这几个文件是独立运行的吗,如何训练自己的中文验证码呢 HOT 5
- RNN分类之后, 在评估处怎么获得每个字符的概率呢 HOT 1
- 楼主您好,请问3500常用汉字的验证码识别,该模型大小够吗? HOT 5
- 多行验证码如何识别呢? HOT 4
- loss为负数且不断减小
- cannot import name '_imaging' from 'PIL'
- 变长标签怎么处理
- 如果验证码最后两位相同,似乎一定识别错误 HOT 4
- 运行winpy/main.py遇到的问题 HOT 5
- train和val的acc都可以到99%,但是eval,只有0.00265,这是怎么回事呢
- 尝试把n_class+1程序可以运行,但是不知道对不对 HOT 1
- 请问如果是不定长的验证码 可以使用吗 HOT 2
- 使用CTC, 识别时不限制4个字符长度,识别率如何? HOT 3
- CTC模型不定长输出问题 HOT 1
- 效果不理想
- 训练完了怎么用啊,纯小白 HOT 2
- cnn_2019.ipynb(防止 tensorflow 占用所有显存)tensorflow2.0要怎么改 HOT 1
- tensorflow 2.0 训练的时候 日志不显示 不知道 训练到哪一步 HOT 1
- 请问我改如何替换掉ctc_2019中的lambda方法呢 因为lambda在加载保存的模型会有错误 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from captcha_break.