Comments (7)
感谢反馈,bug已修复。可以拉取最新版本试一下
from baiduimagespider.
$ python crawling.py -h
usage: crawling.py [-h] -w WORD -tp TOTAL_PAGE -sp START_PAGE
[-pp [{10,20,30,40,50,60,70,80,90,100}]] [-d DELAY]
optional arguments:
-h, --help show this help message and exit
-w WORD, --word WORD 抓取关键词
-tp TOTAL_PAGE, --total_page TOTAL_PAGE
需要抓取的总页数
-sp START_PAGE, --start_page START_PAGE
起始页数
-pp [{10,20,30,40,50,60,70,80,90,100}], --per_page [{10,20,30,40,50,60,70,80,90,100}]
每页大小
-d DELAY, --delay DELAY
抓取延时(间隔)
---------------------
$ python crawling.py --word "美女" --total_page 10 --start_page 0 --per_page 30
可以试试用命令行的形式
from baiduimagespider.
好的,我今天下班回去后试试
from baiduimagespider.
还是一样欸,只下载了5页,也就是150张,作者您那边是能下载300张吗
from baiduimagespider.
作者大大,请问可以实现保存图片的名称是默认的字符串吗,而并不是递增的数字,因为我想让代码判断该图片是否本地已经下载保存过了,是的话就不再保存了
from baiduimagespider.
作者大大,请问可以实现保存图片的名称是默认的字符串吗,而并不是递增的数字,因为我想让代码判断该图片是否本地已经下载保存过了,是的话就不再保存了
建议自行修改一下源码
from baiduimagespider.
我不清楚代码上用什么方法获取图片名称,您能指点一下吗
from baiduimagespider.
Related Issues (20)
- 奈斯
- 已经爬去不了了,怎么进行更新? HOT 4
- 爬1000多张就不行了 HOT 2
- 爬取报错 HOT 1
- 最后一段代码执行不了?问题如下: HOT 1
- json.decoder.JSONDecodeError: Invalid \escape: line 34 column 151 (char 60496) HOT 4
- 看起来失效了,用不起来 HOT 3
- 失效了 HOT 1
- 你好,之前爬虫没问题,最近好像失效了,报错如下图所示 HOT 1
- 爬取到200张就被ban了
- 作者大大,可以上传点图片资源吗?我要好看的。嗯...当学习资料的。
- json.decoder.JSONDecodeError: Invalid \escape: line 12 column 145 (char 30889) HOT 3
- 使用一般網站 HOT 1
- 反爬解决? HOT 1
- 疑问,求解 HOT 5
- DELETED
- 作者非常棒
- 运行报错 HOT 1
- 大佬 请问能抓高清的图片吗 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from baiduimagespider.