Comments (7)
看上去批次爬虫是只支持MySQL,不支持MongoDB是吗?
from feapder.
@Leezj9671
看上去批次爬虫是只支持MySQL,不支持MongoDB是吗?
任务表和批次表只支持MySql, 数据表存储位置支持自定义,可参考:https://boris.org.cn/feapder/#/source_code/pipeline
from feapder.
请问批次可以按分钟级别的时间进行循环吗,比如每5分钟启动一次。
from feapder.
batch_interval=7, # 批次周期 天为单位 若为小时 可写 1 / 24,可以设置其他模式吗?例如周一到周五执行,周六日休息
from feapder.
@lhsnet347
batch_interval=7, # 批次周期 天为单位 若为小时 可写 1 / 24,可以设置其他模式吗?例如周一到周五执行,周六日休息
大哥,batch_interval用于分割每批数据的,比如你设置7,如果3天就采完了,后面再重启,发现间隔不到7天,不会重复采集。 它不是定时, 你这个功能用爬虫管理系统来定时启动就好了
from feapder.
@Leezj9671
看上去批次爬虫是只支持MySQL,不支持MongoDB是吗?
数据入库支持mongo,任务表不支持
from feapder.
@zzjj1988
请问批次可以按分钟级别的时间进行循环吗,比如每5分钟启动一次。
可以用爬虫管理系统来设置管理
from feapder.
Related Issues (20)
- 使用selenium或者PlayWright 都指定了thread_count 但是无法打开多个浏览器
- 在使用playwright的时候总是报错,sync_playwright().start()这里会报错 HOT 2
- UpdateItem 批量更新数据问题 HOT 2
- 解析不了web,python3.11、feapder1.8.5 HOT 1
- 因为代理实效导致的重试还是使用实效的那个IP HOT 5
- 单机多进程模式下,MySQL 连接报错:通常每个套接字地址(协议/网络地址/端口)只允许使用一次 HOT 1
- 想要在start()启动爬虫的时候可以携带可变的初始url作为参数
- render=True报错 HOT 3
- 如何在自定义下载器中启用setting中配置的代理? HOT 3
- feapder v1.8.8 使用代理IP报错? HOT 2
- mysql 查询 有bug HOT 2
- 指定parser_name和callback不生效
- 關於BatchSpider
- PLAYWRIGHT 默认开启无痕 HOT 2
- feapder requests能支持curl_cffi类似绕过指纹吗 HOT 2
- 批次爬虫获取redis中的任务时一直阻塞,任务存在就是不去执行,重新执行采集程序又正常了 HOT 3
- response.re_first()报错 HOT 1
- 浏览器渲染功能是否可以添加支持drissionpage库? HOT 5
- mongo使用url连接副本集,不能使用yeild item来进行存储 HOT 1
- 如何让AirSpider在调用时每次的浏览器实例都为最新 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from feapder.