Comments (2)
一种另类的思路,用 js 爬虫,"曲线救国" 来解决搜狗登陆问题。用户具体的使用过程如下:
step 1. 开发者在自己的 browser 里,访问 weixin.sogou.com
step 2. 在网页上,手动登陆
step 3. 在登陆后的 weixin.sogou.com 页面上,打开 browser 的 console.
step 4. 调用一段,我们预先写好的 JS 爬虫代码,爬取获得文章的临时链接
step 5. 临时链接可以打包下载,交给我们的 python 爬虫来执行内容爬取&保存
这个思路是在,与@hczhcz 于2016年3、4月份讨论的时候,他所提出的
from wechatsogou.
开了个坑~希望我自己能尽快填完哈(~~~怎么有点不相信自己的填坑效率呢~~~)
https://github.com/ax4/WechatSogouJS
已知 Issue:
- 搜狗微信 - 搜文章, 只能显示 100页内容(未登陆仅前 10页,登陆后 100页)
- 使用JS爬虫仍然会跳出验证码。尝试添加Ruokuai
from wechatsogou.
Related Issues (20)
- 现在还可以获取微信的profile_url链接吗?
- 为什么报没有API接口的错误? HOT 2
- 现在import 就报错找不到模块是什么问题
- 请问这个项目还可以用吗,还在维护吗 HOT 1
- 获取不到公众号文章链接,profile_url为空 HOT 1
- bug: ModuleNotFoundError: No module named 'werkzeug.contrib' HOT 2
- 这代码咋使用,运行test里面的文件吗,通过不了,报下面错误,大佬怎么操作的
- 怎么获取微信公众号的biz
- [Bug report]有依赖损坏 HOT 1
- 关于验证码解决的问题。就是禁止验证码出现 HOT 7
- 网络请求太频繁,微信觉得框架异常,所以会出现验证码 HOT 1
- 模块已经安装,报错ModuleNotFoundError: No module named 'werkzeug.contrib' HOT 4
- 怎么解决验证码问题 HOT 4
- get_gzh_article_by_history文章列表为空 HOT 2
- 无法解析带有*的文章链接 HOT 1
- python3.8 安装完包执行报错 HOT 2
- ws_api.get_gzh_info 调用这个接口报这个错误 ('WechatSogouAPI get img', <Response [403]>)
- 运行demo没有反应,直接退出了 HOT 1
- 博主有联系方式吗?想谈合作~ HOT 2
- ('WechatSogouAPI get img', <Response [403]>)加上代理也一样报,如何解决403? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from wechatsogou.