Giter Club home page Giter Club logo

dature's Issues

抓取报错

范例中抓取牛根生的博客测试没问题,但下面这个博客抓取则会报错
http://blog.sina.com.cn/u/1587015434

报错:(node:19969) UnhandledPromiseRejectionWarning: TypeError: Cannot read property 'replace' of null at Object.extract [as sina] (/usr/local/lib/node_modules/dature/lib/extractor/sina.js:45:8) at processTicksAndRejections (internal/process/task_queues.js:93:5) at async fetch (/usr/local/lib/node_modules/dature/lib/fetch.js:18:16) (Use node --trace-warnings ...to show where the warning was created) (node:19969) UnhandledPromiseRejectionWarning: Unhandled promise rejection. This error originated either by throwing inside of an async function without a catch block, or by rejecting a promise which was not handled with .catch(). To terminate the node process on unhandled promise rejection, use the CLI flag--unhandled-rejections=strict (see https://nodejs.org/api/cli.html#cli_unhandled_rejections_mode). (rejection id: 1) (node:19969) [DEP0018] DeprecationWarning: Unhandled promise rejections are deprecated. In the future, promise rejections that are not handled will terminate the Node.js process with a non-zero exit **code.**

恳请您的帮助!!

❗当我用您的程序爬取博文到一定数量时,它就这样报错了。❗
1

  • ❌我甚至用Linux笔记本跑了一下,也在一定页数出现了相同的错误(另外一台电脑,不是虚拟机)
  • ❌不管是Windows10还是Linux文件夹权限都是对的,用管理员方式运行也不行😥
  • ❌我删除了node_modules文件夹好几遍重新安装也没用😭
  • ❌自己处理问题的能力有限,Google&bing&百度都找过问题的解决办法,但是我自己还是不会解决😥
  • 🙏非常希望您可以帮助我解决这个问题,甚至💰有偿💰也愿意,我想帮我姥姥把她写了十来年的文章导出来。🙏
  • 🎄等待您在百忙之中的回复的,这个问题困扰我两个星期了😢🎄
  • ❤爱您

博文配图部分丢失

尝试了很多次,每次都会丢部分配图,显示不出来,如图所示,希望可以得到解决
谢谢作者开发出这么棒的软件,帮了我们的大忙
屏幕截图 2022-09-16 184257

能不能让程序调用IE的登录状态呀?这样就可以抓取成功了吧。

博客现在不对外开放了,但是作者本人登录后,文章列表什么的都和以前一样。
现在抓取的时候提示抓取成功,0文章。(也就是没有登录状态)
能不能修改一下让程序调用IE的登录状态呀?这样就可以抓取成功了吧。
望百忙之中抽空回复一下。谢谢

报个错

给老哥请个安先 LL

然后是报错,抓取完毕之后显示下面这个,而且博文图片有漏抓,不知跟这个错有关西不
抓取完毕, 博客存储目录:/Users/qulo/blog

/usr/local/lib/node_modules/dature/node_modules/picture-downloader/pictureDownloader.js:26
const request = url.startsWith('https') ? https.request : http.request
^

TypeError: Cannot read properties of undefined (reading 'startsWith')
at pictureDownloader (/usr/local/lib/node_modules/dature/node_modules/picture-downloader/pictureDownloader.js:26:23)
at ClientRequest. (/usr/local/lib/node_modules/dature/node_modules/picture-downloader/pictureDownloader.js:41:33)
at ClientRequest.emit (node:events:394:28)
at Socket.socketCloseListener (node:_http_client:423:9)
at Socket.emit (node:events:406:35)
at TCP. (node:

两个问题

挺好用,发现以下两个过期项目:
1、新浪博客;size结尾的图片下载不了,去掉后才能下载
2、csdn下载失效了

自己的博客无法导出

执行命令后,我自己的博客无法导出
能识别到我博客的名字,但是导出0条
然后发现点击任何人的博客都提示:系统维护中,博文仅作者可见。登陆后可查看本人文章。
可能新浪博客快要停运了。
关闭这个权限了?

关于cookie的问题

sss
为什么在这个页面获取到的cookie是这一个呀,与大佬教程的cookie格式不符合。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.