Giter Club home page Giter Club logo

rssworker's Issues

小红书rss问题

当关注的小红书账号有置顶帖子的时候,小红书的rss源就一直显示的是置顶帖子,没有更新。

某微博订阅源返回500

原微博地址:https://weibo.com/u/7045535355
RSS源地址:https://rss-worker.jerry30yang.workers.dev/rss/weibo/user/7045535355
订阅了30多个微博源,都没问题,目前只有上面这个遇到问题

报错如下:
RSSWorker - Made with ❤
500 - Internal Server Error

debug message: TypeError: Cannot read properties of undefined (reading 'url')

stack:
TypeError: Cannot read properties of undefined (reading 'url')
at worker.js:2008:86
at Array.forEach ()
at Object.formatExtended (worker.js:2007:16)
at worker.js:21123:51
at async Promise.all (index 7)
at async deal4 (worker.js:21103:21)
at async dispatch (worker.js:3737:17)
at async worker.js:4246:25

See GitHub for more information.

bani0q42 14a

500 - Internal Server Error

之前还是正常的,今天突然访问不了

debug message: TypeError: Cannot read properties of undefined (reading 'slice')

stack:
TypeError: Cannot read properties of undefined (reading 'slice')
at getUser (worker.js:20997:19)
at async deal4 (worker.js:21010:7)
at async worker.js:4087:62

image
image

小红书爬取的内容无法显示正文内容

如题,以作者贴出的小红书号为例。

只能抓取标题里面的文字,正文的文字需要浏览 https://www.xiaohongshu.com/explore/658ffffb0000000011030663

才能看到,也就是在link标签中。

是否有其他的办法将正文加进去?

<item>
        <title><![CDATA[大家都深爱着地球呀˵>ㅿ<˵!!~]]></title>
        <link>https:&#x2F;&#x2F;www.xiaohongshu.com&#x2F;user&#x2F;profile&#x2F;5d2aec020000000012037401&#x2F;658ffffb0000000011030663</link>
        <description><![CDATA[<img src ="http://sns-webpic-qc.xhscdn.com/202405161226/c4e6f9154001e7d6917e3900962c7a55/1040g2sg30taa7k6t3q005n9atg14mt01mhe2cm0!nc_n_nwebp_mw_1"><br>大家都深爱着地球呀˵>ㅿ<˵!!~]]></description>
        <author>雪糍</author>
    </item>

哔哩哔哩RSS订阅异常。

Screenshot 2024-04-10 at 6 56 25 PM 订阅哔哩哔哩的时候,出现一个很奇怪的现象。同样的一个订阅源有时候打开是正常的,有时候加载出现问题,如图页面加载不出来。过段时间再打开这个订阅源,它又能够加载了。

小红书订阅源没有pubDate

图1为小红书订阅源,图2为Bilibili订阅源。可以看见,小红书订阅源爬取的XML没有笔记的发布日期pubDate,这导致订阅小红书源时,没有按照顺序推送历史笔记。我不懂编程,不知道该如何添加pubDate……
小红书源:/rss/xiaohongshu/user/5efdeba4000000000101e7e6
Bilibili源(有时可能失效,疑似跟爬取频率有关):/rss/bilibili/user/dynamic/1405395281
RSS爬取的小红书XML格式
RSS爬取的B站XML格式

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.