Comments (8)
Sorry last thing but I think the type for headers in FetchOptions
is incorrect, believe it should be something like Record<string, string>
:
export interface FetchOptions {
/**
* list of request headers
* default: null
*/
headers?: string[];
/**
* the values to configure proxy
* default: null
*/
proxy?: ProxyConfig;
}
from feed-extractor.
Awesome! Will check it out. Would be great to expose the utils to validate whether xml or json feed, or have a unified entrypoint that runs the validation & normalization, but I will copy those over for now!
Regarding https://www.nature.com/nature.rss, we have not plan to support RDF format for right now, because this format is quite rarely used.
Makes sense
Could you share more info about your code here? This lib does not modify or verify proxy url. it simply prefers to pick the url from the proxy if that presents.
I'm doing something like
const res = await read(
feed.xmlUrl,
{},
{
proxy: {
target: 'http://127.0.0.1:3001',
},
},
);
Where the target is the url initially shared, or any ip/port combination, and getting back an Invalid URL
error.
from feed-extractor.
Also, what would be the lift on supporting RDF feeds? https://rss.slashdot.org/Slashdot/slashdotMain is another big one I'm interested in. Seeing the format quite a bit through my explorations.
from feed-extractor.
@kylealwyn thank you, RDF can reuse almost logic from RSS parser. I will try to implement a draft.
from feed-extractor.
Curious if something similar to extractus/article-extractor#326 is viable for this library - it'd be great to fetch the xml on my own and provide that to this parser
from feed-extractor.
@kylealwyn same idea, this lib should have that method too.
from feed-extractor.
@kylealwyn https://www.nature.com/nature.rss uses RDF, It's been a long time since I've seen this format!
from feed-extractor.
@kylealwyn v6.2.1 has just been released with 2 new methods for extracting feed data from XML or JSON string. That mays resolve your case.
Regarding https://www.nature.com/nature.rss
, we have not plan to support RDF format for right now, because this format is quite rarely used.
Somewhat related, I'm also trying to use a proxy but to no avail as http://[email protected]:8887 is throwing Invalid URL
Could you share more info about your code here? This lib does not modify or verify proxy url. it simply prefers to pick the url from the proxy if that presents.
from feed-extractor.
Related Issues (20)
- Hardcoded attributeNamePrefix value in xmlParserOptions HOT 3
- Disable item description trimming? HOT 1
- Empty description when content is wrapped in CDATA HOT 3
- Medium feeds - no description tag, only content:encoded -> empty description HOT 3
- Minor regression in v7.0.3 HOT 2
- feed.xml link field parsing error HOT 4
- Now available as a Github Action HOT 3
- Wrong TS type without normalization. HOT 1
- Bug on heise.de HOT 1
- Would you accept a PR supporting CJS compatibility ? HOT 4
- fetch error: cert invalid HOT 1
- add support windows-1251 encoding HOT 5
- `published` typed incorrectly
- Support Get favicon HOT 1
- Cannot define extra entry fields to fetch HOT 2
- Support other date format in published
- The link cannot be resolved when the hostname is not included
- fast-xml-parser regex vulnerability patch could be improved from a safety perspective HOT 1
- Add content:encoded to FeedEntry HOT 1
- `fetchOptions` was not passed to fetch HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from feed-extractor.