Comments (5)
The issues with this solution are:
- It is going to be extremely slow;
- A vast majority of Twitter profiles have never been snapshot by the WBM, hence, we will not be able to get any data.
However, it is a possible solution to the situation.
from snscrape.
The issues with this solution are:
- It is going to be extremely slow;
- A vast majority of Twitter profiles have never been snapshot by the WBM, hence, we will not be able to get any data.
However, it is a possible solution to the situation.
1: Better than nothing
2: Yes I know this is the case, however it can scrape whatever twitter profile is used. This should not be a replacement for the regular Twitter scraped, more of a way to get posts of deleted or banned accounts and deleted Tweets.
from snscrape.
The additional complexity of supporting every past version of Twitter's web layout (rather than just the single current one) is not something I consider an adequate use of developer time, especially given the spotty coverage.
from snscrape.
The additional complexity of supporting every past version of Twitter's web layout (rather than just the single current one) is not something I consider an adequate use of developer time, especially given the spotty coverage.
I'd say to just support the first two or three most recent versions, as desire to archive Twitter only really gained motion since Elon took over, and luckily for us, Twitter's web layout has remained stagnant from about 2016 to 2022, and some captures shuffle the mobile layout which has not changed either. See http://web.archive.org/web/2/https://www.twitter.com/jack/status/20 as an example.
from snscrape.
'The site looks the same' doesn't mean there were no changes relevant for a scraper's code. The WBM also contains snapshots using at least four completely different Twitter website designs in just the last few years (the old design, the old simple/mobile design, the current simple design, and the current usual site which generally doesn't work in the WBM).
And you misunderstood me: I don't think supporting even a single additional version is worth the effort. I certainly won't be doing it. I might consider a well-written PR. Otherwise, this should be done outside of snscrape.
from snscrape.
Related Issues (20)
- Retrieve user metadata on Twitter HOT 1
- phpBB support HOT 1
- x HOT 1
- Username scraping available for Twitter? HOT 4
- Does this still work after the tweets view limits? HOT 1
- What is the specific date range of Instagram? HOT 2
- Example scripts for Telegram and Instagram HOT 2
- Issue: snscrape.base.ScraperException: Unable to find guest token HOT 2
- Cannot get tweets HOT 1
- List members of a Facebook group HOT 1
- vkontakte-user scrapes get redirected to badbrowser.php HOT 1
- Error scraping mastodon-profile HOT 1
- Twitter comment download HOT 1
- Instagram change something! index Error. HOT 1
- Question about the future of snscrape HOT 14
- AttributeError: 'FileFinder' object has no attribute 'find_module' HOT 3
- always 429 rate limit error HOT 1
- Twitter is not working HOT 1
- AttributeError: 'FileFinder' object has no attribute 'find_module' HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from snscrape.