Comments (4)
Thanks, @KOLANICH.
Web scraping is a tricky topic. We worked very hard to write a balanced policy that allows some scraping, but limits abuses and requires scrapers to enact certain privacy-protective policies.
We appreciate your suggestions, but at this time, the site-policy repository is only for discussions of policies. We are not able to approve changes to GitHub functionality, such as new APIs.
from site-policy.
I came here googling for scraping policy information. This policy text about scraping "Researchers may scrape..." "Archivists may scrape..." is that present in live github policy docs anywhere? I guess it used to be, but was since rearranged. I can only find a GitHub Acceptable Use Policies - Spam and inauthentic activity section which seems relevant. "excessive automated bulk activity" is disallowed, but that's less specific, not mentioning "scraping" directly, and certainly no "You may scrape the website for the following reasons"
Over on this github-crawler-lib repo I see they've lifted out the same text about researchers and archivist, but is that text old?
from site-policy.
@harry-wood yes, it's in this repo at https://github.com/github/site-policy/blob/f05aeb95464408a8d854250b2033302e11cf395f/Policies/github-acceptable-use-policies.md#7-information-usage-restrictions which is rendered two sections below the one you linked at https://docs.github.com/en/site-policy/acceptable-use-policies/github-acceptable-use-policies#7-information-usage-restrictions but yes the language was made more general in #309
from site-policy.
is there a data dump for public research somewhere? preferably a .torrent file?
from site-policy.
Related Issues (20)
- We want to thank everyone for their review and feedback on the Privacy Statement Update. We appreciate and share your passion for developer privacy. GitHub remains committed to having the highest privacy standards and will continue to center the needs of developers in all of our platform decisions. We intend for this to be a minimally invasive change that will enable us to provide the best tools to our users. In response to your comments, we are providing the following changes and points of clarification:
- Hello world HOT 1
- Bhdrmms HOT 2
- Some one hack this polli y please block the Lunix window site HOT 3
- Outdated how to update all HOT 1
- Recuperando
- My payment back
- Typo in site-policy/CONTRIBUTING.md HOT 1
- Policy
- 油猴
- gearky forsk
- My neighbor was cloning my phone
- Contribute
- Reporting typos in doc: "Building a CLI with a GitHub App" HOT 3
- Yo
- Manager
- We want to thank everyone for their review and feedback on the Privacy Statement Update. We appreciate and share your passion for developer privacy. GitHub remains committed to having the highest privacy standards and will continue to center the needs of developers in all of our platform decisions. We intend for this to be a minimally invasive change that will enable us to provide the best tools to our users. In response to your comments, we are providing the following changes and points of clarification: HOT 1
- ###???
- services.google.com/corporate/publickey.txt
- Manager
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from site-policy.