zaidzentan / openwebtext Goto Github PK
View Code? Open in Web Editor NEWThis project forked from jcpeterson/openwebtext
Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.
License: GNU General Public License v3.0