This repository contains python code for scrapping newspapers article using pythonfast and powerful web-crawling and scraping framework scrapy.
Scrapy open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way. To install scrapy create a virtualenv and use pip
pip install scrapy
-
Create a new scrapy project. Run
scrapy startproject news_scrapper
in the terminal. This will create news_scrapper directorynews_scrapper/ scrapy.cfg news_scrapper/ __init__.py items.py middlewares.py pipelines.py settings.py spiders/ __init__.py
Add more steps