Giter Club home page Giter Club logo

buscabike-scraper's Introduction

Busca Bike Scraper

Projeto voltado para raspagem de anúncios de bicicletas a venda nas plataformas como OLX, Mercado Livre, entre outros.

Instalação

  1. Faça o checkout do projeto:
$ git clone https://github.com/rochacbruno/buscabike-scraper.git
  1. Crie o ambiente virtual e instale as dependências:
$ cd buscabike-scraper
$ python3 -m venv .venv
$ source .venv/bin/activate
$ pip install -r requirements.txt
  1. Rode o spider desejado. Nesse exemplo será coletado anúncios na OLX.
$ scrapy crawl olx

Os dados coletados segue a estrutura de exemplo abaixo:

{
  "_id": "<id do documento>",
  "url": "http://df.olx.com.br/distrito-federal-e-regiao/ciclismo/bicicleta-aro-24-435226286",
  "type": "Ciclismo",
  "price": " R$ 500,00",
  "created_at": "ISODate('2018-01-04T16:56:42.669Z')",
  "posted_at": "8 Janeiro às 16:15",
  "image": "http://img.olx.com.br/images/35/357804005117894.jpg",
  "district": "Santa Maria",
  "cep": "72505-222",
  "title": "Setor Total Ville",
  "description": "Descrição do anúncio",
  "owner": "Nome do Dono da bicicleta"
  "city": "Brasília",
  "phone": "(61) 99999-9999"
}

Como Contribuir

Veja mais no arquivo CONTRIBUTING.md, as formas de ajudar com o projeto, e o AUTHORS.md para saber quem estão a frente e que pode te auxiliar.

buscabike-scraper's People

Contributors

gilsondev avatar rochacbruno avatar

Stargazers

cfirmo33 avatar  avatar

Watchers

 avatar Denilton Darold avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.