Giter Club home page Giter Club logo

spliffli / denner-scraper Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 7 KB

A single-use web scraper which gets the contact information of all the Denner Sattelit, Partner & Express stores in Switzerland because it's recently become a requirement at my day job to contact all of them, and currently my colleagues are doing it by manually reading and typing each contact into HubSpot CRM one by one which would take ages

License: MIT License

Python 100.00%

denner-scraper's Introduction

denner-scraper

A single-use web scraper which gets the contact information of all the Denner Sattelit, Partner & Express stores in Switzerland because it's recently become a requirement at my day job to contact all of them, and currently my colleagues are doing it by manually reading and typing each contact into HubSpot CRM one by one which would take ages.


Steps

  • With selenium webdriver, navigate to https://www.denner.ch/de/filialen/

  • Apply the filters to show only the Sattelit, Partner & Express stores, and not the other types. This is given as a big list on one page which makes it fairly simple.

  • Iterate through the entire list:

    • extract data points from the dom/html, using either class, id or xpath selectors
    • add each data point as column values for a new row in a pandas dataframe.
  • Possibly clean the data if it needs it, or that could be left until after the next step.

  • Save the dataframe in .xls or .xlsx format

  • Do some validation check e.g. make sure it's the correct amount of rows

  • Review the created excel sheet to make sure it's prepared and compatible with HubSpot.

  • Import all the contacts into Hubspot at once from the excel file, saving weeks of time.

denner-scraper's People

Contributors

spliffli avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.