Giter Club home page Giter Club logo

gh-scraper's Introduction

gh-scraper

This simple tool can be used to parse Grubhub restaurant pages and output the menu items in an easy to use JSON format.

This was created to be used in conjunction with McD4Me. Note that this is not an actual Grubhub scraper, since Grubhub loads the content for their pages via HTTP requests. Instead, it should be fed an HTML file that it will then parse and output into JSON format.

Usage

  1. First, make sure BeautifulSoup is installed. See this page for details.
  2. Head to the Grubhub page for your desired restaurant.
  3. Save the HTML page after it loads. This can be done by viewing page source and saving or simply hitting CTRL-S/COMMAND-S on the page.
  4. Move the HTML file to the same directory as scrape.py
  5. Change the 'FILENAME.html' and 'FILENAME.json' parameters on lines 38 and 59 respectively, replacing FILENAME with the file name of your choice.
  6. Run python3 scrape.py

The file will be output in the same folder as the input HTML file. To see an example of the input and output, refer to the example folder.

JSON output

The JSON output comes in the following format:

{
  "group":"Kung Fu Classic",
  "name":"Kung Fu Black Tea",
  "id":"KunFuBlaTea",
  "price":3.58
}

The tool identifies the menu item's category, name, price, and generates a string id. It creates an object for each menu item with this information, and puts everything in one resulting array.

gh-scraper's People

Contributors

mfarejowicz avatar

Stargazers

Kevin Frans avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.