Giter Club home page Giter Club logo

yelp-ratings-prediction's Introduction

Yelp-Ratings-Prediction

Analysed Yelp Dataset to Identify which factors contribute most to whether a business has a high or low rating.

Dataset Description:

Dataset was obtained from Kaggle.It includes information regarding business attributes to check ins and reviews on 174,000 businesses .Used 4 of the 7 provided datasets;Business,Business Attributes,Tips, Checkins.Kaggle Dataset Link

Project Workflow:

Data Preparation--->Exploratory Analysis---->Detailed Analysis

Findings & Recommendation:

Finding:1

Finding 2 Whether the business is a restaurant or not is the most defining factor for ratings, since it splits the tree at the first level. A more interesting finding is that if the business is not a restaurant, the data is split by review count <= 3.5. This is a really low number of reviews, suggesting that people tend to review restaurants much more than they review other types of businesses on Yelp.

Thus, restaurants, coffee shops, etc. leverage Yelp reviews as a way to boost their online presence and attract more customers.

Finding 2:

Finding 2

Number of positive tip mentions is the most defining factor… if number of positive count is greater than 21.5 and number of negative tip count is less than 3.5, then there is a high chance of high rating for the restaurant. If the number of negative tip count is higher but number of positive tip comment is higher than number of negative count than there is high chance of high rating. Surprisingly, for restaurants with a low number of reviews, Dogs Allowed is an important defining factor for whether they have a high or lower rating.

So it's very important that managers give special attention to the comments that customers give. In case they are negative, they should talk to customers immediately to know their reasons of dissatisfaction and improve their restaurant and rating.

Finding 3:

Finding 3

The finding based on the above is that the Starbucks in the US is rated lower than those outside of the US (which valideates our finding). So, for the cluster 2 where Starbucks is in the US we see that it has the lowest star rating. Also, we can see that those that have the highest star ratings(cluster 1) are reviewed the least. We can see that the results for Starbucks stayed almost the same as for McDonald's and differes from Subway.

Customer perceptions of a restaurant can be very different within the US versus outside the US. Chain restaurants can do some market research so that they can tailor their food offerings and their marketing strategy to each particular country/region.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.