Giter Club home page Giter Club logo

scraping-and-advanced-nlp-analysis-on-imdb-tlou-reviews's Introduction

Scraping and Advanced EDA and NLP Analysis on IMDB "The Last of Us" Reviews

The series TLOU has become very trendy these days and there have been many contradictions about the main actors of this series, Pedro Pascal and Bella Ramsey. In this project, I tried to understand the critics' feelings about the acting of these two actors based on the reviews of this series on IMDB

Project Overview :

- Scraping TLOU Reviews from IMDB

I used Beautifulsoup and Selenium to scrape data from TLOU review's page (over 1400 reviews)

  • Understanding the data

    • Shape of the data
    • Check column dtypes
    • Check is there any null values
  • Data Preprocessing

    • Convert Date Format to Datetime
    • Convert Rating Format to Int
  • Feature Engineering

    • Create Sentiment by Review Rating
  • Analyzing Reviews

    • What Rating Did the Users Give to the Series?
    • Whats the Min, AVG, Max of Ratings?
    • What's the number of reviews according to episodes release date?
    • Review Sentiments Based on Rating
    • What's the Most Frequent Words in the Reviews?
  • Extract Reviews with Bella Ramsey and Pedro Pascal Mentions and Apply Sentiment Analysis on Them for this task I used huggingface transformers for Sentiment Analysis and Spacy for NER

Libraries used in the project

scraping-and-advanced-nlp-analysis-on-imdb-tlou-reviews's People

Contributors

meysamraz avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.