Giter Club home page Giter Club logo

twitter-movies-recommender-system's Introduction

Recommending movies from tweets

Wouldn't it be great to be able to recommend movies for every single twitter user ? This is the main goal of the project, we want to be able to ask you for your twitter handle, and if you've sent a few tweets, we should be able to recommend movies for you to watch !

We want to be able to recommend movies related to what you like generally and not forcibly linked to which movies you previously watched.

Dataset

We use a twitter dataset that EPFL's Data Lab provided coupled with the well known TMDB Kaggle database. The idea here is to select english tweets rating movies from the dataset, and select the users of these tweets and all their own tweets. This means that we restrain ourselves to the movies rated by users inside the dataset we hold.

Data Analysis

You can find in the notebook the analysis we made upon the genres of the rated movies, and the contents of the tweets.

We started off by visualizing the movies clustered by genres. As expected, the clusters are defined by a mix of multiple genres, which makes sense since movies are categorized as multiples genres most of the time.

Afterwards, we took our focus to the tweets' contents, performing a topic detection using Latent Dirichlet Allocation. It turns out that the most important topics were the movies ! Which made us think that recommending movies using the tweets themselves as a bag of words might just work !

Recommender system

Here is how we recommend movies to twitter users. We find the most similar users from our target by cosine distance, and then compute the recommendations by averaging out the ratings of the closest few. To learn more about the technique, read the notebook and the Medium piece ! The idea was to provide a simple proof-of-concept, and not a full blown accurate recommender system, to simply show that it could be done. There are loads of possible improvements to be done.

DISCLAIMER: TO SEE THE PLOTS EASILY, YOU SHOULD USE THIS LINK INSTEAD OF GITHUB'S VIEWER.

twitter-movies-recommender-system's People

Contributors

fouchepie avatar loomisloud avatar matthiasleroyepfl avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar

twitter-movies-recommender-system's Issues

The dataset is no longer available?

Hi LoomisLoud,

I really could not find the dataset anywhere on EPFL lab. Could you please recheck and see help us with a link to the dataset. It would be really helpful.

Regards,
Nilav Ghosh

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.