Giter Club home page Giter Club logo

eluvio-data-challenge's Introduction

Eluvio-Data-Challenge

REQUIREMENTS

  • pyLDAvis
  • plotly.express
  • scikit learn
  • wordcloud

PROBLEM DESCRIPTION

This work seeks to do three main things:

  • Draw insight from the data as to the user reaction over the years as a measure of the growth of the news channel.
  • Visualize the new titles to have a fair idea what the most common topics are about.
  • Cluster the titles to draw insight into the main types of news posted and how users react to each of these news types.
  • Do a predictive analysis on how different types of news will be reacted to by the readers using the news titles to predict the whether or not the news will receive a high number of upvotes.

Findings

  1. The number of news posts are similar across the different months of the year.
  2. The number of news posted has grown steadily over the years and saw a huge spike between middle of 2012 and the first quarter of 2013
  3. The news titles most frequently contain the names of countries such as China, India, and Israel.
  4. Clustering the news into three main groups, the themes seen are:
  • News surrounding human right issues, politics, and civil unrests.
  • International trade relations and finance.
  • International war and terrorism
  1. The number of posts in these different categories is similar over the years
  2. News surrounding international war and terrorism are generally upvoted less

Predicitve Analysis

Two categories were created using a box plot the number of upvotes received for each post. The outliers above the upper fence of 38 were chosen to be the 'high' class since these news seems as news that receives more reactions than the regular distribution of upvotes. Different classification algorithms are applied to varying levels of success.

eluvio-data-challenge's People

Contributors

brighttsevi avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.