Giter Club home page Giter Club logo

mp-pru15's Introduction

Analysis of PRU-15

Table of Contents

About

The primary objective of this project is to show methods of handling messy real-world data. The domain of interest for this project is the results of Malaysian 15th General Election (PRU-15).

Notebook 1: Scraping the winners names from news article

Web scraping is an essential skill for a data professional. An understanding of the data and its context are imperative in order to extract “correct” information. Without domain knowledge, we may pass the wrong and misleading information to the next analyst. They might use it to build whatever narrative based on their agenda. However, our task is to collect the most accurate data. Here, I present the method to extract election results from a news website article.

Prerequisites

  1. Python
  2. Jupyter Notebook
  3. Pandas
  4. Selenium
  5. Matplotlib

Notebook 2: Visualisation of MPs' Relationship Based on Mentions in News

A visualisation of MPs' relationships based on mentions in news articles could take the form of a network diagram, showing each MP as a node and lines connecting them representing mentions of one MP in an article about another. This would allow users to see which MPs have strong connections and which are more isolated. Overall, the visualisation would provide a useful way to understand the relationships between MPs based on news coverage.

Prerequisites

  1. Python
  2. Jupyter Notebook
  3. Pandas
  4. Requests
  5. Newspaper3K
  6. SpaCy
  7. Scikit-Learn
  8. Numpy
  9. NetworkX
  10. PyVis

mp-pru15's People

Contributors

elvinado avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.