Giter Club home page Giter Club logo

capstone1fccnn's Introduction

Dimitri Kourouniotis Data Science Enthusiast

Analysing FCC Net Neutrality Comments using Machine Learning and NLP

Supervised Machine Learning using NLP by Dimitri Kourouniotis In the winter of 2017 there were numerous articles about quantity of fake comments submitted regarding the repeal of Net Neutrality laws by the FCC.

A blog post published by Jeff Kao caught my attention and I followed up with him on his analysis of the text. He provided me with the unedited 22 million filings available. I analyzed a sample from 3 million of them to see what I could find to develop my own features based around the text of faked comments. Image of Jeff Kao Article


Capstone Report (pdf)

Capstone Summary Slidedeck (pdf)

00 Summary and Table of Contents

01 Importing 3 million FCC records from SQL

02 Email domains Emails from fake domains or fake accounts

03 WordCloud Wordcloud of comments

04 Submission Frequency Suspcious submission timing

05 State Population Estimates 2016 and Comment Percentages

06 Plotting Differences from Average Comments by state variations from population

07 Choropleth grid Map of US Variations mapped

08 Statistics Proportions by State Relative to Population Comments by state variations from Normal

09 Classifiers and Feature Selection


Acknowledgements

Many thanks to my mentor, Rajiv Shah!

Thanks to the following for the data and code help for this capstone:

Data: Jeff Kao
More than a million pro-repeal net neutrality comments were likely faked
https://hackernoon.com/more-than-a-million-pro-repeal-net-neutrality-comments-were-likely-faked-e9f0e3ed36a6

Word Cloud: Nikhil Kumar Singh
wordcloud example
https://github.com/nikhilkumarsingh/wordcloud-example/blob/7a77e97c4da135b67ad924be96269d6bb68a0fe6/mywc.py

Chorogrid Plot: lavinben88
chorogrid tutorial part 2
https://plot.ly/~lavinben88/116/chorogrid-tutorial-part-2-chorogri/

Classifier Iterator: Evgeny Volkov
SMS spam detection with various classifiers
https://www.kaggle.com/muzzzdy/sms-spam-detection-with-various-classifiers

capstone1fccnn's People

Contributors

dimitrikourouniotis avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.