Giter Club home page Giter Club logo

data-analytics-projects's Introduction

Data-Analytics-Projects

Included in this repo are some interesting data manipulation and modelling projects that I worked on over the last few months. All analysis was performed in Python 3 (Jupyter Notebook). Below is a brief introduction to each of the projects included.

For more information on the individual projects including some interesting finds during exploratory analysis, please go into the sub-folders. Also looking to improve existing code and extend current functionality so if anyone has got interesting ideas or suggestions for future work, please do let me know!

Projects using Supervised Learning Models:

  1. Analysis on United Kingdoms road safety and traffic demographics dataset obtained from UK Traffic Dataset - Kaggle with the following key goals:

    • Identify common factors responsible for higher accident rates through various feature engineering techniques
    • Carry out a restrospective study of the historical dataset and perform descriptive analysis (Tableau, Power BI and Excel Power Pivot)
    • Attempt to correct an imbalanced target class (SMOTE, Cluster Centroid, Tomek Links)
    • Perform hyper-paramter tuning using GridsearchCV (scikit-learn python package) to enhance predictive power of several supervised learning models (KNN, SVM, Naive Bayes, Logistic Regression, Random Forest, Gradient Boost - Scikit-learn)
  2. Analyze several thousand tweets collected using Twitters Streaming API in JSON format to perform sentiment analysis and classify them into sub categories for a more general consensus. The topic for this NLP project was the 106th #Greycup/#greycup held in Edmonton in November, 2018. Key analytic goals:

    • Perform a clean data pull from Twitter and transform data for analysis in python (Tweepy)
    • Various descriptive and time series analysis for insights (matplotlib (Basemap), Mapboxgl)
    • Build predictive models to classify sentiment of a tweet (Naive Bayes, SVM - Linear/Polynomial)

data-analytics-projects's People

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.