Giter Club home page Giter Club logo

Hi 👋 Welcome to Kar's Github!


🌱 Demo projects:

Life Expectancy Statistical Analysis (Unsupervised ML + Statistical Analysis) | MFA, PCA, MLR, t-test, Wilcoxon test, Mixed Modeling
Human Resource Data Mining (Supervised + Unsupervised ML) | Gower-PAM clustering, CA, MCA, MLR, KNN, SVM, and Random Forest
Credit Card Market Segmentation (Unsupervised - Clustering + PCA) | CLARA, Hk-means Hybrid, FCM, Model-Based Clustering, DBSCAN
Student Retention Rate of AUS Universities (Visualisation + PCA) | Data Cleaning, EDA, PCA
Food Poison Survey Analysis using MCA (Unsupervised - PC Methods) | Scree plot, Var-Ind-cor Plot, biplot, Cos2, Contrib Statistics
Loan EDA and Machine Learning Prediction (Binomial) | Feature Engineering + ROSE's BOTH-sampling + ROC + 4 Tree Algorithms
Titanic Mortality Analysis (Binomial) | Interative Map + 3 ML algorithms + Interactive Dashboard (ShinyApp.io)
Recommendation of Crop Classes by Predictive Model (Multinomial) | EDA + Feature Engineering + 7 ML algorithms + Plumber App
Statistical Analysis and Machine Learning of Iris Flowers (Multinomial) | Feature Engineering + Statistical Comparison + 14 ML Models
Marketing Analytics (Regression) | Inferential Models + Machine Learning Techniques + Statistical Group Comparison
E-commence Clothes Sales (Regression) | EDA + Text Mining + Inferential Model + MLR + Random Forest
Boston Housing Prices Prediction (Regression) | EDA + 7 ML algorithms + RShiny App
Avocado prices EDA and Forecast | EDA + Forecast + PowerBI
Resort Hotel vs City Hotel | EDA
Bike-Share Big Data Analysis (RPubs) | EDA + Mapping
Brisbane Real Estate Sales (RPubs) | EDA + Mapping
Cucumber Multi-Env Latin Square Field Experiment | Statistical Group Comparison
Maize: Soil-Nutrient CRD Glasshouse Experiment | Statistical Group Comparison
Oats Variety-Fertilizer Split-plot Field Experiment | Statistical Group Comparison
Dirty Data Challenge | Text Wangling, Imputation Model, Dummy Matrix, Data Cleaning and Manipulation


Interactive Dashboard (ShinyApp.io) (Page load time: 35 sec)


github_projects


github_graph

yes


Kar is having fun with some data animation


📫 LinkedIn: https://www.linkedin.com/in/kar-hou-ng/

KAR's Projects

analysis-of-titanic-mortality icon analysis-of-titanic-mortality

Data manipulation, imputation, feature engineering, and machine learning algorithms (K-Nearest neightbour, random forest, and extreme-gradient boosting) were applied to clean the dataset. A final, perfectly cleaned dataset was synthesised for data visualisation to understand the trend in the tragedy.

bike-share_big_data_analysis icon bike-share_big_data_analysis

12 datasets, 3.7 million obs, & 13 vars were cleaned and manipulated for 6 graphs, dynamic map, and statistics to convert casual riders into members.

brisbane_real_estate_sales_2020 icon brisbane_real_estate_sales_2020

320k obs and 11 vars cleaned and manipulated for EDA and mapping (choropleth, cluster, points) to find a new home for a Brisbane family.

credit-card-market-segmentation icon credit-card-market-segmentation

VEV model from Mclust among 5 clustering algorithms has optimal performance and detected 8 distinct groups of users. Data was cleaned, standardized and feature-selected, PCA’s biplot, Ggplot, Radar plots, and parallel coordinate plots were applied for EDA.

food-poison-survey-analysis-using-multiple-correspondence-analysis icon food-poison-survey-analysis-using-multiple-correspondence-analysis

This project applies multiple correspondence analysis (MCA) with the techniques in scree plot, variable plots, individual plots, biplot, cosine square (CO2) and contribution statistcs (contrib) to detect trends in the multivariate food poisoning survey dataset and identified the most probable food that caused the food poison. MCA is one of the principal component methods, and principal componet methods belong to the "unsupervised" machine learning branch.

houston_avocado_prices_eda_-_forecast icon houston_avocado_prices_eda_-_forecast

18k obs & 14 vars cleaned and manipulated for EDA, assumption tests, PP, WO, Ljung-Box, and forecasting (ETS & ARIMA) for avocado prices in the US and Houston.

human-resource-data-mining icon human-resource-data-mining

5 analytical tasks have been completed using VAT validated gower-PAM clustering, Correspondence Analysis (CA), Asym-Biplot, Multiple Correspondence Analysis (MCA), Chi-Squared test, Regression, and predictive classification models with KNN, SVM, and Random Forest.

life-expectancy-statistical-analysis-who- icon life-expectancy-statistical-analysis-who-

Statistically answered 8 research questions using Multiple Factor Analysis (MFA), Principal Component Analysis (PCA), Multiple Linear Regression, Welch's t-test, Wilcoxon signed-rank test, and Longitudinal Multilevel Mixed-effect Modeling with time trajectories.

loan-eda-and-machine-learning-prediction icon loan-eda-and-machine-learning-prediction

Solved 7 business tasks and identified statistical important variables related to loan application. Many plots were synthesised during EDA and machine learning. Models built include Logistic regression, Decision Tree, Bootstrap Aggregating, Random Forest, Fine tuned Extremely Gradient boosting.

marketing_analytics icon marketing_analytics

Solved 9 biz tasks by 18 graphs and 10 statistical methods include dummy data partitioning (RMSE & R2), stepwise model selection, multicollinearity (correlation, VIF), MLR, GLM for logistic regression.

resorthotel_versus_cityhotel icon resorthotel_versus_cityhotel

119k obs & 32 vars cleaned and manipulated to create 14 distinct graphs and statistic tables for an extensive EDA to draw insights.

sales-of-summer-clothes-in-e-commerce- icon sales-of-summer-clothes-in-e-commerce-

Solve 9 analysis tasks and identified the most important variables in driving the success of clothes sales. Achieved via 22 plots, multiple linear regression and random forest

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.