Hwai Teng Teoh's Projects
To develop an Airbnb database and create a pipeline using MongoDB and Hadoop architecture to ease the process of managing, loading, processing, querying, and analyzing Airbnb data based on location
Repository with all what is necessary for sentiment analysis and related areas
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
Malaysia COVID-19 dataset by country, states, districts, confirmed cases types and death cases.
Code Repository for the online course Feature Selection for Machine Learning
Flight delays prediction and analysis: Machine Learning Approach
To predict house prices with creative feature engineering process and adoption of advanced regression techniques
Config files for my GitHub profile.
a free python grammar checker šā
Code repository for the online course Machine Learning with Imbalanced Data
To demonstrate the use of market basket analysis to identify valuable and significant associations between items by using lists of groceries transactions
matplotlib: plotting with Python
Exercise notebooks for Machine Learning modules on Microsoft Learn
NLP based Classification Model that predicts a person's personality type as one of the 16 Myers Briggs personality types. Extremely challenging project dealing with correlation between human psychology and casual writing styles and handling heavily imbalanced classes. Check the app here - https://mb-predictor-motetuzs5q-uc.a.run.app/
To predict baby birthweight using regression and classify baby birthweight groupings (Low, Normal, Overweight) using classification with machine learning techniques.
Text classification task to classify whether the post exhibit cyberbullying contents.
Text classification task to classify the identity of the participant's role associated within the cyberbullying activities
Train unsupervised LDA Topic Model on raw Yelp review text, use topic distributions as feature inputs to supervised classifier of review sentiment
Cuztomized Preprocessing Package for Text data
A case study on the use of Yelp Data for the use of retail sector in big data management that focus on data storage and data access using MongoDB, HDFS and PySpark.
A RShiny web application that provides rapid analysis of the restaurants in different cities of Ontorio based on user's selection of preferences