aml_final_project's Introduction

Predictive Modeling Of MBTI Personality Type Based On Social Media Posts

While many existing research focuses on training machine learning models to predict MBTI using text-based input, only a few research is dedicated to improving the performance of a given machine learning model with parameter tuning. In this paper, we used an existing dataset to predict users’ personality types based on Twitter posts. Using the random forest as our prediction model, we concluded that there are a few parameters that have a significant impact on the performance:

the splitting ratio of the training and testing set,
the decision tree’s maximum depth,
the number of estimators
the Minimum sample leaf.

Our analysis indicates that the best ratio of the training and test sets is 60%: 40%. In our case, the optimal minimum sample leaf is 13; The decision tree's maximum depth is 15, and the number of estimators is 1500. While this result is obviously not applicable to any other studies, as all dataset and machine learning model differs, we hope this paper provides some insights into how hyperparameters can impact the performance of a machine learning model.

**KEYWORDS ** Machine Learning, MBTI, Social Media, KNN, Random Forest, SVM

Recommend Projects

kattt999 / aml_final_project Goto Github PK

aml_final_project's Introduction

Predictive Modeling Of MBTI Personality Type Based On Social Media Posts

aml_final_project's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent