Giter Club home page Giter Club logo

telco-customer-churn's Introduction

Telco Customer Churn Project Report

Author: Mehmet Reşit Çağan

Introduction

In this project, I aimed to analyze telco customer churn data and build machine learning models to predict customer churn. This report summarizes the steps I took, the achievements, and the evaluation results of my models.

Data Preprocessing

I performed several data preprocessing steps, including:

  • Visualizing missing values to identify any data gaps.
  • Dropping unnecessary columns that do not contribute to the churn prediction.
  • Imputing missing values using appropriate strategies.
  • Conducting data visualization to explore the relationship between churn and other attributes.

Data Splitting and Preprocessing

To prepare the data for modeling, I split it into train and test sets. I also applied the following preprocessing techniques:

  • Standardizing numeric attributes to ensure they have zero mean and unit variance.
  • Performing one-hot encoding and label encoding to handle categorical variables.

Model Evaluation

I evaluated several machine learning models using the test set and obtained the following accuracy scores:

Model Accuracy Score
Gradient Boosting Classifier 0.8078
Random Forest 0.8026
AdaBoost Classifier 0.7936
Logistic Regression 0.7823
SVM 0.7345
Dummy Classifier 0.7345
Decision Tree 0.7335
KNN 0.7113

From the evaluation, I observed that the Gradient Boosting Classifier achieved the highest accuracy score of 0.8078.

Hyperparameter Tuning

To further improve the performance of the Gradient Boosting Classifier, I performed hyperparameter tuning using RandomizedSearchCV. Before tuning, the model achieved an accuracy score of 0.8078. After conducting the tuning process, I obtained an increased accuracy score of 0.8083. This improvement demonstrates the effectiveness of hyperparameter tuning in enhancing the performance of the model.

Conclusion

In conclusion, I successfully analyzed the telco customer churn data and built machine learning models to predict churn. The Gradient Boosting Classifier showed the highest accuracy, and after hyperparameter tuning, the accuracy slightly improved. My project highlights the importance of data preprocessing, model evaluation, and hyperparameter tuning in achieving accurate predictions for telco customer churn.

Data Link: https://www.kaggle.com/blastchar/telco-customer-churn

telco-customer-churn's People

Contributors

mehmetresitcagan avatar

Stargazers

Ali Dikme avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.