Giter Club home page Giter Club logo

loan-defaulters-classifier's Introduction

Loan Default Prediction Model

Overview

This repository contains the implementation of a loan default prediction model using XGBoost. The model is trained to predict whether a loan applicant is likely to default based on various features such as income, credit score, loan amount, etc.

Dataset

The dataset used for training and evaluation contains information on loan applicants, including their financial profiles, employment details, and loan terms. It consists of both numerical and categorical features.

Workflow

The project follows a systematic workflow, including:

  1. Exploratory Data Analysis (EDA): Analyzing the dataset to understand the distributions and relationships of features.
  2. Feature Engineering: Creating new features or transforming existing ones to improve model performance.
  3. Data Preprocessing: Handling missing values, encoding categorical variables, and scaling numerical features.
  4. Handling Imbalanced Data: Using techniques such as SMOTE to address class imbalance.
  5. Model Selection and Hyperparameter Tuning: Experimenting with various classifiers and optimizing hyperparameters using techniques like GridSearchCV , StratifiedKFold.
  6. Model Evaluation: Assessing model performance using metrics such as accuracy, F1-score, precision, recall, and AUC-ROC curve.
  7. Selection of Best Model: Identifying the XGBoost classifier as the best-performing model based on evaluation results.

Model Performance Evaluation

  • Accuracy: 86.14%
  • F1-score (Class 1): 83.91%
  • Precision (Class 1): 98.3%
  • Recall (Class 1): 73.91%
  • AUC (Class 1): 91.8%

Conclusion

The XGBoost model demonstrates superior performance in predicting loan defaulters, achieving an accuracy of 86% and a high recall rate of 74%. This indicates that the model effectively identifies instances of defaulters while maintaining a reasonable precision score.

Future Work

Potential areas for further improvement include:

  • Experimenting with additional feature engineering techniques.
  • Exploring advanced algorithms or ensemble methods.
  • Conducting more extensive hyperparameter tuning to fine-tune model performance.
  • Evaluating model robustness using cross-validation or validation on external datasets.

loan-defaulters-classifier's People

Contributors

vn33 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.