Giter Club home page Giter Club logo

loan-eligibility-project's Introduction

Loan-eligibility-project

In this project, I built machine learning models to predict the eligibility of intended loan borrower. These models could help money lenders and investors automate the loan eligibility process (real time) based on customer detail provided while filling online application form. These details are Gender, Marital Status, Education, Number of Dependents, Income, Loan Amount, Credit History and others. I used LendingClub dataset with 614 loans and 13 variables for each loan. In training the models, I only used features that where readily available on the table. These features included, among others, the borrower's gender, marital status, education, loan amount and credit history. Credit history was found to be the variable that affected loan status the most with 80% of those recorded to have a credit history being eligible.

The modeling process took several steps, including: removing loan features with significant missing data, or that aren't known to investors; exploring, transforming, and visualizing the data; creating dummy variables for categorical features; and fitting four models: logistic regression, random forest, k-nearest neighbors and support vector machines. I use machine learning pipelines to combine imputation, standardization, dimension reduction, and model fitting into one pipeline object. I found that the four models performed similarly well according to the jaccard similarity scores on the testing data.

The algorithms scored as follows: KNN 0.81, Decision Tree 0.79, SVM 0.79, Logistic Regression 0.80,

I chose K nearest neighbors as the final model, which obtained an jaccard score of 0.81 on a test set consisting of the most recent 30% of the loans. This proved to be the highest scoring model. I also found that the most impactful variables for predicting charge-off was the credit history variable. All the analysis is done in a Python Jupyter Notebook, utilizing the packages numpy, pandas, matplotlib, seaborn, and scikit-learn.

loan-eligibility-project's People

Contributors

obonggodfrey avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.