Light

hardikroutray / customersegmentation Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 2.26 MB

Jupyter Notebook 100.00%

customersegmentation's Introduction

Using credit info of about 9000 customers, we attempt to cluster customers for targeted recommendations or advertisement. The dataset is obtained from kaggle. K-Means clustering is employed. PCA and Autoencoders are also used for dimensionality reduction. This is an unsupervised learning problem. The features are explained in detail inside the notebook.

KEY TAKEAWAYS

Data visualization already indicates a couple of features ('PURCHASE_FREQUENCY' and 'TENURE') that could be used for segmentation. However, for a deeper analysis, we use K-means clustering.
The optimal no. of clusters is found to be 7 for the original dataset without any dimensionality reduction. The elbow method is used for determining this.
On examining the clusters, we find some key features that help segment the customers further. The key features are 'BALANCE', 'CASH ADVANCE', 'CREDIT LIMIT' and '% FULL PAYMENTS'.
PCA is used to visualize the clusters with clear decision boundaries on a reduced 2-dimensional plane.
Autoencoder is used for dimensionality reduction of the original dataset and K-Means employed on the encoded dataset. This leads us to a reduced no. of optimal clusters. PCA is used for visualizing this on a 2-dimensional plane.

Features and their correlation

Clusters using PCA

customersegmentation's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.