Giter Club home page Giter Club logo

top10_kaggle_datathon_e-commerce_customer_classification's Introduction

Predicting the Customer Segments by using e-commerce data

Problem type: Multiclass Classification

Includes solution and jury presentation of BTK Akademi Datathon 2023. I attended the competition solo and ranked in top 10 by the jury's selection out of 359 competitors and 255 teams.

Solution

  • A very detailed EDA phase followed by multiple pivot tables
  • Feature engineering; extracting new numerical features, trying the experimental "Cluster feature" method and getting statistical features by cluster groups
  • Feature selection with Sequential Feature Selection, RFECV, SHAP (not included in this repo)
  • Model selection / model re-evaluation
  • Detection and analysis of the sample that is being misclassified by each of the Random Forest, XGBoost, CatBoost, LightGBM models
  • Hyperparameter tuning with Optuna
  • Creating the final submission with decided final feature set and model architecture

  • I also included every helper function that I use throughout different sections of the solution

External Data/Sources


https://www.kaggle.com/so24def

top10_kaggle_datathon_e-commerce_customer_classification's People

Contributors

so24def avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.