Giter Club home page Giter Club logo

km_challenge's Introduction

Machine Learning with Kernel Methods 2021

KKML Kaggle Challenge: https://www.kaggle.com/c/machine-learning-with-kernel-methods-2021/overview

Team Name: Ker et Paix

Louis Bouvier, Roman Castagné & Julia Linhart

MVA 2020/21 - ENS Paris-Saclay

Overview:

This repository contains the Python implementation of our solution to the above stated Kaggle Challenge. Our methods make use of several kernels, either working on bags of words or on raw string data. They are briefly described in the provided report, which also indicates a justified ranking.

We have implemented from scratch the following models:

  • Logistic Regression
  • Ridge Regression
  • Kernel Ridge Regression
  • Kernel Support Vector Machines
  • Multiple Kernel Learning

We have dealt with the following kernels:

  • Gaussian kernel on bags of words
  • Spectrum kernel on raw sequences
  • Mismatch kernel on raw sequences
  • Substring kernel on raw sequences
  • Fisher kernel on raw sequences
  • TF-IDF extraction
  • Sum of kernels

Content of the Repository:

Models and kernels are implemented from scratch in:

  • linear_models.py containing the classes LogisticRegressor and RidgeRegressor
  • kernel_models.py containg the classes KernelRidgeRegressor, KernelSVM and KernelMKL
  • kernels.py containing all the functions nessecary to build the above stated kernels

Supplementary files:

  • functions.py: method-functions for these models (e.g. Newton method used for Logistic Regression)
  • utils.py: utility functions to load precomputed kernels, initialize a model, run the model (training and evalution on all 3 datasets), save the results in a .csv file of right format to be use as submission file

Finally, the Python Notebook KM_challenge.ipynb contains the code for example runs on our different methods.

km_challenge's People

Contributors

louisbouvier avatar romancast avatar julialinhart avatar

Stargazers

roopre avatar

Watchers

 avatar

Forkers

mm230

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.