Giter Club home page Giter Club logo

nonlinearcomponentanalysis's Introduction

Binder

Nonlinear Component Analysis as a Kernel Eigenvalue Problem

Course project for IE529: Stats of Big data & Clustering, 2017 Fall, UIUC

Description

We implemented the experiments presented in the paper Nonlinear Component Analysis as a Kernel Eigenvalue Problem by Bernhard Schölkopf, Alexander Smola, Klaus-Robert Müller [1]. All the articles, references and documentations are present in docs folder. Also, we encourage you to have an interactive notebook session via Binder either by clicking the above launch binder tag.

Note: click on the direct link if the above binder tag is non responsive

Direct link - https://mybinder.org/v2/gh/namanUIUC/NonlinearComponentAnalysis/master

Dependencies

In order to run the experiments, make sure you have all dependencies installed

matplotlib (>= 2.0.0)
scipy (>=0.19.0)
numpy (>=1.12.1)
sklearn (>=0.0)

You can install them by typing

pip3 install requirements.txt

The prgramming languages we use are Python and MATLAB. If you do not have access to MATLAB on your laptop. We advise you to install Octave instead. You can refer to this webpage for installing.

Experiments in paper

In the paper, there are two major experiments:

  • Toy example: 4-degree Polynomial Kernel PCA
  • Character Recognition (USPS Dataset)

Project implementation

SVM and KPCA on Iris Dataset

Principal Component Analysis (PCA) is a dimensionality reduction technique that is used to transform and a high-dimensional dataset into a smaller dimensional subspace to give a directed impression of the dataset prior to running a machine learning algorithm on the data. We have extracted features for classification of standard IRIS dataset via KPCA.

SVC on USPS Dataset

The dataset contains numeric data obtained from the scanning of handwritten digits from envelopes by the U.S. Postal Service. The original scanned digits are binary and of different sizes and orientations; the images here have been deslanted and size normalized, resulting in 16 × 16 grayscale. We will first extract features via Kernel PCA and apply that to a SVM classifier to train and test on the splitted USPS dataset.

Team

Name Github Homepage
Jvn Karthik N/A
Naman Shukla namanUIUC
Shubham Bansal N/A
Zhenye Na Zhenye-Na
Ziyu Zhou Ziyu0

References

[1]. Nonlinear Component Analysis as a Kernel Eigenvalue Problem

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.