Giter Club home page Giter Club logo

examples's Introduction

In mlpack-model-zoo project, we provide state of the art pre-built models and solutions to some standard datasets.

0. Contents

  1. Introduction
  2. Dependencies
  3. Building-From-Source
  4. Running Models
  5. Current Models
  6. Datasets

1. Introduction

This repository contains a number of different models implemented in C++ using mlpack. To understand more about mlpack refer to the following links. The sample folder contains snippets and smaller models that demonstrate features of mlpack.

2. Dependencies

To run this project you need:

  mlpack
  Armadillo     >= 8.400.0
  Boost (program_options, math_c99, unit_test_framework, serialization,
         spirit)
  CMake         >= 3.3.2
  ensmallen     >= 2.10.0

To install mlpack refer to the install guide.

All of those should be available in your distribution's package manager. If not, you will have to compile each of them by hand. See the documentation for each of those packages for more information.

3. Building from source

To install this project run the following command.

mkdir build && cd build && cmake ../

Use the optional command -D DEBUG=ON to enable debugging.

Run the make file. Use -jN with the following command where N is the number of cores to be used for the build. For instance,

make -j4

4. Running Models

After building the projects all datasets will be unzipped and executables of the model will be made available in bin. You can either execute the same file using:

./bin/fileName

5. Current Models

Currently model-zoo project has the following models implemented:

  • Simple Convolutional Neural Network on MNIST dataset.
  • Multivariate Time Series prediction using LSTM on Google Stock Prices.
  • Univariate Time Series prediction using LSTM on Electricity Consumption Dataset.
  • Variational Auto-Encoder on MNIST dataset.
  • Variational Convolutional Auto-Encoder on MNIST.

6. Datasets

Model-Zoo project has the following datasets available:

1. MNIST

MNIST("Modified National Institute of Standards and Technology") is the de facto “hello world” dataset of computer vision. Since its release in 1999, this classic dataset of handwritten images has served as the basis for benchmarking classification algorithms. As new machine learning techniques emerge, MNIST remains a reliable resource for researchers and learners alike.

Each image is 28 pixels in height and 28 pixels in width, for a total of 784 pixels in total. Each pixel has a single pixel- value associated with it, indicating the lightness or darkness of that pixel, with higher numbers meaning darker. This pixel- value is an integer between 0 and 255, inclusive. The training data set, (train.csv), has 785 columns. The first column, called "label", is the digit that was drawn by the user. The rest of the columns contain the pixel-values of the associated image. For more information refer to this MNIST Database.

2. Google Stock-Prices Dataset

Google Stock-Prices Dataset consists of stock prices for each day from 27th June, 2016 to 27th June, 2019. Each tuple is seperated from its adjacent tuple by 1 day. It consists of following rows that indicate opening, closing, volume and high and low of stocks associated with Google on that day.

3. Electricity Consumption Dataset

Contains electricity consumption of a city for 2011 to 2012, where each tuple is seperated from its adjacent tuple by 1 day.
Each tuple has consumption in kWH and binary values for each Off-peak, Mid-peak, On-peak rows.

examples's People

Contributors

akhandait avatar birm avatar daiviknema avatar e-freiman avatar flying-sheep avatar jeffin143 avatar kartikdutt18 avatar manthan-r-sheth avatar mulx10 avatar rcurtin avatar s1998 avatar shikharj avatar sreenikss avatar yashsharan avatar zoq avatar zsogitbe avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.