Light

jinwoo-jeon / cnn_matlab Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 0.0 32.39 MB

CNN MNIST classification from scratch

MATLAB 100.00%

cnn_matlab's Introduction

#Make CNN for MNIST dataset

###Jinwoo Jeon

##1. 코드 설명 및 실행 Matlab code (repelem 함수 때문에 MATLAB 2015a 이상 버전에서만 실행 가능할 듯)
No external libraries ####RUN_script.m

실행용 스크립트
preproc_data 주석을 해제하면 데이터를 새로 만들고 init 주석을 해제하면 모델을 새로 만듬
학습된 모델을 불러오려는 경우 init을 주석처리하고, .mat파일을 수동으로 불러온 뒤 RUN_script.m을 실행하면 됨
테스트만 하려는 경우 train을 주석하고 test를 주석 해제한 후 실행

####preproc_data.m

MNIST.mat을 로드해서 데이터를 가공하는 스크립트
Data augmentation 및 Mean제거 (test data에서도 train mean제거)

####init.m

학습에 필요한 파라미터 및 모델을 정의하는 스크립트
opt.solver struct에 학습 파라미터가 저장되고, opt.layer struct에 모델 파라미터가 정의됨

####makeModel.m

init.m에서 정의된 모델 파라미터를 바탕으로 weight, bias 등 학습 대상 변수들을 초기화하여 model struct 및 option을 return하는 함수
weight의 경우 Var(1,0)*sqrt(2/n)으로 초기화
bias의 경우 0로 초기화
PReLU의 alpha값은 해당 layer weight로 저장되고 논문에서 사용한 값인 0.25로 초기화

####train.m

학습 loop를 실행하는 함수
train data를 batch size대로 나누고 랜덤한 index를 부여함
learning rate를 iteration이 지나면서 decay 하도록 설정 (inv)
forward.m 함수를 이용하여 각 layer의 output을 계산
forward 계산결과를 이용하여 Cross Entropy (Softmax output) 나 MSE (MLP output) 를 Cost function으로 계산
error와 각 layer outut을 이용하여 backward.m 함수를 실행 (back-propagation)
error와 test 결과를 이용하여 그래프를 plot

####test.m

forward.m 함수를 이용하여 test data에 대한 error rate를 return

####drawFromMat.m

학습된 모델을 import 하여 cost graph와 error rate graph를 그리는 스크립트

####forward.m

CNN model에 input batch를 넣어 각 layer의 output을 return
Convolution layer

(X, Y: 4-dim matrix, batch_size * width * height * channel) (W: 5-dim matrix, 1 * kernel_width * kernel_height * input_channel * output_channel)

MAX pooling layer
Fully-connected layer

(X, Y: 4-dim matrix, batch_size * width * height * channel) (W: 5-dim matrix, 1 * kernel_width * kernel_height * input_channel * output_channel)

ReLU

- PReLU

- Softmax

Cross Entropy 를 Cost function으로 이용하기 위해 softmax를 이용하였음

Dropout

rand함수로 임의의 node를 0으로 비활성화시킴

####backward.m

forward한 결과와 label과의 에러를 이용하여 back-propagation을 수행하여 weight를 update하는 함수
Convolution layer

본 코드에서는 activation layer를 따로 설계했으므로 다음과 같이 error가 propagate됨.

(∂E/∂y는 상위 layer에서 전파된 값을 사용)

또한, weight는 다음과 같이 update됨

Weight Decay term:

Momentum term:

- MAX pooling layer

어디에서 온 에러인지 따로 저장은 안하고 error를 그대로 이전 layer에 전달

Fully-connected layer

Convolution layer와 같은 방식으로 error propagate, weight update

ReLU

ReLU Layer의 input value이 음수이면 error를 죽이고 양수이면 error를 그대로 전달

PReLU

ReLU Layer의 input value이 음수이면 alpha*error를 전달하고 양수이면 error를 그대로 전달
alpha는 논문에 나와있는 대로 0.25로 initialize한 뒤 다음과 같이 update 하였다.

- Softmax

Softmax의 Cross Entropy Cost function

와 Activation function

의 derivative를 이용하여 다음과 같이 Error propagation 식을 얻을 수 있다.

Dropout

Forward path에서 임의로 고른 zero-mask를 이용하여 꺼진 node들은 backward path에서도 error propagation이 안되도록 한다.

##2. 학습 결과 및 성능 ① CPCPFRF

② CPCPFRFS

③ CRPCRPFRFS

④ CPrPCPrPFPrFS

⑤, ⑥ CPrPCPrPFPrDFS, CPrPCPrPFPrDFS (2)

⑦ CPrPDCPrPDFPrDFS

최종 성능: Accuracy 99.33%

cnn_matlab's People

Contributors

Stargazers

Watchers

cnn_matlab's Issues

Images for readme

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.