Jeff Bezos vs Bill Gates (scratch convnet accuracy tops out around 80%)

We're going to classify images, folks. Instead of the lame dogs and cats, which are basically imagenet categories, we're doing to 2 similar dudes that look very similar (caucasion, over 50, wealthy, etc....). Fire up your conda enviorment, install some packages and let's get on our way :)

Border Collie or Yellow Labrador? (scratch convnet accuracy of 90%!)

Since it's challenging to get our validation acuracy over 80% for the bezos vs gates dataset, we move onto a binary classification problem using more distinct subject matter. Is it a border collie, or a yellow labrador? Some adventures in this dataset:

deep dive into keras generator properties (filesnames, classes)
confusion matrix overview using sklearn
analyzing misclassified photos

Misc Topics We Talk About

Using Keras to train a convnet from scratch to classify images of Bill Gates or Jeff Bezos (binary classification)
Using the Generator Utilities Keras provides to avoid reading image data into RAM and also applying transformations to images
Comparing the bottleneck feature gen strategy using VGG16 with the scratch convnet and why the imagenet weights didn't really provide for much of an advantage
Differences in performance between taking the first 50 images google served as validation data, vs taking a large, random subset. The dangers of small data!

Details about my enviorment (sorry, no requirements.txt)

Python 3.5.2 |Anaconda custom (64-bit)| (default, Jul  2 2016, 17:52:12) 
[GCC 4.2.1 Compatible Apple LLVM 4.2 (clang-425.0.28)] on darwin

>>> import keras
Using TensorFlow backend.
>>> keras.__version__
'2.0.2'

You'll need firefox along with selenium webdriver to get this working. Again, use Conda as a package manager and it should be OK, at least on Mac :/

Videos and brief descriptions

First Video: downloading the images, getting a convnet working from scratch with around 75% validation accuracy

Second Video: using a pretrained feature extractor VGG16 to genreate better features, which we'll feed into a MLP (not a conv network).

Third Video: here we delve into further issues around training and validation set splitting and also step through a basic python script to split our images up into training and validation sets. I also reflect a bit on these issues

Fourth Video: we switch gears and move onto a better dataset to apply our incremental learning stragies (scratch convnet --> transfer learning). I also move to a windows machine with a gtx1080ti card

Fifth Video: using our validation generator, but tweaking it to generate ordered predictions and hacking some code together to check out some images and view the images behind some inaccurately classified predictions. We also introduce the confusion matrix

Sixth Video: using our border collie vs yellow lab model, we begin working on a new problem, "image similarity". We create a new model from our traind one which outputs 96d vectors for each image, then use some numpy to create a cosine similarity matrix. Finally we spot check the data and find some really cool results!

medp1030 / keras-gates-vs-bezos-image-classifier Goto Github PK

keras-gates-vs-bezos-image-classifier's Introduction

Jeff Bezos vs Bill Gates (scratch convnet accuracy tops out around 80%)

Border Collie or Yellow Labrador? (scratch convnet accuracy of 90%!)

Misc Topics We Talk About

Details about my enviorment (sorry, no requirements.txt)

Videos and brief descriptions

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent