Giter Club home page Giter Club logo

region-proposal-object-detection-comparative-performance-analysis's Introduction

Region Proposal Object Detection

Final Project for COMSE 6998: Practical Deep Learning Systems Performance. Region Proposal Object Detection Comparative Performance Analysis.

This project has been implemented using Keras and executed on the K80 GPU on Google Colab. We implement an object detection pipeline as follows:

  • Download the Food101 dataset.
  • Finetune different classifier architectures on the dataset after initializing with ImageNet weights(transfer learning).
  • Tune hyperparameters of selective search and run on the image.
  • Pick the best classifier based on performance metrics. Classify each proposal.
  • Apply Non-Maxima Suppression to filter redundant proposal windows.
  • Return final object detection result.

The repo contains:

  • Python script for object detection.
  • Jupyter notebooks for training and downloading Classifier architectures in the 'Training Logs and Notebooks' folder.
  • Notebook to implement the pipeline in Google Colab as an alternative to the Python script.
  • Directory containing source code of Android App

Instructions

First, pick a classifier architecture of your choice, and execute the corresponding Jupyter Notebook in the 'Training Logs and Notebooks' folder. Download and store the trained model. Execute the Jupyter Notebook for object detection or run the script by following the instructions below:

Clone the Repo:

git clone https://github.com/vm2656/COMSE6998_Final_Project.git

Install Requirements by opening a terminal and running:

pip install -r requirements.txt

Download the Food101 dataset from https://data.vision.ee.ethz.ch/cvl/datasets_extra/food-101/ and extract it.

Open terminal and run the following command:

python region_proposal_detection.py --image example.png

To filter by label, enter the following:

python region_proposal_detection.py --image example.png --filter pizza

Tables

Example Results

After training till about 92% validation accuracy, some of the results produced by the model are shown below:

As you can see, the bounding boxes aren't perfect, but the classification works. For further work, these labelled images can be fed to a YOLO, or an RCNN object detection pipeline can be implemented for better accuracy.

The app was used to classify images using the live camera feed using the tflite model. An example still for the same:

Credits

Credits to the tutorial on transfer learning found in here:

https://medium.com/@manasnarkar/transfer-learning-getting-started-9cebf5855a08

Credits to the app building tutorial found here: https://developers.google.com/learn/topics/on-device-ml#build-your-first-on-device-ml-app

Dataset:

https://data.vision.ee.ethz.ch/cvl/datasets_extra/food-101/

region-proposal-object-detection-comparative-performance-analysis's People

Contributors

vasanthmbandhu avatar aiman9 avatar vm2656 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.