Computer vision is an interdisciplinary scientific field that deals with how computers can be made to gain a high-level understanding of digital images or videos. From the perspective of engineering, it seeks to automate tasks that the human visual system can do. In this Computer Vision Repository, I have upload five projects Related to Computer vision which I have done.
In this project, I made face Recognition using open CV Libraries. Basically, these face Detection projects detect your face with Green Rectangle eyes with Blue Detection and Red Rectangle detect Smile face which your face make a smile. I use haarcascade_eye for detecting eyes, haarcascade_frontalface_default for face detection and haarcascade_smile to detect smile face.
https://github.com/opencv/opencv/tree/master/data/haarcascades
- Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola Michael Jones [email protected] [email protected] Mitsubishi Electric Research Labs Compaq CRL 201 Broadway, 8th FL One Cambridge Center Cambridge, MA 02139 Cambridge, MA 02142
- A Discriminative Feature Learning Approach for Deep Face Recognition Yandong WenKaipeng Zhang Zhifeng LiEmail author Yu Qia
Project Link: https://github.com/AmirAli5/Computer-Vision/tree/master/1.Face%20Recognition%20with%20Emotion%20Capture%20using%20Open%20CV
In this project, I made a detection system that detects real time objects. I use Single Shot Detection (SSD) and Open CV in this project. SSD (Single Shot MultiBox Detector) is a popular algorithm in object detection. It's generally faster than Faster RCNN. I use a randomly 2-second video and apply our model on it detects everything (objects) in video with the rectangle and specify with the name.
- SSD: Single Shot Multi-Box Detector Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3, Scott Reed4, Cheng-Yang Fu1, Alexander C. Berg1 1UNC Chapel Hill 2Zoox Inc. 3Google Inc. 4University of Michigan, Ann-Arbor 1 [email protected], 2 [email protected], 3 {dumitru,szegedy}@google.com, [email protected], 1 {cyfu,aberg}@cs.unc.edu
- Object recognition and detection with deep learning for autonomous driving applications Ayşegül Uçar, Yakup Demir, Cüneyt Güzeliş
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
- Robust Real-time Object Detection Paul Viola Michael J. Jones
Project Link: https://github.com/AmirAli5/Computer-Vision/tree/master/2.Real%20Time%20Object%20Detection%20using%20SSD
In this project create fake images from the real image using Deep Convolutional GANs (Generative Adversarial Networks).For creating new images from the real image we use the CIFAR-10 dataset. In this project, we PyTorch library and build a model (Generator & Discriminator) from scratch. During training a model I run only one number of an epoch because I am the student I can’t have a complete source to run this model for a maximum number of epochs because It’ required a high-level system to run this model which I can’t offer. I run only one number of epochs and achieved more than 80% accuracy.
https://www.cs.toronto.edu/~kriz/cifar.html
- Generative Adversarial Nets Authors: Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley,Sherjil Ozair, Aaron Courville, Yoshua Bengio
- Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec Radford, Luke Metz, Soumith Chintala
- Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks Authors: Emily L. Denton, Soumith Chintala, arthur szlam, Rob Fergus,
- Conditional Generative Adversarial Nets Authors: Mehdi Mirza, Simon Osindero
Project Link: https://github.com/AmirAli5/Computer-Vision/tree/master/3.Image%20Creation%20with%20Deep%20Convolutional%20GANs
Used deep neural network model trained on MNIST dataset to classify images of handwritten digits. It generates automatic predictions for images. Trained the model on the MNIST (Modified National Institute of Standards and Technology) dataset, has a training set of 60,000 examples, and a test set of 10,000 examples. Achieved an accuracy of 0.9871 after training for 5 epochs. Achieved a loss value of 0.0404 after training for 5 epochs. Used Keras with Tensorflow backend for the code. Everything is implemented in the Jupyter notebook which will hopefully make it easier to understand the code.
- Y.LeCun, L. Bottou, Y. Bengio, and P. Haffner. "Gradient-based learning applied to document recognition." Proceedings of the IEEE, 86(11):2278-2324, November 1998.
- Official site for MNIST dataset: http://yann.lecun.com/exdb/mnist/
- Yann LeCun, Professor, The Courant Institute of Mathematical Sciences, New York University
- Corinna Cortes, Research Scientist, Google Labs, New York
- Christopher J.C. Burges, Microsoft Research, Redmond
Project Link: https://github.com/AmirAli5/Computer-Vision/tree/master/4.MNIST-Handwritten-Digits-Recognition-Using-Deep-Neural-Network
In this project, we used a deep convolutional neural network model trained on CIFAR-10 (Canadian Institute For Advanced Research) dataset to recognize multiple objects present in various images. Recognize 10 different objects which are airplanes, cars, birds, cats, deer, dogs, frogs, horses, ships and trucks. Trained the model on the CIFAR-10 (Canadian Institute For Advanced Research) dataset. There are 50000 training images and 10000 test images. Achieved an accuracy of 0.8005 after training for 100 epochs. Achieved a loss value of 0.5659 after training for 100 epochs. Achieved validation accuracy of 0.8147 after training for 100 epochs. Achieved a validation loss value of 0.5372 after training for 100 epochs. Used Keras with Tensorflow backend for the code. CIFAR-10 is a common benchmark in machine learning for image recognition. Code in this directory demonstrates how to use TensorFlow to train and evaluate a convolutional neural network (CNN) on GPU.
- Convolutional Deep Belief Networks on CIFAR-10, Alex Krizhevsky, CS department, University of Toronto
- Learning Multiple Layers of Features from Tiny Images, Alex Krizhevsky, April 8, 2009
- Official site for CIFAR-10 dataset: Computer Science department, University of Toronto (https://www.cs.toronto.edu/~kriz/cifar.html)
- Exploding, Vainishing Gradient descent / deeplearning.ai — Andrew Ng.