Giter Club home page Giter Club logo

dynamic-hand-gesture-recognition-using-3d-cnn's Introduction

Dynamic-Hand-Gesture-Recognition-using-3D-CNN

Hand gesture recognition in computer science and language translation is the means of recognizing hand gestures through mathematical methods. Gesture recognition has become one of growing fields of research. Hand gesture recognition has ample number of applications including human–computer interaction, sign language and virtual/augmented gaming technology. Users can perform gestures to control or interact with devices without physically touching them. There are many architectures designed in the field of gesture detection, but existing traditional solutions are not robust to detect hand gestures with high accuracy in real time in the presence of complex patterns in performing hand gestures. In this paper, we present a fast and efficient algorithm for classifying different dynamic hand gestures using 3D-convolution neural networks. Unlike 2D-convolution neural networks, 3D-convolution networks extract features along the temporal dimension for analysis of gestures performed in videos. The paper also focuses on improving accuracy and describes data preprocessing and optimization techniques for obtaining the model inference in real time at 30fps. Our method achieves a correct recognition accuracy of 90.7% for the evaluation made on the testing videos in Chalearn LAP Continuous Gesture dataset. The detection process can be tested on laptops with standard specifications.


Output Samples

output_gesture


Neural Network Architecture

Neural-Network


The repository contains the self-explanatory python3 code starting from:

  1. Fetching the data.
  2. Dataset generation.
  3. Dataset Pre-preprocessing.
  4. Define the 3D CNN model.
  5. Training the model.
  6. Retraining the model.
  7. Creating callbacks through tensorboard for graphical visualization.
  8. Saving the model as tensorflow protobuf (.pb) file for future usage of mobile deployment.
  9. Saving and loading as keras file(.h5).

Last but not the least..

  1. ** REAL-TIME visualization of results on Chalearn ConGD videos using OpenCV3. **

Cite as:

Channayanamath M. et al. (2021) Dynamic Hand Gesture Recognition Using 3D-Convolutional Neural Network. In: Satapathy S.C., Bhateja V., Ramakrishna Murty M., Gia Nhu N., Jayasri Kotti (eds) Communication Software and Networks. Lecture Notes in Networks and Systems, vol 134. Springer, Singapore. http://doi-org-443.webvpn.fjmu.edu.cn/10.1007/978-981-15-5397-4_16


Please follow the copyrights procedures for downloading Chalearn Dataset from their official website.

https://gesture.chalearn.org/2016-looking-at-people-cvpr-challenge/isogd-and-congd-datasets

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.