Giter Club home page Giter Club logo

sih2k18's Introduction

Smart India Hackathon 2018

Team – Dark_Forest

KPIT: Innovation Award Winner, Smart India Hackathon, 2018

Ministry : Ministry of Environment, Forest and Climate Change

Team Members:

Amartya Ranjan Saikia (Team Lead), Joy Dutta, Pulkit Singh, Sandeep Talukdar, Priyangshu Yogi, Tahera Aktar Laskar; Assam Engineering College, Guwahati, India

Team Mentors:

Lakhya Jyoti Bora, Biju Pegu; Gratia Technology Pvt. Ltd.

ABSTRACT

We provided an affordable, scalable and state of the art end-to-end solution for tackling poaching and tracking animals/humans in protected areas.The whole idea comprises of three parts:

1. Android
2. Web
3. Intelligence (Deep Learning)

The idea of the solution is that, the rangers will have a light weight android app with them (in a low cost handheld device) and they will be able to monitor/report activities in the wildlife. The android app UI has a map, showing real time location of all the rangers in black and displays some other features like temperature, humidity etc. The UI also has a red button, so that the rangers can press at any time in case they detect/suspect any poaching activity. As soon as they press the red button, the longitude and latitude of the ranger pressing the button will be sent to the web server and the server will update each and every rangers app, showing shortest path from each and every ranger's location to the location in DANGER(in RED Color). The shortest path will be found within the hazzy and broken paths within jungles with algorithms just like that of implemented in UBER with some fine tunings. As you go through #Image Processing part of Deep Lerning in this repository, more information on finding the shortest path is being thrown light on. The whole Idea is large and involves many part, including HARDWARE set-ups such as - Raspberry Pi, Camera configuration etc to meet the ends of the solution in a real life scenario. The strategy to install them, connect them with sensors, monitoring station and the rangers is unique to us.

I personally worked and leaded the Deep Learning parts of our solution and I am giving you some of the insights into our project solution. The Android and Web parts are unique and are not discussed here. The Connectivity and Integration of the standalone project parts [ Deep Learning, Android, Web, Hardware etc] was demonstrated in the Grand Finale to the esteemed judges with a cost effective business plan.

DEEP LEARNING

The Deep Learning section comprises of Object Detection, Image Classification, Audio Classification, Image Processing and Predictive Analysis. Each of the section are written down with detailed description.

➊ Object Detection

1. Real Time
2. Video Feed

YOLOv2 trained on COCO was used for Object detection. Darkflow which is a Tensorflow variant of Darknet was used to process the frames. Darknet is an open source neural network framework written in C and CUDA.

The output video in the Hackathon are: Webcam , Mobile Camera

Several images of Guns, Poachers, Rangers were scraped from the internet and YOLOv2 was trained from scratch to detect the following object classes in the frames. Due to lack of data and time in the hackathon, high accuracy and confidence was not achieved.

Wildlife Video Feed was fed to the layers of YOLO and animals / humans were tracked with high confidence in the wildlife. Some of the frames are :

Also, We know that deep learning is mainly dependent on Data. Due to lack of Animal categories in COCO, very strong detection was not achieved. Imagenet has Animal categories and we differentiate it with COCO, in the Image Classification section. The best strategy would be to form a huge dataset of animals only, this will help us classify the breeds and types as well.

We also tried YOLO on Thermal, Night Vision, IR and Normal Video Feeds and the differences were astonishing. Please check 1:00 of this video to get a gist.

Several frames of object localization and classification are being portrayed in the following frames.

The bounding box frame classes are sent to the servers in real time in .json and .csv format. This is text sent to servers for classes detected.

➋ Image Classification

Classifying images of wildlife and having a bounding box is essential for detection. Used both ResNet50 and VGG16, trained on ImageNet and COCO datasets. Since COCO is a dataset based on real life objects, animal classification is a bit hard and inaccurate. It is able to detect & bound the animal, but not correctly classify it. The COCO outputs are:

As we can see:

1> Correct Bounding Box, Incorrect Classification
2> Correct Bounding Box, Correct Classification
3> Correct Bounding Box, Incorrect Classification though image has camouflage
4> Correct Bounding Box, Correct Classification

The second Output from Imagenet:

The ImageNet classification is more accurate for animals. Its classification output classes were different types of Tigers, or Elephants, but for COCO, the classifier was confused between a Tiger and a Bear for an image of Cheetah. As we can compare from both the COCO and ImageNet outputs, we can conclude that for more accuracy, we need a dataset comprising of only animals.

➌ Audio Classification

With Sensors in jungles we intended to classify audio to track and detect animals. We know that sound waves are frequency spectrums and each animal's sound has different range of frequencies. Or in other terms each animal has different frequency image, so these can be learned with the help of a Convolutional Neural network (CNN). The ESC-50 dataset from Harvard is a collection of environmental sounds and would serve the purpose for demonstration. A huge audio dataset of animals will be required for real life implementation. Google AI has a similar experiment and works on similar concepts - BIRD SOUNDS. The difference in frequencies of CAT vs DOG can be viewed here: Here is the Sample output.

➍ Image Processing

Image processing algorithms to find roads and paths within jungles. We approached this in two ways:

1) Image Processing
2) GPS Tracking

Image Processing Algorithms inspired from DSTL Satellite Imagery feature Detection and some own algorithms, were accounted to find paths within jungles, so that we can find the realistic shortest paths between the ranger and the danger/poacher location. Image processing can only account for paths which are visible from satellite, but what about the tunnels and hidden from above paths in jungles. We thought of tackling this by tracking the GPS locations of the rangers for a month or so, and asking them to patrol the sanctuary via tunnels and shortest hidden paths. That way, we track the GPS locations over the map and find the shortest paths within jungles.

➎ Predictive Analysis

Machine Learning is learning from data right ? We can find patterns within data. In our case, we can collect data from past field study reports, journals/articles on the wildlife sanctuary/ National park or study reports by rangers/ngos etc. With this huge amount of data, we analyze them and predict some of the features.

Made a dummy data report of poaching - [data.csv] and found the following outcomes via graphs.

Conclusion

Thus we provided a low cost, state of the art solution, which can be implemented in the existing infrastructure of the wildlife sanctuaries / national parks. Thus integrating the hardware and software stacks we get a complete solution.

Thus we were able to bag the KPIT: Innovation Award in Smart India Hackathon 2018.

sih2k18's People

Contributors

florist-notes avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.