Giter Club home page Giter Club logo

Hi, I'm Girin 👋

  • 🔭 I’m currently working on Computer Vision models from Automatic Defect Recognitions (ADR).
  • 📫 How to reach me: [email protected]

Achievement & Recognitions

Out of a pool of over 2100 data scientists competing for over four months in the largest industrial AI data science competition in the world, I achieved an Global Rank 6 in the BMW SORDI.ai Hackathon 2022. This intense and prestigious competition was organized by some of the biggest names in tech, including Microsoft, NVIDIA, BMW Group + QUT Design Academy, idealworks, and the BMW Group. It was an incredible experience to showcase my skills and compete at such a high level.

The Writer Verification task began as a way to detect potential fraud in the banking sector by verifying signatures. This task is difficult because people's handwriting can vary significantly, making it necessary for the model to learn these variations. The task becomes even more complex in offline settings, where dynamic information about the writing process is not available, such as when writing on electronic equipment. The task of the challenge was to given a pair of handwritten text images, automatically determine whether they are written by the same writer or different writers.

Solution Codebase

Computer Vision Projects

A pip package designed to seamlessly integrate and leverage the potent capabilities of the SAM (Segment Anything Model), requiring only the most minimal dependencies.

This is a project where I use a CNN that can recognize the currency of different denominations. I have also implemented a Streamlit app for easy inference of the trained model. Some practical use cases of the model include:

  1. Currency recognition system for visually impaired/blind people
  2. The project has the potential to develop a currency verification system.

This repository provides a user-friendly solution for training a Faster R-CNN model utilizing any custom COCO dataset. The Faster R-CNN algorithm is a widely used object detection framework known for its efficiency and accuracy in localizing and classifying objects within images. With this repository, one can seamlessly tailor the model to your specific needs. The primary focus of this repository is to streamline the process of training the Faster R-CNN model with any custom COCO dataset.

Publications

Abstract:

Recognition of human activities plays a pivotal role in recent times for surveillance and security. The convolution neural network (CNN) based models are growing to classify human activities using micro- Doppler (μD) signatures. However, a larger number of parameters of the CNN models increases the computation cost and increases the size. The present work introduces a novel lightweight model, “LW −μ DCNN,” to classify human activities. The architecture of LW −μ DCNN has 438998 parameters with 7 layers. A total of six human activities are recorded in the FM CWR dataset, which is in the form of μD signatures. These μD signatures are converted into spectrogram images and are considered as input for the experiments. The size of the LW −μ DCNN model is only 5.2 MB, which is further optimized by considering quantization aware training, “QAT-LW- μ DCNN,” has size of 0.43 MB with minimal loss of accuracy. The extensive analysis shows that the LW −μ DCNN model achieves 97% of classification accuracy with a higher F1-score for every class than the other state-of-the-art models. The present paper also proposed two transfer learning approaches, i.e., InceptionV3 and MobileNetV1, for the experimental studies to classify human activities.


Girin Chutia's Projects

cleanvision icon cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

cvplayer_gc icon cvplayer_gc

The easiest way to visualize your computer vision models inferences.

espcn_pytorch_lighting icon espcn_pytorch_lighting

Pytorch Lightning implementation of Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

indcurr icon indcurr

Indian Currency Recognition using AI

keras_cv_attention_models icon keras_cv_attention_models

Keras beit,botnet,CMT,CoaT,CoAtNet,convnext,cotnet,davit,efficientdet,edgenext,efficientformer,efficientnet,fbnet,flexivit,gcvit,ghostnet,gmlp,gpvit,halonet,hornet,iformer,lcnet,levit,maxvit,mlp-mixer,mobilevit,nat,nfnets,pvt,regnet,resmlp,resnest,resnext,resnetd,swin,tinynet,uniformer,volo,wavemlp,yolor,yolov7,yolox

sam_onnx icon sam_onnx

A simple package for installing & running Segment Anything (SAM) model in ONNX format.

vision icon vision

Datasets, Transforms and Models specific to Computer Vision

yolact icon yolact

A simple, fully convolutional model for real-time instance segmentation.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.