techthiyanes Goto Github PK

followers: 18.0 following: 210.0 repos: 7.2K gists: 2.0

Name: Thiya

Type: User

Bio: Data Scientist

Location: Bengaluru

Thiya's Projects

vformer

A PyTorch library for Vision Transformers

vg-gplms

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

vi-svs

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

video-captioning

Video Captioning is an encoder decoder mode based on sequence to sequence learning

video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

video-classification-cnn-and-lstm-

To classify video into various classes using keras library with tensorflow as back-end.

video-pre-training

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

video-sentiment-analysis

Analyze any video with the help of the Deep Learning Emotion Detection model. The model is of 72% accuracy. User can Upload a video or can also Capture a video at a time for the analysis.

video2numpy

Optimized library for large-scale extraction of frames and audio from video.

video2text

📺 An Encoder-Decoder Model for Sequence-to-Sequence learning: Video to Text

videobert

Using VideoBERT to tackle video prediction

videomae

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

videos

Code for the manim-generated scenes used in 3blue1brown videos

videotransformer-pytorch

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

vila

Incorporating VIsual LAyout Structures for Scientific Text Classification

vilt

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

virtex

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

vision-language-modelling-series

Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations

vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

visual-clustering

Visual Clustering: Clustering Plotted Data by Image Segmentation

visual-question-answering-for-medical-domain

Visualkeras is a Python package to help visualize Keras (either standalone or included in TensorFlow) neural network architectures. It allows easy styling to fit most needs. This module supports layered style architecture generation which is great for CNNs (Convolutional Neural Networks), and a graph style architecture, which works great for most models including plain feed-forward networks.

techthiyanes Goto Github PK

Thiya's Projects

Recommend Projects

Recommend Topics

Recommend Org