techthiyanes Goto Github PK
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
A PyTorch library for Vision Transformers
The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
Video Captioning is an encoder decoder mode based on sequence to sequence learning
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
To classify video into various classes using keras library with tensorflow as back-end.
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Analyze any video with the help of the Deep Learning Emotion Detection model. The model is of 72% accuracy. User can Upload a video or can also Capture a video at a time for the analysis.
Optimized library for large-scale extraction of frames and audio from video.
📺 An Encoder-Decoder Model for Sequence-to-Sequence learning: Video to Text
Using VideoBERT to tackle video prediction
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Code for the manim-generated scenes used in 3blue1brown videos
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
Incorporating VIsual LAyout Structures for Scientific Text Classification
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
Visual Clustering: Clustering Plotted Data by Image Segmentation
Visualkeras is a Python package to help visualize Keras (either standalone or included in TensorFlow) neural network architectures. It allows easy styling to fit most needs. This module supports layered style architecture generation which is great for CNNs (Convolutional Neural Networks), and a graph style architecture, which works great for most models including plain feed-forward networks.
Audio-Visual Speech Separation with Cross-Modal Consistency
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
JAX implementation ViT-VQGAN
Unofficial implementation of Exploring Plain Vision Transformer Backbones for Object Detection
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.