Giter Club home page Giter Club logo

Thiya's Projects

vformer icon vformer

A PyTorch library for Vision Transformers

vg-gplms icon vg-gplms

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

vi-svs icon vi-svs

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

video-captioning icon video-captioning

Video Captioning is an encoder decoder mode based on sequence to sequence learning

video-sentiment-analysis icon video-sentiment-analysis

Analyze any video with the help of the Deep Learning Emotion Detection model. The model is of 72% accuracy. User can Upload a video or can also Capture a video at a time for the analysis.

video2numpy icon video2numpy

Optimized library for large-scale extraction of frames and audio from video.

video2text icon video2text

📺 An Encoder-Decoder Model for Sequence-to-Sequence learning: Video to Text

videomae icon videomae

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

videos icon videos

Code for the manim-generated scenes used in 3blue1brown videos

vila icon vila

Incorporating VIsual LAyout Structures for Scientific Text Classification

vilt icon vilt

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

virtex icon virtex

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

vision-language-modelling-series icon vision-language-modelling-series

Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations

vissl icon vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

visualkeras icon visualkeras

Visualkeras is a Python package to help visualize Keras (either standalone or included in TensorFlow) neural network architectures. It allows easy styling to fit most needs. This module supports layered style architecture generation which is great for CNNs (Convolutional Neural Networks), and a graph style architecture, which works great for most models including plain feed-forward networks.

visualvoice icon visualvoice

Audio-Visual Speech Separation with Cross-Modal Consistency

vit-pytorch icon vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

vit-pytorch-1 icon vit-pytorch-1

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

vitdet icon vitdet

Unofficial implementation of Exploring Plain Vision Transformer Backbones for Object Detection

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.