Giter Club home page Giter Club logo

Thiya's Projects

vert-papers icon vert-papers

This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).

vescale icon vescale

A PyTorch Native LLM Training Framework

vformer icon vformer

A PyTorch library for Vision Transformers

vg-gplms icon vg-gplms

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

vgae_pytorch icon vgae_pytorch

This repository implements variational graph auto encoder by Thomas Kipf.

vggsound icon vggsound

VGGSound: A Large-scale Audio-Visual Dataset

vi-svs icon vi-svs

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

vico icon vico

Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"

vid2densepose icon vid2densepose

Convert your videos to densepose and use it on MagicAnimate

video-bgm-generation icon video-bgm-generation

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)

video-captioning icon video-captioning

Video Captioning is an encoder decoder mode based on sequence to sequence learning

video-chatgpt icon video-chatgpt

Video-ChatGPT is a large vision-language model with a dedicated video-encoder and large language model (LLM), enabling video understanding and conversation about videos.

video-llama icon video-llama

Video-LLaMA: An Instruction-Finetuned Visual Language Model for Video Understanding

video-llava icon video-llava

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

video-p2p icon video-p2p

Video-P2P: Video Editing with Cross-attention Control

video-retalking icon video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

video-sentiment-analysis icon video-sentiment-analysis

Analyze any video with the help of the Deep Learning Emotion Detection model. The model is of 72% accuracy. User can Upload a video or can also Capture a video at a time for the analysis.

video2numpy icon video2numpy

Optimized library for large-scale extraction of frames and audio from video.

video2text icon video2text

📺 An Encoder-Decoder Model for Sequence-to-Sequence learning: Video to Text

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.