Giter Club home page Giter Club logo

Yepeng Jin's Projects

animateanyone icon animateanyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

avid icon avid

This respository contains the code for AVID: Any-Length Video Inpainting with Diffusion Model.

awesome-video-diffusion icon awesome-video-diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

cmu-multimodalsdk icon cmu-multimodalsdk

CMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.

control-a-video icon control-a-video

Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"

controlvideo icon controlvideo

[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

convokit icon convokit

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.

cross-modal-bert icon cross-modal-bert

CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis(MM2020)

cvpr23_lfdm icon cvpr23_lfdm

The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

disco icon disco

DisCo: Referring Human Dance Generation in Real World

followyourpose icon followyourpose

[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"

gpt-sovits icon gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

head_movement_detection icon head_movement_detection

Jupyter notebooks and training data containing manual head movement annotations, speech data and velocity, acceleration and jerk data.

i2vgen-xl icon i2vgen-xl

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

imagen-pytorch icon imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

livephoto-important icon livephoto-important

Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

magic-animate icon magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

magicdance icon magicdance

MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer

meld icon meld

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

mmpose icon mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

mmssl icon mmssl

[WWW'2023] Multi-Modal Self-Supervised Learning for Recommendation

mosei_umons icon mosei_umons

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.