Shengkun Tang's Projects
This is a repository to record the code during learning algorithm. This repository belongs to KunKun and CongCong.
video detection via YOLO v3 and v4
This project summarize awesome papers related to efficient multimodal
Denoising Diffusion Implicit Models
Unofficial PyTorch Implementation of Denoising Diffusion Probabilistic Models (DDPM)
The official implementation of "DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range"
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
LLM training code for MosaicML foundation models
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
An open source implementation of CLIP.
Code for the Paper "Improving Diffusion Model Efficiency Through Patching"
PCA-SVD-Autoencoder-Fourier-Wavelet-Transformation-for-denoising
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)
Deep Learning (with PyTorch)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".