Kingmin's Projects
(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic
[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository
Code for FMM-Attack: A Flow-based Multi-modal Adversarial Attack on Video-based LLMs
This is the PyTorch implementation of paper: FSR (AAAI 2023 Oral).
[CVPR 2022 Oral] Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations
Arxiv2022 - Activating More Pixels in Image Super-Resolution Transformer
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
[ECCV 2020] Invertible Image Rescaling
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
A simple baseline for image restoration with state-space model.
[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Count the MACs / FLOPs of your PyTorch model.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A latent text-to-image diffusion model
Official implementation of our CVPR2023 paper "A Unified Pyramid Recurrent Network for Video Frame Interpolation"
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.