CVPR2023-Papers-with-Code-Demo

☪️CVPR2022论文下载：添加微信: nvshenj125, 备注 CVPR 2022 即可获取全部论文pdf

☪️福利注册即可领取 200 块计算资源 : https://www.bkunyun.com/wap/console?source=aistudy 使用说明

欢迎关注公众号：AI算法与图像处理

🌟 CVPR 2023 持续更新最新论文/paper和相应的开源代码/code！

B站demo：https://space.bilibili.com/288489574

✋ 注：欢迎各位大佬提交issue，分享CVPR 2022论文/paper和开源项目！共同完善这个项目

往年顶会论文汇总：

CVPR2021

CVPR2022

ICCV2021

ECCV2022

🎆 欢迎进群 | Welcome

CVPR 2023 论文/paper交流群已成立！已经收录的同学，可以添加微信：nvshenj125，请备注：CVPR+姓名+学校/公司名称！一定要根据格式申请，可以拉你进群。

🔨 目录 |Table of Contents（点击直接跳转）

目录（右侧点击可折叠）

Backbone
数据集/Dataset
NAS
Knowledge Distillation
多模态 / Multimodal
对比学习/Contrastive Learning
图神经网络 / Graph Neural Networks
胶囊网络 / Capsule Network
图像分类 / Image Classification
目标检测/Object Detection
目标跟踪/Object Tracking
轨迹预测/Trajectory Prediction
语义分割/Segmentation
弱监督语义分割/Weakly Supervised Semantic Segmentation
医学图像分割
视频目标分割/Video Object Segmentation
交互式视频目标分割/Interactive Video Object Segmentation
Visual Transformer
深度估计/Depth Estimation
人脸识别/Face Recognition
人脸检测/Face Detection
人脸活体检测/Face Anti-Spoofing
人脸年龄估计/Age Estimation
人脸表情识别/Facial Expression Recognition
人脸属性识别/Facial Attribute Recognition
人脸编辑/Facial Editing
人脸重建/Face Reconstruction
换脸/Face Swap
人体姿态估计/Human Pose Estimation
6D位姿估计 /6D Pose Estimation
手势姿态估计（重建）/Hand Pose Estimation( Hand Mesh Recovery)
视频动作检测/Video Action Detection
手语翻译/Sign Language Translation
3D人体重建
行人重识别/Person Re-identification
行人搜索/Person Search
人群计数 / Crowd Counting
GAN
彩妆迁移 / Color-Pattern Makeup Transfer
字体生成 / Font Generation
场景文本检测、识别/Scene Text Detection/Recognition
图像、视频检索 / Image Retrieval/Video retrieval
Image Animation
抠图/Image Matting
超分辨率/Super Resolution
图像复原/Image Restoration
图像补全/Image Inpainting
图像去噪/Image Denoising
图像编辑/Image Editing
图像拼接/Image stitching
图像匹配/Image Matching
图像融合/Image Blending
图像去雾/Image Dehazing
图像压缩/Image Compression
反光去除/Reflection Removal
车道线检测/Lane Detection
自动驾驶 / Autonomous Driving
流体重建/Fluid Reconstruction
场景重建 / Scene Reconstruction
视频插帧/Frame Interpolation
视频超分 / Video Super-Resolution
3D点云/3D point cloud
标签噪声 / Label-Noise
对抗样本/Adversarial Examples
其他/Other

Backbone

返回目录/back

数据集/Dataset

返回目录/back

NAS

返回目录/back

Knowledge Distillation

Paper title: Generic-to-Specific Distillation of Masked Autoencoders

论文/Paper: https://arxiv.org/abs/2302.14771
代码/Code: https://github.com/pengzhiliang/G2SD

返回目录/back

多模态 / Multimodal

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

论文/Paper: https://arxiv.org/abs/2302.14771
代码/Code: None

Multimodal Industrial Anomaly Detection via Hybrid Fusion

论文/Paper: http://arxiv.org/pdf/2303.00601
代码/Code: https://github.com/nomewang/m3dm

Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision

论文/Paper: http://arxiv.org/pdf/2303.00462
代码/Code: https://github.com/toytiny/cmflow

返回目录/back

Contrastive Learning

返回目录/back

胶囊网络 / Capsule Network

返回目录/back

图像分类 / Image Classification

返回目录/back

目标检测/Object Detection

返回目录/back

目标跟踪/Object Tracking

3D Object Tracking

返回目录/back

轨迹预测/Trajectory Prediction

IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction

论文/Paper: http://arxiv.org/pdf/2303.00575
代码/Code: None

返回目录/back

语义分割/Segmentation

Interactive Segmentation as Gaussian Process Classification

论文/Paper: http://arxiv.org/pdf/2302.14578
代码/Code: None

Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation

论文/Paper: http://arxiv.org/pdf/2302.14250
代码/Code: None

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

论文/Paper: https://arxiv.org/abs/2302.14771
代码/Code: None

ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution

论文/Paper: http://arxiv.org/pdf/2303.00246
代码/Code: None

返回目录/back

弱监督语义分割/Weakly Supervised Semantic Segmentation

返回目录/back

医学图像分割/Medical Image Segmentation

返回目录/back

视频目标分割/Video Object Segmentation

返回目录/back

交互式视频目标分割/Interactive Video Object Segmentation

返回目录/back

Visual Transformer

Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors

论文/Paper: http://arxiv.org/pdf/2302.14746
代码/Code: None

ProxyFormer: Proxy Alignment Assisted Point Cloud Completion with Missing Part Sensitive Transformer

论文/Paper: http://arxiv.org/pdf/2302.14435
代码/Code: https://github.com/I2-Multimedia-Lab/ProxyFormer.

返回目录/back

深度估计/Depth Estimation

返回目录/back

人脸识别/Face Recognition

返回目录/back

人脸检测/Face Detection

返回目录/back

人脸活体检测/Face Anti-Spoofing

返回目录/back

人脸重建/Face Reconstruction

ProxyFormer: Proxy Alignment Assisted Point Cloud Completion with Missing Part Sensitive Transformer

论文/Paper: http://arxiv.org/pdf/2302.14435
代码/Code: https://github.com/I2-Multimedia-Lab/ProxyFormer.

返回目录/back

人脸年龄估计/Age Estimation

返回目录/back

人脸表情识别/Facial Expression Recognition

返回目录/back

手势姿态估计（重建）/Hand Pose Estimation( Hand Mesh Recovery)

Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes

论文/Paper: http://arxiv.org/pdf/2302.14348
代码/Code: https://github.com/jyunlee/Im2Hands

返回目录/back

视频插帧/Frame Interpolation

Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

论文/Paper: http://arxiv.org/pdf/2303.00440
代码/Code: https://github.com/MCG-NJU/EMA-VFI

返回目录/back

3D点云/3D point cloud

ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution

论文/Paper: http://arxiv.org/pdf/2303.00246
代码/Code: None

返回目录/back

其他/Other

PA&DA: Jointly Sampling PAth and DAta for Consistent NAS

论文/Paper: http://arxiv.org/pdf/2302.14772
代码/Code: https://github.com/ShunLu91/PA-DA

Generic-to-Specific Distillation of Masked Autoencoders

论文/Paper: http://arxiv.org/pdf/2302.14771
代码/Code: https://github.com/pengzhiliang/G2SD.

Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger

论文/Paper: http://arxiv.org/pdf/2302.14677
代码/Code: None

Turning a CLIP Model into a Scene Text Detector

论文/Paper: http://arxiv.org/pdf/2302.14338
代码/Code: None

Adversarial Attack with Raindrops

论文/Paper: http://arxiv.org/pdf/2302.14267
代码/Code: None

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

论文/Paper: http://arxiv.org/pdf/2302.14115
代码/Code: None

DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks

论文/Paper: http://arxiv.org/pdf/2302.14685
代码/Code: None

Neural Video Compression with Diverse Contexts

论文/Paper: http://arxiv.org/pdf/2302.14402
代码/Code: https://github.com/microsoft/DCVC

Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation

论文/Paper: http://arxiv.org/pdf/2302.14290
代码/Code: None

Efficient and Explicit Modelling of Image Hierarchies for Image Restoration

论文/Paper: http://arxiv.org/pdf/2303.00748
代码/Code: https://github.com/ofsoundof/grl-image-restoration

Quality-aware Pre-trained Models for Blind Image Quality Assessment

论文/Paper: http://arxiv.org/pdf/2303.00521
代码/Code: None

Renderable Neural Radiance Map for Visual Navigation

论文/Paper: http://arxiv.org/pdf/2303.00304
代码/Code: None

Single Image Backdoor Inversion via Robust Smoothed Classifiers

论文/Paper: http://arxiv.org/pdf/2303.00215
代码/Code: https://github.com/locuslab/smoothinv

Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training

论文/Paper: http://arxiv.org/pdf/2303.00040
代码/Code: None

lrs890 / cvpr2023-papers-with-code-demo Goto Github PK