Giter Club home page Giter Club logo

awesome-learning-mvs's Introduction

Awesome-Learning-MVS (Methods and Datasets)

Learning-based MVS Methods

  1. Volumetric methods (SurfaceNet)
  2. Depthmap based methods (MVSNet/R-MVSNet and so on)

( 💻 means code available)

ICCV2017

  • 💻 SurfaceNet: An End-to-end 3D Neural Network for Multiview Stereopsis [paper] [Github] [T-PAMI]
  • Learning a Multi-View Stereo Machine [paper] (LSMs can produce two kinds of outputs - voxel occupancy grids decoded from 3D Grid or per-view depth maps decoded after a projection operation.)
  • Learned Multi-Patch Similarity [paper] [supp] (Note: Learning to measure multi-image patch similiarity, NOT end-to-end learning MVS pipeline)

CVPR2018

ECCV2018

  • 💻 MVSNet: Depth Inference for Unstructured Multi-view Stereo [paper] [supp] [Github]

CVPR2019

  • 💻 Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference [paper] [supp] [Github]

ICCV2019

  • 💻 Point-Based Multi-View Stereo Network [paper] [supp] [Github] [T-PAMI] (Point-MVSNet performs multi-view stereo reconstruction in a coarse-to-fine fashion, learning to predict the 3D flow of each point to the groundtruth surface based on geometry priors and 2D image appearance cues)
  • P-MVSNet: Learning Patch-wise Matching Confidence Aggregation for Multi-view Stereo [paper]
  • MVSCRF: Learning Multi-view Stereo with Conditional Random Fields [paper]

AAAI2020

  • Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume [paper] [Github]

CVPR2020

  • 💻 Cascade Cost Volume for High-Resolutoin Multi-View Stereo and Stereo Matching [paper] [Github]

  • 💻 Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness [paper] [supp] [Github]

  • 💻 Cost Volume Pyramid Based Depth Inference for Multi-View Stereo [paper] [supp] [Github]

  • 💻 Fast-MVSNet: Sparse-to-Dense Multi-View Stereo with Learned Propagation and Gauss-Newton Refinement [paper] [supp] [Github]

  • Attention-Aware Multi-View Stereo [paper]

  • 💻 A Novel Recurrent Encoder-Decoder Structure for Large-Scale Multi-view Stereo Reconstruction from An Open Aerial Dataset [paper] [Github] [data]

ECCV2020

  • 💻 Pyramid Multi-view Stereo Net with Self-adaptive View aggregation [paper] [Github]
  • 💻 Dense Hybird Recurrent Multi-view Stereo Net with Dynamic Consistency Checking [paper] [Github]

BMVC2020

  • 💻 Visibility-aware Multi-view Stereo Network [paper] [Github]

WACV2021

  • Long-range Attention Network for Multi-View Stereo [paper]

CVPR2021

  • 💻 PatchmatchNet: Learned Multi-View Patchmatch Stereo [paper] [Github]

ICCV2021

  • 💻 AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network [paper] [supp] [Github]
  • EPP-MVSNet: Epipolar-Assembling Based Depth Prediction for Multi-View Stereo [paper]
  • Just a Few Points are All You Need for Multi-view Stereo: A Novel Semi-supervised Learning Method for Multi-view Stereo [paper] [supp]

3DV 2021

CVPR 2022

  • 💻 IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo [paper] [supp] [Github]

  • 💻 Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation and Focal Loss [paper][supp] [Github]

  • 💻 RayMVSNet: Learning Ray-Based 1D Implicit Fields for Accurate Multi-View Stereo [paper] [supp] [Github]

  • Non-Parametric Depth Distribution Modelling Based Depth Inference for Multi-View Stereo [paper] [supp]

  • 💻 TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers [paper] [supp] [Github]

  • 💻 Generalized Binary Search Network for Highly-Efficient Multi-View Stereo [paper] [supp] [Github]

  • 💻 Efficient Multi-View Stereo by Iterative Dynamic Cost Volume [paper] [supp] [Github]

  • 💻 MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions [paper] [supp] [Github]

ECCV 2022

  • 💻 MVSTER: Epipolar Transformer for Efficient Multi-View Stereo [paper] [Github]
  • 💻 Multiview Stereo with Cascaded Epipolar RAFT [paper] [Github]

Journal Paper

  • MVSNet++: Learning Depth-Based Attention Pyramid Features for Multi-View Stereo. IEEE Transactions on Image Processing [paper]
  • HighRes-MVSNet: A Fast Multi-View Stereo Network for Dense 3D Reconstruction From High-Resolution Images. IEEE Access [paper]
  • 💻 AACVP-MVSNet: Attention-aware cost volume pyramid based multi-view stereo network for 3D reconstruction. ISPRS Journal of Photogrammetry and Remote Sensing [paper] [Github]
  • Learning Inverse Depth Regression for Pixelwise Visibility-Aware Multi-View Stereo Networks. International Journal of Computer Vision [paper]
  • Sparse prior guided deep multi-view stereo. Computers & Graphics [paper]

To Be Continued

Survey Paper

  • A Survey on Deep Learning Techniques for Stereo-based Depth Estimation. IEEE T-PAMI [ArXiv] [IEEE Xplore]
  • Deep Learning for Multi-view Stereo via Plane Sweep: A Survey [paper]
  • Multi-view stereo in the Deep Learning Era: A comprehensive review [paper]

PhD Thesis

Multi-view Stereo Benchmark

  • Middlebury [CVPR06']

    • A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms [paper] [website]
  • EPFL [CVPR08']

    • On Benchmarking Camera Calibration and Multi-View Stereo for High Resolution Imagery [paper]
  • DTU [CVPR2014, IJCV2016]

  • Tanks and Temples [ACM ToG2017]

  • ETH3D [CVPR2017]

    • A Multi-View Stereo Benchmark with High-Resolution Images and Multi-Camera Videos [paper] [supp] [website] [Github]
  • BlendedMVS [CVPR2020]

    • BlendedMVS: A Large-Scale Dataset for Generalized Multi-View Stereo Network [paper] [supp] [Github] [visual]
  • GigaMVS [T-PAMI2021]

    • GigaMVS: A Benchmark for Ultra-large-scale Gigapixel-level 3D Reconstruction [paper] [website]
  • Multi-sensor large-scale dataset for multi-view 3D reconstruction [CVPR2023]

    • Multi-sensor large-scale dataset for multi-view 3D reconstruction [paper] [website]

Large-scale Real-world Scenes

  1. Chinese Style Architectures
  1. Western Style Architectures
  1. Aerial Dataset

Other similar collections

Future works (Personal Perspective)

  • ultra-large-scale 3D Reconstruction: GigaMVS
  • Semantic multi-view 3D Reconstruction

部分论文讲解

如果想看经典MVS论文介绍,可以参照👉中文论文讲解

awesome-learning-mvs's People

Contributors

krahets avatar xyz-qiyh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-learning-mvs's Issues

微信二维码提示过期

非常感谢您建立了MVS交流群,但现在扫码提示已过期,请问是否可以再次更新或拉我入群呢
非常感谢~

交流群

你好,可以更新一下交流群吗?感谢

咨询

交流的二维码失效,能否重新提供一下?
十分感谢

colmap请教

嗨,您好,我是从您colmap的文章中过来的,我看了您“已知相机位姿进行colmap重建”,有一些问题,方便加您联系方式交流一下吗?我的微信是15343179786,邮箱是[email protected],如果能得到您的指导,我将感到十分的荣幸!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.