Giter Club home page Giter Club logo

ecodepth's Introduction

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation

CVPR 2024
Project Page arXiv page IEEE Xplore Paper IEEE Xplore Paper

Suraj Patni*, Aradhye Agarwal*, Chetan Arora

Architecture Diagram

News

  • [Coming soon] Pretrained checkpoints for NYUv2 and KITTI datasets.
  • [March 2024] Training and Inference code released!
  • [Feb 2024] ECoDepth accepted in CVPR'2024.

Installation

git clone https://github.com/Aradhye2002/EcoDepth
cd EcoDepth
conda env create -f env.yml
conda activate ecodepth

Dataset

You can see the dataset preparation guide for NYUv2 and KITTI from here. After Update the paths in the desired bash scripts for evaluation and training accordingly.

Pretrained Models

Please download the pretrained weights form this link and save .ckpt inside <repo root>/depth/checkpoints directory.

Evaluation

To evaluate our performance on NYUv2 and KITTI datasets, use test.py file. The trained models are publicly available, download the models using above links.

  1. Train on NYUv2 dataset:
    bash test_nyu.sh <path_to_saved_model_of_NYU>

  2. Train on KITTI dataset:
    bash test_kitti.sh <path_to_saved_model_of_KITTI>

Training

We trained our models on 32 batch size using 8xNVIDIA A100 GPUs. Please set the NPROC_PER_NODE variable and --batch_size argument to set the batch size. We set them as NPROC_PER_NODE=8 and --batch_size=4. So our effective batch_size is 32.

  1. Evaluate on NYUv2 dataset:
    bash train_nyu.sh

  2. Evaluate on KITTI dataset:
    bash train_kitti.sh

Contact

If you have any questions about our code or paper, kindly raise an issue on Github.

Acknowledgment

We thank Kartik Anand for assistance with the experiments. Our source code is inspired from VPD and PixelFormer, we thank their authors for publicly releasing the code.

BibTeX (Citation)

If you find our work useful in your research, please consider citing the following:

TODO
.
.
.
.

ecodepth's People

Contributors

surajiitd avatar aradhye2002 avatar eltociear avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.