Without lidar

Depth Estimation from Monocular Images and Sparse Radar Data

This is the official implementation of the paper Depth Estimation from Monocular Images and Sparse Radar Data. In this repo, we provide code for dataset preprocessing, training, and evaluation.

Some parts of the implementation are adapted from sparse-to-dense. We thank the authors for sharing their implementation.

Updates

Training and evaluation code.
Trained models.
Download instructions for the processed dataset.
Detailed documentation for the processed dataset.
Code and instructions to process data from the official nuScenes dataset.

Installation

git clone https://github.com/brade31919/radar_depth.git
cd radar_depth

Dataset preparation

Use our processed files

We provide our processed files specifically for the RGB + Radar depth estimation task. The download and setup instructions are:

mkdir DATASET_PATH # Set the path you want to use on your own PC/cluster.
cd DATASET_PATH
wget https://data.vision.ee.ethz.ch/daid/NuscenesRadar/Nuscenes_depth.tar.gz
tar -zxcf Nuscenes_depth.tar.gz

⚠️ Since the processed dataset is an adapted material (non-commercial purpose) from the official nuScenes dataset, the contents in the processed dataset are also subject to the official terms of use and the licenses.

Package installation

cd radar_depth # Go back to the project root
pip install -r requirements.txt

If you encounter error message like "ImportError: libSM.so.6: cannot open shared object file: No such file or directory" from cv2, you can try:

sudo apt-get install libsm6 libxrender1 libfontconfig1

Project configuration setting

we put important path setting in config/config_nuscenes.py. You need to modify them to the paths you use on your own PC/cluster.

Project and dataset root setting

In line 14 and 18, please specify your PROJECT_ROOT and DATASET_ROOT

PROJECT_ROOT = "YOUR_PATH/radar_depth"
DATASET_ROOT = "DATASET_PATH"

Experiment path setting

In line 53, please specify your EXPORT_PATH (the path you want to put our processed dataset).

EXPORT_ROOT = "YOUR_EXP_PATH"

Training

Downlaod the pre-trained models

We provide some pretrained models. They are not the original models used to produce the numbers on the paper but they have similar performances (I lost the original checkpoints due to some cluster issue...).

Please download the pretrained models from here, and put them to pretrained/ folder so that the directory structue looks like this:

pretrained/
├── resnet18_latefusion.pth.tar
└── resnet18_multistage.pth.tar

Train the late fusion model yourself

python main.py \
    --arch resnet18_latefusion \
    --data nuscenes \
    --modality rgbd \
    --decoder upproj \
    -j 12 \
    --epochs 20 \
    -b 16 \
    --max-depth 80 \
    --sparsifier radar

Train the full multi-stage model

To make sure that the training process is stable, we'll initialize each stage from the reset18_latefusion model. If you want to skip the trainig of resnet18_latefusion, you can use our pre-trained models.

python main.py \
    --arch resnet18_multistage_uncertainty_fixs \
    --data nuscenes \
    --modality rgbd \
    --decoder upproj \
    -j 12 \
    --epochs 20 \
    -b 8 \
    --max-depth 80 \
    --sparsifier radar

Here we use batch size 8 (instead of 16). This allows us to train the model on cheaper GPU models such as GTX1080Ti, GTX2080Ti, etc., and the training process is more stable.

Evaluation

After the training process finished, you can evaluate the model by (replace the PATH_TO_CHECKPOINT with the path to checkpoint file you want to evaluate):

python main.py \
    --evaluate PATH_TO_CHECKPOINT \
    --data nuscenes

Code Borrowed From

Citation

Please use the following citation format if you want to reference to our paper.

@InProceedings{radar:depth:20,
   author = {Lin, Juan-Ting and Dai, Dengxin and {Van Gool}, Luc},
   title = {Depth Estimation from Monocular Images and Sparse Radar Data},
   booktitle = {International Conference on Intelligent Robots and Systems (IROS)},
   year = {2020}
}

If you use the processed dataset, remember to cite the offical nuScenes dataset.

@article{nuscenes2019,
  title={nuScenes: A multimodal dataset for autonomous driving},
  author={Holger Caesar and Varun Bankiti and Alex H. Lang and Sourabh Vora and 
          Venice Erin Liong and Qiang Xu and Anush Krishnan and Yu Pan and 
          Giancarlo Baldan and Oscar Beijbom},
  journal={arXiv preprint arXiv:1903.11027},
  year={2019}
}

	if pretrained is True:
	# Get pretrained weights
	pretrained_path = os.path.join(cfg.PROJECT_ROOT, "pretrained/resnet18_latefusion.pth.tar")
	if not os.path.exists(pretrained_path):
	raise ValueError("[Error] Can't find pretrained latefusion model. "\
	"Please follow the instructions in README.md to download the weights!")
	checkpoint = torch.load(pretrained_path)

	cs_record = self.get('calibrated_sensor', lidar_record['calibrated_sensor_token'])
	point_clouds.rotate(Quaternion(cs_record['rotation']).rotation_matrix)
	point_clouds.translate(np.array(cs_record['translation']))

	# Second step: transform to the global frame.
	poserecord = self.dataset.get('ego_pose', lidar_record['ego_pose_token'])
	point_clouds.rotate(Quaternion(poserecord['rotation']).rotation_matrix)
	point_clouds.translate(np.array(poserecord['translation']))

	# Third step: transform into the ego vehicle frame for the timestamp of the image.
	poserecord = self.dataset.get('ego_pose', camera_record['ego_pose_token'])
	point_clouds.translate(-np.array(poserecord['translation']))
	point_clouds.rotate(Quaternion(poserecord['rotation']).rotation_matrix.T)

	# Fourth step: transform into the camera.
	cs_record = self.dataset.get('calibrated_sensor', camera_record['calibrated_sensor_token'])
	point_clouds.translate(-np.array(cs_record['translation']))
	point_clouds.rotate(Quaternion(cs_record['rotation']).rotation_matrix.T)

	def get_train_val_table(self):
	seed = cfg.TRAIN_VAL_SEED
	val_num = int(cfg.VAL_RATIO * self.num_scenes)

	# Get train / val scenes
	all = set(list(range(self.num_scenes)))
	np.random.seed(seed)
	val = set(np.random.choice(np.arange(0, self.num_scenes, 1), val_num, replace=False))
	train = all - val

	# Split number set
	train_scenes = list(train)
	val_scenes = list(val)

	parser.add_argument('--no-pretrain', dest='pretrained', action='store_false',
	help='not to use ImageNet pre-trained weights')

	point_clouds, times = RadarPointCloud.from_file_multisweep(self.dataset, sample_obj,
	radar_record["channel"], "LIDAR_TOP",
	nsweeps=1)

brade31919 / radar_depth Goto Github PK

radar_depth's Introduction

Depth Estimation from Monocular Images and Sparse Radar Data

Updates

Installation

Dataset preparation

Use our processed files

Package installation

Project configuration setting

Project and dataset root setting

Experiment path setting

Training

Downlaod the pre-trained models

Train the late fusion model yourself

Train the full multi-stage model

Evaluation

Code Borrowed From

Citation

radar_depth's People

Contributors

Stargazers

Watchers

Forkers

radar_depth's Issues

1. Learning rate and batch size:

2. Weight decay:

3. Processed dataset:

4. Transform points:

Recommend Projects

Recommend Topics

Recommend Org