Giter Club home page Giter Club logo

cmkd's Introduction

CMKD: Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)

Paper

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (arXiv, Supplimentary Included)

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV Open Access)

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (Waymo Challenge Report)

If you find our papers helpful for your research, you may cite our paper as

@inproceedings{yuhong-CMKD-ECCV2022,
author = {Yu Hong and
Hang Dai and
Yong Ding},
title = {Cross-Modality Knowledge
Distillation Network for Monocular 3D Object
Detection},
booktitle = {{ECCV}},
series = {Lecture Notes in Computer Science},
publisher = {Springer},
year = {2022}
}

Introduction

This is the official implementation of CMKD with OpenPCDet for KITTI and Waymo datasets.

We have also implemented another version with MMDetection3D for Nuscenes dataset.

News

[2023.3.25] We have several updates.

  • Release the multi-camera version CMKD for Nuscenes dataset, refer to this link for detail
  • Release more pre-trained models for KITTI dataset

[2023.2.14] We have several updates.

Notice: Due to the short schedule, instructions and pre-trained models will be released gradually in the near future, and there may be many issues and bugs, please feel free to let us know if you have any questions.

  • Support center head in response distillation
  • Support more teacher models in the framework (We now have SECOND, CenterPoint and PointPillar), more pre-trained models will be released later
  • Support to set different feature level as the distillation guidance
  • Add visualization utils to visualize the BEV feature maps and the detection results
  • Support for Nuscenes dataset will be released very soon

[2022.11.20] We release some instructions and pre-trained models covering the KITTI experiments.

This implementation has some differences from our paper, but the core idea is the same. Overall, the current version is faster to train, uses less memory, and has similar performance to the older version.

Waymo experiments and Nuscenes experiments are on the way.

[2022.7.9] Our paper has been accepted by ECCV 2022 as Oral presentation. ๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ

[2022.7.4] Our paper has been accepted by ECCV 2022. ๐Ÿ”ฅ๐Ÿ”ฅ

[2022.5.24] CMKD gets the 3rd place in the 2022 Waymo 3D camera-only detection challenge. ๐Ÿ”ฅ

In the challenge, we simply extend our baseline model from single-camera version to multi-camera version without any challenge-specific skills and achieve good results. Specifically, we use a lightweight res-50 backbone with 20% of the total training samples, no previous frames, no data augmentation, and no training and testing tricks to rank 3rd in the challenge.

Framework Overview

image

BEV Features Generation

image

Use CMKD

Installation

Please follow INSTALL to install CMKD.

Getting Started

Please follow GETTING_START to train or evaluate the models.

Models

KITTI

Teacher Model Car Easy@R40 Car Moderate@R40 Car Hard@R40 Model Teacher Model
CMKD-R50 (kitti train + eigen clean) SECOND 33.36 21.61 17.97 model model
CMKD-R50 (kitti train) SECOND 24.02 15.80 13.22 model model
CMKD-R50 (kitti train + eigen clean) CenterPoint 29.78 21.17 18.41 model model
CMKD-R50 (kitti train) CenterPoint 22.56 16.02 13.52 model model
CMKD-R50 (kitti train + eigen clean) PointPillar 32.25 21.47 18.21 model model
CMKD-R50 (kitti train) PointPillar 23.84 16.44 13.58 model model

Waymo

Coming Soon

Nuscenes

mAP NDS Model
BEVDet-R50 30.7 38.2 -
BEVDet-R50 + CMKD 34.7 42.6 -

cmkd's People

Contributors

cc-hy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.