Giter Club home page Giter Club logo

eddyhkchiu / dmstrack Goto Github PK

View Code? Open in Web Editor NEW
52.0 4.0 4.0 236.96 MB

[ICRA2024] Official code of the paper "Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter"

Home Page: https://eddyhkchiu.github.io/dmstrack.github.io/

Python 98.64% C++ 0.14% Shell 0.88% Cython 0.35%
3d-multi-object-tracking autonomous-driving computer-vision cooperative-perception deep-learning robotics sensor-fusion v2v-communication cooperative-tracking differentiable-kalman-filter

dmstrack's Introduction

Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter

Hsu-kuang Chiu1, Chien-Yi Wang2, Min-Hung Chen2, Stephen F. Smith1

1Robotics Institute, Carnegie Mellon University, 2NVIDIA Research

IEEE International Conference on Robotics and Automation (ICRA), 2024

project arxiv

Overview

We propose a novel algorithm: Differentiable Multi-Sensor Kalman Filter for 3D Multi-Object Cooperative Tracking (DMSTrack). Our algorithm is designed to be capable of estimating observation noise covariance of each detection from different Connected Autonomous Vehicles (CAVs) to better take advantage of the Kalman Filter’s theoretical optimality property: minimizing the expected error of state estimation.

To train our Differentiable Multi-Sensor Kalman Filter neural network model, we use each detection's local point cloud BEV feature and the positional feature as input. We calculate the regression loss by measureing the difference between the tracking result and the ground-truth. For more details, please refer to our paper at arxiv.

The experiment results show that our algorithm improves the tracking accuracy by 17% with only 0.037x communication costs compared with the state-of-the-art method in V2V4Real[1].

Experiment Result

We exeriment on the V2V4Real [1] dataset, which is the first worldwide available public real-world dataset for vehicle-to-vehicle perception with 3D tracking benchmarks. The following table shows the cooperative tracking performance and communication cost in the testing split of V2V4Real [1] dataset in comparison with the state-of-the-art CoBEVT+AB3DMOT method. The baseline results contain the one from the V2V4Real [1] paper and another one of our implementation.

Method AMOTA(↑) AMOTP(↑) sAMOTA(↑) MOTA(↑) MT(↑) ML(↓) Cost (MB) (↓)
CoBEVT+AB3DMOT [1] 32.12 55.61 77.65 63.75 47.29 30.32 0.20
CoBEVT+AB3DMOT (our implementation) 37.16 57.20 84.54 84.14 57.07 15.83 0.20
DMSTrack (ours) 43.52 57.94 91.50 88.32 68.35 13.19 0.0073

The following video shows the visualization of our tracking result. Different colors represent different tracking IDs.

tracking_result_video_53670_compressed.mp4

Using our code

You can follow the step-by-step instructions to setup the environment, download the dataset and model checkpoints, run model inference, visualize the tracking results, and train the model from scratch.

References

We mainly use the open source code of the following two papers as the reference to implement our algorithm.

  • [1] "V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative Perception", Runsheng Xu, Xin Xia, Jinlong Li, Hanzhao Li, Shuo Zhang, Zhengzhong Tu, Zonglin Meng, Hao Xiang, Xiaoyu Dong, Rui Song, Hongkai Yu, Bolei Zhou, Jiaqi Ma, CVPR 2023.
  • [2] "A Baseline for 3D Multi-Object Tracking", Xinshuo Weng and Kris Kitani, IROS 2020.

To Cite Our Paper

@ARTICLE{chiu2024probabilistic,
  title={Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter},
  author={Chiu, Hsu-kuang and Wang, Chien-Yi and Chen, Min-Hung and Smith, Stephen F.},
  journal={IEEE International Conference on Robotics and Automation (ICRA)},
  year={2024}
}

dmstrack's People

Contributors

eddyhkchiu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

dmstrack's Issues

Question About Tracking Data Processing for V2V4Real

Hi! Thanks for the amazing work; I noticed that you have some preprocessing step to get the tracks for the V2V4Real dataset at v2v4real_[train/val]_evaluate_tracking.seqmap.val. I was wondering how this is generated? Do you have the scripts to create these files, and the preprocessing for the label file?

In addition, I was wondering about the KITTI format labels in AB3DMOT/scripts/KITTI/v2v4real_[train/val]_label. Do you have a script to convert this for the v2v4real dataset? I presume that the track ID is the "Vehicle" index of v2v4real dataset annotation yaml file?

Thank you in advance for the clarifications!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.