Giter Club home page Giter Club logo

destr's Introduction

DESTR: Object Detection with Split Transformer (CVPR 2022)

This repository is an official implementation of the CVPR 2022 paper "DESTR: Object Detection with Split Transformer".

Split Cross-attention Pipeline (insert miniDet) Pair Attention

Contributions:

  1. Split estimation of cross attention into two independent branches: one tailored for classification and the other for box regression;
  2. Insert a mini-detector between encoder and decoder to initialize objects’ classification, regression and positional embeddings;
  3. Augment self-attention in decoder to pair self-attention for every two pairs of spatially adjacent queries to improve inductive bias.

Model Zoo

We provide conditional DETR and conditional DETR-DC5 models. AP is computed on COCO 2017 val.

Method Epochs Params (M) AP APS APM APL URL
DETR-R50 500 41 42.0 20.5 45.8 61.1 model
log
DETR-R50 50 41 34.8 13.9 37.3 54.4 model
log
Conditional DETR-R50 50 44 41.0 20.6 44.3 59.3 model
log
DESTR-R50 50 69 43.6 23.5 47.6 62.4 model
log

Note:

  1. The numbers in the table are slightly differently from the numbers in the paper. We re-ran some experiments when releasing the codes.
  2. More weights will be release in future

Installation, Requirement, and Usage

Please see Conditional DETR

License

DESTR is released under the Apache 2.0 license. Please see the LICENSE file for more information.

Aknowledgement

DESTR is build on Conditional DETR . We appreciate the contributions from them!

Citation

@inproceedings{he2022destr,
  title={DESTR: Object Detection with Split Transformer},
  author={He, Liqiang and Todorovic, Sinisa},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={9377--9386},
  year={2022}
}

destr's People

Contributors

helq2612 avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

destr's Issues

Code for Attention Map as in Fig 6 of the paper

Hi,
Thank you for the great work. I want to ask could you please help me regarding code to generate attention map on the images as you have shown in the Fig 6 of your paper? That would be very helpful for me. Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.