Giter Club home page Giter Club logo

iou-aware-single-stage-object-detector's Introduction

IoU-aware single-stage object detector

Introduction

The repository is for the paper: IoU-aware single-stage object detector for accurate localization. The full paper is available at: https://arxiv.org/abs/1912.05992

IoU-aware RetinaNet is implemented based on MMDetection. The installation is the same as MMDetection.

Most of the implemented codes are in the file "mmdet/models/anchor_heads/iou_aware_retina_head.py" and the corresponding configuration file is in "configs/iou_aware_single_stage_detector".

Citation

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@article{wu2020iou,
  title={IoU-aware single-stage object detector for accurate localization},
  author={Wu, Shengkai and Li, Xiaoping and Wang, Xinggang},
  journal={Image and Vision Computing},
  pages={103911},
  year={2020},
  publisher={Elsevier}
}

mmdetection

Introduction

The master branch works with PyTorch 1.1 or higher. If you would like to use PyTorch 0.4.1, please checkout to the pytorch-0.4.1 branch.

mmdetection is an open source object detection toolbox based on PyTorch. It is a part of the open-mmlab project developed by Multimedia Laboratory, CUHK.

demo image

Major features

  • Modular Design

    One can easily construct a customized object detection framework by combining different components.

  • Support of multiple frameworks out of box

    The toolbox directly supports popular detection frameworks, e.g. Faster RCNN, Mask RCNN, RetinaNet, etc.

  • Efficient

    All basic bbox and mask operations run on GPUs now. The training speed is nearly 2x faster than Detectron and comparable to maskrcnn-benchmark.

  • State of the art

    This was the codebase of the MMDet team, who won the COCO Detection 2018 challenge.

Apart from mmdetection, we also released a library mmcv for computer vision research, which is heavily depended on by this toolbox.

License

This project is released under the Apache 2.0 license.

Updates

v0.6.0 (14/04/2019)

  • Up to 30% speedup compared to the model zoo.
  • Support both PyTorch stable and nightly version.
  • Replace NMS and SigmoidFocalLoss with Pytorch CUDA extensions.

v0.6rc0(06/02/2019)

  • Migrate to PyTorch 1.0.

v0.5.7 (06/02/2019)

  • Add support for Deformable ConvNet v2. (Many thanks to the authors and @chengdazhi)
  • This is the last release based on PyTorch 0.4.1.

v0.5.6 (17/01/2019)

  • Add support for Group Normalization.
  • Unify RPNHead and single stage heads (RetinaHead, SSDHead) with AnchorHead.

v0.5.5 (22/12/2018)

  • Add SSD for COCO and PASCAL VOC.
  • Add ResNeXt backbones and detection models.
  • Refactoring for Samplers/Assigners and add OHEM.
  • Add VOC dataset and evaluation scripts.

v0.5.4 (27/11/2018)

  • Add SingleStageDetector and RetinaNet.

v0.5.3 (26/11/2018)

  • Add Cascade R-CNN and Cascade Mask R-CNN.
  • Add support for Soft-NMS in config files.

v0.5.2 (21/10/2018)

  • Add support for custom datasets.
  • Add a script to convert PASCAL VOC annotations to the expected format.

v0.5.1 (20/10/2018)

  • Add BBoxAssigner and BBoxSampler, the train_cfg field in config files are restructured.
  • ConvFCRoIHead / SharedFCRoIHead are renamed to ConvFCBBoxHead / SharedFCBBoxHead for consistency.

Benchmark and model zoo

Supported methods and backbones are shown in the below table. Results and models are available in the Model zoo.

ResNet ResNeXt SENet VGG
RPN
Fast R-CNN
Faster R-CNN
Mask R-CNN
Cascade R-CNN
Cascade Mask R-CNN
SSD
RetinaNet
Hybrid Task Cascade
FCOS
Libra R-CNN

Other features

  • DCNv2
  • Group Normalization
  • Weight Standardization
  • OHEM
  • Soft-NMS
  • Mixed Precision (FP16) Training (coming soon)

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of mmdetection.

Contributing

We appreciate all contributions to improve mmdetection. Please refer to CONTRIBUTING.md for the contributing guideline.

Citation

If you use our codebase or models in your research, please cite this project. We will release a paper or technical report later.

@misc{mmdetection2018,
  author =       {Kai Chen and Jiangmiao Pang and Jiaqi Wang and Yu Xiong and Xiaoxiao Li
                  and Shuyang Sun and Wansen Feng and Ziwei Liu and Jianping Shi and
                  Wanli Ouyang and Chen Change Loy and Dahua Lin},
  title =        {mmdetection},
  howpublished = {\url{https://github.com/open-mmlab/mmdetection}},
  year =         {2018}
}

iou-aware-single-stage-object-detector's People

Contributors

shengkaiwu avatar yancie-yjr avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

iou-aware-single-stage-object-detector's Issues

IoU-aware在Guided Anchor和FCOS上的效果

您好,在mmdet/models/anchor_heads看到你们不仅试了retina-net上的IoU-aware的效果,好像还试了在Guided Anchor和FCOS上IoU-aware的效果。不知道方不方便交流一下这两个实验的结果?比如FCOS原本已经有了Centerness,我看您在代码里比较了Centerness和IoU score,很好奇有什么收获。

预测出来的IoU和真实IoU的对比?

作者您好,我想请问一下你们有没有做过基于你们预测出来的IoU和真实的IoU的对比,以此来看看预测出来的IoU准不准确。我自己做的实验结果是预测出来的IoU并不是非常准确,整体会偏高。由于训练的时候并没有回归neg proposal的IoU,导致预测的结果基本都大于0.5。

关于锚框正负的判断

您好,想问下您论文中的锚框正负的判断是和RPN中的判断是一样的吗?
那您的iouloss 是不是就不是根据:l,,t,b,r来计算的iouscore了,而是用了像NMS算法中计算两个矩形的交并比计算了iouscore这种方式?

Baseline in the paper

The scores of Baseline in Table 2-4 are the same.
What does the baseline mean?

Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.