Giter Club home page Giter Club logo

vovnet_sku110's Introduction

This repository is based on VoVNet-v2

Faster R-CNN on SKU-110K dataset

Note

We measure the inference time of all models with batch size 1 on the same RTX2080Ti GPU machine.

  • pytorch1.4.0
  • CUDA 10.2
  • cuDNN 7.3

Lightweight-VoVNet with FPNLite

Backbone Param. lr sched inference time AP AP75 AP50 download
MobileNetV2-0.5 3.5M 1x 0.037 42.93 44.27 77.31 model | metrics
MobileNetV2 3.5M 3x 0.044 40.55 47.25 64.58 model | metrics
V2-19 11.2M 3x N/A N/A N/A N/A model | metrics
V2-19-DW 6.5M 3x N/A N/A N/A N/A model | metrics
V2-19-Slim 3.1M 3x N/A N/A N/A N/A model | metrics
V2-19-Slim-DW 1.8M 3x N/A N/A N/A N/A model | metrics
  • DW and Slim denote depthwise separable convolution and a thiner model with half the channel size, respectively.
Backbone Param. lr sched inference time AP AP75 AP50 download
V2-19-FPN 37.6M 3x N/A N/A N/A N/A model | metrics
R-50-FPN 51.2M 3x N/A N/A N/A N/A model | metrics
V2-39-FPN 52.6M 3x N/A N/A N/A N/A model | metrics

Using this command with --num-gpus 1

python /path/to/vovnet_sku110/train_net.py --config-file /path/to/vovnet_sku110/configs/<config.yaml> --eval-only --num-gpus 1 MODEL.WEIGHTS <model.pth>

Installation

As this vovnet-detectron2 is implemented as a extension form (detectron2/projects) upon detectron2, you just install detectron2 following INSTALL.md.

Prepare for SKU-110K dataset:

  • To download dataset, please visit here
  • Extract the file downloaded to datasets/sku110/images
  • Extract datasets/sku110/Annotations.zip, there are 2 folders Annotations and ImageSets

Training

To train a model, run

python /path/to/vovnet_sku110/train_net.py --config-file /path/to/vovnet_sku110/configs/<config.yaml>

For example, to launch end-to-end Faster R-CNN training with VoVNetV2-39 backbone on 8 GPUs, one should execute:

python /path/to/vovnet_sku110/train_net.py --config-file /path/to/vovnet_sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

python /path/to/vovnet_sku110/train_net.py --config-file /path/to/vovnet_sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --eval-only MODEL.WEIGHTS <model.pth>

Visualization

To visual the result, run

python /path/to/vovnet_sku110/demo.py --config-file /path/to/vovnet_sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --input image.jpg --output image.jpg MODEL.WEIGHTS <model.pth>

Citing VoVNet

If you use VoVNet, please use the following BibTeX entry.

@inproceedings{lee2019energy,
  title = {An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection},
  author = {Lee, Youngwan and Hwang, Joong-won and Lee, Sangrok and Bae, Yuseok and Park, Jongyoul},
  booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops},
  year = {2019}
}

vovnet_sku110's People

Contributors

naviocean avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.