Giter Club home page Giter Club logo

mcar's Introduction

MCAR.pytorch

This repository is a PyTorch implementation of Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition. The paper is accepted at [IEEE Trans. Image Processing (TIP 2021). This repo is created by Bin-Bin Gao.

PWC PWC PWC

MCAR Framework

Requirements

Please, install the following packages

  • numpy
  • torch-0.4.1
  • torchnet
  • torchvision-0.2.0
  • tqdm

Options

  • topN: number of local regions
  • threshold: threshold of localization
  • ps: global pooling style, e.g., 'avg', 'max', 'gwp'
  • lr: learning rate
  • lrp: factor for learning rate of pretrained layers. The learning rate of the pretrained layers is lr * lrp
  • batch-size: number of images per batch
  • image-size: size of the image
  • epochs: number of training epochs
  • evaluate: evaluate model on validation set
  • resume: path to checkpoint

MCAR Training and Evaluation

bash run.sh
Model Input-Size VOC-2007 VOC-2012 COCO-2014
MobileNet-v2 256 x 256 88.1 model - 69.8 model
ResNet-50 256 x 256 92.3 model - 78.0 model
ResNet-101 256 x 256 93.0 model - 79.4 model
MobileNet-v2 448 x 448 91.3 model 91.0 75.0 Model
ResNet-50 448 x 448 94.1 model 93.5 82.1 model
ResNet-101 448 x 448 94.8 model 94.3 83.8 model

MCAR Demo

bash run_demo.sh

mcar-demo

Citing this repository

If you find this code useful in your research, please consider citing us:

@ARTICLE{MCAR_TIP_2021,
         author = {Bin-Bin Gao, Hong-Yu Zhou},
         title = {{Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition}},
         booktitle = {IEEE Transactions on Image Processing (TIP)},
         year={2021},
         volume={30},
         pages={5920-5932},
}

Reference

This project is based on the following implementations:

Tips

If you have any questions about our work, please do not hesitate to contact us by emails.

mcar's People

Contributors

gaobb avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.