Giter Club home page Giter Club logo

tbmsl-net's Introduction

TBMSL-Net

This is a PyTorch implementation of the paper "Three-branch and Mutil-scale learning for Fine-grained Image Recognition (TBMSL-Net)" (Fan Zhang, Guisheng Zhai, Meng Li, Yizhao Liu). Welcome to discuss with us in issues!

avatar

Table of Contents

Requirements

  • python 3.7
  • pytorch 1.3.1
  • numpy 1.17.3
  • scikit-image 0.16.2
  • Tensorboard 1.15.0
  • TensorboardX 2.0
  • tqdm 4.41.1
  • imageio 2.6.1
  • pillow 6.1.0

Datasets

Download the CUB-200-2011 datasets and copy the contents of the extracted images folder into datasets/CUB 200-2011/images.

Download the FGVC-Aircraft datasets and copy the contents of the extracted data/images folder into datasets/FGVC_Aircraft/data/images)

You can also try other fine-grained datasets.

Training TBMSL-Net

If you want to train the TBMSL-Net, please download the pretrained model of ResNet-50 and move it to models/pretrained before run python train.py. You may need to change the configurations in config.py if your GPU memory is not enough. The parameter N_list is N1, N2, N3 in the original paper and you can adjust them according to GPU memory. During training, the log file and checkpoint file will be saved in model_path directory.

Evaluation

If you want to test the TBMSL-Net, just run python test.py. You need to specify the model_path in test.py to choose the checkpoint model for testing.

Model

We also provide the checkpoint model trained by ourselves, you can download if from Google Drive for CUB-200-2011 or download from here for FGVC-Aircraft. If you test on our provided model, you will get 89.6% and 94.5% test accuracy, respectively.

Reference

If you are interested in our work and want to cite it, please acknowledge the following paper:

@misc{zhang2020threebranch,
    title={Three-branch and Mutil-scale learning for Fine-grained Image Recognition (TBMSL-Net)},
    author={Fan Zhang and Guisheng Zhai and Meng Li and Yizhao Liu},
    year={2020},
    eprint={2003.09150},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

tbmsl-net's People

Contributors

zf4444 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.