Giter Club home page Giter Club logo

rfnext's Introduction

RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks

A general receptive field searching method for CNN.If your network has Conv with kernel larger than 1, RF-Next can further improve your model. The official implementation of:

TPAMI2022 paper: 'RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks'

CVPR2021 paper: 'Global2Local: Efficient Structure Search for Video Action Segmentation'

News

Introduction

Temporal/spatial receptive fields of models play an important role in sequential/spatial tasks. Large receptive fields facilitate long-term relations, while small receptive fields help to capture the local details. Existing methods construct models with hand-designed receptive fields in layers. Can we effectively search for receptive field combinations to replace hand-designed patterns? To answer this question, we propose to find better receptive field combinations through a global-to-local search scheme. Our search scheme exploits both global search to find the coarse combinations and local search to get the refined receptive field combinations further. The global search finds possible coarse combinations other than human-designed patterns. On top of the global search, we propose an expectation-guided iterative local search scheme to refine combinations effectively. Our RF-Next models, plugging receptive field search to various models, boost the performance on many tasks, e.g., temporal action segmentation, object detection, instance segmentation, and speech synthesis.

Applications and Codes

RF-Next supports many applications.

Citation

If you find this work or code is helpful in your research, please cite:

@article{gao2022rfnext,   
title={RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks},   
author={Gao, Shanghua and Li, Zhong-Yu and Han, Qi and Cheng, Ming-Ming and Wang, Liang},   
journal=TPAMI,   
year={2022} }

@inproceedings{gao2021global2local,
  title={Global2Local: Efficient Structure Search for Video Action Segmentation},
  author={Gao, Shang-Hua and Han, Qi and Li, Zhong-Yu and Peng, Pai and Wang, Liang and Cheng, Ming-Ming},
  booktitle=CVPR,
  year={2021}
}

License

The source code is free for research and education use only. Any comercial use should get formal permission first.

Contact

If you have any questions, feel free to E-mail Shang-Hua Gao (shgao(at)live.com)

rfnext's People

Contributors

gasvn avatar hanqer avatar lzyhha avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

rfnext's Issues

about MS-TCN++

hi, thank u for your awesome work. I wonder how to applying G2L to MS-TCN++ receptive field search, because MS-TCN++ have DDL module

Can searched receptive field model finetune from pretrained weight or from scratch?

Thanks your wonderful work and code.
Searching efficient receptive field is such a interesting topic!
I have one question. can searched receptive field model finetune from pretrained weight or from scratch?
If i have a faster rcnn pretrain model from https://github.com/open-mmlab/mmdetection/blob/master/configs/faster_rcnn/faster_rcnn_r50_fpn_2x_coco.py.
The pretrain ap is 38.4.
Now i use rfnext to search.
First search phase. I have two choice.

  1. search from scratch
  2. search from pretrain faster_rcnn_r50_fpn_2x_coco weight
    Which one is prefer?

After search, I have got a new search model.. I have two finetune choice

  1. finetune from scratch
  2. finetuen from pretrain faster_rcnn_r50_fpn_2x_coco weight.
    Which one is prefer?

For my view, reuse pretrain weight is better. But i donot know if it will harm ap.

how long did you train?

hi, it's me again. Though you reduce searching params, but you have to run like 5 epochs each structure. I wonder how long you spent to get final model params.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.