Giter Club home page Giter Club logo

sirst-5k's Introduction

SIRST-5K (TGRS 2024)

SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection

arXiv

Contents

Introduction

curve

Overview

Visual

Dependencies and Installation

  • Following DNANet
  • Python == 3.8
  • pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
  • pip install scikit-image
  • pip install tqdm
  • pip install matplotlib
  • pip install tensorboard==2.14.0
  • pip install opencv-python==4.8.0.76

Dataset

Download the dataset download dir models[Baidu Drive][Google Drive]. Currently, the available dataset are:

  • SIRST-5K: The dataset synthesized using negatives generation strategies (Fig 2).

Codes Demos

Noise Sampling

# Run Noise_Sampling.py directly
python codes/Noise_Sampling/Noise_Sampling.py

Noise displacement

# Run add_noise.py directly
python codes/Mix_Rot/add_noise.py

Negative

# Run rot_patch.py directly
python codes/Mix_Rot/rot_patch.py
# Run rot_mask.py directly
python codes/Mix_Rot/rot_mask.py

Our negative augmentation strategies can produce large amounts of challenging image data. You can download the SIRST-5K directly for training.

Usage

1. Train.

python train.py --base_size 256 --crop_size 256 --epochs 1500 --dataset [dataset-name] --split_method 50_50  --deep_supervision True --train_batch_size 16 --test_batch_size 16 --mode TXT

2. Test.

python test.py --base_size 256 --crop_size 256 --st_model [trained model path] --model_dir [model_dir] --dataset [dataset-name] --split_method 50_50    --deep_supervision True --test_batch_size 1 --mode TXT 

3. Visulize your predicts.

python visulization.py --base_size 256 --crop_size 256 --st_model [trained model path] --model_dir [model_dir] --dataset [dataset-name] --split_method 50_50    --deep_supervision True --test_batch_size 1 --mode TXT 

Quantative Results

Model mIoU (x10(2)) Pd (x10(2)) Fa (x10(6))
Ours 92.78 98.84 2.735 [Weights]

Citation

If you find this project useful for your research, please consider citing our paper. ๐Ÿ˜ƒ

@ARTICLE{10496142,
  author={Lu, Yahao and Lin, Yupei and Wu, Han and Xian, Xiaoyu and Shi, Yukai and Lin, Liang},
  journal={IEEE Transactions on Geoscience and Remote Sensing}, 
  title={SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection}, 
  year={2024},
  publisher={IEEE}
  doi={10.1109/TGRS.2024.3387125}
}

Acknowledgement

This project is build based on DNANet. We thank the authors for sharing their code.

sirst-5k's People

Contributors

luy0222 avatar 7ywx avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.