Giter Club home page Giter Club logo

deep-learning-for-tracking-and-detection's Introduction

Collection of papers, datasets, code and other resources for object detection and tracking using deep learning

Papers

Static Detection

Region Proposal

  • Scalable Object Detection Using Deep Neural Networks [cvpr14] [pdf] [notes]
  • Selective Search for Object Recognition [ijcv2013] [pdf] [notes]

RCNN

YOLO

  • You Only Look Once Unified, Real-Time Object Detection [ax1605] [pdf] [notes]
  • YOLO9000 Better, Faster, Stronger [ax1612] [pdf] [notes]
  • YOLOv3 An Incremental Improvement [ax1804] [pdf] [notes]

SSD

  • SSD Single Shot MultiBox Detector [ax1612/eccv16] [pdf] [notes]
  • DSSD Deconvolutional Single Shot Detector [ax1701] [pdf] [notes]

RetinaNet

  • Feature Pyramid Networks for Object Detection [ax1704] [pdf] [notes]
  • Focal Loss for Dense Object Detection [ax180207/iccv17] [pdf] [notes]

Anchor Free

Misc

  • OverFeat Integrated Recognition, Localization and Detection using Convolutional Networks [ax1402/iclr14] [pdf] [notes]
  • LSDA Large scale detection through adaptation [ax1411/nips14] [pdf] [notes]
  • Acquisition of Localization Confidence for Accurate Object Detection [ax1807/eccv18] [pdf] [notes] [code]

Video Detection

Tubelet

  • Object Detection from Video Tubelets with Convolutional Neural Networks [cvpr16] [pdf] [notes]
  • Object Detection in Videos with Tubelet Proposal Networks [ax1704/cvpr17] [pdf] [notes]

FGFA

  • Deep Feature Flow for Video Recognition [cvpr17] [Microsoft Research] [pdf] [arxiv] [code]
  • Flow-Guided Feature Aggregation for Video Object Detection [ax1708/iccv17] [pdf] [notes]
  • Towards High Performance Video Object Detection [ax1711] [Microsoft] [pdf] [notes]

RNN

  • Online Video Object Detection using Association LSTM [iccv17] [pdf] [notes]
  • Context Matters Refining Object Detection in Video with Recurrent Neural Networks [bmvc16] [pdf] [notes]

Multi Object Tracking

Association

  • Deep Affinity Network for Multiple Object Tracking [ax1810/tpami19] [pdf] [notes] [code] [pytorch]

Deep Learning

  • Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism [ax1708/iccv17] [pdf] [arxiv] [notes]
  • Online multi-object tracking with dual matching attention networks [ax1902/eccv18] [pdf] [arxiv] [notes] [code]
  • FAMNet Joint Learning of Feature, Affinity and Multi-Dimensional Assignment for Online Multiple Object Tracking [iccv19] [pdf] [notes]
  • MOTS Multi-Object Tracking and Segmentation [cvpr19] [pdf] [notes] [code] [project/data]
  • Exploit the Connectivity: Multi-Object Tracking with TrackletNet [ax1811/mm19] [pdf] [notes]
  • Tracking without bells and whistles [ax1903/iccv19] [pdf] [notes] [code] [pytorch]

RNN

  • Tracking The Untrackable: Learning To Track Multiple Cues with Long-Term Dependencies [ax1704/iccv17] [Stanford] [pdf] [notes] [arxiv] [project],
  • Multi-object Tracking with Neural Gating Using Bilinear LSTM [eccv18] [pdf] [notes]
  • Eliminating Exposure Bias and Metric Mismatch in Multiple Object Tracking [cvpr19] [pdf] [notes] [code]

Unsupervised Learning

  • Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers [ax1809/cvpr19] [pdf] [arxiv] [notes] [code]

Reinforcement Learning

Network Flow

Graph Optimization

  • A Multi-cut Formulation for Joint Segmentation and Tracking of Multiple Objects [ax1607] [highest MT on MOT2015] [University of Freiburg, Germany] [pdf] [arxiv] [author] [notes]

Baseline

Single Object Tracking

Reinforcement Learning

  • Deep Reinforcement Learning for Visual Object Tracking in Videos [ax1704] [USC-Santa Barbara, Samsung Research] [pdf] [arxiv] [author] [notes]
  • Visual Tracking by Reinforced Decision Making [ax1702] [Seoul National University, Chung-Ang University] [pdf] [arxiv] [author] [notes]
  • Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning [cvpr17] [Seoul National University] [pdf] [supplementary] [project] [notes] [code]
  • End-to-end Active Object Tracking via Reinforcement Learning [ax1705] [Peking University, Tencent AI Lab] [pdf] [arxiv]

Siamese

Misc

Deep Learning

  • Do Deep Nets Really Need to be Deep [nips14] [pdf] [notes]

Synthetic Gradients

  • Decoupled Neural Interfaces using Synthetic Gradients [ax1608] [pdf] [notes]
  • Understanding Synthetic Gradients and Decoupled Neural Interfaces [ax1703] [pdf] [notes]

Unsupervised Learning

  • Learning Features by Watching Objects Move (cvpr17) [pdf] [notes]

Interpolation

Autoencoder

Variational

  • beta-VAE Learning Basic Visual Concepts with a Constrained Variational Framework [iclr17] [pdf] [notes]
  • Disentangling by Factorising [ax1806] [pdf] [notes]

Datasets

Multi Object Tracking

Single Object Tracking

Video Detection

Video Understanding / Activity Recognition

Static Detection

Animals

Boundary Detection

Static Segmentation

Video Segmentation

Classification

Optical Flow

Code

Multi Object Tracking

Single Object Tracking

GUI Application / Large Scale Tracking / Animals

Video Detection

Static Detection and Matching

Frameworks

Region Proposal

FPN

RCNN

SSD

RetinaNet

YOLO

Anchor Free

Misc

Matching

Boundary Detection

Optical Flow

Instance Segmentation

Frameworks

Semantic Segmentation

Video Segmentation

Autoencoders

Classification

Deep RL

Annotation

Misc

Collections

Datasets

Static Detection

Video Detection

Single Object Tracking

Multi Object Tracking

Segmentation

Deep Compressed Sensing

Misc

Tutorials

Multi Object Tracking

Static Detection

Video Detection

Instance Segmentation

Deep RL

Autoencoders

deep-learning-for-tracking-and-detection's People

Contributors

abhineet123 avatar minhnhat93 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.