Computer Vision Course

Module 1. Image processing basics. Local image processing and feature descriptors

Interest point and distinguished regions detection:
Harris operator (corner detection)
Hessian detector, affine covariant version, Maximally Stable Extrema Regions (MSER).
Descriptors of measurement regions
SIFT (scale invariant feature transform), RootSIFT
Shape context.
LBP (local binary patterns)
Deep learned features (HardNet).
Deep learned features (R2D2, SuperPoint)

Histograms and statistical models
Hidden Markov Models
Integral images
HOG detector
KLT tracker
Mean-Shift, CamShift tracker
Kalman Filter
Binary features
Bag of visual words
minHash
Image Retrieval for large image collections: image description, indexing, geometric consistency
Neural nets for object tracking and search

ML basics: data retrieval, synthesis, augmentation, train and test sets
Non-deep ML models (PCA, SVM)
Deep learning
Different layers. Math and implementation.
Optimization for size and speed (MobileNet, ... )
Running in real-time (Core ML 2, etc)
Study of different architectures: Image classification (AlexNet), Object detection (R-CNN), Object Tracking (SAE and CNN), Object segmentation (SegNet), Instance segmentation (Mask R-CNN), Optical flow (PWC Net), Human pose estimation (VNect), 3D reconstruction (LayoutNet), Transfer-based AR (LSTM Neural nets, GANs). Metrics learning (face recognition)