Giter Club home page Giter Club logo

computervisioncourse's Introduction

Computer Vision Course

Module 1. Image processing basics. Local image processing and feature descriptors

Basic image processing:

  • Scale pyramid, bluring, Image gradients, Contour detection.

Correspondences and wide baseline stereo:

  • Interest point and distinguished regions detection:
  • Harris operator (corner detection)
  • Hessian detector, affine covariant version, Maximally Stable Extrema Regions (MSER).
  • Descriptors of measurement regions
  • SIFT (scale invariant feature transform), RootSIFT
  • Shape context.
  • LBP (local binary patterns)
  • Deep learned features (HardNet).
  • Deep learned features (R2D2, SuperPoint)

Graph neural networks for matching (LoFTR, SuperGlue)

Robust model fitting:

  • RANSAC.
  • Hough transform

Other:

Module 2. Image Segmentation and Correspondence

Intro into segmentation, matting and correspondence problems

Image segmentation:

  • Level-set methods
  • Otsu thresholding
  • Postprosessing (erosion, dilation, NL-means)
  • Edge-based methods
  • Connected components labeling
  • Active contour model
  • Super-pixel segmentation
  • Watershed segmentation
  • Expectation-maximization in computer vision
  • Gaussian mixture model and K-means
  • Segmentation based on graph cuts
  • Markov models (chain, trees, P2D, CRF)
  • YOLACT

Image correspondence:

  • Sparse and dense optical flow
  • Affine and projective transformations
  • Birchfield–Tomasi dissimilarity
  • Dynamic programming for stereo correspondence
  • Polar coordinates and 360 photo
  • Panorama stitching
  • Lukas-Kanade Optical Flow
  • Horn-Schunck optical flow
  • Gunnar-Farneback algorithm
  • PatchMatch algorithm
  • PWC-Net, MaskFlownet

Module 3. Object Tracking and Search

  • Histograms and statistical models
  • Hidden Markov Models
  • Integral images
  • HOG detector
  • KLT tracker
  • Mean-Shift, CamShift tracker
  • Kalman Filter
  • Binary features
  • Bag of visual words
  • minHash
  • Image Retrieval for large image collections: image description, indexing, geometric consistency
  • Neural nets for object tracking and search

Module 4. Geometry and Augmented Reality (AR)

  • Homogeneous coordinates
  • Translation, rotation, scale, and projection matrices
  • Camera models
  • Rendering 3D scenes
  • Camera calibration and removal of lens distortion
  • Search for lines, circles and ellipses in photo
  • Stereo and epipolar geometry
  • Match moving and 3D reconstruction
  • Light and materials
  • AR examples

Module 5. Deep Learning for CV

  • ML basics: data retrieval, synthesis, augmentation, train and test sets
  • Non-deep ML models (PCA, SVM)
  • Deep learning
  • Different layers. Math and implementation.
  • Optimization for size and speed (MobileNet, ... )
  • Running in real-time (Core ML 2, etc)
  • Study of different architectures: Image classification (AlexNet), Object detection (R-CNN), Object Tracking (SAE and CNN), Object segmentation (SegNet), Instance segmentation (Mask R-CNN), Optical flow (PWC Net), Human pose estimation (VNect), 3D reconstruction (LayoutNet), Transfer-based AR (LSTM Neural nets, GANs). Metrics learning (face recognition)

computervisioncourse's People

Contributors

mdavydov avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.