Giter Club home page Giter Club logo

reading-paper's Introduction

Paper Review : My:[124], My_Light:[4], Link:[15],

  • 개인 공부라 열심히는 하고 있으나, 완벽한 리뷰가 아닙니다.
  • 리뷰가 끝나더라도 계속 의문/생각/교정/좋은자료가 있다면 꾸준히 업데이트 됩니다.
  • link review는 다른 분들이 하신 좋은 리뷰를 링크한 것입니다.
  • lihgt_link는 빠르게 개념(abstract)정도로 본 논문을 의미합니다.

Self Supervised Learninig (일단 정리만)

  • Unsupervised Representation Learning by Predicting Image Rotations : [paper][]
  • Unsupervised Visual Representation Learning by Context Prediction : [paper][]
  • Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles : [paper][]
  • Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks : [paper][]
  • Rethinking Pre-training and Self-training : [paper][]
  • Selfie: Self-supervised Pretraining for Image Embedding : [paper] [light_review]
  • Self-training with Noisy Student improves ImageNet classification : [paper] [review]

Visual Transformers (일단 정리만)

  • Stand-Alone Self-Attention in Vision Models : [paper][review]
  • Selfie: Self-supervised Pretraining for Image Embedding : [paper] [light_review]
  • Visual Transformers: Token-based Image Representation and Processing for Computer Vision : [paper][]
  • 2D Attentional Irregular Scene Text Recognizer : [paper][]
  • NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition : [paper][]
  • On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention : [paper][]
  • End-to-End Object Detection with Transformers : [paper][]

Image Retrieval & Deep Feature

  • Large-Scale Image Retrieval with Attentive Deep Local Features : [paper][review]
  • NetVLAD: CNN architecture for weakly supervised place recognition : [paper][review]
  • Learning visual similarity for product design with convolutional neural networks : [paper][review]
  • Bags of Local Convolutional Features for Scalable Instance Search : [paper][review]
  • Neural Codes for Image Retrieval : [paper][review]
  • Conditional Similarity Networks : [paper][review]
  • End-to-end Learning of Deep Visual Representations for Image Retrieval : [paper][review]
  • CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples : [paper][review]
  • Image similarity using Deep CNN and Curriculum Learning : [paper][review]
  • Faster R-CNN Features for Instance Search : [paper][review]
  • Regional Attention Based Deep Feature for Image Retrieval : [paper][review]
  • Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination : [paper][review]
  • Object retrieval with deep convolutional features : [paper][review]
  • Cross-dimensional Weighting for Aggregated Deep Convolutional Features : [paper][review]
  • Learning Embeddings for Product Visual Search with Triplet Loss and Online Sampling : [paper][review]
  • Saliency Weighted Convolutional Features for Instance Search : [paper][review]
  • 2018 Google Landmark Retrieval Challenge 리뷰 : [review]
  • 2019 Google Landmark Retrieval Challenge 리뷰 : [review]
  • REMAP: Multi-layer entropy-guided pooling of dense CNN features for image retrieval : [paper][review]
  • Large-scale Landmark Retrieval/Recognition under a Noisy and Diverse Dataset : [paper][review]
  • Fine-tuning CNN Image Retrieval with No Human Annotation : [paper][review]
  • Large Scale Landmark Recognition via Deep Metric Learning : [paper][review]
  • Deep Aggregation of Regional Convolutional Activations for Content Based Image Retrieval : [paper][review]
  • Challenging deep image descriptors for retrieval in heterogeneous iconographic collections : [paper][review]
  • A Benchmark on Tricks for Large-scale Image Retrieval : [paper][review]
  • Attention-Aware Generalized Mean Pooling for Image Retrieval : [paper][review]
  • Class-Weighted Convolutional Features for Image Retrieval : [paper][review] # 100th
  • deep image retrieval loss (계속 업데이트):[paper][review]
  • Matchable Image Retrieval by Learning from Surface Reconstruction:[paper][review]
  • Regional Maximum Activations of Convolutions with Attention for Cross-domain Beauty and Personal Care Product Retrieval:[paper][review]
  • Combination of Multiple Global Descriptors for Image Retrieval:[paper][review]
  • Unifying Deep Local and Global Features for Efficient Image Search:[paper][review]
  • ACTNET: end-to-end learning of feature activations and multi-stream aggregation for effective instance image retrieval:[paper][review]
  • Google Landmarks Dataset v2 A Large-Scale Benchmark for Instance-Level Recognition and Retrieval:[paper][review]
  • Detect-to-Retrieve: Efficient Regional Aggregation for Image Search:[paper][review]
  • Local Features and Visual Words Emerge in Activations:[paper][review]
  • Image Retrieval using Multi-scale CNN Features Pooling: [paper][review]
  • MultiGrain: a unified image embedding for classes and instances: [paper][link_review]
  • Divide and Conquer the Embedding Space for Metric Learning: [paper][link_review]
  • An Effective Pipeline for a Real-world Clothes Retrieval System: [paper][light_review]

Metric Learning

Fashion Image Retrieval

  • Learning Embeddings for Product Visual Search with Triplet Loss and Online Sampling : [paper][review]
  • Conditional Similarity Networks : [paper][review]

Fashion Recommendation

  • FashionNet: Personalized Outfit Recommendation with Deep Neural Network: [paper][review]
  • Context-Aware Visual Compatibility Prediction: [paper][review]
  • Learning Type-Aware Embeddings for Fashion Compatibility : [paper][review]

Fashion Generative Adversarial Nets

  • Be Your Own Prada: Fashion Synthesis with Structural Coherence : [paper][review]
  • Fashion-Gen: The Generative Fashion Dataset and Challenge : [paper][review]
  • DwNet: Dense warp-based network for pose-guided human video generation: [paper][review]

Image Retrieval using Deep Hash

  • Deep Learning of Binary Hash Codes for Fast Image Retrieval : [paper][review]
  • Feature Learning based Deep Supervised Hashing with Pairwise Labels : [paper][review]
  • Deep Supervised Hashing with Triplet Labels : [paper][review]

Video Classification

  • NetVLAD: CNN architecture for weakly supervised place recognition : [paper][review]
  • Learnable pooling with Context Gating for video classification : [paper][review]
  • Less is More: Learning Highlight Detection from Video Duration : [paper][review]
  • Efficient Video Classification Using Fewer Frames : [paper][review]

OCR - Recognition

  • Synthetically Supervised Feature Learning for Scene Text Recognition : [paper][review]
  • FOTS: Fast Oriented Text Spotting with a Unified Network : [paper][review]
  • Robust Scene Text Recognition with Automatic Rectification : [paper][review]

OCR - Detection

Attention & Deformation

Visual & Textual Embedding

Recommendation

  • FashionNet: Personalized Outfit Recommendation with Deep Neural Network: [paper][review]
  • Context-Aware Visual Compatibility Prediction: [paper][review]

CNN

Transfer Learning

Generative Adversarial Nets

  • Generative Adversarial Nets : [paper][review]
  • Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks : [paper][review]
  • Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks : [paper][review]
  • Progressive Growing of GANs for Improved Quality, Stability, and Variation : [paper][review]
  • Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level : [paper][review]
  • Synthetically Supervised Feature Learning for Scene Text Recognition : [paper][review]
  • A Style-Based Generator Architecture for Generative Adversarial Networks : [paper][review]
  • High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs : [paper][review]
  • Everybody Dance Now : [paper][review]
  • Be Your Own Prada: Fashion Synthesis with Structural Coherence : [paper][review]
  • Fashion-Gen: The Generative Fashion Dataset and Challenge : [paper][review]
  • StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks : [paper][review]
  • DwNet: Dense warp-based network for pose-guided human video generation: [paper][review]

Face

Pose Estimation

NLP/NLU

  • Efficient Estimation of Word Representations in Vector Space : [paper][review]
  • node2vec: Scalable Feature Learning for Networks : [paper][review]
  • Transfomer(self attention) 기본 이해하기 : PPT정리
  • BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding : [paper][review](~ing)
  • DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval : [paper][review]
  • SNRM: From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted Indexing : [paper][review]
  • TF-Ranking: Scalable TensorFlow Library for Learning-to-Rank : [paper][review]
  • ConvRankNet: Deep Neural Network for Learning to Rank Query-Text Pairs : [paper][review]
  • KNRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling : [paper][review]
  • Conv-KNRM: Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search : [paper][review]
  • PACRR: A position-aware neural IR model for relevance matching : [paper][link_review]
  • CEDR: Contextualized Embeddings for Document Ranking #262 : [paper][link]
  • Deeper Text Understanding for IR with Contextual Neural Language Modeling : [paper][link]
  • Simple Applications of BERT for Ad Hoc Document Retrieval : [paper][link]
  • Document Expansion by Query Prediction : [paper][link]
  • Passage Re-ranking with BERT : [paper][link]

Domain Adaptation

Curriculum Learning

  • CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images : [paper][review]

Image Segmentation

  • U-Net: Convolutional Networks for Biomedical Image Segmentation : [paper][review]
  • Mask R-CNN : [paper][review]
  • Fully Convolutional Networks for Semantic Segmentation : [paper][review]
  • Cascade Decoder: A Universal Decoding Method for Biomedical Image Segmentation : [paper][review]
  • FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference : [link_review]

Deep Learning

Localization

AutoML

Image Quality

  • Learning to Compose with Professional Photographs on the Web : [paper][review]
  • Photo Aesthetics Ranking Network with Attributes and Content Adaptation : [paper][review]
  • Composition-preserving Deep Photo Aesthetics Assessment : [paper][review]
  • Deep Image Aesthetics Classification using Inception Modules and Fine-tuning Connected Layer : [paper][review]
  • NIMA: Neural Image Assessment : [paper][review]

Others

reading-paper's People

Contributors

chullhwan-song avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.