willard-yuan / cnn-for-image-retrieval Goto Github PK

:sunrise:The code of post "Image retrieval using MatconvNet and pre-trained imageNet"

MATLAB 32.45% Python 7.31% Makefile 1.29% TeX 10.29% CSS 0.54% JavaScript 0.06% C++ 17.63% Cuda 25.28% C 2.04% Shell 2.41% HTML 0.67% M 0.04%

image-retrieval cnn matconvnet

cnn-for-image-retrieval's Introduction

CNN for Image Retrieval

博文：Image retrieval using MatconvNet and pre-trained imageNet，对应web演示主页picSearch。

2017/10/08: 构建CBIR检索对比框架cnn-cbir-benchmark，包括Fisher Vector, VLAD, FC, RMAC, CROW.

2017/08/15更新：增加Python版本，Caffe版本，Keras版本。

2015/12/31更新：添加对MatConvNet最新版version 1.0-beta17的支持，预训练的模型请到Matconvnet官网下载最新的模型。

2015/10/20更新：Web演示部分代码公开CNN-Web-Demo-for-Image-Retrieval。

2015/09/24更新：添加对MatConvNet最新版version 1.0-beta14的支持。

2015/12/31更新：添加对MatConvNet最新版version 1.0-beta17的支持，删掉原来的版本(预训练的模型请到matconvnet官网下载最新的模型)。

2015/06/29更新：添加对MatConvNet最新版version 1.0-beta12的支持。

注意：其中文件夹matconvnet-1.0-beta17是已经编译好了的，鉴于MatConvNet只能在Matlab 2014及其以上以及系统必须是64位，所以在使用此工具箱之前得满足这两个条件。如果是Pythoner，推荐使用flask-keras-cnn-image-retrieval，纯Python，非常易于写成在线图像搜索应用。

Caltech-256图像数据库上搜索结果

运行步骤

1). 如果不需要计算mAP的话，那就直接把你的图像库文件夹名字命名为database，并将图片全部放在放在database文件夹下即可。如果你要在后面计算MAP（平均检索精度）的话，要确保图像数据库做成文件夹databaseClassified中的形式，然后执行下面命令：

python movefiles.py

2). 接着便可以抽取特征。运行extractCNN.m，要用parfor并行的话，直接修改注释部分即可。

3). 检索可视化。这一步运行queryInDatabaseDemo.m即可。

4). 计算mAP。不需要计算MAP的这步略过。运行compute_MAP.m，关于mAP的计算，请参阅我画的mAP计算过程示意图：信息检索评价指标，这个计算mAP的脚本是按照那个流程中定义的mAP计算方式来写的。

降维

非常的amazing, 除了验证降维到128D后损失不减外，惊奇地发现4096D的CNN降维到128D后精度还有提高，一种可能的解释：CNN特征也有一定的信息冗余，信息冗余所带来的影响比降维所带来的损失的影响要更大。结论：You should reduce the dimension of CNN when you use if.

PCA降维对CNN特征的影响

上面实验使用的是本项目代码，图像数据集使用的是Caltech101。

关于PCA对PCA降维的影响，Neural Codes for Image Retrieval中也有探讨，以及曾跟Adrian Rosebrock也有过这方面的交流：

ANN is really fantastic, it makes such much easier. You could also try something like PCA on your 4096-d vector and try to get it down to 128-d. It would save some space and (ideally) not hurt accuracy.

所以，如果采用了CNN特征的话，推荐将其降维到128D。

CNN资源列表

C++

conv-net-version-3，对应博客Convolutional Neural Networks III

Python

Keras，强力推荐

Keras资源列表：

DeepLearning tutorial（6）易用的深度学习框架Keras简介

DeepLearning tutorial（7）深度学习框架Keras的使用-进阶

Keras VGG-16模型 VGG16 model for Keras

PDNN，对应主页PDNN: A Python Toolkit for Deep Learning

Matlab

GoogLeNet, A GPU Implementation of GoogLeNet.

cnn-for-image-retrieval's People

Contributors

Stargazers

Watchers

Forkers

wenxiaobai vickkyy lgbwust peterpan1990 qss2012 xaccc summer7q blsky garfielder007 digitalimagep milestonesvn julianyu123456 deepxkn hastyj lukeandshuo lansatiankong echohenry2006 sanwenjohnny linzhineng fujianhai chizhizhen cvml simpsonyinsc sunyinhuicoding caomw haotaolv pxjw zhouxiazx qiansen swearos brettll cezi127 wang-lei xysoul hopef silasxue solertis matrixplayer shinexunju angrysquirrel shawnshuailin zhouzhenkun papamadeleine2022 happyimageretrieval dcfucheng tonychouzju aihgf liumaoyang sayiho foreveract matrixping wenyafei4 lijiannuist alululululululu wwwanghao eric-jixiang zyms5244 clarkwang1214 han3732 goofysong jianweilin xiuyu999 huaiwen cavalleria tangwu22 cuixue leezqcst sunxingxingtf wangxiaofanw htyao89 garfield2005 wenxuanliu marklly lijian8 elviswf kevinwenya boluoyu kylinxu zhangxiaodi syhawk mayanxin89 cluo1989 algpower guoshengcv queenie88 zgsxwsdxg farshidfarhat yangqimaya panda409 heroonline sailinghang woshidaerduotu99 andygoo bikong2 songyandong linsong8208 henuzxk pustar perfect28 nansbas

cnn-for-image-retrieval's Issues

faiss

大神有没有评估过faiss在处理咱这一系列绕着image retrieval的工作中有多大实际的提升吗

ESP GAME

Do you have the ESP Game dataset? If you have, would you give me the dataset？
Thank you.

You might be interested in Deep Video Analytics

Hey, saw your repo and blog about visual search.
You might be interested in my project Deep Video Analytics.
Which implements full fledged visual search and data analytics engine.

https://github.com/AKSHAYUBHAT/DeepVideoAnalytics/

retrieval_virsulazation.m建议计算score用矩阵相乘,图像几十万张的时候会快很多

tools/retrieval_virsulazation.m

% %for loop = 1:n
% % VecTemp = featNorm(loop, :);
% % score(loop) = QueryVec*VecTemp';
% %end

score = (QueryVec*featNorm')';

用了http://www.vlfeat.org/matconvnet/pretrained/下的vgg-m-1024模型提取特征,CPU差不多一秒钟5张,GPU(GTX970)快了不少一秒钟50张左右

queryInDatabaseDemo里的256feat2048Norml.mat或者256feat4096Norml.mat需要自己生成吗？

请问博主queryInDatabaseDemo里的256feat2048Norml.mat或者256feat4096Norml.mat需要自己生成吗？自己改extractCNN这个文件？

修复reference to non-existent field 'filters'

具体修改地方为1和2。

Error using - Matrix dimensions must agree.

Error using -
Matrix dimensions must agree.
我有5000+张图片，跑到1982张的时候就报这个错误

PicSearch demo?

楼主您好，请问您博客中的Pic Search 的demo- “CNN-Web-Demo-for-Image-Retrieval”. 还开放吗？链接好像失效了.....也没有找到，不知道可否发一下链接呢? 谢谢。

>> extractCNN boost::filesystem::canonical 我遇到这个问题　　请指导！

does this method support to run on mobile device

I want to know how the compute cost and model size of this method.
does this method support to run on the mobile device?

首先感谢大佬的cnn case和各种hash benchmark！致敬！想请教下大佬，我用vgg提取了2048d的特征，在用hash索引的方法是不是不太好，我看到你用的是矩阵相乘排序的方式，因为我的图片大概有500多G 抽取下来的特征大概也有几个G，加载到内存有点太伤了，另外看到你关于CNN 抽取特征做PCA的实验非常感谢指点，这个也是个思路（有点好奇cnn输出的特征做dense输出和pca的效果），但是最终有必要用hash搜索吗，是否会有很大的损失。目前我试过2048D 做LSH 效果很不太好。

Mistake in computing average precision

Hi, I think there is mistake in computing average precision as ap(i,1) = sum(precision)/queryClassNum. According to the formula, the denominator should be the non-zero items in precision vector rather than the number of retrieval categories, right?

error in extractCNN

hi ,
my error is :

res = vl_simplenn(net, im_) ;

"Attempt to execute SCRIPT vl_nnconv as a function:
C:\Users\Mehrdad\Source\Repos\CNN-for-Image-Retrieval\matconvnet-1.0-beta17\matlab\vl_nnconv.m"
how can i solve it?

question for video retrieval

hi, i am very interested in your work on image retrieval, and my question is how can i apply it to the video retrieval domains. there mainly two pints make me confused:
firstly, image retrieval take CNN as the feature extractor , and the CNN mainly trained with the classification loss on such cifar-10 or landmark dataset(Oxf5k), can it generalize to the actual video scenarios(usually are not landmark)? fine-tuning needed? and how?
secondly, how can i calculate the similarity between the query video and reference video, my method is dividing video into key frames and match the similar frames by extracting the frame features
can you give me some opinion. thanks!