Hi, I add a test net for the MobileNet-V1, and get caffe testing net accuracy numb

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

About Top1 and Top5 result about mobilenet-caffe HOT 5 CLOSED

fanghuaqi commented on June 6, 2024

About Top1 and Top5 result

from mobilenet-caffe.

Comments (5)

shicai commented on June 6, 2024

please use original images (not lmdb).

from mobilenet-caffe.

fanghuaqi commented on June 6, 2024

Hi shicai, I also tried to use original images to do inference on mobilenet-v2, the result I get is:

I0704 09:38:38.086778  3370 caffe.cpp:310] Loss: 1.17715
I0704 09:38:38.086797  3370 caffe.cpp:322] accuracy@1 = 0.712339
I0704 09:38:38.086804  3370 caffe.cpp:322] accuracy@5 = 0.901795
I0704 09:38:38.086813  3370 caffe.cpp:322] loss = 1.17715 (* 1 = 1.17715 loss)

This is the test prototxt data input section:

layer {                                                                                                                                                                                                                                     
   name: "data"                                                                                                                                                                                                                              
   type: "ImageData"                                                                                                                                                                                                                         
   top: "data"                                                                                                                                                                                                                               
   top: "label"                                                                                                                                                                                                                              
   include {                                                                                                                                                                                                                                 
     phase: TEST                                                                                                                                                                                                                             
   }                                                                                                                                                                                                                                         
   transform_param {                                                                                                                                                                                                                         
     scale: 0.017                                                                                                                                                                                                                            
     mean_value: [103.94, 116.78, 123.68]                                                                                                                                                                                                    
     mirror: false                                                                                                                                                                                                                           
     crop_size: 224                                                                                                                                                                                                                          
   }                                                                                                                                                                                                                                         
   image_data_param {                                                                                                                                                                                                                        
     source: "val.txt"                                                                                                                                                                                                                       
     new_height: 256                                                                                                                                                                                                                         
     new_width: 256                                                                                                                                                                                                                          
     batch_size: 20                                                                                                                                                                                                                          
     root_folder: "ILSVRC2012/val/"                                                                                                                                                                                                          
   }                                                                                                                                                                                                                                         
 }

The result is similar to the values I directly use LMDB in mobilenet-v2. And still lower than the official one, is there anything wrong in this test prototxt?

Is the steps in eval_image.py correct for inference? I see in that script, the image is cropped then resize to 224x224 not resize to 256xN, then crop to 224x224.

Thanks
Huaqi

from mobilenet-caffe.

shicai commented on June 6, 2024

eval_image.py is just an example for evaluating single image.
to reproduce the performance on imagenet val dataset, you should resize the image to 256xN, then crop the center 224x224 regions, then feed it into the model.

from mobilenet-caffe.

17759205390 commented on June 6, 2024

@fanghuaqi @shicai @lutzroeder @mn-robot 弱弱的请问一下top1和top5可以详细解释一下吗？？？

from mobilenet-caffe.

fanghuaqi commented on June 6, 2024

@17759205390 https://caffe.berkeleyvision.org/tutorial/layers/accuracy.html

from mobilenet-caffe.

Recommend Projects

About Top1 and Top5 result about mobilenet-caffe HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent