Why the speed in minicaffe is much worse than caffe with same prototxt

Question

I compare the resnet from your run_test.cpp.
But the performance is like below. Th

luoyetx · Answer

would you mind to paste your test code?

yonghenglh6 · Answer

In mini-caffe, I just add a "for(int i=0;i<100;i++)" before "test.Forward();" in ru

yonghenglh6 · Answer

I get the memory use from "nvidia-smi" with watching in the flesh.

yonghenglh6 · Answer

would u like to put up your performance with the resnet for a bug checking of mine? T

luoyetx · Answer

I will test the network prototxt on 1070 later. With more details on mini-caffe and of

yonghenglh6 · Answer

I use those code to test your every layer's time. But cannot find the reason.

yonghenglh6 · Answer

Your net->Forward(2,3) give me an error. So I can only use net->Forward(0,x) to

yonghenglh6 · Answer

As the net get longer, the performance diff between mini-caffe and caffe become larger

yonghenglh6 · Answer

Update the performance up.

yonghenglh6 · Answer

I checked the cudnn and assured it ran well by adding some output info.

luoyetx · Answer

please refer to <a href="https://github.com/luoyetx/mini-caffe/blob/master/profile.md"

yonghenglh6 · Answer

I have tried Profile in the beginning, but the time shown in chrome is not consistent

luoyetx · Answer

Pay attention to the Timer, it's not accuracy, use Profiler:

luoyetx · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

luoyetx · Answer

I find the performance is not stable under Windows platform, I will test on Linux late

yonghenglh6 · Answer

With your new benchmark tool, I found the bn layer is the main part that cause the dif

luoyetx · Answer

There is an optimization in this <a href="https://github.com/BVLC/caffe/commit/e93fcd2

yonghenglh6 · Answer

Everytime you request memory from pool, the blob will be in uninitial state, then it w

luoyetx · Answer

The default behave is the same in official Caffe <a href="https://github.com/BVLC/caff

yonghenglh6 · Answer

Yes, but the official Caffe need not reallocate blob every forward and not call the fu

luoyetx · Answer

you mean <a href="https://github.com/luoyetx/mini-caffe/blob/master/src/syncedmem.cpp#

yonghenglh6 · Answer

Yes.

Why the speed in minicaffe is much worse than caffe with same prototxt about mini-caffe HOT 22 CLOSED