Comments (2)
It is benchmarked on cpu
branch of convent-benchmarks, with some modifications:
- We use Caffe+OpenBLAS as a reference
- Batch size is 64 for VGG-A and 128 for AlexNet & OverFeat. It is important that you use protobufs from
convnet-benchmarks
: specifications from other sources (e.g. in Caffe itself) can use different image sizes and numbers of channels in hidden layers. - Use caffe-nnpack to integrate NNPACK into Caffe. As
caffe-nnpack
is currently incomplete, and doesn't implement backward pass, you'd need to remove this line, so thatcaffe-nnpack
doesn't abort on backward pass. - Next, you need to modify prototxt with models to make some layers use NNPACK. Inside
conv_param
specifyengine: NNPACK
to make the layer compute through NNPACK.
from nnpack.
Thanks a lot! With your help I'm able to track down the problem: OpenBLAS I compiled myself leads to about 1.9x speedup compared to the one that comes with Ubuntu 14.04.
from nnpack.
Related Issues (20)
- NNPACK with Windows support HOT 4
- A compilation error occurs in the Linux ARM environment HOT 1
- potential unitialized variable in nnp_sgemm_upto_4x8__psimd HOT 1
- not found /bin/banchmarkxxx
- Why do more threads take longer?
- AltiVec/PowerPC (OpenPOWER ISA 3.0B or greater) Acceleration Support HOT 1
- CMakeLists.txt broken on MSYS2/MINGW64/AMD64 (Windows) HOT 3
- Real-time human detection on Pi 4 HOT 1
- 'vdotq_lane_s32' is invalid in C99 [-Wimplicit-function-declaration] HOT 1
- Build failed, cos_npi_over_8 is not available in common HOT 1
- ModuleNotFoundError: No module named 'peachpy.x86_64.avx' HOT 7
- make install dont link to libcpuinfo.so HOT 1
- NNPACK builds are not bit-for-bit reproducible HOT 1
- Unsupported Hardware on VM with compatible CPU HOT 3
- Does NNPACK fall back to non-accelerated code when "Could not initialize NNPACK! Reason: Unsupported hardware." occurs? HOT 1
- ld: in lib/libnnpack.a(conv1x1.py.o), section __TEXT/__const address out of range for architecture x86_64
- Use CPack for packaging HOT 1
- After Installing NNPACK on MacBook Pro 15, late 2012 retina, I still get: [W NNPACK.cpp:51] Could not initialize NNPACK! Reason: Unsupported hardware.
- CMake error cpuinfo-gitclone.cmake:40 (message): Failed to checkout tag: 'master'
- [W NNPACK.cpp:64] Could not initialize NNPACK! Reason: Unsupported hardware
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nnpack.