dongyp13 / stochastic-quantization Goto Github PK

Training Low-bits DNNs with Stochastic Quantization

Shell 0.38% CMake 1.21% Makefile 0.27% HTML 0.08% CSS 0.10% Jupyter Notebook 57.19% C++ 33.16% Python 4.00% Cuda 2.59% MATLAB 0.36% M 0.01% Protocol Buffer 0.66%

caffe cifar deep-neural-networks imagenet quantization

stochastic-quantization's People

Contributors

Stargazers

Watchers

Forkers

liuguoyou suzhenghang baiyancheng20 ml-lab mornydew barongeng alphalfc yogsin yamlong xugithub1 haiyang21 keilsmart ewenwan minhson greenfigo2015 sroot0 axc888

stochastic-quantization's Issues

About bwn model

Hi . Thank you for provide your great code!

I have a two question.!

In paper, vgg9 model(cifar10) don't use data augmentation. But , this code use data augmentation like crop and mirror. In paper vgg 9 get 10% error in cifar 10. Is this the result of using data augmentation? ...
Is this code saved as a 32bit model or binary model ?

Thank you !!

Quantize the first and last layers

@dongyp13 Hi, I read your paper and thanks for sharing code.
I am now trying to reproduce your paper with tensorflow. Since I do not know anything about caffe, so I just need to reproduce it with the information only on paper.

My question is,

did you quantize the first and last layers of the network?
and I wonder if you got the results by ensemble method.
Finally, as an example of the first stage, I wonder whether 50% stochastic sampling was performed in the entire layer or 50% sampling was performed in each layer.

Again, thanks for sharing the code and congratulations on your acceptance at the BMVC conference.
Best Regards.

您好，我想请教一下您BWN的训练细节。

我最近在研究量化相关的方向，看到了您的论文。注意到您使用ResNet-56在cifar100上取得了35.01的错误率，但我自己实现时最高只有43。

我去看了您的resnet56的代码，但因为我没学过caffe所以看不懂。

想请教您一下，您选取的optimizer、学习率退火方式以及相应的超参数是什么？

还有，我看您在别的issue里提到您只量化了weight，请问这里的weight包括bn层及bias吗？

十分感谢

Ternary weights not reflected in caffe model

Hi @dongyp13,
Thank you for sharing the code.
I am training the ImageNet/AlexNet-BN/SQ-TWN from scratch. The network seems to be learning, 50K iterations of the first stage have run so far, and I can see the Top1/Top5 accuracy as 0.12/0.28.
I extracted the trained weights from this caffemodel snapshot, and was expecting that 50% (= r) of the weight values would be +1/0/-1, but I dont see such quantized weights for any conv/fc layer in the caffe model.
Could you please guide from where to obtain the quantized weights? Is there any special way to dump them, or will they get reflected in the caffe model after all the stages have run?

Line 144 in fa384c1

(Dtype)1., top_diff, binary_weight_.gpu_data(),

dongyp13 / stochastic-quantization Goto Github PK

stochastic-quantization's People

Contributors

Stargazers

Watchers

Forkers

stochastic-quantization's Issues

About bwn model

Quantize the first and last layers

您好，我想请教一下您BWN的训练细节。

Ternary weights not reflected in caffe model

input与bias的量化

Pretrained models

BWN反传梯度更新

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent