Comments (7)
you could also check the following references for more tips:
http://research.microsoft.com/pubs/192769/tricks-2012.pdf (gradient optimization tips)
http://www.springer.com/computer/theoretical+computer+science/book/978-3-642-35288-1 (if you don't have access to the book, you can find the individually the papers)
from matconvnet.
Hi,
what is a common practice is that on the validation set you tune hyperparameters - which is in case of CNNs the structure of the network itself. Because you are effectively optimising these parameters 'by hand' it may be prone to overfit on the validation dataset.
That is why most of the challenges do have seperate test set where the ground truth labels are not known and which is usually done on a remote server of the organiser. In this way, the organiser is able to limit the number of evaluation on the test set preventing any further over-fitting (in compute vision this is the case of Pascal VOC and ILSVRC datasets).
And if you want to prevent overfitting on the validation dataset, of course you can do any sort of cross validation. It's just that you usually do not have access to test labels...
There is also a lot of other resources on the internets describing this, e.g. this question...
from matconvnet.
Great. Sorry, being dumb. My question should have been avoided. Thanks
But, one questions, what parameters can I tune by validation set? Like learning rate, hidden layer etc?
from matconvnet.
Why sorry, I had no clue about this as well a year ago, so I'm quite happy to help :) But what I found is that google usually knows the most in the end...
Yeah, exactly what you say are the hyperparameters which people usually tune... Hyperparameters as those are parameters influencing the training and the model architecture (number of hidden layers, number of hidden units, size of the convolution kernels and their stride, max pooling sizes, ... it's gazilion of them in the end...).
from matconvnet.
Great! Thanks a lot. I am refreshing memory and learn a lot too. :)
from matconvnet.
thanks!
from matconvnet.
Thanks for the tips too! Haven't heard about the "Neural Networks: Tricks of the Trade" book before...
from matconvnet.
Related Issues (20)
- Latest versions vl_compilenn fail
- vl_compilenn directory
- Error in vl_testnn('gpu', true). MATLAB: mex: ErrInvalidMEXFile
- Error vl_compilenn('enableGpu', true) HOT 1
- cuda11.1 HOT 1
- 未定义的函数 'vl_testnn'
- this is why
- D:\Software\matconvnet-1.0-beta25\matlab\src\bits\datacu.hpp(89): error: identifier "cudnnConvolutionFwdPreference_t" is undefined D:\Software\matconvnet-1.0-beta25\matlab\src\bits\datacu.hpp(94): error: identifier "cudnnConvolutionBwdFilterPreference_t" is undefined D:\Software\matconvnet-1.0-beta25\matlab\src\bits\datacu.hpp(99): error: identifier "cudnnConvolutionBwdDataPreference_t" is undefined D:\Software\matconvnet-1.0-beta25\matlab\src\bits\datacu.hpp(141): error: identifier "cudnnConvolutionFwdPreference_t" is undefined D:\Software\matconvnet-1.0-beta25\matlab\src\bits\datacu.hpp(147): error: identifier "cudnnConvolutionBwdFilterPreference_t" is undefined D:\Software\matconvnet-1.0-beta25\matlab\src\bits\datacu.hpp(153): error: identifier "cudnnConvolutionBwdDataPreference_t" is undefined 6 errors detected in the compilation of "D:/Software/matconvnet-1.0-beta25/matlab/src/bits/data.cu". HOT 1
- every time i reopen the matlab porject,need to recompile the matconvnet,why?
- Question about the matrix format of vgg19
- Error in vl_compilenn>mex_compile (line 583) and vl_compilenn (line 492) HOT 1
- How to solve?
- The NVCC compiler 'C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin\nvcc' does not appear to be valid. HOT 1
- Issue when compiling with GPU enabled HOT 1
- error in vl_compilenn HOT 1
- Is vl_nnconv implemented based on the FFT operator? HOT 1
- Convert pytorch model to matconvnet model
- Matlab crack after run again Matconvnet
- Reference to non-existent field ‘dilate‘.
- Have trouble compiling MatConvNet
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from matconvnet.