Comments (5)
That's a good point, there will be some shift in this case and TD will likely stand to benefit from performing some sort of rescaling of the neurons that fall within the "queue".
Would be nice to see some experiments comparing the effect of this.
from targeted-dropout.
By the way I just applied at FOR.ai!
from targeted-dropout.
We do this because with normal dropout the intention is to replace the deleted neurons back into the network -- which changes the expected norm of the activation -- so we need to scale the neurons so that when the deleted ones are replaced the expected norm stays about the same.
In our case, we don't intend to return the deleted weights/neurons so there's no need to perform and rescaling.
from targeted-dropout.
Thank you!
What I noticed is that targeted dropout does not preserve the same expected value of weight/neuron at train and test time:
-
during training you sort under a certain rule the neurons/weights and you perform dropout on the "queue",
-
during test you sort under the same rule the neurons/weights but you drop deterministically the "queue",
so the average values for neurons/weights is not the same on train and test.
from targeted-dropout.
That's great to hear! Looking forward to reviewing your application.
from targeted-dropout.
Related Issues (13)
- why do you consider unit pruning and weight pruning in L1 norm and L2 norm?
- What's the difference between unit dropout and weight dropout?
- Targeted Units Dropout - Why dropout also on test? HOT 1
- ResNet-32 architecture doesn't match ResNet paper
- Hello,I want to use targeted dropout in my model, can you provide the code of trageted dropout in pytorch?Expect for your reply。
- Variational_dropout and variational_unit_dropout HOT 1
- How replace dropout using your targeted dropout in my models? HOT 1
- How embed your targeted dropout in other model? HOT 6
- TypeError: bad operand type for unary -: 'TopKV2' HOT 1
- where can I call the function of targeted dropout? HOT 3
- Can your function only be used after training on the network? HOT 2
- ValueError: Please wrap optimizers with TowerOptimizer in order to use replicate_model_fn with multiple `devices` HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from targeted-dropout.