Comments (2)
torch.nn.CrossEntropyLoss
VS torch.nn.NLLLoss
same operation but different approaches
CrossEntropy
contains both LogSoftmax
and NLLLoss
(more descriptive)
NLLLoss
on the other hand takes logit output of F.log_softmax()
(more imperative)
from knowledge-distillation-pytorch.
conventional CrossEntropy
knowledge-distillation-pytorch/#playground/torch_cee_imperatively.py
Lines 11 to 14 in 4a7c13d
torch.nn.NLLLose
takes input
, target
; target must be 1d tensor directing to the max indexes
e.g. input = [[0.25, 0.25, 0.5], [0.1, 0.2, 0.7]], target = [2, 2]
knowledge-distillation-pytorch/#playground/torch_cee_imperatively.py
Lines 17 to 26 in 4a7c13d
torch.nn.CrossEntropyLoss
torch.nn.CrossEntropyLoss
consists of nn.LogSoftmax
and nn.NLLLoss
knowledge-distillation-pytorch/#playground/torch_cee_imperatively.py
Lines 29 to 35 in 4a7c13d
wait, why log softmax to calculate cross entropy err?
https://datascience.stackexchange.com/questions/40714/what-is-the-advantage-of-using-log-softmax-instead-of-softmax
https://stats.stackexchange.com/questions/436766/cross-entropy-with-log-softmax-activation
TODO
- read more about the references above
from knowledge-distillation-pytorch.
Related Issues (4)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from knowledge-distillation-pytorch.