Comments (3)
Thanks! The content of the linked post is the following.
It usually depends on the problem you are trying to solve. For a binary classification problem, you might want to have a sigmoid activation function for the last layer (or softmax in case of a multiclass problem), so that you can get an estimate of the probability of your input belonging to the specified class(es). But in case of regression, there may not be a need for any final activation function because we want our network to predict a continuous range of values and not something that is restricted to a range like
(0,1)
.Also, it may depend on the cost function you use because certain loss functions in PyTorch combine the final non linearity inside their own implementation, so we can avoid defining a final activation function explicitly in our network (eg.
CrossEntropyLoss
combinesLogSoftmax
andNLLoss
).
We can either add this to the blog or remove it altogether.
What is your opinion?
from nyu-dlsp20.
Sorry I haven't seen the comment. I think the better one is to add the text to the blog. I can do that if you are still interested?
from nyu-dlsp20.
Yes, please, go ahead.
Then we have to track this across all languages. Comment to issue #144.
from nyu-dlsp20.
Related Issues (20)
- Cannote create pDL environment HOT 1
- Problem visualizing spanish translation on github.io HOT 8
- [11-VAE.ipnb] TSNE fit_transform() happens a error with cuda HOT 2
- [12-regularization.ipynb] two errors happened HOT 1
- Self-Attention Paragraph Typos HOT 2
- [8-seq_classification.ipynb] Little typo error in paragraph 8
- transformer.ipynb : MultiHeadAttention parameter p is never used HOT 4
- Request to contribute the derivation of KL Divergence for two gaussian distributions HOT 1
- Not following the instructions leads to RuntimeError HOT 1
- Omission of x in the equation in the Week 1 lecture HOT 1
- week 6, ch 6.3 HOT 3
- [DLSP22] typo in 16-gated_GCN.ipynb HOT 3
- Russian translation (dictionary) HOT 4
- solutions HOT 1
- Software version update for 2023 HOT 1
- cross attention issue, topic_12.3. Attention and the Transformer HOT 3
- README suggestion HOT 3
- Autoencoders code: NameError: name 'img_bad' is not defined. How to fix? HOT 2
- [02-space_stretching.ipynb] Color pattern for show_scatterplot HOT 3
- The 15-transformer notebook has multiple issues with recent pytorch version HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nyu-dlsp20.