Comments (2)
Hi worulz, thanks for your careful experiment, it really clears up my confusion.
As for your no_grad operantion, I think main.py doesn't consider to have a validation or predict operation, it just train the model , while in the predict function , in my opinion, it just aims to show the loss of that train epoch, you may consider it a train process.
I don't know if it's correct or not, but I think the no_grad function is used in validation or test process, so it's necessary if you want to evaluate the model, but not this place, maybe another function.
Thank you again for your clear pics for comparison.
from da-rnn.
Here's my experimentation with and without tanh in the encoder. Note, I've ensured I've set my model to eval + no_grad before predicting and no_grad during validation. which is different in this repo and I believe it should have been implemented.
In addition, during training the validation loss will reduce faster with tanh. 10 epochs
with tanh
with tanh in encoder
Note: I've trained, validated and predicted over the whole dataset for testing purposes. My assumption was I should get near 99%+ accuracy if the underlying equations are working properly.
from da-rnn.
Related Issues (20)
- Error using CUDA HOT 4
- FileNotFoundError: [Errno 2] No such file or directory: '/da-rnn/plots/pred_0.png' HOT 1
- I got an error when run main_predict.py after running main.py successful HOT 3
- one of the variables needed for gradient computation has been modified by an inplace operation
- How many epoch should I choose? HOT 1
- why use companies' stock price to predict NASDAQ-100 Index?
- Is there any room for gpu memory improvements? HOT 2
- The result value is different from raw data because of StandardScaler(). How can I get the plots and calculate MSE use raw data?
- It's weird that this code can only performance well on predicting 'NDX' HOT 1
- Dose this model genelarize well on your (other) dataset? HOT 1
- Evaluation mode missing on validation and predict
- tensor problem
- Multi-Step Prediction
- Can't find tanh function in eqn. 8 HOT 1
- Current values of external factor are used for prediction in your code? HOT 11
- Regarding scaling of data HOT 2
- Reg predicting the output HOT 1
- NaN issues when changing dat-aset. HOT 2
- Data overlapping in train/test split HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from da-rnn.