Comments (15)
just a reminder to start working on this
I have. But it's too slow and run out of my CPU.
from srs-benchmark.
Transformer has many different implementations and hyperparameters. And it's mainly used for NLP, instead of common time-series prediction. I don't know what's the best practice for our case.
from srs-benchmark.
How about a bidirectional LSTM? According to pytorch documentation, it's very easy to do, there is a hyperparameter for that.
from srs-benchmark.
I try to add the code of transformer: 4801583
But the training is very slow. Maybe I will train the transformer in the next month.
from srs-benchmark.
Btw, to make the results comparable ensure that the transformer has roughly the same number of parameters as the LSTM.
from srs-benchmark.
It only has 492 parameters.
from srs-benchmark.
Yes. So the transformer should have around 500. If the transformer has much more parameters, it will be impossible to tell whether RMSE is different because there are more parameters or because the architecture is different. By making the number of parameters more or less the same, we can see how much the architecture matters.
from srs-benchmark.
I mean my current implementation of transformer has 492 parameters. But I forget how many parameters the LSTM has.
from srs-benchmark.
from srs-benchmark.
OK. Now their sizes of parameters are very close.
from srs-benchmark.
@L-M-Sherlock just a reminder to start working on this
from srs-benchmark.
Progress: 50%
It's worse than LSTM. It's not surprised. Here is a related work: Are Transformers Effective for Time Series Forecasting?
from srs-benchmark.
Interesting, I'll take a look at the article later.
from srs-benchmark.
from srs-benchmark.
What is this?
It sets the size at 1 for each collection.
from srs-benchmark.
Related Issues (20)
- [Feature Request] Group users into single dataset HOT 15
- Using the mode to find the best default parameters HOT 6
- collect bad cases from Anki users' dataset HOT 9
- visualize metrics over time HOT 2
- [Feature Request] Train a gradient-boosted decision tree HOT 36
- Some weird first forgetting curves HOT 7
- [Feature request] Add confidence intervals for all metrics HOT 9
- accidental post
- Revlogs parsing HOT 12
- [Question] A βrawβ version of the tiny_dataset.zip HOT 3
- [Feature Request] Add a BiLSTM HOT 2
- [Feature request] Add the ACT-R model (see paper) HOT 21
- [TODO] Add DASH and its variants HOT 13
- [Feature request] A quantitative measure of cheating HOT 9
- Write an article about binned RMSE and cheating calibration metrics HOT 7
- Ebisu? HOT 6
- [Question] Some more details from a ML perspective HOT 8
- Cannot download dataset from huggingface HOT 4
- Neural network scheduler HOT 42
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from srs-benchmark.