Giter Club home page Giter Club logo

Comments (15)

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024 1

just a reminder to start working on this

image

I have. But it's too slow and run out of my CPU.

from srs-benchmark.

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024

Transformer has many different implementations and hyperparameters. And it's mainly used for NLP, instead of common time-series prediction. I don't know what's the best practice for our case.

from srs-benchmark.

Expertium avatar Expertium commented on June 24, 2024

How about a bidirectional LSTM? According to pytorch documentation, it's very easy to do, there is a hyperparameter for that.

from srs-benchmark.

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024

I try to add the code of transformer: 4801583

But the training is very slow. Maybe I will train the transformer in the next month.

from srs-benchmark.

Expertium avatar Expertium commented on June 24, 2024

Btw, to make the results comparable ensure that the transformer has roughly the same number of parameters as the LSTM.

from srs-benchmark.

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024

It only has 492 parameters.

from srs-benchmark.

Expertium avatar Expertium commented on June 24, 2024

Yes. So the transformer should have around 500. If the transformer has much more parameters, it will be impossible to tell whether RMSE is different because there are more parameters or because the architecture is different. By making the number of parameters more or less the same, we can see how much the architecture matters.

from srs-benchmark.

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024

I mean my current implementation of transformer has 492 parameters. But I forget how many parameters the LSTM has.

from srs-benchmark.

Expertium avatar Expertium commented on June 24, 2024

image

from srs-benchmark.

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024

OK. Now their sizes of parameters are very close.

from srs-benchmark.

Expertium avatar Expertium commented on June 24, 2024

@L-M-Sherlock just a reminder to start working on this

from srs-benchmark.

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024

Progress: 50%

image

It's worse than LSTM. It's not surprised. Here is a related work: Are Transformers Effective for Time Series Forecasting?

from srs-benchmark.

Expertium avatar Expertium commented on June 24, 2024

Interesting, I'll take a look at the article later.

from srs-benchmark.

Expertium avatar Expertium commented on June 24, 2024

What is this?
image

from srs-benchmark.

L-M-Sherlock avatar L-M-Sherlock commented on June 24, 2024

What is this?

It sets the size at 1 for each collection.

from srs-benchmark.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.