Currently each user is treated as an isolated dataset and the results are taken by a w

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

[Feature Request] Group users into single dataset,about open-spaced-repetition/srs-benchmark

Comments (15)

Expertium commented on September 26, 2024 1

Most parameters need "hard" bounds, because, for example, negative values or values outside of [0, 1] wouldn't make sense in the formulas. In fact, I don't think there are any parameters in FSRS that can span from -∞ to ∞.
Our current optimizer supports L2 regularization (it's called weight_decay in the documentation). I can benchmark it to see if it helps to decrease RMSE.

from srs-benchmark.

Expertium commented on September 26, 2024

Aside from using default parameters as a starting point for optimization and choosing reasonable ranges for parameters, there is no way (that I can think of) to utilize the parameters of user A (or multiple users) to train FSRS on user B's data.
The default parameters are chosen by running FSRS on all collections, recording the optimal values, and taking the median. Btw, if you are curious about the distributions of parameters, check this out: https://github.com/open-spaced-repetition/fsrs-benchmark/tree/main/plots
Just simply grouping all reviews into a single dataset wouldn't me meaningful.
Also, I don't know what you mean by "apply regularization to the parameters relative to the individual mean", please explain it in detail.

from srs-benchmark.

jamesal1 commented on September 26, 2024

For each parameter x_i where i is the user, apply the regularization loss l_2*(x_i - x)**2.
x can either be the median like it is now, or it can be a free parameter.

Using the median as a default or to determine the range is less effective than regularization, and you can also use a validation set to optimize the l_2 coefficient, etc.

from srs-benchmark.

jamesal1 commented on September 26, 2024

Sure, having a hard bound could still be useful with regularization.
You'll have to offset each parameter by the default values, otherwise it will regularize all parameters towards 0 which isn't good. Thanks for having a look.

from srs-benchmark.

L-M-Sherlock commented on September 26, 2024

Grouping all users into one single dataset for training will run out of my device's RAM.

And it will be biased by users who have more reviews.

Sometime I even want to use mode as the default values instead of median.

from srs-benchmark.

Expertium commented on September 26, 2024

Sometime I even want to use mode as the default values instead of median.

Well, it's time to go deep down the rabbit hole of estimating the mode of a continuous variable.

I know 3 ways of doing that: half-range mode, half-sample mode, and kernel density estimation. The first one is based on a simple principle: take (x_max - x_min)/2, use it as a sliding window and slide across the sample until you find the densest range. Repeat this process within that range. The second one is similar: divide the sample into two groups with an equal number of elements, and find the group with the smallest value of x_max - x_min. The last one is based on creating an empirical probability density function. So which one is better? No idea.
Here's the code, have fun:
Modes.zip

from srs-benchmark.

L-M-Sherlock commented on September 26, 2024

At least for the interval of good rating, the mode is smaller than median.

from srs-benchmark.

Expertium commented on September 26, 2024

@L-M-Sherlock I think we should use the median in cases where the mode is the min (or max) allowed value, like for w_0:

Here using the mode would just make w_0 = 0.1
But when the mode is not the min/max value, I think it makes sense to use mode. For example, here:

Do you have all of the parameters of all users saved? If so, can you give them to me via a Google Drive link (.json files from the result folder are fine too)? I'll calculate the new default parameters using the median in some cases and the mode in other cases.
So here's my idea: we will do a dry run of 3 sets of default parameters:

Median parameters (already done)
Mode parameters (I'll calculate them myself)
Hybrid set where some values are modes and other values are medians

And then we'll see which set results in the lowest RMSE during the dry run.

from srs-benchmark.

L-M-Sherlock commented on September 26, 2024

I saved all parameters in the .json files. You can refer to this code: https://github.com/open-spaced-repetition/fsrs-benchmark/blob/main/analysis.py

And then we'll see which set results in the lowest RMSE during the dry run.

I think it's not a problem about RMSE. It is a problem for new users. Using median for w[0], w[1], w[2] and w[3] means the default first intervals are too long for half learners (too short for the rest, too). But users often complain that the first intervals are too long instead of too short.

from srs-benchmark.

Expertium commented on September 26, 2024

I saved all parameters in the .json files. You can refer to this code

Yes, but I need the files themselves, and I don't want to run the benchmark myself if you have all the parameters saved.
EDIT: nevermind, I can just download them from here: https://github.com/open-spaced-repetition/fsrs-benchmark/tree/main/result/FSRSv4

from srs-benchmark.

Expertium commented on September 26, 2024

Seems like mode estimation will have to wait: open-spaced-repetition/fsrs4anki#461 (comment)

from srs-benchmark.

Expertium commented on September 26, 2024

So I did some preliminary testing on a smaller dataset, and it seems like I was right - the mode isn't useful in cases where it doesn't arise naturally, and instead arises as an artifact of clamping. Here's an example:

from srs-benchmark.

Expertium commented on September 26, 2024

Interestingly, in this case the mode is in the middle.

from srs-benchmark.

Expertium commented on September 26, 2024

Btw, in order to calculate the mode, I use all three estimators (HRM, HSK, KDE), and then take the average of the two closest ones.

from srs-benchmark.

Expertium commented on September 26, 2024

I'll make a new issue about modes and all of this because it's technically unrelated to the current issue.

from srs-benchmark.

[Feature Request] Group users into single dataset about srs-benchmark HOT 15 CLOSED

Comments (15)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent