stscoundrel / dictionary-revalidator Goto Github PK

View Code? Open in Web Editor NEW

0.0 2.0 0.0 167 KB

Kotlin / Spring Boot API for triggering ISR revalidations in online dictionaries

Home Page: https://dictionary-revalidator.onrender.com/health/

License: MIT License

Kotlin 99.08% Dockerfile 0.92%

isr kotlin revalidation spring spring-boot h2-database maven

dictionary-revalidator's People

Contributors

Watchers

dictionary-revalidator's Issues

Controller: guard with api key

Retries: only continue if failed entries left

Currently loops through all allowed retry iterations, even if they have nothing to retry. Check if there are any left before looping away

Add number of times to retry

Currently always runs retries only once. It could be done until there is nothing to retry or certain limit is met.

Support start & endpoint as optional params

Occasionally one would like to run certain subset of entries to validate. For example, should the process have crashed around entry 10 000 of 20 000, one would only want to revalidate entries 10 000 -> 20 000 in the following request, ignoring the first 10 000.

Add as optional params to the controller
Feed to revalidator. Use if provided.

Dockerize for render.com usage

Add Old Danish dictionary

While not online yet, the urls & entry amounts are already known. Can be added before the release to configs & endpoints

Support custom batch sizes

Currently batch sizes are defined per dictionary. While fine for most cases, changing batch sizes requires code change & redeploy of API.

Lets allow override in controller endpoint. like optional batchSize=200, which would then be fed to the revalidate method. Should still default to original config values when not provided.

Add logs endpoint

Currently only outputs logs in a way that can be observed from render.com. They could also be exposed via simple log endpoint to avoid roundtrip to render. For example:

Move logging to an utility, which preserves the current behavior but also (temporarily) persists the logs
Persistance mechanic can even be as simple as in memory. It will be lost once instance spuns down, but it matters not
Expose that in memory dump via the endpoint, preferably sorted by latest.

Test cases

The script does its thing, but might as well add simple test cases

Revalidator batch size: set per dictionary

Currently uses same 250 entries as batch size for all dictionaries. However, the time it takes to revalidate the entries can vary quite a bit based on how large the dataset to be processed is. For example, Old Icelandic dictionary is quite fast and has few & coincise entries, whereas Old Swedish dictionary is both larger and generally has larger entries.

Add batch size to the per dictionary setup. Preliminary values could be:

Old Norse: 250
Old Icelandic: 300
Old Swedish: 200
Old Norwegian 200

stscoundrel / dictionary-revalidator Goto Github PK

dictionary-revalidator's People

Contributors

Watchers

dictionary-revalidator's Issues

Recommend Projects

Recommend Topics

Recommend Org