We should support some way to serialize the state of our queries to disk and then relo

A very wise observation from: <a class="issue-link js-issue-link" data-error-text="Fai

No, this is not being actively worked on at the moment. <span clas

I think serialization should be generally opt-in: maybe at <co

Serialization to disk about salsa HOT 7 OPEN

salsa-rs commented on August 28, 2024 13

Serialization to disk

from salsa.

Comments (7)

matklad commented on August 28, 2024 1

A very wise observation from: rust-lang/rfcs#1317 (comment)

In a strictly on-demand setting (IDE, not a compiler), serialization to disk creates more problems than it solves.

from salsa.

lnicola commented on August 28, 2024 1

In a strictly on-demand setting (IDE, not a compiler), serialization to disk creates more problems than it solves.

Note that some popular IDEs like Visual Studio actually use a disk database. VS migrated a while ago from a custom format to a SQLite database: https://devblogs.microsoft.com/cppblog/introducing-c-experimental-editor-tools/.

from salsa.

lpil commented on August 28, 2024 1

Hi! This would be a desirable feature for me. Is this being worked on?

Not trying to rush you, just trying to evaluate how suitable this library is for my use-case. Thank you. :)

from salsa.

matklad commented on August 28, 2024 1

No, this is not being actively worked on at the moment.

…

On Thu, 5 Mar 2020 at 16:38, Louis Pilfold ***@***.***> wrote: Hi! This would be a desirable feature for me. Is this being worked on? Not trying to rush you, just trying to evaluate how suitable this library is for my use-case. Thank you. :) — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#10?email_source=notifications&email_token=AANB3M3YCOSNRH5FCEPGGTLRF7BPRA5CNFSM4FYGX55KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEN5XSAQ#issuecomment-595294466>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AANB3M56WYJNTWGXN5QGE73RF7BPRANCNFSM4FYGX55A> .

from salsa.

zseri commented on August 28, 2024 1

I think serialization should be generally opt-in:

maybe at salsa::database level:
rather coarse, without lazy loading or transparent spilling, useful for "whole session" store/load and short-term-running scenarios
or even per query:
fine-grained, with lazy loading and maybe transparent spilling, useful to reduce RAM usage in long-term-running scenarios

I think I already have a kind of usage scenario ("scenario" as in "salsa is currently not used, but I investigate potential usages") in zs-filecrawler.

Click to expand

That program first walks through a file list and computes the hash of each file. Then it iterates over the list of hashes, takes the first associated file, and calls a user-defined hook script on that file. It caches the hash list and the progress. It might not really fit the usual `salsa` usage scenario, but the target is similiar: avoid redoing work.

QueryGroup 1: 
  file_content(filepath) <-- hash_data(filepath)
  ^-[maybe lazy input]      --> association [filepath -> hash_of_file_data]

QueryGroup 2:
  hash2file(hash)    <-- call_hook(hash)
  ^-[input, from QG1]   --> implicit association [hash -> done(hook return value)]

Currently, I just take the "session serialization approach", deserialize at startup, and serialize at shutdown/interrupt, but this may lose some progress. I think that the zs-filecrawler utility program could benefit from salsa, but it requires some way to serialize the state (the split into two QueryGroups would simulate that, but it makes interleaving both parts more difficult, and reduces potential benefits).

from salsa.

matklad commented on August 28, 2024

A similar, but different feature is to allow to transparently spill rarely used values to disk.

IntelliJ relies on similar feature heavily: when you open a multi-million line project with lots of dependencies, indices become really huge.

Note that this is a significantly different setup from rustc, which operates on a crate at a time, and has a reasonable natural cap on the amount of data it must process simultaneously.

from salsa.

lpil commented on August 28, 2024

Thank you

from salsa.

Serialization to disk about salsa HOT 7 OPEN

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent