Giter Club home page Giter Club logo

Comments (3)

roddypr avatar roddypr commented on September 3, 2024 1

Dear Katarzyna,

Thank you for your very clear explanation. It helps a lot because I would like to configure snakePipes to be used across different people in my department.

Just to make sure I understand, in my case I should add the following to a local cluster config file (running with --clusterConfigFile):

plotCorr_bed_spearman:
    memory: 6G

The job state report is indeed a bit confusing. My colleagues mentioned that sometimes there are peaks of memory use that are not accurately measured by seff, which could explain the problem (in fact I will try running the pipeline with 2G for this step, which is probably enough).

Best wishes,

Roddy

from snakepipes.

katsikora avatar katsikora commented on September 3, 2024

Hi Roddy,

it looks like you've managed to fix the issue on your own.

Indeed, there is one global memory resource setting defined as __default__ , and then rule-specific memory declarations .
The rule-specific declarations 'override' the default setting for the listed rules. All other rules are consuming the default memory amount.

If a rule fails due to hitting a memory limit, you can add a declaration for that rule to your cluster_config.yaml and assign a value.
I'd recommend to copy the cluster_config.yaml from your output directory and modify the copy, the submit it to snakePipes workflow with --clusterConfigFile .
Increasing the default memory as you did will also work, but you will end up reserving more resources for all the rules that are using the 'default' memory value on your cluster environment.

To be honest, this job state report is a bit confusing:
there is an out_of_memory state listed, but exit code is 0? Also the reported memory usage looks like well under the 1Gb assigned to it. But if you're saying that assigning 6Gb as default fixed it, then it might indeed have been the issue.

Hope this helps,
Best wishes,

Katarzyna

from snakepipes.

katsikora avatar katsikora commented on September 3, 2024

Dear Roddy,

I'd have thought that you might be facing a one-off issue with a particularly large dataset e.g. a very high number of samples.
In that case, rerunning the workflow with a custom cluster config file would be good enough.

If you think this is rather going to be a recurrent issue, you might consider configuring your snakePipes installation with snakePipes config --clusterConfig and passing a custom "shared" cluster config file. You can see the default "shared" cluster config here.
For the rule plotCorr_bed_spearman, it should be alright to put it in this "shared" cluster config. A couple of other deepToools-based rules are also defined there.

The cluster config that you see in your workflow output folder is a merge between the "shared" and the workflow-specific cluster configs. The merge is done at runtime.

Let me know if you have any other questions,

Best wishes,

Katarzyna

from snakepipes.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.