Comments (13)
Hi @MichelMoser ,
This is unexpected behavior, did you let it run to completion. Maybe it's an issue with coverage of one region that is maybe causing this. What is the coverage of your bam file?
I will discuss this with @tpesout to see if it's something we've seen before.
from helen.
Hi @kishwarshafin ,
I let it run for additional 24 hours once it reached the
> Polishing 99% complete (541585/546365). Estimated time remaining: 17m 57s
but h5 files did not change since then.
Average coverage is about 60 x.
I thought downsampling was implemented in the .json "maxDepth" when generating images?
Best,
Michel
from helen.
I ran two polishing runs since last night with docker and both finished correctly. Would it be possible to prune the images and run it one more time?
If you are spending too much time on this then if you share the files I can try to see what is causing the issue.
You are right, the maxDepth controls the downsampling.
from helen.
What do you mean by pruning the images?
Yes will rerun marginpolish one more time and report back.
from helen.
docker rmi <helen_docker_image>
remove the existing docker image of and pull it again. It would be crazy if this works.
from helen.
i assume using singluarity instead of docker is not the source of error
from helen.
hmm, its running for 1.5 half days now straight and still at "Polishing 99% complete". Gave it 96 threads.
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
177576 michelmo 20 0 374.1g 369.8g 1488 S 291.8 12.2 125225:18 marginPolish
How long are marginpolish runtimes for human genomes normally?
from helen.
@MichelMoser , sorry about this. Usually, a human genome takes about 10-15 hours, on 96 threads it should take less than that. This would be your second run where it got stuck, is that right?
from helen.
yes, its the second run. still at "Polishing 99% complete". You said it might be coverage problems?
I ran polishing with PEPPER simultaneously and it ran through without a hitch within 23.5 hours (on GPUs).
from helen.
@MichelMoser , at this point, with all the improvements in the basecaller, you should be able to see similar results with PEPPER and MarginPolish-HELEN.
If it's not inconvenient for you, I'd like to keep this issue open and get back to it to see if it happens to any other assemblies. This is very unusual and should be looked into.
from helen.
Hi @MichelMoser, I'm sorry you're having issues running this. I have seen something like this happen (not ever for 24 extra hours) with human reads aligned to GRCh38 with minimap2, in a very deep region flanked by very shallow regions (generally satellite DNA). There are some ways to verify this which I'm happy to do if you're willing to share your data. Also, running MarginPolish with the -a info
flag will produce log messages that can help diagnose.
from helen.
Hi @tpesout and @kishwarshafin ,
Thank you for the help and I am happy to share files. But before transferring the 180 GByte file, i could generate some coverage stats with mosdepth and send you the results if that's helpful.
Also I can rerun with the logging option and send you the output.
from helen.
@MichelMoser , coverage plots, and the log would be great!
from helen.
Related Issues (20)
- Plant species HOT 3
- MarginPolish HOT 3
- margin docker run fail HOT 27
- Issue on stitch.py HOT 7
- Compatibility with draft assembly from other assemblers HOT 3
- Models for Guppy 3.6 HOT 2
- some problem occurs when evaluating the data of na12878 chromosome 21 HOT 4
- Add helen to bioconda. HOT 5
- Are existing models suitable to polish a high heterozygosity plant genome assemblies? HOT 3
- How to install helen in a gcc4.8 enrivments? HOT 3
- MarginPolish output zero .fa file HOT 9
- torch.set_num_threads error in docker image HOT 3
- How to train a new model with in lab testing data HOT 2
- Exception: process 6 terminated with signal SIGKILL HOT 5
- wrong script in example Usage in README.md HOT 1
- Only Polish interested sequences HOT 3
- Question about model's training data HOT 2
- there are no other mammal models other than human? HOT 2
- helen polish UserWarning on batch size
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from helen.