Comments (3)
I feel like busco results might be not such accurate. In most cases, duplicated genes come from the corresponding homologous regions which haven't been fully purged. Probably you can check how do the corresponding nodes of these duplicated gene looks like in the assembly graph. I guess you should have a sense if these genes come from the homologous regions.
from hifiasm.
Taking a look at the GFA in Bandage, the specific contigs do not share edges (no bubbles or forks). None of the contigs have unusual GC content or read average read depth either. I am currently running a mummer alignment to gauge the level of sequence level similarity.
Interestingly, scaffolding the purge_dups assembly with 5 rounds of ntLink using the hifireads takes this BUSCO from being single copy complete to having 5 duplicates, 2 on the positive and 3 on the negative strands across 4 scaffolds. So using hifireads to gap fill and scaffold seems to reintroduce the duplicates. Lowering the s parameter to 0.35 in hifiasm does not lower the number of repeats.
from hifiasm.
Thanks... So it is hard to say
from hifiasm.
Related Issues (20)
- Possible missing one haplotype in human assemblies HOT 2
- No haploid.gfa files output in trio-binning mode HOT 3
- Hifi + Hi-c + ONT assembly fails
- In Trio-binning, always more on hap1 despite (almost) same sequences for paternal and maternal
- discontinuous assembly with shorter pacbio hifi reads but high coverage HOT 2
- Is x20 of Hifi data enough to construct draft assembly of 6.5Gb genome? HOT 1
- line 8: 110334 Aborted(core dumped) HOT 1
- Ultra Long intergration failed: no output for UL kmer counting HOT 3
- missing 8Mb sequences in the assembly HOT 6
- Empty haplotype 2 gfa files by ONT integration HOT 1
- Basic Question About HiFi Input HOT 3
- Spend too long times to run hifiasm HOT 6
- Switch error on X and Y chromosome HOT 2
- *.ovlp.bin file HOT 1
- Resolving switching error (?)
- Interchromosomal misjoin HOT 2
- Read error correction does not reduce the number of kmers present once, twice or three times HOT 1
- Recreate p_ctg from p_utg HOT 3
- In the diploid assembly, hfiiasm identified a value that did not exist in the k-mer plot as the "homozygous read coverage threshold".
- fungi diploid assemble phasing errors
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hifiasm.