Comments (7)
Not at the moment.
from pggb.
If this is pending the arrival of the -N
option, I could look at another branch.
from pggb.
I have align 5 5MB genomes and get 22 out of 224 blocks with loops. (This occurs with or without -N flag.) All paths contain many re-used segments in forward orientation and therefore form cycles (Interestingly, we never see a segment used more than twice). I could see this cycle behavior being desired - sibeliaz does a good job absorbing tandem repeats in this way - but the pggb result are less obvious. I have attached a few examples. pggb seems to refuse to smooth short, local (but not tandem) repeats creating these odd structures that, by my eye, need to be flattened. Thanks for any advice.
testDuplicates.txt
from pggb.
from pggb.
from pggb.
Thanks very much. pggb seems be doing well on STRs and VNTRs as evidence here and on some other simulated datasets. The issue above is related more to small local repeats triggering a loop. I have looked at one of the examples above in detail using flanking sequence (text file attached). The little bit of loop-triggering homology is shown in red square in attached dotplot. (On side note, block 700 in maf appears to be a spurious indel related to divergence.) All gfas and raw sequences available at https://github.com/USDA-ARS-GBRU/PanPipes/tree/main/testSets/pggb_5_salmonella.
dotplot.pdf
blockContext.txt
from pggb.
@bredelings @jnvaughn do you have more questions? Else I would like to close this one.
from pggb.
Related Issues (20)
- DRB1-3123 example not producing a nice graph anymore after `biwflambda` update. HOT 5
- PGGB use case with hexaploidy genomes HOT 1
- force reference output in VCF HOT 2
- Three chromosome take too long time HOT 16
- High heterogeneity in sequences identity HOT 2
- extracting node path-coverage information HOT 3
- wfmash -Y option HOT 3
- About the result study HOT 4
- Question about the example "scerevisiae7.fasta.gz " HOT 1
- ValueError: too many values to unpack (expected 13) HOT 3
- Annotating the 1D pangenome graph visualisation with centromere coordinates
- Get the fasta file of non reference sequence
- [W::vcf_parse] Contig '2' is not defined in the header. (Quick workaround: index the file with tabix.) HOT 4
- PGGB get the fasta file of non reference sequence
- Building a graph from fragmented assemblies
- interoperability with vg - error:[vg::SmallSnarlSimplifier] Invalid graph on iteration 0 HOT 14
- Current Bioconda release does not find python scripts HOT 7
- Possible community detection bug HOT 5
- Skip sequence partitioning? HOT 2
- Add/remove one assembly (or more) from a pangenome graph HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pggb.