Comments (4)
Hi @LiaOb21, could you please share a little piece of data to reproduce the problem? Or could you share the output_c1/pggb_input.community.1.fa.gz.7659a9d.417fcdf.171f02d.smooth.final.TAIR10.vcf.gz
file?
About the partition-before-pggb
, it should be equivalent to pggb
, so they should both work. I have no idea about it at the moment. Have you done any other tests?
from pggb.
I dowloaded the S. cerevisiae data to test the pipeline, how explained in your Sequence partitioning tutorial.
1. partition-before-pggb
About partition-before-pggb
I fixed the problem.
I'm going to explain how if this can be helpful for someone else:
I installed pggb through guix-genomic (more or less 2 months ago) and some packages were not installed automatically: wfmash
and python-igraph
(I installed both through conda).
Then, another important thing is that installing pggb through guix-genomics, I don't have the directories partition-before-pggb
is looking for. So, I had to clone the repostitory from GitHub and make executable (with the command chmod +x
) all the scripts contained in the ~/Software/pggb/scripts
directory. Then, I used the partition-before-pggb
script giving the entire path:
~/Software/pggb/partition-before-pggb -i scerevisiae7.fasta.gz -o output -n 7 -t 16 -p 90 -s 5k -V 'S288C:#:1000'
- [W::vcf_parse] Contig '[name]' is not defined in the header. (Quick workaround: index the file with tabix.)
I encountered the same error using S. cerevisiae data. I tried to run pggb on the community.0. The command I gave is this:
pggb -i scerevisiae7.fasta.gz.e363e5b.community.0.fa -o output -n 7 -t 16 -V 'S288C:#:1000'
And here the error:
[vg::deconstruct] decompose VCF
vcfbub -l 0 -a 1000 --input output/scerevisiae7.fasta.gz.e363e5b.community.0.fa.f5e9856.417fcdf.7493449.smooth.final.S288C.vcf.gz
0.04s user 0.00s system 15% cpu 0.28s total 3232Kb max memory
vcfwave -I 1000 -t 16
3.92s user 0.07s system 967% cpu 0.41s total 72388Kb max memory
[W::vcf_parse] Contig 'chrI' is not defined in the header. (Quick workaround: index the file with tabix.)
Encountered an error, cannot proceed. Please check the error output above.
If feeling adventurous, use the --force option. (At your own risk!)
Is there anything wrong in my command? Or maybe do I need someother package?
I installed separately:
vg
downloading the repository from GitHub (version v1.44.0 "Solara").vcfbub
through cargo (vcfbub 0.1.0).bcftools
(I previously had an old version) through miniconda (bcftools 1.16).
To try to solve the issue (even if I think these were installed with the guix installation of pggb, since I didn't have errors), I installed also:
seqwish
through miniconda (0.7.7)smoothxg
through miniconda (0.6.7)gfaffix
through miniconda (0.1.4)
I attach the .vcf.gz file.
Thank you so much in advance!
scerevisiae7.fasta.gz.e363e5b.community.0.fa.f5e9856.417fcdf.7493449.smooth.final.S288C.vcf.gz
from pggb.
@LiaOb21, I think I have understood your problem!
In the file you've shared, you have this line
##contig=<ID=S288C#1#chrI#0,length=219929>
where it was added #0
and the end of the reference name (that is S288C#1#chrI
). Moreover, the CHROM
column in the VCF contains chrI
and not that contig ID (that is 288C#1#chrI#0
).
This is happened because you've used a version of > 1.40.0
. In the Docker/Singularity/Bioconda version of pggb
, we are stick at vg 1.40.0
. At the moment, we do not recommend using vg
versions later than 1.40.0
to avoid these problems. Please use vg 1.40.0
and let me know if the issue is resolved!
from pggb.
Sorry for the late reply!
Apparently changing the version of vg resolved the problem. I am using 1.40.0 now and I don't get that error anymore.
Thank you so much for your help.
Best regards,
Lia
from pggb.
Related Issues (20)
- empty VCF after running PanGenIe on the pggb assembly HOT 1
- GFA with no P lines HOT 5
- DRB1-3123 example not producing a nice graph anymore after `biwflambda` update. HOT 5
- PGGB use case with hexaploidy genomes HOT 1
- force reference output in VCF HOT 2
- Three chromosome take too long time HOT 16
- High heterogeneity in sequences identity HOT 2
- extracting node path-coverage information HOT 3
- wfmash -Y option HOT 3
- About the result study HOT 4
- Question about the example "scerevisiae7.fasta.gz " HOT 1
- ValueError: too many values to unpack (expected 13) HOT 3
- Annotating the 1D pangenome graph visualisation with centromere coordinates
- Get the fasta file of non reference sequence
- PGGB get the fasta file of non reference sequence
- Building a graph from fragmented assemblies
- interoperability with vg - error:[vg::SmallSnarlSimplifier] Invalid graph on iteration 0 HOT 14
- Current Bioconda release does not find python scripts HOT 7
- Possible community detection bug HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pggb.