Comments (13)
Thank you for reporting this issue!
While I couldn't reproduce the problem myself, I believe I've fixed it in the dev branch. Could you try to compile that branch and confirm that the issue is resolved?
Thank you,
Pierre
from epa-ng.
Compiling now works. Thanks!
But I ran into another problem. Converting fasta to bfast format worked normally, but then I tried to do the phylogenetic placement and this error occurred:
terminate called after throwing an instance of 'std::runtime_error'
what(): Invalid substitution rate specification: {/////}+FU{///}+G4{}
[sfr1:69184] *** Process received signal ***
[sfr1:69184] Signal: Aborted (6)
[sfr1:69184] Signal code: (-6)
[sfr1:69184] [ 0] /usr/lib64/libpthread.so.0(+0xf6d0)[0x2b6d0a0d06d0]
[sfr1:69184] [ 1] /usr/lib64/libc.so.6(gsignal+0x37)[0x2b6d0a313277]
[sfr1:69184] [ 2] /usr/lib64/libc.so.6(abort+0x148)[0x2b6d0a314968]
[sfr1:69184] [ 3] /storage/software/gcc-5.2.0/lib64/libstdc++.so.6(_ZN9__gnu_cxx27__verbose_terminate_handlerEv+0x15d)[0x2b6d09689b6d]
[sfr1:69184] [ 4] /storage/software/gcc-5.2.0/lib64/libstdc++.so.6(+0x8cbb6)[0x2b6d09687bb6]
[sfr1:69184] [ 5] /storage/software/gcc-5.2.0/lib64/libstdc++.so.6(+0x8cc01)[0x2b6d09687c01]
[sfr1:69184] [ 6] /storage/software/gcc-5.2.0/lib64/libstdc++.so.6(+0x8ce18)[0x2b6d09687e18]
[sfr1:69184] [ 7] epa-ng(_ZN5raxml5Model15init_model_optsERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEERK13mixture_model+0x2f9b)[0x4cc4fb]
[sfr1:69184] [ 8] epa-ng(_ZN5raxml5Model16init_from_stringERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0x1b2)[0x4cdff2]
[sfr1:69184] [ 9] epa-ng(_ZN5raxml5ModelC1ENS_8DataTypeERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0x14e)[0x4ce39e]
[sfr1:69184] [10] epa-ng(main+0x3560)[0x488f50]
[sfr1:69184] [11] /usr/lib64/libc.so.6(__libc_start_main+0xf5)[0x2b6d0a2ff445]
[sfr1:69184] [12] epa-ng[0x4af7fe]
[sfr1:69184] *** End of error message ***
Any idea what might have gone wrong?
from epa-ng.
Can you post the exact command line call? Looks like you didn't properly specify the model parameters. Try to get them from your reference tree, or of you don't have them you can re-infer them using RAxML with the -f e
option, or using the new raxml-ng with the --evaluate
option.
RAxML will cretae a .info
file, which you can directly pass to epa-ng's --model
option.
from epa-ng.
see also here: https://github.com/Pbdas/epa-ng#setting-the-model-parameters
from epa-ng.
I used this method to create my reference tree https://www.polarmicrobes.org/phylogenetic-placement-re-re-visited/
And so I used the .info file from building the reference tree. But now I re-created it with -f e
option and the problem is gone. However, my query sequences were not aligned and caused another error. Can you recommend a MSA aligner with multiprocessor support?
from epa-ng.
If you want to be thorough, you can try our PaPaRa aligner, which was specifically developed with phylogenetic placement in mind: https://sco.h-its.org/exelixis/web/software/papara/index.html
It uses the reference tree as an additional source of information, and hence yields better alignments for this use case (at an increased computational cost...), as shown in https://doi.org/10.1093/bioinformatics/btr320 as well as http://sco.h-its.org/exelixis/pubs/Exelixis-RRDR-2012-5.pdf
Also, I created a simple MPI version of PaPara: https://github.com/lczech/papara_nt
from epa-ng.
papara works well, I reccommend it.
See also here for a full example of using placement: https://github.com/Pbdas/epa-ng/wiki/Full-Stack-Example
from epa-ng.
Thanks, I will try that. But two questions.
Papara only accepts reference alignment in phylip format, however I cannot convert my reference MSA from fasta to phylip, as my fasta headers are longer than 10 characters and cannot be truncated. Is it possible to use MSA in fasta format for Papara?
And the other question comes from your tutorial.
Papara, for example, outputs the aligned queries together with the reference MSA, in phylip format. (there will be a convenience function for this shortly)
How do you separate queries after aligning against reference MSA?
Thank you for the help!
from epa-ng.
Yes, this is a downside of PaPaRa - it has tob be phylip for reference and fasta for queries, nothing else works. We are currently starting with a project that will improve PaPaRa, and one of the first things I'll suggest is to allow for more flexible file formats.
Luckily for you, PaPaRa already supports relaxed Phylip, with headers as long as they need to be, and which are instead separated from the sequences by (at least one) space. See here for a converter: https://github.com/npchar/Phylogenomic/blob/master/fasta2relaxedPhylip.pl
from epa-ng.
I had this fatal error in Papara. I assume that there's something fishy with my reference sequences?
papara: papara.cpp:538: papara::references<pvec_t, seq_tag>::references(const char*, const char*, papara::queries<seq_tag>*) [with pvec_t = pvec_pgap; seq_tag = sequence_model::tag_dna]: Assertion 'unmasked.size() == seq.size()' failed.
from epa-ng.
The convenience function already exists and is called --split
. I will update the tutorial accordingly. The basic usage is: epa-ng --split ref_alignment query_alignments+
I'm closing this issue as the original bug has been fixed, but we would be happy to help you further on the placement google group (where any errors and solutions become much easier to search for for future users): https://groups.google.com/forum/#!forum/phylogenetic-placement
from epa-ng.
Re fatal error: Please post this again to our forum that @Pbdas mentioned, and maybe add the first few lines of your alignment.
from epa-ng.
please also note #20 since you are using the MPI version. Will fix that shortly.
EDIT: that issue is fixed as well, please pull again!
from epa-ng.
Related Issues (20)
- Split command using fasta files HOT 2
- Address issues with heuristic placement
- Memory management strategy for large trees / alignments
- Intuition for a common taxonomic-assignment procedure HOT 3
- The model parameters under data partitioning HOT 7
- Placements sensitive to outlier sequences? HOT 3
- -INF logl at branch 0 error HOT 3
- PLL assertion error HOT 3
- Issue with building from source: ld throws errors HOT 4
- Error running this command HOT 5
- Zero-length branches altered HOT 7
- Hello,I find issue running this program HOT 3
- error when doing the final epa-ng placement HOT 11
- Support for QMaker and nQMaker from IQ-TREE? HOT 1
- Treeparsing error HOT 6
- Compilation error on Bioconda HOT 4
- model info error HOT 2
- How to convert the epa results (jplace) to the newick format? HOT 3
- FastTree HOT 4
- Help on Aborted (core dumped) message HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from epa-ng.