General schematic of the steps in the workflow
- Quick Start
- Introduction
- Installation
- Usage
- Parameters
- Resource Managers
- Output
- Troubleshooting
- Contributions and Support
- Citations
Run the built-in test set to confirm all parts are working as-expected. It will also download all dependencies to make subsequent runs much faster.
nextflow pull bacterial-genomics/wf-ani
nextflow run \
bacterial-genomics/wf-ani \
-r main \
-profile <docker|singularity>,test
Example command on FastAs in "new-fasta-dir" data using BLAST (ANIb) with singularity:
nextflow pull bacterial-genomics/wf-ani
nextflow run \
bacterial-genomics/wf-ani \
-r main \
-profile singularity \
--input new-fasta-dir \
--outdir my-results \
--ani blast
This workflow performs average nucleotide identity on assembled and/or annotated files (FastA/Genbank).
- Nextflow
>=22.04.03
- Docker or Singularity
>=3.8.0
- Conda is currently unsupported
nextflow run main.nf -profile <docker|singularity> --input <input directory> --outdir <directory for results> --ani <blast|fastani|skani>
Please see the usage documentation for further information on using this workflow.
Note the "--
" long name arguments (e.g., --help
, --input
, --outdir
) are generally specific to this workflow's options, whereas "-
" long name options (e.g., -help
, -latest
, -profile
) are general nextflow options.
These are the most pertinent options for this workflow:
============================================
Input/Output
============================================
--input Path to input data directory containing FastA/Genbank assemblies or samplesheet. Recognized extensions are: {fa,fas,fsa,fna,fasta,gb,gbk,gbf,gbff} with optional gzip compression.
--query Path to input data FastA/Genbank file or samplesheet. Recognized extensions are: {fa,fas,fsa,fna,fasta,gb,gbk,gbf,gbff} with optional gzip compression.
--refdir Path to reference panel data directory containing FastA/Genbank assemblies or samplesheet. Recognized extensions are: {fa,fas,fsa,fna,fasta,gb,gbk,gbf,gbff} with optional gzip compression.
--outdir The output directory where the results will be saved.
============================================
Container platforms
============================================
-profile singularity Use Singularity images to run the workflow. Will pull and convert Docker images from Dockerhub if not locally available.
-profile docker Use Docker images to run the workflow. Will pull images from Dockerhub if not locally available.
============================================
Optional ANI tools
============================================
--ani Specify what algorithm should be used to compare input files. Recognized arguments are: blast, fastani, skani. [Default: blast]
View help menu of all workflow options:
nextflow run \
bacterial-genomics/wf-ani \
-r main \
--help \
--show_hidden_params
The most well-tested and supported is a Univa Grid Engine (UGE) job scheduler with Singularity for dependency handling.
- UGE/SGE
- Additional tips for UGE processing are here.
- No Scheduler
- It has also been confirmed to work on desktop and laptop environments without a job scheduler using Docker with more tips here.
Please see the output documentation for a table of all outputs created by this workflow.
Q: It failed, how do I find out what went wrong?
A: View file contents in the <outdir>/log
directory.
If you would like to contribute to this pipeline, please see the contributing guidelines.
An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md
file.