Giter Club home page Giter Club logo

wf-ani's Introduction

bacterial-genomics/wf-ani

GitHub release (latest by date) Nextflow run with docker run with singularity MegaLinter

workflow

General schematic of the steps in the workflow

Contents

Quick Start: Test

Run the built-in test set to confirm all parts are working as-expected. It will also download all dependencies to make subsequent runs much faster.

Pull workflow from GitHub

nextflow pull bacterial-genomics/wf-ani

Run test workflow

nextflow run \
  bacterial-genomics/wf-ani \
  -r main \
  -profile <docker|singularity>,test

Quick Start: Run

Example command on FastAs in "new-fasta-dir" data using BLAST (ANIb) with singularity:

Pull workflow from GitHub

nextflow pull bacterial-genomics/wf-ani

Run workflow

nextflow run \
  bacterial-genomics/wf-ani \
  -r main \
  -profile singularity \
  --input new-fasta-dir \
  --outdir my-results \
  --ani blast

Introduction

This workflow performs average nucleotide identity on assembled and/or annotated files (FastA/Genbank).

Installation

Usage

nextflow run main.nf -profile <docker|singularity> --input <input directory> --outdir <directory for results> --ani <blast|fastani|skani>

Please see the usage documentation for further information on using this workflow.

Parameters

Note the "--" long name arguments (e.g., --help, --input, --outdir) are generally specific to this workflow's options, whereas "-" long name options (e.g., -help, -latest, -profile) are general nextflow options.

These are the most pertinent options for this workflow:

Required parameters

  ============================================
        Input/Output
  ============================================
  --input                 Path to input data directory containing FastA/Genbank assemblies or samplesheet. Recognized extensions are:  {fa,fas,fsa,fna,fasta,gb,gbk,gbf,gbff} with optional gzip compression.

  --query                 Path to input data FastA/Genbank file or samplesheet. Recognized extensions are:  {fa,fas,fsa,fna,fasta,gb,gbk,gbf,gbff} with optional gzip compression.

  --refdir                Path to reference panel data directory containing FastA/Genbank assemblies or samplesheet. Recognized extensions are:  {fa,fas,fsa,fna,fasta,gb,gbk,gbf,gbff} with optional gzip compression.

  --outdir                The output directory where the results will be saved.


  ============================================
        Container platforms
  ============================================
  -profile singularity    Use Singularity images to run the workflow. Will pull and convert Docker images from Dockerhub if not locally available.

  -profile docker         Use Docker images to run the workflow. Will pull images from Dockerhub if not locally available.


  ============================================
        Optional ANI tools
  ============================================
  --ani               Specify what algorithm should be used to compare input files. Recognized arguments are: blast, fastani, skani. [Default: blast]

Additional parameters

View help menu of all workflow options:

nextflow run \
  bacterial-genomics/wf-ani \
  -r main \
  --help \
  --show_hidden_params

Resource Managers

The most well-tested and supported is a Univa Grid Engine (UGE) job scheduler with Singularity for dependency handling.

  1. UGE/SGE
    • Additional tips for UGE processing are here.
  2. No Scheduler
    • It has also been confirmed to work on desktop and laptop environments without a job scheduler using Docker with more tips here.

Output

Please see the output documentation for a table of all outputs created by this workflow.

Troubleshooting

Q: It failed, how do I find out what went wrong?

A: View file contents in the <outdir>/log directory.

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

Citations

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

wf-ani's People

Contributors

gregorysprenger avatar chrisgulvik avatar

wf-ani's Issues

Parentheses unsupported?

Description of the bug

Need to test more a clarify if parentheses are not supported. Especially for type strain comparisons it would be expected the assembly file to contain them such as "Leptospira_interrogans_(GCA_001292615.2).fna.gz"

This is an example error reported in the errors.tsv file for filenames containing parentheses.

877e682e483/.command.sh: line 28: syntax error near unexpected

Command used and terminal output

No response

Relevant files

No response

System information

No response

new assembly file doesn't get used in a precomputed outpath

Description of the bug

Adding a new assembly file into a precomputed outdir path didn't modify/update the comparisons/pairs.fofn and the comparisons/genomes.fofn as expected, so no new pairs were computed.

Command used and terminal output

nextflow \
 -log /my-proj/ANI/assemblies.all-vs-all/nextflow_log.2898751.txt \
 run /workflows/wf-ani/main.nf \
 -w /scratch/work \
 --ani blast \
 -ansi-log false \
 -profile rosalind_hpc \
 --input /my-proj/all-assems \
 --outdir /my-proj/ANI/assemblies.all-vs-all \
 -resume


### Relevant files

_No response_

### System information

_No response_

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.