Giter Club home page Giter Club logo

rdnatools's Introduction

rDnaTools

rDnaTools is a python package of tools and pipelines for working with ribosomal DNA sequence data generated with the PacBio(R) SMRT sequencing. rDnaTools works by wrapping existing tools from microbial ecology, primarily the Mothur suite of utilities. Currently rDnaTools implements a single pipeline for the export, filtering, and cluster of 16S sequences.

Though primarily intended for use in analyzing 16S rDNA sequences, the same tools and approaches should apply equally well to 18S, 23S, or ITS sequences, provided that suitable reference sequences are supplied.

Requirements

The core functionality of rDnaTools is built upon Python2.7 using the pbcore framework for accessing PacBio data files. In addition rDnaTools wraps the functionality from a number of stand-alone commandline tools that must available for the package to function

Optionally, some secondary scripts in the rDnaTools package also require other Pacific Biosciences tools

rDnaPipeline

The primary tool for analyzing rDNA sequence data is a script called "rDnaPipeline", which takes as an input PacBio sequence data from ribosomal DNA amplicons. The pipeline accepts data in either FOFN, BAS.H5, FASTA, or FASTQ format, and runs a sequential series of analyses, similar to Mothur`s Batch Mode. The analysis is based on Mothur's recommended SOP for analyzing 454 rDNA sequence tags, with some modifications to account for the unique nature of PacBio's data.

Since rDnaPipeline analyzes PacBio Circular-Consensus Sequence (CCS) data, The basic call to the rDnaPipeline will vary slightly depending on what version of SMRT Analysis is being used, as the methods by which CCS data is generated and presented has changed.

If the rDNA sequence generated on a PacBio RS running SMRT Analysis v2.0, then the basic call to rDnaPipeline.py will look as follows: rDnaPipeline.py FOFN -n PROCS -A ALIGNMENT_REF -C CHIMERA_REF

If the rDNA sequence generated on a PacBio RS running SMRT Analysis v2.1, then running rDnaPipeline is a two-step process. First the Reads_of_Insert protocol must be run as normal on the sequencing data to generate CCS data for the data-set. Then rDnaPipeline.py is called as follows: rDnaPipeline.py READS_OF_INSERT -r FOFN -n PROCS -A ALIGNMENT_REF -C CHIMERA_REF

For the reference files, we recommend using the curated SILVA alignments provided on the Mothur website (http://www.mothur.org/wiki/Silva_reference_files).

Citation

rDnaTools would not have been possible were it not for the hard work of the existing Microbial Ecology community, and their existing tools for analyzing ribosomal DNA sequence data. Since the core of the analyses wrapped by rDnaTools come from the Mothur suite, please cite their publication if you use rDnaTools in your work:

Schloss, P.D., et al., Introducing mothur: Open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol, 2009. 75(23):7537-41.

Disclaimer

THIS WEBSITE AND CONTENT AND ALL SITE-RELATED SERVICES, INCLUDING ANY DATA, ARE PROVIDED "AS IS," WITH ALL FAULTS, WITH NO REPRESENTATIONS OR WARRANTIES OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, ANY WARRANTIES OF MERCHANTABILITY, SATISFACTORY QUALITY, NON-INFRINGEMENT OR FITNESS FOR A PARTICULAR PURPOSE. YOU ASSUME TOTAL RESPONSIBILITY AND RISK FOR YOUR USE OF THIS SITE, ALL SITE-RELATED SERVICES, AND ANY THIRD PARTY WEBSITES OR APPLICATIONS. NO ORAL OR WRITTEN INFORMATION OR ADVICE SHALL CREATE A WARRANTY OF ANY KIND. ANY REFERENCES TO SPECIFIC PRODUCTS OR SERVICES ON THE WEBSITES DO NOT CONSTITUTE OR IMPLY A RECOMMENDATION OR ENDORSEMENT BY PACIFIC BIOSCIENCES.

rdnatools's People

Contributors

bnbowman avatar mhsieh avatar pacbiodevnet avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.