Giter Club home page Giter Club logo

bcbio-nextgen's Introduction

bcbio-nextgen

A python toolkit providing best-practice pipelines for fully automated high throughput sequencing analysis. You write a high level configuration file specifying your inputs and analysis parameters. This input drives a parallel pipeline that handles distributed execution, idempotent processing restarts and safe transactional steps. The goal is to provide a shared community resource that handles the front end data processing component of sequencing analysis, allowing us to focus on the downstream biology.

Quick start

  1. Install bcbio-nextgen with all tool dependencies and data files:

       wget https://raw.github.com/chapmanb/bcbio-nextgen/master/scripts/bcbio_nextgen_install.py
       python bcbio_nextgen_install.py install_directory data_directory
    

    producing a system configuration file referencing the installed software and data.

  2. Edit a sample configuration file to describe your samples.

  3. Run analysis, distributed across 8 local cores:

       bcbio_nextgen.py bcbio_system.yaml bcbio_sample.yaml -n 8
    

Documentation

See the full documentation at ReadTheDocs.

Pipelines

Variant calling

The pipeline implements the GATK best practice guidelines for variant calling, which includes:

Features

Distributed

The pipeline runs on single multicore machines, in compute clusters managed by LSF or SGE using IPython parallel, or on the Amazon cloud. This tutorial describes running the pipeline on Amazon with CloudBioLinux and CloudMan.

Galaxy integration

The scripts can be tightly integrated with the Galaxy web-based analysis tool. Tracking of samples occurs via a web based LIMS system, and processed results are uploading into Galaxy Data Libraries for researcher access and additional analysis. See the installation instructions for the front end and a detailed description of the full system.

bcbio-nextgen's People

Contributors

chapmanb avatar roryk avatar brainstorm avatar vals avatar peterjc avatar kwoklab-user avatar

Watchers

Kevin Ying avatar James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.