Giter Club home page Giter Club logo

tron-bioinformatics / tronflow-bam-preprocessing Goto Github PK

View Code? Open in Web Editor NEW
5.0 5.0 0.0 8.77 MB

Nextflow pipeline for the preprocessing of BAM files based on GATK best practices. Marking duplicates, realignment around indels, base quality score recalibration (BQSR) and reporting of metrics are optional to maintain flexibility for different use cases.

License: MIT License

Makefile 0.96% Nextflow 67.35% Shell 31.68%
bioinformatics gatk nextflow picard tronflow

tronflow-bam-preprocessing's People

Contributors

ibn-salem avatar lbresadola avatar lkress avatar priesgo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

tronflow-bam-preprocessing's Issues

Caluclate deduplication rate

Do something like this


process DUPLICATION_RATE {
    cpus params.cpus
    memory params.memory
    tag "${name}"
    //publishDir "${params.output}/${name}/metrics/coverage", mode: "copy"
    //publishDir "${params.output}/${name}/", mode: "copy", pattern: "software_versions.*"

    conda (params.enable_conda ? "bioconda::samtools=1.15.1" : null)

    input:
        tuple val(name), file(bam), file(bai)

    output:
        file("${name}.duplication_rate.tsv")
        
    script:
    """
    samtools flagstat -@ ${task.cpus} ${bam} > output.txt
    parse_duplication_rate.py output.txt 
    """
}

Add flag to select tool used for marking/removing duplicates

I would suggest to add a new flag, with which you can select the tool used for marking or removing duplicate reads. The choices would be then

  1. picard (in particular MarkDuplicates) or
  2. sambamba (in particular markdup).

With this feature, the pipeline remains backwards compatible to users who want to run Picard's MarkDuplicates for any reason, but who similarily want new updates or features introduced in later versions than v1.9.1.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.