Giter Club home page Giter Club logo

test-data's Introduction

MultiQC

Aggregate bioinformatics results across many samples into a single report

PyPI Version Bioconda Version DOI


MultiQC is a tool to create a single report with interactive plots for multiple bioinformatics analyses across many samples.

Reports are generated by scanning given directories for recognised log files. These are parsed and a single HTML report is generated summarising the statistics for all logs found. MultiQC reports can describe multiple analysis steps and large numbers of samples within a single plot, and multiple analysis tools making it ideal for routine fast quality control.

A very large number of Bioinformatics tools are supported by MultiQC. Please see the MultiQC website for a complete list. MultiQC can also easily parse data from custom scripts, if correctly formatted / configured - a feature called Custom Content.

More modules are being written all the time. Please suggest any ideas as a new issue (please include example log files).

Installation

You can install MultiQC from PyPI using pip as follows:

pip install multiqc

Alternatively, you can install using Conda from Bioconda (set up your channels first):

conda install multiqc

If you would like the development version from GitHub instead, you can install it with pip:

pip install --upgrade --force-reinstall git+https://github.com/MultiQC/MultiQC.git

MultiQC is also available via Docker and Singularity images, Galaxy wrappers, and many more software distribution systems. See the documentation for details.

Usage

Once installed, you can use MultiQC by navigating to your analysis directory (or a parent directory) and running the tool:

multiqc .

That's it! MultiQC will scan the specified directory (. is the current dir) and produce a report detailing whatever it finds.

cd test-data/data/modules/fastqc/v0.10.1 && multiqc .

The report is created in multiqc_report.html by default. Tab-delimited data files are also created in multiqc_data/, containing extra information. These can be easily inspected using Excel (use --data-format to get yaml or json instead).

For more detailed instructions, run multiqc -h or see the documentation.

Citation

Please consider citing MultiQC if you use it in your analysis.

MultiQC: Summarize analysis results for multiple tools and samples in a single report.
Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
Bioinformatics (2016)
doi: 10.1093/bioinformatics/btw354
PMID: 27312411

@article{doi:10.1093/bioinformatics/btw354,
 author = {Ewels, Philip and Magnusson, Måns and Lundin, Sverker and Käller, Max},
 title = {MultiQC: summarize analysis results for multiple tools and samples in a single report},
 journal = {Bioinformatics},
 volume = {32},
 number = {19},
 pages = {3047},
 year = {2016},
 doi = {10.1093/bioinformatics/btw354},
 URL = { + http://dx.doi.org/10.1093/bioinformatics/btw354},
 eprint = {/oup/backfile/Content_public/Journal/bioinformatics/32/19/10.1093_bioinformatics_btw354/3/btw354.pdf}
}

Contributions & Support

Contributions and suggestions for new features are welcome, as are bug reports! Please create a new issue for any of these, including example reports where possible. Pull-requests for fixes and additions are very welcome. Please see the contributing notes for more information about how the process works.

MultiQC has extensive documentation describing how to write new modules, plugins and templates.

If in doubt, feel free to get in touch with the author directly: @ewels ([email protected])

Contributors

MultiQC is developed and maintained by Phil Ewels (@ewels) at Seqera Labs. It was originally written at the National Genomics Infrastructure, part of SciLifeLab in Sweden.

A huge thank you to all code contributors - there are a lot of you! See the Contributors Graph for details.

MultiQC is released under the GPL v3 or later licence.

test-data's People

Contributors

a-frantz avatar apeltzer avatar avilella avatar barrymoore avatar cerebis avatar chuan-wang avatar cpavanrun avatar dpryan79 avatar erikdanielsson avatar erinyoung avatar ewels avatar gartician avatar hammarn avatar iimog avatar jfy133 avatar joachimwolff avatar lpantano avatar matthdsm avatar millironx avatar nh13 avatar pontushojer avatar redmar-van-den-berg avatar remiolsen avatar ssjunnebo avatar tanaes avatar tbooth avatar tclamnidis avatar vladsavelyev avatar zethson avatar zwdzwd avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

test-data's Issues

Help wanted: interop data

Hi Phil,

I don't know if you've had the time to take a look at the testdata I've submitted, but I'm having some issues parsing it to a suitable json structure.

Can you offer any advice about how to best structure the data dict, so we can get some nice tables?

Thanks
M

Expose additional statistics in Picard Alignment Summary

The issue

The Picard Alignment Summary currently only shows the number of aligned reads in the barchart. This makes sense for Illumina data, where all reads are the same size. However, for long read sequencing, the number of aligned reads is not so relevant, and the number of aligned bases or the mean read size is more relevant.

The proposal

Add additional datasets to the picard Alignment Summary section. This way, the default behaviour will be the same (show the number of aligned reads), but users can switch over to number of aligned bases or mean read length, if this is more relevant for them.

Add testing instructions to the readme

Hi!

I understand this repo contains test data for multiqc. Is there a recommended way of running these tests locally?

Where I'm at:

  1. I've made some changes to multiqc
  2. I want to test them

In multiqc dev docs, I see that github actions will likely run the testdata against my changes to multiqc: https://github.com/ewels/MultiQC/blob/master/.github/CONTRIBUTING.md#review-workflow

But I also want to modify testdata too. I want to add a test and run it against my branch of multiqc. Feels like adding anything in the readme could be helpful!

If I figure something out I'll PR it.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.