Giter Club home page Giter Club logo

fastqp's People

Contributors

danielecook avatar essut avatar mdshw5 avatar ucpete avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

fastqp's Issues

Exit early if the file is empty

0 reads in input file.
Bin size (-s) set to 1.
...
ZeroDivisionError    : if (act_nlines / est_nlines) * 100 >= percent_complete:
integer division or modulo by zero
ZeroDivisionError: integer division or modulo by zero

Calculate observed/expected ratio for kmers

Two cases:

  1. Unguided null distribution where expected frequency is computed using draws from a uniform distribution of ACTG
  2. Transcriptome-guided null distribution where a 1) GTF and 2) fasta file are provided and sequences are sampled from exons and UTRs.

cc: @jmerkin

Import of tidy data table into Spotfire

I'd like to test importing across multiple experiments into Spotfire for visualization. This should just involve concatenation of multiple results tables.

Plotting raises exceptions with matplotlib-2.2.2

Hi all,

With matplotlib 2.1.0 I get MatplotlibDeprecationWarning: The set_axis_bgcolor function was deprecated in version 2.0. Use set_facecolor instead.. With the latest matplotlib-2.2.2 it now raises an exception.

Is there a recommended matplotlib version?

ZeroDivisionError: division by zero

Hello @mdshw5 @ucpete @danielecook @tomsherborne @edawine , I am using python.2.7.12 and the following OS

CentOS release 6.5 (Final)
Cluster Manager v7.0
slave
LSB_VERSION=base-4.0-amd64:base-4.0-noarch:core-4.0-amd64:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-noarch
CentOS release 6.5 (Final)
CentOS release 6.5 (Final)

I am getting the error

657,845,172 reads in input file.
Bin size (-s) set to 328.
Traceback (most recent call last):
  File "/cm/shared/apps/python/2.7.12/bin/fastqp", line 10, in <module>
    sys.exit(main())
  File "/cm/shared/apps/python/2.7.12/lib/python2.7/site-packages/fastqp/cli.py", line 412, in main
    run(arguments)
  File "/cm/shared/apps/python/2.7.12/lib/python2.7/site-packages/fastqp/cli.py", line 352, in run
    mismatchplot(positions, cycle_mismatch, zip_archive, fig_kw)
  File "/cm/shared/apps/python/2.7.12/lib/python2.7/site-packages/fastqp/plots.py", line 659, in mismatchplot
    for pos in positions
ZeroDivisionError: division by zero 

I checked the issues board, specifically the closed #26

(#26)

which had a "hack-ish" solution #28 . In the source code (current version) the plots.py does not contain the lines mentioned

(3648187)

. Please advise!

Fix axis for qualplot in newer versions of matplotlib

Traceback (most recent call last):
  File "/usr/prog/scicomp/pythonds/v1.0/pyds_v1.0/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap
    self.run()
  File "/usr/prog/scicomp/pythonds/v1.0/pyds_v1.0/lib/python3.8/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
  File "/da/onc/BFx/pisces/0.1.5.3/src/novartis-pisces/pisces/__init__.py", line 135, in read_level_qc
    run(arguments)
  File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/fastqp/cli.py", line 341, in run
    qualplot(positions, quantiles, zip_archive, fig_kw)
  File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/fastqp/plots.py", line 311, in qualplot
    ax.yaxis.grid(b=True, which='major', **{'color': 'gray', 'linestyle': ':'})
  File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/matplotlib/axis.py", line 1698, in grid
    self.set_tick_params(which='major', **gridkw)
  File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/matplotlib/axis.py", line 944, in set_tick_params
    kwtrans = self._translate_tick_params(kwargs)
  File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/matplotlib/axis.py", line 1088, in _translate_tick_params
    raise ValueError(
ValueError: keyword grid_b is not recognized; valid keywords are ['size', 'width', 'color', 'tickdir', 'pad', 'labelsize', 'labelcolor', 'zorder', 'gridOn', 'tick1On', 'tick2On', 'label1On', 'label2On', 'length', 'direction', 'left', 'bottom', 'right', 'top', 'labelleft', 'labelbottom', 'labelright', 'labeltop', 'labelrotation', 'grid_agg_filter', 'grid_alpha', 'grid_animated', 'grid_antialiased', 'grid_clip_box', 'grid_clip_on', 'grid_clip_path', 'grid_color', 'grid_dash_capstyle', 'grid_dash_joinstyle', 'grid_dashes', 'grid_data', 'grid_drawstyle', 'grid_figure', 'grid_fillstyle', 'grid_gapcolor', 'grid_gid', 'grid_in_layout', 'grid_label', 'grid_linestyle', 'grid_linewidth', 'grid_marker', 'grid_markeredgecolor', 'grid_markeredgewidth', 'grid_markerfacecolor', 'grid_markerfacecoloralt', 'grid_markersize', 'grid_markevery', 'grid_mouseover', 'grid_path_effects', 'grid_picker', 'grid_pickradius', 'grid_rasterized', 'grid_sketch_params', 'grid_snap', 'grid_solid_capstyle', 'grid_solid_joinstyle', 'grid_transform', 'grid_url', 'grid_visible', 'grid_xdata', 'grid_ydata', 'grid_zorder', 'grid_aa', 'grid_c', 'grid_ds', 'grid_ls', 'grid_lw', 'grid_mec', 'grid_mew', 'grid_mfc', 'grid_mfcalt', 'grid_ms']

An ZeroDivisionError when I run fastqp.py

Dear, I just install fastqp and an error occurred when i run the script, this is the error.

fastqp Sanger_F_sample.fastq
At 2186 bytes per read of 1086 length we estimate 96 reads in input file.
Bin size (-s) set to 1.
Traceback (most recent call last):
File "/usr/local/bin/fastqp", line 11, in
sys.exit(main())
File "/usr/local/lib/python2.7/site-packages/fastqp/cli.py", line 370, in main
run(arguments)
File "/usr/local/lib/python2.7/site-packages/fastqp/cli.py", line 270, in run
cycle_nuc[i]['T']]) * 100 for i in positions]
ZeroDivisionError: integer division or modulo by zero

Any idea why this error occurred?
I use OSx, python 2.7.13 and a sample of the fastq file is on the attachment.

Regards
Sanger_F.fastq.zip

Bug in count-duplicates feature

Running fastqp input.bam -o output -d raises UnboundLocalError: local variable 'ScalableBloomFilter' referenced before assignment

The bug can be traced to cli.py lines 132-133 and 138-139. The order of two blocks need to be reversed (or better, the two blocks need to be merged since they are testing for the same thing), so that ScalableBloomFilter is imported before it's used.

fastqp stuck with very long consensus fastq file

I'm trying to generate a plot of fastq quality along a very long (800kb) assembly sequence. When I run fastqp I get the following and seemingly no non-stdout output:

At 1775681 bytes per read of 887828 length we estimate 1 reads in input file.
Bin size (-s) set to 1.

It might be the case that this is simply not the appropriate tool. The goal is simply a summary of support of given nucleotide calls along the de novo reference. All advice is much appreciated!

output text files

I see that the info is output in an unstructured way on the stout, but for those of us that would like to do some other types of viz, it would be nice to have the raw data.

Thanks, Matt

not detecting adapter seq

Hi!

We are doing riboseq analysis on some fastq files. We are using Fastp for adapter identification and removal. But, we are facing some problems. fastp is able to remove the adapter sequences only if the adapter seq is given in the command line as an argument. It is not able to detect the adapter and remove it by itself and is giving the output as adapter is not detected. The sra file used, adapter seq and codes are as follows:

SRA file : SRR810103 (fastq size 31.6GB)

adapter seq: TCGTATGCCGTCTTCTGCTTG

./fastp -i ./SRR810103.fastq -a TCGTATGCCGTCTTCTGCTTG -q 33 -w 10 -o./output.fastq

./fastp -i ./SRR810103.fastq -w 10 -o ./output.fastq

Can you please help us?

Eagerly waiting for your help!

Make plots using plot.ly

It seems like most plots could be generated using plotly which will let me focus on performance and correctness.

Request module support for ewels/MultiQC

https://github.com/ewels/MultiQC

From @ewels:

If you're requesting a new module

Checklist of requirements:

  • Description of tool, link to homepage
  • Log files - please submit as a PR to https://github.com/ewels/MultiQC_TestData
  • Any standard log file naming patterns to get sample name (eg. sample_name.processed.log)
  • Say which value is most important, to go into the General Stats table
  • Specify what should be plotted, if anything

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.