mdshw5 / fastqp Goto Github PK
View Code? Open in Web Editor NEWSimple FASTQ quality assessment using Python
Home Page: https://pypi.python.org/pypi/fastqp
License: MIT License
Simple FASTQ quality assessment using Python
Home Page: https://pypi.python.org/pypi/fastqp
License: MIT License
Maybe look at either regression coefficient or residuals to flag abnormal positional kmers.
0 reads in input file.
Bin size (-s) set to 1.
...
ZeroDivisionError : if (act_nlines / est_nlines) * 100 >= percent_complete:
integer division or modulo by zero
ZeroDivisionError: integer division or modulo by zero
Two cases:
cc: @jmerkin
A fast strategy would be to break adapter sequences into kmers, and then classify previously counted kmers into "possible adapter" kmers.
I'd like to test importing across multiple experiments into Spotfire for visualization. This should just involve concatenation of multiple results tables.
Hi all,
With matplotlib 2.1.0 I get MatplotlibDeprecationWarning: The set_axis_bgcolor function was deprecated in version 2.0. Use set_facecolor instead.
. With the latest matplotlib-2.2.2
it now raises an exception.
Is there a recommended matplotlib version?
Hello @mdshw5 @ucpete @danielecook @tomsherborne @edawine , I am using python.2.7.12 and the following OS
CentOS release 6.5 (Final)
Cluster Manager v7.0
slave
LSB_VERSION=base-4.0-amd64:base-4.0-noarch:core-4.0-amd64:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-noarch
CentOS release 6.5 (Final)
CentOS release 6.5 (Final)
I am getting the error
657,845,172 reads in input file.
Bin size (-s) set to 328.
Traceback (most recent call last):
File "/cm/shared/apps/python/2.7.12/bin/fastqp", line 10, in <module>
sys.exit(main())
File "/cm/shared/apps/python/2.7.12/lib/python2.7/site-packages/fastqp/cli.py", line 412, in main
run(arguments)
File "/cm/shared/apps/python/2.7.12/lib/python2.7/site-packages/fastqp/cli.py", line 352, in run
mismatchplot(positions, cycle_mismatch, zip_archive, fig_kw)
File "/cm/shared/apps/python/2.7.12/lib/python2.7/site-packages/fastqp/plots.py", line 659, in mismatchplot
for pos in positions
ZeroDivisionError: division by zero
I checked the issues board, specifically the closed #26
(#26)
which had a "hack-ish" solution #28 . In the source code (current version) the plots.py does not contain the lines mentioned
(3648187)
. Please advise!
Some recent BS-seq programs store methylation information in a SAM tag. One example is Bismark. A flag for --methylation
would activate routines for summarizing:
Multiple input files should be handled as a batch where one file corresponds to one set of figures and data tables.
Traceback (most recent call last):
File "/usr/prog/scicomp/pythonds/v1.0/pyds_v1.0/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap
self.run()
File "/usr/prog/scicomp/pythonds/v1.0/pyds_v1.0/lib/python3.8/multiprocessing/process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "/da/onc/BFx/pisces/0.1.5.3/src/novartis-pisces/pisces/__init__.py", line 135, in read_level_qc
run(arguments)
File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/fastqp/cli.py", line 341, in run
qualplot(positions, quantiles, zip_archive, fig_kw)
File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/fastqp/plots.py", line 311, in qualplot
ax.yaxis.grid(b=True, which='major', **{'color': 'gray', 'linestyle': ':'})
File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/matplotlib/axis.py", line 1698, in grid
self.set_tick_params(which='major', **gridkw)
File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/matplotlib/axis.py", line 944, in set_tick_params
kwtrans = self._translate_tick_params(kwargs)
File "/da/onc/BFx/pisces/0.1.5.3/lib/python3.8/site-packages/matplotlib/axis.py", line 1088, in _translate_tick_params
raise ValueError(
ValueError: keyword grid_b is not recognized; valid keywords are ['size', 'width', 'color', 'tickdir', 'pad', 'labelsize', 'labelcolor', 'zorder', 'gridOn', 'tick1On', 'tick2On', 'label1On', 'label2On', 'length', 'direction', 'left', 'bottom', 'right', 'top', 'labelleft', 'labelbottom', 'labelright', 'labeltop', 'labelrotation', 'grid_agg_filter', 'grid_alpha', 'grid_animated', 'grid_antialiased', 'grid_clip_box', 'grid_clip_on', 'grid_clip_path', 'grid_color', 'grid_dash_capstyle', 'grid_dash_joinstyle', 'grid_dashes', 'grid_data', 'grid_drawstyle', 'grid_figure', 'grid_fillstyle', 'grid_gapcolor', 'grid_gid', 'grid_in_layout', 'grid_label', 'grid_linestyle', 'grid_linewidth', 'grid_marker', 'grid_markeredgecolor', 'grid_markeredgewidth', 'grid_markerfacecolor', 'grid_markerfacecoloralt', 'grid_markersize', 'grid_markevery', 'grid_mouseover', 'grid_path_effects', 'grid_picker', 'grid_pickradius', 'grid_rasterized', 'grid_sketch_params', 'grid_snap', 'grid_solid_capstyle', 'grid_solid_joinstyle', 'grid_transform', 'grid_url', 'grid_visible', 'grid_xdata', 'grid_ydata', 'grid_zorder', 'grid_aa', 'grid_c', 'grid_ds', 'grid_ls', 'grid_lw', 'grid_mec', 'grid_mew', 'grid_mfc', 'grid_mfcalt', 'grid_ms']
Dear, I just install fastqp and an error occurred when i run the script, this is the error.
fastqp Sanger_F_sample.fastq
At 2186 bytes per read of 1086 length we estimate 96 reads in input file.
Bin size (-s) set to 1.
Traceback (most recent call last):
File "/usr/local/bin/fastqp", line 11, in
sys.exit(main())
File "/usr/local/lib/python2.7/site-packages/fastqp/cli.py", line 370, in main
run(arguments)
File "/usr/local/lib/python2.7/site-packages/fastqp/cli.py", line 270, in run
cycle_nuc[i]['T']]) * 100 for i in positions]
ZeroDivisionError: integer division or modulo by zero
Any idea why this error occurred?
I use OSx, python 2.7.13 and a sample of the fastq file is on the attachment.
Regards
Sanger_F.fastq.zip
This is something FastQC does automatically.
Running fastqp input.bam -o output -d raises UnboundLocalError: local variable 'ScalableBloomFilter' referenced before assignment
The bug can be traced to cli.py lines 132-133 and 138-139. The order of two blocks need to be reversed (or better, the two blocks need to be merged since they are testing for the same thing), so that ScalableBloomFilter is imported before it's used.
Cycler replaces the older axes.color_cycle
in matplotlib >= 1.5
I'm trying to generate a plot of fastq quality along a very long (800kb) assembly sequence. When I run fastqp I get the following and seemingly no non-stdout output:
At 1775681 bytes per read of 887828 length we estimate 1 reads in input file.
Bin size (-s) set to 1.
It might be the case that this is simply not the appropriate tool. The goal is simply a summary of support of given nucleotide calls along the de novo reference. All advice is much appreciated!
I see that the info is output in an unstructured way on the stout, but for those of us that would like to do some other types of viz, it would be nice to have the raw data.
Thanks, Matt
Hi!
We are doing riboseq analysis on some fastq files. We are using Fastp for adapter identification and removal. But, we are facing some problems. fastp is able to remove the adapter sequences only if the adapter seq is given in the command line as an argument. It is not able to detect the adapter and remove it by itself and is giving the output as adapter is not detected. The sra file used, adapter seq and codes are as follows:
SRA file : SRR810103 (fastq size 31.6GB)
adapter seq: TCGTATGCCGTCTTCTGCTTG
./fastp -i ./SRR810103.fastq -a TCGTATGCCGTCTTCTGCTTG -q 33 -w 10 -o./output.fastq
./fastp -i ./SRR810103.fastq -w 10 -o ./output.fastq
Can you please help us?
Eagerly waiting for your help!
It seems like most plots could be generated using plotly which will let me focus on performance and correctness.
See #20
https://github.com/ewels/MultiQC
From @ewels:
Checklist of requirements:
sample_name.processed.log
)A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.