seqpostproc's People
seqpostproc's Issues
Have a script to get stats from reseq after batch reseq
Summarize # of mutations, coverage of samples, etc, in a CSV file.
Upfront, the # of mutations for each same will be valuable to know which sample reseq's are potentially problematic.
Use Docker to download all necessary programs
- breseq
- fastx
- trim_galore
- cut adapt
- fastqc
automatically run gdtools ANNOTATE for each breseq sample
We want to grab the HTML annotation for the mutation annotation and sequence change info from the GD file output by gdtools ANNOTATE rather than by parsing the HTML breseq report.
Trying to avoid scraping/parsing HTML when possible.
Have CSV as input file for samples
Constantly getting issues with read file names causing auto-grouping of file names to malfunction. Should instead simply have CSV input file that describes all of the files to Breseq and the name of the output.
Potentially use Trimmomatic instead of trim_galore
According to https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103590/, Trimmomatic give best alignment after adapter and quality trimming. Trimmomatic differs from trim_galore(Cutadapt) in that it uses a sliding window quality trimmer, which generally give better results than other quality trimming approaches.
Questions that need to be answered before substituting trimming software, where trim_galore currently accomplishes these tasks:
- Does Trimmomatic automatically detect adapter sequences?
Automate steps
- adapter trimming
- quality trimming
- resequencing
Should use rm -rf **/0*_* any longer
It is deleting the distribution data for reads. Rather, write this steps to know the specific folders that need to be deleted from each report.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.