Comments (6)
It looks like we need to look into the functionality of the custom --lca and --genus flag. Try running one of Tara's mixture read sets (50/50 anthrax/cereus and 1/99 cereus/anthrax)
Tsosie, you're primary on this one.
from mtsv.
I think I'm still seeing the bug. The following should illustrate what I'm talking about:
# Run mtsv pipeline with "--tax_level genus"
[lovetts@gull081 BA]$ mtsv pipeline -c mtsv.cfg --threads 5 --cluster-config /nbacc/data/MTSv/cluster.cfg --cluster "qsub -q centos6 -N {cluster.job-name} -o /home/lovetts/test_mtsv/test_NAU_data/BA/Logs/{cluster.job-name}.log" --jobs 10 --tax_level genus
# Pipeline stdout/stderr removed for clarity
# Count reads in sample one that are assigned to anthracis/cereus/thuringiensis only
# In order to count, reads must be unique to S1, unique within S1, and assigned to those three species only
[lovetts@gull081 BA]$ grep :1392,1396,1428$ Binning/merged.clp | grep _1_0_0_0_0_0 | wc -l
26770
# Results for genus Bacillus from BA-BaA00455percentspikein-xx-uu-uuu-xxxx-082-VF_S1_L001_R2_001.fastq
[lovetts@gull081 BA]$ (head -n2 Summary/summary.csv; grep -w ^1386 Summary/summary.csv) | cut -f 1-7 -d "," | column -t -s ','
# BA-BaA00455percentspikein-xx-uu-uuu-xxxx-082-VF_S1_L001_R2_001.fastq
TaxID Division Sci. Name Total Hits (S1) Unique Hits (S1) Signature Hits (S1) Unique Signature Hits (S1)
1386 Bacteria Bacillus 47546 46899 143 143
My interpretation of Signature Hits is that these 26770 reads (at least) should be signature hits for genus Bacillus, but I'm only seeing 143.
from mtsv.
@lovettse
Thanks for submitting the issue, we are working on a fix for this and we should have a solution by next week.
from mtsv.
@lovettse This issue should be resolved in the latest build linux-64/mtsv-1.0.0-py36_18.tar.bz2.
Here is an example of some results with a mixture of Bacillus anthracis and Bacillus cereus rolled up to the family level.
TaxID | Division | Sci. Name | Total Hits (S1) | Unique Hits (S1) | Signature Hits (S1) | Unique Signature Hits (S1) |
---|---|---|---|---|---|---|
186817 | Bacteria | Bacillaceae | 1730545 | 1562779 | 1691893 | 1530382 |
from mtsv.
@tfursten Can we count this as resolved?
from mtsv.
@vyfofanov The future development that we have planned around this issue is covered in #11 so I think we can close this because the bug has been fixed.
from mtsv.
Related Issues (20)
- Add PreProcessing module HOT 1
- MTSv-summary lacks graceful exit HOT 1
- MTSv-summary bug HOT 1
- Pre-sorting MTSv-summary output HOT 1
- Taxonomy-aware (genus/family collapse) MTSv-summary output
- Automatic MTSv-extract options HOT 1
- requested enhancement: custom adaptive --lca on MTSv-summary
- mtsv-signature bug
- Remove non-species taxids from databases HOT 1
- Creating databases for use on 'offline' systems HOT 13
- Conda install of v1.02 bug
- mtsv_setup looking for decompression.log HOT 1
- Offline mirror of Genbank/Refseq for use in database creation.
- Rust 1.36.0 Compilation Issue HOT 1
- bug in summary portion on 1.0.4 singularity container HOT 3
- Database won't build from list of unzipped flat files
- Difficulty running v1.06 HOT 2
- 'mtsv pipeline' crashes on rule all when run on SGE grid HOT 2
- 'mtsv report' cannot be executed on an air-gapped system HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mtsv.