Giter Club home page Giter Club logo

Comments (6)

vyfofanov avatar vyfofanov commented on August 10, 2024

It looks like we need to look into the functionality of the custom --lca and --genus flag. Try running one of Tara's mixture read sets (50/50 anthrax/cereus and 1/99 cereus/anthrax)

Tsosie, you're primary on this one.

from mtsv.

lovettse avatar lovettse commented on August 10, 2024

I think I'm still seeing the bug. The following should illustrate what I'm talking about:

# Run mtsv pipeline with "--tax_level genus"
[lovetts@gull081 BA]$ mtsv pipeline -c mtsv.cfg --threads 5 --cluster-config /nbacc/data/MTSv/cluster.cfg --cluster "qsub -q centos6 -N {cluster.job-name} -o /home/lovetts/test_mtsv/test_NAU_data/BA/Logs/{cluster.job-name}.log" --jobs 10 --tax_level genus

# Pipeline stdout/stderr removed for clarity

# Count reads in sample one that are assigned to anthracis/cereus/thuringiensis only
# In order to count, reads must be unique to S1, unique within S1, and assigned to those three species only
[lovetts@gull081 BA]$ grep :1392,1396,1428$ Binning/merged.clp | grep _1_0_0_0_0_0 | wc -l
26770


# Results for genus Bacillus from BA-BaA00455percentspikein-xx-uu-uuu-xxxx-082-VF_S1_L001_R2_001.fastq
[lovetts@gull081 BA]$ (head -n2 Summary/summary.csv; grep -w ^1386 Summary/summary.csv) | cut -f 1-7 -d "," | column -t -s ','
#      BA-BaA00455percentspikein-xx-uu-uuu-xxxx-082-VF_S1_L001_R2_001.fastq
TaxID  Division                                                              Sci. Name  Total Hits (S1)  Unique Hits (S1)  Signature Hits (S1)  Unique Signature Hits (S1)
1386   Bacteria                                                              Bacillus   47546            46899             143                  143

My interpretation of Signature Hits is that these 26770 reads (at least) should be signature hits for genus Bacillus, but I'm only seeing 143.

from mtsv.

tfursten avatar tfursten commented on August 10, 2024

@lovettse
Thanks for submitting the issue, we are working on a fix for this and we should have a solution by next week.

from mtsv.

tfursten avatar tfursten commented on August 10, 2024

@lovettse This issue should be resolved in the latest build linux-64/mtsv-1.0.0-py36_18.tar.bz2.

Here is an example of some results with a mixture of Bacillus anthracis and Bacillus cereus rolled up to the family level.

TaxID Division Sci. Name Total Hits (S1) Unique Hits (S1) Signature Hits (S1) Unique Signature Hits (S1)
186817 Bacteria Bacillaceae 1730545 1562779 1691893 1530382

from mtsv.

vyfofanov avatar vyfofanov commented on August 10, 2024

@tfursten Can we count this as resolved?

from mtsv.

tfursten avatar tfursten commented on August 10, 2024

@vyfofanov The future development that we have planned around this issue is covered in #11 so I think we can close this because the bug has been fixed.

from mtsv.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.