Giter Club home page Giter Club logo

Comments (6)

hyphaltip avatar hyphaltip commented on July 27, 2024 1

yes

  • filter is for reads uses BBMap by default against a contamination and primer/adaptor database. It can also be given a set of accessions to screen against if you have known contamination you want to remove at the read level.
  • vecscreen is for assembled contigs - this is probably going to be replaced with the ncbi fcs-adaptor step - it recapitulated a vector screening tool that was at ncbi online https://www.ncbi.nlm.nih.gov/tools/vecscreen/ - this step will remove parts or whole contigs that match a vector and adaptor database - if a vector match is found within a contig the contig is split into pieces since one cannot be sure the linkage of the contig is sound now.
  • sourpurge is looking for cross-species contamination and removes the entire contig if it is classified as contamination. this is may be subsituted with ncbi fcs-gx

from aaftf.

hyphaltip avatar hyphaltip commented on July 27, 2024

Vecscreen removes primer and obvious contam (phiX) and trims
Or splits contigs using univec db.

Sourpurge screens scaffolds for contamination by looking for matches to groups outside the phylum. It removes scaffolds that are likely matching to a contamination source.

I have also just written in support for NCBI fcs-Gx tool which does similar thing to sourmash/sourpurge step.

from aaftf.

maruiqi0710 avatar maruiqi0710 commented on July 27, 2024

In addition to vecscreen and sourpurge, the filter function is also involved in removing gene fragments of non-target species. The filter step removes reads before the assemble step, and also removes reads of mitochondrial genes from the mito step. Therefore, in AAFTF, there are three steps involved in removing gene fragments of non-target species.

Did I summarize it correctly?

from aaftf.

hyphaltip avatar hyphaltip commented on July 27, 2024

someday soon I'll finish writing up a manuscript on the tool to give this more description.

the ncbi fcs-adapator and fcs-gx are being tested but gx is a little unwieldy as it needs a lot of memory and large DB while sourpurge is pretty small footprint (though the version of the DB I had been using before was removed from OSF.io so the default install may not work right now till we see if the replacement larger DB can still work efficiently).

from aaftf.

hyphaltip avatar hyphaltip commented on July 27, 2024

fcs-gx support is now in AAFTF - available in live code and will be part of the v0.5.0 release

from aaftf.

hyphaltip avatar hyphaltip commented on July 27, 2024

0.5.0 has these features. closing this query since answered above.

from aaftf.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.