Giter Club home page Giter Club logo

Comments (4)

marcelm avatar marcelm commented on July 24, 2024

From [email protected] on February 15, 2011 12:31:11

Finding an adapter of length 33 in a read of length 30 is actually possible due to deletions in the read. For example, if your adapter is ACGTACGT (8 nt) and the read is AGTACGT (7 nt), then an alignment could be this:
adapter: ACGTACGT
read: A-GTACGT
In this example, cutadapt would report that it has trimmed an adapter of length 8 although the read has only length 7.

The default error rate of cutadapt allows 10% errors. In your case (reads of length 30), that amounts to at most 3 errors, which is why you see at most a removed adapter length of 33 (read length 30 plus 3 deletions).

I guess it makes sense to change the histogram such that more intuitive numbers are reported. I'll keep your report open until I have done so. Thanks for your report!

from cutadapt.

marcelm avatar marcelm commented on July 24, 2024

From [email protected] on May 19, 2011 14:24:45

Hi there,

When I see:

Histogram of adapter lengths
length count
3 113604

What the length 3 represents for me (consider the same parameters and data as was explained above)?.

Thanks.

from cutadapt.

marcelm avatar marcelm commented on July 24, 2024

From [email protected] on May 20, 2011 08:05:10

This means that 113604 of the input reads end with 'CCC' (the beginning of the given adapter), and that in all those cases, the 'CCC' was trimmed from the read.

from cutadapt.

marcelm avatar marcelm commented on July 24, 2024

From [email protected] on July 09, 2012 15:20:05

This has been changed in cutadapt 1.1: The histogram now reports the lengths of trimmed sequences.

Status: Fixed

from cutadapt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.