Comments (2)
Thanks for the suggestion. That's a reasonable extension. One limitation we have at the moment is that we don't have a "private deletions" feature yet.
Labeled mutations are a subset of private mutations, so to keep with the logic we'd first need to support private deletions. Not at all unreasonable and actually something I've been thinking about as well.
One challenge with deletions is that if you keep them as single bases they can get overwhelmingly many. If you make them ranges, single indel difference artefacts can make a large difference.
I'm leaning towards using ranges nonetheless.
To be symmetric, it would make sense to add private insertions.
from nextclade.
Having read your ubio post @ammaraziz, I think you are suggesting two things:
-
Extend concept of labeled private mutations (substitutions) to labeled private indels. For this we first need to introduce the concept/feature of private indels. Then split the private indels into "reversions, labeled, neither of the two" just as we currently do with substitutions.
-
Extend the concept of labeled mutations beyond private mutations (private mutations are those mutations that differ wrt nearest neighbour sequence on reference tree). This makes sense for things where you are looking at broader trends and are not so concerned with the individual sequence and whether it is good quality or not (private mutations originated as QC metric, and the labeled/reversion feature is an extension, still coming from the QC perspective).
We already calculate some custom global (non-private only) metrics like:
- Immune escape and ACE2 binding affinity scores in the SARS-CoV-2 BA.2 dataset
- Glycosylation sites in flu (see explanation here, aa motifs: https://github.com/nextstrain/nextclade/releases/tag/2.10.0)
Reporting presence of certain mutations (irrespective of whether they are private or not) for antiviral resistance, RAT escape etc would make sense.
from nextclade.
Related Issues (20)
- How to decide if the reversionSubstitutions are valid variants or not and whether to keep them? HOT 2
- Direct Auspice SVG Download
- ENH: enlarge Visualization when more nucleotides/Codons even if mutations not fall in striclty adjacent codons but close enough to need a whole view. HOT 2
- Updating certain parts of the JSON file output from Map to Array HOT 2
- linux-musl artefact gets slower rather than faster when parallelizing in contrast to gnu HOT 5
- Parsing PCR primers HOT 1
- how many SARS-COV-2 sequences can nextclade handle in a MSA file? HOT 8
- Is there any example for handling HIV data? HOT 2
- Web crash: The target <seqName> could not be identified in the dom HOT 2
- Show "browser not supported HOT 1
- Show "browser not supported" modal only once per session HOT 3
- if the qc.overallStatus of my sequences are mediocre, can we keep them for next step analysis? HOT 5
- web-based nextclade issue when using another reference HOT 8
- web(minor): when customizing dataset files, it always says "pasted sequences" even if the field is for tree HOT 2
- I upload 1490 sequences to nextclade, and upload to auspice.us, why it shows me 4255 sequences? HOT 2
- Allow suppression of ` |(reverse complement)` suffix in header of alignment output HOT 2
- how many genomes can nextclade handle? HOT 1
- PCR primer mutation functionality[v3] HOT 13
- 3.0.0 version not retrieving RSV datasets HOT 1
- Can support be extended for SC2 Datasets V2 for another month? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nextclade.