Giter Club home page Giter Club logo

varskip's Introduction

VarSkip multiplex PCR designs for SARS-CoV-2 sequencing

This repository contains reference files that describe the VarSkip primers used to amplify overlapping fragments of the SARS-CoV-2 virus.

In an attempt to reduce the impact of variants on amplification efficiency, these primers were designed using PrimalSeq after masking the SARS-CoV-2 reference sequence for SNVs observed more than 4000 times in GISAID. More details can be found in scientific presentations:

Primer Set Name Primer positions Primer sequence details
VarSkip 1a neb_vss1a.primer.bed neb_vss1a.primer.tsv
VarSkip 2 neb_vss2a.primer.bed neb_vss2a.primer.tsv
VarSkip 2b (2a + spike-ins) neb_vss2b.primer.bed neb_vss2b.primer.tsv
VarSkip Long 1a neb_vsl1a.primer.bed neb_vsl1a.primer.tsv

VarSkip primers are commerically available at NEB.com as part of the ARTIC sequencing kits. As of 2022-02-15 VarSkip 2 primers have replaced VarSkip 1a primers in these products. As of 2022-04-12, 2 supplemental primers have been tested as spike-in additions to the VarSkip2 pools to improve coverage of amplicons 49 and 57 which are affected by common BA.2 variants.

VarSkip Long primers are available as a custom product for larger volume sequencing.

The files in this repository can be used with analyis methods designed for other multiplex designs (e.g. ARTIC). To avoid miscalling or masking variants in priming regions, it's important to use primer coordinates that match the primers used during amplification (e.g. do not use ARTICv3 primer files with VarSkip libraries).

The same information is presented in multiple formats for convenience. Most workflows should use neb_vss1a.primer.bed or neb_vss2b.primer.bed.

Analysis methods known to work:

  • galaxy ARTIC

  • viralrecon (tested with illumina data)

  • ARTIC field bioinformatics docs

    The ARTIC field bioinformatics pipeline is commonly used to analyize results produced on the Oxford Nanopore instruments. It requires a specific directory structure and primer naming to operate correctly. The schemes directory contains a reformatted primer annotation and reference genome that meet those constraints.

    Example commands (tune this to your instrument-specific model, primer set, and directories):

    VarSkip 1a: artic minion --threads 4 --read-file example_vss1a.fastq --scheme-directory schemes --scheme-version 1a NEB_VarSkip Example --skip-nanopolish --medaka --medaka-model r941_min_fast_g303

    VarSkip 2a: artic minion --threads 4 --read-file example_vss2a.fastq --scheme-directory schemes --scheme-version 2a NEB_VarSkip Example --skip-nanopolish --medaka --medaka-model r941_min_fast_g303

    VarSkip 1a Long: artic minion --threads 4 --read-file example_vsl1a.fastq --scheme-directory schemes --scheme-version 1a-long NEB_VarSkip Example --skip-nanopolish --medaka --medaka-model r941_min_fast_g303

Many libraries produced using VarSkip primers are available via the NCBI SRA. e.g. SRR16375192

varskip's People

Contributors

bwlang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

varskip's Issues

VarSkip V2b Issue

Hello,

I noticed one of the primer names is incorrect:

varskip-0371 instead of varskip-0317 e.g.varskip-0371-1_49_RIGHT_ALT1

Question about alt primers in V2b

Hi there,
I am documenting widely used primer schemes for sequencing viral pathogens as part of a @pha4ge standardisation effort with the aim of facilitating reuse of viral amplicon sequencing data.

It is my understanding that all of these schemes (V1a, V1a-long, V2a, V2b) use primers exactly matching the reference sequence at their respective coordinates. Is this correct?

In particular, I see that the V2b contains two primers marked as ALT:

Do these also match the reference sequence exactly at their specified coordinates? If so, I can easily create 7 column primer.bed definitions of these schemes for standardisation purposes. If not, would you be able to share the exact sequences of any non-ref primers?

Perhaps Brad (@bwlang) is able to comment?

With many thanks,
Bede

Error: could not find primer scheme and reference sequence

Hi!
I used the Varskip 2a primers for the first time and I am having issues running medaka in the artic bioinformatics protocol (which is the one I always use when I use the V3 primers). I am using the pipeline described in your page, but getting the same error over and over again:
(artic-ncov2019) Lotorres-mac5:S5-027 admin$ artic minion --threads 4 --read-file /Users/admin/Run_07/porechop/S5-027_BC68.fastq --skip-nanopolish --scheme-directory /Users/admin/artic-ncov2019/primer_schemes/nCoV-2019/NEB_VarSkip --medaka --medaka-model r941_min_fast_g303 NEB_VarSkip S5-027 --scheme-version 2a
could not find primer scheme and reference sequence, attempting to download
Running: artic-tools get_scheme NEB_VarSkip --schemeVersion 2a
Could not convert: --schemeVersion = 2a
Run with --help for more information.
scheme download failed

We have used several combinations of commands (includin adding the command --scheme-version) but finally getting the same error, where it doesn't find the primer scheme and the reference sequence. We have created another folder inside the nCoV-2019 folder where the V1, 2, 3 and 4 of the artic primer schemes are located. Inside this new folder (named NEB_VarSkip) we have put the files downloaded from your repository primer.bed, the reference.fasta, the primer.tsv and we even created an insert.bed. Nevertheless, the error remains the same.
Please let me know if there is something I am missing.

Thanks a lot for your help!!

Question: using VarSkip primers for poreCov

Hi,

I was adding your primer scheme into poreCov (a SARS-CoV - nanopore analysis pipeline to simplify high throughput analysis). And I wanted to check in, if the current code references are in order for you:

  • your primers are stored here
  • the poreCov help message references it like this:
Parameters - nCov genome reconstruction (optional)
    --primerV       Supported primer variants - choose one [default: V3]
                        ARTIC: V1, V2, V3, V4 
                        NEB: VarSkipV1a
                        Other: V1200

example run with your primers looks like this (without the need to provide the primer files):

nextflow run replikation/poreCov -r 0.11.1 --fastq_pass fastq_pass --primerV VarSkipV1a --update -profile local,docker 

thank you and best

V1a-Long Bed Reference

Hello,

I noticed the 1a-Long scheme bed has the reference as .2 should it be .3?

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.