Giter Club home page Giter Club logo

splintr_preprocessing's Introduction

SPLINTR Barcode Library Pre-processing.

For more information see the paper Fennell and Vassiliadis et al., 2021

Software Dependencies

Procedure to generate SPLINTR reference barcode libraries:

This process assumes familiarity with and access to a HPC cluster running the SLURM scheduler.

  1. Amplify SPLINTR barcodes from each library plasmid pool by PCR (see attached PCR protocol).
  • Perform PCRs in technical duplicate
  • Use a different index combination in PCR2 for each replicate
  1. Sequence library pool to a depth of at least 100-200M reads (~100X coverage for a 1 million barcode library) per PCR reaction.
  • Lower diversity libraries will require less sequencing.
  • Ideally perform single end 150bp or paired end 75bp sequencing, DO NOT perform single end 75bp as reads will not cover the entire barcode.
  1. Generate a text file containing full path of each library fastq file. See example fastqs.txt

  2. Confirm sequencing quality by running FastQC

  • [OPTIONAL] if paired end 75bp sequencing was performed. Run flash.sh to combine paried-end reads into a single overlapped read that covers the entire barcode region.
  1. Extract barcode reads from fastq files using filter_barcodes.sbatch which submits extractBarcodeReads.py scripts to a SLURM HPC

  2. Run Starcode on the extracted barcode reads to cluster barcodes according to Edit distance. See starcode.sbatch

  3. Perform QC and analysis of Starcode output in R using library-analysis-template.Rmd

Output will include a fasta file of barcode sequences within each library which can be used to generate a reference library for subsequent barcode alignment. See example output in library-analysis-template.html and test

splintr_preprocessing's People

Contributors

danevass avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.