Giter Club home page Giter Club logo

adsim's Introduction

Ad-Sim

Pipeline for consistent and standardized simulation of admixed genotypes. Creates simulated genotypes as well as pruned and thinned genotypes for benchmarking at different snp levels. Based off of the code run by Angela Andaleon which in turn uses the admixture simulation tool created by the makers of RFMix.

This pipeline has been made mostly to maintain consistency for ease of benchmarking different admixture analysis softwares. Adsim pipeline first creates simulated genotypes using the admixture simulation tool. From there these can be consistently pruned and or thinned as the user desires and as different software demands. This makes it easy to benchmark at different numbers of snps. Note, adsim is designed to prune and thin each input population independently. This results in different snps across each files. That said, it is fairly trivial to analyze them jointly by concatonating vcf files and pruning/thinning from there.

Data

Pipeline is tested on publicly available data from 1000 genomes

Workflow

Pipeline runs in two steps

  1. Simulate the population for a particular chromosome
  2. Prune and/or thin the simulated genotypes to a certain number of snps

Design principles

This pipeline is only designed to run on one chromosome at a time. If you wish to run it on multiple chromosomes you'll either have to run it multiple times or edit the pipeline accordingly. This pipeline has been designed with the intention of running each population individually, resulting in different snps appearing in each of your pruned/thinned files. In theory, however, it should be simple to keep your snp list consistent across all populations by combining each into one vcf file, pruning/thinning from there, and finally separating the vcf file by population.

Software

At testing all software is run on a linux machine running ubuntu

adsim's People

Contributors

ryanschu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.