Giter Club home page Giter Club logo

hapdab's People

Contributors

schae234 avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Forkers

rafalcode

hapdab's Issues

Design the Cassandra database schema for variants

I need to know if the philosophy behind the Cassandra schema differs from the one we used for SQLite. I know that Cassandra believes in denormalization so we might be sacrificing efficiency for space (which might have sped up SQLite in the first place)

Read up on Spark and Cassandra

As a developer I need to know the basics on Cassandra and Spark. I need this information to implement the features we are proposing to use.

Create a test VCF

This test VCF needs to be small enough to fit into the GitHub Repo but large enough to be useful.

Create a prototype of HapDab that is backed by Spark

Value Statement

As a geneticist, I need easy and fast access to genotype data. I need this because I want to be able to quickly explore my data and evaluate genomic structure in regions that interest me. The tools that are out there are hard to use and often require me to stitch many different tools together in order to get the data I want.

Completed When

All tasks assigned to this epic are QA'd and Closed. In general, code pushed to GitHub will be backed by Spark/Cassandra and will cover the use cases we performed in the MNEc2M paper. There will also be benchmarks to identify if this tech is better than the default Minus80 stack.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.