Giter Club home page Giter Club logo

generatemutationdata's Introduction

GenerateMutationData

#List of research questions

  • Can the model represent the location correctly? Basically, the question is if the model can copy the numbers from the src-text
  • Is the model able to recognize different surface forms of AA (Can we train a model on long/tripple/or single-forms only?)
    • Long-form (Alanine)
    • Tripple-form (ala)
    • Single-form (A)
  • How does our model perform on PubMed? (i.e., use our data from tmVar or SETH)
  • Train a model on instances from Pattern 1-100 and test it on unseen patterns
  • Can we train on longer context? E.g., use the sentence instead of single instances?
  • Can we (to some degree) predict the reference sequence (e.g., cDNA, proteim) or different mutation types (e.g., substitution, deletion)

TODO!

  • The list of amino acids is not complete (e.g., Ter, Termination, X are missing)
  • The amino acids currently are all uppercase (I think that is nat what we want :)
  • I generated the three amino-acids maps by hand! They are incorrect and some long forms are missing!
  • Currently, I only generated patterns for Amino acid subsititutions
  • Currently, I genereated only traning instances with the long form amino-acids (this can easily be changed)

generatemutationdata's People

Contributors

erechtheus avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.