Giter Club home page Giter Club logo

ecmatlas's People

Contributors

aberhane13 avatar spencersteers avatar

Watchers

 avatar  avatar  avatar

ecmatlas's Issues

Create fixtures for models

  • Protein
  • Tissue
  • ProteinHit
  • Experiment
  • VariableModification

Refer to the original database for some dummy data

Create data loader interface

HTML page to upload files

  • Multiple file uploads
  • Asynchronous uploads
  • Read the files and leave a stub for the parser

Create JSON apis for models

  • Protein /proteins/
  • Tissue /tissues/
  • Experiment /tissues/
  • ProteinHit /proteinhits/
  • VariableModification /variablemodifications/

Wikipedia Integration

Example file of pulling wikipedia intro paragraph for a given protein.
http://cl.ly/code/3l1p0e442806

Will need to make a table for 'Editorial Content'. One to one relation with 'Proteins' that holds wikipedia information. Will need to store the summary and a link to the page on wikipedia.

Proteins need more high level data

Currently the Protein model is pretty plain. Most of the data comes from it's relationship to Experiments/ProteinHits.

Should have more general knowledge like:

https://www.proteomicsdb.org/proteomicsdb/#human/proteinDetails/52924/summary
http://en.wikipedia.org/wiki/MYH6 (sidebar)

Examples Proteomics
Localization: Chromosome 14: 23851199 - 23877486 reverse strand
Ensemble Gene/Transcript: ENSG00000197616/ENST00000356287
Gene Name: MYH6(MYHCA)
UniProt AC/ID: P13533/MYH6_HUMAN
Organism: Homo sapiens (human)
Evidence: isoform

Fetch Uniprot ID's for proteins

Programmatic access guide: http://www.uniprot.org/help/programmatic_access

Example query for gene col1a1 to get id: http://www.uniprot.org/uniprot/?query=gene:col1a1+AND+organism:rat&format=tab

Look up how to make http queries in python and how to iterate through a tab delim string.

  • Link proteins in database to a uniprot identifier
  • Provide a link to Uniprot for each protein
  • Fetch Uniprot protein descriptions and insert into database (might be done with wikipedia now #16 )
  • Allow for fetched Uniprot protein descriptions to be changed and updated (might be done with wikipedia now #16)

Admin Interface

Set up the Django admin interface so that the tables are available for editing. Start with small models like Tissue, Family and FunctionalGroup. Eventually the admin will need to be updated to allow editing of the Uniprot (#15) and Wikipedia (#16) data.

Dataloader Updates

New fields and tables will be needed. Experiment table with a date and name. Dataset table will need a foreign key relationship to Experiment.

All data loading happens in ecmdatabase.views.py

CSV parsing is handled in ecmdatabase.dataset_parser.py. This is where column names would be changed to match the new dataset.

The template being rendered is ecmdatabase/templates/upload.html and the loader can be viewed at /datasets/upload/. I'd start by displaying the upload date of a dataset on this page.

  • Add support for experiment ID, experiment date, and relative concentration fields to dataset parser
  • Allow for multiple datasets to be grouped under an experiment ID
  • Add experiment date to data loader
  • Display dataset upload date when viewing list of uploads in the data loader

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.