sgosline / hyphalnet Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 1.0 30.66 MB

Basic implementation of community detection in networks

License: MIT License

Python 68.80% R 30.10% Dockerfile 1.10%

hyphalnet's People

Contributors

Stargazers

Watchers

Forkers

lthura

hyphalnet's Issues

Compute distances within and across hyphal networks for AML/CPTAC data

For the pan-cancer data we can compute 'distances' between the various files. We need these distances to better assess patients activity. See below.

hyphalnet/examples/cancerProtFeatures/panCancerTest.py

Line 132 in a77ae88

if args.getDist:

Update the significant genes script to compute these values.

Replace omicsIntegrator with pcst_fast

We do not need the full omicsintegrator package. instead we can build out tools to build igraph/networkx/array versions of graphs in a much easier format.

Compute GO enrichment for communities in AML/CPTAC data

currently the code computes GO enrichment for the individual patients, but not the communities. update the code to compute enrichment for the communities. See pancancer script for example.

Create function that pulls somatic data from files into dictionary

Current somatic mutations for MPNST datasets are located in syn22136398. To process these data in a network we first need to pull download the files and collect the per-gene allele depth for each sample. The final data structure will be a dictionary of dictionaries. The primary key will be the specimenID, the internal dictionary will be keyed on gene symbol. the value will be the allele depth.

Downloading a synapse file is straightforward once the package is downloaded.

from synapseclient import Project, File, Folder
from synapseclient import Schema, Column, Table, Row, RowSet, as_table_columns

syn = synapseclient.Synapse()
syn.login()

project = syn.get('syn123')

The results for this will go into the [./examples/mutationDrugResponse] directory for now (we can consider making a synapse utility but i'm not sure about this.

Create function that pulls proteomics data from table and computes z-score.

Using the CPTAC patient data, pull the 'LogRatio' column from the molecular table that you accessed in #2 and for each gene, compute the z-score (i.e. (x-mean(x))/sd(x)) so we can assess the relative expression of each gene for each patient. Then create a dictionary of dictonaries - one for each sample, one for each gene/protein value.

Create function that queries from synapse table to create mutation dictionary

This is similar to #1 but might be trickier (or easier) because mutational data for CPTAC is stored in a synapse table. this requires a slightly different syntax to pull the data and put it into a dictionary of dictionaries. The primary key on the dictionaries is the sample identifier, the internal one is the gene symbol.

The table is here: https://www.synapse.org/#!Synapse:syn22172602/tables/

sgosline / hyphalnet Goto Github PK

hyphalnet's People

Contributors

Stargazers

Watchers

Forkers

hyphalnet's Issues

Compute distances within and across hyphal networks for AML/CPTAC data

Replace omicsIntegrator with pcst_fast

Compute GO enrichment for communities in AML/CPTAC data

Create function that pulls somatic data from files into dictionary

Create function that pulls proteomics data from table and computes z-score.

Create function that queries from synapse table to create mutation dictionary

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent