Giter Club home page Giter Club logo

ovcsubtypes's Introduction

Analysis and report generation for ovarian subtyping project

(this will be updated shortly to reflect new directory structure)

Instructions:

First, install MetaGxOvarian (contact Greg Chen or Deena Gendoo for the latest version). Install MetaGx with the commands

git clone https://github.com/bhklab/MetaGx.git
R CMD BUILD MetaGx
R CMD INSTALL MetaGx_0.9.9.tar.gz

Clone this repository:

git clone https://github.com/bhklab/OvcSubtypes.git

In the "reports" directory, there are three main knitr files:

  • reproduceResults.Rnw
  • classificationAcrossDatasets.Rnw
  • robustness.Rnw Note that classificationAcrossDatasets.Rnw produces the file "esets.not.rescaled.RData" which contains samples of high-grade serious ovarian cancer, with genes not z-score rescaled by gene. In order to ensure consistency in all analyses, this file is used by robustness.Rnw and batch.cluster.all.R.

The three knitr files can be run to produce pdf files within an R session with the command:

library(knitr)
knit("reproduceResults.Rnw")
knit("classificationAcrossDatasets.Rnw")
knit("robustness.Rnw")

(alternatively, they can be compiled in RStudio, making sure Preferences -> Sweave -> Weave Rnw files is set to knitr)

Note that several dependencies may need to be installed. Packages on CRAN can be installed with, for example,

install.packages("xtable")

Packages on Bioconductor can be installed with, for example,

source("https://bioconductor.org/biocLite.R")
biocLite("survcomp")

Within robustness.Rnw, there are two main components:

  • a reproduction of clustering algorithms and comparison to the cluster labels given in original supplementary texts (producing heatmaps demonstrating concordance between our implementation and the original results)
  • evaluation of robustness using prediction strength

The prediction strength analysis depends on the output of a fairly computationally intensive run, which produced the output directory jan20clusters. In order to read this output, cd into the "reports" directory and extract the compressed directory:

tar -xvfz jan20clusters.tar.gz

(Addendum) Performing cluster analysis for prediction strength analysis:

In the prediction strength analysis, we re-cluster each dataset with each algorithm 100 times. This takes a substantial amount of computation, so this is parallelized on an SGE cluster with the files "submit_batch_clustering.sh" and "batch.cluster.all.R". Ensuring that "esets.not.rescaled.RData" exists, the command

./submit_batch_clustering.sh

submits an array job to the cluster, creating a directory with clustering output. This directory is read in "robustness_validation.Rnw".

This was most recently performed on January 20, producing the output jan20clusters.tar.gz

ovcsubtypes's People

Contributors

bhaibeka avatar dgendoo avatar gmchen avatar lavkan avatar lwaldron avatar natchar avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ovcsubtypes's Issues

error in classificationAcrossDatasets.Rnw

I can't compile this script - can you, @lavkan or @gmchen? The final lines of output I get are:

 3 : echo keep.source term verbatim (label = load_data2, classificationAcrossDatasets.Rnw:48)
Clean up the esets.

Error:  chunk 3 (label = load_data2) 
Error in .validate_assayDataElementReplace(obj, value) : 
  object and replacement value have different dimensions
Execution halted

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.