Giter Club home page Giter Club logo

modas's Introduction

MODAS

MODAS: Multi-Omics Data Association Study toolkit

Installation

Installation using conda

git clone https://github.com/liusy-jz/MODAS.git
cd MODAS
conda create -n modas python=3.7 -y
conda activate modas
python setup.py build
python setup.py install

pip install pyranges
conda install -y -c conda-forge r-rcppeigen r=3.6 rpy2
Rscript -e 'install.packages(c("data.table", "ggplot2", "ggsignif", "Matrix"), repos="https://cloud.r-project.org")'
Rscript -e 'install.packages("bigsnpr", dependence=T, repos="https://cloud.r-project.org")'

echo "export PATH=`pwd`/utils:\$PATH" >> ~/.bashrc
source ~/.bashrc

Mannual Installation

#Depends: R(>=3.6), python(>=3.7)

git clone https://github.com/liusy-jz/MODAS.git
cd MODAS
python setup.py build
python setup.py install

pip3 install rpy2 pyranges
Rscript -e 'install.packages(c("data.table", "ggplot2", "ggsignif", "Matrix"), repos="https://cloud.r-project.org")'
Rscript -e 'install.packages("bigsnpr",dependence=T, repos="https://cloud.r-project.org")'

echo "export PATH=`pwd`/utils:\$PATH" >> ~/.bashrc
source ~/.bashrc

A toy try

Downloading example data

MODAS_data containing sample data for MODAS and omics data used in the article uploaded by Git extension Git Large File Storage (LFS), first download Git LFS from https://git-lfs.github.com/, and place the git-lfs binary on your system’s executable $PATH or equivalent, then set up Git LFS for your user account by running:

git lfs install

next download MODAS_data by running:

git clone https://github.com/liusy-jz/MODAS_data.git

When the download is complete, first check the integrity of the downloaded data, MODAS_data contains five folders, namely agronomic_traits, genotype, metabolome, transcriptome and example_data, also contains a gene annotaion file for maize. The example folder contains sample data for MODAS, while other folders contain the omics data used in the article.

Then, enter the MODAS_data directory,

cd MODAS_data

Generate pseudo-genotype files

MODAS.py genoidx -g example_data/example_geno -genome_cluster -o example_geno

Pseudo-genotype files generated by genoidx subcommand will be saved as example_geno.genome_cluster.csv.

Prescreen candidate genomic regions for omics data

The prescreen subcommand uses genome-wide genotype files to calculate the kinship matrix, first extract genotype files by:

tar -xvf genotype/chr_HAMP_genotype.tar.gz

Then, the pseudo-genotype file example_geno.genome_cluster.csv generated by genoidx and the example_phe.csv file under the example_data folder are used for prescreen analysis,

MODAS.py prescreen -g ./chr_HAMP -genome_cluster example_geno.genome_cluster.csv -phe example_data/example.phe.csv -o example

Prescreen subcommand generates two files including example.sig_omics_phe.csv containing phenotype data and example.phe_sig_qtl.csv containing candidate genomic regions of phenotype.

Perform regional association analysis to identify QTLs

The prescreen subcommand outputs are used for regional association analysis,

MODAS.py regiongwas -g ./chr_HAMP -phe example.sig_omics_phe.csv -phe_sig_qtl example.phe_sig_qtl.csv -o example

Regiongwas subcommand generates two QTL files including example.region_gwas_qtl_res.csv containing reliable QTL results and example.region_gwas_bad_qtl_res.csv containing unreliable QTL results.

Perform Mendelian randomization analysis

MODAS.py mr -g ./chr_HAMP -exposure ./example_data/example.exp.csv -outcome agronomic_traits/blup_traits_final.new.csv -qtl example_data/example_qtl_res.csv -mlm -o example

The results of Mendelian randomization analysis are saved as example.MR.csv.

MR-based network analysis

MR-based network analysis is carried out by the parameter net of mr subcommand. It uses transcriptome data for subnetwork modules analysis,

MODAS.py mr -g ./chr_HAMP -exposure ./example_data/network_example.exp.csv -outcome ./example_data/network_example.exp.csv -qtl example_data/network_example_qtl.csv -mlm -net -o network_example

Network analysis generated four files, including network_example.MR.csv containing gene pairs with MR effect, network_example.edgelist containing gene pairs with weight, network_example.cluster_one.result.csv containing all identified subnetwork modules, network_example.sig.cluster_one.result.csv containing significant subnetwork modules.

co-associated gene analysis

Co-associated genes analysis is not a modas function. It is implemented by script co-associated.py. The analysis command line is as follows:

python3 example_data/co-associated.py example_data/co_associated.test.pvalue.csv co-associated_test

Then, a file containing co-associated gene labels and a heatmap showing relationship between co-associated genes are saved as co-associated_test.cluster.csv and co-associated_test.cluster.heatmap.pdf.

Document

detail in https://modas-bio.github.io/

modas's People

Contributors

liusy-jz avatar crazyhsu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.