Giter Club home page Giter Club logo

gpn's Introduction

Genotype and Phenotype Network (GPN)

A novel method for multiple phenotype association studies by constructing a bipartite signed network, linking phenotypes and genotypes into a Genotype and Phenotype Network (GPN), which is a new insight to investigate correlation among phenotypes. The GPN can be constructed by both quantitative and qualitative traits, especially phenotypes have extremely unbalanced case-control ratios. The bipartite network shelds the light to Multiple Phenotype Assotiation Test, Colocalization analysis, Heritability Enrichment analysis, and Phenome-Wide Association studies.

This package "GPN" includes the following parts:

  • Construction of bipartite GPN: Construction of the Genotype and Phenotype Network with the individual-level phenotype and genotype data or GWAS Summary statistics; Then detect the well-defined representations of GPN by comparing the network topologies with the random networks, including connectivity, centrality, and system entropy;
  • Community detection method: Novel community detection method to partition K phenotypes into disjoint network modules based on the similarity matrix from GPN;
  • Multiple Phenotype Association Tests with and without considering the network modules: test the association between K phenotypes with a SNP. The phenotypes can be either qualitative or binary, especially the binary phenotypes with the extremely case-control ratio (the test statistics has been adjusted by the saddlepoint approximation)
    • ceCLC: Modified computational efficient clustering linear combination method
    • CLC: Modified clustering linear combination method
    • HCLC: Modified clustering linear combination method based on hierarchical clustering
    • MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS
    • O'Brien: Modified O'Brien (SSU) method
    • Omnibus: Modified omnibus (Chi-square) method
    • TATES: Modified TATES method
    • ACAT: Cauchy combination method
    • HMP: Harmonic mean p-value method
  • Heritability Enrichment Analysis: Construct network topology annotations of genetic variants that quantify the possibility of pleiotropy, and apply stratified linkage disequilibrium (LD) score regression to highly correlated phenotypes to identify enriched annotations.

Installation

You can install the released version of APGD from Github with:

devtools::install_github("xueweic/GPN")
library(GPN)

Reference

Xuewei Cao, Shuanglin Zhang, Qiuying Sha*. A novel method for multiple phenotype association studies based on genotype and phenotype network.

Xuewei Cao #, Lirong Zhu #, Xiaoyu Liang, Shuanglin Zhang, Qiuying Sha*. Constructing genotype and phenotype network helps reveal disease heritability and phenome-wide association studies

Any questions? [email protected]; [email protected]

Graphical Overview

Construction of bipartite GPN
  • a โ€“ c. Construction of the denser and well-defined representations of GPN by comparing the network properties with the random networks, including connectivity, centrality, and system entropy. Specifically, each phenotype and each SNP form a directed edge which represents the strength of the association, where the red color indicates that the minor allele of the SNP is a protective allele to the phenotype, and the blue color indicates that the minor allele of the SNP is a risk allele to the phenotype.
  • d. The weighted degree distributions with different thresholds and the examples of two network topology annotations, approximate betweenness centrality and degree centrality, used in the heritability enrichment analysis;
  • e. The one-mode projection of GPN onto phenotypes that are linked through shared genetic architecture. Heritability enrichment analysis and phenome-wide association studies are introduced as two important applications of the constructed GPN.
Heritability Enrichment Analysis
  • Enrichment Analysis Construct network topology annotations of genetic variants that quantify the possibility of pleiotropy, and apply stratified linkage disequilibrium (LD) score regression to highly correlated phenotypes to identify enriched annotations. The constructed network topology annotations are informative for disease heritability after conditioning on a broad set of functional annotations from the baseline-LD model.
Phenome-wide Association Studies (PheWAS)
  • Community Detection. The powerful community detection method is used to partition phenotypes into disjoint network modules with different colors.
  • FDR Controlling Precedure.
Multiple Phenotype Assocaition Tests
  • Construction of PPN. Construction of a signed network, named the Phenotype and Phenotype Network (PPN), which is the one-mode projection of GPN on phenotypes.
  • Community Detection. The powerful community detection method is used to partition phenotypes into disjoint network modules with different colors.
  • Multiple Phenotype Assocaition Tests. Multiple phenotype association tests are applied to test the association between phenotypes in each of the network modules and a SNP, then the Bonferroni correction is used to obtain the overall p-value.
  • Colocalization analysis. Colocalization of GWAS signals and eQTL analysis.

gpn's People

Contributors

julia-lirong avatar xueweic avatar

Stargazers

 avatar

Watchers

 avatar

Forkers

julia-lirong

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.