Giter Club home page Giter Club logo

multi_dti's Introduction

Deep learning-based integration of multi-interactome data for protein-compound interaction prediction

Requirements

  • python anaconda3(installed rdkit, chainer)

    • pubchempy (for converting PunChem ID into smiles)

    • node2vec, networkx(for applying node2vec algorithm)

    • pyensembl (for converting ensembl protein ID into amino acid sequences)

    pyensembl install --release 93 --species human

Usage

  1. Make input file for integrate model Hard dataset already processed is contained in dataset_hard except for 'protein.npy'(protein onehot). Easy dataset and middle dataset is saved as input files (csv format) that haven't preprocessed yet in dataset_easy and dataset_middle.

1.1 run preprocessed_dataset.py

python preprocessed_dataset.py --input <INPUT_FILE>

<INPUT_FILE> input file (only csv format is available)

  1. Apply node2vec to multi-interactome data

node2vec model of chemical-chemical interaction data are already saved as data_multi/modelcc.pickle. node2vec model of protein-protein interaction data are already saved as data_multi/modelpp.pickle.

construct graph network connecting compounds(proteins) interacting each other and apply the graph to node2vec here.

2.1 run omics_compound.py

python omics_compound.py

chemical_chemical_interaction.csv (multi-interactome data) is setteing as default input file.

2.2 run omics_protein.py

python omics_protein.py

protein_protein_interaction.csv (multi-interactome data) is setteing as default input file.

  1. Learn and predict by integrated model

Example

You can demonstrate our program with test data.

cd /path/to/multiDTI/data_multi
python omics_compound.py
python omics_protein.py
  • 5 fold Cross-validation for hard-dataset

    • PreProcessed train dataset
    python preprocessed_dataset.py --input <FILEPATH> --data <DATA>

    <FILEPATH> './dataset_hard'

    <DATA> './train'

    • PreProcessed test dataset
    python preprocessed_dataset.py --input <FILEPATH> --data <DATA>

    <FILEPATH> './dataset_hard'

    <DATA> './test'

    • Learn
    cd /path/to/multiDTI
    python training.py --input <FILEPATH> --output <DIRECTORY>
    • Predict
    python evaluation.py --input <FILEPATH> --output <DIRECTORY> --place <PLACE_DIRECTORY>

Reference

  • Watanabe, N., Ohnuki, Y., Sakakibara, Y.
    • Deep learning integration of molecular and interactome data for protein-compound interaction prediction

multi_dti's People

Contributors

njk-901aru avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.