Giter Club home page Giter Club logo

phd_code's Introduction

Ranking Microbial Metabolomic and Genomic Links using Complementary Scoring Functions

This repository contains the code used to run the experiments described in the PhD thesis Ranking Microbial Metabolomic and Genomic Links using Compelmentary Scoring Functions.

The requirements.txt file describes the environment used to run the Jupyter notebooks. It surely includes packages that are not required, and does not include libraries that are. In particular, the NPLinker library (https://github.com/sdrogers/nplinker) and some other tools from Simon are used.

Other libraries may require manual install.

data

The data folder contains the structural annotations for the MIBiG database, as well as a mock .mgf file containing the structure entries from GNPS without any ions. This may very well break some parsers.

The microbial_data_sets subfolder contains the config files needed for the NPLinker downloader to download the microbial data sets from PoDP.

iokr_analysis

The iokr_analysis folder contains the scripts used to test the IOKR model. The MS2-MIBiG and BGC-MIBiG IOKR models uses the implementation from NPLinker. The MS2-BGC IOKR model is included in the directory.

mibig_gnps_data_set

The notebooks required to assemble the MIBiG-GNPS data set

phylogeny_adjustment

The notebooks used for the experiments on the phylogenetically adjusted strain correlation score

strain_correlation

The notebook used to generate the graph of the expected value and variance of the strain correlation score. The code to test the strain correlation score on the microbial data set is contained in the iokr_analysis notebooks, along with the code for the MS2-MIBiG IOKR model.

mibig_bgc_fp_distance_test.ipynb

Calculate the correlation between the BGC distance and the difference in molecular fingerprints for the associated metabolites.

phd_code's People

Contributors

grimur avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.