Giter Club home page Giter Club logo

oxmatch's Introduction

OXMatch

A simple script for matching sequences from two different MSAs based on the OX identifiers resulting from an hhblits search. The top hits from each MSA and OX will be selected since these are currently the most likely ortholog selections

Input: MSAs to be matched in a3m format

Output: Merged MSA i a3m format

Requirements: python3 and numpy (v.1.16.6 or higher)

Run like this:

A3M1=./test/1KA9_F.a3m #Path to a3m1

A3M2=./test/1KA9_H.a3m #Path to a3m2

MGF=0.9 #Max gap fraction allowed in each sequence

OUTDIR=./test/ #Output directory

python3 oxmatch.py --a3m1 $A3M1 --a3m2 $A3M2 --max_gap_fraction $MGF --outdir $OUTDIR

Some statistics of different ways of matching MSAs

Comparison of the results from matching the top and bottom hits for each OX from an hhblits (https://toolkit.tuebingen.mpg.de/tools/hhblits) search with that of no pairing whatsoever (noselection). The dataset used for the comparison consists of around 1600 protein-protein interactions and can be found here: https://www.nature.com/articles/s41467-021-21636-z The evaluation refers to how many of the top 10 GaussDCA correlations in the interface are actually true contacts (having CBs within 8ร… from each other). It is twice as likely to obtain at least 5 true contacts when matching the top hits vs the bottom ones (and virtually infinite compared with no matching).

HHblits has been run with the following options: -E 0.001 -all -oa3m

Image 1

oxmatch's People

Contributors

patrickbryant1 avatar

Stargazers

Alex avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.