Giter Club home page Giter Club logo

pkpdb's Introduction

PKPDB

A database of over 12M theoretical pKa values calculated over 120k protein structures deposited in the Protein Data Bank. DOI: 10.1093/bioinformatics/btab518

While this repo contains all the necessary code to replicate pKPDB, you may download the full data from pypka.org/pKPDB.

@article{reis2021pKPDB,
    author = {Reis, Pedro B. P. S. and Clevert, Djork-Arné and Machuqueiro, Miguel},
    title = "{pKPDB: a Protein Data Bank extension database of pKa and pI theoretical values}",
    journal = {Bioinformatics},
    year = {2021},
    month = {07},
    issn = {1367-4803},
    doi = {10.1093/bioinformatics/btab518}
}

DB diagram

Create/Update Protein Data Bank proteins

dbs/init.sql contains the schema of the database

dbs/update_db.sh downloads the latest entries to the Protein Data Banks and inserts them into the database

cd initial/
psql -d pkpdb -f init.sql

cd PDB_data/
bash update_db.sh

mmseqs database of fasta files also needs to be updated

cd src/extra_properties/DB_PDB/
mmseqs databases PDB ./DB_PDB tmp
rm -rf tmp

Add proteins to the database

# Add a random protein
python3 fill.py

# Add the protein with the IDCODE <IDCODE>
python3 fill.py --idcode <IDCODE>

Extra options:

  • ncpus NCPUS
  • verbose {DEBUG,INFO,WARNING}
  • nres-limit NRES_LIMIT

Dependencies

python3 -m pip install psycopg2-binary sqlalchemy pypka biopython python-decouple

mmseqs and DSSP are also required for running extra_properties/solvent_exposure.py

pkpdb's People

Contributors

pedrishi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.