Giter Club home page Giter Club logo

review-nar-databases's Introduction

Review Nucleic Acid Research (NAR) publications of databases for PDB-releated databases

The program searches database publication at NAR to find any database that uses PDB data. The program look for PDB keywords in the summary and abstract of each database published at NAR and measure PDB relation in three tiers:

  • Tier 1 uses PDB data
  • Tier 2 likely uses PDB data
  • Tier 3 may use PDB data

The results searve as a guide for next-step manual review

How to run the program

The scripts are to be run in 3 steps:

Step 1: Find all databases to be searched.

Run p1_parse_main.py to parse databases from NAR database summary, plus the category, the summary page etc. Give output file "p1_db_summary.tsv" of tabular form of databases' name, summary link, category, subcategory.

Step 2: Review summary pages of each database:

Run p21_process_all_db_summary.py, look for PDB keywords. Give output file "p2_db_summary_review.tsv" of tabular form of db name, categories, db url, year, abstract url, length description, keywords t1/t2/t3.

Step 3: Review abstract of each database:

Run p31_process_all_nar_abstract.py, look for PDB keywords and test accessibility of the url of each database. Give output file "p3_db_abstract_review.tsv" of tabular form of db name, categories, db url, year, abstract url, length description, keywords t1/t2/t3, and database url accessibility.

Notes:

Other python scripts such as those state with "p0" are utility scripts.

review-nar-databases's People

Contributors

shaochenghua avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.