Giter Club home page Giter Club logo

scilhs-ontology's Introduction

SCILHS i2b2 PCORnet Common Data Model Ontology

The Scalable Collaborative Infrastructure for a Learning Health System (SCILHS, pronounced "skills") is a network of 10 health centers across the United States that will cover over 8 million patients. SCILHS is a Clinical Data Research Network (CDRN) in the Patient-Centered Outcomes Research Institute's PCORnet.

SCILHS (with much help from others) has developed an i2b2 information model that represents the PCORnet Common Data Model (CDM). This information model consists of an i2b2 ontology/terminology and a process for mapping local data elements to the ontology without changing the underlying imported data. This approach highlights i2b2's ability to separate data model from both information model and the underlying data format.

By conforming to this ontology, SCILHS sites can query across the network with an interoperable data model using SHRINE, and we provide a script to programmatically generate data marts in the PCORnet data model format. This will enable detailed analysis through the PCORnet Distributed Research Network (DRN). Likewise, if PCORNet Queries were rewritten to run against the i2b2 schema, they could run reliably at every site using the PCORNet CDM i2b2 ontology.

The ontology is live at https://www.i2b2.org/webclient/. Change the username to pcori, leave the password as demouser, and click 'Login'. The existing demo data has been mapped to the PCORI ontology, so most queries involving demographics, diagnoses, procedures, and enrollment will work. This did not require any changes to the demo data, only to the information model (ontology). Adding new demo data for the remaining sections of the ontology is on our roadmap.

This release, v1.5.2, is our first public release. It has been vetted by seven of our sites - the mapping has been completed and it is successfully being used for inter- and intra- network querying. This release corresponds to the PCORnet CDM v1.0. A significant expansion will become available by the end of 2015, which will correspond to PCORnet CDM v2.1.

  • Jeffrey Klann, PhD

About the ontology

The core ontology was originally generated from Dan Connolly's code to generate ontologies from the PCORnet CDM v1 spec, and edited significantly based on site feedback. Note that some clarifications on the meaning of various elements appear in the tooltips. Many terminology trees were added, as described below. The ontology has been modified to do as much as possible out-of-the-box. By default, the ontology is mapped to relevant sections in the i2b2 metadata, and some elements are computed automatically (such as number of patients enrolled). Sites can adapt the ontology to their local data through a mapping process described in the documentation, and no changes to actual data are necessary.

What does the ontology cover?

  • Demographics: Similar to the i2b2demodata, with a more granular age tree to support pediatric age queries.
  • Diagnoses: Includes ICD-10 2014AA and ICD-9 2014AA, generated by Lori Phillips from ontologies on BioPortal.
  • Procedures: Includes ICD-9 2014AA, Partners' RPDR CMS-DRG tree, Beth Israel's MS-DRG tree (version unknown), and Nathan Wilson's HCPCS tree
  • Encounters: Includes PCORnet-specific additions, and a tree of 3-digit zip codes derived from the i2b2 demo data
  • Enrollment: To allow sites to specify which patients have "complete longitudinal data" for selected date ranges.
  • Vitals: A simple vitals implementation including height, weight, and blood pressure.

What's not in the ontology?

  • Medications, labs, and patient-reported outcomes will be included in CDM v2.1
  • LOINC v236 and SNOMED Clinical Findings v1.2 are available on BioPortal, but they are not included in this version of the ontology. We suspect SNOMED is not presently used, and the language from the coordinating center indicates CPT and HCPCS are preferred over LOINC for procedures at present: "Only billed procedures should be included in the PROCEDURE table. The ORDER concept may be incorporated into future phases of the CDM."
  • CPT is not included because it is not freely available. We have an integrated CPT tree that has retired codes reconciled across sites. If you have a CPT license, you can contact us to get it.

Download the ontology

Take a look at the documentation first if you like, or download the whole package:

  • Zipfile download: The full release, including pipe-delimited exports of the six ontology tables, documentation, and spreadsheets to assist in mapping.
  • Ontology Mapping Tool for PCORnet: The i2b2 ontology mapping tool, modified for PCORnet (not necessary, but useful if mapping a large number of terms from local codes).
  • Wiki pages:Additional documentation on the ontology implementation process
  • Enabling Patient-Centric Comparative Effectiveness Research in i2b2: A conference poster illustrating the mapping process at a high level.
  • SHRINE support files: Coming soon...
  • PopMedNet transform: Available at this link. Transforms an i2b2 instance mapped to this ontology into the physical CDM schema.
  • Procedures with CPT codes: Available if you have a CPT license, in this repository. Contact us for access if you are eligible.

scilhs-ontology's People

Contributors

jklann avatar

Stargazers

George Kowalski avatar

Watchers

Nathan Graham avatar George Kowalski avatar James Cloos avatar Tony French avatar  avatar

Forkers

kumc-bmi jgaupp

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.