Giter Club home page Giter Club logo

hcris-class's Introduction

Healthcare Cost Report Information System (HCRIS)

This repository provides the necessary code and links to download and organize key hospital information contained in the Healthcare Cost Report Information System.

Raw Data

All of the raw data are publicly available from the Centers for Medicare and Medicaid Services (CMS) website: HCRIS DATA. These data are also available through the NBER: NBER HCRIS Data.

The flat files in the NBER source should match to the raw files downloadable from CMS, but the NBER page includes additional details and links to more documentation. You can also access a subset of variables directly as SAS, Stata, or .csv files, in which case you don't need any of the raw data or the code below.

Raw Code Files

After downloading the flat files, the following code files will extract selected variables and form a final analytic dataset. There are two R code files, one for each of the different versions of HCRIS worksheets (v1996 at H1_HCRISv1996.R and v2010 at H2_HCRISv2010.R, respectively).

The top sections of each of the code files assign the location of the selected variables. These code files are almost identical; however, not all variables are in the same location of the HCRIS worksheets under v1996 and v2010. New variables can easily be added if you know the worksheet, line number, and column number of the relevant variable. If the variable is numeric, it should be in the numeric tables, and if its a character, it should be in the alphanumeric tables. This information should be indicated in the "source" column when assigning each variable's location, with possible values of 'numeric' or 'alpha'.

Master Code File

There is also a master code file that calls each of the individual code files and organizes the resulting data into unique hospital/year combinations: _HCRIS_Data.R

There are two common issues with the HCRIS data that are resolved (at least, attempted to be resolved) in the master code file. There are of course other issues that you will encounter when trying to work with these data, including missing variables in some years and clear evidence of misreporting in some cases, not to mention just very noisy data in general.

  1. Negative values: Variables that reflect a "loss" or a discount are often intended to take positive values in the worksheets but may sometimes take negative values for certain hospitals in certain years. For the subset of variables in these code files, I take the absolute value of all such variables. This may or may not apply to any additional variables that you want to add. To be sure if this is a mistake in the data, check the worksheet instruments in the documentation.

  2. Duplicate reports: Many hospitals change fiscal years at some point over the panel. Since HCRIS reports reflect data for each hospital's fiscal year, such a change will tend to result in more than one report for a given hospital in a given year. There also does not appear to be any systematic way in which hospitals submit data during these transitions. Some hospitals, for example, will transition with two reports that each cover less than a 12 month period; others will transition with one report that covers a longer period; and still others will transition with two reports that each cover a different 12 month period (one starting at the beginning of the original FY and going through the end of the original FY, and another starting at the new FY and ending at the new FY). The master code file collapses these instances with a series of different rules as commented in the code. The end result is a set of unique hospital/year combinations, where hospitals are defined by Medicare provider numbers and years are defined as hospital fiscal years.

hcris-class's People

Contributors

imccart avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.