Giter Club home page Giter Club logo

sdi-health's Introduction

SDI-Health

A short description of the data and materials contained in this repository follows:

Deidentified Raw Data

The data provided in the "rawData" folder has been deidentified. The administrative regions in which the health facilities are located have been encoded. GPS coordinates are not available in this dataset. The following set of variables was recoded to prevent identification:

  • publicprivate: for-profit and non-profit private facilities were grouped under the heading "private"
  • ruralurban: urban and semi-urban were grouped under the heading "urban"
  • facility_level: facilities were reclassified as "hospitals", "health centers", and "health posts"
  • provider_cadre: providers were reclassified as "doctors", "clinical officers", "nurses", and "other"
  • provider_mededuc: medical education attainment was reclassified as "advanced", "diploma", "certificate", and "none"
  • provider_educ: general education attainment was reclassified as "primary", "secondary", and "post-secondary"

In addition, the following set of variables were removed from the data to prevent possible identification of the facility:

  • Date of visit
  • Questions related to finances and staff salaries
  • Questions related to access or distance to district headquarters or other hospital
  • Questions related to days and hours of operation and catchment area
  • Whether maternity waiting center or delivery room are present
  • Availability of emergency obstetric care, c-section services, blood transfusion, and surgery
  • Questions related to number of patient beds, hospitalizations, maternal and neonatal deaths or complications, and transfers for maternal cases
  • Question related to power sources, water sources, toilet infrastructure, and presence of ambulances

Cleaning code

In the "scripts/cleaning" folders, the do-files for cleaning each of the modules are saved under separate folders: "Infrastructure-Module1", "Roster-Module2", and "Vignettes-Module3". In each do-file, the following set of actions are executed:

  • Variables are recoded for consistency with the value label that will be applied during the harmonization step (see metadata files for lists of these value labels).
  • Variables are created, combined, or modified to align with the Tanzania-2014 benchmark survey.
  • Variables are checked to confirm that all missing values are coded as missing.
  • Survey weights are added to the module 1 datasets.
  • Indicator variables related to the module are created.

Metadata Files

The metaData folder contains excel files for each of the three modules (vignettes, absenteeism, infrastructure) for each of the 10 surveys that have occured to date. There is also a template for each module that can be used to build a metadata file for future surveys. Each excel file contains all of the variables that have ever been included in the survey instrument to-date and specifies whether those variables were included in the given country survey.

On each sheet, the first column, "rename" specifies the name the variable should have in the final harmonized dataset. The second column, "varlab", specifies the variable's label. The third column, "varname", specifies the name the variable had in the raw data. The fourth column, "vallab", specifies whether a particular value label should be attached to the variable. In the vignettes metadata files, there is an additional "notes" column where differences across instruments as compared to the Tanzania-2014 benchmark are described.

This information is used to harmonize the data in the final step of the cleaning process.

Harmonized data

In the "scripts/harmonization" folder, there are three do-files, each responsible for harmonizing one module of the SDI survey. These do-files produce country specific data files that contain renamed, harmonized variables. The do-files also produce a single data file where all country datasets have been appended.

Using the data

Should you use this data for analysis, we recommend including this repository in your analysis repository as a submodule using the Git command: git submodule add https://github.com/worldbank/SDI-Health.git SDI-Health

In this way, Git will associate your analysis with a specific commit of the data respository. The commit used in your analysis respository can be updated using the Git command: git submodule update

Acknowledgements

Data prepared and do-files written by Anna Konstantinova and Benjamin Daniels, with support from Jishnu Das, Waly Wane, Christophe Rockmore, and Matthew Collins.

sdi-health's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Forkers

jsvine ebergam cc631

sdi-health's Issues

Availability of data in CSV format?

Thanks for publishing this data! It looks very interesting, but appears only to be available in Stata's .dta format. Are there any plans to publish the data as CSV, or in any other plain-text format?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.