Giter Club home page Giter Club logo

swiss-hospital-data's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

swiss-hospital-data's Issues

Track data sources

In our project we have started from a single source, but are expanding to additional ones. It's becoming clear that there will be likely one or more sources per region, in some cases even per medical facility. Therefore the list that we are currently tracking in our ideas notebook should be formalized. The Data Package spec provides a good way to do this:

"The raw sources for this data package. It MUST be an array of Source objects. Each Source object MUST have a title and MAY have path and/or email properties. Example:"

"sources": [{
  "title": "World Bank and OECD",
  "path": "http://data.worldbank.org/indicator/NY.GDP.MKTP.CD"
}]
  • title: title of the source (e.g. document or organization name)
  • path: A url-or-path string, that is a fully qualified HTTP address, or a relative POSIX path (see the url-or-path definition in Data Resource for details).
  • email: An email address

See also: neherlab/covid19_scenarios_data#3

Set up data flows

We need to set up a simple process to ensure reproducible results. I would use DataFlows, but Snakemake, Renku, Nextflow and Workbench have been suggested by our team as well. I think we should at least stay true to DataFlows philosophy:

  • use it for data that easily fits on your hard drive
  • all data produced by these flows is easily reusable by others
  • not a heavy-weight framework
  • machine and human readable interface

Define the data schema

Since starting this project we have debated a lot about what kind of information we need and don't need. It is clear that our starting point, the available Open Government Data, has both too much and too little information. We want to expand and improve this schema, and write a converter to stay compatible in case there are future updates.

We are working on a data dictionary that is complementary to the schema, and explains the fields in clearly accessible terms. Currently we are especially interested in these indicators, which need to be clearly explained and defined:

    N of hospitals in CH (with name and geolocation)
    N of beds by hospital
    N of ICU beds by hospital
    N of respirators by hospital
    average % bed availability pre-COVID-19 (“Bettenauslastung”) by hospital
    N ICU nurses by hospital (“IPS Pflege”)

A repository was also started yesterday with an example dataset which aims to simplify and standardize the requests we are making to regional health authorities: https://github.com/openZH/covid19_hospitalinfra

Our current data package has an automatically generated schema, which can be edited in the datapackage.json file, or loaded in the Frictionless Data Creator. The results of the data dictionary and schema proposal work can be merged in this way.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.