Giter Club home page Giter Club logo

gregbeaumont / powerpophealth Goto Github PK

View Code? Open in Web Editor NEW
18.0 2.0 2.0 20.88 MB

Power Pop Health is a collection of content intended to simplify the process of ingesting and prepping Healthcare Open Data using Azure data tools and Power BI. Moving forward the overarching theme will be data related to Population Health, but other sources pertinent to Healthcare will also be included.

License: MIT License

healthcare healthcare-datasets healthcare-analysis healthcare-data cms population-health cdc microsoft azure powerbi power-bi

powerpophealth's Introduction

Intro to Power Pop Health

Power Pop Health is a collection of content intended to simplify the process of ingesting and prepping Healthcare Open Data for Analytics, Business Intelligence, and more. Power Pop Health has a simple mission: Make it easy for you to ingest, transform and format Healthcare Open Data and common reference tables so that you can achieve more. Three basic factors contributed to the initial release:

  1. Quite often I encounter teams eager to do Analytics and BI, but first they need to get source data into an easy-to-use and scalable location.
  2. Very few examples exist in the public domain for shaping and modeling Healthcare data for Business Intelligence and Analytics. Concepts such as efficient ETL/ELT, Data Modeling, and scalable enterprise data marts/warehouses often require extrapolating architectures from examples unrelated to Healthcare.
  3. Over the last few years I have accumulated examples and tutorials for items 1-2. This first release is a repository to share those examples in a unified format, and in one place. Future additions to this repository will be based on feedback from the community, with an initial plan to focus primarily on Population Health data such as Social Determinants of Health.

What User Personas are the Target Audience?

  1. Business Intelligence Professionals
  2. Data Scientists
  3. Business Analysts

What Data is Currently Accessible using Power Pop Health?

Available Data

Roadmap for New Content in Power Pop Health

Roadmap

What Types of Content are to be Expected?

All attempts will be made to make deployment of the content no code or low code. Documentation and content organization will focus on three type of content for specific audiences:

  1. Raw Data - Primarily Azure ARM Templates designed to get Open Data into a Data Lake without making any transformations. The ARM Templates will be low code or no code whenever possible.
  2. Curated Data - When source data requires significant transformations or cleanup to be usable, it will be labeled Curated Data. This data will still be in raw form without transformartions to granularity, but easier to use.
  3. Relational Data - Relational Data will allow for queries amongst different tables, but it will not necessarily be optimized for Business Intelligence.
  4. Optimized Data - Optimized Data will be cleaned, curated, transformed, and modeled to be ideal for efficient and performant queries. Most of this content will be optimized for use with Power BI Power Query, DataFlows, and DataSets (Tabular Models).

What do You Need to Use Power Pop Health Content?

Different technologies below may be needed for different components, depending on what you decide to deploy:

  1. Azure subscription - Note that some of the Power BI DataFlows / Power Query scripts can be redirected to other sources if you don't have an Azure subscription.
  2. Azure Data Lake Gen 2 - see below for depoloyment instructions
  3. Azure Data Factory - see below for depoloyment instructions
  4. Azure SQL Database - for specific data sets, I'll also be adding an Azure SQL DB layer in the future
  5. Azure Synapse - for specific solutions with large data volumes
  6. Power BI - DataFlows and/or Power Query, Power BI Desktop
  7. Power BI Premium - only required for some of the DataFlows

Deployment Instructions

  1. Deploy an Azure Data Lake - https://youtu.be/AS3OhGCQ8EQ
  2. Deploy an Azure Data Factory - https://youtu.be/OIbfhVov3YY
  3. Each set of Data will have a separate Azure ARM template you can deploy using the method in this video - https://youtu.be/Bg3zV_GIOJY . Here's the link in Azure where the cut-and-paste code can be deployed: https://ms.portal.azure.com/#create/Microsoft.Template
  4. Each set of Data will usually have an M script for Power Query - https://youtu.be/EZXOQylIjVo
  5. Each set of Data will usually have an M script that also works for DataFlows - https://youtu.be/_G2WniVVkMw

powerpophealth's People

Contributors

gregbeaumont avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

cajetzer pauluk

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.