Giter Club home page Giter Club logo

heart_disease_mortality_by_state's Introduction

Heart Disease Mortality by State

What factors – if any – contribute to disparities in rates of heart disease mortality between U.S. states?

Contributors

  • Hany Dief -- (@hanydief)
  • Katie Djahan -- (@katiedjahan)
  • Alejandro Gutierrez -- (@alejfxguti)
  • Bijoyeta Kayal -- (@BijoyetaK)
  • Tristan Marcum -- (@TinTesla)
  • Niti Patel -- (@niti2442)

Google Slides Presentation Link

Problem Overview

Heart Disease Mortality is a serious public health issue and one of the leading causes of death in the US. Understanding the factors that are associated with heart disease mortality is crucial in identifying potential solutions to this problem.

We analyzed various data sources to determine whether insurance coverage, healthcare spending, or geographic location have an impact on the rate of heart disease mortality at the state level.

Data Sources

Installations/Prerequisites for choropleth maps:

- plotly
- plotly_express
- plotly chart_studio
- geopandas
- fiona
- folium

Reference guides used:

- plotly
   - https://plotly.com/python/getting-started/
   - https://towardsdatascience.com/geographical-plotting-of-maps-with-plotly-4b5a5c95f02a
   - https://plotly.com/python/choropleth-maps/
   
- folium 
   - https://towardsdatascience.com/folium-and-choropleth-map-from-zero-to-pro-6127f9e68564
   - https://towardsdatascience.com/folium-mapping-displaying-markers-on-a-map-6bd56f3e3420
   - https://www.analyticsvidhya.com/blog/2020/06/guide-geospatial-analysis-folium-python/
   - https://github.com/python-visualization/folium/issues/403 --> choosing color schemes
   - https://github.com/python-visualization/folium/issues/1202 --> adding titles to maps
   - https://getbootstrap.com/docs/3.3/components/#glyphicons-glyphs --> icon set options

Instructions to open the html image files -

   - 1) Click on the html file 
   - 2) View Raw -> opens HTML code 
   - 3) Right click on the code 
   - 4) Save as HTML file in your local
   - 5) Open to view as HTML

Research Questions

  • Is there a relationship between health spending per capita and heart disease mortality rates?
  • Is there a relationship between health Insurance coverage and heart disease mortality rates?
  • Do states with higher healthcare spending per capita have higher rates of health insurance coverage?
  • Is there a relationship between a state’s location and its heart disease mortality rate?

Analysis

Is there a relationship between health spending per capita and heart disease mortality rates?

The bar chart shows the deaths rates per 100k by state from Highest to Lowest:

image

Map shows the distribution of Average Healthcare spending by State from Highest to Lowest spending states by color:

image

We can see that - Utah, Texas, Idaho, Arizona, Nevada being the states that tends to spend less towards healthcare services, this acts like a visual basis to do our analysis of the first question.

We wanted to investigate whether the amount a state pays per person on health care services impacts heart disease mortality rates. We hypothesized that states who spend more per person would likely have lower heart disease mortality rates.

Scatter plot was used to determine the correlation and its linearity, followed by the geographical visualization.

Scatter plot:

  • Spending vs Mortality SpendingVsMortality

  • The rvalue is -0.2378, indicating a weak negative correlation between health spending per capita and heart disease mortality rate.

  • The r-squared value is 0.0566, indicating that only 5.7% of the variation in heart disease mortality rate can be explained by health spending per capita.

  • The p-value is 0.0963, which is not statistically significant at the standard 0.05 level, suggesting that there is not strong enough evidence to say that there is a significant relationship between health spending per capita and heart disease mortality rate.

  • The standard deviation of 0.0028 suggests that the regression line is a relatively decent fit for the data, as the distance between the actual data points and the predicted values is relatively small.

Map visualization:

The map acts like a visual journey to establish the correlation of Heart disease Mortality rate with Health Spending per capita. Orange and Red indicating the Lowest and Highest Deaths per 100k respectively.

It somewhat appears that states spending the most in Healthcare have lower Mortality rates than states spending the least on Healthcare. However, if there was a significantly strong correlation between heart mortality rate and health spending by state, then we would have expected to see Texas, Nevada, Idaho, Utah, Arizona,indicating highest deaths per 100k, but it is not. This tells us there could be other factors that could account for this outcome which were not considered in this analysis.

image image

Is there a relationship between health Insurance coverage and heart disease mortality rates?

By replying to this question we are trying to identify if heart disease mortality has any correlation with insurance coverage or if insurance coverage has any impact on reducing the heart disease mortality within the United States.

Map shows the distribution of % Uninsured by State from Highest to Lowest by color:

image

We can clearly see that - Texas, Oklahama,Florida,Georgia,Mississippi being the top 5 states that has the highest % uninsured which helps as a reference to correlation calculations in this question.

Scatter plots were used to determine the correlation and its linearity, followed by the geographical visualization.

Scatter plots:

  • Insured vs Mortality LRInsuredVsMortality

    • The Linear Regression Statistics between Mortality & Insured per state has weak NEGATIVE correlation coefficient "r-value" -0.31
    • The 0.094 is a small r-squared value means 9.4% of Mortality (Death) has dependency on Insurance coverage & that’s a WEAK direct impact relationship.
    • The "p-value" 0.031 is lower than the significance level (P ≤ 0.05) means that the test hypothesis is false or should be rejected.
    • 1.29 Std. Dev shows that the data points spread around the regression line values are generally far & scattered from the mean
    • Graph is a good proof that the more insured citizens the less heart disease mortality but with a very weak impact.
  • Uninsured vs Mortality LRUninsuredVsMortality

    • The Linear Regression Statistics between Mortality & Uninsured per state has weak POSITIVE correlation coefficient "r-value" +0.31
    • The 0.094 is a small r-squared value means 9.4% of Mortality (Death) has dependency on Insurance coverage & that’s a WEAK direct impact relationship.
    • The "p-value" 0.031 is lower than the significance level (P ≤ 0.05) means that the test hypothesis is false or should be rejected.
    • 1.29 Std. Dev shows that the data points spread around the regression line values are generally far & scattered from the mean.
    • Graph is a good proof that the more uninsured citizens the less heart disease mortality but with a very weak impact.

Map visualization:

This map acts like a visual journey to establish the correlation of Mortality rate with Uninsured percentage. Blue and Red indicating the Lowest and Highest Deaths per 100k respectively. It somewhat appears that states having the most uninsured population have higher mortality rates than states having the most insured population. However, if there was a strong correlation between heart mortality rate and % uninsured, then we would have seen Texas to be having the Highest deaths per 100k, Texas displaying the highest % uninsured.This tells us there could be other factors that could account for this outcome which were not considered in this analysis.

image image

Do states with higher healthcare spending per capita have higher rates of health insurance coverage?

Screenshot 2023-04-18 194921

This scatter plot shows the correlation between healthcare spending vs health insurance coverage. Different types of health coverage is identified across all the states in the US during 2019. The types of health coverage includes employer, non-group, Medicaid, Medicare, and Military. States with higher healthcare spending tend to have a greater percentage of insured coverage.

The r-value 0.52 identifies moderate correlation. The r-squared value of 0.27 is fairly low and does no explain the variation in the data much. The low p-value is statistically significant and most likely the null hypothesis is rejected. The low standard deviation indicates that the data is close to the mean.

Unknown

Is there a relationship between a state’s location and its heart disease mortality rate?

This scatter plot visualizes a state's latitude vs. its death rate per 100,000 people:

lat_vs_rate

The r-value is: -0.337
The r-squared is: 0.114
The p-value is: 0.016
The Std. Dev. is: 0.680

The linear regression for Latitude vs Death Rate per State shows a negative correlation, meaning that as a state's latitude increases, their death rate decreases. The r-value of -0.337 indicates a fairly weak negative correlation between latitude and death rate per state. The r-squared value of 0.114 indicates that 91% of the variability in the outcome is due to factors not accounted for in the model. Additionally, the standard deviation of 0.680 is quite low, indicating that there is low variance in the data and the values are all very close to the mean. The p-value is quite strong at roughly 0.02, indicating very high accuracy of these findings.

This scatter plot visualizes a state's latitude vs. its death rate per 100,000 people:

lon_vs_rate

The r-value is: 0.263
The r-squared is: 0.069
The p-value is: 0.065
The Std. Dev. is: 0.214

The linear regression for Longitude vs Death Rate per State shows a positive correlation, meaning that as a state's longitude increases, so does its death rate. The r-value shows a weak correlation between state longitude and death rate at 0.263. The r-sqared value is roughly 0.07, indicating that 93% of the variability in the outcome is due to factors not accounted for in the model. The standard deviation of 0.214 is quite low, indicating that there is low variance in the data and the values are all very close to the mean. The p-value is fairly average at roughly 0.06, meaning that these findings are likely accurate.

Screenshot 2023-04-19 at 2 56 01 PM

This map shows the death rate per state. The size of the points is the death rate per 100,000 people in that state. This visual shows that, while there is not much variation in the death rates per state, the points do become larger as the latitude increases and the longitude increases. This provides a more visual representation of the results shown by the scatter plots above.

It is unlikely that the latitude or longitude of a state has much of an impact on the state's heart disease mortality rate, but there is some correlation. Based on these findings, it is very likely that the outcome is the result of other factors not accounted for in our analysis.

heart_disease_mortality_by_state's People

Contributors

bijoyetak avatar hanydief avatar tintesla avatar alejfxguti avatar niti2442 avatar katiedjahan avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.