Source: https://github.com/CSSEGISandData/COVID-19
Credit to Johns Hopkins CSSE for compiling and reporting COVID-19 data on a daily basis.
Each data file in here reports the figures as a cumulative amount. Hence, to obtain incremental figures, all the historical files must be compiled.
However, data files have inconsistent format from day to day. Codes in here aim to clean and compile the daily data into a tidy data. Please find the tidy data here.
In R, use the below code to read the clean tidy data:
covid19_data <-
read.csv(text = RCurl::getURL("https://raw.githubusercontent.com/ashten28/covid19_analysis/master/01_data/covid19_daily_cases_cleaned.csv"))
The data consist of these main columns:
- report_date
- country_region - Country
- province_state - Province (earlier dates may not have breakdown)
- cum_confirmed - Cumulative confirmed cases
- cum_deaths - Cumulative deaths cases
- cum_recovered - Cumulative recovered cases
- inc_confirmed - Incremental confirmed cases (calculated field)
- inc_deaths - Incremental death cases (calculated field)
- inc_recovered - Incremental recovered cases (calculated field)
- lat - Lattitude
- long - Longitude