govex / covid-19 Goto Github PK
View Code? Open in Web Editor NEWData analysis and visualizations of daily COVID cases report
License: MIT License
Data analysis and visualizations of daily COVID cases report
License: MIT License
The hourly data seems to be reporting 2,386,987 having received first dose, but the state dashboard only says 2,173,322.
https://covid19vaccine.health.ny.gov/covid-19-vaccine-tracker
Hello,
First, thanks a lot for compiling this amazing data!
A new vaccine is starting to get approval in various countries, the J&J one.
It requires only 1 dose to be administered.
I just wanted to know how it will be counted in the Global vaccine data here:
Regarding the US state timeline, here, will there be a new column for the J&J vaccine?
Thanks a lot for helping me understand that!
Best,
Louis
July 28, 2021:
The Mills Administration announced that the State of Maine will follow the U.S. Centers for Disease Control and Prevention’s (U.S. CDC) updated face covering guidance, which recommends that:
If I read and understood correctly, the licence does not permit building applications using the data. Is that intentional? Could it be altered to something like MIT (same licence as JHU COVID data) so that we can use it to build stuff?
Or have I misunderstood and the "Website Use" covers the dashboard linked in the readme only?
First, I just would like to thank and congratulate everyone in this repository for the hard work in pulling and consolidating all this different data sources regarding the pandemic.
That being said, as many users right now, most of my interest in this data has been to consume the Vaccinations information (https://github.com/govex/COVID-19/blob/master/data_tables/vaccine_data/raw_data/vaccine_data_us_state_timeline.csv). With that in mind I've found a number of data issues and verified the data against the provided sources, and have compiled an list of problematic regions ( I did not go through all states)
Alaska: Doses administrated do not match the dashboard information (https://www.arcgis.com/apps/opsdashboard/index.html#/84691dc5b0184827af0fd8e4c20034d9)
Alabama: Doses administrated over time do not match any of the historical data points in the dashboard report.
Arizona: 2021-01-20 administrated numbers are far from the reported number https://www.azdhs.gov/documents/preparedness/epidemiology-disease-control/infectious-disease-epidemiology/novel-coronavirus/vaccine-phases.pdf. Also from the sources provided I couldn't find how the second dose separation was calculated from the provided sources.
Colorado: Assuming you are using the CDC data, it doesn't match. Also for some dates doses_admin_total = people_total+ people_total_2nd_dose and for others people_total + 2 * people_total_2nd_dose
Delaware: Doses administrated do not follow the values reported on https://myhealthycommunity.dhss.delaware.gov/locations/state/vaccine-tracker#vaccine_tracker
DC: I think the CSV is updating only the cumulative numbers, but not the daily numbers that are displayed (https://coronavirus.dc.gov/data/vaccination)
Georgia: Numbers do not match dashboard https://dph.georgia.gov/covid-vaccine nor CDC
Hawaii: Historical numbers do not match dashboard report (https://health.hawaii.gov/coronavirusdisease2019/what-you-should-know/current-situation-in-hawaii/#vaccine)
Idaho: Dates of data points have a 2 day lag to the reported data https://public.tableau.com/profile/idaho.division.of.public.health#!/vizhome/COVID-19VaccineDataDashboard/Residence
Iowa: Numbers do not match report https://idph.iowa.gov/Portals/1/userfiles/61/COVID19%20Vaccine%20Administration.pdf
Illinois: Historicals are lagged two days to the report in the dashboard http://www.dph.illinois.gov/covid19/vaccinedata?county=Illinois
Indiana: Most recent number does not match the displayed value - could be a date issue https://www.coronavirus.in.gov/2680.htm
Kansas: Data does not match the dashboard historicals https://www.kansasvaccine.gov/158/Data
Kentucky: Numbers do not match the dashboard https://govstatus.egov.com/ky-covid-vaccine
Massachusetts: Numbers do not match the weekly report values https://www.mass.gov/doc/weekly-covid-19-vaccination-report-january-14-2021/download
Michigan: Data does not match historicals presented on dashboard https://www.michigan.gov/coronavirus/0,9753,7-406-98178_103214_103272-547150--,00.html
Maryland: Historicals do not match numbers on https://coronavirus.maryland.gov/#Vaccine
Maine: Most recent data point does not match https://www.maine.gov/covid19/vaccines
Minnesota: Data does not match dashboard historicals https://mn.gov/covid19/vaccine/data/index.jsp
Missouri: Numbers do not match CDC report
The Johns Hopkins Coronavirus Resource Center will begin providing U.S. state testing data from publicly reported sources, a service that replaces the COVID Tracking Project’s year-long collection effort ending March 7.
The new dataset can be found here. It includes the entire time series provided by the Covid Tracking Project. The data dictionary and list of sources can also be found in that folder, together with details about how the CRC’s testing variables map to those used by the Covid Tracking Project.
The testing data will be published in a CSV on this GitHub folder. We will also provide a public API.
Regarding the vaccination data:
Is there any reason why some dates are skipped and why data is sometimes left blank to (I assume) indicate the previous value should be used instead of empty?
Would it be possible to fill out the data more explicitly?
Thank you
The first dose vaccination data for New Jersey doubles on May 2nd 2021 and then goes back to normal after that. Can this be corrected in the source data?
combined_key date stage_one_doses
<chr> <date> <dbl>
1 New Jersey, US 2021-05-01 4615887
2 New Jersey, US 2021-05-02 8537206
3 New Jersey, US 2021-05-03 4646283
4 New Jersey, US 2021-05-04 4660766
Hi folks, just wanted to drop you a thank-you!
I'm using your data here: https://chart-studio.plotly.com/~yfarjoun/3/#/plot
Feel free to close this issue, I just didn't know how else to reach out.
The Johns Hopkins Coronavirus Resource Center will begin incorporating global vaccine data from publicly reported national sources on May 10, 2021 for the following countries: Austria; Belgium; Bulgaria; Canada; Chile; Denmark; France; Germany; India; Ireland; Italy; Latvia; Luxembourg; Poland; Spain; UAE; Ukraine; United Kingdom. We already sourced the United States total independently.
This will replace data previously sourced from Our World in Data. We will continue to provide a composite of the country and aggregated sources. As we stand up additional global sources, we will continue to incorporate them. We will continue to supply this data as a daily timeseries and hourly snapshot in a long format, but will also supply a wide format.
The new dataset can be found here. The data dictionary and list of sources can also be found in that repository.
The JHU US vaccine data map ( https://coronavirus.jhu.edu/vaccines/us-states ) seems to be using the Stage_Two_Doses value in vaccine_data_us.csv for its "People Fully Vaccinated" metric, instead of the People_Fully_Vaccinated value in vaccine_people_vaccinated_US.csv. This means that people who received a one-dose vaccine (around 10 million people in total, I think) are not counted as fully vaccinated on the JHU site.
For example: California currently is listed in the data files as 14,845,598 people fully vaccinated, 13,271,020 stage 2 doses (all vaccines). The JHU map shows 13,271,020 people fully vaccinated in California.
As of a few days ago, the number of ppl fully vaccinated in AZ is wrong: e.g., as of 4/11, state site says 1,686,236. Current number showing on Hopkin's site: 1,445,774. Rest of numbers appear to be correct.
Vaccine data csv contains an error in formatting:
GitHub message: We can make this file beautiful and searchable if this error is corrected: It looks like row 87 should actually have 18 columns, instead of 17. in line 86.
http://dph.illinois.gov/covid19/vaccinedata?county=Illinois
The data here does not match the state data
Thanks for carrying the torch from the Covid Tracking Project! It's so very important to have current testing data.
In today's time_series_covid19_US.csv, there are two records that look a little strange to me:
Finally, Missouri on 3/11/21 (record #20976) appeared to have had about 7.5 weeks of positive results in a single day. This appears to be correct, but I don't know how that happened. Perhaps they finally decided to included probable cases as per federal guidance?
One last question: Is this the best way to bring issues like this to your attention?
Thanks,
Robert
Could someone take a look at the testing time series data for 3/10? It looks like 2.5 million tests were added in the tests_combined_total category, a huge increase from the 1.4 million seven-day average the day prior. There might be an issue with Kansas which has an increase of 1.3 million from the previous day. Thank you!
it's smaller by a factor of 10 from both the date before and after it. I suspect a typo happened there.
The US vaccine numbers for Pennsylvania on April 17th and 18th 2021 look mistaken.
They jump from a total of 8.08mn administered on April 16 to 13.770mn on the 17th, 13.999mn on the 18th, then back down to 8.52mn on the 19th and 8.59mn on the 20th. This also impacts the stage one and stage two dose numbers.
Edit: Fix decimal points.
The Johns Hopkins Coronavirus Resource Center will begin providing U.S. state-level antigen testing data from publicly reported sources on Friday April, 12 2021.
The new data will be incorporated into the existing U.S. testing data file “time_series_covid19_US.csv”, located here. The new columns will be named: tests_antigen_positive, tests_antigen_total, people_antigen_positive, people_antigen_total. Data will include the entire time series provided by the Covid Tracking Project up to March 7, 2021, and new data collected from April 12, 2021 moving forward. Note that data shows cumulative counts, and therefore the number of counts for antigen fields will remain stale between March 7 and April 12, 2021.
The data dictionary and list of sources can be found in the same folder, along with details about how the CRC’s testing variables map to those used by the Covid Tracking Project.
I have a question about the vaccine time series. For states with data from multiple days, some metrics are not repeated for all days. For example, Texas had doses_alloc_total listed for 12/14 but not for 12/17 or 12/18. Going forward, I'm curious if each cell will be included only if it's updated, or if you'll start filling in the dates with the most recent figure.
date | Province_State | doses_alloc_total | doses_shipped_total | people_total |
---|---|---|---|---|
12/17/2020 | Texas | 91650 | 4187 | |
12/18/2020 | Texas | 91650 | 4187 | |
12/14/2020 | Texas | 1400000 | 19500 |
Thanks very much for providing this excellent data.
I have a question regarding the data dictionary... What actually means “doses allocated”?
Hello all
Since mid-December, the Coronavirus Resource Center team has been manually collecting U.S. state vaccine data from publicly-reported sources. The data reported have evolved over the past two months, as well as the vaccine types being distributed. To accommodate the changing landscape, we are switching over to an automated data collection.
Not all states are reporting the same variables, so the dataset will be a mix of U.S. state-reported and CDC-reported data. Details about the logic/criteria regarding the use of U.S. state dashboard vs. CDC data will be explained in a repository readme file. As a result, there will also be a change in data structure. Soon, we will also add a "people" table to the repository, which will allow us to track doses and people vaccinated separately.
The new dataset can be found here. The data dictionary and list of sources can also be found in that repository.
We will concurrently update the original dataset, available here, until 2/23/21, to give consumers of the data an opportunity to update their systems. After that date, it will be archived in a different folder.
Thanks for administering this data.
Currently, the data here differs from the CDC site:
https://covid.cdc.gov/covid-data-tracker/#vaccinations
Are there plans to insure consistency with the CDC vaccine distribution and administration data moving forward?
Dear CCI team,
Apologies if this isn't the right place to do this, but we could not find any contact information on https://coronavirus.jhu.edu/about and figured you could maybe forward this message to the right person.
There is an International Vaccination Efforts page in the JHU Coronavirus Resource Center that shows a table with vaccinations in all countries in the world. This page credits the Centers for Civic Impact as the source, and notes that "The CRC relies upon publicly available data from multiple sources."
Our understanding (based on this repo) is that the source of this data is actually our dataset of global vaccinations at @owid, made available here: https://github.com/owid/covid-19-data/tree/master/public/data/vaccinations
If this is indeed the case, would it be possible to correct the source metadata on that page?
Many thanks,
Edouard
Head of Data, Our World in Data
I've noticed that the new data format doesn't have any data prior to 2021-01-21. Should the data from the old format still be considered valid or was there a reason it wasn't included?
Thanks for the great data source!
The Vaccine Dose figures for New Jersey have been the same since 5/3. The numbers match what is currently in the hourly file as well.
New Jersey,2021-05-03,All,34.0,US,40.2989,-74.521,,8769895.0,7430120.0,4634016.0,3005463.0,"New Jersey, US"
New Jersey,2021-05-04,All,34.0,US,40.2989,-74.521,,8769895.0,7430120.0,4634016.0,3005463.0,"New Jersey, US"
New Jersey,2021-05-05,All,34.0,US,40.2989,-74.521,,9006975.0,7430120.0,4634016.0,3005463.0,"New Jersey, US"
In vaccine_data_us_state_timeline.csv, all kinds of Puerto Rico, North Carolina and Arizona Vaccine Doses Administered have sharp increase starting from June 30 and July 1. May I ask the validity of these data?
FWIW, the link to Arkansas' vaccine tracker is here. They don't appear to grab or store historical data, but at least it's being updated (the general dashboard doesn't have this data). Hope that helps.
Virgin Islands have duplicate rows for 12/26/2020. Could you have a look?
Previous spikes in historical data for total and positive tests in the graphic were anomalies caused by the shift in data collection that began March 3 when the Coronavirus Resource Center (CRC) began obtaining data from the Johns Hopkins Centers for Civic Impact rather than from the COVID Tracking Project (CTP), which ceased operations March 7. The CRC also now includes non-resident tests in Alaska and Florida and probable cases in Hawaii.
Hi all,
thank you so much for maintaining this repository!
Do you know why there sometimes is a difference between the sum of people_partially_vaccinated and people_fully_vaccinated vs. doses_admin? I would expect that the sum and the total should be roughly the same. For UAE, we see
people_partially_vaccinated | people_fully_vaccinated | Sum | Doses_admin | Difference
3836521 | 5081853 | 8,918,374 | 13,347,300 | -4,428,926
All the best,
Christoph
hi, the numbers for Texas Stage_Two_Doses on 3/13/21 are off
Texas | 3/12/21 | 2602278 |
Texas | 3/13/21 | 5057046 |
Texas | 3/14/21 | 2832158 |
can you please fix them in the data set?
For the past two days Virginia has been presenting null values in the Vaccine Type field.
I don't know if you moved from the Newsletter to the state dashboard (https://www.vdh.virginia.gov/coronavirus/covid-19-vaccine-summary/), but this is likely the number of doses administered for J&J.
In this table:
The number of stage 1 doses in guam is greater than the population of guam by a lot.
On the COVID-19 vaccines data dictionary, the "unknown providers" are described as "COVID-19 vaccines from an unknown provider." I would like to have a better clarification for that definition, if possible to know if the unknown provider are vaccines where you guys could not identify if the vaccine is from Pfizer, Moderna, etc, or if those are vaccines from a provider different from Pfizer and Moderna, such as Johnson-and-Johnson.
Thanks again for your work on the testing data!
Once again, the number of total tests (tests_combined_total) for Puerto Rico was set to zero on a day when the number of new tests was probably zero, this time on 3/17. Perhaps that number should be the same as the previous day (407299)?
Thanks,
Robert
The Hourly Vaccine Data has not been updated. The last Date is 3/22. Is there an issue with the hourly bot?
In both GitHub and in the repository I cloned locally, the CSV file at https://github.com/govex/COVID-19/blob/master/data_tables/testing_data/time_series_covid19_US.csv only reports through March 2nd.
It reports daily commits in the History, so I'm not sure why it's missing data.
Thank you,
Simon St.Laurent
Will you accept pull request if I add most recent MA data to the CSV?
Formatting
Mortality ratio bar plot
Mortality ratio scatter plot
Good evening!
While I was doing a calculation of the doses allocated per provider, the states of Delaware and Kentucky did not have classification of their vaccines. I would like to confirm that those vaccines from doses unknown, Pfizer, or Moderna. In addition, when I was comparing the total doses administered with the sum of doses administered (Pfizer + Moderna + Unknown), the states of Alabama and Colorado had a small difference. Do you guys intent to fix that difference?
Thank you for your attention!
I noticed that the two columns people_total and people_total_2nd_dose are missing for California, while CDC vaccination tracker actually has those numbers (link to CDC tracker: https://covid.cdc.gov/covid-data-tracker/#vaccinations). Are there any plans to populate these fields for CA? Thank you for this data source, it is extremely helpful!
cases_conf_probable and tests_combined_total have same values for the past 4 days for New Jersey
It appears the NJ Covid website has some updated data (https://dashboards.doh.nj.gov/views/DailyConfirmedCaseSummary7_22_2020/PCRandAntigenPositives?%3AshowAppBanner=false&%3Adisplay_count=n&%3AshowVizHome=n&%3Aorigin=viz_share_link&%3AisGuestRedirectFromVizportal=y&%3Aembed=y) -- see "PCR and Antigen Positives" tab
Thanks it is resolved
Regards
The entry for Tennessee on 1/9/2921 has an invalid value for 'people_total_2nd_dose'. It seems like it includes the entry for the previous day, a few spaces, plus the correct value.
01/09/2021,Tennessee,TN,Yes,,,,,328500,,,328500,223791,,,223791,214492,4561 8364
For Nevada on 2/11, the people_tested field has some invalid characters ("221343‬")
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.