Giter Club home page Giter Club logo

micb25 / dka Goto Github PK

View Code? Open in Web Editor NEW
92.0 18.0 5.0 703.04 MB

Statistische Analyse und Visualisierung der täglichen Diagnoseschlüssel der deutschen COVID-19 Tracing-App (Corona-Warn-App).

Home Page: https://micb25.github.io/dka/

License: GNU General Public License v3.0

Python 38.08% Gnuplot 35.01% Shell 0.32% HTML 12.07% JavaScript 1.82% Standard ML 12.71%
corona coronavirus-data covid19 covid19-data covid19-tracing covid19-graph covid19-germany corona-warn-app germany exposure-notification

dka's Introduction

Hi there, I’m Michael 👋

I’m Michael Böhme, theoretical chemist 👨🏻‍🔬 and computer enthusiast 👨🏻‍💻 from Jena 💚, Germany. I’m currently working part-time at the Friedrich Schiller University in Jena doing scientific research in computational magnetochemistry. 🧲 You can have a look on my personal website, if you want to know more about my scientific work. 🧪

Besides, I’ve loved to code since I was 10 years old. 👦🏻 I learned to program on an old Commodore C64 👾 with the help of the great C64 User's Guide. 📖 Nowadays, I am mostly working with C++ and Python, sometimes also MatLab, BASH scripts or gnuplot. 📈 I’m currently learning a lot of 3D graphics programming in Vulkan. 🌋 In my spare time 🌞, I’m working on a yet secret project of mine. 😎 Stay tuned! 🖖

Twitter: theochemiker GitHub micb25 GitHub micb25

dka's People

Contributors

kemaeleon avatar micb25 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dka's Issues

Diagnoseschlüsselveröffentlichung und gemeldete Neuinfektionen

https://micb25.github.io/dka/plots_de/plot_rki_cwa_cases.png

Derzeit werden die veroeffentlichten Diagnoseschluessel aller CWA Benutzer im direkten Verhältnis zu den gemeldeten Neuinfektionen aller Einwohner dargestellt.

Gesucht ist der Anteil derer ist, die nicht ihre positiven Testergebnisse als Warnung freischalten und zwar nicht im Bezug auf die Gesamtbevoelkerung sondern auf das was im Idealfall maximal moeglich ist.

Q = Einwohnerzahl/Appbenutzerzahl (Grössenverhältnis)
Die im Idealfall erreichbare Anzahl an gemeldeten Diagnoseschluesseln (Positivtests) =: X
Gemeldete Neuinfektionen =: N
Tatsächlich gemeldete Diagnoseschluessel =: Y
X = N / Q
"CWA Performance" in % = 100 / (X / Y)

Dies nur als Vorschlag. Gut moeglich dass sich hier noch Rechen oder Denkfehler verstecken!

Collect and parse hourly diagnosis key packages

The diagnosis key statistics should be based on the hourly packages to obtain a higher precision and to minimize problems connected with a padding multiplier change. To achieve this goal a rewrite of the scrapers and parsing scripts is necessary.

Add correction for padding multiplier in "Verteilung TRL"

The plots in "Verteilung Transmission Risk Level (TRL) in Diagnoseschlüsseln" currently use the number of keys transmitted including the padded fake keys afaiu. As long as the padding factor stays the same this shouldn't be a problem. But this factor will change from tomorrow on (the plan is to bring it down to 1 eventually). The changes in the padding multiplier will cause some distortion in those graphs as new data will receive less weight.

My suggestion would be to use the data which has been corrected for this multiplier like in the "Geteilte Diagnoseschlüssel von positiv getesteten Personen" section.
@mh- has introduced an automatic detection for the multiplier used in the data set in his parsing tool: corona-warn-app/cwa-server#620 (comment)

Inconsitent numbers per day and per week

"wöchentl. positiv getestete Personen, die Diagnoseschlüssel teilten", KW34: 486
"täglich positiv getestete Personen, die Diagnoseschlüssel teilten", 17.8.-23.8. Sum 504

Please add a licence

I would like to see if I can make a version for the dutch corona app. Please add a license.

How do you estimate number of users uploading keys

Wie berechnest / schätzt du eigentlich inzwischen die Anzahl der User, die DEK keys hochladen? Oder ist das eine Kennzahl, die das RKI inzwischen veröffentlicht?
Seit cws v 1.9 (?) / den neuen TRL Zuweisungen, wo 'days since symptoms' mit eingerechnet werden stimmt ja der 'naive' Ansatz mit dem TRL=6 nicht mehr?

Flasche Werte für 23.06

Die am 23.06 veröffentlichten Daten enthalten Schlüssel, die via App in der Zeit vom 16.06 bis 23.06 gemeldet wurden.
Deshalb muss man für die % Zahl auch die Summe der vom RKI gemeldeten Infektionen in Relation setzen (4023).
RKI: 345+580+770+601+687+537+503 statt 503
Nach meinen Berechnungen ergeben sich aber auch 44 Personen.
Hier meine Tabelle
https://github.com/Tho-Mat/corona-stuff/blob/master/%C3%BCberblick.xlsx

User:

16-23.06 44      <<< im Laufe des 23.06 wurde der Faktor von 1 auf 10 geändert.
   24.06 11 (incl. 2 Meldungen vom 23.06)
   25.06 19
   26.06 20
   27.06 15 (incl. 1 Meldung vom 26.06)
   28.06 8
   29.06 15 (incl. 1 Meldung vom 28.06)
   30.06 26 (incl. 1 Meldung vom 29.06)
   01.07 18 (incl. 1 Meldung vom 30.06)
   02.07 33 (incl. 1 Meldung vom 01.07)  oder 38 incl. 2 Meldungen vom 01.07 <<< hier wurde der Faktor von 10 auf 5 geändert
   03.07 16
   04.07 11
   05.07 3
   06.07 10

Da nicht immer 140 Schlüssel für die "hour" Dateien zusammenkommen werden 1 bis 2 Meldungen manchmal erst am nächsten Tag veröffentlicht.

Keine Updates mehr?

Kein Update seit den 4.2.2021.

Liefert das RKI keine Daten mehr? Oder werden die Grafiken nicht mehr aufgefrischt?

orthographic mistake

There's a comma missing in one of the graphs' titles. It should spell "Verhältnis zwischen positiv getesteten Personen, die Diagnoseschlüssel teilen, und den gemeldeten Neuinfektionen".

Weekly number of uploading users without teleTAN

Once a week, Telekom releases the total number of requested teleTANs. In the same weekly cadence, we could calculate:
Estimated number of uploading users without teleTAN = people uploading that week - issued teleTANs

Of course not every teleTAN must be used for upload so the actual number of people uploading without teleTAN can be higher.

With this weekly statistic we could better follow the usage of the QR code test retrieval process within CWA.

[Feature request] Provide raw data of app reports and JHU/RKI reports to allow time series analysis

Great dashboard, love it! Although I think it'd be good to highlight one or two really important graphs at the top, and leave the more detailed, less important ones at the bottom. (I think the number of reports per day and the proportion of case reported through app are the most important one - and should be at the top - but that's a separate issue)

I like that you compare app reports with case reports from JHU and RKI. It looks like JHU is better (not surprisingly) but there seems to be a 1 day shift between JHU and app. You call it a correlation, but you're not really doing that (plotting the cases from app and JHU on a scatter plot against each other).

This lends itself perfectly for a proper time series analysis (probably not something you come across as a chemist a lot, but econometricians do that all the time). Could you provide the raw data for your plot so that I can run a time series analysis? I think it'd be great if we could have a stable comparison, the lag of one day makes it look less stable than it could be, but it's best to do this properly and then adjust the plot.

Add parameter visualization for the parameters that are used to calculate the risk level

The corona warn app risk level calculation is parameterized by the RKI, the app downloads this parameters regularly. They are 5 or 6 parameters that define what signal strength and duration of exposures is considered a risk. All are transmitted using protobufs. I would be nice if this parametrization could be plotted as well and be able to see how this correlates with uploaded diagnosis keys over time.
I was planning to do this myself, but did not get to it up to now. Maybe you are interested in doing it.

Verhältnis zu RKI Infektionszahlen/Meldezahlen

Vielen Dank für die initiative. Wäre es sinnvoll, als Basis für das Verhältnis zu den RKI-Zahlen die Zahl der Fälle nach Meldetatum zu nehmen ("Für die Darstellung der neuübermittelten Fälle pro Tag wird das Meldedatum verwendet – das Datum, an dem das lokale Gesundheitsamt Kenntnis über den Fall erlangt und ihn elektronisch erfasst hat."- aus dem RKI-Dashboard Disclaimer)? Das könnte besser passen, als die tägl. Differenz der Gesamtzahl der Infizierten, schleppt aber immer drei, vier Tage nach.

Question on number of reported keys (23.June)

I have a question on your diagram of daily reported keys to corona warning app (plot_keys).
Your very first number on 23.June is 477 keys, but I received 503 keys on 24.June. All your other numbers match exactly with what I received.
Do you have an idea how to explain this difference on the first day? Could these be keys from the first week?

Week numbers seem to be behind by one

We're currently (05.08.) in Week 32, but last bars in the weekly diagrams - which seem to depict aggregate data for the current week - are denoted with "KW 31", shouldn't this be "KW 32" ?

Additionally: I'd actually suggest only displaying weekly bar diagrams for weeks which have concluded. Imho this makes the endpoints in those weekly graphs less confusing in general.

Corana Warn App enthält jetzt auch Schlüssel anderer Europäischer Länder

Hallo mic25,

sehe auf Deinen tollen Dashboards (vielen Dank!) seit letzter Woche einen größeren Anstieg insbesondere bei Personen die Diagnose Schlüssel teilen. Seit letzter Woche werden für die CWA ja auch Keys anderer europäischer Länder (bislang imho noch nicht viele) veröffentlicht. Deine Zahlen beziehen sich aber auf Deutschland. Falls möglich und noch nicht geschehen, solltest Du für Deine Statistiken nur die Deutschen Keys übernehmen.

Grüße

Add graph for keys per user

Each user can at max upload 13 keys. It seems that recently there has been a strong divergence in the number of users uploading keys vs. actual number keys uploaded:
22.08: 86 people vs 639 keys: 7.4 keys/person
20.08: 91 people vs 948 keys: 10.4 keys/person

My guess would be that maybe some ppl who got tested and wanted to receive their result via app only installed it once they got tested and therefore only very few keys will be submitted.

This information might also be interesting in context of the discussion on how to improve CWA efficiency/uptake in the wishlist repo: corona-warn-app/cwa-wishlist#150.

That's why I thought it would be nice to have a graph visualizing the keys/person development 🙂.

Update x-axis of Downloads graph daily

Currently, the x-axis of the "Summe ausgegebener TeleTANs" graph is updated daily, even if there is no new data for a day. This makes it easy to see how recent the latest data point is.

The same should be done for the Downloads graph, i.e. expand the x-axis daily, even though new data points might only be entered once a week.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.