Intoduction
This repository is Robert Cocking's Birkbeck Data Science Project, 2022.
Source Data- where it comes from
Training data initially comes from:
https://public.tableau.com/app/profile/institute.of.physics/viz/Anonymised_Hesa_dataset/Sheet1
Manual steps to download:
1- visit the url and go to Download, Data
2- a popup opens
3- allow download
4- use as required
The data is originally from https://www.hesa.ac.uk/, via https://www.jisc.ac.uk/tailored-datasets, supplied to the Institute of Physics. This data is then rounded and anonymised in line with HESA guidance https://www.hesa.ac.uk/about/regulation/data-protection/rounding-and-suppression-anonymise-statistics and later published to Tableau Public.
The rounding strategy was detailed in the project proposal- the code is available.