This repository contains the code relevant to my bachelor's thesis.
dataio.py
: functions to help with I/Ofigures.py
: figures seen in the Methods sectionpreprocessing.py
: preprocessingprocessing.py
: clustering and regressionvisualization.py
: figures seen in the Results section
To reproduce the results shown in the thesis:
- Create directories for data and figures:
mkdir data/ figures/
- Acquire the original data. Note: the starting data I used is stored as python pickles. Please don't actually download and open them, pickle files can execute arbitrary code. I should convert them to csv for sharing.
- Preprocess, process, and visualize:
python3 preprocessing.py && python3 processing.py && python3 visualization.py
Be warned, the clusterings can take a long time (~2.5 hours on an i5-4690k).