This repository contains a series of python scripts to generate the CPI (City Prosperity Index) dataset for 305 municipalities in Mexico.
The use of a virtual environment is recommended.
python -m venv venv
source venv/bin/activate
- data
- pdfs
pip install -r requirements.txt
This dataset is made from the tables in 305 pdf files. The command to download the files is as follows:
./scraper.py
./extractor.py
./preprocess.py
./fix_final_ds.py
After executing the above command the files CPI_Mex.csv
and CPI_Mex_full.csv
are generated.