Cross-platform Chrome History Analysis
There is a movement of self tracking. Monitoring pulse, heartbeat and so on. But the most important is not being tracked: our online behavior.
Making sure we can self document, we need the following things.
-
Chrome only keeps its history for a max of 90 days, so we need to start saving history.
-
We need to collect HTML data from the pages we visit and keep them cached.
-
(Cleaning up) We need to extract and analyze data from the HTML, such as code snippets, links, microdata, images, events.. anything really.
-
(Not started) Allow plugins per website for example. Extract trip date from airbnb page to connect to photos for example.
To come
-
For now, go to Extensions in Chrome, and "load unpacked". Open the
chromePlugin
in this folder. -
pip install nostalgia
-
To test it out, run
nostalgia serve
. Look at theboot_as_service
on how to runnostalgia
as a service on boot. -
Make sure you do not lose history, run
nostalgia backup_history
every (at most) 90 days.
Chrome history:
- Referral data
- Visit counts
HTML (after adding analyzer)
- DOM
- code snippets
- text
- publication date
- links
- microdata
- images
- events