The US Secretary of State's website publishes an overview of the official international travel of the Secretary of State. This data goes back continuously to 1905. Unfortunately the data is only available as a series of webpages and individual rows may contain multiple cities.
This repository contains a single CSV file of this data. It also includes the Node.js scripts used to process the data. Hopefully this data is useful visualization proejcts, statistical analyses, and other uses for geographic, time-based, historical data.
original_country
: text string identifying country scraped from Secretary Of State websiteoriginal_city
: text string identifying city scraped from Secretary Of State website.original_date
: text string identifying date interval scraped from Secretary Of State website.description
: text string describing the Secretary's travel arrangement.country_modified_for_geo
: text string based on original but modified for geocoding.city_modified_for_geo
: text string based on original but modified for geocoding.date
: unused should removesec_id
: text string identifying the Secretary of Statesec_name
: text string of Secretary's nameid
: chronological integer of destinationsglat
: latitude of location from Googleglon
: longitude of location from Googlegcity
: text string of what Google identifies as the city of the destinationgcountry
: text string of what Google identifies as the country of the destinationisGeocoded
: binary note that the destination was geocodedsplit_added
: binary note that the entry was interpolated from a single Secretary of State entry.original_line
: binary note that the entry generated multiple destinations, which havesplit_added
astrue
start_time
: start time of the destination in milliseconds since epoch.end_time
: end time of the destination in milliseconds since epoch.start_time_form
: start time of the destination in UTC format.end_time_form
: end time of the destination in UTC format.elapsed_days
: number of days at the destination
sec_id
: id of secretary. Same as in/data/destinations.csv
url
: url of the page where destination data was scrapedname
: full name of Secretaryyears
: years the Secretary served
country
: every country from theoriginal_country
fieldid
: rough approximation of world region: A: Americas, AS: Asia, AF: Africa, MD: Middle East, E: Europe
The dataset can be regenerated using four Node.js scripts
scrape-raw.js
: Pull in initial data and generatesecretaries.csv
fix-dates.js
: Transform the date/interval text string to structured formsplit-and-clean-locs.js
: Some original lines contain multiple cities over multiple days. This script splits them into unique destinations as long as the original line's destination lasted more than one day.geocode.js
: Uses Google's geocoding service to geocode the destinations. Requires a free API key.