Name: Environmental Data and Governance Initiative
Type: Organization
Bio: The Environmental Data & Governance Initiative (EDGI) documents, analyzes, and advocates for the federal provision of environmental data and governance.
Blog: www.envirodatagov.org
Environmental Data and Governance Initiative's Projects
ARCHIVED--Jupyter Notebooks investigating EPA quantitative dataset downloading
ARCHIVED--EPA site search scraper
ARCHIVED--Tool to generate sitemap of EPA, see https://github.com/edgi-govdata-archiving/sitemapper for current development
Monetary flows within EDGI and visualizations of how money moves through the organization
🔍 Go diffing service for comparing changes to webpages
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Technical guides for how to preserve and hold data
Next generation of halpy, built on hubot v3 and with all scripts updated to use most vurrent versions of APIs from relevant services
A Heroku buildpack to help bridge chat tools like Slack.
🎈 Start here for current projects, how to get involved, and joining community calls, a resource for new and veteran members
Seeing if we can use Github's PR tools to generate diffs between iterations of permits
Mini website crawler to make sitemap from a website.
Upload Datasets to S3 from the browser
ARCHIVED--Services to create xml, csv and json sitemaps of websites
Based on the EDGI Repo Guidelines at https://github.com/edgi-govdata-archiving/overview/blob/master/repo_guidelines.md
quick url proxy server
ARCHIVED--Bookmarklet to modify UI for Versionista website monitoring
ARCHIVED--A Ruby script that scrapes Versionista's web interface to generate a csv summarizing which websites and pages have had recent changes.
Landing page app with important info that participants can be sent through prior to joining a video call
WARC writing MITM HTTP/S proxy
A Python API to the Internet Archive Wayback Machine
Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")
Example scripts to count what n-grams were added and removed from every page EDGI’s Web Monitoring group is tracking.
An HTTP API for tracking and annotating changes to a set of web pages.
Tools for diffing and comparing web content. Also includes a web server that makes diffs available as an HTTP service.
🔍 Node.js diffing service for the website monitoring project
Documentation and configuration files for EDGI’s deployment of Web Monitoring tools.
Tools for access, "diff"-ing, and analyzing archived web pages
Experimental new tool for generating weekly analyst task sheets for web monitoring
UI to enable analysts to quickly assess changes to monitored government websites