When government data disappears: How to cross-check, preserve, and investigate
- Elizabeth Clemons
- 7 hours ago
- 2 min read

IRE 2025 and AccessFest 2025 Tip Sheet

Anna Massoglia, anna@sunlightresearch.net, X, LinkedIn, BlueSky
Jay Hunter, jay@hunterindex.org, X, LinkedIn, Substack
Michael Nolan, michael@sunlightsearch.net, LinkedIn
Jason Leopold, jleopold15@bloomberg.net, LinkedIn
Elizabeth Clemons, elizabeth@sunlightresearch.net, LinkedIn
This tip sheet can be used to find information on the disappearance of government data and explore what you can do to keep track of data relevant to your research and reporting.
Examples of Government Data and Information Archives
GW’s National Security Archive
Hunter Index (politicians’ personal financial information)
Requests (FOIA)Â
527 Explorer (IRSÂ
Nonprofit Explorer (IRS Form 990s)
Data Store (static, historical)
OpenSecrets.org (campaign finances, lobbying)
Tools for Archiving Content
Perma.cc (free for journalists)
Conifer Webrecorder by Rhizome (more complex websites)
ArchiveBox (self-hosted)
Tools for Archiving Data
Scraping code: https://github.com/m-nolan/doge-scrapeÂ
BLN Updating code: https://github.com/biglocalnews/sync-doge-scrape/
Creating web scrapers
Finding undocumented APIs: https://inspectelement.org/apis.html
Tools for Change Detection
Visualping – Visual/text change tracking with alerts, Chrome extension (freemium)
WebSite-Watcher – Advanced local tool for Windows ($$)
Distill.io – Content tracking with local app, browser, Chrome extension (freemium)
PageCrawl.io – Team-friendly archiving and alerts (freemium)
Wachete – Tracks private/password-protected pages (freemium)
ChangeTower – Keyword and content change alerts (freemium)
Fluxguard – HTML, visual and text tracking, translation (freemium)
Follow That Page – Basic email alerts for text changes
SiteDelta – Firefox-only in-browser tracker
KeyCDN Tools – check HTTP header for when page was last modified
Tips for Archiving
Creating Web Scrapers
Finding undocumented APIs: https://inspectelement.org/apis.html
DOGE Scraper Github Links
Scraping code: https://github.com/m-nolan/doge-scrapeÂ
BLN Updating code: https://github.com/biglocalnews/sync-doge-scrape/
Articles writen with BLN DOGE tables
University of Minnesota: Cuts, halted grant reviews could be ‘absolutely crippling’ to research
Unpacking DOGE’s plan to cut 14 federal office leases across Minnesota
How much federal money flows into Minnesota for health care, education, agriculture?
Evanston woman says her contract is among DOGE website errors
'We got un-DOGE'd': Inaccurate info discovered in claimed taxpayer savings
From DOGE cuts to tariffs, see Trump's first 100 days by the numbers
DEI, Project 2025 and the Constitution: Tracking Trump's impact in his first 100 days
DOGE claims at least $117 million in Bay Area contract cuts, spurring layoffs and uncertainty
EV uncertainty: How Trump's charging station funding freeze in affecting the DC area
Questions or comments about Sunlight's workshops and resources? Contact Elizabeth at elizabeth@sunlightresearch.net.