Government Backup

From Archiveteam
Revision as of 20:26, 29 December 2016 by Yan (talk | contribs) (add some redlinks; will try to create later)
Jump to navigation Jump to search
Government data.jpg

The US Government has an awful lot of data, and it's in a lot of places. In 2016, elections were held that indicated deep sea changes in goals and ideals (although previous transitions have always contained such changes). Inspired by this, a number of groups and efforts have risen up to ensure backups of all government data possible are made off-site.

This page contains overviews of the effort by all the teams.

Internet Archive

Internet Archive has two teams, Wayback and Archive-It (archive-it.org), working through listings of government websites and data stores. They are working internally using Internet Archive's crawlers and environment.

#DATAREFUGE

The Data Refuge project (ppehlab.org/datarefuge) has the following Google document about climate datasets.

Archive Team FTP Backup

The Archive Team project is backing up 750+ FTP sites hosted at .MIL and .GOV sites. These two projects can be tracked here (discovery phase) and here (download phase). The results of this download are being sent to this collection.

Archive Team General Websites Download

Besides the FTP data download, Archive Team is also doing a general download (where possible) of many crawlable government websites, such as usa.gov.