Difference between revisions of "Frequently Asked Questions"

From Archiveteam
Jump to navigation Jump to search
m (shuffle faqs most likely to be relevant to new users to the top)
(add faq of where saved files go)
Line 1: Line 1:
'''How can I help?'''
'''How can I help?'''


See [[Who We Are]], [[Deathwatch]], and [[:Category:Projects_status]]
See [[Who We Are]], [[Deathwatch]], and [[:Category:Projects_status]]. These pages describe our projects and the things you can do to help.


'''How should I go about backing things up?'''
'''How should I go about backing things up?'''
Line 14: Line 14:


There is a growing number of tools that can manipulate WARC files in [[The WARC Ecosystem]].
There is a growing number of tools that can manipulate WARC files in [[The WARC Ecosystem]].
'''Where do all the saved files go?'''
Files are ultimately uploaded to Internet Archive on the [https://archive.org/details/archiveteam Archive Team collection].


'''Is there a backup of the data on the archiveteam.org website? If so where can I download it?'''
'''Is there a backup of the data on the archiveteam.org website? If so where can I download it?'''

Revision as of 04:09, 4 October 2013

How can I help?

See Who We Are, Deathwatch, and Category:Projects_status. These pages describe our projects and the things you can do to help.

How should I go about backing things up?

What would you like to back up? If you want to mirror/backup a website, the de facto tool is Wget (but there's lots more, see Software!). WARC files are highly recommended as they can be ingested by the Wayback Machine.

If you want to back up your personal files, "List of backup software" at Wikipedia is an extensive list of backup software. See Backup Tips as well!

What are these WARC files in the Internet Archive? How do I extract files from a WARC file?

WARC files are de facto medium of digital preservation of the web. These WARC files are ingested by the Wayback Machine.

There is a growing number of tools that can manipulate WARC files in The WARC Ecosystem.

Where do all the saved files go?

Files are ultimately uploaded to Internet Archive on the Archive Team collection.

Is there a backup of the data on the archiveteam.org website? If so where can I download it?

Two sets of backups of this wiki are available. There are backups done by the hosting provider (several, going back days and weeks as well as hours), which use the storage capability of the shared hosting to keep them automatically (no tape or disk backups being done as most people would think of them). There are similarly copies of the database kept going back months.

Additionally, an XML dump of the Mediawiki database (which can be imported into any MediaWiki software) is accessible at http://www.archiveteam.org/dumps. New backups are currently pushed out once a week (and will be increased if changes on the site require it). All images are also wrapped into a images.tar.gz file, although our entire images directory is available at http://www.archiveteam.org/images.

Is there a mirror of the archiveteam.org website?

There are no mirrors we know of, although we encourage our more paranoid or protective readers to maintain one based on the above dumps.

There is a backup from August 03, 2011 available. The main things that are not included are: Site history, Edit & source of the pages, Special pages and other minor links. (See "Not Crawled.txt") Click here to download.

I went through the wiki and I still have a question! How do I contact the Archive Team?

Join us on IRC! (All channels and info listed here.) For general inquiries, visit #archiveteam on EFNet.