Difference between revisions of "Archive.today"

From Archiveteam
Jump to navigation Jump to search
m (Corrected "TB" to quoted "Tb", also linked to IA query blog post.)
Line 10: Line 10:
Archive.is is a privately funded on-demand archiving site, similar to [[WebCite]]. One key difference is that it stores "Web 2.0" pages better than WebCite; it also supports zip downloads of entire individual webpages and takes a screenshot of the webpage. It does not store PDFs, binary files, Adobe Flash content, videos, or sounds. The maximum size of a webpage it will archive (including images) is 50MB. Additionally, Archive.is forwards your IP address to the submitted website in a X-Forwarded-For header.<ref>http://blog.archive.today/post/111779719291/do-you-preserve-archivers-privacy-e-g-not</ref>
Archive.is is a privately funded on-demand archiving site, similar to [[WebCite]]. One key difference is that it stores "Web 2.0" pages better than WebCite; it also supports zip downloads of entire individual webpages and takes a screenshot of the webpage. It does not store PDFs, binary files, Adobe Flash content, videos, or sounds. The maximum size of a webpage it will archive (including images) is 50MB. Additionally, Archive.is forwards your IP address to the submitted website in a X-Forwarded-For header.<ref>http://blog.archive.today/post/111779719291/do-you-preserve-archivers-privacy-e-g-not</ref>


The website shot up significantly in popularity in the second half of 2014 primarily due to the GamerGate controversy. As of Feb. 2015, the website has archived about [http://blog.archive.today/post/111780063961/how-much-storage-is-archive-today-using-currently 200 TB of data.]
The website shot up significantly in popularity in the second half of 2014 primarily due to the GamerGate controversy. As of Feb. 2015, the website has archived about [http://blog.archive.today/post/111780063961/how-much-storage-is-archive-today-using-currently 200 "Tb" of data.] ''It is likely 200 Terabyte '''TB''', not Terabit '''Tb''' as is quoted. Nonetheless, if accurate, 200Tb ≈ 25TB.''


On April 14, 2014, Archive.is changed its name to Archive.today due to attacks against [http://www.isnic.is/en/ ISNIC].<ref>http://blog.archive.today/post/82775187091/curious-why-the-move-in-domain-names-from-archive-is</ref><ref>https://twitter.com/archiveis/status/455710701948903424</ref>
On April 14, 2014, Archive.is changed its name to Archive.today due to attacks against [http://www.isnic.is/en/ ISNIC].<ref>http://blog.archive.today/post/82775187091/curious-why-the-move-in-domain-names-from-archive-is</ref><ref>https://twitter.com/archiveis/status/455710701948903424</ref>
Line 29: Line 29:


[https://dl.dropboxusercontent.com/u/94483242/archive.is/archive.is_sitemaps.7z All sitemaps] (as of 2014/02/17)
[https://dl.dropboxusercontent.com/u/94483242/archive.is/archive.is_sitemaps.7z All sitemaps] (as of 2014/02/17)
As a side note, the [http://blog.archive.is/post/117445434661/would-you-consider-handing-over-all-the-captured administrator is unsupportive] of [[Internet Archive]]'s [[robots.txt]] policy - which could hinder future backup cooperation.


== Archives ==
== Archives ==

Revision as of 10:41, 12 June 2015

Archive.is
Archive-is 2013-07-02 17-05-40.png
URL archive.is[IAWcite.todayMemWeb]
Status Online!
Archiving status Not saved yet
Archiving type Unknown
IRC channel #archiveteam-bs (on hackint)

Archive.is is a privately funded on-demand archiving site, similar to WebCite. One key difference is that it stores "Web 2.0" pages better than WebCite; it also supports zip downloads of entire individual webpages and takes a screenshot of the webpage. It does not store PDFs, binary files, Adobe Flash content, videos, or sounds. The maximum size of a webpage it will archive (including images) is 50MB. Additionally, Archive.is forwards your IP address to the submitted website in a X-Forwarded-For header.[1]

The website shot up significantly in popularity in the second half of 2014 primarily due to the GamerGate controversy. As of Feb. 2015, the website has archived about 200 "Tb" of data. It is likely 200 Terabyte TB, not Terabit Tb as is quoted. Nonetheless, if accurate, 200Tb ≈ 25TB.

On April 14, 2014, Archive.is changed its name to Archive.today due to attacks against ISNIC.[2][3]

Funding

According to their FAQ:

It is privately funded, there in no complex finance behind it. It may look more or less reliable compared to the startup-style funding or an univercity project, depending on which risks are taken into account. My death can cause interruption of service, but something like new market condition or changing head of a department can not.

Site structure

A list of all domains currently archived is available here.

List of all domains from archive.is/alldomains (as of 2014/02/20) = 7,255,826 domains

Sadly, the url counts from /alldomains are out of date.

All sitemaps (as of 2014/02/17)

As a side note, the administrator is unsupportive of Internet Archive's robots.txt policy - which could hinder future backup cooperation.

Archives

/alldomains Archive

References