Difference between revisions of "Alive... OR ARE THEY"

From Archiveteam
Jump to navigation Jump to search
(Added JSFiddle)
m (Whoops, can't even get the right URL!)
Line 27: Line 27:
* '''[[Internet Archive]]''' ({{url|1=http://www.archive.org/}}) seems stable at the moment but its [https://archive.org/~tracey/mrtg/du.html 16 petabytes] of data aren't mirrored anywhere else, the code for their system isn't open source and generally they're a single point of failure for a large amount of the web's history. Why should there be only 1 internet archive?
* '''[[Internet Archive]]''' ({{url|1=http://www.archive.org/}}) seems stable at the moment but its [https://archive.org/~tracey/mrtg/du.html 16 petabytes] of data aren't mirrored anywhere else, the code for their system isn't open source and generally they're a single point of failure for a large amount of the web's history. Why should there be only 1 internet archive?
** There seems to be a second instance at [http://www.bibalex.org/isis/frontend/archive/archive_web.aspx Bibliotheca Alexandrina] although it's currently broken and out of date.
** There seems to be a second instance at [http://www.bibalex.org/isis/frontend/archive/archive_web.aspx Bibliotheca Alexandrina] although it's currently broken and out of date.
*'''[[JSFiddle]]''' ({{url|1=http://jsfiddle.com/}}) is referenced in many StackOverflow answers, as well as other forums, etc. It shows no signs of going away, but should we archive it just in case?
*'''[[JSFiddle]]''' ({{url|1=http://jsfiddle.net/}}) is referenced in many StackOverflow answers, as well as other forums, etc. It shows no signs of going away, but should we archive it just in case?
*'''[[Know Your Meme]]''' ({{url|1=http://knowyourmeme.com/}}) is at this point the de facto central repository for information on internet memes and culture. It is as popular as ever at the moment, but even with this popularity, former owners Rocketboom had trouble financing it. In the spring of 2011 was sold to Cheezburger Networks, a site which has been known to "reorganize" its properties, sometimes with a detrimental effect on content. Though it was quite a different story, I might remind people what happened to [[Encyclopedia Dramatica]].
*'''[[Know Your Meme]]''' ({{url|1=http://knowyourmeme.com/}}) is at this point the de facto central repository for information on internet memes and culture. It is as popular as ever at the moment, but even with this popularity, former owners Rocketboom had trouble financing it. In the spring of 2011 was sold to Cheezburger Networks, a site which has been known to "reorganize" its properties, sometimes with a detrimental effect on content. Though it was quite a different story, I might remind people what happened to [[Encyclopedia Dramatica]].
* '''[[Last.fm]]''' ({{url|1=http://www.last.fm/}}) is being cloned by free software developers in the form of [http://libre.fm Libre.fm] -- they have a tool, [http://svn.savannah.gnu.org/viewvc/*checkout*/trunk/lastscrape/lastscrape.py?root=librefm Lastscrape] which can get all your listening data out into a tab delimited text file.
* '''[[Last.fm]]''' ({{url|1=http://www.last.fm/}}) is being cloned by free software developers in the form of [http://libre.fm Libre.fm] -- they have a tool, [http://svn.savannah.gnu.org/viewvc/*checkout*/trunk/lastscrape/lastscrape.py?root=librefm Lastscrape] which can get all your listening data out into a tab delimited text file.

Revision as of 12:01, 6 September 2014

Like many sites before them, these places indicate a sunny outlook, a clean bill of health and a total sense of "all systems go". But as we've found out from those many sites before them, fortunes can change overnight.

Archive Team considers these sites specifically of interest because they solicit so much content, contain so many works and projects by a wide group of people, or have the internet particularly dependent on them. Consider this a fire drill.. know what you can do to get your data off these sites and back them off for later.

Sites

Not so alive, rather living deads (owned by Yahoo!):

  • Flickr contains billions of files, hundreds millions of which are under a Creative Commons license or stored there by many museums and other cultural institutions. The site was tumblr-ised in 2013 and has been poorly functional ever since; pro users were removed, so it doesn't yet have a business model. Additionally, it's owned by Yahoo!, need to say more?!

All the others:

In progress

Many of the sites above are too big to randomly start saving them and if we start they must not be so alive, but contingency plans wouldn't harm.

Website User Archiving Status Details Archives Archive Date Archive Format
pastebin [1] User:Arkiver Aborted Archive power can better be used for other websites. COMING 2013-12-14 - 2013-12-17 .warc.gz
User:joepie91 In progress... Downloading newest pastes .warc.gz