Archives

From Archiveteam
Jump to: navigation, search
Look at Archive Team Collection at Internet Archive too

Some archives available for downloading, by Archive Team or by other volunteers or groups. Sorted by size.

Look at Archive Team Collection at Internet Archive too.

If you have archived any site, you can add a link to the table editing this page (or just drop a line in our IRC channel and we will add it).

Available for download

Title/Download link Description Size
Geocities - The PATCHED Torrent (IA) The popular web hosting service founded in 1994. It was closed by Yahoo! in 2009 641.4 GB
URL Shortener Backup Torrent v4 URLTeam compressed backups of various URL shorteners (README) 75 GB
URL Shortener Backup Torrent v3 outdated, use v4 URLTeam compressed backups of various URL shorteners (README) 50 GB
URL Shortener Backup Torrent v2 outdated, use v4 URLTeam compressed backups of various URL shorteners (README) 48 GB
URL Shortener Backup Torrent v1 outdated, use v4 URLTeam compressed backups of various URL shorteners (README) 41.1 GB
Papers from Philosophical Transactions of the Royal Society This archive contains 18,592 scientific publications totaling 33GiB, all from Philosophical Transactions of the Royal Society and which should be available to everyone at no cost, but most have previously only been made available at high prices through paywall gatekeepers like JSTOR. 32.48 GB
The May 2011 Calufa Twitter Scrape 90+ million tweets from more than 6 million users 14.9 GB
Internet Gopher Archive 2007 (IA) Archive of gopher sites 14.8 GB
Encyclopedia Dramatica January 2010 Mirror lulz 11.7 GB
The TEXTFILES.COM Time Capsule This collection comprises all the major text-based sets of the TEXTFILES.COM site 11 GB
Salon Table Talk Threads of this talk site +6.0 GB
Usenet Archive of UTZOO Tapes Collection of .TGZ files of very early USENET posted data 2.0 GB
Quux.org Gopher Mirror Collection 2006 (IA) This is a collection of mirrors maintained by gopher.quux.org. These mirrors were taken offline in 2006 due to bandwidth constraints 1.5 GB
full-history-linux.git.tar GIT repository of Linux Kernel from 1991 to 2010 (details) 594 MB
Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape Almost 10 million tweets 425 MB
The 2010 Reddit Research Project Dataset on affinities of 60,000+ Reddit users, recorded in 2010 ~360 MB
Archive Team Starwars.Yahoo.Com Panic Download This is a panic download of the starwars.yahoo.com forums and profiles, done before the closure of same by Yahoo on December 15, 2009. This includes as many messages, profiles, and pages related to the site as could be easily brought in. ~250 MB
Social Structure of Facebook Networks Facebook Data Scrape Facebook data scrape related to paper "The Social Structure of Facebook Networks", by Amanda L. Traud, Peter J. Mucha, Mason A. Porter 197 MB
Archive Team's Etherpad Time Capsule This archive contains roughly 6,400 Etherpads, in their final state 125 MB
WikiTeam archives Archives about wikis. See WikiTeam +100 MB
Archive Team Archive Team.org Site Rip from August 03, 2011 75 MB
Boing Boing Posts Archive (2000-2011) Two collections of Boing Boing postings provided by the cultural website boingboing.net on its 5th and 11th anniversaries 42 MB
Archive Team Quotes Database Backup Amusing snatches of conversation from IRC and other online gathering places 5 MB
Mirror of Revelation Passage Series Website wget of a small author's website. ~500kb
Archive Team Powerblogs Shutdown Snapshot This is a 108-blog snapshot of the final month of Powerblogs, before their shutdown  ?
BBC Closing Panic Archives Some BBC sites  ?
stillflying.net A firefly fan fiction site that maded the rest of season 1 and season 2 pdf scripts for what would have been if firefly wasn't canceled. 408.1mb
Google Reader Text for 46M feeds, per-feed statistics, Reader Directory search results ~8800GB
Total size ~9492 GB

Archived but not available

See also

External links

PhilosophyArchivesIntroduction


[view]  [edit]                   Archive Team                  
Current events Alive... OR ARE THEY · Deathwatch · Projects · Download available archives
Archiveteam.jpg
Archiving projects Archive.is · BetaArchive · Internet Archive · It Died · OldApps.com · OldVersion.com · OSBetaArchive · TEXTFILES
The Dead, the Dying & The Damned · UK Web Archive · WebCite
Blogs/Web hostings Angelfire · Blogger · Blogster · EtherPad · FortuneCity · Free ProHosting · Fuelmyblog · GeoCities (patch) · Google Sites · Jux · LiveJournal · My Opera · Open Diary · Posterous · Prodigy.net · Proust · Splinder · Tripod · Vox · Windows Live Spaces · Wordpress.com · Xanga · Yahoo! Blog · Zapd
Corporations Apple · IBM · Google · Microsoft · Yahoo!
Events Arab Spring · Occupy movement · Spanish Revolution
Font Repos Google Web Fonts · GNU FreeFont · Fontspace
Image hosting services Cameroid · Flickr · Geograph Britain and Ireland · ImageShack · Imgur · Instagr.am · Panoramio · Photobucket · Picasa · Picplz · Ptch · puu.sh · Snapjoy · TwitPic · Wikimedia Commons
Knowledge/Wikis arXiv · Citizendium · Edit.This · Encyclopedia Dramatica · Everything2 · infoAnarchy · GeoNames · GNUPedia · Google Books · Insurgency Wiki · Knol · Nupedia · OpenCourseWare · OpenStreetMap · Project Gutenberg · Puella Magi · Referata · SongMeanings · ShoutWiki · The Internet Movie Database · The Pirate Bay · TropicalWikis · Urban Dictionary · Webmonkey · Wikia · Wikidot · WikiHow · Wikkii · WikiLeaks · Wikipedia · Wikispaces · Wik.is · Wiki-Site · WikiTravel
Microblogging Identi.ca · Jaiku · Plurk · Sina Weibo · Tumblr · Twitter · TwitLonger
Music/Audio Audimated.com · digCCmixter · Dogmazic.net · Free Music Archive · Gogoyoko · Indaba Music · Jamendo · Last.fm · MOG · PureVolume · Reverbnation · ShareTheMusic · SoundCloud · Soundpedia · Twaud.io
People Michael S. Hart · Steve Jobs · Mark Pilgrim · Dennis Ritchie · Len Sassaman Project
Q&A Askville · Answerbag · Answers.com · Ask.com · Askalo · Baidu Knows · Blurtit · ChaCha · Expers Exchange · GirlsAskGuys · Google Answers · Google Questions and Answers · JustAnswer · MetaFilter · Quora · StackExchange · The AnswerBank · The Internet Oracle · Uclue · WikiAnswers · Yahoo! Answers
Social bookmarking Addinto · Backflip · Balatarin · BibSonomy · Bkmrx · Blinklist · BlogMarks · BookmarkSync · CiteULike · Connotea · Delicious · Digg · Diigo · Dir.eccion.es · Evernote · Excite Bookmark · Faves · Favilous · folkd · Freelish · Getboo · GiveALink.org · Gnolia · Google Bookmarks · HeyStaks · IndianPad · Kippt · Knowledge Plaza · Licorize · Linkwad · Menéame · Microsoft Developer Network · Microsoft TechNet · Mister Wong · My Web · Mylink Vault · Newsvine · Oneview · Pearltrees · Pinboard · Pocket · Reddit · sabros.us · Scloog · Scuttle · Simpy · SiteBar · Squidoo · StumbleUpon · Twine · Vizited · Yummymarks · Xmarks · Zootool · Zotero
Social networks Bebo · BlackPlanet · Classmates.com · Cyworld · deviantART · Dopplr · douban · Facebook · Flixster · Friendster · Gaia Online · Google+ · Habbo · hi5 · Hyves · LinkedIn · mixi · MyHeritage · MyLife · Myspace · Netlog · Odnoklassniki · Orkut · Plaxo · Qzone · Renren · Skyrock · Sonico.com · Tagged · Viadeo · Vkontakte · WeeWorld · Wretch · more sites...
Software Android Development · Alioth · Assembla · BerliOS · Betavine · Bitbucket · BountySource · CodePlex · Freepository · Free Software Foundation · GNU Savannah · GitHub · Gitorious · Gna! · Google Code · java.net · JavaForge · KnowledgeForge · Launchpad · LuaForge · mozdev · OSOR.eu · OW2 Consortium · Openmoko · Ourproject.org · Project Kenai · RubyForge · SEUL.org · SourceForge · tigris.org · Transifex · TuxFamily
Video hosting services Academic Earth · Blip.tv · Google Video · TED Talks · Ustream · Viddler · Vimeo · Yahoo! Video · YouTube
Other 4chan · April Fools' Day · Amplicate · Circavie · Co.mments · Dmoz · Electronic Frontier Foundation · Feedly · Ficlets · FriendFeed · Gopher · Google Books Ngram · Google Reader · IFTTT · isoHunt · MegaUpload · MyBlogLog · Pastebin · Propeller.com · Quantcast · Salon Table Talk · SOPA blackout pages · World Wide Web · Yahoo! Buzz · Yahoo! Groups
Teams Bibliotheca Anonoma · LibreTeam · URLTeam · Yahoo Video Warroom · WikiTeam
About Archive Team Introduction · Philosophy · Who We Are · Why Back Up? · Software · Films and documentaries about archiving · Formats · Cheap storage · Storage Media · Recommended Reading · FAQ
Personal tools