https://wiki.archiveteam.org/api.php?action=feedcontributions&user=Yan&feedformat=atomArchiveteam - User contributions [en]2024-03-28T09:44:26ZUser contributionsMediaWiki 1.37.1https://wiki.archiveteam.org/index.php?title=Template:Navigation_box&diff=28969Template:Navigation box2017-01-26T13:54:48Z<p>Yan: move government backup links</p>
<hr />
<div><br clear="all" /><center><!--<br />
<br />
<br />
<br />
<br />
Rows are in Alphabetic order. Except "Current events" at the top and "About Archive Team" at the bottom.<br />
Items inside rows are in Alphabetic order too.<br />
Easy : )<br />
<br />
<br />
<br />
<br />
--><br />
{| class="mw-collapsible mw-collapsed" style="border: 1px solid #aaa; background-color: #f9f9f9; color: black; margin: 0.5em 0 0.5em 1em; padding: 0.2em; font-size: 100%;"<br />
| colspan=3 align=center style="background: #ccccff;" | <span style="float: right;"><span class="plainlinks">[[{{fullurl:Template:Navigation_box}} view]]&nbsp;&nbsp;[[{{fullurl:Template:Navigation_box|action=edit}} edit]]</span>&nbsp;</span>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;'''[[Archive Team]]'''&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiveteam:Current events|Current events]]''' || [[Alive... OR ARE THEY]] {{·}} [[Deathwatch]] {{·}} [[Projects]] || rowspan=5 | [[File:Archiveteam.jpg|right|150px]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiving projects]]''' || [[APKMirror]] {{·}} [[Archive.is]] {{·}} [[BetaArchive]] {{·}} [[Government Backup]] ([[DataRefuge|#datarefuge]] {{·}} [[ftp-gov]]) {{·}} [[Gmane]] {{·}} [[Internet Archive]] {{·}} [[It Died]] {{·}} [[Megalodon.jp]] {{·}} [[OldApps.com]] {{·}} [[OldVersion.com]] {{·}} [[OSBetaArchive]] {{·}} [[TEXTFILES.COM]] {{·}} [[The Dead, the Dying & The Damned]] {{·}} [[The Mail Archive]] {{·}} [[UK Web Archive]] {{·}} [[WebCite]] {{·}} [[Vaporwave.me]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Blogging''' || [[Blog.pl]] {{·}} [[Blogger]] {{·}} [[Blogster]] {{·}} [[Blogter.hu]] {{·}} [[Freeblog.hu]] {{·}} [[Fuelmyblog]] {{·}} [[Jux]] {{·}} [[LiveJournal]] {{·}} [[My Opera]] {{·}} [[Nolblog.hu]] {{·}} [[Open Diary]] {{·}} [[ownlog.com]] {{·}} [[Posterous]] {{·}} [[Powerblogs]] {{·}} [[Proust]] {{·}} [[Roon]] {{·}} [[Splinder]] {{·}} [[Tumblr]] {{·}} [[Vox]] {{·}} [[Weblog.nl]] {{·}} [[Windows Live Spaces]] {{·}} [[Wordpress.com]] {{·}} [[Xanga]] {{·}} [[Yahoo! Blog]] {{·}} [[Zapd]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Clown hosting|Cloud hosting]]/file sharing''' || [[ADrive|aDrive]] {{·}} [[AnyHub]] {{·}} [[Box]] {{·}} [[Dropbox]] {{·}} [[Docstoc]] {{·}} [[Google Drive]] {{·}} [[Google Groups Files]] {{·}} [[iCloud]] {{·}} [[Fileplanet]] {{·}} [[LayerVault]] {{·}} [[MediaCrush]] {{·}} [[MediaFire]] {{·}} [[Mega]] {{·}} [[MegaUpload]] {{·}} [[MobileMe]] {{·}} [[OneDrive]] {{·}} [[Pomf.se]] {{·}} [[RapidShare]] {{·}} [[Ubuntu One]] {{·}} [[Yahoo! Briefcase]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[:Category:Corporations|Corporations]]''' || [[Apple]] {{·}} [[IBM]] {{·}} [[Google]] {{·}} [[Lycos Europe]] {{·}} [[Microsoft]] {{·}} [[Yahoo!]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Events''' || [[Arab Spring]] {{·}} [[Great Ape-Snake War]] {{·}} [[Spanish Revolution]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Font Repos''' || [[Google Web Fonts]] {{·}} [[GNU FreeFont]] {{·}} [[Fontspace]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Forums/Message boards''' || colspan=2 | [[4chan]] {{·}} [[Captain Luffy Forums]] {{·}} [[College Confidential]] {{·}} [[DSLReports]] {{·}} [[ESPN Forums]] {{·}} [[forums.starwars.com]] {{·}} [[HeavenGames]] {{·}} [[Invisionfree]] {{·}} [[The Classic Horror Film Board]] {{·}} [[Yahoo! Messages]] {{·}} [[Yahoo! Neighbors]] {{·}} [[Yuku.com]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Games|Gaming]]''' || colspan=2 | [[Atomicgamer]] {{·}} [[City of Heroes]] {{·}} [[Club Nintendo]] {{·}} [[CSGO Lounge|CS:GO Lounge]] {{·}} [[Desura]] {{·}} [[Dota 2 Lounge]] {{·}} [[Emulation Zone]] {{·}} [[GameMaker Sandbox]] {{·}} [[GameTrailers]] {{·}} [[Halo]] {{·}} [[HLTV.org]] {{·}} [[Infinite Crisis]] {{·}} [[Minecraft.net]] {{·}} [[Player.me]] {{·}} [[Playfire]] {{·}} [[Steam]] {{·}} [[SteamDB]] {{·}} [[TF2 Outpost]] {{·}} [[Warhammer]] {{·}} [[Xfire]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Image hosting]]''' || [[500px]] {{·}} [[AOL Pictures]] {{·}} [[Blipfoto]] {{·}} [[Blingee]] {{·}} [[Canv.as]] {{·}} [[Camera+]] {{·}} [[Cameroid]] {{·}} [[DailyBooth]] {{·}} [[Degree Confluence Project]] {{·}} [[deviantART]] {{·}} [[Demotivalo.net]] {{·}} [[Flickr]] {{·}} [[Fotoalbum.hu]] {{·}} [[Fotolog.com]] {{·}} [[Fotopedia]] {{·}} [[Frontback]] {{·}} [[Geograph Britain and Ireland]] {{·}} [[GTF Képhost]] {{·}} [[ImageShack]] {{·}} [[Imgur]] {{·}} [[Inkblazers]] {{·}} [[Instagr.am]] {{·}} [[Kepfeltoltes.hu]] {{·}} [[Kephost.com]] {{·}} [[Kephost.hu]] {{·}} [[Kepkezelo.com]] {{·}} [[Keptarad.hu]] {{·}} [[Madden GIFERATOR]] {{·}} [[MLKSHK]] {{·}} [[Microsoft Clip Art]] {{·}} [[Microsoft Photosynth]] {{·}} [[Nokia Memories]] {{·}} [[noob.hu]] {{·}} [[Odysee]] {{·}} [[Panoramio]] {{·}} [[Photobucket]] {{·}} [[Picasa]] {{·}} [[Picplz]] {{·}} [[Pixiv]] {{·}} [[PSharing]] {{·}} [[Ptch]] {{·}} [[puu.sh]] {{·}} [[Rawporter]] {{·}} [[Relay.im]] {{·}} [[ScreenshotsDatabase.com]] {{·}} [[Snapjoy]] {{·}} [[Streetfiles]] {{·}} [[Tabblo]] {{·}} [[Trovebox]] {{·}} [[TwitPic]] {{·}} [[Wallbase]] {{·}} [[Wallhaven]] {{·}} [[Webshots]] {{·}} [[Wikimedia Commons]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Knowledge/[[Wikis]]''' || colspan=2 | [[arXiv]] {{·}} [[Citizendium]] {{·}} [[Clipboard.com]] {{·}} [[Deletionpedia]] {{·}} [[EditThis]] {{·}} [[Encyclopedia Dramatica]] {{·}} [[Etherpad]] {{·}} [[Everything2]] {{·}} [[infoAnarchy]] {{·}} [[GeoNames]] {{·}} [[GNUPedia]] {{·}} [[Google Books]] ([[Google Books Ngram]]) {{·}} [[Horror Movie Database]] {{·}} [[Insurgency Wiki]] {{·}} [[Knol]] {{·}} [[Library Genesis]] {{·}} [[Lost Media Wiki]] {{·}} [[Neoseeker.com]] {{·}} [[Notepad.cc]] {{·}} [[Nupedia]] {{·}} [[OpenCourseWare]] {{·}} [[OpenStreetMap]] {{·}} [[Orain]] {{·}} [[Pastebin]] {{·}} [[Patch.com]] {{·}} [[Project Gutenberg]] {{·}} [[Puella Magi]] {{·}} [[Referata]] {{·}} [[Resedagboken]] {{·}} [[SongMeanings]] {{·}} [[ShoutWiki]] {{·}} [[The Internet Movie Database]] {{·}} [[TropicalWikis]] {{·}} [[Uncyclopedia]] {{·}} [[Urban Dictionary]] {{·}} [[Webmonkey]] {{·}} [[Wikia]] {{·}} [[Wikidot]] {{·}} [[WikiHow]] {{·}} [[Wikkii]] {{·}} [[WikiLeaks]] {{·}} [[Wikipedia]] ([[Simple English Wikipedia]]) {{·}} [[Wikispaces]] {{·}} [[Wikispot]] {{·}} [[Wik.is]] {{·}} [[Wiki-Site]] {{·}} [[WikiTravel]] {{·}} [[Word Count Journal]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Magazines/Blogs/News''' || colspan=2 | [[Cyberpunkreview.com]] {{·}} [[Game Developer Magazine]] {{·}} [[Gigaom]] {{·}} [[Helium]] {{·}} [[JPG Magazine]] {{·}} [[Polygamia.pl]] {{·}} [[San Fransisco Bay Guardian]] {{·}} [[Scoop]] {{·}} [[Regretsy]] {{·}} [[Yahoo! Voices]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Microblogging]]''' || colspan=2 | [[Heello]] {{·}} [[Identi.ca]] {{·}} [[Jaiku]] {{·}} [[Mommo.hu]] {{·}} [[Plurk]] {{·}} [[Sina Weibo]] {{·}} [[Twitter]] {{·}} [[TwitLonger]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Music/Audio''' || colspan=2 | [[AOL Music]] {{·}} [[Audimated.com]] {{·}} [[Cinch]] {{·}} [[digCCmixter]] {{·}} [[Dogmazic.net]] {{·}} [[Earbits]] {{·}} [[exfm]] {{·}} [[Free Music Archive]] {{·}} [[Gogoyoko]] {{·}} [[Indaba Music]] {{·}} [[Instacast]] {{·}} [[Jamendo]] {{·}} [[Last.fm]] {{·}} [[Music Unlimited]] {{·}} [[MOG]] {{·}} [[PureVolume]] {{·}} [[Reverbnation]] {{·}} [[ShareTheMusic]] {{·}} [[SoundCloud]] {{·}} [[Soundpedia]] {{·}} [[This Is My Jam]] {{·}} [[TuneWiki]] {{·}} [[Twaud.io]] {{·}} [[WinAmp]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''People''' || colspan=2 | [[Aaron Swartz]] {{·}} [[Michael S. Hart]] {{·}} [[Steve Jobs]] {{·}} [[Mark Pilgrim]] {{·}} [[Dennis Ritchie]] {{·}} [[Len Sassaman Project]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Protocols/Infrastructure''' || colspan=2 | [[FTP]] {{·}} [[Gopher]] {{·}} [[IRC]] {{·}} [[Usenet]] {{·}} [[World Wide Web]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Q&A''' || colspan=2 | [[Askville]] {{·}} [[Answerbag]] {{·}} [[Answers.com]] {{·}} [[Ask.com]] {{·}} [[Askalo]] {{·}} [[Baidu Knows]] {{·}} [[Blurtit]] {{·}} [[ChaCha]] {{·}} [[Experts Exchange]] {{·}} [[Formspring]] {{·}} [[GirlsAskGuys]] {{·}} [[Google Answers]] {{·}} [[Google Baraza]] {{·}} [[JustAnswer]] {{·}} [[MetaFilter]] {{·}} [[Quora]] {{·}} [[Retrospring]] {{·}} [[StackExchange]] {{·}} [[The AnswerBank]] {{·}} [[The Internet Oracle]] {{·}} [[Uclue]] {{·}} [[WikiAnswers]] {{·}} [[Yahoo! Answers]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Recipes/Food''' || colspan=2 | [[Allrecipes]] {{·}} [[Epicurious]] {{·}} [[Food.com]] {{·}} [[Foodily]] {{·}} [[Food Network]] {{·}} [[Punchfork]] {{·}} [[ZipList]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Social bookmarking''' || colspan=2 | [[Addinto]] {{·}} [[Backflip]] {{·}} [[Balatarin]] {{·}} [[BibSonomy]] {{·}} [[Bkmrx]] {{·}} [[Blinklist]] {{·}} [[BlogMarks]] {{·}} [[BookmarkSync]] {{·}} [[CiteULike]] {{·}} [[Connotea]] {{·}} [[Delicious]] {{·}} [[Designer News]] {{·}} [[Digg]] {{·}} [[Diigo]] {{·}} [[Dir.eccion.es]] {{·}} [[Evernote]] {{·}} [[Excite Bookmark]] {{·}} [[Faves]] {{·}} [[Favilous]] {{·}} [[folkd]] {{·}} [[Freelish]] {{·}} [[Getboo]] {{·}} [[GiveALink.org]] {{·}} [[Gnolia]] {{·}} [[Google Bookmarks]] {{·}} [[Hacker News]] {{·}} [[HeyStaks]] {{·}} [[IndianPad]] {{·}} [[Kippt]] {{·}} [[Knowledge Plaza]] {{·}} [[Licorize]] {{·}} [[Linkwad]] {{·}} [[Menéame]] {{·}} [[Microsoft Developer Network]] {{·}} [[myVIP]] {{·}} [[Mister Wong]] {{·}} [[My Web]] {{·}} [[Mylink Vault]] {{·}} [[Newsvine]] {{·}} [[Oneview]] {{·}} [[Pearltrees]] {{·}} [[Pinboard]] {{·}} [[Pocket]] {{·}} [[Propeller.com]] {{·}} [[Reddit]] {{·}} [[sabros.us]] {{·}} [[Scloog]] {{·}} [[Scuttle]] {{·}} [[Simpy]] {{·}} [[SiteBar]] {{·}} [[Slashdot]] {{·}} [[Squidoo]] {{·}} [[StumbleUpon]] {{·}} [[Twine]] {{·}} [[Vizited]] {{·}} [[Yummymarks]] {{·}} [[Xmarks]] {{·}} [[Yahoo! Buzz]] {{·}} [[Zootool]] {{·}} [[Zotero]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Social network|Social networks]]''' || colspan=2 | [[Bebo]] {{·}} [[BlackPlanet]] {{·}} [[Classmates.com]] {{·}} [[Cyworld]] {{·}} [[Dogster]] {{·}} [[Dopplr]] {{·}} [[douban]] {{·}} [[Ello]] {{·}} [[Facebook]] {{·}} [[Flixster]] {{·}} [[FriendFeed]] {{·}} [[Friendster]] {{·}} [[Friends Reunited]] {{·}} [[Gaia Online]] {{·}} [[Google+]] {{·}} [[Habbo]] {{·}} [[hi5]] {{·}} [[Hyves]] {{·}} [[iWiW]] {{·}} [[LinkedIn]] {{·}} [[Miiverse]] {{·}} [[mixi]] {{·}} [[MyHeritage]] {{·}} [[MyLife]] {{·}} [[Myspace]] {{·}} [[myVIP]] {{·}} [[Netlog]] {{·}} [[Odnoklassniki]] {{·}} [[Orkut]] {{·}} [[Plaxo]] {{·}} [[Qzone]] {{·}} [[Renren]] {{·}} [[Skyrock]] {{·}} [[Sonico.com]] {{·}} [[Storylane]] {{·}} [[Tagged]] {{·}} [[tvtag]] {{·}} [[Upcoming]] {{·}} [[Viadeo]] {{·}} [[Vine]] {{·}} [[Vkontakte]] {{·}} [[WeeWorld]] {{·}} [[Weibo]] {{·}} [[Wretch]] {{·}} [[Yahoo! Groups]] {{·}} [[Yahoo! Stars India]] {{·}} [[Yahoo! Upcoming]] {{·}} [[Social network|more sites...]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Shopping/Retail''' || colspan=2 | [[Alibaba]] {{·}} [[AliExpress]] {{·}} [[Amazon]] {{·}} [[Apple Store]] {{·}} [[eBay]] {{·}} [[Printfection]] {{·}} [[RadioShack]] {{·}} [[Sears]] {{·}} [[Target]] {{·}} [[The Book Depository]] {{·}} [[ThinkGeek]] {{·}} [[Walmart]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Software/[[Code hosting services|code hosting]]''' || colspan=2 | [[Android Development]] {{·}} [[Alioth]] {{·}} [[Assembla]] {{·}} [[BerliOS]] {{·}} [[Betavine]] {{·}} [[Bitbucket]] {{·}} [[BountySource]] {{·}} [[Codecademy]] {{·}} [[CodePlex]] {{·}} [[Freepository]] {{·}} [[Free Software Foundation]] {{·}} [[GNU Savannah]] {{·}} [[GitHost]] {{·}} [[GitHub]] {{·}} [[GitHub Downloads]] {{·}} [[Gitorious]] {{·}} [[Gna!]] {{·}} [[Google Code]] {{·}} [[ibiblio]] {{·}} [[java.net]] {{·}} [[JavaForge]] {{·}} [[KnowledgeForge]] {{·}} [[Launchpad]] {{·}} [[LuaForge]] {{·}} [[Maemo]] {{·}} [[mozdev]] {{·}} [[OSOR.eu]] {{·}} [[OW2 Consortium]] {{·}} [[Openmoko]] {{·}} [[OpenSolaris]] {{·}} [[Ourproject.org]] {{·}} [[Ovi Store]] {{·}} [[Project Kenai]] {{·}} [[RubyForge]] {{·}} [[SEUL.org]] {{·}} [[SourceForge]] {{·}} [[Stypi]] {{·}} [[TestFlight]] {{·}} [[tigris.org]] {{·}} [[Transifex]] {{·}} [[TuxFamily]] {{·}} [[Yahoo! Downloads]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Torrenting/Piracy''' || colspan=2 | [[ExtraTorrent]] {{·}} [[EZTV]] {{·}} [[isoHunt]] {{·}} [[KickassTorrents]] {{·}} [[The Pirate Bay]] {{·}} [[Torrentz]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Video hosting]]''' || colspan=2 | [[Academic Earth]] {{·}} [[Blip.tv]] {{·}} [[Epic]] {{·}} [[Google Video]] {{·}} [[Justin.tv]] {{·}} [[Niconico]] {{·}} [[Nokia Trailers]] {{·}} [[Qwiki]] {{·}} [[Skillfeed]] {{·}} [[Stickam]] {{·}} [[TED Talks]] {{·}} [[Ticker.tv]] {{·}} [[Twitch.tv]] {{·}} [[Ustream]] {{·}} [[Videoplayer.hu]] {{·}} [[Viddler]] {{·}} [[Viddy]] {{·}} [[Vimeo]] {{·}} [[Vine]] {{·}} [[Vstreamers]] {{·}} [[Yahoo! Video]] {{·}} [[YouTube]] {{·}} [[Famous Internet videos]] ([[Me at the zoo]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[List of website hosts|Web hosting]]''' || [[Angelfire]] {{·}} [[Brace.io]] {{·}} [[BT Internet]] {{·}} [[CableAmerica Personal Web Space]] {{·}} [[Claranet Netherlands Personal Web Pages]] {{·}} [[Comcast Personal Web Pages]] {{·}} [[Extra.hu]] {{·}} [[FortuneCity]] {{·}} [[Free ProHosting]] {{·}} [[GeoCities]] ([[GeoCities Torrent Patch|patch]]) {{·}} [[Google Business Sitebuilder]] {{·}} [[Google Sites]] {{·}} [[Internet Centrum]] {{·}} [[MBinternet]] {{·}} [[MSN TV]] {{·}} [[Nwnyet]] {{·}} [[Parodius Networking]] {{·}} [[Prodigy.net]] {{·}} [[Saunalahti Iso G]] {{·}} [[Swipnet]] {{·}} [[Telenor]] {{·}} [[Tripod]] {{·}} [[University of Michigan personal webpages]] {{·}} [[Verizon Mysite]] {{·}} [[Verizon Personal Web Space]] {{·}} [[Webzdarma]] {{·}} [[Virgin Media]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Web applications''' || colspan=2 | [[Mailman]] {{·}} [[MediaWiki]] {{·}} [[phpBB]] {{·}} [[Simple Machines Forum]] {{·}} [[vBulletin]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Other''' || colspan=2 | [[800notes]] {{·}} [[AOL]] {{·}} [[Akoha]] {{·}} [[Ancestry.com]] {{·}} [[April Fools' Day]] {{·}} [[Amplicate]] {{·}} [[AutoAdmit]] {{·}} [[Bre.ad]] {{·}} [[Circavie]] {{·}} [[Cobook]] {{·}} [[Co.mments]] {{·}} [[Countdown]] {{·}} [[Distill]] {{·}} [[Dmoz]] {{·}} [[Easel]] {{·}} [[Eircode]] {{·}} [[Electronic Frontier Foundation]] {{·}} [[FanFiction.Net]] {{·}} [[Feedly]] {{·}} [[Ficlets]] {{·}} [[Forrst]] {{·}} [[FunnyExam.com]] {{·}} [[FurAffinity]] {{·}} [[Google Helpouts]] {{·}} [[Google Moderator]] {{·}} [[Google Reader]] {{·}} [[ICQmail]] {{·}} [[IFTTT]] {{·}} [[Jajah]] {{·}} [[JuniorNet]] {{·}} [[Lulu Poetry]] {{·}} [[Mobile Phone Applications]] {{·}} [[Mochi Media]] {{·}} [[Mozilla Firefox]] {{·}} [[MyBlogLog]] {{·}} [[NBII]] {{·}} [[Neopets]] {{·}} [[Quantcast]] {{·}} [[Quizilla]] {{·}} [[Salon Table Talk]] {{·}} [[Shutdownify]] {{·}} [[Slidecast]] {{·}} [[SOPA blackout pages]] {{·}} [[starwars.yahoo.com]] {{·}} [[TechNet]] {{·}} [[Toshiba Support]] {{·}} [[USA-Gov]] {{·}} [[Volán]] {{·}} [[Widgetbox]] {{·}} [[Windows Technical Preview]] {{·}} [[Wunderlist]] {{·}} [[Zoocasa]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Information''' || colspan=2 | [[A Million Ways to Die on the Web]] {{·}} [[Backup Tips]] {{·}} [[Cheap storage]] {{·}} [[Collecting items randomly]] {{·}} [[Data compression algorithms and tools]] {{·}} [[Dev]] {{·}} [[Discovery Data]] {{·}} [[DOS Floppies]] {{·}} [[Fortress of Solitude]] {{·}} [[Keywords]] {{·}} [[Naughty List]] {{·}} [[Nightmare Projects]] {{·}} [[Rescuing Floppy Disks|Rescuing floppy disks]] {{·}} [[Rescuing optical media]] {{·}} [[Site exploration]] {{·}} [[The WARC Ecosystem]] {{·}} [[Working with ARCHIVE.ORG]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Projects]]''' || colspan=2 | [[ArchiveCorps]] {{·}} [[Audit2014]] {{·}} [[Emularity]] {{·}} [[Faceoff]] {{·}} [[FlickrFckr]] {{·}} [[Froogle]] {{·}} [[INTERNETARCHIVE.BAK]] ([[Internet Archive Census]]) {{·}} [[IRC Quotes]] {{·}} [[Javascript Mess|JSMESS]] {{·}} [[Jsvlc|JSVLC]] {{·}} [[Just Solve the Problem 2012|Just Solve the Problem]] {{·}} [[NewsGrabber]] {{·}} [[Project Newsletter]] {{·}} [[Valhalla]] {{·}} [[Web Roasting]] ([[ISP Hosting]] {{·}} [[University Web Hosting]]) {{·}} [[Woohoo]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Software|Tools]]''' || colspan=2 | [[ArchiveBot]] {{·}} [[ArchiveTeam Warrior]] ([[Tracker]]) {{·}} [[Google Takeout]] {{·}} [[HTTrack options|HTTrack]] {{·}} [[Video|Video downloaders]] {{·}} [[Wget]] ([[Wget with Lua hooks|Lua]] {{·}} [[Wget with WARC output|WARC]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Teams''' || colspan=2 | [[Bibliotheca Anonoma]] {{·}} [[LibreTeam]] {{·}} [[URLTeam]] {{·}} [[Yahoo Video Warroom]] {{·}} [[WikiTeam]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''About [[Archive Team]]''' || colspan=2 | [[Introduction]] {{·}} [[Philosophy]] {{·}} [[Who We Are]] {{·}} [[Robots.txt|Our stance on robots.txt]] {{·}} [[Why Back Up?]] {{·}} [[Software]] {{·}} [[Formats]] {{·}} [[Storage Media]] {{·}} [[Recommended Reading]] {{·}} [[Films and documentaries about archiving]] {{·}} [[Talks]] {{·}} [[In The Media]] {{·}} [[Frequently Asked Questions|FAQ]]<br />
|}<br />
</center>[[Category:Archive Team]]<noinclude>[[Category:Templates]]</noinclude></div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28968DataRefuge2017-01-24T14:45:49Z<p>Yan: fix year</p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data, in response to the 2016 US Presidential election. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]). They collaborate with [https://projectarcc.org/ Project_ARCC] and University of Michigan Libraries, among others.<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/]. Instructions can be found [https://docs.google.com/document/d/1PeWefW2toThs-Pbw0CMv2us7wxQI0gRrP1LGuwMp_UQ/edit here]. They maintain a list of gov. climate datasets [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 here].<br />
<br />
== Results ==<br />
They form part of the University of North Texas Libraries' [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], in collaboration with [[Library of Congress]] and the [[Internet Archive]]. Data they grab may also be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28967DataRefuge2017-01-24T14:43:56Z<p>Yan: fix naming</p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data, in response to the 2015 US Presidential election. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]). They collaborate with [https://projectarcc.org/ Project_ARCC] and University of Michigan Libraries, among others.<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/]. Instructions can be found [https://docs.google.com/document/d/1PeWefW2toThs-Pbw0CMv2us7wxQI0gRrP1LGuwMp_UQ/edit here]. They maintain a list of gov. climate datasets [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 here].<br />
<br />
== Results ==<br />
They form part of the University of North Texas Libraries' [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], in collaboration with [[Library of Congress]] and the [[Internet Archive]]. Data they grab may also be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28966DataRefuge2017-01-24T14:43:00Z<p>Yan: add footer</p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data, in response to the 2015 US Presidential election. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]). They collaborate with [https://projectarcc.org/ Project_ARCC] and University of Michigan Libraries, among others.<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/]. Instructions can be found [https://docs.google.com/document/d/1PeWefW2toThs-Pbw0CMv2us7wxQI0gRrP1LGuwMp_UQ/edit here]. They maintain a list of environmental datasets [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 here].<br />
<br />
== Results ==<br />
They form part of the University of North Texas Libraries' [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], in collaboration with [[Library of Congress]] and the [[Internet Archive]]. Data they grab may also be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=Template:Navigation_box&diff=28965Template:Navigation box2017-01-24T14:40:14Z<p>Yan: add DataRefuge</p>
<hr />
<div><br clear="all" /><center><!--<br />
<br />
<br />
<br />
<br />
Rows are in Alphabetic order. Except "Current events" at the top and "About Archive Team" at the bottom.<br />
Items inside rows are in Alphabetic order too.<br />
Easy : )<br />
<br />
<br />
<br />
<br />
--><br />
{| class="mw-collapsible mw-collapsed" style="border: 1px solid #aaa; background-color: #f9f9f9; color: black; margin: 0.5em 0 0.5em 1em; padding: 0.2em; font-size: 100%;"<br />
| colspan=3 align=center style="background: #ccccff;" | <span style="float: right;"><span class="plainlinks">[[{{fullurl:Template:Navigation_box}} view]]&nbsp;&nbsp;[[{{fullurl:Template:Navigation_box|action=edit}} edit]]</span>&nbsp;</span>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;'''[[Archive Team]]'''&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiveteam:Current events|Current events]]''' || [[Alive... OR ARE THEY]] {{·}} [[Deathwatch]] {{·}} [[Projects]] || rowspan=5 | [[File:Archiveteam.jpg|right|150px]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiving projects]]''' || [[APKMirror]] {{·}} [[Archive.is]] {{·}} [[BetaArchive]] {{·}} [[DataRefuge]] {{·}} [[Government Backup]] {{·}} [[Gmane]] {{·}} [[Internet Archive]] {{·}} [[It Died]] {{·}} [[Megalodon.jp]] {{·}} [[OldApps.com]] {{·}} [[OldVersion.com]] {{·}} [[OSBetaArchive]] {{·}} [[TEXTFILES.COM]] {{·}} [[The Dead, the Dying & The Damned]] {{·}} [[The Mail Archive]] {{·}} [[UK Web Archive]] {{·}} [[WebCite]] {{·}} [[Vaporwave.me]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Blogging''' || [[Blog.pl]] {{·}} [[Blogger]] {{·}} [[Blogster]] {{·}} [[Blogter.hu]] {{·}} [[Freeblog.hu]] {{·}} [[Fuelmyblog]] {{·}} [[Jux]] {{·}} [[LiveJournal]] {{·}} [[My Opera]] {{·}} [[Nolblog.hu]] {{·}} [[Open Diary]] {{·}} [[ownlog.com]] {{·}} [[Posterous]] {{·}} [[Powerblogs]] {{·}} [[Proust]] {{·}} [[Roon]] {{·}} [[Splinder]] {{·}} [[Tumblr]] {{·}} [[Vox]] {{·}} [[Weblog.nl]] {{·}} [[Windows Live Spaces]] {{·}} [[Wordpress.com]] {{·}} [[Xanga]] {{·}} [[Yahoo! Blog]] {{·}} [[Zapd]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Clown hosting|Cloud hosting]]/file sharing''' || [[ADrive|aDrive]] {{·}} [[AnyHub]] {{·}} [[Box]] {{·}} [[Dropbox]] {{·}} [[Docstoc]] {{·}} [[Google Drive]] {{·}} [[Google Groups Files]] {{·}} [[iCloud]] {{·}} [[Fileplanet]] {{·}} [[LayerVault]] {{·}} [[MediaCrush]] {{·}} [[MediaFire]] {{·}} [[Mega]] {{·}} [[MegaUpload]] {{·}} [[MobileMe]] {{·}} [[OneDrive]] {{·}} [[Pomf.se]] {{·}} [[RapidShare]] {{·}} [[Ubuntu One]] {{·}} [[Yahoo! Briefcase]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[:Category:Corporations|Corporations]]''' || [[Apple]] {{·}} [[IBM]] {{·}} [[Google]] {{·}} [[Lycos Europe]] {{·}} [[Microsoft]] {{·}} [[Yahoo!]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Events''' || [[Arab Spring]] {{·}} [[Great Ape-Snake War]] {{·}} [[Spanish Revolution]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Font Repos''' || [[Google Web Fonts]] {{·}} [[GNU FreeFont]] {{·}} [[Fontspace]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Forums/Message boards''' || colspan=2 | [[4chan]] {{·}} [[Captain Luffy Forums]] {{·}} [[College Confidential]] {{·}} [[DSLReports]] {{·}} [[ESPN Forums]] {{·}} [[forums.starwars.com]] {{·}} [[HeavenGames]] {{·}} [[Invisionfree]] {{·}} [[The Classic Horror Film Board]] {{·}} [[Yahoo! Messages]] {{·}} [[Yahoo! Neighbors]] {{·}} [[Yuku.com]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Games|Gaming]]''' || colspan=2 | [[Atomicgamer]] {{·}} [[City of Heroes]] {{·}} [[Club Nintendo]] {{·}} [[CSGO Lounge|CS:GO Lounge]] {{·}} [[Desura]] {{·}} [[Dota 2 Lounge]] {{·}} [[Emulation Zone]] {{·}} [[GameMaker Sandbox]] {{·}} [[GameTrailers]] {{·}} [[Halo]] {{·}} [[HLTV.org]] {{·}} [[Infinite Crisis]] {{·}} [[Minecraft.net]] {{·}} [[Player.me]] {{·}} [[Playfire]] {{·}} [[Steam]] {{·}} [[SteamDB]] {{·}} [[TF2 Outpost]] {{·}} [[Warhammer]] {{·}} [[Xfire]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Image hosting]]''' || [[500px]] {{·}} [[AOL Pictures]] {{·}} [[Blipfoto]] {{·}} [[Blingee]] {{·}} [[Canv.as]] {{·}} [[Camera+]] {{·}} [[Cameroid]] {{·}} [[DailyBooth]] {{·}} [[Degree Confluence Project]] {{·}} [[deviantART]] {{·}} [[Demotivalo.net]] {{·}} [[Flickr]] {{·}} [[Fotoalbum.hu]] {{·}} [[Fotolog.com]] {{·}} [[Fotopedia]] {{·}} [[Frontback]] {{·}} [[Geograph Britain and Ireland]] {{·}} [[GTF Képhost]] {{·}} [[ImageShack]] {{·}} [[Imgur]] {{·}} [[Inkblazers]] {{·}} [[Instagr.am]] {{·}} [[Kepfeltoltes.hu]] {{·}} [[Kephost.com]] {{·}} [[Kephost.hu]] {{·}} [[Kepkezelo.com]] {{·}} [[Keptarad.hu]] {{·}} [[Madden GIFERATOR]] {{·}} [[MLKSHK]] {{·}} [[Microsoft Clip Art]] {{·}} [[Microsoft Photosynth]] {{·}} [[Nokia Memories]] {{·}} [[noob.hu]] {{·}} [[Odysee]] {{·}} [[Panoramio]] {{·}} [[Photobucket]] {{·}} [[Picasa]] {{·}} [[Picplz]] {{·}} [[Pixiv]] {{·}} [[PSharing]] {{·}} [[Ptch]] {{·}} [[puu.sh]] {{·}} [[Rawporter]] {{·}} [[Relay.im]] {{·}} [[ScreenshotsDatabase.com]] {{·}} [[Snapjoy]] {{·}} [[Streetfiles]] {{·}} [[Tabblo]] {{·}} [[Trovebox]] {{·}} [[TwitPic]] {{·}} [[Wallbase]] {{·}} [[Wallhaven]] {{·}} [[Webshots]] {{·}} [[Wikimedia Commons]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Knowledge/[[Wikis]]''' || colspan=2 | [[arXiv]] {{·}} [[Citizendium]] {{·}} [[Clipboard.com]] {{·}} [[Deletionpedia]] {{·}} [[EditThis]] {{·}} [[Encyclopedia Dramatica]] {{·}} [[Etherpad]] {{·}} [[Everything2]] {{·}} [[infoAnarchy]] {{·}} [[GeoNames]] {{·}} [[GNUPedia]] {{·}} [[Google Books]] ([[Google Books Ngram]]) {{·}} [[Horror Movie Database]] {{·}} [[Insurgency Wiki]] {{·}} [[Knol]] {{·}} [[Library Genesis]] {{·}} [[Lost Media Wiki]] {{·}} [[Neoseeker.com]] {{·}} [[Notepad.cc]] {{·}} [[Nupedia]] {{·}} [[OpenCourseWare]] {{·}} [[OpenStreetMap]] {{·}} [[Orain]] {{·}} [[Pastebin]] {{·}} [[Patch.com]] {{·}} [[Project Gutenberg]] {{·}} [[Puella Magi]] {{·}} [[Referata]] {{·}} [[Resedagboken]] {{·}} [[SongMeanings]] {{·}} [[ShoutWiki]] {{·}} [[The Internet Movie Database]] {{·}} [[TropicalWikis]] {{·}} [[Uncyclopedia]] {{·}} [[Urban Dictionary]] {{·}} [[Webmonkey]] {{·}} [[Wikia]] {{·}} [[Wikidot]] {{·}} [[WikiHow]] {{·}} [[Wikkii]] {{·}} [[WikiLeaks]] {{·}} [[Wikipedia]] ([[Simple English Wikipedia]]) {{·}} [[Wikispaces]] {{·}} [[Wikispot]] {{·}} [[Wik.is]] {{·}} [[Wiki-Site]] {{·}} [[WikiTravel]] {{·}} [[Word Count Journal]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Magazines/Blogs/News''' || colspan=2 | [[Cyberpunkreview.com]] {{·}} [[Game Developer Magazine]] {{·}} [[Gigaom]] {{·}} [[Helium]] {{·}} [[JPG Magazine]] {{·}} [[Polygamia.pl]] {{·}} [[San Fransisco Bay Guardian]] {{·}} [[Scoop]] {{·}} [[Regretsy]] {{·}} [[Yahoo! Voices]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Microblogging]]''' || colspan=2 | [[Heello]] {{·}} [[Identi.ca]] {{·}} [[Jaiku]] {{·}} [[Mommo.hu]] {{·}} [[Plurk]] {{·}} [[Sina Weibo]] {{·}} [[Twitter]] {{·}} [[TwitLonger]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Music/Audio''' || colspan=2 | [[AOL Music]] {{·}} [[Audimated.com]] {{·}} [[Cinch]] {{·}} [[digCCmixter]] {{·}} [[Dogmazic.net]] {{·}} [[Earbits]] {{·}} [[exfm]] {{·}} [[Free Music Archive]] {{·}} [[Gogoyoko]] {{·}} [[Indaba Music]] {{·}} [[Instacast]] {{·}} [[Jamendo]] {{·}} [[Last.fm]] {{·}} [[Music Unlimited]] {{·}} [[MOG]] {{·}} [[PureVolume]] {{·}} [[Reverbnation]] {{·}} [[ShareTheMusic]] {{·}} [[SoundCloud]] {{·}} [[Soundpedia]] {{·}} [[This Is My Jam]] {{·}} [[TuneWiki]] {{·}} [[Twaud.io]] {{·}} [[WinAmp]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''People''' || colspan=2 | [[Aaron Swartz]] {{·}} [[Michael S. Hart]] {{·}} [[Steve Jobs]] {{·}} [[Mark Pilgrim]] {{·}} [[Dennis Ritchie]] {{·}} [[Len Sassaman Project]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Protocols/Infrastructure''' || colspan=2 | [[FTP]] {{·}} [[Gopher]] {{·}} [[IRC]] {{·}} [[Usenet]] {{·}} [[World Wide Web]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Q&A''' || colspan=2 | [[Askville]] {{·}} [[Answerbag]] {{·}} [[Answers.com]] {{·}} [[Ask.com]] {{·}} [[Askalo]] {{·}} [[Baidu Knows]] {{·}} [[Blurtit]] {{·}} [[ChaCha]] {{·}} [[Experts Exchange]] {{·}} [[Formspring]] {{·}} [[GirlsAskGuys]] {{·}} [[Google Answers]] {{·}} [[Google Baraza]] {{·}} [[JustAnswer]] {{·}} [[MetaFilter]] {{·}} [[Quora]] {{·}} [[Retrospring]] {{·}} [[StackExchange]] {{·}} [[The AnswerBank]] {{·}} [[The Internet Oracle]] {{·}} [[Uclue]] {{·}} [[WikiAnswers]] {{·}} [[Yahoo! Answers]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Recipes/Food''' || colspan=2 | [[Allrecipes]] {{·}} [[Epicurious]] {{·}} [[Food.com]] {{·}} [[Foodily]] {{·}} [[Food Network]] {{·}} [[Punchfork]] {{·}} [[ZipList]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Social bookmarking''' || colspan=2 | [[Addinto]] {{·}} [[Backflip]] {{·}} [[Balatarin]] {{·}} [[BibSonomy]] {{·}} [[Bkmrx]] {{·}} [[Blinklist]] {{·}} [[BlogMarks]] {{·}} [[BookmarkSync]] {{·}} [[CiteULike]] {{·}} [[Connotea]] {{·}} [[Delicious]] {{·}} [[Designer News]] {{·}} [[Digg]] {{·}} [[Diigo]] {{·}} [[Dir.eccion.es]] {{·}} [[Evernote]] {{·}} [[Excite Bookmark]] {{·}} [[Faves]] {{·}} [[Favilous]] {{·}} [[folkd]] {{·}} [[Freelish]] {{·}} [[Getboo]] {{·}} [[GiveALink.org]] {{·}} [[Gnolia]] {{·}} [[Google Bookmarks]] {{·}} [[Hacker News]] {{·}} [[HeyStaks]] {{·}} [[IndianPad]] {{·}} [[Kippt]] {{·}} [[Knowledge Plaza]] {{·}} [[Licorize]] {{·}} [[Linkwad]] {{·}} [[Menéame]] {{·}} [[Microsoft Developer Network]] {{·}} [[myVIP]] {{·}} [[Mister Wong]] {{·}} [[My Web]] {{·}} [[Mylink Vault]] {{·}} [[Newsvine]] {{·}} [[Oneview]] {{·}} [[Pearltrees]] {{·}} [[Pinboard]] {{·}} [[Pocket]] {{·}} [[Propeller.com]] {{·}} [[Reddit]] {{·}} [[sabros.us]] {{·}} [[Scloog]] {{·}} [[Scuttle]] {{·}} [[Simpy]] {{·}} [[SiteBar]] {{·}} [[Slashdot]] {{·}} [[Squidoo]] {{·}} [[StumbleUpon]] {{·}} [[Twine]] {{·}} [[Vizited]] {{·}} [[Yummymarks]] {{·}} [[Xmarks]] {{·}} [[Yahoo! Buzz]] {{·}} [[Zootool]] {{·}} [[Zotero]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Social network|Social networks]]''' || colspan=2 | [[Bebo]] {{·}} [[BlackPlanet]] {{·}} [[Classmates.com]] {{·}} [[Cyworld]] {{·}} [[Dogster]] {{·}} [[Dopplr]] {{·}} [[douban]] {{·}} [[Ello]] {{·}} [[Facebook]] {{·}} [[Flixster]] {{·}} [[FriendFeed]] {{·}} [[Friendster]] {{·}} [[Friends Reunited]] {{·}} [[Gaia Online]] {{·}} [[Google+]] {{·}} [[Habbo]] {{·}} [[hi5]] {{·}} [[Hyves]] {{·}} [[iWiW]] {{·}} [[LinkedIn]] {{·}} [[Miiverse]] {{·}} [[mixi]] {{·}} [[MyHeritage]] {{·}} [[MyLife]] {{·}} [[Myspace]] {{·}} [[myVIP]] {{·}} [[Netlog]] {{·}} [[Odnoklassniki]] {{·}} [[Orkut]] {{·}} [[Plaxo]] {{·}} [[Qzone]] {{·}} [[Renren]] {{·}} [[Skyrock]] {{·}} [[Sonico.com]] {{·}} [[Storylane]] {{·}} [[Tagged]] {{·}} [[tvtag]] {{·}} [[Upcoming]] {{·}} [[Viadeo]] {{·}} [[Vine]] {{·}} [[Vkontakte]] {{·}} [[WeeWorld]] {{·}} [[Weibo]] {{·}} [[Wretch]] {{·}} [[Yahoo! Groups]] {{·}} [[Yahoo! Stars India]] {{·}} [[Yahoo! Upcoming]] {{·}} [[Social network|more sites...]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Shopping/Retail''' || colspan=2 | [[Alibaba]] {{·}} [[AliExpress]] {{·}} [[Amazon]] {{·}} [[Apple Store]] {{·}} [[eBay]] {{·}} [[Printfection]] {{·}} [[RadioShack]] {{·}} [[Sears]] {{·}} [[Target]] {{·}} [[The Book Depository]] {{·}} [[ThinkGeek]] {{·}} [[Walmart]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Software/[[Code hosting services|code hosting]]''' || colspan=2 | [[Android Development]] {{·}} [[Alioth]] {{·}} [[Assembla]] {{·}} [[BerliOS]] {{·}} [[Betavine]] {{·}} [[Bitbucket]] {{·}} [[BountySource]] {{·}} [[Codecademy]] {{·}} [[CodePlex]] {{·}} [[Freepository]] {{·}} [[Free Software Foundation]] {{·}} [[GNU Savannah]] {{·}} [[GitHost]] {{·}} [[GitHub]] {{·}} [[GitHub Downloads]] {{·}} [[Gitorious]] {{·}} [[Gna!]] {{·}} [[Google Code]] {{·}} [[ibiblio]] {{·}} [[java.net]] {{·}} [[JavaForge]] {{·}} [[KnowledgeForge]] {{·}} [[Launchpad]] {{·}} [[LuaForge]] {{·}} [[Maemo]] {{·}} [[mozdev]] {{·}} [[OSOR.eu]] {{·}} [[OW2 Consortium]] {{·}} [[Openmoko]] {{·}} [[OpenSolaris]] {{·}} [[Ourproject.org]] {{·}} [[Ovi Store]] {{·}} [[Project Kenai]] {{·}} [[RubyForge]] {{·}} [[SEUL.org]] {{·}} [[SourceForge]] {{·}} [[Stypi]] {{·}} [[TestFlight]] {{·}} [[tigris.org]] {{·}} [[Transifex]] {{·}} [[TuxFamily]] {{·}} [[Yahoo! Downloads]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Torrenting/Piracy''' || colspan=2 | [[ExtraTorrent]] {{·}} [[EZTV]] {{·}} [[isoHunt]] {{·}} [[KickassTorrents]] {{·}} [[The Pirate Bay]] {{·}} [[Torrentz]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Video hosting]]''' || colspan=2 | [[Academic Earth]] {{·}} [[Blip.tv]] {{·}} [[Epic]] {{·}} [[Google Video]] {{·}} [[Justin.tv]] {{·}} [[Niconico]] {{·}} [[Nokia Trailers]] {{·}} [[Qwiki]] {{·}} [[Skillfeed]] {{·}} [[Stickam]] {{·}} [[TED Talks]] {{·}} [[Ticker.tv]] {{·}} [[Twitch.tv]] {{·}} [[Ustream]] {{·}} [[Videoplayer.hu]] {{·}} [[Viddler]] {{·}} [[Viddy]] {{·}} [[Vimeo]] {{·}} [[Vine]] {{·}} [[Vstreamers]] {{·}} [[Yahoo! Video]] {{·}} [[YouTube]] {{·}} [[Famous Internet videos]] ([[Me at the zoo]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[List of website hosts|Web hosting]]''' || [[Angelfire]] {{·}} [[Brace.io]] {{·}} [[BT Internet]] {{·}} [[CableAmerica Personal Web Space]] {{·}} [[Claranet Netherlands Personal Web Pages]] {{·}} [[Comcast Personal Web Pages]] {{·}} [[Extra.hu]] {{·}} [[FortuneCity]] {{·}} [[Free ProHosting]] {{·}} [[GeoCities]] ([[GeoCities Torrent Patch|patch]]) {{·}} [[Google Business Sitebuilder]] {{·}} [[Google Sites]] {{·}} [[Internet Centrum]] {{·}} [[MBinternet]] {{·}} [[MSN TV]] {{·}} [[Nwnyet]] {{·}} [[Parodius Networking]] {{·}} [[Prodigy.net]] {{·}} [[Saunalahti Iso G]] {{·}} [[Swipnet]] {{·}} [[Telenor]] {{·}} [[Tripod]] {{·}} [[University of Michigan personal webpages]] {{·}} [[Verizon Mysite]] {{·}} [[Verizon Personal Web Space]] {{·}} [[Webzdarma]] {{·}} [[Virgin Media]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Web applications''' || colspan=2 | [[Mailman]] {{·}} [[MediaWiki]] {{·}} [[phpBB]] {{·}} [[Simple Machines Forum]] {{·}} [[vBulletin]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Other''' || colspan=2 | [[800notes]] {{·}} [[AOL]] {{·}} [[Akoha]] {{·}} [[Ancestry.com]] {{·}} [[April Fools' Day]] {{·}} [[Amplicate]] {{·}} [[AutoAdmit]] {{·}} [[Bre.ad]] {{·}} [[Circavie]] {{·}} [[Cobook]] {{·}} [[Co.mments]] {{·}} [[Countdown]] {{·}} [[Distill]] {{·}} [[Dmoz]] {{·}} [[Easel]] {{·}} [[Eircode]] {{·}} [[Electronic Frontier Foundation]] {{·}} [[FanFiction.Net]] {{·}} [[Feedly]] {{·}} [[Ficlets]] {{·}} [[Forrst]] {{·}} [[FunnyExam.com]] {{·}} [[FurAffinity]] {{·}} [[Google Helpouts]] {{·}} [[Google Moderator]] {{·}} [[Google Reader]] {{·}} [[ICQmail]] {{·}} [[IFTTT]] {{·}} [[Jajah]] {{·}} [[JuniorNet]] {{·}} [[Lulu Poetry]] {{·}} [[Mobile Phone Applications]] {{·}} [[Mochi Media]] {{·}} [[Mozilla Firefox]] {{·}} [[MyBlogLog]] {{·}} [[NBII]] {{·}} [[Neopets]] {{·}} [[Quantcast]] {{·}} [[Quizilla]] {{·}} [[Salon Table Talk]] {{·}} [[Shutdownify]] {{·}} [[Slidecast]] {{·}} [[SOPA blackout pages]] {{·}} [[starwars.yahoo.com]] {{·}} [[TechNet]] {{·}} [[Toshiba Support]] {{·}} [[USA-Gov]] {{·}} [[Volán]] {{·}} [[Widgetbox]] {{·}} [[Windows Technical Preview]] {{·}} [[Wunderlist]] {{·}} [[Zoocasa]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Information''' || colspan=2 | [[A Million Ways to Die on the Web]] {{·}} [[Backup Tips]] {{·}} [[Cheap storage]] {{·}} [[Collecting items randomly]] {{·}} [[Data compression algorithms and tools]] {{·}} [[Dev]] {{·}} [[Discovery Data]] {{·}} [[DOS Floppies]] {{·}} [[Fortress of Solitude]] {{·}} [[Keywords]] {{·}} [[Naughty List]] {{·}} [[Nightmare Projects]] {{·}} [[Rescuing Floppy Disks|Rescuing floppy disks]] {{·}} [[Rescuing optical media]] {{·}} [[Site exploration]] {{·}} [[The WARC Ecosystem]] {{·}} [[Working with ARCHIVE.ORG]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Projects]]''' || colspan=2 | [[ArchiveCorps]] {{·}} [[Audit2014]] {{·}} [[Emularity]] {{·}} [[Faceoff]] {{·}} [[FlickrFckr]] {{·}} [[Froogle]] {{·}} [[ftp-gov]] {{·}} [[INTERNETARCHIVE.BAK]] ([[Internet Archive Census]]) {{·}} [[IRC Quotes]] {{·}} [[Javascript Mess|JSMESS]] {{·}} [[Jsvlc|JSVLC]] {{·}} [[Just Solve the Problem 2012|Just Solve the Problem]] {{·}} [[NewsGrabber]] {{·}} [[Project Newsletter]] {{·}} [[Valhalla]] {{·}} [[Web Roasting]] ([[ISP Hosting]] {{·}} [[University Web Hosting]]) {{·}} [[Woohoo]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Software|Tools]]''' || colspan=2 | [[ArchiveBot]] {{·}} [[ArchiveTeam Warrior]] ([[Tracker]]) {{·}} [[Google Takeout]] {{·}} [[HTTrack options|HTTrack]] {{·}} [[Video|Video downloaders]] {{·}} [[Wget]] ([[Wget with Lua hooks|Lua]] {{·}} [[Wget with WARC output|WARC]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Teams''' || colspan=2 | [[Bibliotheca Anonoma]] {{·}} [[LibreTeam]] {{·}} [[URLTeam]] {{·}} [[Yahoo Video Warroom]] {{·}} [[WikiTeam]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''About [[Archive Team]]''' || colspan=2 | [[Introduction]] {{·}} [[Philosophy]] {{·}} [[Who We Are]] {{·}} [[Robots.txt|Our stance on robots.txt]] {{·}} [[Why Back Up?]] {{·}} [[Software]] {{·}} [[Formats]] {{·}} [[Storage Media]] {{·}} [[Recommended Reading]] {{·}} [[Films and documentaries about archiving]] {{·}} [[Talks]] {{·}} [[In The Media]] {{·}} [[Frequently Asked Questions|FAQ]]<br />
|}<br />
</center>[[Category:Archive Team]]<noinclude>[[Category:Templates]]</noinclude></div>Yanhttps://wiki.archiveteam.org/index.php?title=Government_Backup&diff=28964Government Backup2017-01-24T14:32:43Z<p>Yan: /* #DATAREFUGE */ rm link now there's a dedicated page</p>
<hr />
<div>__NOTOC__<br />
[[Image:Government data.jpg|300px|right]]<br />
<br />
''The US Government has an awful lot of data, and it's in a lot of places.'' In 2016, elections were held that indicated deep sea changes in goals and ideals (although previous transitions have always contained such changes). Inspired by this, a number of groups and efforts have risen up to ensure backups of all government data possible are made off-site.<br />
<br />
'''This page contains overviews of the effort by all the teams.'''<br />
<br />
=== Internet Archive ===<br />
<br />
[[Internet Archive]] has two teams, [[Wayback Machine|Wayback]] and [[Archive-It]] ([https://archive-it.org/ archive-it.org]), working through listings of government websites and data stores. They are working internally using Internet Archive's crawlers and environment.<br />
<br />
The result of these internal efforts are saved in [https://archive.org/details/EndOfTerm2016WebCrawls this collection]. (Note that other efforts exist under this collection as Sub-Collections, such as Archive Team efforts.)<br />
<br />
=== #DATAREFUGE ===<br />
<br />
The [[DataRefuge|Data Refuge]] project has [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 the following Google document] about climate datasets.<br />
<br />
=== Archive Team FTP Backup ===<br />
<br />
The [[ftp-gov|Archive Team project]] is backing up 750+ FTP sites hosted at .MIL and .GOV sites. These two projects can be tracked [http://tracker.archiveteam.org/ftpdisco/ here] (discovery phase) and [http://tracker.archiveteam.org/ftp-gov/ here] (download phase). The results of this download are being sent to [https://archive.org/details/archiveteam_ftpgov this collection].<br />
<br />
=== Archive Team General Websites Download ===<br />
<br />
Besides the [[ftp-gov|FTP]] data download, Archive Team is also doing a general download (where possible) of many crawlable government websites, such as [[USA-Gov|usa.gov]].<br />
<br />
== Internet Archive Statements ==<br />
<br />
* [http://blog.archive.org/2016/11/09/us-election-results/ US Election Results] - Surprise at the outcome of the election and a call to keep libraries open.<br />
* [http://blog.archive.org/2016/11/11/contribute-to-the-2016-u-s-presidential-election-web-archive/ Please Help Build the 2016 End-of-Term Archive] - A call for assistance and volunteers<br />
* [http://blog.archive.org/2016/11/29/help-us-keep-the-archive-free-accessible-and-private/ Help Us Keep the Archive Free, Accessible, and Reader Private] - First entry that indicates election has influenced efforts to back up the archive in Canada.<br />
* [http://blog.archive.org/2016/12/03/faqs-about-the-internet-archive-canada/ FAQs about the Internet Archive Canada] - Much needed clarification about the mirroring in Canada of the Internet Archive.<br />
* [http://blog.archive.org/2016/12/06/internet-archive-canada-and-national-security-letter-in-the-news-roundup/ Internet Archive Canada and National Security Letter in the News] - Roundup of press mentions about the mirroring efforts.<br />
* [http://blog.archive.org/2016/12/15/preserving-u-s-government-websites-and-data-as-the-obama-term-ends/ Preserving U.S. Government Websites and Data as the Obama Term Ends] - Notes by the head of Archive-It about efforts to run the End of Term archiving.<br />
* [http://blog.archive.org/2016/12/17/robots-txt-gov-mil-websites/ Robots.txt Files and Archiving .gov and .mil Websites Archiving .GOV and .MIL Websites] - The Internet Archive will no longer follow ROBOTS.TXT directives on .GOV and .MIL.<br />
* [http://blog.archive.org/2016/12/20/would-like-to-archive-government-web-services-not-just-web-sites-please-help/ Would Like to Archive Government Web Services, not just Web Sites– Please help] - Additional call to archive Government Web Services, not just Websites.<br />
<br />
== Notable Press Mentions and References ==<br />
<br />
''Note that the story oscillates between "Internet Archive is adding a mirror in Canada" and "Internet Archive is Moving to Canada".'' The actuality, for anyone viewing this page coming in cold, is that the Internet Archive has been building a mirror in Canada for a significant period of time and has a fully-functioning facility in Canada that has been a presence of some sort for nearly a decade as of 2016. The current effort was merely a speeding up of an inevitable timetable.<br />
<br />
* [http://www.theverge.com/2016/11/29/13778188/internet-archive-of-canada-backup-trump-surveillance-censorship The Internet Archive is building a Canadian copy to protect itself from Trump], The Verge, November 29, 2016<br />
* [http://www.nbcnews.com/news/us-news/internet-archive-web-s-warehouse-creating-trump-era-copy-canada-n689916 Internet Archive, Web's Warehouse, Creating Trump-Era Copy in Canada], NBC News, November 29, 2016<br />
* [http://www.dailykos.com/story/2016/11/30/1605487/-The-Internet-Archive-is-Moving-to-Canada The Internet Archive is "Moving to Canada"], The Daily Kos, November 30, 2016<br />
* [http://gothamist.com/2016/11/30/even_the_internet_is_getting_ready.php Even The Internet Archive Is Moving To Canada Because Of Trump], Gothamist, November 30, 2016<br />
* [http://www.huffingtonpost.ca/2016/11/30/archive-org-canada-trump_n_13330492.html Archive.org Moving To Canada Over Trump Censorship Fears], Huffington Post Canada, November 30, 2016</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28963DataRefuge2017-01-24T14:31:30Z<p>Yan: ce</p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data, in response to the 2015 US Presidential election. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]). They collaborate with [https://projectarcc.org/ Project_ARCC] and University of Michigan Libraries, among others.<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/]. Instructions can be found [https://docs.google.com/document/d/1PeWefW2toThs-Pbw0CMv2us7wxQI0gRrP1LGuwMp_UQ/edit here]. They maintain a list of environmental datasets [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 here].<br />
<br />
== Results ==<br />
They form part of the University of North Texas Libraries' [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], in collaboration with [[Library of Congress]] and the [[Internet Archive]]. Data they grab may also be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28962DataRefuge2017-01-24T14:31:05Z<p>Yan: rm comma</p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data, in response to the 2015 US Presidential election. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]). They collaborate with [https://projectarcc.org/ Project_ARCC] and University of Michigan Libraries, among others.<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/]. Instructions can be found [https://docs.google.com/document/d/1PeWefW2toThs-Pbw0CMv2us7wxQI0gRrP1LGuwMp_UQ/edit here]. They maintain a list of environmental datasets [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 here].<br />
<br />
== Results ==<br />
They form part of the University of North Texas Libraries [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], in collaboration with [[Library of Congress]] and the [[Internet Archive]]. Data they grab may apparently also be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28961DataRefuge2017-01-24T14:30:50Z<p>Yan: /* Results */ fix attribution of eth</p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data, in response to the 2015 US Presidential election. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]). They collaborate with [https://projectarcc.org/ Project_ARCC] and University of Michigan Libraries, among others.<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/]. Instructions can be found [https://docs.google.com/document/d/1PeWefW2toThs-Pbw0CMv2us7wxQI0gRrP1LGuwMp_UQ/edit here]. They maintain a list of environmental datasets [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 here].<br />
<br />
== Results ==<br />
They form part of the University of North Texas Libraries, [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], in collaboration with [[Library of Congress]] and the [[Internet Archive]]. Data they grab may apparently also be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28960DataRefuge2017-01-24T14:26:41Z<p>Yan: </p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data, in response to the 2015 US Presidential election. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]). They collaborate with [https://projectarcc.org/ Project_ARCC] and University of Michigan Libraries, among others.<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/]. Instructions can be found [https://docs.google.com/document/d/1PeWefW2toThs-Pbw0CMv2us7wxQI0gRrP1LGuwMp_UQ/edit here]. They maintain a list of environmental datasets [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 here].<br />
<br />
== Results ==<br />
They form part of the [[Internet Archive]]'s [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], but data they grab may apparently be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28959DataRefuge2017-01-24T14:22:19Z<p>Yan: add link</p>
<hr />
<div>Data Refuge ([http://www.ppehlab.org/datarefuge #datarefuge]) is a project concerned with saving (United States) federal climate and environmental data. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]).<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/].<br />
<br />
== Results ==<br />
They form part of the [[Internet Archive]]'s [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], but data they grab may apparently be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.</div>Yanhttps://wiki.archiveteam.org/index.php?title=DataRefuge&diff=28958DataRefuge2017-01-24T14:17:05Z<p>Yan: Created page with "Data Refuge (#datarefuge) is a project concerned with saving (United States) federal climate and environmental data. It's an experiment by the Penn Program in the Environmenta..."</p>
<hr />
<div>Data Refuge (#datarefuge) is a project concerned with saving (United States) federal climate and environmental data. It's an experiment by the Penn Program in the Environmental Humanities ([http://www.ppehlab.org/ ppehlab.org/]).<br />
<br />
They hold real life "Data Rescue" events ([http://www.ppehlab.org/datarescue/ ppehlab.org/datarescue/]) and can be followed on their [https://twitter.com/PPEHLab Twitter account]. <br />
<br />
== Tools ==<br />
Their tooling can be found at [https://envirodatagov.org/event-toolkit/ envirodatagov.org/event-toolkit/].<br />
<br />
== Results ==<br />
They form part of the [[Internet Archive]]'s [http://digital2.library.unt.edu/nomination/eth2016/about/ End of Term Presidential Harvest 2016], but data they grab may apparently be hosted by either the organisations holding the Data Rescue events and/or in a custom repository built using Amazon Web Services integrated with CKAN.</div>Yanhttps://wiki.archiveteam.org/index.php?title=User:Yan/Dev&diff=26761User:Yan/Dev2017-01-03T17:56:30Z<p>Yan: Created page with "{{:Dev}} =Infrastructure overview= {{:Dev/Infrastructure}} =Source code repositories= {{:Dev/Source Code}} =Warrior overview= {{:Dev/Warrior}} =Starting a new project= {{:Dev/..."</p>
<hr />
<div>{{:Dev}}<br />
=Infrastructure overview=<br />
{{:Dev/Infrastructure}}<br />
=Source code repositories=<br />
{{:Dev/Source Code}}<br />
=Warrior overview=<br />
{{:Dev/Warrior}}<br />
=Starting a new project=<br />
{{:Dev/New Project}}<br />
=Writing Seesaw grab scripts=<br />
{{:Dev/Seesaw}}<br />
=Setting up a tracker=<br />
{{:Dev/Tracker}}<br />
=Setting up Rsync and Megawarc Factory=<br />
{{:Dev/Staging}}<br />
=Project management and leadership=<br />
{{:Dev/Project Management}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=NUjij&diff=26758NUjij2017-01-03T14:34:46Z<p>Yan: mark as closed and saved</p>
<hr />
<div>{{Infobox project<br />
| title = NUjij<br />
| logo = nujij-logo.png<br />
| image = nujij_screenshot.png<br />
| URL = http://nujij.nl<br />
| project_status = {{closed}}<br />
| archiving_status = {{rescued}} [https://archive.org/details/archiveteam_nujij archiveteam_nujij]<br />
| tracker = [http://tracker.archiveteam.org/nujij nujij]<br />
| source = [https://github.com/ArchiveTeam/nujij-grab nujij-grab]<br />
}}<br />
<br />
'''NUjij''' is a discussion platform for the Dutch '''NU.nl''' news website.<br />
<br />
It is being shut down on September 12, 2016.<br />
<br />
== Announcement ==<br />
<br />
The notice in Dutch can be read on [http://nujij.nl NUjij.nl], in a yellow box on the right.<br />
<br />
{{BilingualBox|Dutch|English<br />
|NU.nl heeft besloten per 12 september te stoppen met open reactieplatform NUjij.nl. We accepteren daarom geen nieuwe accounts meer.<br />
<br />
Vanaf 12 september is het niet meer mogelijk te discussiëren via NUjij. De site zal verdwijnen en als vervanging wordt een functie in de site en apps van NU.nl geïntroduceerd waarmee bezoekers foto's, video's, tips en foutjes kunnen insturen.<br />
<br />
Voor meer informatie kun je terecht op: http://www.nu.nl/nujij-vragen.html<br />
|NU.nl has decided to stop with the open comment platform NUjij.nl, starting from September 12. For this reason, we no longer accept new comments.<br />
<br />
From September 12, it will not be possible to discuss through NUjij. The site will disappear and to replace it, a feature will be introduced in the site and apps of NU.nl that visitors can use to submit photos, videos and corrections.<br />
<br />
For more information, please visit http://www.nu.nl/nujij-vragen.html<br />
<br />
<small>(Translation by [[user:joepie91|joepie91]])</small>}}<br />
<br />
== How can I help? ==<br />
<br />
<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''NUjij''. (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)<br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/nujij-grab github.com/ArchiveTeam/nujij-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd nujij-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>nujij-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=Fotolog.com&diff=26757Fotolog.com2017-01-03T14:33:47Z<p>Yan: mark as saved</p>
<hr />
<div>{{Infobox project<br />
| title = Fotolog.com<br />
| URL = {{url|http://fotolog.com}}<br />
| description = Fotolog – Share photos. Make friends. It's easy!<br />
| logo = Fotolog-logo.png<br />
| image = Fotolog_screenshot.png<br />
| project_status = {{online}}<br />
| archiving_status = {{rescued}} [https://archive.org/details/archiveteam_fotolog archiveteam_fotolog]<br />
| tracker = [http://tracker.archiveteam.org/fotolog fotolog]<br />
| source = [https://github.com/ArchiveTeam/fotolog-grab fotolog-grab]<br />
| irc = fotologout<br />
}}<br />
<br />
'''Fotolog.com''' is an international photo sharing and social networking platform, that started in 2002 on fotolog.net, and is currently owned by the Brazilian company Doutíssima.<br />
<br />
After a few weeks of downtime, it was announced on the main page on January 26, 2016 that fotolog.com was shutting down on February 20, 2016.<br />
<br />
As of February 2016, fotolog.com has more than 33.5 million users.<br />
<br />
As of July 2016, the site is up and the shutdown notice disappeared. It seems it's not shutting down, but we still archive it.<ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-04-12,Tue&sel=111#l107</ref><br />
<br />
== Shutdown ==<br />
The shutdown message on the top of the site, written in three languages, reads in English as follows:<br />
<br />
: "Dear members, the Fotolog platform could be permanently unavailable in the upcoming weeks.<br />
: We wanted to inform you of this matter, as hosting provider, so you can retrieve your data as quickly as possible and in any event before February the 20th of 2016.<br />
: We hope you can continue your blogs and your photos sharing on other platforms.<br />
: Do not hesitate to share this information with all other members of the community."<br />
<br />
== How can I help? ==<br />
<br />
<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''Fotolog''. (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)<br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/fotolog-grab github.com/ArchiveTeam/fotolog-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd fotolog-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>fotolog-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
{{Navigation box}}<br />
[[Category: Social networks]]<br />
[[Category: Image hosting services]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=Panoramio&diff=26756Panoramio2017-01-03T14:33:14Z<p>Yan: shorten link</p>
<hr />
<div>{{Infobox project<br />
| title = Panoramio<br />
| logo = Panoramio logo.jpg<br />
| image = Panoramio - Fotos del mundo 1294868654701.png<br />
| description = Panoramio mainpage in 2011-01-12<br />
| URL = http://www.panoramio.com<br />
| project_status = {{offline}}<br />
| archiving_status = {{saved}} [https://archive.org/details/archiveteam_panoramio archiveteam_panoramio]<br />
| irc = paranormio<br />
| tracker = [http://tracker.archiveteam.org/panoramio/ panoramio]<br />
| source = [https://github.com/ArchiveTeam/panoramio-grab panoramio-grab]<br />
}}<br />
<br />
'''Panoramio''' is an image hosting service provided by [[Google]]. The service allows people to post pictures based on the location in the world taken. Photos can be searched by location.<br />
<br />
== Stats ==<br />
* 134M of photos? http://www.panoramio.com/photo/134000000 (it works)<br />
* There are gaps, there are much more than 61M (that was 2011...). Eg http://www.panoramio.com/photo/110000000<br />
<br />
== Vital signs ==<br />
<br />
On September 16, 2014, Google announced they would be "migrating" Panoramio over to Google Maps. During the migration, site features such as comments, favorite photographers, and groups would be deleted.<ref>https://groups.google.com/forum/#!topic/panoramio-questions-support/R5toz0EAB8k</ref> On September 23, the founders of Panoramio launched a [http://www.change.org/p/google-larry-and-sergey-google-keep-the-panoramio-community-alive petition] asking Google not to shut the website down.<br />
<br />
[[commons:User:Panoramio upload bot]] keeps uploading the freely licensed Panoramio images to Wikimedia Commons, for better usage as well as preservation. It finds about 1 % of the IDs correspond to an existing and free image (about 190k total uploads as of October 2016).<br />
<br />
== November 2016 closure ==<br />
<br />
In October 2016, users have received the following notification:<br />
<br />
<blockquote><br />
Back in 2014, we announced our intention to retire Panoramio in order to invest our efforts into improving photo-sharing experiences directly inside Google Maps. In response to your feedback, we postponed these plans and worked to add features to Maps that better support the level of engagement that you have enjoyed with Panoramio. Today, with [https://support.google.com/maps/answer/2622947 photo upload tools] in Google Maps and our [https://www.google.com/local/guides/ Local Guides] program, we are providing easy options for you to share your photos with an active and growing community. As such, we’ve decided to now close down Panoramio. To make this transition easier, we’ll provide several options to continue sharing photos through other services. If you choose, you can also export all your data and take it somewhere else.<br />
<br />
Because you have [http://www.panoramio.com/help/gplus-faq linked your Panoramio profile with a Google account], all your Panoramio photos will be copied to the [https://support.google.com/picasa/answer/7008270 Google Album Archive] at full resolution after Panoramio goes away. These copied photos will not use any of your Google storage quota. However, unless you [http://www.panoramio.com/help/gplus-faq upgrade to a Google+ account], your Panoramio photos will stop appearing in Google Maps. This is because Panoramio nicknames will no longer be supported, and all other photos in Maps are attributed to Google+ user names.<br />
<br />
After November 4, 2016, you’ll continue to have access to your photos in Panoramio for a year, but you will no longer be able to add new photos, likes, or comments. Below, we’ve included resources to help you manage or export your data. You can visit your Panoramio profile to see what photos you've added. If you have already [http://www.panoramio.com/help/gplus-faq linked] your profile with a Google account, your Panoramio data will automatically be saved.<br />
<br />
1) Keep your photos in Album Archive<br />
<br />
Because you have a Google account linked with your Panoramio account, we will automatically copy your Panoramio photos to the [https://support.google.com/picasa/answer/7008270 Google Album Archive] when Panoramio is retired in November 2017.<br />
<br />
Copied photos will not use any of your Google storage quota.<br />
If you [http://www.panoramio.com/help/gplus-faq activate Google+] on your account, then eligible images and your View Counts will be transferred into Google Maps and will be visible when you sign in to Maps and [https://www.google.com/maps/contrib/ access the Contributions screen] from the main menu.<br />
<br />
2) Export your photos to a local zip file<br />
<br />
Visit [https://myaccount.google.com/intro/privacy takeout.google.com] and follow the instructions there.<br />
<br />
3) Become a Local Guide to keep contributing <br />
<br />
To keep adding photos to Google Maps and engage with a growing community of photographers, [https://maps.google.com/localguides/signup join the Local Guides] program. You can earn points and unlock rewards for photos submitted with a Google account when they are linked to a point of interest or business. [https://support.google.com/local-guides/answer/6348743 Many of your Panoramio photos may already be counted].<br />
<br />
If you prefer to no longer share your photos with Google Maps, you can delete your Panoramio photos or your entire account at any time. <br />
<br />
You can delete your Panoramio account (and photos) immediately by logging in, opening "Settings", and clicking the "Delete your account" option. You can also navigate to this page directly by using [http://www.panoramio.com/delete_own_account this link].<br />
If you delete any photos (or delete your entire account) before Panoramio is retired in November 2017, that content will not be automatically copied to the Google Album Archive and will stop appearing in Google Maps.<br />
<br />
Please see our [http://www.panoramio.com/maps-faq Help Center article] for more information and the various options you have to save, transfer, or delete your content. We’ve appreciated your contributions over the years and hope you will continue to share amazing photos with the world.<br />
<br />
Thank you,<br />
<br />
The Panoramio team<br />
</blockquote><br />
<br />
== How can I help? ==<br />
<br />
<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''Panoramio''. (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)<br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/panoramio-grab github.com/ArchiveTeam/panoramio-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd panoramio-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>panoramio-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
== Site structure ==<br />
* Photo IDs seem to be incremental. The smallest ID is http://www.panoramio.com/photo/15<br />
* Photos are hosted on http://static.panoramio.com<br />
** The original sized photo can be retrieved at http://static.panoramio.com/photos/original/ID.jpg<br />
** The license of the photo can be retrieved at http://www.panoramio.com/photo/ID in &lt;li class="license LICENSEABBR"&gt;HUMAN READABLE VERSION&lt;/li&gt;<br />
** The coordinates can be retrieved at http://www.panoramio.com/photo/ID &lt;abbr class="latitutde or longitude" title="decimal of coordinates"&gt;DMS coordinates&lt;/abbr&gt; and &lt;p id="place"&gt;Human readable place name&lt;/p&gt;<br />
* Comments are paginated on each image (20 comments per page.) For example: http://www.panoramio.com/photo/14748463?comment_page=20<br />
* User profile pages are somewhat incremental. For example, http://www.panoramio.com/user/1 (a co-founder of Panoramio) but http://www.panoramio.com/user/2 doesn't exist<br />
* Similarly, groups are somewhat incremental. For example http://www.panoramio.com/group/402000<br />
** A directory of group pages is available at http://www.panoramio.com/groups/directory, but it doesn't seem to include all the groups.<br />
* Favorite photographers are a bit tricky. The list is paginated and loaded via JavaScript. <br />
** The URL for each page is http://www.panoramio.com/user/USER_ID/get_favorite_users?size=16&page=PAGENUMBER&type=user<br />
** Each page has a next button. The very last page also has a next button, but it's disabled with the "ajax_next_link disabled" class.<br />
* Stats<br />
** User: http://www.panoramio.com/user/4120074/stats<br />
** Photo: http://www.panoramio.com/photo/31719645/stats<br />
* API page<br />
** http://www.panoramio.com/api/data/api.html<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
* http://www.panoramio.com<br />
* https://archive.org/details/archiveteam_panoramio<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Image hosting]]<br />
[[Category:Google]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=Template:Url&diff=26751Template:Url2017-01-03T01:05:50Z<p>Yan: reflect archive.today's URL change</p>
<hr />
<div><span class="plainlinks">[{{{1|http://www.google.com}}} {{{2|{{{1|Google}}}}}}]&nbsp;<sup>[[http://web.archive.org/web/*/{{{1|http://www.google.com}}} IA]]&nbsp;[[http://www.webcitation.org/query?url={{{1|http://www.google.com}}} WebCite]]&nbsp;[[https://archive.is/{{{1|http://www.google.com}}} archive.is]]</sup></span></div>Yanhttps://wiki.archiveteam.org/index.php?title=Template:Navigation_box&diff=26750Template:Navigation box2017-01-02T16:38:11Z<p>Yan: add Government Backup</p>
<hr />
<div><br clear="all" /><center><!--<br />
<br />
<br />
<br />
<br />
Rows are in Alphabetic order. Except "Current events" at the top and "About Archive Team" at the bottom.<br />
Items inside rows are in Alphabetic order too.<br />
Easy : )<br />
<br />
<br />
<br />
<br />
--><br />
{| class="mw-collapsible mw-collapsed" style="border: 1px solid #aaa; background-color: #f9f9f9; color: black; margin: 0.5em 0 0.5em 1em; padding: 0.2em; font-size: 100%;"<br />
| colspan=3 align=center style="background: #ccccff;" | <span style="float: right;"><span class="plainlinks">[[{{fullurl:Template:Navigation_box}} view]]&nbsp;&nbsp;[[{{fullurl:Template:Navigation_box|action=edit}} edit]]</span>&nbsp;</span>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;'''[[Archive Team]]'''&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiveteam:Current events|Current events]]''' || [[Alive... OR ARE THEY]] {{·}} [[Deathwatch]] {{·}} [[Projects]] || rowspan=5 | [[File:Archiveteam.jpg|right|150px]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiving projects]]''' || [[APKMirror]] {{·}} [[Archive.is]] {{·}} [[BetaArchive]] {{·}} [[Government Backup]] {{·}} [[Gmane]] {{·}} [[Internet Archive]] {{·}} [[It Died]] {{·}} [[Megalodon.jp]] {{·}} [[OldApps.com]] {{·}} [[OldVersion.com]] {{·}} [[OSBetaArchive]] {{·}} [[TEXTFILES.COM]] {{·}} [[The Dead, the Dying & The Damned]] {{·}} [[The Mail Archive]] {{·}} [[UK Web Archive]] {{·}} [[WebCite]] {{·}} [[Vaporwave.me]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Blogging''' || [[Blog.pl]] {{·}} [[Blogger]] {{·}} [[Blogster]] {{·}} [[Blogter.hu]] {{·}} [[Freeblog.hu]] {{·}} [[Fuelmyblog]] {{·}} [[Jux]] {{·}} [[LiveJournal]] {{·}} [[My Opera]] {{·}} [[Nolblog.hu]] {{·}} [[Open Diary]] {{·}} [[ownlog.com]] {{·}} [[Posterous]] {{·}} [[Powerblogs]] {{·}} [[Proust]] {{·}} [[Roon]] {{·}} [[Splinder]] {{·}} [[Tumblr]] {{·}} [[Vox]] {{·}} [[Weblog.nl]] {{·}} [[Windows Live Spaces]] {{·}} [[Wordpress.com]] {{·}} [[Xanga]] {{·}} [[Yahoo! Blog]] {{·}} [[Zapd]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Clown hosting|Cloud hosting]]/file sharing''' || [[ADrive|aDrive]] {{·}} [[AnyHub]] {{·}} [[Box]] {{·}} [[Dropbox]] {{·}} [[Docstoc]] {{·}} [[Google Drive]] {{·}} [[Google Groups Files]] {{·}} [[iCloud]] {{·}} [[Fileplanet]] {{·}} [[LayerVault]] {{·}} [[MediaCrush]] {{·}} [[MediaFire]] {{·}} [[Mega]] {{·}} [[MegaUpload]] {{·}} [[MobileMe]] {{·}} [[OneDrive]] {{·}} [[Pomf.se]] {{·}} [[RapidShare]] {{·}} [[Ubuntu One]] {{·}} [[Yahoo! Briefcase]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[:Category:Corporations|Corporations]]''' || [[Apple]] {{·}} [[IBM]] {{·}} [[Google]] {{·}} [[Lycos Europe]] {{·}} [[Microsoft]] {{·}} [[Yahoo!]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Events''' || [[Arab Spring]] {{·}} [[Occupy movement]] {{·}} [[Spanish Revolution]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Font Repos''' || [[Google Web Fonts]] {{·}} [[GNU FreeFont]] {{·}} [[Fontspace]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Forums/Message boards''' || colspan=2 | [[4chan]] {{·}} [[Captain Luffy Forums]] {{·}} [[College Confidential]] {{·}} [[DSLReports]] {{·}} [[ESPN Forums]] {{·}} [[forums.starwars.com]] {{·}} [[HeavenGames]] {{·}} [[Invisionfree]] {{·}} [[The Classic Horror Film Board]] {{·}} [[Yahoo! Messages]] {{·}} [[Yahoo! Neighbors]] {{·}} [[Yuku.com]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Games|Gaming]]''' || colspan=2 | [[Atomicgamer]] {{·}} [[City of Heroes]] {{·}} [[Club Nintendo]] {{·}} [[CSGO Lounge|CS:GO Lounge]] {{·}} [[Desura]] {{·}} [[Dota 2 Lounge]] {{·}} [[Emulation Zone]] {{·}} [[GameMaker Sandbox]] {{·}} [[GameTrailers]] {{·}} [[Halo]] {{·}} [[HLTV.org]] {{·}} [[Infinite Crisis]] {{·}} [[Minecraft.net]] {{·}} [[Player.me]] {{·}} [[Playfire]] {{·}} [[Steam]] {{·}} [[SteamDB]] {{·}} [[TF2 Outpost]] {{·}} [[Warhammer]] {{·}} [[Xfire]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Image hosting]]''' || [[500px]] {{·}} [[AOL Pictures]] {{·}} [[Blipfoto]] {{·}} [[Blingee]] {{·}} [[Canv.as]] {{·}} [[Camera+]] {{·}} [[Cameroid]] {{·}} [[DailyBooth]] {{·}} [[Degree Confluence Project]] {{·}} [[deviantART]] {{·}} [[Demotivalo.net]] {{·}} [[Flickr]] {{·}} [[Fotoalbum.hu]] {{·}} [[Fotolog.com]] {{·}} [[Fotopedia]] {{·}} [[Frontback]] {{·}} [[Geograph Britain and Ireland]] {{·}} [[GTF Képhost]] {{·}} [[ImageShack]] {{·}} [[Imgur]] {{·}} [[Inkblazers]] {{·}} [[Instagr.am]] {{·}} [[Kepfeltoltes.hu]] {{·}} [[Kephost.com]] {{·}} [[Kephost.hu]] {{·}} [[Kepkezelo.com]] {{·}} [[Keptarad.hu]] {{·}} [[Madden GIFERATOR]] {{·}} [[MLKSHK]] {{·}} [[Microsoft Clip Art]] {{·}} [[Nokia Memories]] {{·}} [[noob.hu]] {{·}} [[Odysee]] {{·}} [[Panoramio]] {{·}} [[Photobucket]] {{·}} [[Picasa]] {{·}} [[Picplz]] {{·}} [[PSharing]] {{·}} [[Ptch]] {{·}} [[puu.sh]] {{·}} [[Rawporter]] {{·}} [[Relay.im]] {{·}} [[ScreenshotsDatabase.com]] {{·}} [[Snapjoy]] {{·}} [[Streetfiles]] {{·}} [[Tabblo]] {{·}} [[Trovebox]] {{·}} [[TwitPic]] {{·}} [[Wallbase]] {{·}} [[Wallhaven]] {{·}} [[Webshots]] {{·}} [[Wikimedia Commons]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Knowledge/[[Wikis]]''' || colspan=2 | [[arXiv]] {{·}} [[Citizendium]] {{·}} [[Clipboard.com]] {{·}} [[Deletionpedia]] {{·}} [[EditThis]] {{·}} [[Encyclopedia Dramatica]] {{·}} [[Etherpad]] {{·}} [[Everything2]] {{·}} [[infoAnarchy]] {{·}} [[GeoNames]] {{·}} [[GNUPedia]] {{·}} [[Google Books]] ([[Google Books Ngram]]) {{·}} [[Horror Movie Database]] {{·}} [[Insurgency Wiki]] {{·}} [[Knol]] {{·}} [[Library Genesis]] {{·}} [[Lost Media Wiki]] {{·}} [[Neoseeker.com]] {{·}} [[Notepad.cc]] {{·}} [[Nupedia]] {{·}} [[OpenCourseWare]] {{·}} [[OpenStreetMap]] {{·}} [[Orain]] {{·}} [[Pastebin]] {{·}} [[Patch.com]] {{·}} [[Project Gutenberg]] {{·}} [[Puella Magi]] {{·}} [[Referata]] {{·}} [[Resedagboken]] {{·}} [[SongMeanings]] {{·}} [[ShoutWiki]] {{·}} [[The Internet Movie Database]] {{·}} [[TropicalWikis]] {{·}} [[Uncyclopedia]] {{·}} [[Urban Dictionary]] {{·}} [[Webmonkey]] {{·}} [[Wikia]] {{·}} [[Wikidot]] {{·}} [[WikiHow]] {{·}} [[Wikkii]] {{·}} [[WikiLeaks]] {{·}} [[Wikipedia]] ([[Simple English Wikipedia]]) {{·}} [[Wikispaces]] {{·}} [[Wikispot]] {{·}} [[Wik.is]] {{·}} [[Wiki-Site]] {{·}} [[WikiTravel]] {{·}} [[Word Count Journal]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Magazines/Blogs/News''' || colspan=2 | [[Cyberpunkreview.com]] {{·}} [[Game Developer Magazine]] {{·}} [[Gigaom]] {{·}} [[Helium]] {{·}} [[JPG Magazine]] {{·}} [[Polygamia.pl]] {{·}} [[San Fransisco Bay Guardian]] {{·}} [[Scoop]] {{·}} [[Regretsy]] {{·}} [[Yahoo! Voices]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Microblogging]]''' || colspan=2 | [[Heello]] {{·}} [[Identi.ca]] {{·}} [[Jaiku]] {{·}} [[Mommo.hu]] {{·}} [[Plurk]] {{·}} [[Sina Weibo]] {{·}} [[Twitter]] {{·}} [[TwitLonger]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Music/Audio''' || colspan=2 | [[AOL Music]] {{·}} [[Audimated.com]] {{·}} [[Cinch]] {{·}} [[digCCmixter]] {{·}} [[Dogmazic.net]] {{·}} [[Earbits]] {{·}} [[exfm]] {{·}} [[Free Music Archive]] {{·}} [[Gogoyoko]] {{·}} [[Indaba Music]] {{·}} [[Instacast]] {{·}} [[Jamendo]] {{·}} [[Last.fm]] {{·}} [[Music Unlimited]] {{·}} [[MOG]] {{·}} [[PureVolume]] {{·}} [[Reverbnation]] {{·}} [[ShareTheMusic]] {{·}} [[SoundCloud]] {{·}} [[Soundpedia]] {{·}} [[This Is My Jam]] {{·}} [[TuneWiki]] {{·}} [[Twaud.io]] {{·}} [[WinAmp]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''People''' || colspan=2 | [[Aaron Swartz]] {{·}} [[Michael S. Hart]] {{·}} [[Steve Jobs]] {{·}} [[Mark Pilgrim]] {{·}} [[Dennis Ritchie]] {{·}} [[Len Sassaman Project]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Protocols/Infrastructure''' || colspan=2 | [[FTP]] {{·}} [[Gopher]] {{·}} [[IRC]] {{·}} [[Usenet]] {{·}} [[World Wide Web]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Q&A''' || colspan=2 | [[Askville]] {{·}} [[Answerbag]] {{·}} [[Answers.com]] {{·}} [[Ask.com]] {{·}} [[Askalo]] {{·}} [[Baidu Knows]] {{·}} [[Blurtit]] {{·}} [[ChaCha]] {{·}} [[Experts Exchange]] {{·}} [[Formspring]] {{·}} [[GirlsAskGuys]] {{·}} [[Google Answers]] {{·}} [[Google Baraza]] {{·}} [[JustAnswer]] {{·}} [[MetaFilter]] {{·}} [[Quora]] {{·}} [[Retrospring]] {{·}} [[StackExchange]] {{·}} [[The AnswerBank]] {{·}} [[The Internet Oracle]] {{·}} [[Uclue]] {{·}} [[WikiAnswers]] {{·}} [[Yahoo! Answers]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Recipes/Food''' || colspan=2 | [[Allrecipes]] {{·}} [[Epicurious]] {{·}} [[Food.com]] {{·}} [[Foodily]] {{·}} [[Food Network]] {{·}} [[Punchfork]] {{·}} [[ZipList]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Social bookmarking''' || colspan=2 | [[Addinto]] {{·}} [[Backflip]] {{·}} [[Balatarin]] {{·}} [[BibSonomy]] {{·}} [[Bkmrx]] {{·}} [[Blinklist]] {{·}} [[BlogMarks]] {{·}} [[BookmarkSync]] {{·}} [[CiteULike]] {{·}} [[Connotea]] {{·}} [[Delicious]] {{·}} [[Designer News]] {{·}} [[Digg]] {{·}} [[Diigo]] {{·}} [[Dir.eccion.es]] {{·}} [[Evernote]] {{·}} [[Excite Bookmark]] {{·}} [[Faves]] {{·}} [[Favilous]] {{·}} [[folkd]] {{·}} [[Freelish]] {{·}} [[Getboo]] {{·}} [[GiveALink.org]] {{·}} [[Gnolia]] {{·}} [[Google Bookmarks]] {{·}} [[Hacker News]] {{·}} [[HeyStaks]] {{·}} [[IndianPad]] {{·}} [[Kippt]] {{·}} [[Knowledge Plaza]] {{·}} [[Licorize]] {{·}} [[Linkwad]] {{·}} [[Menéame]] {{·}} [[Microsoft Developer Network]] {{·}} [[myVIP]] {{·}} [[Mister Wong]] {{·}} [[My Web]] {{·}} [[Mylink Vault]] {{·}} [[Newsvine]] {{·}} [[Oneview]] {{·}} [[Pearltrees]] {{·}} [[Pinboard]] {{·}} [[Pocket]] {{·}} [[Propeller.com]] {{·}} [[Reddit]] {{·}} [[sabros.us]] {{·}} [[Scloog]] {{·}} [[Scuttle]] {{·}} [[Simpy]] {{·}} [[SiteBar]] {{·}} [[Slashdot]] {{·}} [[Squidoo]] {{·}} [[StumbleUpon]] {{·}} [[Twine]] {{·}} [[Vizited]] {{·}} [[Yummymarks]] {{·}} [[Xmarks]] {{·}} [[Yahoo! Buzz]] {{·}} [[Zootool]] {{·}} [[Zotero]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Social network|Social networks]]''' || colspan=2 | [[Bebo]] {{·}} [[BlackPlanet]] {{·}} [[Classmates.com]] {{·}} [[Cyworld]] {{·}} [[Dogster]] {{·}} [[Dopplr]] {{·}} [[douban]] {{·}} [[Ello]] {{·}} [[Facebook]] {{·}} [[Flixster]] {{·}} [[FriendFeed]] {{·}} [[Friendster]] {{·}} [[Friends Reunited]] {{·}} [[Gaia Online]] {{·}} [[Google+]] {{·}} [[Habbo]] {{·}} [[hi5]] {{·}} [[Hyves]] {{·}} [[iWiW]] {{·}} [[LinkedIn]] {{·}} [[Miiverse]] {{·}} [[mixi]] {{·}} [[MyHeritage]] {{·}} [[MyLife]] {{·}} [[Myspace]] {{·}} [[myVIP]] {{·}} [[Netlog]] {{·}} [[Odnoklassniki]] {{·}} [[Orkut]] {{·}} [[Plaxo]] {{·}} [[Qzone]] {{·}} [[Renren]] {{·}} [[Skyrock]] {{·}} [[Sonico.com]] {{·}} [[Storylane]] {{·}} [[Tagged]] {{·}} [[tvtag]] {{·}} [[Upcoming]] {{·}} [[Viadeo]] {{·}} [[Vine]] {{·}} [[Vkontakte]] {{·}} [[WeeWorld]] {{·}} [[Weibo]] {{·}} [[Wretch]] {{·}} [[Yahoo! Groups]] {{·}} [[Yahoo! Stars India]] {{·}} [[Yahoo! Upcoming]] {{·}} [[Social network|more sites...]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Shopping/Retail''' || colspan=2 | [[Alibaba]] {{·}} [[AliExpress]] {{·}} [[Amazon]] {{·}} [[Apple Store]] {{·}} [[eBay]] {{·}} [[Printfection]] {{·}} [[RadioShack]] {{·}} [[Sears]] {{·}} [[Target]] {{·}} [[The Book Depository]] {{·}} [[ThinkGeek]] {{·}} [[Walmart]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Software/[[Code hosting services|code hosting]]''' || colspan=2 | [[Android Development]] {{·}} [[Alioth]] {{·}} [[Assembla]] {{·}} [[BerliOS]] {{·}} [[Betavine]] {{·}} [[Bitbucket]] {{·}} [[BountySource]] {{·}} [[Codecademy]] {{·}} [[CodePlex]] {{·}} [[Freepository]] {{·}} [[Free Software Foundation]] {{·}} [[GNU Savannah]] {{·}} [[GitHost]] {{·}} [[GitHub]] {{·}} [[GitHub Downloads]] {{·}} [[Gitorious]] {{·}} [[Gna!]] {{·}} [[Google Code]] {{·}} [[ibiblio]] {{·}} [[java.net]] {{·}} [[JavaForge]] {{·}} [[KnowledgeForge]] {{·}} [[Launchpad]] {{·}} [[LuaForge]] {{·}} [[Maemo]] {{·}} [[mozdev]] {{·}} [[OSOR.eu]] {{·}} [[OW2 Consortium]] {{·}} [[Openmoko]] {{·}} [[OpenSolaris]] {{·}} [[Ourproject.org]] {{·}} [[Ovi Store]] {{·}} [[Project Kenai]] {{·}} [[RubyForge]] {{·}} [[SEUL.org]] {{·}} [[SourceForge]] {{·}} [[Stypi]] {{·}} [[TestFlight]] {{·}} [[tigris.org]] {{·}} [[Transifex]] {{·}} [[TuxFamily]] {{·}} [[Yahoo! Downloads]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Torrenting/Piracy''' || colspan=2 | [[ExtraTorrent]] {{·}} [[EZTV]] {{·}} [[isoHunt]] {{·}} [[KickassTorrents]] {{·}} [[The Pirate Bay]] {{·}} [[Torrentz]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Video hosting]]''' || colspan=2 | [[Academic Earth]] {{·}} [[Blip.tv]] {{·}} [[Epic]] {{·}} [[Google Video]] {{·}} [[Justin.tv]] {{·}} [[Niconico]] {{·}} [[Nokia Trailers]] {{·}} [[Qwiki]] {{·}} [[Skillfeed]] {{·}} [[Stickam]] {{·}} [[TED Talks]] {{·}} [[Ticker.tv]] {{·}} [[Twitch.tv]] {{·}} [[Ustream]] {{·}} [[Videoplayer.hu]] {{·}} [[Viddler]] {{·}} [[Viddy]] {{·}} [[Vimeo]] {{·}} [[Vine]] {{·}} [[Vstreamers]] {{·}} [[Yahoo! Video]] {{·}} [[YouTube]] {{·}} [[Famous Internet videos]] ([[Me at the zoo]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[List of website hosts|Web hosting]]''' || [[Angelfire]] {{·}} [[Brace.io]] {{·}} [[BT Internet]] {{·}} [[CableAmerica Personal Web Space]] {{·}} [[Claranet Netherlands Personal Web Pages]] {{·}} [[Comcast Personal Web Pages]] {{·}} [[Extra.hu]] {{·}} [[FortuneCity]] {{·}} [[Free ProHosting]] {{·}} [[GeoCities]] ([[GeoCities Torrent Patch|patch]]) {{·}} [[Google Business Sitebuilder]] {{·}} [[Google Sites]] {{·}} [[Internet Centrum]] {{·}} [[MBinternet]] {{·}} [[MSN TV]] {{·}} [[Nwnyet]] {{·}} [[Parodius Networking]] {{·}} [[Prodigy.net]] {{·}} [[Saunalahti Iso G]] {{·}} [[Swipnet]] {{·}} [[Telenor]] {{·}} [[Tripod]] {{·}} [[University of Michigan personal webpages]] {{·}} [[Verizon Mysite]] {{·}} [[Verizon Personal Web Space]] {{·}} [[Webzdarma]] {{·}} [[Virgin Media]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Web applications''' || colspan=2 | [[Mailman]] {{·}} [[MediaWiki]] {{·}} [[phpBB]] {{·}} [[Simple Machines Forum]] {{·}} [[vBulletin]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Other''' || colspan=2 | [[800notes]] {{·}} [[AOL]] {{·}} [[Akoha]] {{·}} [[Ancestry.com]] {{·}} [[April Fools' Day]] {{·}} [[Amplicate]] {{·}} [[AutoAdmit]] {{·}} [[Bre.ad]] {{·}} [[Circavie]] {{·}} [[Cobook]] {{·}} [[Co.mments]] {{·}} [[Countdown]] {{·}} [[Distill]] {{·}} [[Dmoz]] {{·}} [[Easel]] {{·}} [[Eircode]] {{·}} [[Electronic Frontier Foundation]] {{·}} [[FanFiction.Net]] {{·}} [[Feedly]] {{·}} [[Ficlets]] {{·}} [[Forrst]] {{·}} [[FunnyExam.com]] {{·}} [[FurAffinity]] {{·}} [[Google Helpouts]] {{·}} [[Google Moderator]] {{·}} [[Google Reader]] {{·}} [[ICQmail]] {{·}} [[IFTTT]] {{·}} [[Jajah]] {{·}} [[JuniorNet]] {{·}} [[Lulu Poetry]] {{·}} [[Mobile Phone Applications]] {{·}} [[Mochi Media]] {{·}} [[Mozilla Firefox]] {{·}} [[MyBlogLog]] {{·}} [[NBII]] {{·}} [[Neopets]] {{·}} [[Quantcast]] {{·}} [[Quizilla]] {{·}} [[Salon Table Talk]] {{·}} [[Shutdownify]] {{·}} [[Slidecast]] {{·}} [[SOPA blackout pages]] {{·}} [[starwars.yahoo.com]] {{·}} [[TechNet]] {{·}} [[Toshiba Support]] {{·}} [[USA-Gov]] {{·}} [[Volán]] {{·}} [[Widgetbox]] {{·}} [[Windows Technical Preview]] {{·}} [[Wunderlist]] {{·}} [[Zoocasa]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Information''' || colspan=2 | [[A Million Ways to Die on the Web]] {{·}} [[Backup Tips]] {{·}} [[Cheap storage]] {{·}} [[Collecting items randomly]] {{·}} [[Data compression algorithms and tools]] {{·}} [[Dev]] {{·}} [[Discovery Data]] {{·}} [[DOS Floppies]] {{·}} [[Fortress of Solitude]] {{·}} [[Keywords]] {{·}} [[Naughty List]] {{·}} [[Nightmare Projects]] {{·}} [[Rescuing Floppy Disks|Rescuing floppy disks]] {{·}} [[Rescuing optical media]] {{·}} [[Site exploration]] {{·}} [[The WARC Ecosystem]] {{·}} [[Working with ARCHIVE.ORG]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Projects]]''' || colspan=2 | [[ArchiveCorps]] {{·}} [[Audit2014]] {{·}} [[Emularity]] {{·}} [[Faceoff]] {{·}} [[FlickrFckr]] {{·}} [[Froogle]] {{·}} [[ftp-gov]] {{·}} [[INTERNETARCHIVE.BAK]] ([[Internet Archive Census]]) {{·}} [[IRC Quotes]] {{·}} [[Javascript Mess|JSMESS]] {{·}} [[Jsvlc|JSVLC]] {{·}} [[Just Solve the Problem 2012|Just Solve the Problem]] {{·}} [[NewsGrabber]] {{·}} [[Project Newsletter]] {{·}} [[Valhalla]] {{·}} [[Web Roasting]] ([[ISP Hosting]] {{·}} [[University Web Hosting]]) {{·}} [[Woohoo]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Software|Tools]]''' || colspan=2 | [[ArchiveBot]] {{·}} [[ArchiveTeam Warrior]] ([[Tracker]]) {{·}} [[Google Takeout]] {{·}} [[HTTrack options|HTTrack]] {{·}} [[Video|Video downloaders]] {{·}} [[Wget]] ([[Wget with Lua hooks|Lua]] {{·}} [[Wget with WARC output|WARC]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Teams''' || colspan=2 | [[Bibliotheca Anonoma]] {{·}} [[LibreTeam]] {{·}} [[URLTeam]] {{·}} [[Yahoo Video Warroom]] {{·}} [[WikiTeam]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''About [[Archive Team]]''' || colspan=2 | [[Introduction]] {{·}} [[Philosophy]] {{·}} [[Who We Are]] {{·}} [[Robots.txt|Our stance on robots.txt]] {{·}} [[Why Back Up?]] {{·}} [[Software]] {{·}} [[Formats]] {{·}} [[Storage Media]] {{·}} [[Recommended Reading]] {{·}} [[Films and documentaries about archiving]] {{·}} [[Talks]] {{·}} [[In The Media]] {{·}} [[Frequently Asked Questions|FAQ]]<br />
|}<br />
</center>[[Category:Archive Team]]<noinclude>[[Category:Templates]]</noinclude></div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26749Ftp-gov2017-01-02T16:35:38Z<p>Yan: add footer</p>
<hr />
<div>{{Infobox project<br />
| title = US government FTP sites<br />
| logo = Great Seal of the United States.png<br />
| image = Government data.jpg<br />
| project_status = {{endangered}}<br />
| archiving_status = {{inprogress}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/ftpdisco/ discovery]<br/>[http://tracker.archiveteam.org/ftp-gov download]<br />
| source = [https://github.com/ArchiveTeam/ftp-gov-grab ftp-gov-grab]<br />
}}<br />
As part of the [[Government Backup|government backup]], we are backing up 750+ FTP sites hosted at .MIL and .GOV sites. The results of this download are being sent to [https://archive.org/details/archiveteam_ftpgov this collection on the Internet Archive].<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel {{IRC|cheetoflee}} and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=Template:Navigation_box&diff=26748Template:Navigation box2017-01-02T16:35:35Z<p>Yan: link ftp-gov and USA-Gov</p>
<hr />
<div><br clear="all" /><center><!--<br />
<br />
<br />
<br />
<br />
Rows are in Alphabetic order. Except "Current events" at the top and "About Archive Team" at the bottom.<br />
Items inside rows are in Alphabetic order too.<br />
Easy : )<br />
<br />
<br />
<br />
<br />
--><br />
{| class="mw-collapsible mw-collapsed" style="border: 1px solid #aaa; background-color: #f9f9f9; color: black; margin: 0.5em 0 0.5em 1em; padding: 0.2em; font-size: 100%;"<br />
| colspan=3 align=center style="background: #ccccff;" | <span style="float: right;"><span class="plainlinks">[[{{fullurl:Template:Navigation_box}} view]]&nbsp;&nbsp;[[{{fullurl:Template:Navigation_box|action=edit}} edit]]</span>&nbsp;</span>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;'''[[Archive Team]]'''&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiveteam:Current events|Current events]]''' || [[Alive... OR ARE THEY]] {{·}} [[Deathwatch]] {{·}} [[Projects]] || rowspan=5 | [[File:Archiveteam.jpg|right|150px]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Archiving projects]]''' || [[APKMirror]] {{·}} [[Archive.is]] {{·}} [[BetaArchive]] {{·}} [[Gmane]] {{·}} [[Internet Archive]] {{·}} [[It Died]] {{·}} [[Megalodon.jp]] {{·}} [[OldApps.com]] {{·}} [[OldVersion.com]] {{·}} [[OSBetaArchive]] {{·}} [[TEXTFILES.COM]] {{·}} [[The Dead, the Dying & The Damned]] {{·}} [[The Mail Archive]] {{·}} [[UK Web Archive]] {{·}} [[WebCite]] {{·}} [[Vaporwave.me]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Blogging''' || [[Blog.pl]] {{·}} [[Blogger]] {{·}} [[Blogster]] {{·}} [[Blogter.hu]] {{·}} [[Freeblog.hu]] {{·}} [[Fuelmyblog]] {{·}} [[Jux]] {{·}} [[LiveJournal]] {{·}} [[My Opera]] {{·}} [[Nolblog.hu]] {{·}} [[Open Diary]] {{·}} [[ownlog.com]] {{·}} [[Posterous]] {{·}} [[Powerblogs]] {{·}} [[Proust]] {{·}} [[Roon]] {{·}} [[Splinder]] {{·}} [[Tumblr]] {{·}} [[Vox]] {{·}} [[Weblog.nl]] {{·}} [[Windows Live Spaces]] {{·}} [[Wordpress.com]] {{·}} [[Xanga]] {{·}} [[Yahoo! Blog]] {{·}} [[Zapd]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Clown hosting|Cloud hosting]]/file sharing''' || [[ADrive|aDrive]] {{·}} [[AnyHub]] {{·}} [[Box]] {{·}} [[Dropbox]] {{·}} [[Docstoc]] {{·}} [[Google Drive]] {{·}} [[Google Groups Files]] {{·}} [[iCloud]] {{·}} [[Fileplanet]] {{·}} [[LayerVault]] {{·}} [[MediaCrush]] {{·}} [[MediaFire]] {{·}} [[Mega]] {{·}} [[MegaUpload]] {{·}} [[MobileMe]] {{·}} [[OneDrive]] {{·}} [[Pomf.se]] {{·}} [[RapidShare]] {{·}} [[Ubuntu One]] {{·}} [[Yahoo! Briefcase]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[:Category:Corporations|Corporations]]''' || [[Apple]] {{·}} [[IBM]] {{·}} [[Google]] {{·}} [[Lycos Europe]] {{·}} [[Microsoft]] {{·}} [[Yahoo!]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Events''' || [[Arab Spring]] {{·}} [[Occupy movement]] {{·}} [[Spanish Revolution]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Font Repos''' || [[Google Web Fonts]] {{·}} [[GNU FreeFont]] {{·}} [[Fontspace]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Forums/Message boards''' || colspan=2 | [[4chan]] {{·}} [[Captain Luffy Forums]] {{·}} [[College Confidential]] {{·}} [[DSLReports]] {{·}} [[ESPN Forums]] {{·}} [[forums.starwars.com]] {{·}} [[HeavenGames]] {{·}} [[Invisionfree]] {{·}} [[The Classic Horror Film Board]] {{·}} [[Yahoo! Messages]] {{·}} [[Yahoo! Neighbors]] {{·}} [[Yuku.com]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Games|Gaming]]''' || colspan=2 | [[Atomicgamer]] {{·}} [[City of Heroes]] {{·}} [[Club Nintendo]] {{·}} [[CSGO Lounge|CS:GO Lounge]] {{·}} [[Desura]] {{·}} [[Dota 2 Lounge]] {{·}} [[Emulation Zone]] {{·}} [[GameMaker Sandbox]] {{·}} [[GameTrailers]] {{·}} [[Halo]] {{·}} [[HLTV.org]] {{·}} [[Infinite Crisis]] {{·}} [[Minecraft.net]] {{·}} [[Player.me]] {{·}} [[Playfire]] {{·}} [[Steam]] {{·}} [[SteamDB]] {{·}} [[TF2 Outpost]] {{·}} [[Warhammer]] {{·}} [[Xfire]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Image hosting]]''' || [[500px]] {{·}} [[AOL Pictures]] {{·}} [[Blipfoto]] {{·}} [[Blingee]] {{·}} [[Canv.as]] {{·}} [[Camera+]] {{·}} [[Cameroid]] {{·}} [[DailyBooth]] {{·}} [[Degree Confluence Project]] {{·}} [[deviantART]] {{·}} [[Demotivalo.net]] {{·}} [[Flickr]] {{·}} [[Fotoalbum.hu]] {{·}} [[Fotolog.com]] {{·}} [[Fotopedia]] {{·}} [[Frontback]] {{·}} [[Geograph Britain and Ireland]] {{·}} [[GTF Képhost]] {{·}} [[ImageShack]] {{·}} [[Imgur]] {{·}} [[Inkblazers]] {{·}} [[Instagr.am]] {{·}} [[Kepfeltoltes.hu]] {{·}} [[Kephost.com]] {{·}} [[Kephost.hu]] {{·}} [[Kepkezelo.com]] {{·}} [[Keptarad.hu]] {{·}} [[Madden GIFERATOR]] {{·}} [[MLKSHK]] {{·}} [[Microsoft Clip Art]] {{·}} [[Nokia Memories]] {{·}} [[noob.hu]] {{·}} [[Odysee]] {{·}} [[Panoramio]] {{·}} [[Photobucket]] {{·}} [[Picasa]] {{·}} [[Picplz]] {{·}} [[PSharing]] {{·}} [[Ptch]] {{·}} [[puu.sh]] {{·}} [[Rawporter]] {{·}} [[Relay.im]] {{·}} [[ScreenshotsDatabase.com]] {{·}} [[Snapjoy]] {{·}} [[Streetfiles]] {{·}} [[Tabblo]] {{·}} [[Trovebox]] {{·}} [[TwitPic]] {{·}} [[Wallbase]] {{·}} [[Wallhaven]] {{·}} [[Webshots]] {{·}} [[Wikimedia Commons]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Knowledge/[[Wikis]]''' || colspan=2 | [[arXiv]] {{·}} [[Citizendium]] {{·}} [[Clipboard.com]] {{·}} [[Deletionpedia]] {{·}} [[EditThis]] {{·}} [[Encyclopedia Dramatica]] {{·}} [[Etherpad]] {{·}} [[Everything2]] {{·}} [[infoAnarchy]] {{·}} [[GeoNames]] {{·}} [[GNUPedia]] {{·}} [[Google Books]] ([[Google Books Ngram]]) {{·}} [[Horror Movie Database]] {{·}} [[Insurgency Wiki]] {{·}} [[Knol]] {{·}} [[Library Genesis]] {{·}} [[Lost Media Wiki]] {{·}} [[Neoseeker.com]] {{·}} [[Notepad.cc]] {{·}} [[Nupedia]] {{·}} [[OpenCourseWare]] {{·}} [[OpenStreetMap]] {{·}} [[Orain]] {{·}} [[Pastebin]] {{·}} [[Patch.com]] {{·}} [[Project Gutenberg]] {{·}} [[Puella Magi]] {{·}} [[Referata]] {{·}} [[Resedagboken]] {{·}} [[SongMeanings]] {{·}} [[ShoutWiki]] {{·}} [[The Internet Movie Database]] {{·}} [[TropicalWikis]] {{·}} [[Uncyclopedia]] {{·}} [[Urban Dictionary]] {{·}} [[Webmonkey]] {{·}} [[Wikia]] {{·}} [[Wikidot]] {{·}} [[WikiHow]] {{·}} [[Wikkii]] {{·}} [[WikiLeaks]] {{·}} [[Wikipedia]] ([[Simple English Wikipedia]]) {{·}} [[Wikispaces]] {{·}} [[Wikispot]] {{·}} [[Wik.is]] {{·}} [[Wiki-Site]] {{·}} [[WikiTravel]] {{·}} [[Word Count Journal]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Magazines/Blogs/News''' || colspan=2 | [[Cyberpunkreview.com]] {{·}} [[Game Developer Magazine]] {{·}} [[Gigaom]] {{·}} [[Helium]] {{·}} [[JPG Magazine]] {{·}} [[Polygamia.pl]] {{·}} [[San Fransisco Bay Guardian]] {{·}} [[Scoop]] {{·}} [[Regretsy]] {{·}} [[Yahoo! Voices]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Microblogging]]''' || colspan=2 | [[Heello]] {{·}} [[Identi.ca]] {{·}} [[Jaiku]] {{·}} [[Mommo.hu]] {{·}} [[Plurk]] {{·}} [[Sina Weibo]] {{·}} [[Twitter]] {{·}} [[TwitLonger]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Music/Audio''' || colspan=2 | [[AOL Music]] {{·}} [[Audimated.com]] {{·}} [[Cinch]] {{·}} [[digCCmixter]] {{·}} [[Dogmazic.net]] {{·}} [[Earbits]] {{·}} [[exfm]] {{·}} [[Free Music Archive]] {{·}} [[Gogoyoko]] {{·}} [[Indaba Music]] {{·}} [[Instacast]] {{·}} [[Jamendo]] {{·}} [[Last.fm]] {{·}} [[Music Unlimited]] {{·}} [[MOG]] {{·}} [[PureVolume]] {{·}} [[Reverbnation]] {{·}} [[ShareTheMusic]] {{·}} [[SoundCloud]] {{·}} [[Soundpedia]] {{·}} [[This Is My Jam]] {{·}} [[TuneWiki]] {{·}} [[Twaud.io]] {{·}} [[WinAmp]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''People''' || colspan=2 | [[Aaron Swartz]] {{·}} [[Michael S. Hart]] {{·}} [[Steve Jobs]] {{·}} [[Mark Pilgrim]] {{·}} [[Dennis Ritchie]] {{·}} [[Len Sassaman Project]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Protocols/Infrastructure''' || colspan=2 | [[FTP]] {{·}} [[Gopher]] {{·}} [[IRC]] {{·}} [[Usenet]] {{·}} [[World Wide Web]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Q&A''' || colspan=2 | [[Askville]] {{·}} [[Answerbag]] {{·}} [[Answers.com]] {{·}} [[Ask.com]] {{·}} [[Askalo]] {{·}} [[Baidu Knows]] {{·}} [[Blurtit]] {{·}} [[ChaCha]] {{·}} [[Experts Exchange]] {{·}} [[Formspring]] {{·}} [[GirlsAskGuys]] {{·}} [[Google Answers]] {{·}} [[Google Baraza]] {{·}} [[JustAnswer]] {{·}} [[MetaFilter]] {{·}} [[Quora]] {{·}} [[Retrospring]] {{·}} [[StackExchange]] {{·}} [[The AnswerBank]] {{·}} [[The Internet Oracle]] {{·}} [[Uclue]] {{·}} [[WikiAnswers]] {{·}} [[Yahoo! Answers]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Recipes/Food''' || colspan=2 | [[Allrecipes]] {{·}} [[Epicurious]] {{·}} [[Food.com]] {{·}} [[Foodily]] {{·}} [[Food Network]] {{·}} [[Punchfork]] {{·}} [[ZipList]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Social bookmarking''' || colspan=2 | [[Addinto]] {{·}} [[Backflip]] {{·}} [[Balatarin]] {{·}} [[BibSonomy]] {{·}} [[Bkmrx]] {{·}} [[Blinklist]] {{·}} [[BlogMarks]] {{·}} [[BookmarkSync]] {{·}} [[CiteULike]] {{·}} [[Connotea]] {{·}} [[Delicious]] {{·}} [[Designer News]] {{·}} [[Digg]] {{·}} [[Diigo]] {{·}} [[Dir.eccion.es]] {{·}} [[Evernote]] {{·}} [[Excite Bookmark]] {{·}} [[Faves]] {{·}} [[Favilous]] {{·}} [[folkd]] {{·}} [[Freelish]] {{·}} [[Getboo]] {{·}} [[GiveALink.org]] {{·}} [[Gnolia]] {{·}} [[Google Bookmarks]] {{·}} [[Hacker News]] {{·}} [[HeyStaks]] {{·}} [[IndianPad]] {{·}} [[Kippt]] {{·}} [[Knowledge Plaza]] {{·}} [[Licorize]] {{·}} [[Linkwad]] {{·}} [[Menéame]] {{·}} [[Microsoft Developer Network]] {{·}} [[myVIP]] {{·}} [[Mister Wong]] {{·}} [[My Web]] {{·}} [[Mylink Vault]] {{·}} [[Newsvine]] {{·}} [[Oneview]] {{·}} [[Pearltrees]] {{·}} [[Pinboard]] {{·}} [[Pocket]] {{·}} [[Propeller.com]] {{·}} [[Reddit]] {{·}} [[sabros.us]] {{·}} [[Scloog]] {{·}} [[Scuttle]] {{·}} [[Simpy]] {{·}} [[SiteBar]] {{·}} [[Slashdot]] {{·}} [[Squidoo]] {{·}} [[StumbleUpon]] {{·}} [[Twine]] {{·}} [[Vizited]] {{·}} [[Yummymarks]] {{·}} [[Xmarks]] {{·}} [[Yahoo! Buzz]] {{·}} [[Zootool]] {{·}} [[Zotero]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Social network|Social networks]]''' || colspan=2 | [[Bebo]] {{·}} [[BlackPlanet]] {{·}} [[Classmates.com]] {{·}} [[Cyworld]] {{·}} [[Dogster]] {{·}} [[Dopplr]] {{·}} [[douban]] {{·}} [[Ello]] {{·}} [[Facebook]] {{·}} [[Flixster]] {{·}} [[FriendFeed]] {{·}} [[Friendster]] {{·}} [[Friends Reunited]] {{·}} [[Gaia Online]] {{·}} [[Google+]] {{·}} [[Habbo]] {{·}} [[hi5]] {{·}} [[Hyves]] {{·}} [[iWiW]] {{·}} [[LinkedIn]] {{·}} [[Miiverse]] {{·}} [[mixi]] {{·}} [[MyHeritage]] {{·}} [[MyLife]] {{·}} [[Myspace]] {{·}} [[myVIP]] {{·}} [[Netlog]] {{·}} [[Odnoklassniki]] {{·}} [[Orkut]] {{·}} [[Plaxo]] {{·}} [[Qzone]] {{·}} [[Renren]] {{·}} [[Skyrock]] {{·}} [[Sonico.com]] {{·}} [[Storylane]] {{·}} [[Tagged]] {{·}} [[tvtag]] {{·}} [[Upcoming]] {{·}} [[Viadeo]] {{·}} [[Vine]] {{·}} [[Vkontakte]] {{·}} [[WeeWorld]] {{·}} [[Weibo]] {{·}} [[Wretch]] {{·}} [[Yahoo! Groups]] {{·}} [[Yahoo! Stars India]] {{·}} [[Yahoo! Upcoming]] {{·}} [[Social network|more sites...]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Shopping/Retail''' || colspan=2 | [[Alibaba]] {{·}} [[AliExpress]] {{·}} [[Amazon]] {{·}} [[Apple Store]] {{·}} [[eBay]] {{·}} [[Printfection]] {{·}} [[RadioShack]] {{·}} [[Sears]] {{·}} [[Target]] {{·}} [[The Book Depository]] {{·}} [[ThinkGeek]] {{·}} [[Walmart]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Software/[[Code hosting services|code hosting]]''' || colspan=2 | [[Android Development]] {{·}} [[Alioth]] {{·}} [[Assembla]] {{·}} [[BerliOS]] {{·}} [[Betavine]] {{·}} [[Bitbucket]] {{·}} [[BountySource]] {{·}} [[Codecademy]] {{·}} [[CodePlex]] {{·}} [[Freepository]] {{·}} [[Free Software Foundation]] {{·}} [[GNU Savannah]] {{·}} [[GitHost]] {{·}} [[GitHub]] {{·}} [[GitHub Downloads]] {{·}} [[Gitorious]] {{·}} [[Gna!]] {{·}} [[Google Code]] {{·}} [[ibiblio]] {{·}} [[java.net]] {{·}} [[JavaForge]] {{·}} [[KnowledgeForge]] {{·}} [[Launchpad]] {{·}} [[LuaForge]] {{·}} [[Maemo]] {{·}} [[mozdev]] {{·}} [[OSOR.eu]] {{·}} [[OW2 Consortium]] {{·}} [[Openmoko]] {{·}} [[OpenSolaris]] {{·}} [[Ourproject.org]] {{·}} [[Ovi Store]] {{·}} [[Project Kenai]] {{·}} [[RubyForge]] {{·}} [[SEUL.org]] {{·}} [[SourceForge]] {{·}} [[Stypi]] {{·}} [[TestFlight]] {{·}} [[tigris.org]] {{·}} [[Transifex]] {{·}} [[TuxFamily]] {{·}} [[Yahoo! Downloads]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Torrenting/Piracy''' || colspan=2 | [[ExtraTorrent]] {{·}} [[EZTV]] {{·}} [[isoHunt]] {{·}} [[KickassTorrents]] {{·}} [[The Pirate Bay]] {{·}} [[Torrentz]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Video hosting]]''' || colspan=2 | [[Academic Earth]] {{·}} [[Blip.tv]] {{·}} [[Epic]] {{·}} [[Google Video]] {{·}} [[Justin.tv]] {{·}} [[Niconico]] {{·}} [[Nokia Trailers]] {{·}} [[Qwiki]] {{·}} [[Skillfeed]] {{·}} [[Stickam]] {{·}} [[TED Talks]] {{·}} [[Ticker.tv]] {{·}} [[Twitch.tv]] {{·}} [[Ustream]] {{·}} [[Videoplayer.hu]] {{·}} [[Viddler]] {{·}} [[Viddy]] {{·}} [[Vimeo]] {{·}} [[Vine]] {{·}} [[Vstreamers]] {{·}} [[Yahoo! Video]] {{·}} [[YouTube]] {{·}} [[Famous Internet videos]] ([[Me at the zoo]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[List of website hosts|Web hosting]]''' || [[Angelfire]] {{·}} [[Brace.io]] {{·}} [[BT Internet]] {{·}} [[CableAmerica Personal Web Space]] {{·}} [[Claranet Netherlands Personal Web Pages]] {{·}} [[Comcast Personal Web Pages]] {{·}} [[Extra.hu]] {{·}} [[FortuneCity]] {{·}} [[Free ProHosting]] {{·}} [[GeoCities]] ([[GeoCities Torrent Patch|patch]]) {{·}} [[Google Business Sitebuilder]] {{·}} [[Google Sites]] {{·}} [[Internet Centrum]] {{·}} [[MBinternet]] {{·}} [[MSN TV]] {{·}} [[Nwnyet]] {{·}} [[Parodius Networking]] {{·}} [[Prodigy.net]] {{·}} [[Saunalahti Iso G]] {{·}} [[Swipnet]] {{·}} [[Telenor]] {{·}} [[Tripod]] {{·}} [[University of Michigan personal webpages]] {{·}} [[Verizon Mysite]] {{·}} [[Verizon Personal Web Space]] {{·}} [[Webzdarma]] {{·}} [[Virgin Media]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Web applications''' || colspan=2 | [[Mailman]] {{·}} [[MediaWiki]] {{·}} [[phpBB]] {{·}} [[Simple Machines Forum]] {{·}} [[vBulletin]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Other''' || colspan=2 | [[800notes]] {{·}} [[AOL]] {{·}} [[Akoha]] {{·}} [[Ancestry.com]] {{·}} [[April Fools' Day]] {{·}} [[Amplicate]] {{·}} [[AutoAdmit]] {{·}} [[Bre.ad]] {{·}} [[Circavie]] {{·}} [[Cobook]] {{·}} [[Co.mments]] {{·}} [[Countdown]] {{·}} [[Distill]] {{·}} [[Dmoz]] {{·}} [[Easel]] {{·}} [[Eircode]] {{·}} [[Electronic Frontier Foundation]] {{·}} [[FanFiction.Net]] {{·}} [[Feedly]] {{·}} [[Ficlets]] {{·}} [[Forrst]] {{·}} [[FunnyExam.com]] {{·}} [[FurAffinity]] {{·}} [[Google Helpouts]] {{·}} [[Google Moderator]] {{·}} [[Google Reader]] {{·}} [[ICQmail]] {{·}} [[IFTTT]] {{·}} [[Jajah]] {{·}} [[JuniorNet]] {{·}} [[Lulu Poetry]] {{·}} [[Mobile Phone Applications]] {{·}} [[Mochi Media]] {{·}} [[Mozilla Firefox]] {{·}} [[MyBlogLog]] {{·}} [[NBII]] {{·}} [[Neopets]] {{·}} [[Quantcast]] {{·}} [[Quizilla]] {{·}} [[Salon Table Talk]] {{·}} [[Shutdownify]] {{·}} [[Slidecast]] {{·}} [[SOPA blackout pages]] {{·}} [[starwars.yahoo.com]] {{·}} [[TechNet]] {{·}} [[Toshiba Support]] {{·}} [[USA-Gov]] {{·}} [[Volán]] {{·}} [[Widgetbox]] {{·}} [[Windows Technical Preview]] {{·}} [[Wunderlist]] {{·}} [[Zoocasa]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Information''' || colspan=2 | [[A Million Ways to Die on the Web]] {{·}} [[Backup Tips]] {{·}} [[Cheap storage]] {{·}} [[Collecting items randomly]] {{·}} [[Data compression algorithms and tools]] {{·}} [[Dev]] {{·}} [[Discovery Data]] {{·}} [[DOS Floppies]] {{·}} [[Fortress of Solitude]] {{·}} [[Keywords]] {{·}} [[Naughty List]] {{·}} [[Nightmare Projects]] {{·}} [[Rescuing Floppy Disks|Rescuing floppy disks]] {{·}} [[Rescuing optical media]] {{·}} [[Site exploration]] {{·}} [[The WARC Ecosystem]] {{·}} [[Working with ARCHIVE.ORG]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Projects]]''' || colspan=2 | [[ArchiveCorps]] {{·}} [[Audit2014]] {{·}} [[Emularity]] {{·}} [[Faceoff]] {{·}} [[FlickrFckr]] {{·}} [[Froogle]] {{·}} [[ftp-gov]] {{·}} [[INTERNETARCHIVE.BAK]] ([[Internet Archive Census]]) {{·}} [[IRC Quotes]] {{·}} [[Javascript Mess|JSMESS]] {{·}} [[Jsvlc|JSVLC]] {{·}} [[Just Solve the Problem 2012|Just Solve the Problem]] {{·}} [[NewsGrabber]] {{·}} [[Project Newsletter]] {{·}} [[Valhalla]] {{·}} [[Web Roasting]] ([[ISP Hosting]] {{·}} [[University Web Hosting]]) {{·}} [[Woohoo]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''[[Software|Tools]]''' || colspan=2 | [[ArchiveBot]] {{·}} [[ArchiveTeam Warrior]] ([[Tracker]]) {{·}} [[Google Takeout]] {{·}} [[HTTrack options|HTTrack]] {{·}} [[Video|Video downloaders]] {{·}} [[Wget]] ([[Wget with Lua hooks|Lua]] {{·}} [[Wget with WARC output|WARC]])<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''Teams''' || colspan=2 | [[Bibliotheca Anonoma]] {{·}} [[LibreTeam]] {{·}} [[URLTeam]] {{·}} [[Yahoo Video Warroom]] {{·}} [[WikiTeam]]<br />
|-<br />
| align=center width=150px style="background: #ddddff;" | '''About [[Archive Team]]''' || colspan=2 | [[Introduction]] {{·}} [[Philosophy]] {{·}} [[Who We Are]] {{·}} [[Robots.txt|Our stance on robots.txt]] {{·}} [[Why Back Up?]] {{·}} [[Software]] {{·}} [[Formats]] {{·}} [[Storage Media]] {{·}} [[Recommended Reading]] {{·}} [[Films and documentaries about archiving]] {{·}} [[Talks]] {{·}} [[In The Media]] {{·}} [[Frequently Asked Questions|FAQ]]<br />
|}<br />
</center>[[Category:Archive Team]]<noinclude>[[Category:Templates]]</noinclude></div>Yanhttps://wiki.archiveteam.org/index.php?title=Current_Projects&diff=26747Current Projects2017-01-02T16:32:36Z<p>Yan: /* Warrior based projects */ link new page</p>
<hr />
<div>__NOTOC__<br />
== Archive Team recruiting ==<br />
* [[Dev|Want to code for Archive Team? Here's a starting point.]]<br />
<br />
== Warrior based projects ==<br />
* [[ftp-gov]]: Oh, just everything the United States government hosts on FTP. Part of the wider [[Government Backup]] efforts. '''IRC Channel {{IRC|cheetoflee}}'''.<br />
* [[NUjij]]: Dutch discussion platform being killed off on September 12, 2016. '''IRC Channel {{IRC|archiveteam}}'''.<br />
* [[Yahoo! Answers]]: It's Yahoo. It's been acquired. More questions? '''IRC Channel {{IRC|noanswers}}'''.<br />
* [[Bayimg]]: [[The Pirate Bay|TPB]]'s image hosting is also being flooded. '''IRC Channel {{IRC|archiveteam}}'''.<br />
* [[Fotolog.com]]: Photo sharing and social networking site <s>closing on February 20, 2016</s>. Still saving. '''IRC Channel {{IRC|fotologout}}.'''<br />
* [[Google Code]]: Google likes [[Github]] more. Shut down on January 26, 2016, but we got a chance to go on saving. '''IRC Channel {{IRC|googlecodeblue}}'''.<br />
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.<br />
* [[PDF 2016]]: We've been given a long list of PDF files. We'll get'em. '''IRC Channel {{IRC|pdflush}}.'''<br />
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. '''IRC Channel {{IRC|archiveteam}}'''.<br />
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.<br />
* [[WikiTeam]] ([[WARC]] format): Saving wikis and their external links for the Wayback Machine. '''IRC Channel {{IRC|wikiteam}}'''.<br />
<br />
=== Scripts only ===<br />
* [[FTP]]: Download all the FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
<br />
Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br><br />
What's on: [http://tracker.archiveteam.org/ online tracker].<br><br />
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].--><br />
<br />
== Manual projects ==<br />
* [[AOL]]: Climbing into the decaying walled garden. '''IRC Channel {{IRC|aohell}}'''<br />
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.<br />
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.<br />
* [[Froogle]]: Let's do a census of all of Google's products. '''IRC Channel {{IRC|froogle}}'''.<br />
* [[FTP]]: Help us find all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.<br />
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.<br />
* [[NewsGrabber]]: Saving all news articles. Help with server power or by finding more newssites. '''IRC Channel {{IRC|newsgrabber}}'''.<br />
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.<br />
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.<br />
* [[WikiTeam]] (XML dumps): Exporting Mediawiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.<br />
* [[Woohoo]]: Yahoo is untrustworthy, let's do a census of all their products. '''IRC Channel {{IRC|woohoo}}'''.<br />
<br />
== Upcoming projects ==<br />
<!-- Top priority: could disappear anytime now --><br />
<!-- Shutting down, definite deadline given --><br />
<!-- Shutting down, vague deadline given --><br />
<!-- Shutting down, no deadline given --><br />
<!-- Archiving the archives --><br />
* [[Vine]]: [[Twitter]] killed the app, but videos likely won't last forever. '''IRC Channel {{IRC|vinewhine}}'''.<br />
<!-- Misc. projects (unmaintained sites, distrust in owners) --><br />
* [[Flickr]]: [[Yahoo!]] decided to kill it, now Yahoo has been acquired. '''IRC Channel {{IRC|flickrfckr}}'''.<br />
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired. '''IRC Channel {{IRC|stumblr}}'''.<br />
<br />
== Proposed projects ==<br />
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). --><br />
* [[PhotoSynth]]: Microsoft is [http://www.winbeta.org/news/photosynth-to-shut-down-in-february-2017 shutting down] their panorama sharing platform in Feb. 2017<br />
* [[DevPort]]: This [http://developerportfolio.com/ portfolio SaaS provider] has [http://www.lowendtalk.com/discussion/65135/need-some-help-saas-provider-is-dead-but-my-site-is-still-up-how-should-i-grab-it reportedly] been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.<br />
* [[Picasa|Picasa Web Albums]]: Merging into Google Photos on May 1, 2016. '''IRC Channel {{IRC|picasso}}'''.<br />
* [[Google Drive]] web hosting to be discontinued on August 31, 2016.<br />
* [[RadioShack]]: RadioShack is going bankrupt. '''IRC Channel {{IRC|unshackled}}'''.<br />
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.<br />
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07). <br />
* [[SoundCloud]]: Might get blown away by Hurricane Debt and Hurricane Copyright. '''IRC Channel {{IRC|soundclown}}'''.<br />
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.<br />
* [[Blipfoto]]: <s>Went into liquidation on March 11, 2015, future uncertain.</s> Acquired on March 25, 2015, we're still grabbing it anyway. '''IRC Channel {{IRC|fotofinish}}'''.<br />
<br />
== Recently finished ==<br />
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people --><br />
* [[Panoramio]]: [[Google]] is taking one last picture of Panoramio getting executed behind the shed on November 4, 2016. '''IRC Channel {{IRC|paranormio}}'''.<br />
<br />
<!--== Hiatus / Missed the Mark ==<br />
--><br />
<small>ArchiveTeam uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[IRC|More info]]</small></div>Yanhttps://wiki.archiveteam.org/index.php?title=Government_Backup&diff=26746Government Backup2017-01-02T16:30:53Z<p>Yan: /* Archive Team General Websites Download */ link FTP</p>
<hr />
<div>__NOTOC__<br />
[[Image:Government data.jpg|300px|right]]<br />
<br />
''The US Government has an awful lot of data, and it's in a lot of places.'' In 2016, elections were held that indicated deep sea changes in goals and ideals (although previous transitions have always contained such changes). Inspired by this, a number of groups and efforts have risen up to ensure backups of all government data possible are made off-site.<br />
<br />
'''This page contains overviews of the effort by all the teams.'''<br />
<br />
=== Internet Archive ===<br />
<br />
[[Internet Archive]] has two teams, [[Wayback Machine|Wayback]] and [[Archive-It]] ([https://archive-it.org/ archive-it.org]), working through listings of government websites and data stores. They are working internally using Internet Archive's crawlers and environment.<br />
<br />
The result of these internal efforts are saved in [https://archive.org/details/EndOfTerm2016WebCrawls this collection]. (Note that other efforts exist under this collection as Sub-Collections, such as Archive Team efforts.)<br />
<br />
=== #DATAREFUGE ===<br />
<br />
The [[DataRefuge|Data Refuge]] project ([http://www.ppehlab.org/datarefuge ppehlab.org/datarefuge]) has [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 the following Google document] about climate datasets.<br />
<br />
=== Archive Team FTP Backup ===<br />
<br />
The [[ftp-gov|Archive Team project]] is backing up 750+ FTP sites hosted at .MIL and .GOV sites. These two projects can be tracked [http://tracker.archiveteam.org/ftpdisco/ here] (discovery phase) and [http://tracker.archiveteam.org/ftp-gov/ here] (download phase). The results of this download are being sent to [https://archive.org/details/archiveteam_ftpgov this collection].<br />
<br />
=== Archive Team General Websites Download ===<br />
<br />
Besides the [[ftp-gov|FTP]] data download, Archive Team is also doing a general download (where possible) of many crawlable government websites, such as [[USA-Gov|usa.gov]].<br />
<br />
== Internet Archive Statements ==<br />
<br />
* [http://blog.archive.org/2016/11/09/us-election-results/ US Election Results] - Surprise at the outcome of the election and a call to keep libraries open.<br />
* [http://blog.archive.org/2016/11/11/contribute-to-the-2016-u-s-presidential-election-web-archive/ Please Help Build the 2016 End-of-Term Archive] - A call for assistance and volunteers<br />
* [http://blog.archive.org/2016/11/29/help-us-keep-the-archive-free-accessible-and-private/ Help Us Keep the Archive Free, Accessible, and Reader Private] - First entry that indicates election has influenced efforts to back up the archive in Canada.<br />
* [http://blog.archive.org/2016/12/03/faqs-about-the-internet-archive-canada/ FAQs about the Internet Archive Canada] - Much needed clarification about the mirroring in Canada of the Internet Archive.<br />
* [http://blog.archive.org/2016/12/06/internet-archive-canada-and-national-security-letter-in-the-news-roundup/ Internet Archive Canada and National Security Letter in the News] - Roundup of press mentions about the mirroring efforts.<br />
* [http://blog.archive.org/2016/12/15/preserving-u-s-government-websites-and-data-as-the-obama-term-ends/ Preserving U.S. Government Websites and Data as the Obama Term Ends] - Notes by the head of Archive-It about efforts to run the End of Term archiving.<br />
* [http://blog.archive.org/2016/12/17/robots-txt-gov-mil-websites/ Robots.txt Files and Archiving .gov and .mil Websites Archiving .GOV and .MIL Websites] - The Internet Archive will no longer follow ROBOTS.TXT directives on .GOV and .MIL.<br />
* [http://blog.archive.org/2016/12/20/would-like-to-archive-government-web-services-not-just-web-sites-please-help/ Would Like to Archive Government Web Services, not just Web Sites– Please help] - Additional call to archive Government Web Services, not just Websites.<br />
<br />
== Notable Press Mentions and References ==<br />
<br />
''Note that the story oscillates between "Internet Archive is adding a mirror in Canada" and "Internet Archive is Moving to Canada".'' The actuality, for anyone viewing this page coming in cold, is that the Internet Archive has been building a mirror in Canada for a significant period of time and has a fully-functioning facility in Canada that has been a presence of some sort for nearly a decade as of 2016. The current effort was merely a speeding up of an inevitable timetable.<br />
<br />
* [http://www.theverge.com/2016/11/29/13778188/internet-archive-of-canada-backup-trump-surveillance-censorship The Internet Archive is building a Canadian copy to protect itself from Trump], The Verge, November 29, 2016<br />
* [http://www.nbcnews.com/news/us-news/internet-archive-web-s-warehouse-creating-trump-era-copy-canada-n689916 Internet Archive, Web's Warehouse, Creating Trump-Era Copy in Canada], NBC News, November 29, 2016<br />
* [http://www.dailykos.com/story/2016/11/30/1605487/-The-Internet-Archive-is-Moving-to-Canada The Internet Archive is "Moving to Canada"], The Daily Kos, November 30, 2016<br />
* [http://gothamist.com/2016/11/30/even_the_internet_is_getting_ready.php Even The Internet Archive Is Moving To Canada Because Of Trump], Gothamist, November 30, 2016<br />
* [http://www.huffingtonpost.ca/2016/11/30/archive-org-canada-trump_n_13330492.html Archive.org Moving To Canada Over Trump Censorship Fears], Huffington Post Canada, November 30, 2016</div>Yanhttps://wiki.archiveteam.org/index.php?title=Talk:Ftp-gov&diff=26745Talk:Ftp-gov2017-01-02T16:29:00Z<p>Yan: add TODO</p>
<hr />
<div>TODO: link this page on the various pages/templates listing current projects. --[[User:Yan|Yan]] ([[User talk:Yan|talk]]) 11:28, 2 January 2017 (EST)</div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26744Ftp-gov2017-01-02T16:27:56Z<p>Yan: name IA</p>
<hr />
<div>{{Infobox project<br />
| title = US government FTP sites<br />
| logo = Great Seal of the United States.png<br />
| image = Government data.jpg<br />
| project_status = {{endangered}}<br />
| archiving_status = {{inprogress}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/ftpdisco/ discovery]<br/>[http://tracker.archiveteam.org/ftp-gov download]<br />
| source = [https://github.com/ArchiveTeam/ftp-gov-grab ftp-gov-grab]<br />
}}<br />
As part of the [[Government Backup|government backup]], we are backing up 750+ FTP sites hosted at .MIL and .GOV sites. The results of this download are being sent to [https://archive.org/details/archiveteam_ftpgov this collection on the Internet Archive].<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel {{IRC|cheetoflee}} and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.</div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26743Ftp-gov2017-01-02T16:27:10Z<p>Yan: copy line from Government backup</p>
<hr />
<div>{{Infobox project<br />
| title = US government FTP sites<br />
| logo = Great Seal of the United States.png<br />
| image = Government data.jpg<br />
| project_status = {{endangered}}<br />
| archiving_status = {{inprogress}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/ftpdisco/ discovery]<br/>[http://tracker.archiveteam.org/ftp-gov download]<br />
| source = [https://github.com/ArchiveTeam/ftp-gov-grab ftp-gov-grab]<br />
}}<br />
As part of the [[Government Backup|government backup]], we are backing up 750+ FTP sites hosted at .MIL and .GOV sites. The results of this download are being sent to [https://archive.org/details/archiveteam_ftpgov this collection on archive.org].<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel {{IRC|cheetoflee}} and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.</div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26742Ftp-gov2017-01-02T16:25:22Z<p>Yan: fix infobox</p>
<hr />
<div>{{Infobox project<br />
| title = US government FTP sites<br />
| logo = Great Seal of the United States.png<br />
| image = Government data.jpg<br />
| project_status = {{endangered}}<br />
| archiving_status = {{inprogress}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/ftpdisco/ discovery]<br/>[http://tracker.archiveteam.org/ftp-gov download]<br />
| source = [https://github.com/ArchiveTeam/ftp-gov-grab ftp-gov-grab]<br />
}}<br />
We are backing up 750+ FTP sites hosted at .MIL and .GOV sites.<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel {{IRC|cheetoflee}} and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.</div>Yanhttps://wiki.archiveteam.org/index.php?title=FTP_GOV&diff=26741FTP GOV2017-01-02T16:21:19Z<p>Yan: add redirect</p>
<hr />
<div>#REDIRECT [[ftp-gov]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26740Ftp-gov2017-01-02T16:19:14Z<p>Yan: add infobox</p>
<hr />
<div>{{Infobox project<br />
| title = US government FTP sites<br />
| logo = Great Seal of the United States.png<br />
| image = Government data.jpg<br />
| project_status = {{Endangered}}<br />
| archiving_status = {{Upcoming}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/cheetoflee cheetoflee]<br />
| source = [https://github.com/ArchiveTeam/usa-gov-grab ftp-gov-grab]<br />
}}<br />
We are backing up 750+ FTP sites hosted at .MIL and .GOV sites.<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel {{IRC|cheetoflee}} and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.</div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26739Ftp-gov2017-01-02T16:16:37Z<p>Yan: link IRC</p>
<hr />
<div>We are backing up 750+ FTP sites hosted at .MIL and .GOV sites.<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel {{IRC|cheetoflee}} and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.</div>Yanhttps://wiki.archiveteam.org/index.php?title=USA-Gov&diff=26738USA-Gov2017-01-02T16:15:57Z<p>Yan: make this a generic site infobox</p>
<hr />
<div>{{Infobox project<br />
| title = USA.gov<br />
| image = usa_gov_screenshot.png<br />
| logo = usa_gov_logo.png<br />
| URL = http://www.usa.gov<br />
| project_status = {{Endangered}}<br />
| archiving_status = {{Upcoming}}<br />
}}<br />
'''USA.gov''' is the official website of the United States of America Government. <s>It is unlikely to go down soon, but</s> given the upcoming presidental change in 2017 it is a good idea to create an archive before it all changes.<br />
<br />
== Site Areas ==<br />
The following sites / areas have been identified at this time and are in the process of being discovered and jobs created.<br />
<br />
=== Index ===<br />
<br />
USA.gov offers an index of more than 10,000 links to official government information. The index is categorized by services and common topics, and can be accessed through five audience gateways: Businesses and Nonprofits, Citizens, Federal Employees, Government to Government (for state, local, and tribal governments), and Visitors to the U.S.<br />
<br />
=== Frequently Asked Questions ===<br />
USA.gov's Frequently Asked Questions (FAQs) database contains thousands of answers to the questions the public asks most via USA.gov or the contact center at 1 (800) FED-INFO. For more than 30 years, the contact center has been a source for answers to questions about consumer problems and government services.<br />
<br />
=== URL Shortening ===<br />
A URL shortening service, go.USA.gov, is available to users that have a .gov email address (only .gov URLs may be submitted for shortening through this service). The service will generate a random following go.USA.gov/ which redirects the user to the longer .gov URL stored in the system.<br />
<br />
=== Gobierno ===<br />
<br />
A part of USA.gov, GobiernoUSA.gov pulls together all of the U.S. government’s Spanish-language websites and makes them easily accessible to the public in one central location. <br />
GobiernoUSA.gov features more than 900 external links and provides access to more than 125,000 Government pages in Spanish. Although most of the resources are federal, the site also links to Spanish-language content provided by 42 states, the District of Columbia, the Commonwealth of Puerto Rico, and local government websites.<br />
<br />
== Discovery ==<br />
<br />
It is still to be decided what is in and out of scope.<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26737Ftp-gov2017-01-02T16:15:15Z<p>Yan: add lede</p>
<hr />
<div>We are backing up 750+ FTP sites hosted at .MIL and .GOV sites.<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel #cheetoflee and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.</div>Yanhttps://wiki.archiveteam.org/index.php?title=Ftp-gov&diff=26736Ftp-gov2017-01-02T16:13:14Z<p>Yan: moved ftp-gov specific text from USA-Gov</p>
<hr />
<div>== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel #cheetoflee and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.</div>Yanhttps://wiki.archiveteam.org/index.php?title=Government_Backup&diff=26735Government Backup2017-01-02T16:12:49Z<p>Yan: /* Archive Team FTP Backup */ redlink ftp-gov project page</p>
<hr />
<div>__NOTOC__<br />
[[Image:Government data.jpg|300px|right]]<br />
<br />
''The US Government has an awful lot of data, and it's in a lot of places.'' In 2016, elections were held that indicated deep sea changes in goals and ideals (although previous transitions have always contained such changes). Inspired by this, a number of groups and efforts have risen up to ensure backups of all government data possible are made off-site.<br />
<br />
'''This page contains overviews of the effort by all the teams.'''<br />
<br />
=== Internet Archive ===<br />
<br />
[[Internet Archive]] has two teams, [[Wayback Machine|Wayback]] and [[Archive-It]] ([https://archive-it.org/ archive-it.org]), working through listings of government websites and data stores. They are working internally using Internet Archive's crawlers and environment.<br />
<br />
The result of these internal efforts are saved in [https://archive.org/details/EndOfTerm2016WebCrawls this collection]. (Note that other efforts exist under this collection as Sub-Collections, such as Archive Team efforts.)<br />
<br />
=== #DATAREFUGE ===<br />
<br />
The [[DataRefuge|Data Refuge]] project ([http://www.ppehlab.org/datarefuge ppehlab.org/datarefuge]) has [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 the following Google document] about climate datasets.<br />
<br />
=== Archive Team FTP Backup ===<br />
<br />
The [[ftp-gov|Archive Team project]] is backing up 750+ FTP sites hosted at .MIL and .GOV sites. These two projects can be tracked [http://tracker.archiveteam.org/ftpdisco/ here] (discovery phase) and [http://tracker.archiveteam.org/ftp-gov/ here] (download phase). The results of this download are being sent to [https://archive.org/details/archiveteam_ftpgov this collection].<br />
<br />
=== Archive Team General Websites Download ===<br />
<br />
Besides the FTP data download, Archive Team is also doing a general download (where possible) of many crawlable government websites, such as [[USA-Gov|usa.gov]].<br />
<br />
<br />
<br />
<br />
== Internet Archive Statements ==<br />
<br />
* [http://blog.archive.org/2016/11/09/us-election-results/ US Election Results] - Surprise at the outcome of the election and a call to keep libraries open.<br />
* [http://blog.archive.org/2016/11/11/contribute-to-the-2016-u-s-presidential-election-web-archive/ Please Help Build the 2016 End-of-Term Archive] - A call for assistance and volunteers<br />
* [http://blog.archive.org/2016/11/29/help-us-keep-the-archive-free-accessible-and-private/ Help Us Keep the Archive Free, Accessible, and Reader Private] - First entry that indicates election has influenced efforts to back up the archive in Canada.<br />
* [http://blog.archive.org/2016/12/03/faqs-about-the-internet-archive-canada/ FAQs about the Internet Archive Canada] - Much needed clarification about the mirroring in Canada of the Internet Archive.<br />
* [http://blog.archive.org/2016/12/06/internet-archive-canada-and-national-security-letter-in-the-news-roundup/ Internet Archive Canada and National Security Letter in the News] - Roundup of press mentions about the mirroring efforts.<br />
* [http://blog.archive.org/2016/12/15/preserving-u-s-government-websites-and-data-as-the-obama-term-ends/ Preserving U.S. Government Websites and Data as the Obama Term Ends] - Notes by the head of Archive-It about efforts to run the End of Term archiving.<br />
* [http://blog.archive.org/2016/12/17/robots-txt-gov-mil-websites/ Robots.txt Files and Archiving .gov and .mil Websites Archiving .GOV and .MIL Websites] - The Internet Archive will no longer follow ROBOTS.TXT directives on .GOV and .MIL.<br />
* [http://blog.archive.org/2016/12/20/would-like-to-archive-government-web-services-not-just-web-sites-please-help/ Would Like to Archive Government Web Services, not just Web Sites– Please help] - Additional call to archive Government Web Services, not just Websites.<br />
<br />
== Notable Press Mentions and References ==<br />
<br />
''Note that the story oscillates between "Internet Archive is adding a mirror in Canada" and "Internet Archive is Moving to Canada".'' The actuality, for anyone viewing this page coming in cold, is that the Internet Archive has been building a mirror in Canada for a significant period of time and has a fully-functioning facility in Canada that has been a presence of some sort for nearly a decade as of 2016. The current effort was merely a speeding up of an inevitable timetable.<br />
<br />
* [http://www.theverge.com/2016/11/29/13778188/internet-archive-of-canada-backup-trump-surveillance-censorship The Internet Archive is building a Canadian copy to protect itself from Trump], The Verge, November 29, 2016<br />
* [http://www.nbcnews.com/news/us-news/internet-archive-web-s-warehouse-creating-trump-era-copy-canada-n689916 Internet Archive, Web's Warehouse, Creating Trump-Era Copy in Canada], NBC News, November 29, 2016<br />
* [http://www.dailykos.com/story/2016/11/30/1605487/-The-Internet-Archive-is-Moving-to-Canada The Internet Archive is "Moving to Canada"], The Daily Kos, November 30, 2016<br />
* [http://gothamist.com/2016/11/30/even_the_internet_is_getting_ready.php Even The Internet Archive Is Moving To Canada Because Of Trump], Gothamist, November 30, 2016<br />
* [http://www.huffingtonpost.ca/2016/11/30/archive-org-canada-trump_n_13330492.html Archive.org Moving To Canada Over Trump Censorship Fears], Huffington Post Canada, November 30, 2016</div>Yanhttps://wiki.archiveteam.org/index.php?title=USA-Gov&diff=26734USA-Gov2017-01-02T16:10:53Z<p>Yan: rm incorrect cats</p>
<hr />
<div>{{Infobox project<br />
| title = USA Government<br />
| image = usa_gov_screenshot.png<br />
| logo = usa_gov_logo.png<br />
| URL = http://www.usa.gov<br />
| project_status = {{Endangered}}<br />
| archiving_status = {{Upcoming}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/cheetoflee cheetoflee]<br />
| source = [https://github.com/ArchiveTeam/usa-gov-grab usa-gov-grab]<br />
}}<br />
'''USA.gov''' is the official website of the United States of America Government. <s>It is unlikely to go down soon, but</s> given the upcoming presidental change in 2017 it is a good idea to create an archive before it all changes.<br />
<br />
== Site Areas ==<br />
The following sites / areas have been identified at this time and are in the process of being discovered and jobs created.<br />
<br />
=== Index ===<br />
<br />
USA.gov offers an index of more than 10,000 links to official government information. The index is categorized by services and common topics, and can be accessed through five audience gateways: Businesses and Nonprofits, Citizens, Federal Employees, Government to Government (for state, local, and tribal governments), and Visitors to the U.S.<br />
<br />
=== Frequently Asked Questions ===<br />
USA.gov's Frequently Asked Questions (FAQs) database contains thousands of answers to the questions the public asks most via USA.gov or the contact center at 1 (800) FED-INFO. For more than 30 years, the contact center has been a source for answers to questions about consumer problems and government services.<br />
<br />
=== URL Shortening ===<br />
A URL shortening service, go.USA.gov, is available to users that have a .gov email address (only .gov URLs may be submitted for shortening through this service). The service will generate a random following go.USA.gov/ which redirects the user to the longer .gov URL stored in the system.<br />
<br />
=== Gobierno ===<br />
<br />
A part of USA.gov, GobiernoUSA.gov pulls together all of the U.S. government’s Spanish-language websites and makes them easily accessible to the public in one central location. <br />
GobiernoUSA.gov features more than 900 external links and provides access to more than 125,000 Government pages in Spanish. Although most of the resources are federal, the site also links to Spanish-language content provided by 42 states, the District of Columbia, the Commonwealth of Puerto Rico, and local government websites.<br />
<br />
== Discovery ==<br />
<br />
It is still to be decided what is in and out of scope.<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel #cheetoflee and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=User:Yan&diff=26714User:Yan2016-12-29T20:56:55Z<p>Yan: zero page</p>
<hr />
<div></div>Yanhttps://wiki.archiveteam.org/index.php?title=Talk:Government_Backup&diff=26713Talk:Government Backup2016-12-29T20:56:37Z<p>Yan: Created page with "=== USA-Gov === We should probably split up USA-Gov and move the bottom half to e.g. ftp-gov, since it's about the archiving of government FTP sites, while the top hal..."</p>
<hr />
<div>=== USA-Gov ===<br />
We should probably split up [[USA-Gov]] and move the bottom half to e.g. [[ftp-gov]], since it's about the archiving of government FTP sites, while the top half describes just [https://www.usa.gov/ usa.gov]. Any objections? --[[User:Yan|Yan]] ([[User talk:Yan|talk]]) 15:56, 29 December 2016 (EST)</div>Yanhttps://wiki.archiveteam.org/index.php?title=Archiving_projects&diff=26712Archiving projects2016-12-29T20:34:15Z<p>Yan: add 2 redlinks</p>
<hr />
<div>List of '''archiving projects''':<br />
* [[archive.is]] (formerly [[archive.today]]) http://archive.is/<br />
* European Archive http://www.europarchive.org/<br />
* Federal Web Harvests http://www.webharvest.gov/collections/<br />
* [[Internet Archive]]<br />
* Internet Memory Foundation http://internetmemory.org/<br />
* Newsletter Archive http://newsletterarchive.org/ offline<br />
* Pandora Archive http://pandora.nla.gov.au/<br />
* [[Perma.cc]] https://perma.cc/<br />
* [[TEXTFILES]]<br />
* [http://thearchivers.blogspot.com/ THE ARCHiVERS]<br />
* UK Web Archive<br />
* [[WebCitation]]<br />
* Webdesign Timeline http://www.designtimeline.org<br />
<br />
== External links ==<br />
* {{url|1=http://www.dmoz.org/Computers/Internet/History/Archives/}}<br />
* http://en.wikipedia.org/wiki/List_of_Web_Archiving_Initiatives<br />
* {{url|1=http://www.ifs.tuwien.ac.at/~aola/links/WebArchiving.html}}<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=Government_Backup&diff=26711Government Backup2016-12-29T20:26:53Z<p>Yan: add some redlinks; will try to create later</p>
<hr />
<div>[[Image:Government data.jpg|300px|right]]<br />
<br />
''The US Government has an awful lot of data, and it's in a lot of places.'' In 2016, elections were held that indicated deep sea changes in goals and ideals (although previous transitions have always contained such changes). Inspired by this, a number of groups and efforts have risen up to ensure backups of all government data possible are made off-site.<br />
<br />
'''This page contains overviews of the effort by all the teams.'''<br />
<br />
=== Internet Archive ===<br />
<br />
[[Internet Archive]] has two teams, [[Wayback Machine|Wayback]] and [[Archive-It]] ([https://archive-it.org/ archive-it.org]), working through listings of government websites and data stores. They are working internally using Internet Archive's crawlers and environment.<br />
<br />
=== #DATAREFUGE ===<br />
<br />
The [[DataRefuge|Data Refuge]] project ([http://www.ppehlab.org/datarefuge ppehlab.org/datarefuge]) has [https://docs.google.com/spreadsheets/d/12-__RqTqQxuxHNOln3H5ciVztsDMJcZ2SVs1BrfqYCc/edit#gid=0 the following Google document] about climate datasets.<br />
<br />
=== Archive Team FTP Backup ===<br />
<br />
The [[USA-Gov|Archive Team project]] is backing up 750+ FTP sites hosted at .MIL and .GOV sites. These two projects can be tracked [http://tracker.archiveteam.org/ftpdisco/ here] (discovery phase) and [http://tracker.archiveteam.org/ftp-gov/ here] (download phase). The results of this download are being sent to [https://archive.org/details/archiveteam_ftpgov this collection].<br />
<br />
=== Archive Team General Websites Download ===<br />
<br />
Besides the FTP data download, Archive Team is also doing a general download (where possible) of many crawlable government websites, such as [[USA-Gov|usa.gov]].</div>Yanhttps://wiki.archiveteam.org/index.php?title=Current_Projects&diff=26710Current Projects2016-12-29T19:47:28Z<p>Yan: /* Warrior based projects */ add ftp-gov pages</p>
<hr />
<div>__NOTOC__<br />
== Archive Team recruiting ==<br />
* [[Dev|Want to code for Archive Team? Here's a starting point.]]<br />
<br />
== Warrior based projects ==<br />
* [[Government Backup]] / [[USA-Gov]]: Oh, just everything the United States government hosts on FTP. '''IRC Channel {{IRC|cheetoflee}}'''.<br />
* [[NUjij]]: Dutch discussion platform being killed off on September 12, 2016. '''IRC Channel {{IRC|archiveteam}}'''.<br />
* [[Yahoo! Answers]]: It's Yahoo. It's been acquired. More questions? '''IRC Channel {{IRC|noanswers}}'''.<br />
* [[Bayimg]]: [[The Pirate Bay|TPB]]'s image hosting is also being flooded. '''IRC Channel {{IRC|archiveteam}}'''.<br />
* [[Fotolog.com]]: Photo sharing and social networking site <s>closing on February 20, 2016</s>. Still saving. '''IRC Channel {{IRC|fotologout}}.'''<br />
* [[Google Code]]: Google likes [[Github]] more. Shut down on January 26, 2016, but we got a chance to go on saving. '''IRC Channel {{IRC|googlecodeblue}}'''.<br />
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.<br />
* [[PDF 2016]]: We've been given a long list of PDF files. We'll get'em. '''IRC Channel {{IRC|pdflush}}.'''<br />
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. '''IRC Channel {{IRC|archiveteam}}'''.<br />
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.<br />
* [[WikiTeam]] ([[WARC]] format): Saving wikis and their external links for the Wayback Machine. '''IRC Channel {{IRC|wikiteam}}'''.<br />
<br />
=== Scripts only ===<br />
* [[FTP]]: Download all the FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
<br />
Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br><br />
What's on: [http://tracker.archiveteam.org/ online tracker].<br><br />
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].--><br />
<br />
== Manual projects ==<br />
* [[AOL]]: Climbing into the decaying walled garden. '''IRC Channel {{IRC|aohell}}'''<br />
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.<br />
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.<br />
* [[Froogle]]: Let's do a census of all of Google's products. '''IRC Channel {{IRC|froogle}}'''.<br />
* [[FTP]]: Help us find all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.<br />
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.<br />
* [[NewsGrabber]]: Saving all news articles. Help with server power or by finding more newssites. '''IRC Channel {{IRC|newsgrabber}}'''.<br />
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.<br />
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.<br />
* [[WikiTeam]] (XML dumps): Exporting Mediawiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.<br />
* [[Woohoo]]: Yahoo is untrustworthy, let's do a census of all their products. '''IRC Channel {{IRC|woohoo}}'''.<br />
<br />
== Upcoming projects ==<br />
<!-- Top priority: could disappear anytime now --><br />
<!-- Shutting down, definite deadline given --><br />
<!-- Shutting down, vague deadline given --><br />
<!-- Shutting down, no deadline given --><br />
<!-- Archiving the archives --><br />
* [[Vine]]: [[Twitter]] killed the app, but videos likely won't last forever. '''IRC Channel {{IRC|vinewhine}}'''.<br />
<!-- Misc. projects (unmaintained sites, distrust in owners) --><br />
* [[Flickr]]: [[Yahoo!]] decided to kill it, now Yahoo has been acquired. '''IRC Channel {{IRC|flickrfckr}}'''.<br />
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired. '''IRC Channel {{IRC|stumblr}}'''.<br />
<br />
== Proposed projects ==<br />
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). --><br />
* [[PhotoSynth]]: Microsoft is [http://www.winbeta.org/news/photosynth-to-shut-down-in-february-2017 shutting down] their panorama sharing platform in Feb. 2017<br />
* [[DevPort]]: This [http://developerportfolio.com/ portfolio SaaS provider] has [http://www.lowendtalk.com/discussion/65135/need-some-help-saas-provider-is-dead-but-my-site-is-still-up-how-should-i-grab-it reportedly] been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.<br />
* [[Picasa|Picasa Web Albums]]: Merging into Google Photos on May 1, 2016. '''IRC Channel {{IRC|picasso}}'''.<br />
* [[Google Drive]] web hosting to be discontinued on August 31, 2016.<br />
* [[RadioShack]]: RadioShack is going bankrupt. '''IRC Channel {{IRC|unshackled}}'''.<br />
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.<br />
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07). <br />
* [[SoundCloud]]: Might get blown away by Hurricane Debt and Hurricane Copyright. '''IRC Channel {{IRC|soundclown}}'''.<br />
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.<br />
* [[Blipfoto]]: <s>Went into liquidation on March 11, 2015, future uncertain.</s> Acquired on March 25, 2015, we're still grabbing it anyway. '''IRC Channel {{IRC|fotofinish}}'''.<br />
<br />
== Recently finished ==<br />
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people --><br />
* [[Panoramio]]: [[Google]] is taking one last picture of Panoramio getting executed behind the shed on November 4, 2016. '''IRC Channel {{IRC|paranormio}}'''.<br />
<br />
<!--== Hiatus / Missed the Mark ==<br />
--><br />
<small>ArchiveTeam uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[IRC|More info]]</small></div>Yanhttps://wiki.archiveteam.org/index.php?title=USA-Gov&diff=26699USA-Gov2016-12-28T18:19:01Z<p>Yan: /* Running the script manually */ fix github link</p>
<hr />
<div>{{Infobox project<br />
| title = USA Government<br />
| image = usa_gov_screenshot.png<br />
| logo = usa_gov_logo.png<br />
| URL = http://www.usa.gov<br />
| project_status = {{Endangered}}<br />
| archiving_status = {{Upcoming}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/cheetoflee cheetoflee]<br />
| source = [https://github.com/ArchiveTeam/usa-gov-grab usa-gov-grab]<br />
}}<br />
'''USA.gov''' is the official website of the United States of America Government. <s>It is unlikely to go down soon, but</s> given the upcoming presidental change in 2017 it is a good idea to create an archive before it all changes.<br />
<br />
== Site Areas ==<br />
The following sites / areas have been identified at this time and are in the process of being discovered and jobs created.<br />
<br />
=== Index ===<br />
<br />
USA.gov offers an index of more than 10,000 links to official government information. The index is categorized by services and common topics, and can be accessed through five audience gateways: Businesses and Nonprofits, Citizens, Federal Employees, Government to Government (for state, local, and tribal governments), and Visitors to the U.S.<br />
<br />
=== Frequently Asked Questions ===<br />
USA.gov's Frequently Asked Questions (FAQs) database contains thousands of answers to the questions the public asks most via USA.gov or the contact center at 1 (800) FED-INFO. For more than 30 years, the contact center has been a source for answers to questions about consumer problems and government services.<br />
<br />
=== URL Shortening ===<br />
A URL shortening service, go.USA.gov, is available to users that have a .gov email address (only .gov URLs may be submitted for shortening through this service). The service will generate a random following go.USA.gov/ which redirects the user to the longer .gov URL stored in the system.<br />
<br />
=== Gobierno ===<br />
<br />
A part of USA.gov, GobiernoUSA.gov pulls together all of the U.S. government’s Spanish-language websites and makes them easily accessible to the public in one central location. <br />
GobiernoUSA.gov features more than 900 external links and provides access to more than 125,000 Government pages in Spanish. Although most of the resources are federal, the site also links to Spanish-language content provided by 42 states, the District of Columbia, the Commonwealth of Puerto Rico, and local government websites.<br />
<br />
== Discovery ==<br />
<br />
It is still to be decided what is in and out of scope.<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel #cheetoflee and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/ftp-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd ftp-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>ftp-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Q&A]]<br />
[[Category:Yahoo!]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=USA-Gov&diff=26698USA-Gov2016-12-28T18:18:20Z<p>Yan: /* Running the script manually */ fix github link</p>
<hr />
<div>{{Infobox project<br />
| title = USA Government<br />
| image = usa_gov_screenshot.png<br />
| logo = usa_gov_logo.png<br />
| URL = http://www.usa.gov<br />
| project_status = {{Endangered}}<br />
| archiving_status = {{Upcoming}}<br />
| irc = cheetoflee<br />
| tracker = [http://tracker.archiveteam.org/cheetoflee cheetoflee]<br />
| source = [https://github.com/ArchiveTeam/usa-gov-grab usa-gov-grab]<br />
}}<br />
'''USA.gov''' is the official website of the United States of America Government. <s>It is unlikely to go down soon, but</s> given the upcoming presidental change in 2017 it is a good idea to create an archive before it all changes.<br />
<br />
== Site Areas ==<br />
The following sites / areas have been identified at this time and are in the process of being discovered and jobs created.<br />
<br />
=== Index ===<br />
<br />
USA.gov offers an index of more than 10,000 links to official government information. The index is categorized by services and common topics, and can be accessed through five audience gateways: Businesses and Nonprofits, Citizens, Federal Employees, Government to Government (for state, local, and tribal governments), and Visitors to the U.S.<br />
<br />
=== Frequently Asked Questions ===<br />
USA.gov's Frequently Asked Questions (FAQs) database contains thousands of answers to the questions the public asks most via USA.gov or the contact center at 1 (800) FED-INFO. For more than 30 years, the contact center has been a source for answers to questions about consumer problems and government services.<br />
<br />
=== URL Shortening ===<br />
A URL shortening service, go.USA.gov, is available to users that have a .gov email address (only .gov URLs may be submitted for shortening through this service). The service will generate a random following go.USA.gov/ which redirects the user to the longer .gov URL stored in the system.<br />
<br />
=== Gobierno ===<br />
<br />
A part of USA.gov, GobiernoUSA.gov pulls together all of the U.S. government’s Spanish-language websites and makes them easily accessible to the public in one central location. <br />
GobiernoUSA.gov features more than 900 external links and provides access to more than 125,000 Government pages in Spanish. Although most of the resources are federal, the site also links to Spanish-language content provided by 42 states, the District of Columbia, the Commonwealth of Puerto Rico, and local government websites.<br />
<br />
== Discovery ==<br />
<br />
It is still to be decided what is in and out of scope.<br />
<br />
== How can I help? ==<br />
At this time no script has been completed, For now come into the IRC channel #cheetoflee and be the first to know how you can get involved!<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''ArchiveTeam's Choice'' for now, Until the scripts are written (this is going to be our highest priority so you will run them as soon as they are released!).<!-- (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)--><br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/usa-gov-grab github.com/ArchiveTeam/ftp-gov-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd usa-gov-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>usa-gov-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Q&A]]<br />
[[Category:Yahoo!]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=TwitPic&diff=20462TwitPic2014-10-18T15:57:26Z<p>Yan: /* Export Tool Bugs */ name link</p>
<hr />
<div>{{Infobox project<br />
| title = TwitPic<br />
| logo = Twitpic-logo.png<br />
| image = Twitpic - Share photos on Twitter 1294869067903.png<br />
| description = TwitPic mainpage in 2011-01-12<br />
| URL = http://twitpic.com<br />
| project_status = {{closing}}<br />
| archiving_status = {{in progress}}<br />
| irc = quitpic<br />
| tracker = [http://tracker.archiveteam.org/twitpicdisco twitpicdisco], [http://tracker.archiveteam.org/twitpic twitpic], [http://tracker.archiveteam.org/twitpic-cloudfront twitpic-cloudfront]<br />
| source = [https://github.com/ArchiveTeam/twitpic-discovery twitpic-discovery], [https://github.com/ArchiveTeam/twitpic-grab twitpic-grab], [https://github.com/ArchiveTeam/twitpic-items twitpic-items], [https://github.com/ArchiveTeam/twitpic-cloudfront-grab twitpic-cloudfront-grab]<br />
}}<br />
<br />
'''TwitPic''' is an image hosting service. The service is designed mainly for Twitter users - the images uploaded on the service are given short URLs for usage in Twitter posts. Twitter carries a 140-character post limit, the average Twitpic URL is 25/26 characters long.<br />
<br />
On September 4, 2014 TwitPic [http://blog.twitpic.com/2014/09/twitpic-is-shutting-down/ announced] they were shutting down on September 25. On September 18, 2014, TwitPic [https://twitter.com/TwitPic/status/512705809696837632 announced] that they'd been acquired and would "live on". However, on October 16, 2014, Twitpic announced that "agreeable terms could not be met" and that the service would be [http://techcrunch.com/2014/10/16/twitpic-couldnt-find-an-acquirer-will-shut-down-after-all-on-oct-25th/ shutting down on October 25th].<br />
<br />
== Shutdown ==<br />
<br />
[[File:Twitpic-2014-10-16-at-8-25-08.png|thumb|right|400px|Twitpic's erratic demise]]<br />
<br />
''Posted on September 4, 2014 by Noah Everett on [http://blog.twitpic.com blog.twitpic.com]:''<br />
<br />
"Twitpic will be shutting down September 25th. You will be able to export all your photos and videos. We’ll let everyone know when this feature is live in the next few days.<br />
<br />
This is an unexpected and hard announcement for us to make and we want to lay out what led us to this decision.<br />
<br />
A few weeks ago Twitter contacted our legal demanding that we abandon our trademark application or risk losing access to their API. This came as a shock to us since Twitpic has been around since early 2008, and our trademark application has been in the USPTO since 2009.<br />
<br />
Here is some backstory on the history of our trademark:<br />
<br />
We originally filed for our trademark in 2009 and our first use in commerce dates back to February 2008 when we launched. We encountered several hurdles and difficulties in getting our trademark approved even though our first use in commerce predated other applications, but we worked through each challenge and in fact had just recently finished the last one. During the “published for opposition” phase of the trademark is when Twitter reached out to our counsel and implied we could be denied access to their API if we did not give up our mark.<br />
<br />
Unfortunately we do not have the resources to fend off a large company like Twitter to maintain our mark which we believe whole heartedly is rightfully ours. Therefore, we have decided to shut down Twitpic.<br />
<br />
On a personal note I (@noaheverett) want to thank you for letting us be a part of your life and helping you share your experiences over the past 6 years, it’s truly been an honor. I have learned so much through running Twitpic over the years. Through the many mistakes I’ve made and lessons learned, to the bad days and the great days. Thank you again everyone…I will miss and cherish the days of Twitpic we had together."<br />
<br />
=== Won't (?) shut down ===<br />
<br />
Twitpic [https://twitter.com/TwitPic/status/512705809696837632 writes] on Twitter, on September 18, 2014:<br />
<br />
:"We're happy to announce we've been acquired and Twitpic will live on! We will post more details as we can disclose them"<br />
<br />
However, ArchiveTeam goes on downloading TwitPic, for safety.<br />
<br />
=== IT INDEED WILL ===<br />
<br />
'''UPDATE''' on [http://blog.twitpic.com blog.twitpic.com]:<br />
<br />
:"It’s with a heavy heart that I announce again that Twitpic will be shutting down on October 25th. We worked through a handful of potential acquirers and exhausted all potential options. We were almost certain we had found a new home for Twitpic (hence our previous tweet), but agreeable terms could not be met. Normally we wouldn’t announce something like that prematurely but we were hoping to let our users know as soon as possible that Twitpic was living on.<br />
<br />
:I’m sincerely sorry (and embarrassed) for the circumstances leading up to this, from our initial shutdown announcement to an acquisition false alarm.<br />
<br />
:You can export your data and photos at: http://twitpic.com/account/settings "<br />
<br />
=== But wait! There's more! ===<br />
<br />
On October 17th, 2014, Twitpic began blocking public access to images,<ref>https://twitter.com/textfiles/status/523160196672409600</ref> replacing them with a shutdown notice. Comments are still available for now, but the images are not.<br />
<br />
== Site structure ==<br />
<br />
Image page urls:<br />
<br />
http://twitpic.com/******<br />
http://twitpic.com/*****<br />
http://twitpic.com/****<br />
http://twitpic.com/***<br />
http://twitpic.com/**<br />
http://twitpic.com/*<br />
where * = 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s, t, u, v, w, x, y, z<br />
<br />
where ****** consists of up to 6 alphanumeric characters. Leading zeros are irrelevant, e.g.: /000joe = /0joe = /joe. Like incremental numbers in base-36 numeral system.<br />
<br />
== Progress ==<br />
<br />
=== Phase 1: content discovery ===<br />
<br />
From September 5 to 6, until ArchiveTeam got banned, ~41 million of the possible ~900 million urls were discovered. The discovery was suspended.<br />
<br />
On September 6th, someone claiming to be Noah Everett showed up in #quitpic<ref>http://paste.archivingyoursh.it/raw/xesequhogi</ref>:<br />
<br />
<pre><br />
[16:21:14] <n00b957> hey guys<br />
[16:21:16] <n00b957> Noah Everett here<br />
[16:21:26] <n00b957> noticed the site was really bogging down due to ArchiveTeam requersts<br />
[16:21:30] <n00b957> *requests<br />
[16:21:55] <n00b957> didn't know what it was at first so we blocked it to continue normal site operations and users can get their data easily<br />
[16:22:27] <n00b957> just wanted to give a heads up so you don't think we are trying to be malicious<br />
[16:23:00] <n00b957> we're working on getting our export tool out the door right now<br />
[16:23:14] <n00b957> I'd like to let our users get their data off the site via that first as quickly as possible<br />
</pre><br />
<br />
Unfortunately, he left #quitpic shortly afterwards and has not returned any of Archive Team's repeated inquiries about archiving Twitpic.<br />
<br />
=== Phase 2: content grab ===<br />
<br />
After some testing, actual content grab began on September 14. Its progress can be followed on the [http://tracker.archiveteam.org/twitpic tracker]. (One item contains 36 images and/or other elements of the image pages.)<br />
<br />
== How can I help? ==<br />
<br />
'''Important notice''': TwitPic staff may ban ArchiveTeam members' access to their site through AT tools, or completely (IP address ban), and for a long time. If you want to use TwitPic outside ArchiveTeam tools (e.g. if you have an account there and you want to access it), consider running the Warrior/script with '''low''' concurrency, or, if you're paranoid, not running it at all.<br />
<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''TwitPic Phase 2''. (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)<br />
<br />
If you see "Project code is out of date", simply restart the warrior.<br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/twitpic-grab twitpic-grab].<br />
<br />
Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency. '''Note''': the higher the concurrency is, the more the chance is to be banned by TwitPic staff.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd twitpic-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/twitpic-grab</nowiki></code>. After the updating has finished, re-launch the script.<br />
<br />
=== For both Warrior and script ===<br />
<br />
If you see 403 error codes in the output of the script about files '''not on twitpic.com''' (e.g. on twimg.com or else), don't worry. That is normal and the script handles the problem. However, if the script receives 403s from twitpic.com, your script (or even your IP address) has possibly got blocked. You can retry later, but you may be banned for a long time.<br />
<br />
=== Joining us on IRC ===<br />
<br />
Either you run the Warrior or the script, you should join our IRC channel '''#quitpic''' to catch the latest news about the project and its progress, and there you can also put questions if something doesn't work. You can use the web interface at http://chat.efnet.org:9090, or if you use a standalone IRC client, connect to irc://irc.efnet.org.<br />
<br />
=== Spread the word! ===<br />
<br />
Time is short and we must grab a lot of stuff. Furthermore, it seems that many users running a few threads is better than one user running a lot. So, please try to get some more people work on this project! Speak/write about it, tell your friends. This is definitely an urgent and important project.<br />
<br />
=== Donate to IA ===<br />
<br />
Contents saved from TwitPic will be given to and stored by the [[Internet Archive]]. The amount of the data is tens of terabytes. This will cost thousands of dollars to store in the long run, so if you an afford, please donate to the Internet Archive so that contents of TwitPic can be available forever. http://archive.org/donate<br />
<br />
== Archives ==<br />
<br />
<font color="lightgray">As TwitPic is probably not shutting down, it seems to be needless to store the downloaded data (more than 100 terabytes). However, TwitPic cannot be considered a reliable service anymore. So, archives should be stored somewhere. But the [[Internet Archive]] will probably not be willing to ingest this amount of data also present on the internet.<br />
<br />
To solve this problem, to decide where to store the data grabbed from TwitPic (and from sites in similar situation), '''Project [[Valhalla]]''' has been established. Read more about it on the linked wiki page.</font><br />
<br />
Archives of TwitPic will be stored by the [[Internet Archive]]. Please [http://archive.org/donate donate] if you can so that the costs of storing can be covered in the long run.<br />
<br />
== Download Your Data ==<br />
<br />
<font color="lightgray">User ''elise81'' [https://www.reddit.com/r/socialmedia/comments/2gdh6t/reminder_export_your_twitpic_photos_before_925/ writes] on Reddit:<br />
<br />
:"Log into Twitpic.com, click settings, scroll to bottom and click the request your data button. It takes a little while, but you'll eventually get a .zip file of all of your data."</font><br />
<br />
'''"You can export your data and photos at: http://twitpic.com/account/settings"'''<br />
<br />
When it's about ''your'' content, don't rely on ArchiveTeam's archives, as they may be incomplete, and are not made in a way that a single user's content can be extracted from them. '''Use the export-tool!'''<br />
<br />
=== Export Tool Bugs ===<br />
<br />
Twitpic's export tool is buggy, handing out seemingly empty zip files<ref>https://news.ycombinator.com/item?id=8473393</ref> and 503 errors.<ref>https://twitter.com/textfiles/status/522837349676236801</ref> The empty zip file problem can sometimes be fixed:<br />
<br />
The problem is twofold. If the problem is on a non-Windows computer, it is probably a corrupted download (which happens way too often). On Windows, the built in zip file handler is not able to reliably handle zip files. [http://www.7-zip.org/ 7-zip] seems to have the most success with the zip file but others have worked as well.<br />
<br />
General process to follow:<br />
<br />
<ol><br />
<li>Download and install 7z</li><br />
<li>Download the zip file and rename it something short.</li><br />
<li>Open a command prompt.</li><br />
<li>run the command 7z t zipfilename.zip</li><br />
<li>If it tests successfully run 7z x zipfilename.zip</li><br />
<li>Browse to the photo directory.</li><br />
<li>Pictures should be visible and a text file with the metadata.</li><br />
</ol><br />
<br />
== Downloaders ==<br />
* [http://code.google.com/p/emijrp/source/browse/trunk/scrapers/twitpic.py Downloader by tag] (it saves the full resolution image and metadata: uploader, date and description)<br />
<br />
== References ==<br />
<references/><br />
<br />
== External links ==<br />
* http://twitpic.com<br />
* [http://www.geekwire.com/2014/archive-team-twitpic-blocking-us-downloading-photos-shutdown/ "Lost forever? Archive Team says TwitPic is blocking photo downloads before shutdown"]<br />
* [https://twitter.com/TwitPic/status/512705809696837632 "We're happy to announce we've been acquired and Twitpic will live on! ..."]<br />
* [http://techcrunch.com/2014/10/16/twitpic-couldnt-find-an-acquirer-will-shut-down-after-all-on-oct-25th/ Twitpic Couldn’t Find An Acquirer, Will Shut Down After All On Oct 25th]<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Image hosting services]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=Deathwatch&diff=19995Deathwatch2014-09-04T18:04:24Z<p>Yan: /* Pining for the Fjords (Dying) */ add twitpic</p>
<hr />
<div>The '''Deathwatch''' is a central indicator of websites and networks that are shutting down and serves as an indicator of what happened to particular sites that shut down quickly.<br />
<br />
New sites should be added in chronological order, newest death date first. Forward-looking death dates should be added to the first list only. Sites large enough to warrant additional information will receive a dedicated page, linked from here and on [[:Category:Closing projects]].<br />
<br />
== Watchlist ==<br />
<br />
=== Getting things done ===<br />
<br />
[[Current Projects]] contains the up-to-date projects that are in progress. This small table keeps track of smaller projects by individual members.<br />
<br />
{| class="wikitable"<br />
! Website<br />
! Closing date<br />
! Project status<br />
! User<br />
! Archiving Status<br />
! Details<br />
! Archives<br />
! Archive Date<br />
! Archive Format<br />
|-<br />
| [[ArchiveBot]]<br />
| {{green|Saved}}<br />
| Downloaded website, dev and blog subdomains<br />
|<br />
|<br />
|<br />
|<br />
|<br />
| .warc.gz<br />
|-<br />
| Various<br />
| {{green|Saved}}<br />
| Downloaded website, forums, skins/plugins<br />
| [https://archive.org/search.php?query=winamp+warc]<br />
| 2013-11<br />
|<br />
|<br />
|<br />
| .warc.gz<br />
|- <br />
| [[Quick.io]] [http://www.quik.io/]<br />
| 2013-12-31<br />
| Closing<br />
| [[User:Arkiver]]<br />
| {{green|Saved}}<br />
| Downloaded the main website and the subdomains of the main website<br />
| COMING<br />
| 2013-12-13<br />
| .warc.gz<br />
|-<br />
| [[widgetbox]] [http://www.widgetbox.com/] [http://support.widgetbox.com/] [http://blog.widgetbox.com/] [http://cdn.widgetbox.com/] [http://help.widgetbox.com/] [http://pub.widgetbox.com/] [http://files.widgetbox.com/]<br />
| 2014-03-28<br />
| Closing<br />
| [[User:Arkiver]]<br />
| {{orange|In progress...}}<br />
| Downloading all the websites<br />
|<br />
| 2013-12-19 - present<br />
| .warc.gz<br />
|-<br />
| [[TechNet]] [http://technet.microsoft.com/]<br />
| 2014-09-30<br />
| Closing<br />
| [[User:Arkiver]]<br />
| {{orange|In progress...}}<br />
| Downloading full website<br />
|<br />
|<br />
| .warc.gz<br />
|}<br />
<br />
=== Pining for the Fjords (Dying) ===<br />
<br />
* [[TwitPic]] shuts down [http://blog.twitpic.com/2014/09/twitpic-is-shutting-down/ September 25, 2014].<br />
<br />
* 20 newspapers in Quebec will shutdown in the coming weeks. Here's a list [http://pastebin.com/Xwt19JFQ] of those still up that needs to be archived ASAP.<br />
<br />
* Heello [https://heello.com/heello/26633887 shuts down] August 15, 2014.<br />
<br />
* Rue Frontenac was a website created during a newspaper lockout in Canada back in 2009. It was saved [http://exruefrontenac.com/ here] , but I'm not sure if anybody is maintaining it. Copy ?<br />
<br />
* [http://www.thegridto.com The Grid] (magazine in Toronto) printed its last issue on July 3rd 2014 ([https://twitter.com/TheGridTO/status/484352888635129856 see here]) not sure how long the site will stay up.<br />
<br />
* Yahoo! is back to destroy more of the internet. [[Yahoo! Shine]] and [[Yahoo! Voices]] will shut down on July 31, 2014. The [[Yahoo! Contributor Network]] dies entirely at the end of August.<br />
<br />
* [[Svpply]] and Want by Svpply will shut down on August 31, 2014.<br />
<br />
* [[Shortmail]] will get shut down on July 31, 2014.<br />
<br />
* [[Snapdisk]] gets snapped on July 31, 2014.<br />
<br />
* [[Orkut]] will be shutting down on September 30, 2014.<br />
<br />
* [[Blip.tv]] will be removing accounts/videos on September 1st, 2014.<br />
<br />
* The Hungarian [http://iwiw.hu IWIW] (Facebook like social-network portal) will die on June 30, 2014.<br />
<br />
* The [[National Atlas]] [http://www.nationalatlas.gov/status.html will die] on September 30, 2014.<br />
<br />
* [[Epinions]] locking out its users on March 25, 2014.<br />
<br />
* [[Full Disclosure]] (http://seclists.org/fulldisclosure/) is a security/hacker mailing list that was [http://seclists.org/fulldisclosure/2014/Mar/332 suddenly suspended] as of March 19, 2014.<br />
<br />
* [[Nakido]] ([http://www.nakido.com/ site]) claims to be a "time capsule" that will "host your files for decades" - except it's a commercial enterprise selling premium acounts, and uses a proprietary P2P platform for delivery. What could possibly go wrong?<br />
<br />
* [[Gatsby]], not sure whether to file this here or under "Dead as a Doornail". [http://gatsby.im/ Frontpage] says that it's dead, but it's unclear whether hosted content is still available. Awaiting [https://vpsboard.com/topic/3475-gatsbyim-discontinued/#entry52574 response] as to what happened to the data.<br />
<br />
* LEGO has a bad habit of deleting Flash games and other materials from their sites. Some of them still lie in pieces on cache.lego.com, awaiting their deletion. Fortunately, some games are still available to play on [http://biomediaproject.com/bmp/games/ BioMediaProject] or [http://4t2portfolio.co.uk 4T2 Portfolio].<br />
<br />
* Nintendo shut down [http://www.fullscreenmario.com Full Screen Mario]. It's [https://github.com/Diogenesthecynic/FullScreenMario GitHub repository] should be archived in case it goes down.<br />
<br />
* [[Yahoo! China]] appears to be in the process of [http://wayback.archive.org/web/201429000000/http://www.bbc.co.uk/news/technology-23929002/ completely shutting itself down].<br />
<br />
* [[Yahoo!]] [http://www.yqlblog.net/blog/2013/11/11/y-ahoo-it-url-shortener-end-of-life-announcement/ retired] the y.ahoo.it [[URLTeam|URL shortener]] November 20th 2013 but the shortener is still active.<br />
<br />
* WordChamp was supposed to have shut down on June 30, 2013, later changed September 15, 2013, but is still up and running.<br />
<br />
* [[TuneWiki]] (not a wiki)<br />
<br />
* [[Readmill]], a social e-reader thing, is closing its doors on July 1, 2013. They have quite a few user pages documenting who read what, who says what and what people think of books.<br />
<br />
* These sites are getting an update in the next few months:<br />
** [http://www.lincsfm.co.uk Lincs FM], [http://www.traxfm.co.uk Trax FM], Rutland Radio [www rutlandradio.co.uk - spam filter on here blocked this url], [http://www.dearnefm.co.uk Dearne FM], [http://www.rotherfm.co.uk Rother FM], [http://www.compassfm.co.uk Compass FM], [http://www.kcfm.co.uk KCFM 99.8], [http://www.ridingsfm.co.uk Ridings FM]. All are getting an update, so you might want to back these up; not sure what the best means are, but making a mirror of Lincs FM Group websites is good for historical reasons.<br />
<br />
* [http://pic2.piczo.com/go/home Piczo], a social network for teens, has announced that it's shutting down.<br />
<br />
* '''[http://1up.com 1up.com]''', [http://www.ugo.com/ ugo.com], and [http://www.gamespy.com/ gamespy.com] a collection of video game, news, and fan sites with lots of user-generated content, was purchased by Ziff Davis in February. CEO Vivek Shah announced on February 21st, 2013 that it will be [http://www.polygon.com/2013/2/21/4014196/ign-layoffs-1up-ugo-and-gamespy-shutting-down "winding down 1UP.com, UGO and Gamespy"].<br />
<br />
* The '''Centralstation Community''' [http://community.thisiscentralstation.com/_Central-Station-v2-Q38As/blog/5449967/126249.html has closed]. The site is a UK-based social network for artists and creatives that provides hosting for content and portfolio. Users are being advised to back up their work as the new version of their platform will rely on existing media hosting sites like Flickr, Vimeo, and Soundcloud.<br />
<br />
* '''[http://www.groklaw.net/article.php?story=20130818120421175 Groklaw]''' will no longer be posting new articles, "due to government monitoring of the internet, particularly e-mail." Whether or not its archives will remain online is unclear, although it does seem rather unlikely it will 100% disappear. OTOH, better safe than sorry.<br />
<br />
* '''[[Webmonkey]]''' won't be posting new content anymore. It probably won't disappear overnight, but [http://longhandpixels.net/blog/2013/sep/20/whatever-happened-to-webmonkey/ "it wouldn’t hurt to create a backup."]<br />
<br />
=== Pre-emptive Alarmbells (Likely To Die) ===<br />
<br />
* Archive Team officially proclaims '''[[Yahoo!]]''' the least trustable host and its arch-enemy. Prove us different, Yahooligans. Or... don't. Expect anything in [http://en.wikipedia.org/wiki/List_of_mergers_and_acquisitions_by_Yahoo! this list] and [http://en.wikipedia.org/wiki/List_of_Yahoo!-owned_sites_and_services this list] to shutdown (if it already hasn't).<br />
** Please follow the feeds! [https://twitter.com/YahooVictims] [http://www.google.com/alerts/feeds/03733117766037168292/11115209096644139952]<br />
<br />
* [[Google]] has [http://www.seopedia.org/internet-marketing-and-seo/googles-secret-andor-forgotten-places/ quite] [http://www.seopedia.org/seo-news/google-2/googles-56-forgotten-secret-pages-part-two/ a few] old pages on their servers which haven't been updated in a long time. Might be a good idea to save these before they disappear.<br />
<br />
* Like Google, Nintendo of Japan has its share of ancient pages, like [http://www.nintendo.co.jp/n02/dmg/mla/index.html this one].<br />
<br />
* '''[[cyberpunkreview.com]]''': 80s science fiction fansite and community {{url|1=http://cyberpunkreview.com/}} hasn't seen much staff activity in a long time, although the forums are going strong. UPDATE: Looking active again. [[User:Aggroskater|Aggroskater]] 08:26, 19 March 2012 (EDT)<br />
<br />
* '''[[WikiLeaks]]''' ({{url|1=http://wikileaks.org/}}) has an uncertain financial situation, and the site was inaccessible for some time in 2010.<br />
<br />
* '''[[FriendFeed]]''' ({{url|1=http://friendfeed.com/}}) has been purchased by [[Facebook]], leaving FriendFeed users uncertain as to its future and mostly unsupported. The Twitter bridge, for instance, has not worked for years now.<br />
<br />
* '''[[The Pirate Bay]]''' ({{url|1=http://www.thepiratebay.org/}}) still having persistent legal problems. The tracker went down in November, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB on '''THIS VERY PAGE''', this is pretty dang important. Thankfully, the magnet links and entire siterips have now been made, though keeping them updated is sure to be a pain.<br />
<br />
* '''[http://www.ning.com/ Ning]''' in 2010 has laid off 40% of staff and seems to be running out of money [http://techcrunch.com/2010/04/15/nings-bubble-bursts-no-more-free-networks-cuts-40-of-staff/]. There is certainly some networks worth archiving among the 2 million networks[http://blog.ning.com/2010/01/2-million-ning-networks.html] they host. Grouply[http://blog.grouply.com/grouply-welcomes-ning-networks/] and Posterous[http://blog.posterous.com/posterous-commits-to-building-a-ning-blog-imp] say they are going to offer migration tools.<br />
<br />
* '''[http://debates.oireachtas.ie/ debates.oireachtas.ie]''' on September 18th, 2012 the Houses of Oireachtas website [http://www.kildarestreet.com/statement2012/ announced] that it would no longer be updating its XMl data for Irish parliamentary debates (1919-2012). Access to pre-existing data is still available, but is likely to disappear, if the current trend continues. It would be useful to at least capture the XML data that is there, while it is still available.<br />
<br />
* As of 2014, ScraperWiki Classic is now read-only. But don’t worry! You can transfer this scraper to Morph.io if you want to continue editing it.<br />
<br />
=== Other endangered species and misc ideas ===<br />
<br />
We have even more small tidbits of information at [[Deathwatch/Misc]].<br />
<br />
=== Just When You Least Expect It ===<br />
<br />
Archive Team keeps a list of [[Fire_Drill|healthy sites]] that could be fine today and not so hot tomorrow. We focus on ways to back your personal data off these sites so you don't put yourself at unnecessary risk.<br />
<br />
== Dead as a Doornail ==<br />
<br />
=== Because we know better ===<br />
* [[Fileplanet]] [http://www.fileplanet.com/]. Already fully archived.<br />
<br />
===2014===<br />
* August 10: '''[[Fotopedia]]''' leaves a photo finish.<br />
* August 5: '''[[Justin.tv]]''' shuts down completely.<br />
* August 1: '''[[Yahoo! Voices]]''', formerly Associated Content, is shut up by [[Yahoo!]].<br />
* July : Potential massive Quebec newspaper shutdown around August 2, 74 newspapers were bought by [http://blog.fagstein.com/2014/05/28/competition-bureau-quebecor-tc-newspapers/ Transcontinental].<br />
* July 31: Pinterest [http://www.iol.co.za/scitech/technology/business/pinterest-buys-startup-icebergs-1.1728612 acquires] Icebergs.com<br />
* June 30: Hungarian [[iWiW]] social network closes; data not available from this date at all.<br />
* June 15: [[Rawporter]] enters "into an exclusive business partnership", deletes user photos and videos (which we rescue.)<br />
* June 1: [[Ubuntu One]] shuts down, gives its users until July 31 to grab their data.<br />
* May 20: Nintendo shut down [[Nintendo Wi-Fi Connection]] (except for the Wii and DSi Shop Channels).<br />
* May 6: [[Userscripts.org]] mysteriously vanished. [http://userscripts-mirror.org A mirror] popped up not long after.<br />
* April: [http://jderef.com/ JDEREF.com] is served a takedown notice by Oracle.<br />
* April: [[Vizify]].com as been acquired by [[Yahoo!]]. Bios to be deleted on April 7, 2014. (Users can opt-in to extend date to September 4, 2014.)<br />
* April 30: [[qik]].com is shutting down on 30 April, 2014.<br />
* April 18: [[Twitter Music]] shuts down.<br />
* April 15: Beats will shut down [[MOG]] on.<br />
* March 31: [[IntoNow]], a [[Yahoo!]] acquisition, will ceased to function.<br />
* March 31: [[Mochi Media]] realizes Flash is dead and the game is over.<br />
* March 17: [[doo]] shut down.<br />
* March 11: [[Intel AppUp]] is shutting down.<br />
* March 3: [[My Opera]] closes its member profiles.<br />
* February: [[Videogum]] is [http://www.videogum.com/800151/hey-guys-we-have-to-talk-to-you-about-something/letter-from-the-editor/ shutting down].<br />
* February 28: [[Outbox]] shuts down to [http://blog.outboxmail.com/post/74086768959/outbox-is-shutting-down-a-note-of-gratitude rebuild itself].<br />
* February 21: [[Yahoo!]] crashed [[Cloud Party]].<br />
* February 7: Schemer.com shut down by Google. (Time of death: 2014-02-08 00:13:52,184 EST.)<br />
* January 21: DrawQuest and [[Canv.as|Canvas]] shuts down. moot writes his [http://chrishateswriting.com/post/74083032842/today-my-startup-failed shut down notice.]<br />
* ??? '''[[dl.tv]]''' [http://dl.tv] There is no new tech podcast on here for over a year. Good idea to start backing up all podcast on this site. Same for Crankygeeks. [http://www.crankygeeks.com/]<br />
<br />
===2013===<br />
<br />
* December 26: [[Wretch]] and [[Yahoo! Blog]] is closed by Yahoo!.<br />
* December 21: [https://web.archive.org/web/20131222233044/http://clanbase.org/ ClanBase] is no more. The company that bought the website in 2004, Global Gaming League, decided to "move on" after basically running the website into the ground.<br />
* December 20: '''[[WinAmp]]''', home of the Winamp media player, shuts down.<br />
* December 18: [[Warhammer|Warhammer Online: Age of Reckoning]] closes.<br />
* December 15: [[Everpix]], a photo-sharing service, shuts down.[http://www.everpix.com/] [http://www.theverge.com/2013/11/5/5039216/everpix-life-and-death-inside-the-worlds-best-photo-startup] [https://github.com/everpix/Everpix-Intelligence], rest lost<br />
* December 12: [[Hyves]] closes it social network, but it's now got games!<br />
* November 11: [[Bre.ad]] is dead.<br />
* November 7: [[Dopplr]] drops out from the web.<br />
* November 1: [[Zapd]] deletes its user data from the website.<br />
* November 1: [[iGoogle]] shuts down.<br />
* November: [[Bitmit]], a Bitcoin marketplace, shut down.<br />
* November: Going to call this one before it even starts, friends: '''[https://www.legacylocker.com/ Legacy Locker]''' promises lifetime control of your data and return of your data to loved ones for just $300 for "lifetime", or $30/year. [http://www.washingtonpost.com/wp-dyn/content/article/2009/03/10/AR2009031001211.html] Archive Team says to just say [https://web.archive.org/web/20131121055401/http://legacylocker.com/ No].<br />
* October 21: [[isoHunt]] was always going to shut down after an MPAA settlement. However, it did so earlier than expected to prevent archival efforts, claiming that 95% of torrents were available elsewhere. No mention of the metadata though.<br />
* September 30: [[OMGPOP]] shut down, and now redirects to Zynga's main site. There was a [https://www.facebook.com/SaveOMGPOP petition] to stop it from closing, which did not gain much traction.<br />
* September 30: [[MSN TV]], aka WebTV, no longer accessible.<br />
* September 1st: [https://torrentfreak.com/major-tv-torrent-site-thebox-bz-calls-it-quits-130829/ Thebox.bz], a TV torrents tracker/site.<br />
* September: [[Freeblog.hu]] closes without noticing it's users. Unknown number of blogs lost.<br />
* August-September: [http://fileden.com/ FileDen.com], a file hosting website, suddenly shuts down, giving their users little to no warning.<br />
* August 31: [[Rockmelt]] shuts down after being acquired by [[Yahoo!]].<br />
* August 21: [[Amplicate]] vanishes, leaves behind 502 Bad Gateway errors.<br />
* August 20: [[Catch]] closes its doors.<br />
* August 19: [http://wowcoolfactsaboutgaming.com Wow! Cool Facts About Gaming] shuts down, thankfully leaves everything up.<br />
* August 9: [[Google Latitude]] shut down.<br />
* August 5: [[Astrid]] is shut down after being acquired by [[Yahoo!]].<br />
* July 31: All third party downloads disappear from [[Yahoo! Downloads]].<br />
* July 25: [[Yahoo! Stars India]] is shut down.<br />
* July 24: [[Snapjoy]], acquired by Dropbox in December 2012, is shut down.<br />
* July 19: [[Google]] shuts down Alfred.<br />
* July 9: [[Yahoo! Neighbors]] shuts down a day after it was supposed to.<br />
* July 8: AltaVista, one of the oldest search engines, shuts down.<br />
* July 1: FoxyTunes and Yahoo! RSS Alerts disappear from the web.<br />
* June 30: [[Yahoo!]] demolishes Yahoo! WebPlayer.<br />
* June 28: [[Yahoo!]] shuts down Axis, Browser Plus and Citizen Sports.<br />
* June 28: Nintendo shuts down all of it's WiiConnect24 services, except for the Mii Channel, Wii Shop Channel, Mario Kart Wii Channel, and the Wii Speak Channel.<br />
* June 4: '''Adrenaline Vault''', a video game review site, has this posted on their Facebook profile: "Over the past weekend hackers hit the site with a DoS attack. Everything had been wiped and with no backups, everything was lost. It has been decided that Avault will remain closed. Rest in Peace, Avault."<br />
* June: '''[http://ompldr.org/ Omploader]''', an anonymous file upload site, has announced that they are about $2500 in the hole on hosting costs, and that there is possibility of their shutting down if donations do not improve. It stands to reason that there are some files among their database that are worth saving. An attempt to contact the administrator for more information and to be given a dump of the site was made, and he responded saying he'd be happy to rsync a copy of the data after some legal issues have been settled.<br />
* April 30th '''[[Posterous]]''', a blogging and life streaming platform, shut down its "Posterous Spaces" to focus on Twitter.<br />
* April 30th '''Circalit''' decides in March that [http://circalit.createsend4.com/t/ViewEmail/r/26E73577D4220DBD/4433C195741969884AB3169DA1FD82E9 deleting is easier than migrating] its prose-writing users.<br />
* April 20: '''Microsoft Collection Book''', a site dedicated to collecting information about Windows betas, shuts down due to a C&D from Microsoft. It reopened on May 5 as '''The Collection Book'''.<br />
* March 31st shutdown. '''Zug.com''', a comedy website running since 1995 closed down, and replaced all its pages with a goodbye image.<br />
* March 29th shutdown. http://wrathofheroes.warhammeronline.com/ Play 4 Free Warhammer Online: Wrath of Heroes (WOH)<br />
* March 24th shutdown. '''[http://hub.opensolaris.org/bin/view/Main/ The OpenSolaris Hub]''' and '''all sites under opensolaris.org''', including the site hosting the OpenSolaris source code, are being decommissioned by Oracle. OpenSolaris is an open source computer operating system based on Solaris and originally created by Sun Microsystems. After the acquisition of Sun Microsystems in 2010, Oracle decided to discontinue open development of the core software, and replaced the OpenSolaris distribution model with the proprietary Solaris Express.<br />
* February 28th '''[http://www.stickam.com Stickam]''', a major video chat service, shut down. Users were emailed and given the ability to download any recorded videos for 3 weeks in advance of the closing date.<br />
* February: [[Regretsy]] shuts down.<br />
* January 31: [[Do.com]] shuts down.<br />
* January: <nowiki>http://</nowiki>go.to, an [[URLTeam|URL shortener]], has all of its domains on sale on Sedo. No official word just yet, though.<br />
<br />
===2012===<br />
* The Polish social network '''[https://en.wikipedia.org/wiki/Grono.net Grono.net]]''' has disappeared, replaced by a file hosting service '''grono.net.pl''' on July 1, 2012. Most content from the old site was supposed to be migrated, but, according to a message on the main page, technical difficulties have delayed the migration by one or two weeks. It's getting increasingly late...<br />
* '''Ponibooru''', a famous My Little Pony-related imageboard, [http://www.equestriadaily.com/2012/06/ponibooru-shuts-down.html shut down] by August 17. All of the images themselves (but not the comments) were available to download via torrents, though it is unknown if the torrents are still available. Currently the most popular/upvoted images are available via another imageboard, Derpibooru, but their copy is incomplete.<br />
* [[Parodius Networking]], which hosts numerous web sites related to classic video game platforms, died in August 2012.<br />
* '''[http://kasabi.com Kasabi]''', a data publishing platform created by [http://talis.com Talis] was [http://blog.kasabi.com/2012/07/09/shutting-down-kasabi/ announced] to be closing on July 30, 2012. While the service has only been around for ~2 years it represents a unique look at services for Linked Data, and contains a variety of datasets. Kasabi has a [http://blog.kasabi.com/2012/07/16/archive-of-datasets/ blog post] that announces the availability of datasets contained in Kasabi to ease archiving.<br />
* '''[http://gamecorner.pl Gamecorner.pl]''', a Polish video game news portal, was closed in May, and later wiped entirely on October 29. The articles have been retained at the publisher's other video game portal, Polygamia.pl, but the article comments and the forums are gone (It also had user blogs, but they seemed to have been erased much earlier.)<br />
* "[[convore.com]]" [http://blog.convore.com/post/17951919109/convore-shutting-down-april-1st shut down in April 2012]. The site hosted IRC conversations, and involved a lot of JavaScript.<br />
* Google Wave shut down on April 30th.<br />
* The popular file hosting service '''Megaupload''' has been shut down in January 2012; with it, '''Megavideo''' too is gone. It was mainly used for copyright infringement, but lots of perfectly regular files were hosted on it.<br />
* '''Apple''' '''[[MobileMe]]''', '''[[MobileMe#iDisk | iDisk]]''', '''[[MobileMe#web.me.com / iWeb | iWeb]]''', and included services. This major website and these services will shut down in [http://support.apple.com/kb/HT4597 2012], simply because web hosting is boring and they want to focus on the exciting "iCloud".[http://www.apple.com/mobileme/transition.html][http://support.apple.com/kb/HT4597]<br />
* Hungarian free hosting provider [http://eplanet.hu Eplanet] stops free service as of February 2012; unknown number of pages disappeared and probably deleted.<br />
<br />
===2011===<br />
<br />
* The [[Insurgency Wiki]] is a wiki with a community that created multiple guides and raids for Anonymous, in a similar manner to [[Encyclopedia Dramatica]]. It's status has always been unclear, with many mirrors coming and going. But as of Feb. 22, 2012, the last mirror, Partyvan.info, looks like it has some damning database error. Just in case, the Bibliotheca Anonoma has made a full backup, including all available images available.<br />
<br />
* The closure of '''Google Buzz''' was announced in October. Luckily Google released a tool to download your content from it called [[Google Takeout]]: https://www.google.com/takeout/ Besides Buzz, Google shut down many of its other minor services, such as Aardvark, Sidewiki, and others: http://googleblog.blogspot.com/2011/09/fall-spring-clean.html<br />
* '''[[Google Labs]]''' (http://labs.google.com) closed on (somewhen august/sebtember?) http://www.pcmag.com/article2/0,2817,2388881,00.asp#fbid=7kZ39-1XQUH and many great/experimental one of a kind tools vanished. Among many others "google sets" that had been around since a long time, "City Tours" some includeing user generated content and the exciting "google squared" http://4.bp.blogspot.com/_ZaGO7GjCqAI/SibWbewOy5I/AAAAAAAAQBM/8lb7UA6AWPY/s640/google-squared-species.png that was an approach to pass more artificial intelligence to the user than conventional searchengines (compareable to wolframalpha) but seemingly based on a bigger/vast pool of data just like standard google searchresults). Since there is hardly any obvious rationale(?) for closeing down "Google Labs" it pictures google as beeing either less supportive or even hostile to new inoventions and less responsible with usergenerated content or more secretive about their ongoing projects than one might thought before or than Google might was before indeed. 1 Jan 2011 [[User:Whatsgoingonwithgoogle|Whatsgoingonwithgoogle]] 18:06, 17 October 2011 (UTC)<br />
<br />
* The wiki hosting site '''wik.is''', hosted by MindTouch, shut down on the first week of January 2011; the explanation being that "in order to continue to support the growing needs of our MindTouch Express users, we are offering MindTouch Cloud", which "opens up additional features and functionality that are not available in Wik.is.". The only way you'd know all that is if you receive a warning e-mail from MindTouch. They offer to keep your site running by "upgrading to our paid Cloud version by [http://campaigns.mindtouch.com/Wik.isDecomissioningMigrationInterest.html filling out this short form.]"<br />
<br />
* '''[[ProHosting]]''' (http://free.prohosting.com) closed hosted sites on 1 Jan 2011<br />
<br />
* '''The Sims Carnival''': January 17th, 2011[http://www.simscarnival.com/games/CarnivalMonkey/35068/The-Sims-Carnival-Says-Goodbye]<br />
<br />
* '''[http://team.gaia.com/blog/2010/3/important-gaia-announcement Gaia Community]''' shut down at the end of March. <br />
<br />
* '''[http://ghost.cc/ Ghost Cloud Computing]''' became a ghost of itself[http://ghost.cc/home/SignUp.jsp].<br />
<br />
* '''Microsoft''' closed '''Windows Live Spaces''' on March 16, 2011. Spaces owners had the option to migrate their blogs to '''WordPress''' or to make copies. As of January 4, 2011, they could no longer edit their existing Spaces.[http://explore.live.com/windows-live-spaces-help-center]<br />
<br />
* '''[[Yahoo! Video]]''' shut down on March 31st, 2011 and was reborn as a video portal.<br />
<br />
* '''[[Encyclopedia Dramatica]]''' shutdown on 16. April 2011 without warning. Ongoing reconstruction Efforts. A lot of Images and Articles are probably lost. (The replacement OhInternet is a very strongly sanitized Version of ED.) <s>ED is claiming that they are in danger of shutting down. Despite the controversial nature of many articles hosted on the wiki, this would be a big loss of historical records.</s><br /><font color="red">A lot of the Images and Pages are still missing. Help appreciated.</font><br />
<br />
* [[Yahoo]]! has {{url|1=http://techcrunch.com/2011/02/24/yahoo-to-shut-down-mybloglog-on-may-24/|2=announced}} that '''[[MyBlogLog]]''' will be closed on 24 May 2011. '''UPDATE:''' Yup.<br />
<br />
* '''[[Prodigy Pages]]''' shut down on June 1,2011.<br />
<br />
* '''[[Forums.starwars.com]]''': {{url|1=http://www.starwars.com|2=StarWars.Com}} {{url|1=http://forums.starwars.com/ann.jspa?annID=3|2=announced}} the closure of their {{url|1=http://forums.starwars.com|2=forums}} on June 3, 2011. (Forum will lock on 29 April 2011) {{url|1=http://theforce.net/latestnews/story/StarWarscom_Forums_Shutting_Down_In_June_137497.asp|2=tf.n report}}<br />
<br />
===2010===<br />
* '''[http://machinima.com Machinima.com]''' was reworked in December 2010, and by "reworked" we mean massacred. Most notably, the forums were deleted, as well as tons of older articles. <br />
* The '''[http://www.symbian.org/ Symbian Foundation]''' will shut down its websites, Twitter account, Facebook page, bug trackers and remove access to its source code on 17 Dec 2010[http://www.engadget.com/2010/11/27/symbian-foundation-axing-websites-on-december-17th-source-repos/][http://developer.symbian.org/wiki/Symbian_Foundation_web_sites_to_shut_down].<br />
* '''[http://itdied.com/ It Died]''' by Glenn Fleishman. a site dedicated to indicating sites that have died, itself died. (Keep the [http://itdied.com/atom.xml RSS Feed] around in case that changes, though).<br />
* '''isweb lite''', the Japanese Geocities, shut down on October 31. Thousands of personal homepages of artists and illustrators were deleted forever. A tiny sample of the pages deleted: [http://togetter.com/li/64058] '''isweb''' itself (paid hosting!) will shut down in May 2012. [http://portal.faq.rakuten.co.jp/app/answers/detail/a_id/15387/]<br />
* '''[http://closing.vox.com/ Vox]''' shut down at the end of September 2010.<br />
* '''[http://storytlr.com/ Storytlr]''', a lifestreaming site, stopped hosting March 1st 2010.<br />
* '''[http://platinum.ac Platinum]''', once a popular Finnish web site associated with electronic dance music, clubbing/raving, and the other related things was closed in March after been running for years. All the content posted to the forums of the site was, however, obtained and made available by [http://klubitus.org Klubitus], another related portal popular in Finland.<br />
* '''[http://www.kidradd.com Kid Radd]''' was a notable and quite popular webcomic which vanished when AT&T discontinued their Worldnet service. Thankfully, an archive is available, e.g. [http://tangent128.name/depot/kid_radd.zip here].<br />
* '''[http://www.brightfuse.net BrightFuse]''' was a small social network started as a side venture by CareerBuilder.com in August 2009. It was quietly shutdown November of 2010 without much fanfare. At its height it has 100k users.<br />
* '''[http://extra.hu Extra.hu]''', largest free hosting Hungarian hosting provider goes paid-only; deletes unknown number of free sites on 31 March 2010.<br />
<br />
===2009===<br />
<br />
* Google acquired '''[http://etherpad.com Etherpad]''' on 4th December, 2009 and immediately [http://etherpad.com/ep/blog/posts/google-acquires-appjet announced] a March 2010 content deletion date. After community pressure, Google has decided to [http://etherpad.com/ep/blog/posts/etherpad-back-online-until-open-sourced open source the Etherpad codebase], keeping the service alive until then. The site closed down shortly after. Fortunately there are now are [http://www.google.com/search?q=etherpad+alternatives numerous] [http://www.google.com/search?q=etherpad+clone alternatives].<br />
<br />
* '''favrd''', a website that aggregated favorite tweets from twitter, abruptly shut down on '''December 6, 2009''' with absolutely no warning, killing off thousands of highlighted entries added by group-consensus over significant months. As a reward for their efforts, founder Dean Allen wrote this helpful message: ''"Alas, stars on Twitter have become mere take-out menus hung on the doors of other restaurants. There are still lots of clever and funny things to read every day, but finding these is no longer a challenge â you already follow your sources. Sites like this one now serve mainly as fuel for emotional up-fuckedness in the guise of a game. Just an idea: next time you see something you like, write the person who made it a note telling them so. Even better, explain why. Take care!"'' Advice to people who want to work with Dean Allen's projects in the future: don't.<br />
* '''here.is''' seems to permanently off-line. It ceased to re-direct email for some time ago and as per 11-23-09 it doesn't redirect even URLs any longer.<br />
[[Image:Encarta.jpg|right|300px|Discontinuedpedia]] <br />
* '''Microsoft Encarta''', the online encyclopedia with a 15+ year history, is being shut down. The US version will shut down on October 31, 2009 and the Japanese version on December 31, 2009. [http://www.reuters.com/article/CMPTRS/idUSLV28230720090331]<br />
* '''[[GeoCities]]''': Shock! Repeat Offender '''[[Yahoo]]''' announced that it would close GeoCities "later this year...We'll send you more details this summer." [http://help.yahoo.com/l/us/yahoo/geocities/geocities-05.html]. The plug was pulled on October 26th 2009. See the [[Geocities]] project page for more details.<br />
* '''Microsoft's SoapBox''' has announced it is getting off said soapbox on August 31, 2009. [http://arstechnica.com/microsoft/news/2009/07/soapbox-microsofts-youtube-dies-on-august-31-2009.ars]. <br />
* '''ArchNacho's & TortillaGodzilla's Quality ROMs''', a site that hosted ROMs for NES, SNES, and Genesis games, which has announced its effective death back in January of 2006, is now finally completely inaccessible, both on its original domain (http://www.qualityroms.com), and on the site that the domain masked (http://home.no.net/qualrom/). Archive.org has [http://web.archive.org/web/*/http://qualityroms.com mirrors] of the site up through August 30, 2007, which is after all updates to the site ceased. All ROMs hosted on QualityRoms are included in the mirror and can be downloaded from there.<br />
* '''Microsoft's Popfly''' [http://popflyteam.spaces.live.com/blog/cns!51018025071FD37F!336.entry] pops off into nowhere on August 24, 2009.<br />
* '''Yahoo! 360''' announces [http://blog.360.yahoo.com/blog-1qCkw2Ehaak.hdNZkEAzDrpa4Q--?cq=1] that they are closing up shop on July 13, 2009. Of course, you can still register an account but that's the first thing you're told.<br />
* '''Imeem''', a site for sharing music and convincing yourself that what you're hearing is good, [http://blog.imeem.com/2009/06/25/simplifying-imeem/ announced] on June 25, 2009 that they were "simplifying" things and deleting all user-generated photos and videos uploaded by users. They gave everyone '''five days''' to get their photos off, and then extended it to ''twenty days'' from the ensuing hue and cry. The uploaded videos had no way to extract them back.<br />
* '''Rejaw''', a microblogging platform, has announced that it will be shutting down on May 31 2009 [http://rejaw.com/rejaw/shout/OOfs2wUaLql]. It's gone.<br />
* '''[http://www.jumpcut.com Jumpcut.com]''' became the latest example of Yahoo!'s awesome respect for history and data, announcing the closure of the video hosting and editing site, for June 15, 2009. A software utility has been released to allow you to download the movies from Jumpcut. Otherwise, you are not in great shape - Yahoo says you can move your videos to Flickr, but Flickr cuts off at 90 seconds. A lot of homemade video is going to disappear.<br />
* '''MSN QnA Beta''' closed on May 21 [http://liveqna.spaces.live.com/blog/cns!2933A3E375F68349!2244.entry]<br />
* '''[http://www.coghead.com Coghead]''', " a web-based service for building and hosting custom online database applications and a software as a platform 'utility computing' company", announced it had closed up on February 20, 2009, and that the site would go down permanently on April 20, 2009. [http://blogs.zdnet.com/collaboration/?p=349]. It did.<br />
* '''[http://furl.net/ Furl]''' was a social bookmarking service that had been around since 2004. It was acquired by [http://diigo.com/ Diigo] (announced on March 9), allowed people to opt into transferring their bookmarks to Diigo, and shut down on April 17. [http://blog.diigo.com/2009/03/16/welcome-furl-users/ Diigo blog post]; [http://www.techcrunch.com/2009/03/09/diigo-buys-web-page-clipping-service-furl-away-from-looksmart/ Techcrunch post].<br />
* '''[http://www.spiralfrog.com Spiralfrog]''', "a FREE service that lets you download over 3 million songs and videos, legally and safely", pulled up stakes in the night and completely shut down on March 20, 2009. [http://arstechnica.com/web/news/2009/03/ad-based-music-service-spiralfrog-croaks.ars] Things looked so promising in 2006: [http://arstechnica.com/old/content/2006/08/7611.ars] Oh, and sadly, all your music you downloaded from them will stop working within 30 days or less. [http://arstechnica.com/old/content/2007/09/spiralfrog-debuts-with-free-ad-supported-music-downloads.ars]<br />
[[Image:HP upline goes offline.jpg|right|300px|Did we say upline? We meant offline.]]<br />
* It doesn't get more ironic than this: '''[https://www.upline.com/ Upline]''', a HP-owned online backup service, is being shut down.[http://news.cnet.com/8301-17939_109-10173136-2.html?part=rss&subj=news&tag=2547-1_3-0-5] ''They almost immediately turned off the backup process,'' and then announced all your restorable data would go offline on March 31, roughly 30 days after announcement. Surprise!<br />
* '''[[Yahoo_Briefcase|Yahoo Briefcase]]''', a positively ancient site run by Yahoo that provided you with 25 free megabytes of storage space for your junk, sent a mail to what were likely years-old contact addresses to tell them they had a little more than a month to get their files out, March 30, 2009. After that, the files would be deleted. What, Yahoo doesn't have a spare memory stick to store what must be the amount of files in this service for the next year?<br />
* '''Yahoo! Farechase''', an airline fare aggregation and searching site, was shut down on March 25, 2009. It had previously been it's own company, founded in 1999, and purchased by Yahoo! in 2004. [http://news.cnet.com/Yahoo-buys-travel-company/2100-1032_3-5300561.html]<br />
* '''[http://seattlepi.nwsource.com/ The Seattle Post-Intelligencer]''' was [http://seattlepi.nwsource.com/business/395463_newspapersale10.html put up for sale], but found no buyer, and the print edition stopped on March 17th 2009 after 146 years. [http://www.thenewstribune.com/news/columnists/zeeck/story/591181.html] Initially, reports indicated it would shut down the website as well as the paper, but a plan was apparently in place to run a "skeleton crew" on an internet-only site, which continues to operate.<br />
* '''[http://www.videosift.com Videosift]''' had a combination database and backup failure, losing: "All votes, ever. All member usernames who registered later than around 12 months ago. All member rankings. Your member profile info (e.g., bio, favorite sift, etc.), if any. All activity that happened on the site yesterday, March 11." This is unlikely to kill the site, but an awful lot of data was lost.<br />
* '''[http://www.scoopt.com/ Scoopt]''', a "citizen journalism" site run by Getty images to allow the uploading of images by citizen journalists and the chance to be licensed to news organizations, announced they would no longer take any new imagery after February 6, 2009, and will shut down completely on March 6, 2009. Some content uploaders "may" be contacted about being absorbed into the main Getty site.<br />
[[Image:20090227.jpg|right|300px]]<br />
* '''The [http://www.rockymountainnews.com/ Rocky Mountain News]''' has shut down as of February 27, 2009. [http://www.rockymountainnews.com/news/2009/feb/26/rocky-mountain-news-closes-friday-final-edition/] We're watching to see what happens with the website (and the material, and the newspaper itself). With a 150 year history, there's a lot of backstory, and how this chronicler of history will end up, so too will many others. There is an excellent documentary about the last days of the Rocky Mountain News [http://www.vimeo.com/3390739 here].<br />
*'''Electronic Gaming Monthly''' has recently shut its doors. [http://multiplayerblog.mtv.com/2009/01/06/egm-closed-ziff-lays-off-30/]<br />
*'''[http://culture11.com/home Culture11]''' ran out of money.[http://www.patrolmag.com/scanner/1263/culture11-is-over]<br />
* '''[[Lycos Europe]]''' shut down their '''Tripod''' hosting service on February 28, 2009. [http://www.washingtonpost.com/wp-dyn/content/article/2009/01/18/AR2009011800224.html] [http://www.paidcontent.co.uk/entry/419-lycos-europe-killing-tripod-customers-warned-to-back-up/] Note that Lycos Europe are distinct from Lycos.com. '''[[Lycos Europe]]''' is also shuttering the social networking site '''Jubii''' as of February 15, 2009. [http://www.techcrunch.com/2009/01/18/lycos-kills-jubii-while-theyre-at-it/] A Danish version of the site will remain open for the time being.<br />
* '''Windows Live''' shut down the '''MSN Groups''' on February 23. They extended their original date from February 21st to give Group owners the weekend to prepare. [http://windowslivewire.spaces.live.com/Blog/cns!2F7EB29B42641D59!34861.entry?sa=503427140]<br />
* '''[http://ma.gnolia.com/ ma.gnolia.com]''' had a catastrophic disk corruption/failure on January 31, 2009. From the message on the main site: ''"As I evaluate recovery options, I can't provide a certain timeline or prognosis as to to when or to what degree Ma.gnolia or your bookmarks will return; only that this process will take days, not hours."'' Ma.gnolia had an excellent export feature... hope you used it and did the backups they didn't!<br />
* '''[http://dominomag.com/ Domino Magazine]''', a style/interior design magazine, announced that they were shutting down on January 28, 2009. [http://mydecofile.dominomag.com/ My Deco File], one of the site's heavily used social bookmarking features (somewhat like delicious for images) will remain up for a few weeks to allow users to save their stuff.<br />
* '''Yahoo Pets''' was shut down and redirected with absolutely no notice around January 27, 2009. [http://blog.dogster.com/2009/01/28/yahoo-quietly-shutters-yahoo-pets-grin/]<br />
* '''[[totse]].com''' [http://www.totse.com/ closed its doors] on January 17, 2009. As of Jan 20th, a mirror [http://totse.danladds.com/ exists], alongside a [http://totse.danladds.com/text/ repository of the totse text files].<br />
* '''[[Ficlets]].com''' (owned by AOL) has announced they are closing on January 15, 2009. [http://www.peopleconnectionblog.com/2008/12/02/ficlets-will-be-shut-down-permanently/]<br />
* '''[[Circavie]].com''' (owned by AOL) has announced they are closing on January 15, 2009. [http://www.peopleconnectionblog.com/2008/12/03/circavie-will-be-shut-down-permanently/]<br />
* '''Several Google services''' have shut down. [http://www.readwriteweb.com/archives/google_giveth_and_it_taketh_away.php] Most importantly, Google Video stopped accepting new uploads (to avoid competition with Google-owned YouTube), and Google Catalog Search was erased.<br />
* '''[[Co.mments]].com''' closed down on January 11, 2009.<br />
* '''[[AOL_Pictures|AOL Pictures]]''' said so long on January 9, 2009. To their credit, you can still yank your stuff into other photo services until June of 2009. (At least, according to their goodbye letter.)<br />
<br />
===2008===<br />
<br />
* [http://blogs.zdnet.com/BTL/?p=11227 Overview of 2008 Technology News]<br />
<br />
''Biggest Botched Shutdowns of 2008''<br />
* '''[http://www.peopleconnectionblog.com/2008/11/06/hometown-has-been-shutdown AOL Hometown]''' (owned by AOL) was officially killed on October 31, 2008. [http://ascii.textfiles.com/archives/1617 Jason wrote about it.]<br />
[[Image:Stayclassyaol.png|thumb|right|470px|The full extent of warning AOL gave about shutting down Hometown.]]<br />
* '''Digitalrailroad.net''', a photo hosting site, gave their users a 24-hour eviction notice on October 27, 2008. They shut down 10 hours after the 24-hour notice. [http://news.cnet.com/8301-17939_109-10078042-2.html]<br />
<br />
''Other deaths of 2008''<br />
<br />
* '''[http://www.lively.com/goodbye.html Lively]''', a 3D Avatar space experiment, was killed in a really crappy way by Google on December 31, 2008.<br />
* '''[http://pingmag.jp/ Pingmag]''', the magazine from Tokyo about "Designing and Making things," simultaneously rang in the new year and checked out of existence on December 31, 2008.<br />
* '''[http://blog.mixwit.com/ Mixwit]''' said goodbye on December 27, 2008. [http://news.cnet.com/8301-17939_109-10126057-2.html]<br />
* '''[http://www.castlecops.com/ Castle Cops]''' put away their badges on December 23, 2008. [http://www.idf50.co.uk/clubhouse/computer-room/15996-castle-cops-closed-down.html]<br />
* '''[[Google Research Datasets]]''', shut down on December 19(?), 2008. [http://blog.wired.com/wiredscience/2008/12/googlescienceda.html]<br />
[[Image:Final image 01.png|400px|right|thumb|The last person at Yahoo! Kickstart turning off the lights.]]<br />
* '''Yahoo! Kickstart''', a social network for college students revealed in 2007 [http://mashable.com/2007/08/30/yahoo-kickstart/] got expelled on about December 18, 2008. [http://www.techpluto.com/yahoo-kickstart-shutdown/]<br />
* '''Flip.com''', a social network for teenage girls, shut down on December 16, 2008. Users were advised to print out their digital scrapbooks as backups. [http://news.cnet.com/8301-1023_3-10112021-93.html]<br />
* '''[http://pownce.com/ Pownce]''' was closed on December 15, 2008.<br />
* '''[http://getsatisfaction.com/iwantsandy/topics/a_fork_in_the_road_an_important_announcement_about_i_want_sandy I Want Sandy]''' [http://www.webcitation.org/5eFA58kqN (WEBCITE)] was shut down on December 8, 2008. A lot of people complained about this one, while others thanked the site for shutting down and wished the founder well! <br />
* '''[http://live.yahoo.com/ Yahoo Live!]''' died on December 3, 2008. [http://news.cnet.com/8301-13515_3-10081486-26.html]<br />
* '''[http://ourworld.cs.com/sfrederick2/index.htm?f=fs|Compuserve OurWorld]''' slipped into history on October 31, 2008.<br />
* '''[http://blogrush.com BlogRush.com]''' failed to provide bloggers with the traffic they so desperately desired, and the creator admitted on October 29, 2008 that his 4AM idea may not have been so brilliant. [http://mashable.com/2008/10/29/blogrush-shutdown/]<br />
* '''[http://wallop.com/ Wallop]''', Microsoft's attempt at starting a social network, died on September 18, 2008. All that remains is a few Facebook apps. [http://news.cnet.com/8301-13577_3-10041856-36.html] [http://www.techcrunch.com/2008/09/15/wallop-takes-a-leap-into-the-deadpool/]<br />
* '''Yahoo! Mash''', a social networking site, became mush on September 28, 2009, after 30 days warning. [http://mashable.com/2008/08/28/yahoo-mash-has-been-quashed/] <br />
* '''ScribbleWiki''' wikis go offline.<br />
* '''Virtual Magic Kingdom''' [http://www.intercot.com/discussion/showthread.php?t=130548 closed its gates] on May 21, 2008. [http://www.virtualworldsnews.com/2008/04/disneys-virtual.html] The amount of broken hearts and anguish over this move was amazing, and a warning sign to any family-oriented site that encourages families to join up.<br />
** Some of the more anguished fans have gotten together in various forms to recreate VMK, including [http://game.myvmk.com/ MyVMK], [http://www.vmkrevisited.com/ VMKRevisted] (a memorial site), and [http://www.openvmk.com/ OpenVMK] (although OpenVMK shutdown due to [https://docs.google.com/document/d/1qTdpgcLUd-Hg6-FZzkVEju4x1aMlhbMNX6snlUNgnOM/ internal squabbles].)<br />
* '''[http://en.wikipedia.org/wiki/Think_Secret Think Secret]''' was killed by Apple and shut down on February 14, 2008. [http://blog.wired.com/business/2007/12/apple-and-think.html]<br />
* '''Uber.com''' was a social blog site that died. [http://news.cnet.com/8301-13577_3-10052301-36.html]<br />
* '''Social.fm''' couldn't stand up to Last.fm, and died. [http://news.cnet.com/8301-13577_3-10005554-36.html]<br />
* '''Brijit.com''', a news aggregation site, closed on May 15, 2008. It might be closed for good. [http://news.cnet.com/8301-13577_3-9945059-36.html]<br />
* '''Yahoo! Design''', a showcase of designing and information aesthetics related to the Yahoo! properties, got revised into oblivion in February, 2008 as part of a 1,000 employee layoff. [http://infosthetics.com/archives/2008/02/rip_yahoo_design_closed_down.html]<br />
<br />
===2007===<br />
<br />
* '''Yahoo! Podcasts''', a Podcast searching site founded in October 2005 [http://www.ysearchblog.com/2005/10/09/listen-to-the-internet-with-yahoo-podcasts/], was closed with no explanation on October 31, 2007. [http://searchengineland.com/yahoo-podcasts-to-close-the-sorry-state-of-podcast-search-12288]<br />
* '''[http://oink.cd/ OiNK's Pink Palace]''' Music Bittorrent tracker site with huge user community which cared greatly about digital content and music. Would have been a great resource for the industry to research. Shutdown October 23, 2007. [http://www.wired.com/entertainment/music/news/2007/10/oink]<br />
* '''[http://jam.bbc.co.uk/ BBC Jam]''' was [http://news.bbc.co.uk/2/hi/uk_news/education/6449619.stm suspended] March 20, 2007 and [http://www.guardian.co.uk/media/2008/feb/28/bbc.digitalmedia will not be coming back].<br />
* '''Yahoo! Photos''', a photo sharing service by Yahoo!. Tools: [http://smart-techie.com/yahoo/ Download Hi Resolution Yahoo! Photos] by [http://smart-techie.com/web/ Rohit Sud], [http://kentbrewster.com/download-yahoo-photos/ Download Yahoo! Photos] by [[Kent Brewster]], and [http://yandao.com/yahoograb/ Yahoo! Photos Grabber] by [http://yandao.com Yandao.com]<br />
<br />
===2006===<br />
<br />
===2005===<br />
<br />
* http://IUMA.COM (Internet Underground Music Archive), of Santa Cruz, California, the actual first website to offer free hosting of bands including MP3 files of music offered by the bands, was mostly archived by John Gilmore before going down. At least one IUMA founder now has a copy of that archive. This ~800GB collection has been uploaded to an archiveteam staging server.<br />
<br />
===2004===<br />
<br />
===2003===<br />
<br />
* http://mp3.com went down. Much of it was archived by John Gilmore.<br />
<br />
===2002===<br />
<br />
===2001===<br />
* '''SixDegrees.com''', a social network service website that lasted from 1997 to 2001<br />
* '''The Useless Pages''' (at [http://replay.web.archive.org/20000612123540/http://www.go2net.com/useless/index.html IA])<br />
<br />
== Eleventh Hour Reprieves and Reanimations ==<br />
<br />
* Video host '''[[Viddler]]''' announces in an [http://mad.ly/5ae274?pact=20251445218&fe=1 e-mail newsletter] that they're shutting down free accounts on March 11, 2014. But Archive Team kicked in and began to suck up the place until the owners told us to stop. Videos won't be permanently deleted.<br />
* '''[[4chan|Chanarchive.org]]''' - A site dedicated to saving select quality threads from 4chan, running since 2006 and containing 500GBs of important material. It has shut down entirely, as the owner was banned from Paypal and has no means of paying for the site in it's current state. In a [https://boards.4chan.org/q/res/264159 4chan thread], the owner explains that backups will be made available, but there is no guarantee of who, where, and for how long.<br />
* '''[[Berlios.de]]''' will [http://www.berlios.de/ shut down at end of 2011]. The site hosts thousands of open source software projects (git, svn, bzr, mailing lists, bug tracking, etc). [http://developer.berlios.de/docman/display_doc.php?docid=2056&group_id=2 Instructions for exporting a project.] Berlios is still open and they are now [http://joinup.ec.europa.eu/news/german-open-source-development-site-berlios-joins-sourceforge partnered with sourceforge] to keep things running.<br />
* '''[[Citizendium]]''''s finances {{url|1=http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2010-11-08/News_and_notes|2=constantly cry for money}}. Running a MediaWiki site is cheap and Sanger is not homeless, hence it's expected to survive. [[WikiTeam]] archives it on a regular basis.<br />
* '''[[Delicious]]'''[http://www.delicious.com] will be [http://daringfireball.net/linked/2010/12/16/delicious.php shutting down soon]. The whole team was let go yesterday - 15 December 2010. [http://tech.slashdot.org/story/10/12/16/2220225/Yahoo-To-Close-Delicious Slashdot link]. Delicious was acquired from Yahoo! in early 2011 by AVOS however all the prior content is gone.<br />
* '''Cli.gs''', another URL shortening service, announced closure: "On Sunday, 25 Oct 2009 at 12:00:00 GMT, the service will stop accepting new short URLs and will stop logging analytics."[http://blog.cli.gs/news/cligs-shutting-down] In December 2009, it was announced that the "social bookmarking" site Mister Wong has acquired cli.gs and are keeping it running.[http://blog.cli.gs/news/mister-wong-acquires-cligs] All aboard the [[TinyURL]] project. <br />
* '''[https://duck.co/topic/soft-launch-of-the-new-forum Duck.co]''', the official DuckDuckGo community forums, transitioned to their [https://dukgo.com own platform] and moved all posts over from their old Zoho forum.<br />
* '''[[Earbits]]''' bites the dust on June 16, 2014, but comes back to life on June 19th, 2014. In between, we grabbed ~130GB of images and ~130k MP3s.<br />
*'''Filefront.com''' is closing up shop [http://farewell.filefront.com/]. The site will be suspended on March 30, 2009. 1.5 Million files and 48+ TB of space gone just like that. '''UPDATE''' As of April 2, 2009, it looks like there may have been an 11th hour reprieve for Filefront. According to a message reportedly from the original founders of the service [http://welcome.filefront.com/], the site has been re-acquired by them in order to prevent its proposed shuttering.<br />
* '''[[Formspring]]''' (now called '''spring.me''') announced they'd be [http://formspring.wordpress.com/2013/03/15/formspring-is-shutting-down/ shutting down] on April the 15th. It was, however, was acquired by new management on May 8, 2013, and saved from being shut down.<br />
* '''[[Google Video]]''' threatened to remove all hosted videos with two weeks' notice in April 2011. It backed down after criticism and an archive effort by the Archive Team.<br />
* '''Home of the Underdogs''' went under on Feb 9th[http://flashofsteel.com/index.php/2009/02/13/rip-hotu/]. There has been some passed along words by the site's owner, now working at an NGO, that an attempt to bring it back may happen. (She definitely has backups of the site.) A community-driven effort to revive the site is currently underway [http://www.hotud.org]. Backups were restored, and the remaining files (1,000+) collected from the community. As of Jan 4th 2010, HOTU is reporting that files are back online [http://www.hotud.org/component/content/article/25133-files-online-and-import-done]<br />
* '''[[JPG Magazine]]''' announced it would shut down on January 5, 2009 [http://jpgmag.com/blog/2009/01/jpg_magazine_says_goodbye.html], but the site lives [http://jpgmag.com/blog/2009/02/an_exciting_future_for_jpg.html lives on under new ownership]. Feel free to download the [http://thepiratebay.org/torrent/4624703/ torrent]<br />
* [[Jux]] announced that they would be shutting down on August 31, 2013. '''UPDATE''' On July 17, 2013, Jux announced that they would not shut down, apparently due to financial support from one of their members.<br />
* [[KeygenJukebox]], which shut down in 2014, has recently popped back up.<br />
* '''[[MobyGames]]''', the largest database of old game releases on the net, with huge amount of content found nowhere else. Was bought by [[GameFly]] in 2010 and received a new site design in September 2013, which made almost all contributors emigrate. Many site features have disappeared or became broken in the new design, and their large database of cover art and screenshots had problems loading. In December 2013, the site was bought by [http://blueflamelabs.com/ Blue Flame Labs], who have restored the old site design and managed to draw back pretty much all the contributors. It seems that the site is going back to full health again. <br />
* '''[[WebCite]]''' &ndash; Has a habit of crying for money, threatening it will stop accepting submissions. Since October 2013, the Wayback Machine archives pages on demand, hence there's no reason to use a site like WebCite that declares self at risk. It's expected that they'll send their data to Internet Archive if they ever really have to shut down.<br />
* '''[[Word Count Journal]]''' ({{url|1=http://www.wordcountjournal.com/about}}) is shutting down on June 11, 2011 '''UPDATE''' The site is fully up and running. (checked on October 21, 2011) '''UPDATE2''': Non-functional, but the website is up with this notice "Word Count Journal is no longer being supported." (checked on January 26th, 2012)<br />
<br />
== Links ==<br />
<br />
=== Other Sites Remember the Dead ===<br />
<br />
* [http://www.disobey.com/ghostsites/ Ghost Sites of the Web] by Steve Baldwin. [http://www.disobey.com/ghostsites/atom.xml RSS Feed]<br />
* [http://www.techcrunch.com/tag/deadpool/ Techcrunch's Deadpool] is an excellent archive of stories about site closings.<br />
* [http://deletionpedia.dbatley.com/w/index.php?title=Main_Page Deletionpedia] saved the articles deleted from Wikipedia in 2008, and [http://wikidumper.blogspot.com/ Wikidumper] preserves a selection of them.<br />
<br />
=== Tragic ===<br />
<br />
* [http://news.cnet.com/8301-13578_3-10029798-38.html "Russia Web site owner killed after arrest" - article at CNET News]<br />
<br />
=== Humorous ===<br />
<br />
* [http://www.nzherald.co.nz/lifestyle/news/article.cfm?c_id=6&objectid=10448650 "Dating website's miscalculated publicity attempt" - article at New Zealand Herald]<br />
<br />
{{Navigation pager<br />
| previous = Who We Are<br />
| next = Fire Drill<br />
}}<br />
{{Navigation box}}<br />
<br />
[[Category:Archive Team]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=TwitPic&diff=19994TwitPic2014-09-04T18:01:22Z<p>Yan: twitpic is shutting down</p>
<hr />
<div>{{Infobox project<br />
| title = TwitPic<br />
| image = Twitpic - Share photos on Twitter 1294869067903.png<br />
| description = TwitPic mainpage in 2011-01-12<br />
| URL = http://twitpic.com<br />
| project_status = {{closing}}<br />
| archiving_status = {{notsavedyet}}<br />
}}<br />
<br />
'''TwitPic''' is an image hosting service. The service is designed mainly for Twitter users - the images uploaded on the service are given short URLs for usage in Twitter posts. Twitter carries a 140-character post limit, the average Twitpic URL is 25/26 characters long.<br />
<br />
On September 4, 2014 TwitPic [http://blog.twitpic.com/2014/09/twitpic-is-shutting-down/ announced] they were shutting down.<br />
<br />
== Downloaders ==<br />
* [http://code.google.com/p/emijrp/source/browse/trunk/scrapers/twitpic.py Downloader by tag] (it saves the full resolution image and metadata: uploader, date and description)<br />
<br />
== External links ==<br />
* http://twitpic.com<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Image hosting services]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=TropicalWikis&diff=16720TropicalWikis2013-05-20T13:21:05Z<p>Yan: add image</p>
<hr />
<div>{{Infobox project<br />
| title = TropicalWikis<br />
| logo = Tropicalwikistwitter.png<br />
| image = TropicalWikis-20130520.png<br />
| description = Page listing the wikis hosted on 2013-05-20.<br />
| URL = http://www.tropicalwikis.com<br />
| project_status = {{online}}<br />
| archiving_status = {{nosavedyet}}<br />
| irc = wikiteam<br />
}}<br />
<br />
'''TropicalWikis''' is a [[wikifarm]].<br />
<br />
Some wikis have vanished. Status is not clear.<br />
<br />
== Backups ==<br />
* No known backup for this wikifarm<br />
* [[TropicalWikis/Twitter account]] grab ([https://twitter.com/#!/TropicalWikis @TropicalWikis])<br />
<br />
== See also ==<br />
* [[List of wikifarms]]<br />
<br />
== External links ==<br />
* http://www.tropicalwikis.com<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=File:TropicalWikis-20130520.png&diff=16719File:TropicalWikis-20130520.png2013-05-20T13:20:10Z<p>Yan: Screenshot of http://www.tropicalwikis.com/wiki/Special:Farmer/list on 2013-05-20.</p>
<hr />
<div>Screenshot of http://www.tropicalwikis.com/wiki/Special:Farmer/list on 2013-05-20.</div>Yanhttps://wiki.archiveteam.org/index.php?title=Working_with_ARCHIVE.ORG&diff=16717Working with ARCHIVE.ORG2013-05-19T17:30:25Z<p>Yan: add wikilink + grammar & dash</p>
<hr />
<div>[[Image:Archivesneedlove.jpg.jpg]]<br />
<br />
<br />
The [[Internet Archive]] has enormous resources at its disposal, and shares many parallel goals with Archive Team. There are some advantages to making as much of Archive Team's data saves available to ARCHIVE.ORG, especially as regards the [http://wayback.archive.org/web/ Wayback Machine], a mechanism to be able to browse older web materials going back years. Where possible, Archive Team should try to work with ARCHIVE.ORG, and get the love going.<br />
__TOC__<br />
== Headers and Logs ==<br />
<br />
The most striking difference between Archive Team and ARCHIVE.ORG is that while Archiveteam traditionally considers header and logging information to be a nice hat trick if you have the time, ARCHIVE.ORG absolutely needs it to import into their Wayback Machine. With WGET, the best way to do this is to save off the log files, and use ''--save-headers'' to include the headers in the file.<br />
<br />
(More research needs to be done to ensure these options are in scripts.)<br />
<br />
Unfortunately, this produces files that are not, initially, usable for just dumping into a new location, so there is a chance either two copies should be made, or a script written that strips the headers out of the files for later use.<br />
<br />
== Stripping the Headers ==<br />
As an example, let's download this page:<br />
<pre>wget -O headers.raw.html --save-headers "http://www.archiveteam.org/index.php?title=Working_with_ARCHIVE.ORG"</pre><br />
<br />
If we want to strip the headers, then we run:<br />
<pre>cat headers.raw.html | perl -ne 'unless ($out) { $out=1 if $_ eq "\r\n"; next } print' > noheaders.raw.html</pre><br />
<br />
== The Archive.org "Archiveteam" Collection ==<br />
<br />
Archive.org currently has an [http://www.archive.org/details/archiveteam Archive Team Collection], which consists of site-grabs, older archives, and various rips/mass downloads from websites and Archive Team activities. Right now, that collection is primarily administered by Jason Scott &mdash; sending him e-mail at jason@textfiles.com with something you think that collection should have is probably the best way to go about things.</div>Yanhttps://wiki.archiveteam.org/index.php?title=Internet_Archive&diff=16716Internet Archive2013-05-19T17:26:38Z<p>Yan: interlink</p>
<hr />
<div>{{Infobox project<br />
| title = Internet Archive<br />
| image = Internet Archive- Digital Library of Free Books, Movies, Music & Wayback Machine 1292930995846.png<br />
| description = Internet Archive mainpage in 2010-12-21<br />
| URL = {{url|1=http://www.archive.org}}<br />
| project_status = {{online}}<br />
| archiving_status = {{nosavedyet}}<br />
}}<br />
The '''Internet Archive''' is a non-profit digital library with the stated mission/motto: "universal access to all knowledge". The Internet Archive stores several billion webpages from different dates and times for historical purposes that are available through the Wayback Machine, arguably an archivists wet dream. The Archive.org website also archives books, music and videos.<br />
<br />
== Mirrors ==<br />
<br />
There are currently two mirrors of the Internet Archive collection - the official mirror available at archive.org, and a second mirror at Bibliotheca Alexandrina. Both seem to be up and stable.<br />
<br />
== Raw Numbers as of December 2010 ==<br />
<br />
* 4 data centers, 1,300 nodes, 11,000 spinning disks<br />
* Wayback Machine: 2.4 PetaBytes<br />
* Books/Music/Video Collections: 1.7 PetaBytes<br />
* Total used storage: 5.8 PetaBytes<br />
<br />
== See also ==<br />
* [[Working with ARCHIVE.ORG]]<br />
<br />
== External links ==<br />
* {{url|1=http://www.archive.org}}<br />
* {{url|1=http://archive.bibalex.org|2=Bibliotheca Alexandrina mirror}}<br />
* {{url|1=http://www.archive.org/web/petabox.php|2=Petabox details}}<br />
<br />
{{Navigation box}}</div>Yanhttps://wiki.archiveteam.org/index.php?title=Wget_with_WARC_output&diff=16715Wget with WARC output2013-05-19T17:21:56Z<p>Yan: /* Options */ missing named parameter?</p>
<hr />
<div>From the discussion about [[Working with ARCHIVE.ORG]], we learn that it is important to save not just files but also HTTP headers. With Wget, that's difficult. With a few tricks you can keep the response headers, but there is no option to save the request headers. You also lose the response headers that don't produce an HTML page: Wget doesn't save redirects and 404 responses.<br />
<br />
The [http://savannah.gnu.org/bzr/?group=wget development version of Wget] can write its results to a [http://www.digitalpreservation.gov/formats/fdd/fdd000236.shtml WARC] (Web ARChive file format) file, just like Heritrix and other archiving tools. With the WARC format, it's possible to save both the request and the response headers. It also provides a clean way to store redirects and 404 responses.<br />
<br />
There is an additional advantage: if Wget writes these headers to a WARC file, it is no longer necessary to use the <code>--save-headers</code> to save them at the top of each downloaded file. There is need to remove these headers afterwards to produce a clean copy: the mirror produced by Wget are useable without post-processing.<br />
<br />
Note that this work has been accepted into the Wget codebase, and [https://twitter.com/anarchivist/statuses/232550155394641920 as of version 1.14 wget supports WARC output out of the box].<br />
<br />
== Compiling ==<br />
<br />
<pre><br />
bzr branch bzr://bzr.savannah.gnu.org/wget/trunk<br />
cd trunk<br />
./bootstrap<br />
./configure && make<br />
</pre><br />
<br />
== Usage ==<br />
<br />
To download a file and save the request and response data to a WARC file, run this:<br />
<br />
<pre><br />
src/wget "http://www.archiveteam.org/" --warc-file="at"<br />
</pre><br />
<br />
This will download the file to <code>index.html</code>, but it will also create a file <code>at-00000.warc.gz</code>. This is a gzipped WARC file that contains the request and response headers (of the initial redirect and of the Wiki homepage) and the html data.<br />
<br />
If you want to have a non-compressed WARC file, use the <code> --no-warc-compression</code> option:<br />
<br />
<pre><br />
src/wget "http://www.archiveteam.org/" --warc-file="at" --no-warc-compression<br />
</pre><br />
<br />
Saving one file is nice, but the <code>warc-file</code> option becomes even more powerful if you combine it with Wget's mirror option: (You may want to try this with a smaller site than the AT wiki.)<br />
<br />
<pre><br />
src/wget "http://www.archiveteam.org/" --mirror --warc-file="at"<br />
</pre><br />
<br />
If you uncompress <code>at-00000.warc.gz</code> and look at it, you'll see that it contains WARC records for every request and response: it is a complete copy of the mirrored site, while at the same time Wget also created the normal mirror of the site.<br />
<br />
== Options ==<br />
<br />
<code>--warc-file=FILENAME</code> enables the WARC export. WARC files will be based on FILENAME: FILENAME-00000.warc.gz, FILENAME-00001.warc.gz et cetera.<br />
<br />
<code>--warc-max-size=NUMBER</code> defines the maximum size of the WARC files. The default is an infinite limit ("inf"). If you download a large site, the recommended limit is 1GB, set the option to 1G to enable this limit. Note that this is a soft limit: files can get slightly larger than this, depending on the files you download.<br />
<br />
<code>--warc-header=STRING</code> adds STRING as a custom header to the warcinfo record, e.g. "operator: Archive Team". This option can be used multiple times.<br />
<br />
<code>--warc-cdx=FILENAME</code> writes a CDX index file to FILENAME.cdx. The CDX file will contain a list of the records and their locations in the WARC files.<br />
<br />
<code>--warc-dedup=FILENAME</code> can be used to reduce the size of WARC files generated by a recrawl. FILENAME should point to a CDX file, generated with <code>--warc-cdx</code> in a previous run. For each file it downloads, Wget will check the CDX file to see if the response is listed there. If the exact file already exists, a "revisit" record with a reference to the previous record will be added to the WARC file, instead of a duplicate "response" record. Duplicate records are detected by comparing the SHA-1 digest of the payload of the response.<br />
<br />
<code>--no-warc-compression</code> will write uncompressed WARC files. Compression is enabled by default. It is better to use the built-in compression than to compress the WARC files afterwards. The built-in compression will compress each record as an individual GZIP block, which allows other utilities to extract single records from the file.<br />
<br />
<code>--no-warc-digests</code> disables the SHA-1 digests. By default, SHA-1 digests will be calculated for the whole response block and the response payload. If you really need to, you can disable that.<br />
<br />
<code>--no-warc-keep-log</code> can be set if you don't want the Wget log in the WARC file. By default, Wget will add the log file as a separate record to the WARC file.<br />
<br />
<code>--warc-tempdir=DIRECTORY</code> sets the temporary directory used by the WARC writer. The system tempdir will be used by default.<br />
<br />
== WARC file format ==<br />
<br />
The WARC file format is an ISO standard. The official specification of [http://www.iso.org/iso/catalogue_detail.htm?csnumber=44717 ISO 28500:2009] is not available for free. However, the [http://bibnum.bnf.fr/WARC/WARC_ISO_28500_version1_latestdraft.pdf final draft] is free, and is supposed to be technically equivalent to the official standard.<br />
<br />
The WARC usage task force has published [http://netpreserve.org/sites/default/files/resources/WARC_Guidelines_v1.pdf WARC implementation guidelines] with additional recommendations.<br />
<br />
[[Category:Tools]]</div>Yanhttps://wiki.archiveteam.org/index.php?title=Wget_with_WARC_output&diff=16714Wget with WARC output2013-05-19T17:21:09Z<p>Yan: /* WARC file format */ document moved?</p>
<hr />
<div>From the discussion about [[Working with ARCHIVE.ORG]], we learn that it is important to save not just files but also HTTP headers. With Wget, that's difficult. With a few tricks you can keep the response headers, but there is no option to save the request headers. You also lose the response headers that don't produce an HTML page: Wget doesn't save redirects and 404 responses.<br />
<br />
The [http://savannah.gnu.org/bzr/?group=wget development version of Wget] can write its results to a [http://www.digitalpreservation.gov/formats/fdd/fdd000236.shtml WARC] (Web ARChive file format) file, just like Heritrix and other archiving tools. With the WARC format, it's possible to save both the request and the response headers. It also provides a clean way to store redirects and 404 responses.<br />
<br />
There is an additional advantage: if Wget writes these headers to a WARC file, it is no longer necessary to use the <code>--save-headers</code> to save them at the top of each downloaded file. There is need to remove these headers afterwards to produce a clean copy: the mirror produced by Wget are useable without post-processing.<br />
<br />
Note that this work has been accepted into the Wget codebase, and [https://twitter.com/anarchivist/statuses/232550155394641920 as of version 1.14 wget supports WARC output out of the box].<br />
<br />
== Compiling ==<br />
<br />
<pre><br />
bzr branch bzr://bzr.savannah.gnu.org/wget/trunk<br />
cd trunk<br />
./bootstrap<br />
./configure && make<br />
</pre><br />
<br />
== Usage ==<br />
<br />
To download a file and save the request and response data to a WARC file, run this:<br />
<br />
<pre><br />
src/wget "http://www.archiveteam.org/" --warc-file="at"<br />
</pre><br />
<br />
This will download the file to <code>index.html</code>, but it will also create a file <code>at-00000.warc.gz</code>. This is a gzipped WARC file that contains the request and response headers (of the initial redirect and of the Wiki homepage) and the html data.<br />
<br />
If you want to have a non-compressed WARC file, use the <code> --no-warc-compression</code> option:<br />
<br />
<pre><br />
src/wget "http://www.archiveteam.org/" --warc-file="at" --no-warc-compression<br />
</pre><br />
<br />
Saving one file is nice, but the <code>warc-file</code> option becomes even more powerful if you combine it with Wget's mirror option: (You may want to try this with a smaller site than the AT wiki.)<br />
<br />
<pre><br />
src/wget "http://www.archiveteam.org/" --mirror --warc-file="at"<br />
</pre><br />
<br />
If you uncompress <code>at-00000.warc.gz</code> and look at it, you'll see that it contains WARC records for every request and response: it is a complete copy of the mirrored site, while at the same time Wget also created the normal mirror of the site.<br />
<br />
== Options ==<br />
<br />
<code>--warc-file=FILENAME</code> enables the WARC export. WARC files will be based on FILENAME: FILENAME-00000.warc.gz, FILENAME-00001.warc.gz et cetera.<br />
<br />
<code>--warc-max-size=NUMBER</code> defines the maximum size of the WARC files. The default is an infinite limit ("inf"). If you download a large site, the recommended limit is 1GB, set the option to 1G to enable this limit. Note that this is a soft limit: files can get slightly larger than this, depending on the files you download.<br />
<br />
<code>--warc-header=STRING</code> adds STRING as a custom header to the warcinfo record, e.g. "operator: Archive Team". This option can be used multiple times.<br />
<br />
<code>--warc-cdx</code> writes a CDX index file to FILENAME.cdx. The CDX file will contain a list of the records and their locations in the WARC files.<br />
<br />
<code>--warc-dedup=FILENAME</code> can be used to reduce the size of WARC files generated by a recrawl. FILENAME should point to a CDX file, generated with <code>--warc-cdx</code> in a previous run. For each file it downloads, Wget will check the CDX file to see if the response is listed there. If the exact file already exists, a "revisit" record with a reference to the previous record will be added to the WARC file, instead of a duplicate "response" record. Duplicate records are detected by comparing the SHA-1 digest of the payload of the response.<br />
<br />
<code>--no-warc-compression</code> will write uncompressed WARC files. Compression is enabled by default. It is better to use the built-in compression than to compress the WARC files afterwards. The built-in compression will compress each record as an individual GZIP block, which allows other utilities to extract single records from the file.<br />
<br />
<code>--no-warc-digests</code> disables the SHA-1 digests. By default, SHA-1 digests will be calculated for the whole response block and the response payload. If you really need to, you can disable that.<br />
<br />
<code>--no-warc-keep-log</code> can be set if you don't want the Wget log in the WARC file. By default, Wget will add the log file as a separate record to the WARC file.<br />
<br />
<code>--warc-tempdir=DIRECTORY</code> sets the temporary directory used by the WARC writer. The system tempdir will be used by default.<br />
<br />
== WARC file format ==<br />
<br />
The WARC file format is an ISO standard. The official specification of [http://www.iso.org/iso/catalogue_detail.htm?csnumber=44717 ISO 28500:2009] is not available for free. However, the [http://bibnum.bnf.fr/WARC/WARC_ISO_28500_version1_latestdraft.pdf final draft] is free, and is supposed to be technically equivalent to the official standard.<br />
<br />
The WARC usage task force has published [http://netpreserve.org/sites/default/files/resources/WARC_Guidelines_v1.pdf WARC implementation guidelines] with additional recommendations.<br />
<br />
[[Category:Tools]]</div>Yan