Difference between revisions of "Current Projects"

From Archiveteam
Jump to navigation Jump to search
(MediaFire to scripts only)
(add Tor URLs skeleton)
 
(138 intermediate revisions by 16 users not shown)
Line 1: Line 1:
__NOTOC__
__NOTOC__
== Archive Team recruiting ==
== Archive Team recruiting ==
* Help us: '''[[ArchiveTeam_Warrior|☞ Download and run your warrior ☜]]'''.
* What's on: [https://tracker.archiveteam.org/ online tracker].
* [[Donate|Donate to keep our projects going]].
* [[Dev|Want to code for Archive Team? Here's a starting point.]]
* [[Dev|Want to code for Archive Team? Here's a starting point.]]
* Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br>
* What's on: [https://tracker.archiveteam.org/ online tracker].<br>
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].-->


== Warrior-based projects ==
== Warrior-based projects ==
{{:CurrentWarriorProject}}
{{:CurrentWarriorProject}}


=== Short-term, urgent projects ===
<!-- Urgent projects -->
<!-- Urgent projects -->
<!-- Long-term projects -->
<!-- sorted by deadline (soonest on top) -->
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam|network=hackint}}'''.
* [[DeviantArt]]: Archiving custom widgets, favorites, group affiliations, countdown timers, admin forums, and admin announcements. '''IRC Channel {{IRC|devianttart}}'''


''There will be fewer Warrior projects than usual due to the virtual appliance being unable to run many newer projects that utilize wget-at. It will take a little bit of time before an updated version is available that can run it.''
=== Medium-term projects ===
<!-- sorted alphabetically -->


=== Scripts only ===
(none currently)
* [[MediaFire]]: [https://twitter.com/textfiles/status/1339625133363912706 Setting fire to accounts and files it deems "abandoned" starting in January 2021.] '''IRC Channel {{IRC|mediaonfire|network=hackint}}'''.
 
* .EU domains: The Brexit deal is done, and with that comes a purge of UK-based sites no longer eligible to use the .EU domain as of 2021. '''IRC Channel {{IRC|noteurdomain|network=hackint}}'''.
=== Long-term projects ===
* [[Endomondo]]: GPS workout tracker with optional social networking features, shutting down 2020-12-31. '''IRC Channel {{IRC|findelmundo|network=hackint}}'''.
<!-- sorted alphabetically -->
* [[Flash]] domains: An effort to preserve what remains of a storied legacy of websites hosting Adobe Flash Player content before the web industry takes it behind the shed. '''IRC Channel {{IRC|flashbang|network=hackint}}'''.
* [[Blogger]]: Grabbing inactive Blogger blogs since Google began a mass purge of inactive Google accounts on or after {{datetime|2023-12-01}}. '''IRC Channel {{IRC|frogger}}'''.
* Classic [[Google Sites]]: Making more sites inaccessible to the public starting September 1, 2021. '''IRC Channel {{IRC|nearlylostmygoogles|network=hackint}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Currently grabbing ''new'' material. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[Imgur]]: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on {{datetime|2023-05-15}}. '''IRC Channel {{IRC|imgone}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.
* [[MediaFire]]: [https://twitter.com/textfiles/status/1349516443654758401 Not 'at-risk' but grabbing speculatively to save historic files] '''IRC Channel {{IRC|mediaonfire}}'''.
* [[Pastebin]]: Archiving the pastas. '''IRC Channel {{IRC|pastalavista}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on {{datetime|2023-06-19}}. '''IRC Channel {{IRC|shreddit}}'''.
* [[Telegram]]: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. '''IRC Channel {{IRC|telegrab}}'''.
* [[URLs|Tor URLs]]: A random collection of stuff, but from Tor <code>.onion</code> URLs. '''IRC Channel {{IRC|//}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[URLs]]: A random collection of stuff. '''IRC Channel {{IRC|//}}'''.
* [[YouTube]]: Archiving [[YouTube#Scope|selected videos]]. '''IRC Channel {{IRC|down-the-tube}}'''.
 
''An [[ArchiveTeam_Warrior|updated Warrior virtual appliance (v3.2)]] is now available with better support for newer projects that utilize wget-at.''


== Manual projects ==
== Manual projects ==
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive|network=hackint}}'''
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot|network=hackint}}'''.
* [[Codearchiver]]: Dumping and archival of source code repositories and associated version control systems. '''IRC Channel {{IRC|codearchiver}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam|network=hackint}}'''.
* Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. '''IRC Channel {{IRC|archiveteam}}'''
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.


== Upcoming & proposed projects ==
== Upcoming & proposed projects ==
Line 34: Line 45:
<!-- Top priority: could disappear anytime now -->
<!-- Top priority: could disappear anytime now -->
<!-- Shutting down, definite deadline given -->
<!-- Shutting down, definite deadline given -->
* [[Halo]]: Back to finishing off unfinished business before Bungie kills the original website on February 9, 2021. '''IRC Channel {{IRC|yolohalo|network=hackint}}'''.
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between {{datetime|2021-12-01}} and 2025. '''IRC Channel {{IRC|chromeweblore}}'''.
* [[Webs]]: Vistaprint is killing off the Freewebs you knew from the 2000s on March 31, 2021, unless you pay up. '''IRC Channel {{IRC|webbed|network=hackint}}'''.
* [[Periscope]]: Another Twitter acquisition, another shutdown. This time, its live-streamer gets to join Vine in the bin at the end of March. '''IRC Channel {{IRC|microscope|network=hackint}}'''.
* [[Google Poly]]: A 3D art repository that Google will send to the trash compactors on June 30, 2021. New uploads cease April 30. '''IRC Channel {{IRC|polygone|network=hackint}}'''.
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between December 1, 2020 and June 2022. '''IRC Channel {{IRC|chromeweblore|network=hackint}}'''.
<!-- Shutting down, vague deadline given -->
<!-- Shutting down, vague deadline given -->
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago|network=hackint}}'''.
* [[Photobucket]]: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. '''IRC Channel {{IRC|photosucket}}'''.
* [[Twitter]]: Deleting inactive accounts <s>2019-12-11</s> sometime. '''IRC Channel {{IRC|twitterdead}}'''.
* Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, [https://www.cnet.com/tech/mobile/one-third-of-apple-and-google-apps-are-so-outdated-they-could-get-removed/ with an estimated ~1.5M of them at risk]. '''IRC Channel {{IRC|appocalypse}}'''.
* [[Twitter]]: General instability; deleting inactive accounts <s>{{datetime|2019-12-11}}</s> sometime. '''IRC Channel {{IRC|twitterdead|EFnet|abandoned}}'''.
<!-- Shutting down, no deadline given -->
<!-- Shutting down, no deadline given -->
<!-- Archiving the archives -->
<!-- Archiving the archives -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.
* [[VKontakte]]: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. '''IRC Channel {{IRC|lostkontakt}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums|EFnet|abandoned}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal|EFnet|abandoned}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee|EFnet|abandoned}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy
* [[Giphy]]: Bought by <s>Facebook</s>Shutterstock, to be "integrated" (assimilated) into <s>Instagram</s> https://news.knowyourmeme.com/news/facebook-to-buy-giphy


== Recently finished projects ==
== Recently finished projects ==
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
<!-- projects that have finished in the last 30 days go here in reverse-chronogical order to be found easily and showcase recent work. additionally, keep projects here that are still in the tracker but not yet deleted so it won't confuse people. -->
* [[SmackJeeves]]: Webcomics host being tossed into the incinerator on 2020-12-31. '''IRC Channel {{IRC|archiveteam-bs|network=hackint}}'''.
* [[Taringa!]]: Shut down on {{datetime|2024-03-24}} with barely two weeks lead time. '''IRC Channel {{IRC|mataringa}}'''. <!-- finished: 2024-03-25 -->
* [[Voat]]: A reddit competitor from the Ellen Pao days gives its users a Christmas present: it's fucking dead! '''IRC Channel {{IRC|scrapevoat|network=hackint}}'''.
 


== Hiatus / Missed the Mark ==
== On Hiatus ==
* [[Fast.io]]: A CDN for cloud storage services which will evaporate completely on 2021-01-15. '''IRC Channel {{IRC|slowio|network=hackint}}'''.
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr|network=hackint}}'''.
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee|network=hackint}}'''.
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07).
* [[Google Drive]]: Same as MediaFire. '''IRC Channel {{IRC|googlecrash}}'''. Currently on haitus.
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], {{datetime|2016-06-07}}).
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.
* [[DevPort]]: This [http://developerportfolio.com/ portfolio SaaS provider] has [http://www.lowendtalk.com/discussion/65135/need-some-help-saas-provider-is-dead-but-my-site-is-still-up-how-should-i-grab-it reportedly] been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting|network=hackint}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[NewsGrabber]]: Saving all news articles. <!-- Help with server power or by finding more news sites.-->Currently paused. '''IRC Channel {{IRC|newsgrabber|network=hackint}}'''.
* [[Miraheze]]: <s>Shutting down sometime between {{datetime|2023-09-01}} and {{datetime|2023-10-31}}.</s> Rescued by new volunteers!
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. '''IRC Channel {{IRC|tumbledown|network=hackint}}'''.
* [[Tinkercad]]: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around {{datetime|2021-05-24}}. '''IRC Channel {{IRC|tinkerhad}}'''.
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. '''IRC Channel {{IRC|archiveteam|network=hackint}}'''.
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. '''IRC Channel {{IRC|tumbledown}}'''.


<small>ArchiveTeam primarily uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – [[Archiveteam:IRC|More info]]
<small>ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – [[Archiveteam:IRC|More info]]</small>
<small>ArchiveTeam also has some channels left on the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[Archiveteam:IRC|More info]]</small><br>

Latest revision as of 20:03, 27 March 2024

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Telegram

Short-term, urgent projects

  • DeviantArt: Archiving custom widgets, favorites, group affiliations, countdown timers, admin forums, and admin announcements. IRC Channel #devianttart (on hackint)

Medium-term projects

(none currently)

Long-term projects

An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at.

Manual projects

  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • Codearchiver: Dumping and archival of source code repositories and associated version control systems. IRC Channel #codearchiver (on hackint).
  • Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).

Upcoming & proposed projects

Recently finished projects

  • Taringa!: Shut down on 2024-03-24 with barely two weeks lead time. IRC Channel #mataringa (on hackint).


On Hiatus

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info