Difference between revisions of "Current Projects"

From Archiveteam
Jump to navigation Jump to search
m (→‎Warrior-based projects: Switch plays.tv to active 2/2)
(Mixer to recently finished, still working on the last of the Mercurial repos at Bitbucket)
(22 intermediate revisions by 9 users not shown)
Line 10: Line 10:


<!-- Urgent projects -->
<!-- Urgent projects -->
* [[Plays.tv]]: Stopping.tv on 2019-12-15. '''IRC Channel {{IRC|stops.tv|network=hackint}}'''.
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks}}'''
<!-- Long-term projects -->
<!-- Long-term projects -->
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.


=== Scripts only ===
=== Scripts only ===
* [[NewsGrabber]]: Saving all news articles. Help with server power or by finding more news sites. '''IRC Channel {{IRC|newsgrabber}}'''.
* [[Bitbucket]]: Kicking the bucket on Mercurial repositories by July 1 2020 to worship Git instead. '''IRC Channel {{IRC|kickthebucket|network=hackint}}'''.


== Manual projects ==
== Manual projects ==
* [https://github.com/davidferguson/yahoogroups-joiner Yahoogroups-joiner] Filling out captchas to archive Yahoo Groups. '''IRC Channel {{IRC|yahoosucks}}'''.
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive}}'''
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks|network=hackint}}'''
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.


== Active Projects Not Yet Ingesting ==
== Upcoming & proposed projects ==
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
<!-- Top priority: could disappear anytime now -->
<!-- Top priority: could disappear anytime now -->
<!-- Shutting down, definite deadline given -->
<!-- Shutting down, definite deadline given -->
<!-- Other -->
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit}}'''.
== Proposed projects ==
* [[YouTube]]: Making playlists of liked videos private on 2019-12-05. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
<!-- Shutting down, vague deadline given -->
<!-- Shutting down, vague deadline given -->
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.
Line 39: Line 33:
<!-- Archiving the archives -->
<!-- Archiving the archives -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|getgit}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy


== Recently finished projects ==
== Recently finished projects ==
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
* [[Drawr]]: Doodle repository getting sharpied out on December 2nd (JST?). '''IRC Channel {{IRC|drawrnomore}}'''.
* [[Mixer]]: Video game streaming network shutting down 2020-07-23. '''IRC Channel {{IRC|mixdown|network=hackint}}'''
* [[Gfycat]]: Deleting old anonymous uploads on or after 2019-11-18. '''IRC Channel {{IRC|deadcat|network=hackint}}'''.
* [[Google Fusion Tables]]: Shutting down on 2019-12-03. '''IRC Channel {{IRC|FuslOnTable|network=hackint}}'''.


== Hiatus / Missed the Mark ==
== Hiatus / Missed the Mark ==
Line 64: Line 58:
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[NewsGrabber]]: Saving all news articles. <!-- Help with server power or by finding more news sites.-->Currently paused. '''IRC Channel {{IRC|newsgrabber}}'''.
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
Line 69: Line 64:
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. '''IRC Channel {{IRC|archiveteam}}'''.
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. '''IRC Channel {{IRC|archiveteam}}'''.


<small>ArchiveTeam uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[IRC|More info]]
<small>ArchiveTeam primarily uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[Archiveteam:IRC|More info]]</small><br>
<small>ArchiveTeam also uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – [[Archiveteam:IRC|More info]]

Revision as of 22:51, 8 August 2020

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Telegram
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).

Scripts only

  • Bitbucket: Kicking the bucket on Mercurial repositories by July 1 2020 to worship Git instead. IRC Channel #kickthebucket (on hackint).

Manual projects

  • 2019-2020 coronavirus outbreak: Documenting and preserving data, events, and impacts of the virus on society. IRC Channel #coronarchive (on hackint)
  • Yahoo! Groups: Years of internet threads, soon to go private-only. IRC Channel #yahoosucks (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
  • MP3.com: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.

Upcoming & proposed projects

Recently finished projects

  • Mixer: Video game streaming network shutting down 2020-07-23. IRC Channel #mixdown (on hackint)

Hiatus / Missed the Mark

ArchiveTeam primarily uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090More info
ArchiveTeam also uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info