Difference between revisions of "Current Projects"

From Archiveteam
Jump to navigation Jump to search
(Add note about increasing hackint usage)
(Remove "active not yet ingesting" section which is confusing, add 8tracks, move projects to the right sections)
Line 10: Line 10:


<!-- Urgent projects -->
<!-- Urgent projects -->
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks}}'''
<!-- Long-term projects -->
<!-- Long-term projects -->
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.


=== Scripts only ===
=== Scripts only ===
* [[NewsGrabber]]: Saving all news articles. Help with server power or by finding more news sites. '''IRC Channel {{IRC|newsgrabber}}'''.


== Manual projects ==
== Manual projects ==
* [https://github.com/davidferguson/yahoogroups-joiner Yahoogroups-joiner] Filling out captchas to archive Yahoo Groups. '''IRC Channel {{IRC|yahoosucks}}'''.
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks}}'''
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.


== Active Projects Not Yet Ingesting ==
== Upcoming & proposed projects ==
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
<!-- Top priority: could disappear anytime now -->
<!-- Top priority: could disappear anytime now -->
<!-- Shutting down, definite deadline given -->
<!-- Shutting down, definite deadline given -->
<!-- Other -->
* [[8tracks]]: social network around audio streaming and creating playlists, shutting down 2019-12-31. '''IRC Channel {{IRC|8ball|network=hackint}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit}}'''.
 
== Proposed projects ==
* [[YouTube]]: Making playlists of liked videos private on 2019-12-05. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
<!-- Shutting down, vague deadline given -->
<!-- Shutting down, vague deadline given -->
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.
Line 43: Line 37:
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
Line 52: Line 47:
* [[Gfycat]]: Deleting old anonymous uploads on or after 2019-11-18. '''IRC Channel {{IRC|deadcat|network=hackint}}'''.
* [[Gfycat]]: Deleting old anonymous uploads on or after 2019-11-18. '''IRC Channel {{IRC|deadcat|network=hackint}}'''.
* [[Google Fusion Tables]]: Shutting down on 2019-12-03. '''IRC Channel {{IRC|FuslOnTable|network=hackint}}'''.
* [[Google Fusion Tables]]: Shutting down on 2019-12-03. '''IRC Channel {{IRC|FuslOnTable|network=hackint}}'''.
* [[YouTube]]: Making playlists of liked videos private on 2019-12-05. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.


== Hiatus / Missed the Mark ==
== Hiatus / Missed the Mark ==
Line 64: Line 60:
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[NewsGrabber]]: Saving all news articles. <!-- Help with server power or by finding more news sites.-->Currently paused. '''IRC Channel {{IRC|newsgrabber}}'''.
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.

Revision as of 02:56, 28 December 2019

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Telegram
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).

Scripts only

Manual projects

  • Yahoo! Groups: Years of internet threads, soon to go private-only. IRC Channel #yahoosucks (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
  • MP3.com: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.

Upcoming & proposed projects

  • 8tracks: social network around audio streaming and creating playlists, shutting down 2019-12-31. IRC Channel #8ball (on hackint).
  • Kinja: Deleting all user pages, maybe? IRC Channel #gokinjagokinjago (on hackint).
  • Twitter: Deleting inactive accounts 2019-12-11 sometime. IRC Channel #twitterdead (on hackint).
  • GitHub: Embraced-uh, I mean, bought by Microsoft. IRC Channel #getgit (on hackint).
  • Imgur: Image hoster decided that using it for hosting images is not permitted. IRC Channel #imgone (on hackint).
  • JamiiForums: the Tanzanian government would like this gone. IRC Channel #jammedforums (on hackint).
  • LiveJournal: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. IRC Channel #recordedjournal (on hackint).
  • Ownlog: Ownlog is losing popularity and support from its owners. IRC Channel #pwnlog (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. IRC Channel #shreddit (on hackint).
  • The Pirate Bay: Recently came back up, grabbing an archive for sanity's sake. IRC Channel #yarharfiddlededee (on hackint).
  • Valhalla: Where to store what even the Internet Archive doesn't have space for? IRC Channel #huntinggrounds (on hackint).

Recently finished projects

Hiatus / Missed the Mark

ArchiveTeam primarily uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090More info
ArchiveTeam also uses the hackint IRC network – irc://irc.hackint.org (TLS required) – webchat: https://webirc.hackint.org/More info