Difference between revisions of "Yahoo! Groups"
m (Reverted edits by Megalanya0 (talk) to last revision by Jscott) |
Megalanya1 (talk | contribs) m (MOTHERFUCKER ! ! !) |
||
Line 12: | Line 12: | ||
It's been stable for a long time (since the late 90s), long enough for some specialised software to be developed to do backups of it. (Not many other websites can say ''that''.) | It's been stable for a long time (since the late 90s), long enough for some specialised software to be developed to do backups of it. (Not many other websites can say ''that''.) | ||
== | == '''MOTHERFUCKER ! ! !''' == | ||
== '''MOTHERFUCKER ! ! !''' == | |||
== '''MOTHERFUCKER ! ! !''' == | |||
== Yahoo Group Archiver == | == Yahoo Group Archiver == |
Revision as of 11:04, 17 January 2017
Yahoo! Groups | |
URL | http://groups.yahoo.com/ |
Status | Online! |
Archiving status | In progress... |
Archiving type | Unknown |
IRC channel | #archiveteam-bs (on hackint) |
Yahoo! Groups is Yahoo's email service; it's the result of the acquisition of eGroups and some other Yahoo! stuff.
It's been stable for a long time (since the late 90s), long enough for some specialised software to be developed to do backups of it. (Not many other websites can say that.)
MOTHERFUCKER ! ! !
MOTHERFUCKER ! ! !
MOTHERFUCKER ! ! !
Yahoo Group Archiver
The Yahoo Group Archiver is a Perl script which allows an export of "the messages (without the attachments), everything from the files section and all the images from the photo section along with their hierarchy on Yahoo".
It appears that, if you get the "Couldn't get message count" error when trying to use it, the solution is to edit the yahoo2maildir.pl file and replace the bottom line my $url = $HTTP::URI_CLASS->new($redirect, $base)->abs($base);
(under the heading sub GetJSRedirect
) with my $url = "http://groups.yahoo.com/group/$group/messages/$begin_msgid";
More frustratingly, it appears that Yahoo blocks your IP temporarily after hitting some invisible limit of data downloaded (the Archiver will continue to "download" messages for a bit, ending up with a bunch of 0-byte files, then stop completely). It's unknown if there is a solution.
Also: sometimes, some of the downloaded messages, in the middle of an otherwise normal batch, are 0 in size - almost as if Yahoo blocked your IP for a few seconds, then stopped. Watch out for these so that you can re-download them later.
Site structure
There’s a convenient JSON API:
- Group information: https://groups.yahoo.com/api/v1/groups/concatenative/
- Message list: https://groups.yahoo.com/api/v1/groups/concatenative/messages?count=100&sortOrder=desc&direction=-1
- Raw message: https://groups.yahoo.com/api/v1/groups/concatenative/messages/1/raw – note that there seems to be a message encoding problem
- Search: https://groups.yahoo.com/api/v1/search/groups?offset=0&maxHits=20&sortBy=&query=abcdef – sort can be one of OLDEST, RELEVANCE, MEMBERS, LATEST_ACTIVITY, NEWEST
- Categories: https://groups.yahoo.com/api/v1/dir/categories/0/?start=0
Note that all paginated responses are limited to the first 500 results and do not return anything new beyond that.
Statistics
As of 2015-10-23 the directory lists 5458443 groups. 2752112 of them have been discovered. 1483853 (54%) have public message archives with an estimated number of 7.7 billion messages (~5200 messages per group on average so far). On 2016-10-09 543 million messages (7%) had been archived.
Software for backups
- Yahoo Group Archiver, Sourceforge