Difference between revisions of "Wikipedia"

From Archiveteam
Jump to navigation Jump to search
(→‎Criteria for speedy deletions: Added link to the G13 deletion backlog and Eventualism article on WikiMedia Meta-Wiki)
(40 intermediate revisions by 13 users not shown)
Line 1: Line 1:
For once, a site that recognizes the importance of third-party backups! They have a [http://download.wikipedia.org/ main downloads page] from which you can get XML dumps from [http://download.wikipedia.org/backup-index.html individual wikis].
{{Infobox project
| title = Wikipedia
| logo = Wikipedia.png
| url = http://www.wikipedia.org/
| project_status = {{online}}
| archiving_status = {{saved}}
| irc = wikiteam
}}


There's an old article dump (2008/03/12) [http://thepiratebay.org/torrent/4794236/enwiki-20080312-pages-articles.xml.bz2 up on the pirate bay], from the [http://thepiratebay.org/user/archiveteam/ archiveteam TPB account]. Also, a [http://dumps.wikimedia.org/archive/enwiki/20060816/ dump from 2006].
'''Wikipedia''' is the largest [[wiki]] on the planet, with several million articles available in English and several million more in dozens of available languages.


Some [http://www.archive.org/search.php?query=wikipedia%20dump Wikipedia dumps] in Internet Archive.
[[File:Wikipedia nostalgia.png|thumb|right|[http://nostalgia.wikipedia.org Wikipedia nostalgia], a frozen version of Wikipedia from 2001]]
[[File:Wikipedia, the free encyclopedia april fools day 2010.png|thumb|right|April Fools Day 2010]]
<center>'''No more [[Library of Alexandria|Libraries of Alexandria]] destroyed.'''</center>


There is no current public backup for images uploaded to [[Wikimedia Commons]] which has about 7 millions of images.
[[File:Size of English Wikipedia in August 2010 (L).png|thumb|right|700px|English Wikipedia in August 2010, if printed.]]
 
For once, a site that recognizes the importance of third-party backups! They have a [http://dumps.wikimedia.org/ main downloads page] from which you can get XML dumps from individual wikis (Wikimedia Foundation hosts more than 800 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons, Wikivoyage, Wikidata).
 
== Criteria for speedy deletions ==
While articles nominated for deletion are usually given a one-week long discussion period prior to a potential deletion, under certain conditions, articles may be deleted at any time, regardless of contentual quality.<ref name=CSD>[https://en.wikipedia.org/wiki/Wikipedia:Criteria_for_speedy_deletion ''Criteria for speedy deletion'' on Wikipedia]</ref>
 
These conditions include:
 
* Draft articles unedited for at least half a year (criterion ''G13'')
* Request by original author, if not substantially edited by other users (criterion ''G7'')
* Discussion pages of deleted articles (criterion ''G8'')
* Recreations of previously deleted pages (criterion ''G4'')
* Request by original author in user space (criterion ''U1'')
* Underpopulated Wikipedia ''portal'' (criterion ''P2'')
 
The backlog of draft articles impending deletion per criterion ''G13'' can be found at [[:Wikipedia: Category:AfC G13 eligible soon submissions]].
 
== Tools ==
* [https://github.com/WikiTeam/wikiteam/blob/master/wikipediadownloader.py WikiTeam script] to download Wikipedia dumps from download.wikimedia.org
 
== Backups ==
As of 19:07, 10 July 2016 (EDT), dumps.wikimedia.org only has about 10 earlier versions of dumps for each wiki, generally going back to around October 2015. They don't seem to be linked, but they are accessible via http://dumps.wikimedia.org/''wikiname''/ (where ''wikiname'' is listed on the index page).
 
There's an old article dump (2008/03/12) [http://thepiratebay.org/torrent/4794236/enwiki-20080312-pages-articles.xml.bz2 up on The Pirate Bay] [magnet:?xt=urn:btih:5dc4df42109c8d1dbc759276d62225223ca69c53&dn=enwiki-20080312-pages-articles.xml.bz2&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Fopen.demonii.com%3A1337&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Fexodus.desync.com%3A6969 magnet], from the [http://thepiratebay.org/user/archiveteam/ ArchiveTeam TPB account], although it has no seeders as of 19:07, 10 July 2016 (EDT).
 
There is no current public backup for images uploaded to [[Wikimedia Commons]], which has about 32 million images and other media files uploaded on its services as of 19:07, 10 July 2016 (EDT).
 
Links:
* [http://download.wikipedia.org/ official backups site]
* http://download.wikimedia.org/archive/ - about a dozen older dumps, including [http://dumps.wikimedia.org/archive/enwiki/20060816/ one from 2006], as well as 2 from [https://dumps.wikimedia.org/archive/2001/ 2001].
* {{url|http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z|old wikipedia backups discovered}}
** [https://web.archive.org/web/20130522000621/http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z Direct Wayback link]
** {{url|http://lists.wikimedia.org/pipermail/foundation-l/2010-December/063088.html|announcement on foundation-l}}
** {{url|https://web.archive.org/web/20120306052415/http://grey.colorado.edu/wikipedia_2001/|script for parsing them}}
 
* Internet Archive results: http://www.archive.org/search.php?query=wikipedia%20dumps (223,142 results as of 20:25, 10 July 2016 (EDT))
** {{IA id|wikimediadownloads}} - Primary collection, manage by Hydriz
*** 915,108 items, with archivedates from Nov 10, 2005 through Jul 10, 2016 as of 20:34, 10 July 2016 (EDT)
** {{IA id|wikipediadumps}} - Older, somewhat forgotten collection
*** 810 items, with archivedates from April 9, 2010 through Aug 13, 2014 as of 20:25, 10 July 2016 (EDT)
*** Three sets of all or most of the different language editions of Wikipedia, from 2010-04-08, 2010-06-10 and 2011-08-08.
**** 2010-04 has an underscore between the wiki name and the date, and is missing ltwiki (Lithuanian) presumably because it was created between then and June 2010.
**** 2010-06 has the same identifier format, and contains one edition that is missing from the other two: emwiki (which appears to be the [[wikipedia:Emilian-Romagnol]] edition).
**** 2011-08 has a dash (rather than an underscore) both before and after "wiki", and is missing 7 editions that are present in the other two (ace, ckb, hu, krc, mwl, pcd, pnb) and contains 7 missing from them (ak, be_x_old, eml, fj, hz, ng, tokipona).
*** There are also 12 other misc dumps:
**** {{IA id|arwiki20110112}}
**** {{IA id|de_labswikimedia-20110904}}
**** {{IA id|de_labswikimedia-20111013}}
**** {{IA id|en_labswikimedia-20110906}}
**** {{IA id|en_labswikimedia-20111015}}
**** {{IA id|enwiki-20110620-item-1-of-2}}
**** {{IA id|enwiki-20110620-item-2-of-2}}
**** {{IA id|flaggedrevs_labswikimedia-20110907}}
**** {{IA id|flaggedrevs_labswikimedia-20111016}}
**** {{IA id|idwiki20101106}}
**** {{IA id|readerfeedback_labswikimedia-20110907}}
**** {{IA id|readerfeedback_labswikimedia-20111016}}
 
* [http://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive Compilation of links to Wikipedia archives]
* [http://nostalgia.wikipedia.org/wiki/HomePage A backup of Wikipedia as of Thursday, December 20, 2001]
 
=== Transferring to IA ===
[[User:Hydriz|Hydriz]] is currently transferring the dumps of all Wikimedia projects into the Internet Archive. Wikimedia itself has provided resources to me for transferring these dumps to the Internet Archive. The results are in the {{IA id|wikimediadownloads}} collection, which is still being kept up to date as of 20:38, 10 July 2016 (EDT).
 
== Vital signs ==
 
Stable, but they seriously use a lot of tactics to get donations.
 
== Offline readers ==
* [http://www.okawix.com/ Okawix] ([http://www.okawix.com/zenos/ files])
* [http://www.kiwix.org Kiwix] ([http://download.kiwix.org/zim/ files])


== See also ==
== See also ==
* [[Wikimedia Commons]]
* [[Wikia]]
* [[Wikia]]
* [[Wikis]]
* [[Wikis]]
* [[Nupedia]]
* [[GNUPedia]]
* [[Citizendium]]
* [[WikiTravel]] - Not a Wikimedia project, but its content was forked to create WMF-hosted rival Wikivoyage.
* [[WikiTeam]]
== External links ==
* http://www.wikipedia.org
* http://www.wikimedia.org
* https://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive
* https://en.wikipedia.org/wiki/User:Emijrp/All_Human_Knowledge
* [https://meta.wikimedia.org/wiki/Deletionism ''Deletionism'' article on WikiMedia Meta-Wiki]
* [https://meta.wikimedia.org/wiki/Eventualism ''Eventualism'' article on WikiMedia Meta-Wiki]
{{Navigation box}}


[[Category:Wikis]]
[[Category:Wikis]]

Revision as of 10:37, 3 October 2020

Wikipedia
Wikipedia logo
URL http://www.wikipedia.org/
Status Online!
Archiving status Saved!
Archiving type Unknown
IRC channel #wikiteam (on hackint)

Wikipedia is the largest wiki on the planet, with several million articles available in English and several million more in dozens of available languages.

Wikipedia nostalgia, a frozen version of Wikipedia from 2001
April Fools Day 2010
No more Libraries of Alexandria destroyed.
English Wikipedia in August 2010, if printed.

For once, a site that recognizes the importance of third-party backups! They have a main downloads page from which you can get XML dumps from individual wikis (Wikimedia Foundation hosts more than 800 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons, Wikivoyage, Wikidata).

Criteria for speedy deletions

While articles nominated for deletion are usually given a one-week long discussion period prior to a potential deletion, under certain conditions, articles may be deleted at any time, regardless of contentual quality.[1]

These conditions include:

  • Draft articles unedited for at least half a year (criterion G13)
  • Request by original author, if not substantially edited by other users (criterion G7)
  • Discussion pages of deleted articles (criterion G8)
  • Recreations of previously deleted pages (criterion G4)
  • Request by original author in user space (criterion U1)
  • Underpopulated Wikipedia portal (criterion P2)

The backlog of draft articles impending deletion per criterion G13 can be found at Wikipedia: Category:AfC G13 eligible soon submissions.

Tools

Backups

As of 19:07, 10 July 2016 (EDT), dumps.wikimedia.org only has about 10 earlier versions of dumps for each wiki, generally going back to around October 2015. They don't seem to be linked, but they are accessible via http://dumps.wikimedia.org/wikiname/ (where wikiname is listed on the index page).

There's an old article dump (2008/03/12) up on The Pirate Bay magnet, from the ArchiveTeam TPB account, although it has no seeders as of 19:07, 10 July 2016 (EDT).

There is no current public backup for images uploaded to Wikimedia Commons, which has about 32 million images and other media files uploaded on its services as of 19:07, 10 July 2016 (EDT).

Links:

Transferring to IA

Hydriz is currently transferring the dumps of all Wikimedia projects into the Internet Archive. Wikimedia itself has provided resources to me for transferring these dumps to the Internet Archive. The results are in the wikimediadownloads collection, which is still being kept up to date as of 20:38, 10 July 2016 (EDT).

Vital signs

Stable, but they seriously use a lot of tactics to get donations.

Offline readers

See also

External links