Difference between revisions of "Wikipedia"
m |
|||
Line 5: | Line 5: | ||
For once, a site that recognizes the importance of third-party backups! They have a [http://download.wikipedia.org/ main downloads page] from which you can get XML dumps from [http://download.wikipedia.org/backup-index.html individual wikis] (Wikimedia Foundation hosts more than 700 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons). | For once, a site that recognizes the importance of third-party backups! They have a [http://download.wikipedia.org/ main downloads page] from which you can get XML dumps from [http://download.wikipedia.org/backup-index.html individual wikis] (Wikimedia Foundation hosts more than 700 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons). | ||
== Tools == | |||
* [http://code.google.com/p/wikiteam/source/browse/trunk/wikipediadownloader.py WikiTeam script] to download Wikipedia dumps from download.wikimedia.org | |||
== Backups == | |||
You can download all the articles of the [[English Wikipedia]] (with complete edit history) in an unique file (compressed in [[7zip]] format) from [http://download.wikimedia.org/enwiki/20100130/ here] (ATTENTION: 31 GB! Unpacked it expands up to 5.2 TB. Direct link: [http://download.wikimedia.org/enwiki/20100130/enwiki-20100130-pages-meta-history.xml.7z pages-meta-history.xml.7z]). | You can download all the articles of the [[English Wikipedia]] (with complete edit history) in an unique file (compressed in [[7zip]] format) from [http://download.wikimedia.org/enwiki/20100130/ here] (ATTENTION: 31 GB! Unpacked it expands up to 5.2 TB. Direct link: [http://download.wikimedia.org/enwiki/20100130/enwiki-20100130-pages-meta-history.xml.7z pages-meta-history.xml.7z]). | ||
Line 13: | Line 17: | ||
There is no current public backup for images uploaded to [[Wikimedia Commons]] which has about 10 million images and other media files uploaded on it's services. | There is no current public backup for images uploaded to [[Wikimedia Commons]] which has about 10 million images and other media files uploaded on it's services. | ||
Links: | |||
* [http://download.wikipedia.org/ official backups site] | * [http://download.wikipedia.org/ official backups site] | ||
* [http://download.wikimedia.org/archive/ some incomplete old dumps, English Wikipedia mainly] | * [http://download.wikimedia.org/archive/ some incomplete old dumps, English Wikipedia mainly] | ||
Line 30: | Line 27: | ||
* [http://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive Compilation of links to Wikipedia archives] | * [http://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive Compilation of links to Wikipedia archives] | ||
<center>'''No more [[Library of Alexandria|Libraries of Alexandria]] destroyed.'''</center> | |||
[[File:Size of English Wikipedia in August 2010 (L).png|thumb|center|700px|English Wikipedia in August 2010, if printed.]] | |||
== Vital signs == | == Vital signs == |
Revision as of 12:13, 5 December 2011
Wikipedia is the largest wiki on the planet, with several million articles available in English and several million more in dozens of available languages.
For once, a site that recognizes the importance of third-party backups! They have a main downloads page from which you can get XML dumps from individual wikis (Wikimedia Foundation hosts more than 700 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons).
Tools
- WikiTeam script to download Wikipedia dumps from download.wikimedia.org
Backups
You can download all the articles of the English Wikipedia (with complete edit history) in an unique file (compressed in 7zip format) from here (ATTENTION: 31 GB! Unpacked it expands up to 5.2 TB. Direct link: pages-meta-history.xml.7z).
There's an old article dump (2008/03/12) up on The Pirate Bay, from the ArchiveTeam TPB account. Also, a dump from 2006 is available.
Some Wikipedia dumps in the Internet Archive.
There is no current public backup for images uploaded to Wikimedia Commons which has about 10 million images and other media files uploaded on it's services.
Links:
- official backups site
- some incomplete old dumps, English Wikipedia mainly
- old wikipedia backups discovered (announcement)
- Internet Archive results: http://www.archive.org/search.php?query=wikipedia%20dumps
Vital signs
Stable, but they seriously use a lot of tactics to get donations.