Difference between revisions of "Software"

From Archiveteam
Jump to: navigation, search
(Site-Specific)
(General Tools: Added WiLiSe)
Line 12: Line 12:
 
* [http://scrapy.org/ Scrapy] - Fast python library for web scraping
 
* [http://scrapy.org/ Scrapy] - Fast python library for web scraping
 
* [http://splinter.cobrateam.info/ Splinter] - Web app acceptance testing library for Python -- could be used along with a scraping lib to extract data from hard-to-reach places
 
* [http://splinter.cobrateam.info/ Splinter] - Web app acceptance testing library for Python -- could be used along with a scraping lib to extract data from hard-to-reach places
 +
* [http://sourceforge.net/projects/wilise/ WiLiSe] '''Wi'''ki'''Li'''nk '''Se'''arch - Python script to get links to specific pages of a site through the search in a Wiki ([[wikipedia:MediaWiki|MediaWiki]]-type) has the [http://www.mediawiki.org/wiki/Api.php api.php] accessible or [http://www.mediawiki.org/wiki/Extension:LinkSearch extension LinkSearch] enabled (the project is still very immature and at the moment the code is only available in [http://sourceforge.net/p/wilise/code/1/tree/code/trunk/ this SVN repository]).
  
 
== Hosted tools ==
 
== Hosted tools ==

Revision as of 06:35, 27 July 2011

General Tools

Hosted tools

Pinboard is a convenient social bookmarking service that will archive copies of all your bookmarks for online viewing. The catch is that it costs $9.25 just to join, plus $25/year for the archival feature and you can only download archives of your 25 most recent bookmarks in a particular category. This may pose problems if you ever need to get your data out in a hurry.

Site-Specific

Format Specific