Difference between revisions of "User:Start"

From Archiveteam
Jump to navigation Jump to search
m
Line 92: Line 92:


=== Items ===
=== Items ===
* TODO: Scrape Google Search
* TODO: Scrape Google
* TODO: Scrape Bing
* TODO: Scrape Bing
* TODO: Scrape Twitter
* TODO: Scrape Twitter
* TODO: Scrape Reddit
* TODO: Scrape links from MediaWiki wikis
* TODO: Scrape links from MediaWiki wikis
* TODO: Scrape the Open Directory Project
* TODO: Scrape the Open Directory Project
* TODO: Scrape the Common Crawl Index
* TODO: Scrape the Common Crawl Index
* TODO: Scrape the Wayback Machine
* TODO: Scrape URLTeam dumps
* TODO: Scrape URLTeam dumps

Revision as of 18:33, 15 March 2015

I like preserving the web.

I also go by Start+Select and Pressstart.

Archives

Website Crawls

Public HTTP/FTP Server List

Searching intitle:"index of /" inurl:"ftp" on Google gives millions of results.

blah blah blah ignore

Items

  • TODO: Scrape Google
  • TODO: Scrape Bing
  • TODO: Scrape Twitter
  • TODO: Scrape Reddit
  • TODO: Scrape links from MediaWiki wikis
  • TODO: Scrape the Open Directory Project
  • TODO: Scrape the Common Crawl Index
  • TODO: Scrape the Wayback Machine
  • TODO: Scrape URLTeam dumps