Difference between revisions of "User:Start"

From Archiveteam
Jump to navigation Jump to search
m
Line 102: Line 102:
* TODO: Scrape the Wayback Machine
* TODO: Scrape the Wayback Machine
* TODO: Scrape URLTeam dumps
* TODO: Scrape URLTeam dumps
* TODO: Scrape DNSdumpster.com
* TODO: Scrape a list of subdomains from DNSdumpster.com (if applicable)

Revision as of 22:33, 6 May 2015

I like preserving the web.

I also go by Start+Select and Pressstart.

Archives

Website Crawls

Public HTTP/FTP Server List

Searching intitle:"index of /" inurl:"ftp" on Google gives millions of results.

blah blah blah ignore

Items

  • TODO: Scrape Google
  • TODO: Scrape Bing
  • TODO: Scrape DuckDuckGo
  • TODO: Scrape Twitter
  • TODO: Scrape Reddit
  • TODO: Scrape links from MediaWiki wikis
  • TODO: Scrape the Open Directory Project
  • TODO: Scrape the Common Crawl Index
  • TODO: Scrape the Wayback Machine
  • TODO: Scrape URLTeam dumps
  • TODO: Scrape a list of subdomains from DNSdumpster.com (if applicable)