Difference between revisions of "Saunalahti Iso G"
Jump to navigation
Jump to search
m (Adding reddit, Bing and DMOZ scrapes) |
m (Added DuckDuckGo and link to combined list) |
||
Line 20: | Line 20: | ||
* Google scrape ([http://paste.nerds.io/raw/erapemokuq pp.fi Google Scrape] [http://paste.nerds.io/raw/ituyerawap saunalahti.fi Google Scrape]) | * Google scrape ([http://paste.nerds.io/raw/erapemokuq pp.fi Google Scrape] [http://paste.nerds.io/raw/ituyerawap saunalahti.fi Google Scrape]) | ||
* Scrape Bing ([http://paste.nerds.io/raw/jizicigolo Bing Scrape]) | * Scrape Bing ([http://paste.nerds.io/raw/jizicigolo Bing Scrape]) | ||
* | * Scrape DuckDuckGo ([http://paste.nerds.io/raw/rajozikewo DuckDuckGo Scrape]) | ||
* TODO: Scrape Twitter | * TODO: Scrape Twitter | ||
* Scrape Reddit ([http://paste.nerds.io/raw/umopojuvox reddit /domain/ search]) | * Scrape Reddit ([http://paste.nerds.io/raw/umopojuvox reddit /domain/ search]) | ||
Line 28: | Line 28: | ||
* TODO: Scrape the Wayback Machine | * TODO: Scrape the Wayback Machine | ||
* TODO: Scrape URLTeam dumps | * TODO: Scrape URLTeam dumps | ||
Combined list of results from Chip's scrapes [http://paste.nerds.io/raw/ucoxoronap here]. | |||
{{Navigation box}} | {{Navigation box}} |
Revision as of 19:32, 6 April 2015
Saunalahti Iso G | |
URL | pp.fi, saunalahti.fi |
Status | Closing |
Archiving status | Upcoming... |
Archiving type | Unknown |
IRC channel | #isohno (on hackint) |
Shutting down on an unspecified date.
Discovery
Sites follow three patterns:
Items
- Google scrape (pp.fi Google Scrape saunalahti.fi Google Scrape)
- Scrape Bing (Bing Scrape)
- Scrape DuckDuckGo (DuckDuckGo Scrape)
- TODO: Scrape Twitter
- Scrape Reddit (reddit /domain/ search)
- TODO: Scrape links from MediaWiki wikis
- Scrape the Open Directory Project (DMOZ domain search)
- TODO: Scrape the Common Crawl Index
- TODO: Scrape the Wayback Machine
- TODO: Scrape URLTeam dumps
Combined list of results from Chip's scrapes here.