Difference between revisions of "List of websites excluded from the Wayback Machine"

From Archiveteam
Jump to navigation Jump to search
(10 intermediate revisions by 3 users not shown)
Line 1: Line 1:
This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.
This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.


<!-- atwikibot:urlCount -->This list currently contains 1591 URLs.<!-- /atwikibot:urlCount -->
<!-- atwikibot:urlCount -->This list currently contains 1597 URLs.<!-- /atwikibot:urlCount -->


* http://0x000000.com/
* http://0x000000.com/
Line 536: Line 536:
* https://dreammarketdrugs.com/ <!--dead (Jun 13 2019)-->
* https://dreammarketdrugs.com/ <!--dead (Jun 13 2019)-->
* http://dreams.grimuar.info/ <!--dead (Sep 21 2019)-->
* http://dreams.grimuar.info/ <!--dead (Sep 21 2019)-->
* http://drivehq.com/
* http://drunkenpeasants.wiki/ <!--redirect to https://discordapp.com/ (Mar 10 2019)-->
* http://drunkenpeasants.wiki/ <!--redirect to https://discordapp.com/ (Mar 10 2019)-->
* http://dslabo.info/
* http://dslabo.info/
Line 567: Line 568:
* https://www.emmerdale.me.uk/
* https://www.emmerdale.me.uk/
* http://empowernation.net/ <!--dead (Jul 27 2019)-->
* http://empowernation.net/ <!--dead (Jul 27 2019)-->
* https://en.luxuretv.com/
* http://www.enterprise-logic.com/
* http://www.enterprise-logic.com/
* https://www.eobot.com/
* https://www.eobot.com/
Line 685: Line 687:
* http://ggt.com/
* http://ggt.com/
* https://gifer.com/
* https://gifer.com/
* https://girlfuckshorse.net/
* https://www.gite-la-palme.fr/
* https://www.gite-la-palme.fr/
* http://giuseppemacario.info/
* http://giuseppemacario.info/
Line 742: Line 745:
* https://healthengine.com.au/
* https://healthengine.com.au/
* http://heavenfire.top/ <!--dead (Jul 31 2019)-->
* http://heavenfire.top/ <!--dead (Jul 31 2019)-->
* http://heavy-r.com/
* https://www.heavy-r.com/
* https://www.heavy-r.com/
* http://hee.com/
* http://hee.com/
Line 921: Line 925:
* http://lukebozier.co.uk/ <!--dead (Apr 7 2019)-->
* http://lukebozier.co.uk/ <!--dead (Apr 7 2019)-->
* http://lurkmore.to/
* http://lurkmore.to/
* https://luxuretv.com/
* https://www.lyonlabs.org/
* https://www.lyonlabs.org/
* http://m-h.tel/ <!--dead (May 26 2019)-->
* http://m-h.tel/ <!--dead (May 26 2019)-->
Line 1,151: Line 1,156:
* http://www.populartechnology.net/
* http://www.populartechnology.net/
* http://porco.ru/
* http://porco.ru/
* https://www.pornsocket.com/
* https://www.pornwikileaks.com/
* https://www.pornwikileaks.com/
* https://www.portclintonnewsherald.com/
* https://www.portclintonnewsherald.com/

Revision as of 00:00, 26 June 2020

This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.

This list currently contains 1597 URLs.