Difference between revisions of "List of websites excluded from the Wayback Machine"

From Archiveteam
Jump to navigation Jump to search
(Add several more, remove some duplicates, clarify situation with subdomains)
Line 1: Line 1:
This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.
This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This applies to all subdomains as well. This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.


This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the [[/Partial exclusions]] subpage.
This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the [[/Partial exclusions]] subpage.
Line 92: Line 92:
* http://4608.info/ <!--dead (Jul 22 2019)-->
* http://4608.info/ <!--dead (Jul 22 2019)-->
* http://4chan.org/
* http://4chan.org/
* http://www.4chan.org/
* http://www.4channel.org/
* http://www.4channel.org/
* http://4digits.net/ <!--dead (Sep 22 2019)-->
* http://4digits.net/ <!--dead (Sep 22 2019)-->
Line 884: Line 883:
* https://www.iguanadons.net/
* https://www.iguanadons.net/
* http://illpost.top/
* http://illpost.top/
* http://images.neopets.com/
* https://neopets.com/
* http://imdi.com/ <!--dead (Mar 19 2019)-->
* http://imdi.com/ <!--dead (Mar 19 2019)-->
* https://imgbin.com/
* https://imgbin.com/
Line 1,093: Line 1,092:
* https://medapplications.com/
* https://medapplications.com/
* https://medcitynews.com/
* https://medcitynews.com/
* http://media.8ch.net/
* https://media.8ch.net/
* https://media.8ch.net/
* http://mediatakeout.com/ <!--redirect to https://mtonews.com/ (Jun 13 2019)-->
* http://mediatakeout.com/ <!--redirect to https://mtonews.com/ (Jun 13 2019)-->
Line 1,187: Line 1,185:
* https://neonnettle.com/
* https://neonnettle.com/
* http://www.neontrend.de/ <!--dead (Apr 8 2019)-->
* http://www.neontrend.de/ <!--dead (Apr 8 2019)-->
* http://www.neopets.com/
* https://network-tools.com/
* https://network-tools.com/
* http://netzwissenschaft.de/
* http://netzwissenschaft.de/
Line 1,218: Line 1,215:
* https://www.nobonuscasino.com/
* https://www.nobonuscasino.com/
* http://nogst.club/ <!--redirect to https://www.buybitcoinworldwide.com/ (Jun 6 2019)-->
* http://nogst.club/ <!--redirect to https://www.buybitcoinworldwide.com/ (Jun 6 2019)-->
* https://www.nohomers.net/
* https://nohomers.net/
* http://nordish.net/ <!--dead (Mar 7 2019)-->
* http://nordish.net/ <!--dead (Mar 7 2019)-->
* https://nordvpn.com/
* https://nordvpn.com/
Line 1,291: Line 1,288:
* http://peos.crane.navy.mil/  <!--dead (Aug 14 2019)-->
* http://peos.crane.navy.mil/  <!--dead (Aug 14 2019)-->
* http://personal.ecu.edu/
* http://personal.ecu.edu/
* http://www.perunamaa.net/
* http://perunamaa.net/
* http://petardas.com/
* http://petardas.com/
* http://petitcolas.net/
* http://petitcolas.net/
Line 1,557: Line 1,554:
* https://stevespages.com/
* https://stevespages.com/
* http://stickdeath.com/ <!--seemingly empty (Mar 6 2019)-->
* http://stickdeath.com/ <!--seemingly empty (Mar 6 2019)-->
* https://www.stileproject.com/
* https://stileproject.com/
* http://stopcirc.com/ <!--redirect to https://copyright.gov/ (Mar 10 2019)-->
* http://stopcirc.com/ <!--redirect to https://copyright.gov/ (Mar 10 2019)-->
* https://www.strabismus.org/
* https://www.strabismus.org/
Line 1,589: Line 1,586:
* http://www.survey-calls.com/
* http://www.survey-calls.com/
* https://swedenborgdigitallibrary.org/
* https://swedenborgdigitallibrary.org/
* http://swf.neopets.com/
* http://swharden.com/
* http://swharden.com/
* https://www.swl.usace.army.mil/
* https://www.swl.usace.army.mil/
Line 1,744: Line 1,740:
* https://warondrugsmedal.org/
* https://warondrugsmedal.org/
* http://watchout.com/
* http://watchout.com/
* https://www.wattpad.com/
* https://wattpad.com/
* https://www.wausaudailyherald.com/
* https://www.wausaudailyherald.com/
* https://www.wbir.com/
* https://www.wbir.com/
Line 1,822: Line 1,818:
* http://zpetny-odkaz.info/ <!--dead (May 16 2019)-->
* http://zpetny-odkaz.info/ <!--dead (May 16 2019)-->
* http://zpetnyodkaz.info/ <!--dead (May 16 2019)-->
* http://zpetnyodkaz.info/ <!--dead (May 16 2019)-->
* https://topspeed.com/
* https://eda-land.ru/
* https://mercola.com/
* http://stormbot.org/
* http://stormbot.net/
* https://dave.dk/
* https://egoiste1.net/
* https://biancawalraven.nl/
* http://middlecenter.com/
* http://houbysoft.com/
* https://zhina.wiki/
* http://zhina.red/
* http://reclusiveandunzipped.com/
* http://crypto-games.net/
* https://booklog.jp/
* https://bestgore.com/
* http://geocities.com/


[[Category:Lists]]
[[Category:Lists]]
[[Category:Endangered projects]] <!-- Because anything that excludes the Wayback Machine can be seen as endangered. As it can be seen, some of the websites listed above have already perished. -->
[[Category:Endangered projects]] <!-- Because anything that excludes the Wayback Machine can be seen as endangered. As it can be seen, some of the websites listed above have already perished. -->

Revision as of 09:55, 22 November 2023

This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This applies to all subdomains as well. This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.

This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the /Partial exclusions subpage.

This list currently contains 1812 URLs.