Difference between revisions of "List of websites excluded from the Wayback Machine"

From Archiveteam
Jump to navigation Jump to search
(mydate.de - was a scam website which wanted to hide its traces. gmail-is-too-creepy.com - Daniel Brandt. FilmMusic.io. Adding at the end. The bot will sort it anyway.)
Line 5: Line 5:
Past exclusions that are no longer active are tracked on the [[/Former exclusions]] subpage.
Past exclusions that are no longer active are tracked on the [[/Former exclusions]] subpage.


<!-- atwikibot:urlCount -->This list currently contains 1824 URLs.<!-- /atwikibot:urlCount -->
<!-- atwikibot:urlCount -->This list currently contains 1828 URLs.<!-- /atwikibot:urlCount -->
<!--
<!--
Editing notes:
Editing notes:
Line 106: Line 106:
* http://www.9kw.eu/
* http://www.9kw.eu/
* https://www.9news.com/
* https://www.9news.com/
* https://FilmMusic.io/ ([[FilmMusic.io]])
* http://www.a-c-p-m.org/
* http://www.a-c-p-m.org/
* https://www.a-concept.org/
* https://www.a-concept.org/
Line 779: Line 780:
* https://glomardisclosure.com/ <!--dead (Mar 10 2019)-->
* https://glomardisclosure.com/ <!--dead (Mar 10 2019)-->
* https://glutenfreerecipebox.com/
* https://glutenfreerecipebox.com/
* http://gmail-is-too-creepy.com/ <!-- Daniel Brandt -->
* http://gmx.de/ <!--redirect to https://www.gmx.net/ (Mar 8 2019)-->
* http://gmx.de/ <!--redirect to https://www.gmx.net/ (Mar 8 2019)-->
* http://go.ezboard.com/
* http://go.ezboard.com/
Line 1,162: Line 1,164:
* http://mybrandpartner.biz/
* http://mybrandpartner.biz/
* https://www.mycams.com/
* https://www.mycams.com/
* http://mydate.de/ <!-- scam website which wanted to hide its traces -->
* http://www.myex.com/ <!--dead (Mar 8 2019)-->
* http://www.myex.com/ <!--dead (Mar 8 2019)-->
* http://www.myparentime.com/
* http://www.myparentime.com/
Line 1,629: Line 1,632:
* http://www.the-niceguy.com/
* http://www.the-niceguy.com/
* https://theaverageguy.tv/
* https://theaverageguy.tv/
* http://theberry.com <!-- also belongs to the fake archive theCHIVE.com -->
* https://thecharnelhouse.org/
* https://thecharnelhouse.org/
* https://thechive.com/
* https://thechive.com/
Line 1,836: Line 1,840:
* http://zpetny-odkaz.info/ <!--dead (May 16 2019)-->
* http://zpetny-odkaz.info/ <!--dead (May 16 2019)-->
* http://zpetnyodkaz.info/ <!--dead (May 16 2019)-->
* http://zpetnyodkaz.info/ <!--dead (May 16 2019)-->
* http://mydate.de/ <!-- scam website which wanted to hide its traces -->
* http://gmail-is-too-creepy.com/ <!-- Daniel Brandt -->
* http://theberry.com <!-- also belongs to the fake archive theCHIVE.com -->
* https://FilmMusic.io/ ([[FilmMusic.io]])


[[Category:Lists]]
[[Category:Lists]]
[[Category:Endangered projects]] <!-- Because anything that excludes the Wayback Machine can be seen as endangered. As it can be seen, some of the websites listed above have already perished. -->
[[Category:Endangered projects]] <!-- Because anything that excludes the Wayback Machine can be seen as endangered. As it can be seen, some of the websites listed above have already perished. -->

Revision as of 00:00, 26 November 2023

This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This applies to all subdomains as well. This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.

This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the /Partial exclusions subpage.

Past exclusions that are no longer active are tracked on the /Former exclusions subpage.

This list currently contains 1828 URLs.