Difference between revisions of "Tripod"

From Archiveteam
Jump to navigation Jump to search
m (Reverted edits by Megalanya1 (talk) to last revision by Start)
(Add info on domain discovery and downloading in bulk)
 
Line 4: Line 4:
| description =  
| description =  
| URL = http://www.tripod.lycos.com/
| URL = http://www.tripod.lycos.com/
| project_status = {{online}}
| project_status = {{endangered}}
| archiving_status = {{nosavedyet}}
| archiving_status = {{nosavedyet}}
}}
}}


'''Tripod''' is a web hosting service.
'''Tripod''' is a web hosting service. It allows users to use a subdomain or their own custom domain. It is not clear if Lycos is turning a profit. After the incidents of [[Lycos Europe]] it seems that the website is at risk of being shutdown.
 
== Discovery & Downloading ==
Through basic subdomain scanning using [https://github.com/guelfoweb/knock Knockpy] <code>python knockpy.py tripod.com</code> we can locate approximately 30,000 subdomains with around 17,500 returning code 200.
 
After we single out all active subdomains we can use a command like this to create a WARC of each domain.
 
<code>
parallel -a domains.txt --jobs 10 wget -e robots=off -m --page-requisites --warc-file="{/}" {}
</code>
 
Finally, we'd just have to combine all the WARC files using something like [https://github.com/alard/megawarc MegaWARC] or [https://github.com/ArchiveTeam/archiveteam-megawarc-factory MegaWARC Factory].


== External links ==
== External links ==

Latest revision as of 03:56, 11 February 2023

Tripod
Tripod.png
URL http://www.tripod.lycos.com/
Status Endangered
Archiving status Not saved yet
Archiving type Unknown
IRC channel #archiveteam-bs (on hackint)

Tripod is a web hosting service. It allows users to use a subdomain or their own custom domain. It is not clear if Lycos is turning a profit. After the incidents of Lycos Europe it seems that the website is at risk of being shutdown.

Discovery & Downloading

Through basic subdomain scanning using Knockpy python knockpy.py tripod.com we can locate approximately 30,000 subdomains with around 17,500 returning code 200.

After we single out all active subdomains we can use a command like this to create a WARC of each domain.

parallel -a domains.txt --jobs 10 wget -e robots=off -m --page-requisites --warc-file="{/}" {}

Finally, we'd just have to combine all the WARC files using something like MegaWARC or MegaWARC Factory.

External links