Difference between revisions of "Yuku.com"

From Archiveteam
Jump to navigation Jump to search
(update status)
m
 
(15 intermediate revisions by 12 users not shown)
Line 1: Line 1:
{{Infobox project
{{Infobox project
| title = Yuku.com
| logo = Yuku-logo.png
| logo =
| image = Www.yuku.com_screencapture.png
| image = Www.yuku.com_screencapture.png
| description =
| URL = http://yuku.com
| URL = http://yuku.com
| project_status = {{online}}
| project_status = {{offline}}
| archiving_status = {{inprogress}}
| archiving_status = {{partiallysaved}}
| source = [https://github.com/ArchiveTeam/yuku-grab yuku-grab]
| source = [https://github.com/ArchiveTeam/yuku-grab yuku-grab]
| tracker = [http://tracker.archiveteam.org/yuku yuku]
| tracker = [http://tracker.archiveteam.org/yuku yuku]
| irc = archiveteam
| data = {{IA id|archiveteam_yuku}}
}}
}}


Yuku is an Internet forum site that allows users to generate forums that are a subdomain of yuku.com. Originally brought to ArchiveTeam's attention with [[The Classic Horror Film Board]].  
Yuku was an Internet forum site that allows users to generate forums that are a subdomain of yuku.com. Originally brought to ArchiveTeam's attention with [[The Classic Horror Film Board]]. Like [[FreeForums.org]], Yuku was owned by the company [[CrowdGather]]. In 2016, it was acquired by [[Tapatalk]], and in July 2017, its forums were forcibly migrated to the Tapatalk's mobile-centric forum platform, which is known to not only severely cripple the boards it hosts, but also kill them off after three months of inactivity ([https://www.tapatalk.com/groups/tapatalksupport/terms-of-use-website-t37212.html the staff] makes it sound ambiguous as hell, saying the forums will be locked, not removed - but [https://docs.google.com/document/d/1Bx1Bi55vuB_K4cu6EAU4bz8xZfdijwx57vvKhlIpdRI/edit TT's Terms of Service] say otherwise). Apparently, the Tapatalk staff are also known to monitor every single forum they can and delete posts on sight, as long as they violate their ToS even metaphorically. As of {{#formatdate:2023-08-03}}, all spot-checked forums on subdomains of yuku.com simply show [[Tapatalk]]'s homepage.


==Structure==
==Structure==
Line 26: Line 24:
=== Items ===
=== Items ===
* Scrape Google [https://raw.githubusercontent.com/chpwssn/yuku-discovery/master/googlescrape-parsed.txt Parsed] [https://raw.githubusercontent.com/chpwssn/yuku-discovery/master/googlescrape-raw.txt Raw]
* Scrape Google [https://raw.githubusercontent.com/chpwssn/yuku-discovery/master/googlescrape-parsed.txt Parsed] [https://raw.githubusercontent.com/chpwssn/yuku-discovery/master/googlescrape-raw.txt Raw]
* TODO: Scrape Bing
* Scrape Bing [https://raw.githubusercontent.com/mutoso/yuku-scrape/master/bing/bingscrape-parsed.txt Parsed] [https://raw.githubusercontent.com/mutoso/yuku-scrape/master/bing/bingscrape-raw.txt Raw] (Extracted from the first 20000 pages)
* TODO: Scrape DuckDuckGo
* Scrape Rapid7's Sonar data for all possible subdomains related to Yuku. [https://archive.org/details/yuku_com_subdomains Parsed] & [https://www.archiveteam.org/index.php?title=User:Trumad Instructions for grabbing an updated list].
* TODO: Scrape DuckDuckGo [https://raw.githubusercontent.com/mutoso/yuku-scrape/master/ddg/ddgscrape-parsed.txt Parsed] [https://raw.githubusercontent.com/mutoso/yuku-scrape/master/ddg/ddgscrape-raw.txt Raw] ([https://duckduckgo.com/?q=yuku.com Didn't return many results])
* TODO: Scrape Twitter
* TODO: Scrape Twitter
* TODO: Scrape Reddit
* TODO: Scrape Reddit

Latest revision as of 09:47, 4 August 2023

Yuku.com
Yuku.com logo
Www.yuku.com screencapture.png
URL http://yuku.com
Status Offline
Archiving status Partially saved
Archiving type Unknown
Project source yuku-grab
Project tracker yuku
IRC channel #archiveteam-bs (on hackint)
Data[how to use] archiveteam_yuku

Yuku was an Internet forum site that allows users to generate forums that are a subdomain of yuku.com. Originally brought to ArchiveTeam's attention with The Classic Horror Film Board. Like FreeForums.org, Yuku was owned by the company CrowdGather. In 2016, it was acquired by Tapatalk, and in July 2017, its forums were forcibly migrated to the Tapatalk's mobile-centric forum platform, which is known to not only severely cripple the boards it hosts, but also kill them off after three months of inactivity (the staff makes it sound ambiguous as hell, saying the forums will be locked, not removed - but TT's Terms of Service say otherwise). Apparently, the Tapatalk staff are also known to monitor every single forum they can and delete posts on sight, as long as they violate their ToS even metaphorically. As of 2023-08-03, all spot-checked forums on subdomains of yuku.com simply show Tapatalk's homepage.

Structure

Forums

Forums are separated by subdomains (example.yuku.com) and subforums are sequential and accessible via example.yuku.com/forums/<forum number>, topics are also sequential and can be accessed via example.yuku.com/topic/<topic number>. Pages within topics are indicated like: example.yuku.com/topic/56108/?page=2, topics can also be accessed via rss through example.yuku.com/feed/get/type/rss/source/lead/id/<topic number>

Users

Users can choose their own name, profiles are viewed via <username>.u.yuku.com or <username>.<forum name>.yuku.com. The two may be different depending on how the user registered or if the registration system changed.

Images

All images hosted by yuku are shown in their S3 bucket listing, and a complete scrape with object key and size can be found here.

Items

  • Scrape Google Parsed Raw
  • Scrape Bing Parsed Raw (Extracted from the first 20000 pages)
  • Scrape Rapid7's Sonar data for all possible subdomains related to Yuku. Parsed & Instructions for grabbing an updated list.
  • TODO: Scrape DuckDuckGo Parsed Raw (Didn't return many results)
  • TODO: Scrape Twitter
  • TODO: Scrape Reddit
  • TODO: Scrape links from MediaWiki wikis
  • TODO: Scrape the Open Directory Project
  • TODO: Scrape the Common Crawl Index
  • TODO: Scrape the Wayback Machine
  • TODO: Scrape URLTeam dumps
  • TODO: Scrape a list of subdomains from DNSdumpster.com (if applicable)
  • pentest-tools.com Subdomain search Parsed