Difference between revisions of "User talk:Bzc6p"

From Archiveteam
Jump to navigation Jump to search
m (→‎FTP Sites: Minor fix. Apologies for all these edits)
(Add IA item with yt videos)
 
(12 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{DISPLAYTITLE:User talk:bzc6p}}
{{DISPLAYTITLE:User talk:bzc6p}}


== Re: Some friendly words ==
<span style="background-color:lightgray; border: 2px gray solid; padding:5px">[[user_talk:bzc6p/Archive1|Archive 1]]</span>


Thanks for appreciating my efforts and explaining the ArchiveTeam to me. I thought "#archiveteam-bs" was for off-topic conversation, though. :/ And of course I didn't give up on archiving. Why would I? I'm getting 24 Blu-ray M-Discs next month, in fact. :) Would you willing to explain to the other users about the situation? I'm willing to forgive them if they accept it & apologize for my trolling. I'm just glad someone, by the very least, understood my situation and took the time to write to me.


And I looked at your userpage. I'll see if I can track down some Hungarian sites. You can always use the Google operator "site:.hu" to filter just Hungarian sites. There is, however, [http://donkeykong.gportal.hu/ this site]. I have a backup of it, but not in .warc.gz format. Even worse, Yahoo is stupid enough to be shutting down their first service: dir.yahoo.com (Yahoo! Dir), on 12/31/2014. Stupid Yahoo...
== MyVIP ==


And by the way, SketchCow disliked the fact that I "asked too many questions". [[User:Archive Maniac|Archive Maniac]] 13:25, 19 October 2014 (EDT)
Hi, can you let me know when you're online in #archiveteam and have some time to talk with me about MyVIP?


:I'm waiting for Wpull to have a Windows release or a Python 2 release. I also stink at Python big time... [[User:Archive Maniac|Archive Maniac]] 17:47, 19 October 2014 (EDT)
----
::Python3 unfortunately gets mixed up with Python 2 in the Command Prompt (e.g. python3 is not recognized as a command). That's why I've stuck to Python 2, because I use the wiki dump tool with that version. Aside from that, I always get errors when attempting installation, like vcvarsbatall.bat or something error, couldn't find seesaw kit, etc. Python is so user-unfriendly... [[User:Archive Maniac|Archive Maniac]] 17:42, 20 October 2014 (EDT)
Hi, can you please give me a list of example links for a myvip profile? --[[User:Arkiver|Arkiver]] 04:31, 28 December 2015 (EST)


== Any Help on Chat? ==
What's your IRC username? I want help coming back on the ArchiveBot & archiveteam-bs channel. And please tell me what discussions are appropriate for the latter; you do have a way with words. :P [[User:Archive Maniac|Archive Maniac]] 20:59, 21 October 2014 (EDT)


== ArchiveBot Requests ==
== Chelyabinsk meteor ==


Hey, Bzc6p. Are you willing to take ArchiveBot requests from me? I also like your Hungarian site archiving. I recently archived smb.gportal.hu on my computer. [[User:Archive Maniac|Archive Maniac]] 18:55, 18 November 2014 (EST)
Hi Bzc6p, could you help create a project to save videos and pictures of Chelyabinsk meteor 2013 event to archive.org?  
:I have two more questions (the thing that made users upset at me):


#I like archiving stuff. What archiving tools do you know of and recommend?
It is a unique event that has occurred only once every several hundred years, and the original pictures and videos of the event are slowly disappearing from the YouTube and other video sites.
#Is there a way that I can save whole sites to the Wayback Machine without using the ArchiveBot channel? I probably don't think so, but there still might be a chance.
#Why doesn't the ArchiveTeam make C++ ports of their Python tools?
#When I try to use Wget, I get this error in the command prompt: ''Connecting to SITENAME (SITENAME)|IP|:PORT... failed: Bad file descriptor.'' Do you know how to fix this problem?


I hope you're not too annoyed by these questions, like the others would probably be. [[User:Archive Maniac|Archive Maniac]] 12:01, 20 November 2014 (EST)
[https://en.wikipedia.org/wiki/Chelyabinsk_meteor Chelyabinsk meteor on Wikipedia]
:Thanks for the info. And what's been a problem is that I've tried to set ArchiveBot or wpull up a few times, but never had proper 100% cannot fail step-by-step instructions on how to set both up. If you have the time, could you please write a more specific tutorial than the existing one? I preferably want a tutorial on the former [wpull]. [[User:Archive Maniac|Archive Maniac]] 11:45, 22 November 2014 (EST)


== Blank CD Question ==
I'm no developer but I've made efforts to locate the best available references:


Hi Bzc6p, I am wondering how long CD-R's and DVD-R's last with a .iso image burned on to it. Is it just as long as the estimated shelf life? More importantly: what do you recommend for long-term backup solutions? [[User:Archive Maniac|Archive Maniac]] 14:51, 29 November 2014 (EST)
So far I've found 3 good lists of video URLs. Unfortunately some of the videos are already inaccessible and lost for eternity :(


== Blogter.hu's Unexpected Downfall ==
~1000 videos URLs here:
* http://meteor.asu.cas.cz/~lenka/videa/list.phtml?reset=1


Hi Bzc6p. You know how Blogter unexpectedly shut down in December in spite of its popularity? That goes to show that anything, and I mean anything, can happen to web sites that seem okay but actually are in limbo (i.e. extinction). That's why I suggested you archive gportal.hu. I already archived the Mario and DK sites. [[User:Archive Maniac|Archive Maniac]] 19:46, 7 December 2014 (EST)
~728 video URLs here:
* http://vk.com/chelyabinskfall


== What I'm Currently Doing ==
~1000 photos here:
* https://fotki.yandex dot ru/users/chelyabinskfall


Hi Bzc6p, it's been a little bit since I last talked to you. If you want to know what I'm currently doing, it's that I'm searching the depths of the Internet for links and saving them on to the Wayback Machine. I'm also uploading [https://archive.org/search.php?query=subject%3A%22dec3199%22 my own collections to the Internet Archive]. There's some stuff in there which you'll probably enjoy. :)
I hope this helps, tell me if there is anything else I can help with.


And the icing on the cake is that I'm editing a few wikis, cleaning them up and trying to make them more informative.
:Thanks for your help. I noticed that the xls/doc file from the second link (vk.com) contains not just URLs to videos, but also URLs to user galleries containing high resolution pictures. These pictures may need to be downloaded manually (there are not too many)


P.S. Do you forgive me and understand why I went into a very mad rage here those few times (which I shouldn't have)? I know the experience is over, but I feel embarrassed around you, given my extremely vulgar actions and how you're aware of it.
=== archive.org item ===
 
I've created an item with all youtube videos I could download, it's in a tarball format, about 600 videos inside, ~10GiB. [https://archive.org/details/ChME-2013-02-15 ChME-2013-02-15]  --[[User:Vitzli|Vitzli]] ([[User talk:Vitzli|talk]]) 08:58, 9 April 2016 (EDT)
Anyway, nice to message you again. Good luck saving Hungarian sites! :) [[User:Archive Maniac|Archive Maniac]] 21:15, 5 January 2015 (EST)
 
:Thanks for replying. :) Shortly after I messaged you, somebody on a forum site taught me how to properly burn files to an M-Disc. And it was a success! A good, long offline backup for me! :D
 
And it's a shame [[extra.hu]] is gone... It looked like an excellent web host...
 
 
I also have issues with using wikiadownloader.py. It gives me this error:
 
<pre>
Traceback (most recent call last):
  File "wikiadownloader.py", line 41, in <module>
    f = open('wikia.com', 'r')
IOError: [Errno 2] No such file or directory: 'wikia.com'
</pre>
 
 
Do you know what that is? [[User:Archive Maniac|Archive Maniac]] 12:25, 6 January 2015 (EST)
 
== View Archive.org Directories as Text Only ==
 
Hi Bzc6p, I remember someone on the ArchiveTeam taught me how to view archive.org site directories (e.g. like these: http://web.archive.org/*/media.nintendo-europe.com/* ) as text-only in the browser. I forgot how to do it, so I've come to ask you how to do it. Do you know how? [[User:Archive Maniac|Archive Maniac]] 18:56, 22 January 2015 (EST)
:I literally meant what I said. The link I gave you lists all of the URLs on the Internet Archive. I asked how to view it as text-only. (By the way, it was taught to me on the #archivebot channel, which isn't on BadCheese). [[User:Archive Maniac|Archive Maniac]] 18:32, 23 January 2015 (EST)
::Ah, yes. That's what they mentioned. Thanks, bzc6p. I also have a bit of a problem—see, I want to access a site (http://eecad.sogang.ac.kr/~chang/games/dkc2/) on the Wayback Machine, but it's blocked by robots.txt... Also, many of Nintendo Europe's sites (e.g. nintendo.co.uk, nintendo.es, nintendo.fr) are excluded from the Wayback Machine entirely. Is there any way for me to access them? I mean, J.Scott's obviously not going to help out here. [[User:Archive Maniac|Archive Maniac]] 14:46, 24 January 2015 (EST)
:::Wow, he is not nice. Just look how he talks about the people on the IA Forums on IRC. He's also gloating about having access to everything on the Internet Archive. (And I saved your email in case I get banned for voicing my opinion, which really is true...) [[User:Archive Maniac|Archive Maniac]] 17:30, 24 January 2015 (EST)
:::Add a period in front of the domain name, e.g. https://web.archive.org/web/20011211041409/http://.eecad.sogang.ac.kr/~chang/games/dkc2/ (note that you need to do this for all links too) [[User:PiRSquared|PiRSquared]] 23:59, 25 January 2015 (EST)
::::Thanks PiR. (and sorry for what I said above; I was upset about something on IRC). Oh yeah, and I should probably not tell anyone else about it, which I will do. [[User:Archive Maniac|Archive Maniac]] 14:14, 26 January 2015 (EST)
 
== FTP Sites ==
 
Hey, bzc6p, have you ever considered trying to crawl FTP sites (see [[FTP]] article]])? As of now, I uploaded two on to the Internet Archive. By the way, I figured out that you can save tons of urls on the Wayback Machine if you crawl/mirror a site using Wget (url should be  http://web.archive.org/save/urlgoeshere ). In total, I do: <pre>Wget http://web.archive.org/save/http://exampleurl.com -m -p -np -e robots=off</pre> Hope this helps. It's sort of an ArchiveBot alternative.
 
Or if you don't want to save files on to your computer and delete them every time you crawl a site:
 
<pre>wget http://web.archive.org/save/urlgoeshere.com -r --spider -np -e robots=off</pre> [[User:Archive Maniac|Archive Maniac]] 14:15, 1 February 2015 (EST)

Latest revision as of 12:58, 9 April 2016


Archive 1


MyVIP

Hi, can you let me know when you're online in #archiveteam and have some time to talk with me about MyVIP?


Hi, can you please give me a list of example links for a myvip profile? --Arkiver 04:31, 28 December 2015 (EST)


Chelyabinsk meteor

Hi Bzc6p, could you help create a project to save videos and pictures of Chelyabinsk meteor 2013 event to archive.org?

It is a unique event that has occurred only once every several hundred years, and the original pictures and videos of the event are slowly disappearing from the YouTube and other video sites.

Chelyabinsk meteor on Wikipedia

I'm no developer but I've made efforts to locate the best available references:

So far I've found 3 good lists of video URLs. Unfortunately some of the videos are already inaccessible and lost for eternity :(

~1000 videos URLs here:

~728 video URLs here:

~1000 photos here:

I hope this helps, tell me if there is anything else I can help with.

Thanks for your help. I noticed that the xls/doc file from the second link (vk.com) contains not just URLs to videos, but also URLs to user galleries containing high resolution pictures. These pictures may need to be downloaded manually (there are not too many)

archive.org item

I've created an item with all youtube videos I could download, it's in a tarball format, about 600 videos inside, ~10GiB. ChME-2013-02-15 --Vitzli (talk) 08:58, 9 April 2016 (EDT)