Windows Live Spaces
On March 16, 2011 will definitely shut down the platform of Windows Live Spaces and since September last year Microsoft has been notifying every user who had a Space active to migrate it to Wordpress using your Windows Live ID, save it to your hard drive or remove. Still, there are many long-abandoned blogs that have not yet been migrated so that they may not survive. For these reasons I decided to create this tutorial to know who wants to save some Spaces
HTTrack (graphic version)
I will explain what is the procedure to download one or more Spaces using HTTrack graphic version (WinHTTrack in Windows and in Linux is called WebHTTrack).
I assume that the reader should be familiarized with the use of WinHTTrack (or WebHTTrack) so I'll just explain that you need configure (in the Option Panel of the program) to download a Space of Windows Live Spaces.Cite error: Invalid
refs with no name must have content
In the section "Scan Rules" must be added the following lines:
+*.css +*.js -ad.doubleclick.net/* -mime:application/foobar +*.7z +*.pdf +*.doc +*.mid +*.3gp +*.djvu +*.amr +*.mp4 +*.ogg +*.ogv +*.ogm +*.mov +*.mpg +*.mpeg +*.avi +*.asf +*.mp3 +*.mp2 +*.rm +*.wav +*.vob +*.qt +*.vid +*.ac3 +*.wma +*.wmv +*.zip +*.tar +*.tgz +*.gz +*.rar +*.z +*.arj +*.dar +*.lzh +*.lz +*.lza +*.arc +*.gif +*.jpg +*.png +*.tif +*.bmp -*.entry#comment +*.profile.live.com/Lists/* +*.byfiles.storage.live.com/* +*.photos.live.com +*.spaces.live.com
Line 1 to 7 indicate what types of files are downloaded from a Space (if the program finds one these and this lines can be modified to suit the user), the line 8 is because the program tries to capture the comments any post of a blog on Windows Live Spaces and this action generates errors (in addition to a waste of time when exploring a site), line 9 and 12 are used to capturing Spaces of the list of "friends" who might have the Space user which is capturing at that time (these lines are optional), and lines 10 and 11 are to capture the files and photosCite error: Invalid
refs with no name must have content that the user can have uploaded there.
Finally add in the field Browser "Identity" (from the section Browser ID) the following User Agent:
Googlebot/2.1 (+ http://www.googlebot.com/bot.html)