Difference between revisions of "Tumblr"

From Archiveteam
Jump to navigation Jump to search
m (→‎top: typos fixed: from it's → from its)
(→‎Lists of Tumblr blogs: https://kinsta.com/blog/import-tumblr-to-wordpress/)
(22 intermediate revisions by 11 users not shown)
Line 2: Line 2:
| title = Tumblr
| title = Tumblr
| logo = Tumblr on white.png
| logo = Tumblr on white.png
| image = Tumblr_staff_blog.png
| URL = https://www.tumblr.com/
| URL = <nowiki>http://www.tumblr.com/</nowiki>
| project_status = {{online}}
| project_status = {{online}}
| archiving_status = {{nosavedyet}}
| archiving_status = {{partiallysaved}}
| source = https://github.com/ArchiveTeam/tumblr-grab
| source = https://github.com/ArchiveTeam/tumblr-grab
| irc = tumbledown
| irc = tumbledown
| tracker = https://tracker.archiveteam.org/tumblr/
}}
}}


Line 14: Line 14:
'''Tumblr''' is a social networking microblog.
'''Tumblr''' is a social networking microblog.


[[Yahoo!]] has purchased Tumblr for 1.1 billion dollars. Tumblr allegedly [http://blogs.wsj.com/digits/2014/10/21/yahoo-tumblr-to-make-over-100-million-in-revenue-next-year/ doubled in number of blogs in 2014] will become profitable in 2015.
As of August 2019, Verizon acknowledges its failure and Tumblr is up for sale, [https://www.verizon.com/about/news/verizon-media-announces-sale-tumblr-automattic possibly to Automattic], the company behind [[WordPress]].


In December 2015, Yahoo put their Tumblr service into the "decide on" category in their Action Plan, according to their [http://www.wsj.com/public/resources/documents/yahoopresentation.pdf 2015 shareholder presentation].
== Quirks ==
Users can change their account names into the format used for deleted accounts. Specifically, USERNAME-deactivated-[Any amount of digits, 0-9]. Users who do this are inaccessible via their main account page, or directly linked to posts. Their posts will still show up in searches, and their "archive" URL will work. This doesn't seem to have an effect on the API, and tumblr-utils will still work just fine. For an example of this tomfoolery, see [https://diediedie3344-deactivated-204913.tumblr.com/archive the archive page of user "diediedie3344-deactivated-204913"].
 
Another quirk is that Tumblr accounts that appear to be on a different domain name are still accessible at, and show up in searches as, their account name. Trying to go to any page on the accountname.tumblr.com end redirects you to the same page on the custom-url-here.com page. For an example of this behavior, see [https://homosethsual.tumblr.com user "homosethsual"] which redirects to [http://ranpos.star.is/ ranpos.star.is]
 
== History ==
 
[[Yahoo!]] has purchased Tumblr for 1.1 billion dollars. Tumblr allegedly [https://blogs.wsj.com/digits/2014/10/21/yahoo-tumblr-to-make-over-100-million-in-revenue-next-year/ doubled in number of blogs in 2014] were supposed to become profitable in 2015.
 
In December 2015, Yahoo put their Tumblr service into the "decide on" category in their Action Plan, according to their [https://www.wsj.com/public/resources/documents/yahoopresentation.pdf 2015 shareholder presentation].
 
In June 2017, Tumblr tightened up "Safe mode", which limits "sensitive content" to all users below 18 years old and the viewing of blogs marked as explicit, potentially causing a major move-away from Tumblr due to Internet Backdraft from its users. Given Yahoo's tendency to ax things that become less popular than expected, it might be important to keep an eye out for it.
 
As of 30th of July 2017, <s>it is no longer possible to access NSFW accounts outside of https://tumblr.com/blog/<name> URLs. Attempting to access an NSFW account normally will now cause infinite redirecting</s>. NSFW marked Tumblrs are inaccessible to signed out users.


In June 2017, Tumblr tightened up "Safe mode", which limits "sensitive content" to all users below 18 years old and the viewing of blogs marked as explicit, potentially causing a major moveaway from Tumblr due to Internet Backdraft from its users. Given Yahoo's tedency to ax things that become less popular than expected, it might be important to keep an eye out for it.
On 3 Dec 2018, [https://tumblr.zendesk.com/hc/en-us/articles/231885248-Sensitive-content Tumblr announced] that all NSFW content will be removed on 17 Dec 2018, with [https://www.eff.org/deeplinks/2018/12/dear-tumblr-banning-adult-content-wont-make-your-site-better-it-will-harm-sex plenty of misclassifications detected].


== Quirks ==
== Lists of Tumblr blogs ==
Users can change their account names into the format used for deleted accounts. Specifically, USERNAME-deactivated-[Any amount of digits, 0-9]. Users who do this are unaccessible via their main account page, or directly linked to posts. Their posts will still show up in searches, and their "archive" url will work. This doesn't seem to have an effect on the API, and tumblr-utils will still work just fine. For an example of this tomfoolery, see [http://diediedie3344-deactivated-204913.tumblr.com/archive the archive page of user "diediedie3344-deactivated-204913"].
* {{URL|https://transfer.sh/13Aa3n/tumblr.com.txt}} (2.6 million; [[Project Sonar]] 2018-10-26 FDNS data)
* {{URL|https://files.catbox.moe/o1di6l.xz}} (~7 million, scraped in april 2018, csv formatted with additional metadata. blog,url,post count,likes count, ... 5th csv field should be the is_nsfw indicator)
* {{URL|https://files.catbox.moe/ve5hb4.xz}} (~12 million, scraped in april 2018, contains everything from above and more. schema as stated in first line of the file: tumblelog,url,posts,likes,adult,nsfw,groupchan)
* {{URL|https://transfer.sh/zO7YT/tumblr-adult-bing}} (96.7k; scraped from Bing on 2018-12-05 with the adult keyword list at {{URL|https://pastebin.com/TgirZdRB}})
 
== Migrate your Tumblr ==
 
If you have a blog on Tumblr, you may want to migrate its content to Wordpress. You can then easily export all the content in a usable format, keep the Wordpress blog as a backup, or even jump ship. An easy and comprehensive tutorial is available at https://kinsta.com/blog/import-tumblr-to-wordpress/
 
== Archiving ==
 
We still need more archivers! As usual, follow instructions on the [https://tracker.archiveteam.org/tumblr/ tracker] and the [https://github.com/ArchiveTeam/tumblr-grab tumblr-grab repository].
 
As you probably heard, Tumblr is [https://www.techdirt.com/articles/20181216/22420341242/as-final-fuck-you-to-free-speech-tumblr-verizon-blocked-archivists.shtml fighting users and archivists]. '''Please limit your concurrency to 1 per machine/IP address'''.
 
As the sources above are incomplete, blogs to be archived can also be submitted using [https://docs.google.com/forms/d/e/1FAIpQLSdoYnlweKF-5iQ2G0FB9s7pDV_Le61dDU-gMMDsc8CQ50YBjQ/viewform?hl=en this form].
 
== Internet Archive collections ==


Another quirk is that tumblr accounts that appear to be on a different domain name are still accessible at, and show up in searches as, their account name. Trying to go to any page on the accountname.tumblr.com end redirects you to the same page on the custom-url-here.com page. For an example of this behavior, see [http://homosethsual.tumblr.com user "homosethsual"] which redirects to [http://ranpos.star.is/ ranpos.star.is]
Tens of thousands subdomains from the December 2018 emergency archival are flowing to https://archive.org/details/archiveteam_tumblr


As of 30th of July 2017<s>,it is no longer possible to access NSFW accounts outside of http://tumblr.com/blog/<name> URLs. Attempting to access an NSFW account normally will now cause infinite redirecting.</s>  NSFW marked Tumblrs are inaccessible to signed out users.
Over 800 domains were archived in 2015: [https://archive.org/search.php?query=WildArchives-Tumblr WildArchives-Tumblr].


== See also ==
== See also ==
* [http://sourceforge.net/projects/gettumblrpics/ gettumblrpics], simple script to download images from a tumblr feed as they appear in it
* [https://github.com/ArchiveTeam/tumblr-grab/blob/master/FAQ.md FAQ on tumblr-grab]
* [https://github.com/bbolli/tumblr-utils/ tumblr-utils], tumblr_backup.py can make a local backup of posts (XML default), video, audio and images.
* [https://sourceforge.net/projects/gettumblrpics/ gettumblrpics], simple script to download images from a Tumblr feed as they appear in it
* [https://github.com/woodenphone/tumblrsagi Tumblrsagi], Code to grab blogs from the API and stuff them into a database for rehosting, used by [https://t.archive.horse/blogs this tumblr archive]
* [https://github.com/bbolli/tumblr-utils tumblr-utils], tumblr_backup.py can make a local backup of posts (XML default), video, audio and images. Uses APIv2
* [http://soup.io] can automatically mirror the contents of a tumblr blog as they are posted, which may be useful for maintaining an offsite-copy which can be archived later.
* [https://github.com/woodenphone/tumblrsagi Tumblrsagi], Code to grab blogs from the API and stuff them into a database for rehosting, used by [https://t.archive.horse/blogs this Tumblr archive]
* [http://www.soup.io] can automatically mirror the contents of a Tumblr blog as they are posted, which may be useful for maintaining an offsite-copy which can be archived later.
* [https://www.jzab.de/content/tumblthree TumblThree], Can archive an entire blog by feeding it an URL, including asks, text posts and reblogs to XML format and can download all images. [https://github.com/johanneszab/TumblThree/releases/latest Downloadable here.] Windows only until the dev implements mono support.
* [https://www.jzab.de/content/tumblthree TumblThree], Can archive an entire blog by feeding it an URL, including asks, text posts and reblogs to XML format and can download all images. [https://github.com/johanneszab/TumblThree/releases/latest Downloadable here.] Windows only until the dev implements mono support.



Revision as of 21:04, 17 April 2020

Tumblr
Tumblr logo
URL https://www.tumblr.com/
Status Online!
Archiving status Partially saved
Archiving type Unknown
Project source https://github.com/ArchiveTeam/tumblr-grab
Project tracker https://tracker.archiveteam.org/tumblr/
IRC channel #tumbledown (on hackint)

Yahoobuystumblr.gif

Tumblr is a social networking microblog.

As of August 2019, Verizon acknowledges its failure and Tumblr is up for sale, possibly to Automattic, the company behind WordPress.

Quirks

Users can change their account names into the format used for deleted accounts. Specifically, USERNAME-deactivated-[Any amount of digits, 0-9]. Users who do this are inaccessible via their main account page, or directly linked to posts. Their posts will still show up in searches, and their "archive" URL will work. This doesn't seem to have an effect on the API, and tumblr-utils will still work just fine. For an example of this tomfoolery, see the archive page of user "diediedie3344-deactivated-204913".

Another quirk is that Tumblr accounts that appear to be on a different domain name are still accessible at, and show up in searches as, their account name. Trying to go to any page on the accountname.tumblr.com end redirects you to the same page on the custom-url-here.com page. For an example of this behavior, see user "homosethsual" which redirects to ranpos.star.is

History

Yahoo! has purchased Tumblr for 1.1 billion dollars. Tumblr allegedly doubled in number of blogs in 2014 were supposed to become profitable in 2015.

In December 2015, Yahoo put their Tumblr service into the "decide on" category in their Action Plan, according to their 2015 shareholder presentation.

In June 2017, Tumblr tightened up "Safe mode", which limits "sensitive content" to all users below 18 years old and the viewing of blogs marked as explicit, potentially causing a major move-away from Tumblr due to Internet Backdraft from its users. Given Yahoo's tendency to ax things that become less popular than expected, it might be important to keep an eye out for it.

As of 30th of July 2017, it is no longer possible to access NSFW accounts outside of https://tumblr.com/blog/<name> URLs. Attempting to access an NSFW account normally will now cause infinite redirecting. NSFW marked Tumblrs are inaccessible to signed out users.

On 3 Dec 2018, Tumblr announced that all NSFW content will be removed on 17 Dec 2018, with plenty of misclassifications detected.

Lists of Tumblr blogs

Migrate your Tumblr

If you have a blog on Tumblr, you may want to migrate its content to Wordpress. You can then easily export all the content in a usable format, keep the Wordpress blog as a backup, or even jump ship. An easy and comprehensive tutorial is available at https://kinsta.com/blog/import-tumblr-to-wordpress/

Archiving

We still need more archivers! As usual, follow instructions on the tracker and the tumblr-grab repository.

As you probably heard, Tumblr is fighting users and archivists. Please limit your concurrency to 1 per machine/IP address.

As the sources above are incomplete, blogs to be archived can also be submitted using this form.

Internet Archive collections

Tens of thousands subdomains from the December 2018 emergency archival are flowing to https://archive.org/details/archiveteam_tumblr

Over 800 domains were archived in 2015: WildArchives-Tumblr.

See also

  • FAQ on tumblr-grab
  • gettumblrpics, simple script to download images from a Tumblr feed as they appear in it
  • tumblr-utils, tumblr_backup.py can make a local backup of posts (XML default), video, audio and images. Uses APIv2
  • Tumblrsagi, Code to grab blogs from the API and stuff them into a database for rehosting, used by this Tumblr archive
  • [1] can automatically mirror the contents of a Tumblr blog as they are posted, which may be useful for maintaining an offsite-copy which can be archived later.
  • TumblThree, Can archive an entire blog by feeding it an URL, including asks, text posts and reblogs to XML format and can download all images. Downloadable here. Windows only until the dev implements mono support.