Bitbucket

From Archiveteam
Revision as of 16:23, 3 September 2020 by JustAnotherArchivist (talk | contribs) (Update and cleanup/slight restructuring, move actual repositories to separate Mercurial page)
Jump to navigation Jump to search
Bitbucket
Bitbucket logo
Bitbucket-screenshot.png
URL https://bitbucket.org/
Status Online!
Archiving status Not saved yet
Archiving type Unknown
IRC channel #kickthebucket (on hackint)

Bitbucket is a version control repository hosting service, marketed mostly towards proprietary and enterprise software but with a substantial FLOSS presence.

Mercurial repositories

It announced on 20 August 2019 that it would be ending Mercurial support to focus exclusively on Git.[1] Creating new Mercurial repositories was disabled on 1 February 2020, and all Mercurial repositories and API were to be removed on 1 July 2020.[2] The actual removal happened in mid-August 2020.

Archival

Bitbucket Mercurial web content
Status Offline
Archiving status Saved!
Archiving type Unknown
Project source bitbucket-grab
Project tracker bitbucket
IRC channel #kickthebucket (on hackint)

Our archival was based on an enumeration of Mercurial repositories from Bitbucket's search API after the February lockdown. Repositories were still writable until they were made read-only in early July. Although the API returns an updated-on field, it is not clear whether this is the repository or its metadata.

The project was split into two parts: the actual hg repositories were retrieved through the Mercurial project (developed for this but reusable for hg repositories in general), and bitbucket covered the web interface (issues, pull requests, wikis, etc.). Apart from around 200 odd repositories, we managed to archive everything successfully.

Statistics

  • Total repos online: 245,068
  • Total reported size (fairly accurate): 5.23 TiB (does this include hg compression?)
  • Mean reported size: 22.4 MiB
  • Median reported size: 205 KiB
  • Maximum reported size: 14.4 GiB

Existing discussion and tooling

Site structure

Some API requires auth, some does not. Rate limits are documented here.

References

  1. https://bitbucket.org/blog/sunsetting-mercurial-support-in-bitbucket[IAWcite.todayMemWeb]
  2. The original sunset date was 1 June, but on 21 April this was pushed back due to Coronavirus.