Difference between revisions of "Jira"

From Archiveteam
Jump to navigation Jump to search
(issues.apache.org failed, will need to update this further later)
(partially update list, going to need to do more work on this though)
Line 11: Line 11:
Issue comments are loaded via JS and are not properly saved. However, all comments are included when exporting an issue in "printable" form, as well as in the XML and DOC export formats. As such, these export links are worth saving (though they either need to be generated as a list or extracted from the job's database). <code><nowiki>https://example.com/si/jira.issueviews:issue-html/PROJ-12345/PROJ-12345.html</nowiki></code>
Issue comments are loaded via JS and are not properly saved. However, all comments are included when exporting an issue in "printable" form, as well as in the XML and DOC export formats. As such, these export links are worth saving (though they either need to be generated as a list or extracted from the job's database). <code><nowiki>https://example.com/si/jira.issueviews:issue-html/PROJ-12345/PROJ-12345.html</nowiki></code>


{{URL|https://transfer.archivete.am/qMgJa/jira_url_list.py}} generates a list of URLs, also using the {{URL|https://docs.atlassian.com/software/jira/docs/api/REST/9.13.0/|Jira API}} to determine the maximum issue for each project.
{{URL|https://transfer.archivete.am/qMgJa/jira_url_list.py}} generates a list of URLs, also using the {{URL|https://docs.atlassian.com/software/jira/docs/api/REST/9.13.0/|Jira API}} to determine the maximum issue for each project. {{URL|https://transfer.archivete.am/6owk9/jira_process_db.py}} extracts attachment URLs from databases. (Offsite links aren't done yet.)
 
A few sites have attachments that redirected to a login page, e.g. https://hub.spigotmc.org/jira/secure/attachment/10005/daf45dfd-da9b-498b-96e4-5782bbac11c7.dat from https://hub.spigotmc.org/jira/browse/SPIGOT-31. These are simply broken, and logging in results in a 404 page for those attachments. The attachment zip for those issues also gives a 500.
 
{{URL|https://transfer.archivete.am/6rcal/jira_url_list_delux.py}} was used to generate lists of tickets from the API, as well as the API calls themselves, for two sites, though in both cases the script needed to be modified and using 1000 entries/request does not work well (100 entries/request seems much more stable). This was needed due to large numbers of nonexistent tickets due to either importing from another issue tracker or large amounts of deleted spam.


== Status ==
== Status ==


{| class="wikitable sortable plainlinks"
{| class="wikitable sortable plainlinks"
! rowspan=2 | API !! rowspan=2 | Status !! colspan=7 | Archive details
! rowspan=2 | Site !! rowspan=2 | Status !! colspan=7 | Archive details
|-
|-
! Tool !! URL list !! Job !! Date !! Size !! Objects !! Notes
! Tool !! URL list !! Job !! Date !! Size !! Objects !! Notes
Line 29: Line 33:
| {{saved}} || [[ArchiveBot]] (!ao < list) || {{URLAB|https://transfer.archivete.am/tEkTT/bugs.mojang.com_missed_urls_other_than_attachments.txt | bugs.mojang.com_missed_urls_other_than_attachments.txt}}<!-- zst --> || [https://archive.fart.website/archivebot/viewer/job/20231004220509e7bzh e7bzh] || 2023-10-04 || || 3 warcs || XML/HTML/DOC versions (included in the seed list for later jobs) as well as project descriptions (not included)
| {{saved}} || [[ArchiveBot]] (!ao < list) || {{URLAB|https://transfer.archivete.am/tEkTT/bugs.mojang.com_missed_urls_other_than_attachments.txt | bugs.mojang.com_missed_urls_other_than_attachments.txt}}<!-- zst --> || [https://archive.fart.website/archivebot/viewer/job/20231004220509e7bzh e7bzh] || 2023-10-04 || || 3 warcs || XML/HTML/DOC versions (included in the seed list for later jobs) as well as project descriptions (not included)
|-
|-
|rowspan="2"| https://hub.spigotmc.org/jira/
|rowspan="3"| https://hub.spigotmc.org/jira/
| {{saved}} || [[ArchiveBot]] (!a < list) || {{URLAB|https://transfer.archivete.am/GDkEA/hub.spigotmc.org_jira_seed_urls.txt | hub.spigotmc.org_jira_seed_urls.txt}}<!-- not zst --> || [https://archive.fart.website/archivebot/viewer/job/202310042150482nxku 2nxku] || 2023-10-04 || || 1 warc || includes offsite links, does not include XML/HTML/DOC
| {{saved}} || [[ArchiveBot]] (!a < list) || {{URLAB|https://transfer.archivete.am/GDkEA/hub.spigotmc.org_jira_seed_urls.txt | hub.spigotmc.org_jira_seed_urls.txt}}<!-- not zst --> || [https://archive.fart.website/archivebot/viewer/job/202310042150482nxku 2nxku] || 2023-10-04 || || 1 warc || includes offsite links, does not include XML/HTML/DOC
|-
|-
| {{notsavedyet}} || || || || || || || attachments, XML/HTML/DOC
| {{saved}} || [[ArchiveBot]] (!ao < list) || {{URLAB|https://transfer.archivete.am/10RVuY/hub.spigotmc.org_attachments.txt}}<!-- not zst --> || [https://archive.fart.website/archivebot/viewer/job/202402140438077gnzz 7gnzz] || 2024-02-14 || || 1 warc || attachments, XML/HTML/DOC
|-
| {{upcoming}} || || || || || || || XML/HTML/DOC
|-
|-
| https://issues.asterisk.org/ || {{lost}}/{{selfsaved}} || || || [https://archive.fart.website/archivebot/viewer/job/20231212002710aehw8 aehw8] || 2023-12-12 || || || redirect to GitHub
| https://issues.asterisk.org/ || {{lost}}/{{selfsaved}} || || || [https://archive.fart.website/archivebot/viewer/job/20231212002710aehw8 aehw8] || 2023-12-12 || || || redirect to GitHub
Line 116: Line 122:
|-
|-
| {{notsavedyet}} || || || || || || || offsite links
| {{notsavedyet}} || || || || || || || offsite links
|-
|rowspan="2"| https://jira.mariadb.org/
| {{inprogress}} || [[ArchiveBot]] (!a < list) || {{URLAB|https://transfer.archivete.am/q76ET/jira.mariadb.org.txt | jira.mariadb.org.txt}}<!-- zst --> || [https://archive.fart.website/archivebot/viewer/job/da3z7 da3z7] || 2024-02-08 || || || includes offsite links and XML/HTML/DOC
|-
| {{notsavedyet}} || || || || || || || attachments
|-
|-
|rowspan="4"| https://issues.apache.org/jira/browse/
|rowspan="4"| https://issues.apache.org/jira/browse/
Line 126: Line 137:
| {{notsavedyet}} || || || || || || || offsite links
| {{notsavedyet}} || || || || || || || offsite links
|-
|-
|rowspan="2"| https://jira.mariadb.org/
|rowspan="3"| https://bugs.openjdk.org/
| {{inprogress}} || [[ArchiveBot]] (!a < list) || {{URLAB|https://transfer.archivete.am/q76ET/jira.mariadb.org.txt | jira.mariadb.org.txt}}<!-- zst --> || [https://archive.fart.website/archivebot/viewer/job/da3z7 da3z7] || 2024-02-08 || || || includes offsite links and XML/HTML/DOC
| {{upcoming}} || || || || || || || something's wrong here; https://bugs.openjdk.org/browse/CCC-8178916?jql=project%20%3D%20CCC%20ORDER%20BY%20key%20desc gives 91 issues between CCC-4244499 and CCC-8178916
|-
|-
| {{notsavedyet}} || || || || || || || attachments
| {{notsavedyet}} || || || || || || || attachments
|-
|-
|rowspan="2"| https://bugs.openjdk.org/
| {{notsavedyet}} || || || || || || || offsite links
| {{upcoming}} || || || || || || || something's wrong here; https://bugs.openjdk.org/browse/CCC-8178916?jql=project%20%3D%20CCC%20ORDER%20BY%20key%20desc gives 91 issues between CCC-4244499 and CCC-8178916
|-
| {{notsavedyet}} || || || || || || || attachments
|-
|-
|More... || || || || || || || || mostly saved, to be entered soon
|}
|}


== References ==
== References ==
{{References}}
{{References}}

Revision as of 06:20, 15 February 2024

Jira is a bug tracking system developed by Atlassian. They previously offered a self-hosted version (Jira Server), but have ended sales on Feb 15, 2023[1] and support will fully end on Feb 15, 2024,[2] leaving the only options as their cloud version[3] or the comically expensive data center version.[4]

Strategy

An ArchiveBot !a command will not properly discover all issues.

The approach I've used is to find the highest-numbered ticket in each project (use the "all issues" filter and order by "key"), and then generated a text file listing all of the ticket URLs, which then can be ran (by an op) as an !a < list job. However, this will not save issue attachments due to the no-parent rule, so in addition the job's database needs to be manually saved and the relevant links need to be extracted from the database, and can be run as a separate job.

Attachment mime types do get detected as a relative URL, and will need to be ignored, along the lines of https?://{primary_netloc}/browse/(application|text|image|video)/. Not all attachments are found like this due to a wpull quirk, so this is insufficient for getting a list of attachments.[5] Additionally, attachment sorting is not useful to save, so [?&]attachment(Order|ViewMode|SortBy)= can be ignored.

Issue comments are loaded via JS and are not properly saved. However, all comments are included when exporting an issue in "printable" form, as well as in the XML and DOC export formats. As such, these export links are worth saving (though they either need to be generated as a list or extracted from the job's database). https://example.com/si/jira.issueviews:issue-html/PROJ-12345/PROJ-12345.html

https://transfer.archivete.am/qMgJa/jira_url_list.py[IAWcite.todayMemWeb] generates a list of URLs, also using the Jira API[IAWcite.todayMemWeb] to determine the maximum issue for each project. https://transfer.archivete.am/6owk9/jira_process_db.py[IAWcite.todayMemWeb] extracts attachment URLs from databases. (Offsite links aren't done yet.)

A few sites have attachments that redirected to a login page, e.g. https://hub.spigotmc.org/jira/secure/attachment/10005/daf45dfd-da9b-498b-96e4-5782bbac11c7.dat from https://hub.spigotmc.org/jira/browse/SPIGOT-31. These are simply broken, and logging in results in a 404 page for those attachments. The attachment zip for those issues also gives a 500.

https://transfer.archivete.am/6rcal/jira_url_list_delux.py[IAWcite.todayMemWeb] was used to generate lists of tickets from the API, as well as the API calls themselves, for two sites, though in both cases the script needed to be modified and using 1000 entries/request does not work well (100 entries/request seems much more stable). This was needed due to large numbers of nonexistent tickets due to either importing from another issue tracker or large amounts of deleted spam.

Status

References