Difference between revisions of "Talk:SourceForge"
Its notjack (talk | contribs) m |
m |
||
Line 1: | Line 1: | ||
BerliOS seems to still be online (?) [[User:Its notjack|its_notjack]] 06:50, 14 November 2015 (EST) | BerliOS seems to still be online (?) [[User:Its notjack|its_notjack]] 06:50, 14 November 2015 (EST) | ||
=== Preparation work, 2015 === | |||
Script to download files: https://github.com/SpiritQuaddicted/sourceforge-file-download | Script to download files: https://github.com/SpiritQuaddicted/sourceforge-file-download | ||
Line 70: | Line 71: | ||
It's somewhat lower priority, but a download stats API seems to be documented here: http://sourceforge.net/p/forge/documentation/Download%20Stats%20API/ | It's somewhat lower priority, but a download stats API seems to be documented here: http://sourceforge.net/p/forge/documentation/Download%20Stats%20API/ | ||
=== Thoughts, 2022 === | |||
It's still desirable to archive SourceForge. | |||
--[[User:Random|Random]] ([[User talk:Random|talk]]) 11:50, 27 May 2022 (UTC) |
Latest revision as of 11:50, 27 May 2022
BerliOS seems to still be online (?) its_notjack 06:50, 14 November 2015 (EST)
Preparation work, 2015
Script to download files: https://github.com/SpiritQuaddicted/sourceforge-file-download
And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.
Issue trackers can have multiple, arbitrary names:
http://sourceforge.net/p/scummvm/bugs/
http://sourceforge.net/p/scummvm/feature-requests/
http://sourceforge.net/p/dungeonsofdecay/tickets/
- The admin interface lets you pick make arbitrary numbers of issue trackers (called "tickets" in the interface) with arbitrary names. We'll need to find them by parsing the summary page ( http://sourceforge.net/p/scummvm )
http://sourceforge.net/p/scummvm/mailman/
- Mailing lists *may* be already archived elsewhere (i.e. Gmane, The Mail Archive), so they may be lower priority
- http://sourceforge.net/p/doublecmd/mailman/doublecmd-devel/
- Contains all the messages (paged: http://sourceforge.net/p/doublecmd/mailman/doublecmd-devel/?style=flat&page=3 )
- http://sourceforge.net/p/doublecmd/mailman/message/34074982/
- A particular message (with thread)
- http://sourceforge.net/p/doublecmd/mailman/attachment/20150428145819.6a8925d5%40spritty/1/
- an attachment (which we may or may not care about)
http://sourceforge.net/p/dvdstyler/discussion/
http://sourceforge.net/p/doublecmd/forum/
- This appears to be a standard instance of phpBB (which we hopefully know how to archive?)
http://sourceforge.net/p/doublecmd/news/
http://sourceforge.net/p/doublecmd/code
- this looks like browsable repos; we probably don't want to scrape these
http://sourceforge.net/p/scummvm/patches/
- we don't want feeds, so reject: patches/[0-9]+/feed\.(atom|rss)
- attachments are in scummvm/patches/_discuss/thread/
http://sourceforge.net/projects/dvdstyler/reviews
- simply some pages with ? , not a directory
http://sourceforge.net/p/dvdstyler/wiki/
- we don't want feeds, so reject: wiki/[0-9]+/feed\.(atom|rss)
- http://sourceforge.net/p/doublecmd/wiki/browse_pages/
- looks like a way to get a list of wiki pages (TODO: figure out paging)
Donation links (which appear to just be redirects to a PayPal URL, seem to be of the form): http://sourceforge.net/p/scummvm/donate/
wiki might be hosted elsewhere
homepage might be hosted elsewhere
domains from which files are served
It's somewhat lower priority, but a download stats API seems to be documented here: http://sourceforge.net/p/forge/documentation/Download%20Stats%20API/
Thoughts, 2022
It's still desirable to archive SourceForge. --Random (talk) 11:50, 27 May 2022 (UTC)