Difference between revisions of "Talk:SourceForge"

From Archiveteam
Jump to navigation Jump to search
(note about forum)
m
(One intermediate revision by one other user not shown)
Line 1: Line 1:
BerliOS seems to still be online (?) [[User:Its notjack|its_notjack]] 06:50, 14 November 2015 (EST)
Script to download files: https://github.com/SpiritQuaddicted/sourceforge-file-download
Script to download files: https://github.com/SpiritQuaddicted/sourceforge-file-download


Line 4: Line 7:


And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.
And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.
Issue trackers can have multiple, arbitrary names:


http://sourceforge.net/p/scummvm/bugs/
http://sourceforge.net/p/scummvm/bugs/


http://sourceforge.net/p/scummvm/feature-requests/
http://sourceforge.net/p/scummvm/feature-requests/
http://sourceforge.net/p/dungeonsofdecay/tickets/
* The admin interface lets you pick make arbitrary numbers of issue trackers (called "tickets" in the interface) with arbitrary names. We'll need to find them by parsing the summary page ( http://sourceforge.net/p/scummvm )


http://sourceforge.net/p/scummvm/mailman/
http://sourceforge.net/p/scummvm/mailman/

Revision as of 11:50, 14 November 2015

BerliOS seems to still be online (?) its_notjack 06:50, 14 November 2015 (EST)


Script to download files: https://github.com/SpiritQuaddicted/sourceforge-file-download


And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.

Issue trackers can have multiple, arbitrary names:

http://sourceforge.net/p/scummvm/bugs/

http://sourceforge.net/p/scummvm/feature-requests/

http://sourceforge.net/p/dungeonsofdecay/tickets/

  • The admin interface lets you pick make arbitrary numbers of issue trackers (called "tickets" in the interface) with arbitrary names. We'll need to find them by parsing the summary page ( http://sourceforge.net/p/scummvm )

http://sourceforge.net/p/scummvm/mailman/

http://sourceforge.net/p/dvdstyler/discussion/

http://sourceforge.net/p/doublecmd/forum/

  • This appears to be a standard instance of phpBB (which we hopefully know how to archive?)

http://sourceforge.net/p/doublecmd/news/

http://sourceforge.net/p/doublecmd/code

  • this looks like browsable repos; we probably don't want to scrape these

http://sourceforge.net/p/scummvm/patches/

  • we don't want feeds, so reject: patches/[0-9]+/feed\.(atom|rss)
  • attachments are in scummvm/patches/_discuss/thread/

http://sourceforge.net/projects/dvdstyler/reviews

  • simply some pages with ? , not a directory

http://sourceforge.net/p/dvdstyler/wiki/

Donation links (which appear to just be redirects to a PayPal URL, seem to be of the form): http://sourceforge.net/p/scummvm/donate/

wiki might be hosted elsewhere

homepage might be hosted elsewhere


domains from which files are served


It's somewhat lower priority, but a download stats API seems to be documented here: http://sourceforge.net/p/forge/documentation/Download%20Stats%20API/