Difference between revisions of "Patch.com"

From Archiveteam
Jump to navigation Jump to search
Line 15: Line 15:
=== Current status ===
=== Current status ===


A combo archive/spider script now exists, and is being tested.  If it all goes well, we'll put this out on the WarriorPop in the IRC channel if you'd like to be notified when we're ready to go.
In progress.  Warrior integration coming soon.
 
=== Patch.com will rate-limit you across all sites ===
 
Patch.com institutes a rate-limit (some unknown hundreds of requests/hour) across all sites.  If you exceed this, all of your requests will be met with HTTP 420s.
 
If the patch-grab script detects these, it hard-abortsA kinder solution would be to sleep for some period of time (an hour?) and try again; suggestions appreciated.

Revision as of 17:03, 19 August 2013

Patch.com
Your neighborhood. Your news.
Your neighborhood. Your news.
URL http://www.patch.com/
Status Closing
Archiving status In progress...
Archiving type Unknown
Project source https://github.com/ArchiveTeam/patch-grab
Project tracker here
IRC channel #cabbagepatch (on hackint)

Patch.com is a "hyperlocal" news community which is being downsized from its current ~900 sites to ~500.

Current status

In progress. Warrior integration coming soon.

Patch.com will rate-limit you across all sites

Patch.com institutes a rate-limit (some unknown hundreds of requests/hour) across all sites. If you exceed this, all of your requests will be met with HTTP 420s.

If the patch-grab script detects these, it hard-aborts. A kinder solution would be to sleep for some period of time (an hour?) and try again; suggestions appreciated.