Difference between revisions of "Patch.com"

From Archiveteam
Jump to navigation Jump to search
(Created page with "{{Infobox project | title = Patch.com | image = Patch screenshot.png | description = Your neighborhood. Your news. | URL = <nowiki>http://www.patch.com/</nowiki> | project_sta...")
 
(14 intermediate revisions by 5 users not shown)
Line 4: Line 4:
| description = Your neighborhood. Your news.
| description = Your neighborhood. Your news.
| URL = <nowiki>http://www.patch.com/</nowiki>
| URL = <nowiki>http://www.patch.com/</nowiki>
| project_status = {{closing}}
| project_status = {{specialcase}}
| source = https://github.com/ArchiveTeam/patch-grab
| source = https://github.com/ArchiveTeam/patch-grab
| archiving_status = {{inprogress}}
| archiving_status = {{saved}} - [https://archive.org/details/archiveteam_patch archives]
| irc = cabbagepatch
| irc = cabbagepatch
| tracker = [http://quilt.io.ninjawedding.org/patch here]
| tracker = [http://quilt.at.ninjawedding.org/patchy here]
}}
}}


'''Patch.com''' is a "hyperlocal" news community which is [http://www.sfgate.com/technology/businessinsider/article/AOL-CEO-Tim-Armstrong-Fired-Patch-s-Creative-4720914.php being downsized] from its current ~900 sites to ~600.
'''Patch.com''' is a "hyperlocal" news community which is [http://www.webcitation.org/6IrUArBiV being downsized] from its current ~900 sites to ~500.


=== Current status ===
== Current status ==


antomatic has prepared (what appears to be) a [[full list of sites|List_of_Patch.com_sites]]A prototype seesaw project (no Warrior integration yet) also exists.
In progress.  Warrior integration coming soon.


=== Next steps ===
== Patch.com will rate-limit you across all sites ==


Patch subdomains are (1) big and (2) appear to implement some sort of request cap per IP per unit time(You'll start getting HTTP 420s after a while.)  We need to investigate whether we need to implement a complicated mechanism to split up individual sites and then megawarc them together, or just take each site slowly (e.g. n requests every hour).
Patch.com institutes a rate-limit (some unknown hundreds of requests/hour) across all sitesIf you exceed this, all of your requests will be met with HTTP 420s.


Pop in the IRC channel if you want to help.
If the patch-grab script detects these, it hard-aborts.  A kinder solution would be to sleep for some period of time (an hour?) and try again; suggestions appreciated.
 
{{Navigation box}}

Revision as of 02:13, 30 December 2014

Patch.com
Your neighborhood. Your news.
Your neighborhood. Your news.
URL http://www.patch.com/
Status Special case
Archiving status Saved! - archives
Archiving type Unknown
Project source https://github.com/ArchiveTeam/patch-grab
Project tracker here
IRC channel #cabbagepatch (on hackint)

Patch.com is a "hyperlocal" news community which is being downsized from its current ~900 sites to ~500.

Current status

In progress. Warrior integration coming soon.

Patch.com will rate-limit you across all sites

Patch.com institutes a rate-limit (some unknown hundreds of requests/hour) across all sites. If you exceed this, all of your requests will be met with HTTP 420s.

If the patch-grab script detects these, it hard-aborts. A kinder solution would be to sleep for some period of time (an hour?) and try again; suggestions appreciated.