DevOps Zone is brought to you in partnership with:

I am the founder and CEO of Catchpoint. The Catchpoint vision is to provide the best IT analytics & intelligence to help companies improve their end user performance, and keep an eye on their infrastructure and vendors. Before Catchpoint, I spent 10+ years at DoubleClick and Google, where I was responsible for Quality of Services, buying, building, deploying and using various internal and external monitoring solutions to keep an eye on the DART infrastructure delivering billions of transactions a day using throusands of servers, routers . . . HP Openview, Sitescope, SMARTS, Adlex, Coradiant, Gomez & Keynote are some of the products we used. Thus the interest of building something in this space! Mehdi is a DZone MVB and is not an employee of DZone and has posted 46 posts at DZone. You can read more from them at their website. View Full User Profile

How the NYT Was Up When It Was Down

08.16.2013
| 2802 views |
  • submit to reddit

nytimesreverse

Yesterday, the big web news story of the day was the NYT outage.  But the bigger performance story of the day was not the fact the NYT was down for a few hours, but rather, how the NYT team did all the right things to handle the outage.

NYT Avail

On our end, Catchpoint synthetic agents started recording 503 server response errors from http://www.nytimes.com at around 11:14 AM EST indicating that the web application was not functioning properly.  During the downtime, there were timeouts/reset errors from across the US, from Washington D.C. to Dallas to San Francisco.  The homepage was back up and running by about 1:38 PM EST.  A post hoc analysis deemed the root cause of the failure to be an issue during internal maintenance.

You Can’t Stop the Presses

Downtime happens and it cannot be avoided.  To make the best out of a bad situation, I always recommend to communicate with your customers throughout the outage and try to be as creative as possible to keep business running – and that is exactly what the NYT did.   They kept their readers informed, not only using social media as a temporary platform to keep delivering the news but also to stay in touch and let everyone know they were working hard to resolve the issue.

NYT FBI commend the team for their swift action and grace during the downtime. From my time at DoubleClick, I know personally how stressful it can be — the office turns into a war room with everyone trying to triage and resolve the issue as fast as possible.  The NYT set an example yesterday that everyone should follow.  They deserve a pat on the back and a nice cold beer.

I bow to the Grey Lady.

Mehdi – Catchpoint

Published at DZone with permission of Mehdi Daoudi, author and DZone MVB. (source)

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)