|
Well drat. Our 100% uptime value on our Pingdom report is now a thing of the past. You can read all the gory details in this Status blog post.
Yesterday's outage was the first we've had in a long time - hopefully the last for a long time also. It was also the first we've had since we started publishing these blogs and it pointed out several advantages to our new blogging approach:
1.) Have a separate set of servers in a different location that can continue to work even if the main site is down.
These blogs are hosted by TypePad.com on their servers which were unaffected by our charting issues.
2.) Keep everyone as informed as possible.
Despite things being very hectic here during the outage, we took time to update our Status blog whenever our understanding of the problem changed.
3.) Explain the problem and explain the fix.
Let everyone know that this particular chink in our armor has been identified and fixed.
So, while all outages are bad - really, really bad - I hope that our Status blog kept people better informed than ever before.
Remember, if you ever think that we might be down, check our Status blog and/or our Pingdom report for confirmation.
And please use the comments section below to let me know if you have any suggestions for improving how we communicate with you during outages.
(Yeah, yeah - "Don't have any!" - That's always the goal... but, it, isn't, easy.)
Posted by: Richard H Demshar March 14, 2009 at 01:42 AM
Posted by: Keith Shepard March 14, 2009 at 03:24 AM
Posted by: Eric Grunewald March 14, 2009 at 06:06 AM
Posted by: Clinton Alexander March 16, 2009 at 01:25 AM