ROOT CAUSE ANALYSIS; SERVICE INTERRUPTION; 01/13/2014 9AM through 10AM PST

Our engineers investigated a problem that was affecting a large subset of our users where the main page would appear but feeds would not load. This incident began at 09:00:36am, PST and ended at 10:00:36am, PST.

Due to a unique set of environment circumstances the feed loading problem did not appear for our teams internally, which significantly delayed discovery of the issue and resolution of the service interruption. Once the customer reports reached the engineering team the interruption was quickly investigated and resolved.

The engineering team are reviewing the systems responsible for monitoring these conditions and adjusted our process so that our internal environments will enable more immediate discovery of feed issues that affect our external customers.

About Dan

I work at Yammer to make sure that our customers solve their issues!
This entry was posted in (RCA) Root Cause Analysis. Bookmark the permalink.