Incident Summary
From October 13 to October 16, some users were seeing issues when sending emails from Advantage. There was slowness, timeouts, and an error message that mentioned issues with a file path.
Leadup
There were no significant changes or other environmental factors that contributed to this incident.
Fault
Advantage was very slow and presenting a number of different errors including timeout errors. The exact cause is unknown, but it appears to have been related to Advantage’s inability to successfully access some PDF files.
Impact
From 10/13 to 10/16, users who were sending emails from Advantage were seeing slowness, timeouts, and error messages.
Detection
The issue was raised by affiliates via Support tickets. Detection could possibly have been improved by having increased monitoring on the VM, but this may have not caught these sporadic errors either.
Response
The Advantage, DBA, SysOps, and Command Center teams responded to the incident along with representatives from ACS. The process went smoothly and there were no delays that hampered the incident resolution.
Recovery
Functionality was restored on 10/16 by updating VM tools and also rebooting Maui. This lead to a gradual restoration of service.
Timeline
Root Cause
The root cause of this issue is unclear. Although the VM tools update seemed to resolve the issue, there is no other evidence that it was the root cause and not a symptom of the underlying issue.
Recurrence
This issue has not happened before.
Corrective actions
The SysOps team is looking into whether additional monitoring could help detect these types of issues in the future.