2 Issues: Evening of May 8
We've been working on a number of infrastructure changes behind-the-scenes of Help Scout to further increase redundancy, so that your account works as expected no matter what. Last night we had a couple of growing pains and need to share the details:
Issue #1 + How We're Fixing it
Last week we brought a new mailserver online to handle more volume and give us additional redundancy in the case of a failure somewhere. We now have two mailservers running with two different hosting providers so that there's never a single point of failure.
Last night, between 7:30pm - 12:30am the new mailserver ran into some problems. The server thought it was out of space when it actually had plenty available. During this window, sporadic emails forwarded to Help Scout were bounced. Some emails were processed, others were bounced and we don't really know what made the difference.
Since nothing was actually "down" during this time, none of our monitoring alerted us to the problem. That's why it took longer to find and fix the issue. Our development team has already come up with a solution for storing messages off should this ever happen again, avoiding any bounces.
Issue #2 + How We're Fixing it
From 10:00-10:15pm last night, we had a scheduled maintenance window for outgoing email. Any emails sent from Help Scout were put on hold while we did the maintenance, while everything else operated as normal.
For a period of less than 10 minutes, incoming emails could have possibly bounced. Our hosting provider had to clear a cache we were unaware of, which caused a slight delay. This particular maintenance most likely won't happen again. It's part of the learning process in dealing with different cloud providers and understanding how they operate. Should we ever need to perform this maintenance again, we'll know the cache on their end must be cleared.
Oddly enough, these two issues were not related even though they happened in the same evening. We sincerely apologize for the inconvenience caused. Another ironic twist is that this maintenance is all designed to bring more stability to Help Scout in the long term and we still believe it will.
A great deal of work is being done on our end to continue scaling the Help Scout infrastructure and making sure email is always stored and processed even if everything else goes down. Last night was a blip we're confident will not happen again. Staying up is and always be the most critical priority.
Have any specific questions? You can email us here.