So what will you be doing to ensure this doesn't happen in the future?
After reviewing what happened, we worked out a way to prevent it and implemented it immediately.
Not to downplay anything (we will always work to improve everything we do), but there is an inevitable exaggeration of even the smallest fault that comes hand-in-hand with scheduled league races. By that I mean: we can provide a flawless service for a month, then have 2 nights in a row where there is a brief 20 minute disturbance, let's say occurring at 8pm exactly. Each night it's a totally different source. Unlike any other game, anyone who races at 8pm will now believe there is a "constant" problem happening "every day" or "all the time". Yet on paper, that equates to 98.7% flawless service in a 24 hour period, and 99.91% in the period of a month. For any other service, those would be very respectable numbers. For a service where people need to operate on a schedule, it can be extremely disruptive.
I know none of that matters to anyone except us, but thinking aloud it's an interesting challenge to deal with as a developer, one which I don't think many games have. When we have a problem at the same time for a couple of days in a week, we know how that's going to look to those people, and it just amplifies the whole situation. I offer this up merely to give some perspective, and show that on paper we are competent (for those who clearly question that and suggest how we do our jobs). We also understand that we can't have issues like this occurring, period. We're always working towards 100% up-time and all of the money from the game gets re-invested into development of features and systems, much of it to ensure that.