Industry


Ads by TechWords

See your link here


Preston Gralla's picture
Preston Gralla

Seeing Through Windows

Gmail outage: Microsoft laughs all the way to the bank

Google's attempt to prove to the world that office apps inexpensively delivered from the cloud beats running PC-based clients got a big setback yesterday, when Gmail suffered a world-wide outage. The big winner: Microsoft, which is laughing all the way to the bank.

Computerworld reports that the outage was in essence caused by "a traffic jam on its servers." Meanwhile Google said in its official blog that the problem was caused when Google took "a small fraction of Gmail's servers offline to perform routine upgrades." That, in turn, led to a series of events that led to the worldwide outage, because Google underestimated the effects of previous changes they had made to servers --- changes that were ironically designed to improve reliability.

Google's blog goes into a lengthy explanation of the outage and how Google tried to solve it. I have to say, though, that the explanation makes me feel as if cloud computing is a more tenuous proposition than I realized. Consider this explanation of what happened, according to the Google blog:

The Gmail engineering team was alerted to the failures within seconds (we take monitoring very seriously). After establishing that the core problem was insufficient available capacity, the team brought a LOT of additional request routers online (flexible capacity is one of the advantages of Google's architecture), distributed the traffic across the request routers, and the Gmail web interface came back online.

The key point here is that the Gmail team knew about the failue within seconds, but it still took 100 minutes to bring Gmail back online. This isn't to criticize the Gmail team --- such outages, I believe, are inherent in the nature of cloud computing. There are simply too many points at which things can go wrong --- not just servers, but to the Internet itself.

And that's why Microsoft is laughing all the way to the bank. Enterprises simply can't have outages of their most vital applications. All Microsoft needs to do is arm its Office sales force with news reports of the outage. When they're up against the Google sales team trying to sell Google Apps, all the Microsoft people will have to do is pull out the news reports. Chalk up one more sale to Microsoft.

Microsoft will be moving Office to the cloud to a certain extent, but that will most likely be an adjunct to client-based Office. And until cloud computing is a rock-solid certainty -- a time which may never come -- enterprises will most likely opt for reliability over cost-savings.

What People Are Saying

No Preston...

-Google has more money than Microsoft ever will.

-If anything Microsoft's in the bank for loans, not cashing checks.

-Google is reliable in every aspect of its profession, I trust the Empire with my life.

-The power outage only affected Gmail, not applications like you said Preston.

-This wasn't a big deal, you're just a big Microsoft fan.

-I'm a fan of the best.

Pre-order some aion cd key

Pre-order some aion cd key is crucial if you wanna be a gamer of Aion. And there is another item, aion time card is as important as the cd key in the game.

Yes, Google messed this one

Yes, Google messed this one up, but consider two points-

1. One of the main themes of the comments here is that Enterprise Apps are managed in as many different ways as there are Enterprises. World-wide, failures are common, whether due to Microsoft or Local IT or the User. While not for everyone, it is clear that for a LARGE group of enterprises consistant off-site expert management will be a tremendous improvement in and of itself.

2. Forget Apps for a minute... A little village known as New York City has had occasional power blackouts shutting everything down. (Ever try walking down from the 50th floor?) But interestingly enough, I haven't heard of a lot of companies taking power generation out of "the cloud."

Clearly one can do better than 100 minutes of downtime a quarter, but many, many "other ones" can't!

Apps weren't down

That's an odd prediction regarding Microsoft's sales force, since Gmail was down, not Google Apps. I accessed both Docs and Spreadsheets during the Gmail outage. They are not dependent on Gmail.

I hear this argument

I hear this argument frequently, but as a system administrator, I know better. Those enterprise apps stored on-site tend to have even greater downtime than cloud apps.

Nothing Microsoft should be happy about

Well if I take into account how many times I have to restart machine because Windowses (yes all of them from 3x, 95... up to Vista) have crashed for known or unknown reasons, the problem that Google had shows more about a brilliant and quick action than about their failure. One computer on one Windows system that is supposed to be tested every way possible is incomparable more controllable environment than Internet. On Internet one good commercial, song or video that would catch an instant attention of many can block not only Google but the serious portion of Internet all together. Not that this was not Google's mistake of some kind, and that they should not be blamed at all, but there is nothing Microsoft should be happy about.

Helplessness

Hopefully google learns from this and this failure path is prevented in the future.

I think the real issue for most with hosted apps is the sense of helplessness of the IT folks at the customer with limited feedback from the Google IT staff.

Too Big to Fail

Maybe the issue isn't the outage, since all software has downtime. Maybe the issue is how many people we're not able to work due to the outage. If it 100k, okay that's bad. If it's 1 million, that worse and so on. Either way people will accept a certain amount of risk, but they won't put their job on the line for something that might get them fired because of a reliability issue. We have a small installation for exchange, but 100 minutes a quarter seems excessive for any mission critical system. Google is smart, they'll work through these hickups and do just fine in the future.

Where's My Stuff???

Preston, I see your point. GMail took out many customers for 100 minutes, which equated to those users not being able to pull up any documents that may have been attached to emails. Documents that were time-critical or mission-critical. And, nothing could be done about it. Therein lies the risk ...

Some people actually use IMAP

Google said that IMAP was still up and running. And it was. So how does the mission-critical data disappear now? So if Gmail's down, you're computer immediately bursts into flames? Your hard drive spontaneously fails? Seriously.