Industry


Ads by TechWords

See your link here


Seth Weintraub's picture
Seth Weintraub

Apple versus Google

Google Gmail down across the world, massive outage (updated)

We are in the midst of a huge Google outage it appears. I've got users in Europe and Asia that are also out. 

I've also tried to log onto Gmail from servers in  London, Spain and France without any luck.

The outage is affecting both Apps accounts and personal Gmail.

This one looks to be big...and global.

Google's App status dashboard says that it is just mail that has been affected.

More to come as information becomes available.

Update 5:00pm Eastern time:

The Gmail blog says the following:

We know many of you are having trouble accessing Gmail right now — we are too, and we definitely feel your pain. We don't usually post about minor issues here (the Apps status dashboard and the Gmail Help Center are usually where this kind of information goes). Because this is impacting so many of you, we wanted to let you know we're currently looking into the issue and hope to have more info to share here shortly. If you have IMAP or POP set up already, you should be able to access your mail that way in the meantime. We're terribly sorry for the inconvenience and will get Gmail back up and running as soon as possible. 

Update: 5:20 - I am able to get into my personal Google apps mail but it is very slow

Update: Gmail blog has explanation:

Gmail's web interface had a widespread outage earlier today, lasting about 100 minutes. We know how many people rely on Gmail for personal and professional communications, and we take it very seriously when there's a problem with the service. Thus, right up front, I'd like to apologize to all of you — today's outage was a Big Deal, and we're treating it as such. We've already thoroughly investigated what happened, and we're currently compiling a list of things we intend to fix or improve as a result of the investigation.

Here's what happened: This morning (Pacific Time) we took a small fraction of Gmail's servers offline to perform routine upgrades. This isn't in itself a problem — we do this all the time, and Gmail's web interface runs in many locations and just sends traffic to other locations when one is offline. 

However, as we now know, we had slightly underestimated the load which some recent changes (ironically, some designed to improve service availability) placed on the request routers — servers which direct web queries to the appropriate Gmail server for response. At about 12:30 pm Pacific a few of the request routers became overloaded and in effect told the rest of the system "stop sending us traffic, we're too slow!". This transferred the load onto the remaining request routers, causing a few more of them to also become overloaded, and within minutes nearly all of the request routers were overloaded. As a result, people couldn't access Gmail via the web interface because their requests couldn't be routed to a Gmail server. IMAP/POP access and mail processing continued to work normally because these requests don't use the same routers.

The Gmail engineering team was alerted to the failures within seconds (we take monitoring very seriously). After establishing that the core problem was insufficient available capacity, the team brought a LOT of additional request routers online (flexible capacity is one of the advantages of Google's architecture), distributed the traffic across the request routers, and the Gmail web interface came back online.

What's next: We've turned our full attention to helping ensure this kind of event doesn't happen again. Some of the actions are straightforward and are already done — for example, increasing request router capacity well beyond peak demand to provide headroom. Some of the actions are more subtle — for example, we have concluded that request routers don't have sufficient failure isolation (i.e. if there's a problem in one datacenter, it shouldn't affect servers in another datacenter) and do not degrade gracefully (e.g. if many request routers are overloaded simultaneously, they all should just get slower instead of refusing to accept traffic and shifting their load). We'll be hard at work over the next few weeks implementing these and other Gmail reliability improvements — Gmail remains more than 99.9% available to all users, and we're committed to keeping events like today's notable for their rarity. 

 I think the outages are most noteworthy for their severity and huge affect on our lives.

What People Are Saying

What was the cause of the outage?

It was a bug.

What happened here was a bug in Google's load-balancing code that prevented the load from being correctly shared amongst fewer servers.

When designing systems that load-balance and fail-over, it's dangerous to include logic that rejects connections, based on an absolute, arbitrary load factor. Better to have flexible, proportional triggers than to rely on a fixed load trigger.

There's more about this from around the Web in today's IT Blogwatch:
Gmail email FAIL: why Gmail went down #gonegoogle

I must say, I find all the

I must say, I find all the complaints about gmail being down pretty amazing. Most people who use gmail pay exactly ZERO for the service. I don't think it is at all reasonable to expect 100% up-time from a free service. If you are really dependent on email, take measures to minimize problems. After all, IMAP and POP access ARE up, so if you really need you email, you could get it that way.

imap and pop died too

imap and pop died too

Not just free users affected.

I administrate Google Apps for a medium sized business. I can assure you it affected the paying customers as well. We were out until the end, no access at all to our mail. It should be noted that everything else was working well despite the trouble with the Mail servers.

Not free for everyone

Not everyone uses the free Google Apps or Gmail, and the article doesn't say that only the free services were/are affected.

It's impacting video uploading on YouTube!

I've been seeing this problem all afternoon both with Gmail and with YouTube's video uploads. Fortunately, we launched our new series on YouTube BEFORE the outage!

Just a taste

Just a taste of what awaits in the Cloud.

Gmail down

While Gmail is down, I needed my contacts, I opened a Ymail account and transferred all contacts very slick very fast, something is still working at gmail. This took about 5 minutes.

Down in Trinidad and Tobago as well

Gmail is down in Trinidad and Tobago as well.

And it's back up now

weeee :)

It was down for about an hour.