Amazon Cloud outage bad for business
- 27 April, 2011 12:29
- Comments 2
Outages to Amazon’s Elastic Compute Cloud (EC2) offerings over the weekend have received plenty of global coverage, but they left at least one Australian business frustrated.
Simon Ellis, chief technology officer of Australian-Canadian startup LabSlice told Computerworld Australia that the outage to Amazon’s eastern US region on Friday came at a bad time for the company.
The company, which uses EC2 Web hosting, offers services to companies wanting to create sales/training demos, evaluations, and training environments in the Cloud.
Ellis was attempting to give a demonstration of one of the company's products to clients at the time.
"It's the second time there has been an outage," Ellis said. "It doesn't reflect too well on our business or Amazon considering we are a Cloud service provider and we are promoting Amazon as a high-level product."
The company used the Cloud service to host and download training images.
"As I was giving a demonstration, the images which are normally available in three minutes, I was left waiting for 15 minutes with nothing happening. It takes at least an hour until the fault status shows up on Amazon's website. We were in a ‘no man's land’ for a while."
Ellis said that while his company had mitigated against the possible risks associated with services from third parties like Amazon, he would not be "too impressed" if an outage happened again.
"Everything is running now but this is the second time that I have been demoing the software and there was an outage. As long it's not a multi-day outage, it is something we can brush off for the meantime.”
Another Melbourne company, Cyclopic Energy, which also uses EC2, was luckier.
Technical director, Rick Morgans, said the engineering firm was an "atypical Amazon user", using the service for cluster computing rather than critical data storage.
"We'll fire up 16 cluster compute instances and have them running for 20 to 40 hours and then shut down,” he said. “I'm not sure if the cluster compute instances were affected as we didn't have any running."
Popular social networking services including Reddit, Foursquare, Quora, Awe.sm and others were left without service for portions of the weekend as a result of the Amazon outage, partially fuelling increased commentary over the issue.
Since then, the company has slowly revealed information about the cause, which it attributed to a “networking event” that triggered a shortage capacity to some of the four availability zones that make up the eastern US region operated out of Virginia. The issue ultimately led to increased latency and some data outages.
Amazon operates two regions in the US in Virginia and California, as well as regions in Singapore, Ireland and most recently Japan. Each region is split into ‘availability zones’ used to provide multiple, redundant instances of client data and computing clusters.
Among the 99.5 per cent uptime included in client service level agreements, the outage has called Amazon’s use of availability zones into question.
Though some have laid the blame on Amazon for the outage, others have argued it was a simple issue of risk mitigation, one CIOs had no excuse not to counter in using the Cloud services.
Ellis warned companies should have appropriate backup strategies in place, regardless of the provider or claimed reliability.
Has your business experienced an outage due to Amazon Cloud? Let us know below!
Follow Hamish Barwick on Twitter: @HamishBarwick
Follow Computerworld Australia on Twitter: @ComputerworldAU
Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.
- Bookmark this page
- Share this article
- Got more on this story? Email CIO
- Follow CIO on twitter
- Cloud printing in the enterprise: liberating the mobile print experience from cables, operating systems and physical boundaries
- IDC Case Study - EMC IT Increasing Efficiency, Reducing Costs, and Optimising IT with Data Deduplication
- Aberdeen Group Analyst Insight Report: Does Your Enterprise Have a “Dropbox Problem?”
- Yes. We. Can. Flexible Policy 2.0
- Oracle Database 11g Product Family
-
Google Jumps Into Social Bookmarks Game
-
NBN build gaining momentum daily: Quigley
-
Face Time - Interview with John Brennan and Robert DiStefano
-
Monday Grok: Will Siri crack the walls of GOOG?
-
Face Time - Interview with John Brennan and Robert DiStefano
-
High Availability with Oracle Database 11g Release 2
In this paper, we review the common causes of application downtime and discuss how technologies available in the Oracle Database can help avoid costly downtime and enable rapid recovery from unplanned failures and also minimize impact from planned outages. We also highlight new technologies introduced in Oracle Database 11g Release 2 that enable businesses to make their IT infrastructure even more robust and fault tolerant, maximize their return on investment on high availability infrastructure, and provide better quality of service to users. -
Three simple steps to better patch security
It’s estimated that 90% of successful attacks against software vulnerabilities could be prevented with an existing patch or configuration setting. Yet patching is a persistent challenge for IT managers. With the glut of patches released each year, how do you know which ones are truly critical security patches and which ones aren’t? And how can you identify which computers are actually missing the patches they need? This paper details a simple approach to patching that gives you better visibility into and control over patch assessment and compliance. -
So Long, Silos: Why Multi-Domain MDM Is Better For Your Business
Say “so long” to silos. This white paper explains why a multi-domain MDM solution is far better than single-domain, single-focused point solutions. You’ll learn what to look for in a multi-domain solution so you don’t outgrow it or are forced to purchase multiple products down the road. You’ll also get tips on how to select a multi-domain solution that can lead to multiple benefits over many years. The age of multi-domain MDM is here. See why you should say “hello” to it!

















Comments
Nilam Doctor
Any outage for a company in small or medium scale is bad for business. This implies that even a small business should look for their own dedicated server hosting. It is like buying your own car for going to office rather than using public transport.
Well we have to decide....
IT Realist
Did this guy considered the risk of using this technology?
Of course it reflects badly on him - because it is not up to the standards that most IT leaders require - and there are so many links in the chain that can break that you just can't manage.
If you are looking for cheap computing and don't need rock solid reliability - then go for it....but don't expect it to be perfect.
Any expect this will happen again on a big scale...it's not the last outage....just ask Sony.
Post new comment