Amazon's AWS Outage Shows How Its Complex Cloud Makes Backup Plans Difficult

Amazon said that an “impairment” of network devices in AWS Virginia data centre caused the prolonged outage earlier this week.

Amazon's AWS Outage Shows How Its Complex Cloud Makes Backup Plans Difficult

An issue that makes it hard for businesses to diversify is AWS makes it relatively cheap to send data

Highlights
  • Amazon has 24.1 percent of the overall market
  • Amazon is the world's biggest cloud computing firm
  • AWS itself has critical "dependencies" within its own services
Advertisement

Major companies using Amazon's data services got a painful lesson this week about how the complexity and market dominance of the company's cloud unit make it difficult to back up their data with other providers, analysts and experts told Reuters.

Amazon said that an "an impairment of several network devices" in its Amazon Web Services (AWS) Virginia data centre region caused the prolonged outage on Tuesday. The outage temporarily interrupted streaming platforms Netflix and Disney+, trading app Robinhood and even Amazon's own e-commerce site, which makes heavy use of AWS.

An Amazon spokesperson told Reuters on Wednesday that the issues had been resolved.

The huge trail of damage from a network problem at a single region that AWS calls "US-EAST-1" underscored how difficult it is for companies to spread their cloud computing around.

With 24.1 percent of the overall market, according to research firm IDC, Amazon is the world's biggest cloud computing firm. Rivals like Microsoft, Alphabet's Google, and Oracle are trying to lure AWS customers to use parts of their clouds, often as a backup.

But crafting a complex online service that can be easily shifted from one provider to another in case of emergency is far from simple, said Naveen Chhabra, a senior analyst with research firm Forrester. Rather than being a singular "cloud," AWS is actually composed of hundreds of different services, from basic building blocks like computing power and storage to advanced services like high-speed databases and artificial intelligence training.

Any given website, Chhabra said, might use several dozen of those individual services, each of which must work for the site to function. It is difficult to make a backup on another cloud provider because some services are proprietary to AWS and some work very differently at another provider.

"It's like saying, 'Can I put an SUV body on a sedan chassis?' Maybe, if everything is all the same and lines up. But there is no guarantee," Chhabra said.

Another issue that makes it hard for businesses to diversify is that AWS makes it relatively cheap to send data into its cloud, but then charges higher prices for "egress fees" to get data out of its cloud to take to a rival.

"That amplifies issues like this (outage) when they happen," said Matthew Prince, chief executive of internet security firm Cloudflare "A more resilient cloud is one where egress fees are eliminated and customers can be multi-cloud. I think that would actually increase the faith customers have in the cloud."

Dependencies in one region

AWS itself has critical "dependencies" within its own services where they are linked together in ways that can cause one to fail when another fails, said Angelique Medina, head of product market at Cisco's ThousandEyes. That is because AWS's complex services are often built on top of its own more basic services. One problem that crops up with a basic function like networking can cascade through services that depend on it.

Early on in the incident on Tuesday, AWS said the outage was "affecting some of our monitoring and incident response tooling, which is delaying our ability to provide updates."

Medina said AWS also seems to be have critical services clustered in its US-EAST-1 region, where another outage last year also had a widely felt impact.

"That's where a lot of their critical dependencies have been located historically," Medina said. "Over time, they've diversified a bit."

Chhabra, the Forrester analyst, said Amazon has done a lot of "heavy lifting" to make its own services resilient. But what Amazon does not do for its customers is build applications in a way that can withstand an outage by tapping multiple locations or providers.

Doing so can often involve extra work that might not always be worth it when cloud outages remain relatively rare.

"It's this tradeoff you always have between something that is decentralised, something that's secure and something that's useable," said Charly Fei, product lead for Inter Blockchain Communication lead at The Interchain Foundation, which is focused on technologies for decentralising computing. "It's not something where you'll ever get a perfect solution that gets all three."

© Thomson Reuters 2021


Why does Redmi refresh its phones so soon? We discuss this on Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
Comments

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Further reading: Amazon, AWS
Oppo Find X4 Price, Specifications Tipped; Likely to Feature MediaTek Dimensity 9000, Triple Rear Cameras
Instagram Launches Playback Feature to Recap Top 10 Stories of 2021, Announces Security Features
Share on Facebook Gadgets360 Twitter Share Tweet Snapchat Share Reddit Comment google-newsGoogle News
 
 

Advertisement

Follow Us

Advertisement

© Copyright Red Pixels Ventures Limited 2024. All rights reserved.
Trending Products »
Latest Tech News »