Journalism of Courage
Advertisement
Premium

What the AWS outage reveals about the internet’s backbone

AWS outage October 2025: The cloud computing market is said to have a brittle foundation because it is dominated by just a few hyperscalers such as AWS, Microsoft Azure, and Google Cloud.

AWS competes with the cloud services offered by Google and Microsoft.AWS competes with the cloud services offered by Google and Microsoft. (Reuters Photo)

Amazon Web Services (AWS) is a cloud storage and computing provider that hosts many of the world’s most-used online services including Netflix, WhatsApp, ChatGPT, Fortnite, Snapchat, Roblox, Signal, Reddit, Canva, and even multiple government websites. It has cornered 30 per cent of the global cloud computing market with much of that traffic running through several clusters of data centres primarily located in Virginia, United States, which is also home to US-East-1 (its most important data centre campus).

This means that when the US-East-1 data centre region experiences issues such as an outage, the ripple effects are felt globally and that is exactly what happened on Monday, October 20.

Three hours after the outage was confirmed, Amazon said that the cloud service had returned to normal operations. However, the company also said that some AWS services had a backlog of messages that would take a few hours to process.

What led to the AWS outage?

While the company is yet to share a detailed post-mortem report on the outage, AWS said that it traced the source of the problem to the “DynamoDB endpoint in the US-East-1 Region.”

DynamoDB is a centralised database service used by many internet-based services to track user information, store key data, and manage operations. AWS also attributed the outage to a domain name system (DNS) error. DNS works as a kind of phonebook for the internet, it translates internet addresses into machine-readable IP addresses that connect browsers and apps with websites and other underlying web services.

DNS errors are not that uncommon and can be addressed fairly quickly. But since so many sites and services rely on AWS, a DNS error can lead to widespread disruption. The system’s failure to correctly resolve which server to connect to, resulted in cascading failures that took down large swathes of the internet.

The outage on Monday was the third major failure tied to AWS’s US-East-1 data centre cluster in the past five years. It not only underscores how dependent businesses have become on the tech giants that deliver cloud computing services but also serves as a reminder of how fragile the internet’s backbone can be even if the outage is resolved quickly.

Story continues below this ad

How did AWS become the internet’s backbone?

The roots of AWS date back to the 2000s when Amazon was struggling with scale problems, which forced the e-commerce company to set up robust internal servers and web infrastructure. Soon, these core systems became the foundation of a new services business that is now AWS. The billion-dollar company’s success is often credited to its early-mover advantage. It was the first firm to launch a modern cloud infrastructure service back in 2006, which gave it a massive headstart as competitors took years to catch up.

Today, hyperscalers such as AWS, Microsoft Azure, and Google Cloud together control two-thirds of the global cloud computing market.

During the COVID-19 pandemic, several companies sought to digitise operations and move to the public cloud in order to tap into a wide range of services. This led to a further increase in revenues for cloud service providers like AWS, Azure, and Google Cloud that already dominate the global market. Earlier this year, Gartner forecast a 21.4 per cent jump in worldwide end-user spending on public cloud services to more than $723 billion.

Relying on cloud giants like AWS has strengthened cybersecurity and stability around the world by establishing common guardrails and best practices for all customers. Yet, even with their generally resilient infrastructure, AWS and its competitors remain vulnerable to large-scale disruptions because the internet is a complex web of overlapping services.

Story continues below this ad

Past outages on this scale have stemmed from various issues ranging from faulty updates and accidental injection of bad code to incompatible third-party software changes. For instance, a faulty software update by cybersecurity company CrowdStrike affected devices running Microsoft Windows and caused massive disruptions around the world in 2024.

Why is concentration of cloud services risky?

Experts have repeatedly pointed out that competition in the cloud computing market has been stifled by tactics such as bundling services, opaque pricing schemes, and high switching costs that keep customers locked in.

AWS, Google Cloud, and Microsoft Azure have been known to impose egress fees, dubbed as the ‘cloud tax’, on customers who want to migrate to another cloud provider, or download and move their data elsewhere. These practices have created steep entry barriers for potential challengers, allowing big tech companies to maintain their dominance over critical parts of the internet’s infrastructure.

The concentration of the internet’s core infrastructure in the hands of just a few major cloud providers has created critical single points of failure. A three-to-six-day outage at one of the top three cloud providers could cause $15 billion in damages in the US, according to a 2018 study by insurance giant Lloyd. The risk management processes of these big tech companies are also opaque.


Commenting how it was “pretty weird” that even secure messaging platforms like Signal are dependent on a single cloud service provider, tech billionaire Elon Musk said, “It means that AWS is in the loop and can take out Signal at any time.” He also highlighted that X Chat, a dedicated messaging service developed by Musk-owned X, does not rely on AWS and hence, was able to stay outline amid the outage.

Story continues below this ad

Can AI drive competition in the cloud services industry?

While large cloud computing providers still rule the global market, the artificial intelligence (AI) boom has opened the door to a new generation of players looking to challenge the status-quo.

For instance, Ubicloud is a startup that is looking to build an open-source alternative to AWS. It was part of Y Combinator’s W24 Batch and raised $16 million in seed funding. Companies like CoreWeave, Together AI, and Lambda Labs have also capitalised on the surge in demand for computing power driven by the AI boom, with some of them looking to offer distributed compute capacity.

Regulators could also play a role in levelling the playing field as big tech companies such as Meta, Google, Apple, and Amazon are already facing antitrust scrutiny in the US and European Union.

From the homepage
Tags:
  • Amazon Amazon Web Services cloud computing cloud services Cloud storage
Edition
Install the Express App for
a better experience
Featured
Trending Topics
News
Multimedia
Follow Us
Bihar Election ResultsNitish factor propels NDA towards Bihar sweep, Tejashwi fails to retain strongholds
X