The AWS outage explained: What happened, who was impacted, and what services are back online?

Amazon Web Services (AWS) sign pictured at the Tech & Innovation Expo during the South by Southwest (SXSW) Sydney festival in Sydney, Australia. — (Image credit: Getty Images)

Amazon Web Services (AWS) has confirmed a recent outage that impacted customers was caused by overheating at a North Virginia data center.

The disruption affected one of AWS' six Availability Zones, use1-az4 in the AWS US-EAST-1 region. This is one of the company's most heavily used regions globally.

Notably, the incident hit platforms including cryptocurrency exchange Coinbase, disrupting core exchange functions for more than five hours. Other reported victims include the CME Group trading platform and major gambling company FanDuel.

Coinbase last night warned that some users might experience delayed sends and receives on the Solana network and for ALEO, but said it was working on the issue.

Overheating behind AWS outage

In an update to customers, AWS attributed the cause of the outage to overheating. The hyperscaler is yet to confirm how the overheating occurred.

"We have experienced an increase in temperatures within a single data center, which in some cases has caused impairments for instances in the Availability Zone," AWS said in a status report.

"EC2 instances and EBS volumes hosted on impacted hardware are affected by the loss of power during the thermal event."

In its latest update, AWS said it had shifted traffic away from the impacted zone. The hyperscaler said it was still carrying out mitigation efforts.

These are taking longer than expected to bring additional cooling system capacity online and recover the remaining affected infrastructure safely and in a controlled manner.

AWS warned some customers will continue to see their affected EC2 instances and EBS volumes as impaired until it can achieve full recovery. It said it currently didn't have an ETA for this.

What services are back online?

A number of services are back online following the outage, according to the hyperscaler. This includes:

AWS IoT Core
AWS NAT Gateway
Amazon Elastic Kubernetes Service
Amazon Elastic Load Balancing
Amazon Redshift

Some services are still impacted at time of writing, including:

Amazon ElastiCache
Amazon Managed Streaming for Apache Kafka
Amazon OpenSearch Service
Amazon SageMaker

Yet another AWS outage

It's not the first time that a major AWS outage has caused chaos. Last year, hundreds of apps and websites including Slack, Zoom, Coinbase, Snapchat, and Signal were taken down in a global outage.

Banking applications including Lloyds and Halifax also saw customers unable to access services. On that occasion, AWS attributed the outage to a DNS issue.

The incident highlights the extent to which major websites and apps are dependent on just a few tech giants.

In 2024, for example, issues with CrowdStrike saw hospitals, banks, and airports in Australia, New Zealand, India, Japan, the US, Germany, and the UK seriously affected.

Follow ITPro on Google News and add us as a preferred source to keep tabs on all our latest news, analysis, views, and reviews.

You can also follow ITPro on LinkedIn, X, Facebook, and BlueSky.

TOPICS

Emma Woollacott is a freelance journalist writing for publications including the BBC, Private Eye, Forbes, Raconteur and specialist technology titles.

Overheating behind AWS outage

What services are back online?

Yet another AWS outage

FOLLOW US ON SOCIAL MEDIA