PaaS + IaaS

Rackspace datacenter infrastructure took 12-hour nap in London, Sydney, Hong Kong

Borked SANs, not a security SNAFU, identified as the cause. Services are back, but Linux VMs must reboot

Tue 30 May 2023 // 09:01 UTC

Updated Rackspace is in a mess again.

The cloudy concern's status page reports outages in its SYD2, LON5, LON3, and HKG5 datacenter infrastructure across May 29 and 30.

Rackspace's first incident report is timestamped 29 May 22:24 CDT.

A subsequent update identified the issue as related to Dense Wavelength-Division Multiplexing (DWDM) in London, as that facility is related to a fiber transport network that allows Rackspace to deliver traffic between datacenters and internet service providers.

But an hour later Rackspace ruled out DWDM as a cause of the incident. The company has not updated its status page since.

The Register has obtained an email a SaaS company that resides in Rackspace has sent to its customers.

"Our hosting provider Rackspace have confirmed they are experiencing connectivity issues," the email opens. "All available engineers have been engaged and are working to resolve the issue with the highest priority."

It gets worse: Rackspace has warned customers of its London datacenters that whatever's causing the issue may disrupt their backups, and offered instructions on how to detect any failures.

At the time of writing – 02:45 CDT on May 30 – Rackspace had not updated its status page for over an hour. The Register has sought comment and will update this story if we receive useful information.

This outage comes at a terrible time for Rackspace as its US and UK customers emerge from a holiday weekend.

The company is also far from out of the woods after the December 2022 attack on its Hosted Exchange environment caused weeks of disruption and saw the service abandoned.

That incident led to protracted inability to access data, again with terrible timing as customers prepared for the festive season. Class actions are under way to give aggrieved customers a chance for compensation.

And now Rackspace customers on three continents have a new set of worries. ®

Updated at 23:00 UTC, May 30

Rackspace has identified the cause of the problem as "I/O limits in the multi-tenant Shared SAN environment had reset incorrectly."

Rackspace ran a script to reset the value and as of 12:10 CDT services were restored – with some exceptions.

"It has been identified that any impacted Linux VMs (virtual machines) will not automatically recover if storage has been adjusted and will need to be manually rebooted. Rackspace engineers can reboot impacted VMs from the portal where necessary" states Rackspace's status update.

A Rackspace spokesperson told us the incident is not considered a security matter.

Topics

Special Features

Vendor Voice

Resources

PaaS + IaaS

Rackspace datacenter infrastructure took 12-hour nap in London, Sydney, Hong Kong

Borked SANs, not a security SNAFU, identified as the cause. Services are back, but Linux VMs must reboot

Updated at 23:00 UTC, May 30

More about

More about

More about

More about

More about

TIP US OFF

Other stories you might like

911 goes MIA across multiple US states, cause unclear

Cyberattack hits Omni Hotels systems, taking out bookings, payments, door locks

US-EAST-1 region is not the cloudy crock it's made out to be, claims AWS EC2 boss

Protecting distributed branch office environments from ransomware

Datacenter outages are on the decline, but when they hit, they hit hard

Tech trade union confirms cyberattack behind IT, email outage

McDonald's ordering system suffers McFlurry of tech troubles

LinkedIn's turn to fall over: Outage hits thinkfluencer hub

World-plus-dog booted out of Facebook, Instagram, Threads

AT&T's apology for Thursday's outage should stretch to a cup of coffee

Americans wake to widespread AT&T cellular outages

X protests forced suspension of accounts on orders of India's government

About Us

Our Websites

Your Privacy