Oracle Cloud, Netsuite, and Azure go down, hard, Down Under
Storm that passed through Sydney saw clouds sleep early and struggle to wake up
Updated Oracle, Netsuite, and Microsoft's clouds have gone down, hard, in the Sydney, Australia, region likely due to an issue at a datacenter provider in which both are tenants.
The Big Red Cloud first advised customers of an outage at 2129 Sydney time (1229 UTC) on Wednesday, and 29 minutes later wrote to inform customers that the outage had started earlier than its first emailed advisory – at 1015 UTC.
Oracle’s second email delivered the mixed message that: “We are still investigating an issue in the Australia East (Sydney) region that is impacting multiple OCI services. We have identified root cause of service failures and are working to mitigate the issue.”
The snafu took out a swathe of compute and SaaS services, with Kubernetes offerings disrupted, Exadata unavailable, and cloudy networks borked. Oracle has advised the impact of the incident means “some customers may be unable to use or connect to certain OCI services in the region.”
Oracle’s next emailed advice again changed the IT giant's story, this time referring to a “preliminary root cause” and naming that as related to “network connectivity.”
Microsoft, meanwhile, advised customers of its Azure cloud that as of approximately 0830 UTC on August 30 it was suffering an outage due to a cooling problem: "A utility power surge in the Australia East region tripped a subset of the cooling units offline in one datacenter, within one of the Availability Zones."
Temperatures rose inside the datacenter and Microsoft "proactively powered down a small subset of selected compute and storage scale units, to avoid damage to hardware."
Microsoft said it has since restored 99 percent of storage services and 99 percent of impacted Virtual Machines, and is "actively investigating individual downstream services to confirm their recovery status and mitigate remaining issues."
Downstream dependencies are preventing full recovery, we're told. Oracle is also having trouble restoring all services.
Oracle-owned cloudy ERP outfit Netsuite's status page states "A lightning storm impacted the chiller plant in the Sydney data center, and most systems were temporarily shut down to reduce temperatures. The temperatures have stabilized, and the systems are being systematically powered up."
Netsuite service is slowly being restored.
At the time of writing Oracle Cloud’s status page listed ten services as unavailable, six as experiencing service disruptions, and 12 as operational – including some services that were previously listed as unavailable.
Oracle Australia told The Register it has no comment at this time.
- Cisco's Duo Security suffers major authentication outage
- Microsoft DNS boo-boo breaks Hotmail for users around the globe
- UK flights disrupted by 'technical issue' with air traffic computer system
- Oracle shrinks its on-prem cloud into a single rack
The Register understands Oracle's networking team is most exercised right now as Big Red battles to end the downtime, consistent with the little information Oracle has provided mentioning customers may be unable to reach its cloud and ongoing connectivity issues.
Readers have suggested a lightning storm that struck Sydney last night could be the cause of the equipment breakdown. The storm sparked power outages in parts of the city that host several datacenters. Your correspondent's afternoon was darkened by the storms, which passed through at around 1600 Sydney time, a few hours before cloudy troubles began.
We've asked the datacenter companies known to have Microsoft and Oracle as tennnts for comment.
Whatever the root cause, hyperscale clouds and the datacenters that host them pride themselves on building sufficient resilience to cope with problems. Those efforts clearly have not worked in this case.
We’ll update this story if more info arrives. ®
Editor's note: This story was updated at 0000 UTC and again at 0030 UTC to reflect new information about the storm, Microsoft's outage, and knowledge of the situation we have learned from other sources. We updated the story again at 01:55 UTC to add information about Netsuite.