Skip to main content

Frequency and Severity of Data Center Outages Not Improving in 2024

The frequency and severity of data center outages remain mainly unchanged from 2023 or show small improvements, according to the Global Data Center Survey from Uptime Institute.

"The need for resiliency is well understood by all data center operators and across the supply chain. Although advances in IT, and software-based distributed resiliency, have offered the potential for operators to de-emphasize site-level resiliency, this has not happened. The need to avoid outages at a site level and maintain IT service, despite the high cost, remains a critical issue for operators in 2024," the report executive summary states.

"Uptime expects distributed resiliency strategies to play an increasingly important role in mitigating the effects of outages in the coming years. With further investments in cloud-style application architecture and software-based approaches, these approaches will improve over time."

Uptime also suggest that resiliency can benefit from improved training, processes and greater management attention on the importance of availability. The survey found that 80& of data center operators believe their most recent significant downtime incidents would have been preventable with better management, processes, or configuration.

"This data highlights the need for more testing and training, and a continued re-examination of existing systems and processes. There is also an opportunity to learn from the experience of previous outages, and from the industry’s progress in adapting to an expanding risk landscape," the report adds.

Other key findings from the 2024 report include:

■ Enterprises continue to meet their IT needs with hybrid architectures. More than one half of workloads (55%) are now off-premises, continuing the gradual trend of recent years, and survey respondents expect that number to increase even more through 2026. Meanwhile, many continue to maintain their own data centers.

■ Most operators recognize the benefits of AI and its potential. But despite many operators planning to host the technology, trust in AI for use in data center operations has declined for the third year in a row.

■ Average server rack densities are increasing but remain below 8 kilowatts (kW). Most facilities do not have racks above 30kW, and those that do have only a few. This is expected to change in coming years.

■ Average PUE levels remain mostly flat for the fifth consecutive year, but this conceals advances in newer, larger facilities.

■ Staffing challenges have neither improved nor worsened from 2023. More effort is needed to expand labor pools and skillsets to match the pace of capacity growth.

■ Fewer than one half of data center owners and operators are tracking the metrics needed to assess their sustainability and/or meet pending regulatory requirements.

"Our data shows operators poised for major changes ahead on multiple levels," said Andy Lawrence, Executive Director of Research, Uptime Intelligence. "In 2024, we see the challenges of increased demand impacting power and cooling capabilities of existing facilities and the need for further investment to keep up with the demand. At the same time the industry needs to focus on continued staffing challenges to match capacity growth. And regulatory requirements are here and cannot be dismissed."

Methodology: Uptime conducted this year’s annual Global Data Center Survey online and by email in the first half of 2024. The survey participants represent a wide range of industry verticals in multiple countries. Responses were collected from a total of 879 end users registered for the survey and answered at least one question. More than one half are located in North America and Europe. Approximately one third of respondents work for professional IT/data center service providers (staff with operational or executive responsibilities for a third-party data center), such as those offering colocation, wholesale, software or cloud computing services.

Hot Topics

The Latest

People want to be doing more engaging work, yet their day often gets overrun by addressing urgent IT tickets. But thanks to advances in AI "vibe coding," where a user describes what they want in plain English and the AI turns it into working code, IT teams can automate ticketing workflows and offload much of that work. Password resets that used to take 5 minutes per request now get resolved automatically ...

Governments and social platforms face an escalating challenge: hyperrealistic synthetic media now spreads faster than legacy moderation systems can react. From pandemic-related conspiracies to manipulated election content, disinformation has moved beyond "false text" into the realm of convincing audiovisual deception ...

Traditional monitoring often stops at uptime and server health without any integrated insights. Cross-platform observability covers not just infrastructure telemetry but also client-side behavior, distributed service interactions, and the contextual data that connects them. Emerging technologies like OpenTelemetry, eBPF, and AI-driven anomaly detection have made this vision more achievable, but only if organizations ground their observability strategy in well-defined pillars. Here are the five foundational pillars of cross-platform observability that modern engineering teams should focus on for seamless platform performance ...

For all the attention AI receives in corporate slide decks and strategic roadmaps, many businesses are struggling to translate that ambition into something that holds up at scale. At least, that's the picture that emerged from a recent Forrester study commissioned by Tines ...

From smart factories and autonomous vehicles to real-time analytics and intelligent building systems, the demand for instant, local data processing is exploding. To meet these needs, organizations are leaning into edge computing. The promise? Faster performance, reduced latency and less strain on centralized infrastructure. But there's a catch: Not every network is ready to support edge deployments ...

Every digital customer interaction, every cloud deployment, and every AI model depends on the same foundation: the ability to see, understand, and act on data in real time ... Recent data from Splunk confirms that 74% of the business leaders believe observability is essential to monitoring critical business processes, and 66% feel it's key to understanding user journeys. Because while the unknown is inevitable, observability makes it manageable. Let's explore why ...

Organizations that perform regular audits and assessments of AI system performance and compliance are over three times more likely to achieve high GenAI value than organizations that do not, according to a survey by Gartner ...

Kubernetes has become the backbone of cloud infrastructure, but it's also one of its biggest cost drivers. Recent research shows that 98% of senior IT leaders say Kubernetes now drives cloud spend, yet 91% still can't optimize it effectively. After years of adoption, most organizations have moved past discovery. They know container sprawl, idle resources and reactive scaling inflate costs. What they don't know is how to fix it ...

Artificial intelligence is no longer a future investment. It's already embedded in how we work — whether through copilots in productivity apps, real-time transcription tools in meetings, or machine learning models fueling analytics and personalization. But while enterprise adoption accelerates, there's one critical area many leaders have yet to examine: Can your network actually support AI at the speed your users expect? ...

The more technology businesses invest in, the more potential attack surfaces they have that can be exploited. Without the right continuity plans in place, the disruptions caused by these attacks can bring operations to a standstill and cause irreparable damage to an organization. It's essential to take the time now to ensure your business has the right tools, processes, and recovery initiatives in place to weather any type of IT disaster that comes up. Here are some effective strategies you can follow to achieve this ...

Frequency and Severity of Data Center Outages Not Improving in 2024

The frequency and severity of data center outages remain mainly unchanged from 2023 or show small improvements, according to the Global Data Center Survey from Uptime Institute.

"The need for resiliency is well understood by all data center operators and across the supply chain. Although advances in IT, and software-based distributed resiliency, have offered the potential for operators to de-emphasize site-level resiliency, this has not happened. The need to avoid outages at a site level and maintain IT service, despite the high cost, remains a critical issue for operators in 2024," the report executive summary states.

"Uptime expects distributed resiliency strategies to play an increasingly important role in mitigating the effects of outages in the coming years. With further investments in cloud-style application architecture and software-based approaches, these approaches will improve over time."

Uptime also suggest that resiliency can benefit from improved training, processes and greater management attention on the importance of availability. The survey found that 80& of data center operators believe their most recent significant downtime incidents would have been preventable with better management, processes, or configuration.

"This data highlights the need for more testing and training, and a continued re-examination of existing systems and processes. There is also an opportunity to learn from the experience of previous outages, and from the industry’s progress in adapting to an expanding risk landscape," the report adds.

Other key findings from the 2024 report include:

■ Enterprises continue to meet their IT needs with hybrid architectures. More than one half of workloads (55%) are now off-premises, continuing the gradual trend of recent years, and survey respondents expect that number to increase even more through 2026. Meanwhile, many continue to maintain their own data centers.

■ Most operators recognize the benefits of AI and its potential. But despite many operators planning to host the technology, trust in AI for use in data center operations has declined for the third year in a row.

■ Average server rack densities are increasing but remain below 8 kilowatts (kW). Most facilities do not have racks above 30kW, and those that do have only a few. This is expected to change in coming years.

■ Average PUE levels remain mostly flat for the fifth consecutive year, but this conceals advances in newer, larger facilities.

■ Staffing challenges have neither improved nor worsened from 2023. More effort is needed to expand labor pools and skillsets to match the pace of capacity growth.

■ Fewer than one half of data center owners and operators are tracking the metrics needed to assess their sustainability and/or meet pending regulatory requirements.

"Our data shows operators poised for major changes ahead on multiple levels," said Andy Lawrence, Executive Director of Research, Uptime Intelligence. "In 2024, we see the challenges of increased demand impacting power and cooling capabilities of existing facilities and the need for further investment to keep up with the demand. At the same time the industry needs to focus on continued staffing challenges to match capacity growth. And regulatory requirements are here and cannot be dismissed."

Methodology: Uptime conducted this year’s annual Global Data Center Survey online and by email in the first half of 2024. The survey participants represent a wide range of industry verticals in multiple countries. Responses were collected from a total of 879 end users registered for the survey and answered at least one question. More than one half are located in North America and Europe. Approximately one third of respondents work for professional IT/data center service providers (staff with operational or executive responsibilities for a third-party data center), such as those offering colocation, wholesale, software or cloud computing services.

Hot Topics

The Latest

People want to be doing more engaging work, yet their day often gets overrun by addressing urgent IT tickets. But thanks to advances in AI "vibe coding," where a user describes what they want in plain English and the AI turns it into working code, IT teams can automate ticketing workflows and offload much of that work. Password resets that used to take 5 minutes per request now get resolved automatically ...

Governments and social platforms face an escalating challenge: hyperrealistic synthetic media now spreads faster than legacy moderation systems can react. From pandemic-related conspiracies to manipulated election content, disinformation has moved beyond "false text" into the realm of convincing audiovisual deception ...

Traditional monitoring often stops at uptime and server health without any integrated insights. Cross-platform observability covers not just infrastructure telemetry but also client-side behavior, distributed service interactions, and the contextual data that connects them. Emerging technologies like OpenTelemetry, eBPF, and AI-driven anomaly detection have made this vision more achievable, but only if organizations ground their observability strategy in well-defined pillars. Here are the five foundational pillars of cross-platform observability that modern engineering teams should focus on for seamless platform performance ...

For all the attention AI receives in corporate slide decks and strategic roadmaps, many businesses are struggling to translate that ambition into something that holds up at scale. At least, that's the picture that emerged from a recent Forrester study commissioned by Tines ...

From smart factories and autonomous vehicles to real-time analytics and intelligent building systems, the demand for instant, local data processing is exploding. To meet these needs, organizations are leaning into edge computing. The promise? Faster performance, reduced latency and less strain on centralized infrastructure. But there's a catch: Not every network is ready to support edge deployments ...

Every digital customer interaction, every cloud deployment, and every AI model depends on the same foundation: the ability to see, understand, and act on data in real time ... Recent data from Splunk confirms that 74% of the business leaders believe observability is essential to monitoring critical business processes, and 66% feel it's key to understanding user journeys. Because while the unknown is inevitable, observability makes it manageable. Let's explore why ...

Organizations that perform regular audits and assessments of AI system performance and compliance are over three times more likely to achieve high GenAI value than organizations that do not, according to a survey by Gartner ...

Kubernetes has become the backbone of cloud infrastructure, but it's also one of its biggest cost drivers. Recent research shows that 98% of senior IT leaders say Kubernetes now drives cloud spend, yet 91% still can't optimize it effectively. After years of adoption, most organizations have moved past discovery. They know container sprawl, idle resources and reactive scaling inflate costs. What they don't know is how to fix it ...

Artificial intelligence is no longer a future investment. It's already embedded in how we work — whether through copilots in productivity apps, real-time transcription tools in meetings, or machine learning models fueling analytics and personalization. But while enterprise adoption accelerates, there's one critical area many leaders have yet to examine: Can your network actually support AI at the speed your users expect? ...

The more technology businesses invest in, the more potential attack surfaces they have that can be exploited. Without the right continuity plans in place, the disruptions caused by these attacks can bring operations to a standstill and cause irreparable damage to an organization. It's essential to take the time now to ensure your business has the right tools, processes, and recovery initiatives in place to weather any type of IT disaster that comes up. Here are some effective strategies you can follow to achieve this ...