Skip to main content

IT Leaders Trust AI More Than Ever - Here's What That Means for Operations

Eric Johnson
PagerDuty

Global businesses are riding a new wave of AI-powered innovation to transform IT operations, or ITOps.

Research reveals that more than half of organizations have already deployed AI agents and more than a third are planning to do so. This comes amidst growing executive trust in AI agents as they are increasingly relied on within IT and other business operations.

AI can be a critical part of the IT puzzle by helping to accelerate incident response, reduce downtime and keep customers happy. These gains can shift executive attitudes, as leaders come to see AI agents not just as experimental tools, but as reliable partners in mission-critical situations. It's no surprise, then, that 81% of IT and business executives now trust AI agents to take action during a crisis.

Adding Value for Ambitious Businesses

AI is already having a profound impact on digital operations, helping organizations to innovate faster and work more efficiently. In fact, a recent survey found that 51 % of organizations using AI reported that it had a transformational impact on their operations. AI agents can amplify that further by autonomously executing complex, multi-step tasks, helping to automate entire business processes.

It's part of the reason why we're seeing so many organizations adopt AI agents. In PagerDuty's latest survey, nearly three-quarters of executives (74%) reported that AI has become so essential to operations that they would struggle to function without it. A similar share has already deployed more than one agent. Interestingly, smaller companies seem to be even more reliant on the new technology than their larger peers.

Executives surveyed also revealed that AI is expected to handle as much as a third of their departmental workload in the next 12 months. Many are deploying AI agents for code writing or reviewing, but business processes as diverse as employee onboarding and supply chain optimization could be made more efficient and cost effective through agentic AI.

Challenges Mount

AI adoption is spiking in part because a significant majority of organizations now trust the technology more than they did a year ago, with IT leaders in particular feeling more confident. The improved sentiment stems from leaders observing better-quality outputs and more positive results in practical deployments and pilots.

However, legitimate concerns persist. Organizations are most anxious about outages that may impact their AI systems. 84% of survey respondents reported having experienced at least one outage impacting either internal or external models and tools. A similar share believes their organization needs to improve its ability to detect AI failings to respond to incidents more rapidly.

In many companies, those with AI skills are spread too thin across organizations, meaning fewer experts may be available or on-call to fix issues. Even among those qualified to deal with these incidents, they may find that black box AI systems defy traditional debugging approaches. It's important to remember that many are laboring with siloed incident response/detection systems, which may exclude input from customer support and other functions. Many of these tools aren't designed to monitor complex AI systems with cascading decision layers.

AI Lights a Way Forward

Fortunately, AI can be an important ally as organizations tackle these operational issues. AI embedded into tools can learn from previous incidents to accelerate triage, reduce alert fatigue by minimizing "noise" and review recent changes to proactively surface issues. Generative AI can also add value with automatically generated post-incident reviews for enhanced team learning and with AI-generated runbooks to streamline response. It could also produce timely status updates to keep stakeholders informed of incident developments without impacting first-responder workloads.

Agentic AI can help incident response move even faster, autonomously resolving routine issues so engineers can focus on innovation. This could involve pulling relevant triage data and running diagnostics to make first responders' jobs easier, or recording and sharing incident information for other responders, and even surfacing recommended areas for ongoing improvements.

Predicting and Preventing Failure

The most advanced organizations will use AI to attain a level of operational maturity where AI and automation are used for predictive maintenance and incident prevention. As a result, these organizations can decrease the likelihood of an AI-related system error occurring, helping to ensure customers are happy, services remain uninterrupted and engineers are focused on growth rather than incident response.

It's an aspiration rather than a reality for most enterprises today, but as AI finds its way into more business-critical systems, a preventative approach to ITOps should at least be on the roadmap for teams. A combination of embedded, generative and agentic AI can be key in helping an organization be prepared for the next major outage.

Eric Johnson is Chief Information Officer at PagerDuty

Hot Topics

The Latest

Every digital customer interaction, every cloud deployment, and every AI model depends on the same foundation: the ability to see, understand, and act on data in real time ... Recent data from Splunk confirms that 74% of the business leaders believe observability is essential to monitoring critical business processes, and 66% feel it's key to understanding user journeys. Because while the unknown is inevitable, observability makes it manageable. Let's explore why ...

Organizations that perform regular audits and assessments of AI system performance and compliance are over three times more likely to achieve high GenAI value than organizations that do not, according to a survey by Gartner ...

Kubernetes has become the backbone of cloud infrastructure, but it's also one of its biggest cost drivers. Recent research shows that 98% of senior IT leaders say Kubernetes now drives cloud spend, yet 91% still can't optimize it effectively. After years of adoption, most organizations have moved past discovery. They know container sprawl, idle resources and reactive scaling inflate costs. What they don't know is how to fix it ...

Artificial intelligence is no longer a future investment. It's already embedded in how we work — whether through copilots in productivity apps, real-time transcription tools in meetings, or machine learning models fueling analytics and personalization. But while enterprise adoption accelerates, there's one critical area many leaders have yet to examine: Can your network actually support AI at the speed your users expect? ...

The more technology businesses invest in, the more potential attack surfaces they have that can be exploited. Without the right continuity plans in place, the disruptions caused by these attacks can bring operations to a standstill and cause irreparable damage to an organization. It's essential to take the time now to ensure your business has the right tools, processes, and recovery initiatives in place to weather any type of IT disaster that comes up. Here are some effective strategies you can follow to achieve this ...

In today's fast-paced AI landscape, CIOs, IT leaders, and engineers are constantly challenged to manage increasingly complex and interconnected systems. The sheer scale and velocity of data generated by modern infrastructure can be overwhelming, making it difficult to maintain uptime, prevent outages, and create a seamless customer experience. This complexity is magnified by the industry's shift towards agentic AI ...

In MEAN TIME TO INSIGHT Episode 19, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA explains the cause of the AWS outage in October ... 

The explosion of generative AI and machine learning capabilities has fundamentally changed the conversation around cloud migration. It's no longer just about modernization or cost savings — it's about being able to compete in a market where AI is rapidly becoming table stakes. Companies that can't quickly spin up AI workloads, feed models with data at scale, or experiment with new capabilities are falling behind faster than ever before. But here's what I'm seeing: many organizations want to capitalize on AI, but they're stuck ...

On September 16, the world celebrated the 10th annual IT Pro Day, giving companies a chance to laud the professionals who serve as the backbone to almost every successful business across the globe. Despite the growing importance of their roles, many IT pros still work in the background and often go underappreciated ...

Artificial Intelligence (AI) is reshaping observability, and observability is becoming essential for AI. This is a two-way relationship that is increasingly relevant as enterprises scale generative AI ... This dual role makes AI and observability inseparable. In this blog, I cover more details of each side ...

IT Leaders Trust AI More Than Ever - Here's What That Means for Operations

Eric Johnson
PagerDuty

Global businesses are riding a new wave of AI-powered innovation to transform IT operations, or ITOps.

Research reveals that more than half of organizations have already deployed AI agents and more than a third are planning to do so. This comes amidst growing executive trust in AI agents as they are increasingly relied on within IT and other business operations.

AI can be a critical part of the IT puzzle by helping to accelerate incident response, reduce downtime and keep customers happy. These gains can shift executive attitudes, as leaders come to see AI agents not just as experimental tools, but as reliable partners in mission-critical situations. It's no surprise, then, that 81% of IT and business executives now trust AI agents to take action during a crisis.

Adding Value for Ambitious Businesses

AI is already having a profound impact on digital operations, helping organizations to innovate faster and work more efficiently. In fact, a recent survey found that 51 % of organizations using AI reported that it had a transformational impact on their operations. AI agents can amplify that further by autonomously executing complex, multi-step tasks, helping to automate entire business processes.

It's part of the reason why we're seeing so many organizations adopt AI agents. In PagerDuty's latest survey, nearly three-quarters of executives (74%) reported that AI has become so essential to operations that they would struggle to function without it. A similar share has already deployed more than one agent. Interestingly, smaller companies seem to be even more reliant on the new technology than their larger peers.

Executives surveyed also revealed that AI is expected to handle as much as a third of their departmental workload in the next 12 months. Many are deploying AI agents for code writing or reviewing, but business processes as diverse as employee onboarding and supply chain optimization could be made more efficient and cost effective through agentic AI.

Challenges Mount

AI adoption is spiking in part because a significant majority of organizations now trust the technology more than they did a year ago, with IT leaders in particular feeling more confident. The improved sentiment stems from leaders observing better-quality outputs and more positive results in practical deployments and pilots.

However, legitimate concerns persist. Organizations are most anxious about outages that may impact their AI systems. 84% of survey respondents reported having experienced at least one outage impacting either internal or external models and tools. A similar share believes their organization needs to improve its ability to detect AI failings to respond to incidents more rapidly.

In many companies, those with AI skills are spread too thin across organizations, meaning fewer experts may be available or on-call to fix issues. Even among those qualified to deal with these incidents, they may find that black box AI systems defy traditional debugging approaches. It's important to remember that many are laboring with siloed incident response/detection systems, which may exclude input from customer support and other functions. Many of these tools aren't designed to monitor complex AI systems with cascading decision layers.

AI Lights a Way Forward

Fortunately, AI can be an important ally as organizations tackle these operational issues. AI embedded into tools can learn from previous incidents to accelerate triage, reduce alert fatigue by minimizing "noise" and review recent changes to proactively surface issues. Generative AI can also add value with automatically generated post-incident reviews for enhanced team learning and with AI-generated runbooks to streamline response. It could also produce timely status updates to keep stakeholders informed of incident developments without impacting first-responder workloads.

Agentic AI can help incident response move even faster, autonomously resolving routine issues so engineers can focus on innovation. This could involve pulling relevant triage data and running diagnostics to make first responders' jobs easier, or recording and sharing incident information for other responders, and even surfacing recommended areas for ongoing improvements.

Predicting and Preventing Failure

The most advanced organizations will use AI to attain a level of operational maturity where AI and automation are used for predictive maintenance and incident prevention. As a result, these organizations can decrease the likelihood of an AI-related system error occurring, helping to ensure customers are happy, services remain uninterrupted and engineers are focused on growth rather than incident response.

It's an aspiration rather than a reality for most enterprises today, but as AI finds its way into more business-critical systems, a preventative approach to ITOps should at least be on the roadmap for teams. A combination of embedded, generative and agentic AI can be key in helping an organization be prepared for the next major outage.

Eric Johnson is Chief Information Officer at PagerDuty

Hot Topics

The Latest

Every digital customer interaction, every cloud deployment, and every AI model depends on the same foundation: the ability to see, understand, and act on data in real time ... Recent data from Splunk confirms that 74% of the business leaders believe observability is essential to monitoring critical business processes, and 66% feel it's key to understanding user journeys. Because while the unknown is inevitable, observability makes it manageable. Let's explore why ...

Organizations that perform regular audits and assessments of AI system performance and compliance are over three times more likely to achieve high GenAI value than organizations that do not, according to a survey by Gartner ...

Kubernetes has become the backbone of cloud infrastructure, but it's also one of its biggest cost drivers. Recent research shows that 98% of senior IT leaders say Kubernetes now drives cloud spend, yet 91% still can't optimize it effectively. After years of adoption, most organizations have moved past discovery. They know container sprawl, idle resources and reactive scaling inflate costs. What they don't know is how to fix it ...

Artificial intelligence is no longer a future investment. It's already embedded in how we work — whether through copilots in productivity apps, real-time transcription tools in meetings, or machine learning models fueling analytics and personalization. But while enterprise adoption accelerates, there's one critical area many leaders have yet to examine: Can your network actually support AI at the speed your users expect? ...

The more technology businesses invest in, the more potential attack surfaces they have that can be exploited. Without the right continuity plans in place, the disruptions caused by these attacks can bring operations to a standstill and cause irreparable damage to an organization. It's essential to take the time now to ensure your business has the right tools, processes, and recovery initiatives in place to weather any type of IT disaster that comes up. Here are some effective strategies you can follow to achieve this ...

In today's fast-paced AI landscape, CIOs, IT leaders, and engineers are constantly challenged to manage increasingly complex and interconnected systems. The sheer scale and velocity of data generated by modern infrastructure can be overwhelming, making it difficult to maintain uptime, prevent outages, and create a seamless customer experience. This complexity is magnified by the industry's shift towards agentic AI ...

In MEAN TIME TO INSIGHT Episode 19, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA explains the cause of the AWS outage in October ... 

The explosion of generative AI and machine learning capabilities has fundamentally changed the conversation around cloud migration. It's no longer just about modernization or cost savings — it's about being able to compete in a market where AI is rapidly becoming table stakes. Companies that can't quickly spin up AI workloads, feed models with data at scale, or experiment with new capabilities are falling behind faster than ever before. But here's what I'm seeing: many organizations want to capitalize on AI, but they're stuck ...

On September 16, the world celebrated the 10th annual IT Pro Day, giving companies a chance to laud the professionals who serve as the backbone to almost every successful business across the globe. Despite the growing importance of their roles, many IT pros still work in the background and often go underappreciated ...

Artificial Intelligence (AI) is reshaping observability, and observability is becoming essential for AI. This is a two-way relationship that is increasingly relevant as enterprises scale generative AI ... This dual role makes AI and observability inseparable. In this blog, I cover more details of each side ...