
PagerDuty announced new AIOps and automation capabilities for the PagerDuty Operations Cloud, enabling customers to accelerate and optimize their critical operations in service of business transformation.
The PagerDuty Operations Cloud Fall 2023 release includes event orchestration, runbook automation and updates around its recently announced generative AI (GenAI) use cases, helping organizations cut operating costs, accelerate innovation, and mitigate risk of operational failures.
By using the PagerDuty Operations Cloud, enterprises are able to ensure up to 75% less downtime3. This can equate to millions of dollars saved or revenue preserved in a single year. To optimize digital experiences and reduce costly downtime, enterprise CEOs and CIOs are seeking integrated platforms like PagerDuty that go beyond patchwork solutions and leverage AI and automation across their entire product portfolio. To help customers remain competitive, retain and expand customers, and accelerate innovation in a resource-constrained environment, the PagerDuty Operations Cloud offers next-generation AI and automation to quickly resolve issues, minimize interruptions, reduce tool sprawl, and build more automation faster.
Do more with less at scale with event-driven automation to trigger intelligent remediation:
- Event Orchestration Variables empower organizations to build intelligent automation that helps inform other tools and processes for faster, more targeted incident response that can be standardized across the organization for better cross-team collaboration.
- Runbook Automation Add-On allows customers looking to automate actions triggered by AIOps, incident responders, or customer service representatives to benefit from capabilities of both Automation Actions and Runbook Automation on a single SKU. This provides a more powerful solution for gathering deeper information from a customer’s environment for triage and diagnosis and running remediation changes to resolve incidents. Leveraging the synergies of Runbook Automation Add-On and AIOps helps resolve incidents up to 95% faster4 by allowing the delegation of repetitive tasks to incident responders while also freeing up specialists’ time. With the Runbook Automation Add-On, customers can also reduce planned downtime by 85%5 and support costs by 55%6.
Increase team productivity and time for innovation with AI and AIOps:
- Global Alert Grouping reduces interruptions by grouping event data across all your services for teams to gain a better understanding of the incident scope when you need to coordinate with other teams to quickly resolve issues. Whether you need to curb noise on one or multiple services, PagerDuty allows you to customize how event data is grouped. On average, customers using PagerDuty AIOps are able to reduce the overall number of incidents by 87%7, minimizing unnecessary interruptions and keeping teams in flow.
- AI-Generated Incident Postmortems enable teams to save hours–and in some cases, days–of effort, removing toil and improving accuracy associated with post-incident analysis. PagerDuty automatically creates a draft postmortem using GenAI to empower teams to focus on refining their learnings and action items for improving their operational processes. With the click of a button, PagerDuty triangulates and collates incident data to help generate comprehensive summaries of what happened, when, how it was resolved, and key action items. This ensures that incident analysis is timely and relevant, and enables a continuous loop of learning. The feature is now in beta–interested customers should sign up for the waitlist.
- AI-Generated Status Updates remove manual work associated with stakeholder management during incident response to keep teams focused on resolution. Early customer feedback of AI-Generated Status Updates has shown that some companies designate as many as three responders to handle stakeholder communication. With AI-Generated Status Updates, drafting the message only takes one click, making it easier for the responder to review, edit and send updates to keep internal stakeholders and executives in the loop. This can potentially cut down the number of responders needed from three to one, saving costs and helping scale your workforce. The feature is now in beta–interested customers should sign up for the waitlist.
Improve operational resilience with a platform that fits with the way you work:
- Google Cloud Personalized Service Health Integration sends proactive, customized and detailed alerts about Google service disruptions to get ahead of customer impacting issues. The PagerDuty and Google Cloud partnership offers a vital platform for efficient cloud operations, aiding customers in responding to disruptions and ensuring smooth digital experiences.
- Slack/Chat as a Contact Method allows chat app users to elect to use chat apps as a contact method for incident response, adding to an already comprehensive integrated experience for customers using PagerDuty with Slack.
The Latest
In live financial environments, capital markets software cannot pause for rebuilds. New capabilities are introduced as stacked technology layers to meet evolving demands while systems remain active, data keeps moving, and controls stay intact. AI is no exception, and its opportunities are significant: accelerated decision cycles, compressed manual workflows, and more effective operations across complex environments. The constraint isn't the models themselves, but the architectural environments they enter ...
Like most digital transformation shifts, organizations often prioritize productivity and leave security and observability to keep pace. This usually translates to both the mass implementation of new technology and fragmented monitoring and observability (M&O) tooling. In the era of AI and varied cloud architecture, a disparate observability function can be dangerous. IT teams will lack a complete picture of their IT environment, making it harder to diagnose issues while slowing down mean time to resolve (MTTR). In fact, according to recent data from the SolarWinds State of Monitoring & Observability Report, 77% of IT personnel said the lack of visibility across their on-prem and cloud architecture was an issue ...
In MEAN TIME TO INSIGHT Episode 23, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the NetOps labor shortage ...
Technology management is evolving, and in turn, so is the scope of FinOps. The FinOps Foundation recently updated their mission statement from "advancing the people who manage the value of cloud" to "advancing the people who manage the value of technology." This seemingly small change solidifies a larger evolution: FinOps practitioners have organically expanded to be focused on more than just cloud cost optimization. Today, FinOps teams are largely — and quickly — expanding their job descriptions, evolving into a critical function for managing the full value of technology ...
Enterprises are under pressure to scale AI quickly. Yet despite considerable investment, adoption continues to stall. One of the most overlooked reasons is vendor sprawl ... In reality, no organization deliberately sets out to create sprawling vendor ecosystems. More often, complexity accumulates over time through well-intentioned initiatives, such as enterprise-wide digital transformation efforts, point solutions, or decentralized sourcing strategies ...
Nearly every conversation about AI eventually circles back to compute. GPUs dominate the headlines while cloud platforms compete for workloads and model benchmarks drive investment decisions. But underneath that noise, a quieter infrastructure challenge is taking shape. The real bottleneck in enterprise AI is not processing power, it is the ability to store, manage and retrieve the relentless volumes of data that AI systems generate, consume and multiply ...
The 2026 Observability Survey from Grafana Labs paints a vivid picture of an industry maturing fast, where AI is welcomed with careful conditions, SaaS economics are reshaping spending decisions, complexity remains a defining challenge, and open standards continue to underpin it all ...
The observability industry has an evolving relationship with AI. We're not skeptics, but it's clear that trust in AI must be earned ... In Grafana Labs' annual Observability Survey, 92% said they see real value in AI surfacing anomalies before they cause downtime. Another 91% endorsed AI for forecasting and root cause analysis. So while the demand is there, customers need it to be trustworthy, as the survey also found that the practitioners most enthusiastic about AI are also the most insistent on explainability ...
In the modern enterprise, the conversation around AI has moved past skepticism toward a stage of active adoption. According to our 2026 State of IT Trends Report: The Human Side of Autonomous AI, nearly 90% of IT professionals view AI as a net positive, and this optimism is well-founded. We are seeing agentic AI move beyond simple automation to actively streamlining complex data insights and eliminating the manual toil that has long hindered innovation. However, as we integrate these autonomous agents into our ecosystems, the fundamental DNA of the IT role is evolving ...
AI workloads require an enormous amount of computing power ... What's also becoming abundantly clear is just how quickly AI's computing needs are leading to enterprise systems failure. According to Cockroach Labs' State of AI Infrastructure 2026 report, enterprise systems are much closer to failure than their organizations realize. The report ... suggests AI scale could cause widespread failures in as little as one year — making it a clear risk for business performance and reliability.