PagerDuty Adds Automated Incident Response
November 16, 2021
Share this

PagerDuty announced new capabilities to further support the digital-first orientation of businesses as they seek to meet heightened expectations for customer experiences.

The new solutions inject control and logic at the event layer instantly to drive real-time behaviors and workflows.

Uniquely integrating this new event management into operations automation reduces manual processes and toil, automates work, and drives best practices across distributed teams.

Along with these new features, PagerDuty also announced the general availability of Change Events in Mobile, Rundeck Actions, Round Robin Scheduling, and Probable Incident Origin, bringing a strong suite of capabilities all aimed at enhancing automated incident response to drive customer engagement and efficient operations across the modern enterprise.

“Successful business operations in today’s world are fully digitized. Mission-critical work is urgent, unplanned, and involves distributed teams that need to assemble and collaborate effectively when minutes of delay can mean millions in lost revenue,” said Sean Scott, PagerDuty’s CPO. “PagerDuty’s Operations Cloud connects teams, departments, and dependencies, empowering companies to master key services and manage time-critical work that impacts customer experience.”

Digital services are in a constant state of change with a complex web of dependencies, which is further complicated as 72% of tech leaders report their organizations are actively accelerating their digital transformation strategies. Connecting and correlating data and signals for real-time information means ingesting signals from numerous sources, both structured and unstructured, and quickly turning the data into insights that guides actions.

New PagerDuty capabilities include:

- New Event Orchestration: Reduce manual processes and toil to gain operational efficiency

Event Intelligence is well-known for its noise reduction capabilities with features like intelligent alert grouping. PagerDuty now delivers the ability for teams to minimize transient noise with machine learning. Event Orchestration cuts down on businesses’ manual event processing with a powerful decision engine. Teams can now create custom logic to enrich, modify, and control routing based on event conditions at scale. Event Orchestration combines nested event rules for precise, targeted automation including diagnostics and remediation to reduce toil and gain operational efficiency.

“Customers are increasingly looking for event orchestration capabilities that balance human-led and machine-led work in real time,” said Stephen Elliot, group vice president, I&O, cloud operations and DevOps. “As CIOs and CEOs are continuing to find ways to increase operational efficiency, now is the time for people to get ahead of unplanned downtime events and find ways to automate their incident response processes.”

- New Rundeck Cloud: Rundeck is now available as a fully managed cloud service

With Rundeck, users focus on building and running automated workflows. Rundeck Cloud manages the infrastructure for users by providing high availability, security, and elastic scalability. It also manages all patches and updates, so users always have the latest features available. In the near future, Rundeck Actions will pair with Rundeck Cloud to quickly create sophisticated automated diagnostics and remediation for your production systems.

- New Service Standards: Enables account owners to configure and enforce best practice standards at scale for all their managed services.

Clearly defined and well-configured services are central to achieving team autonomy and efficient incident response. Many organizations are pivoting towards a Service Ownership model where developers and site reliability engineers take responsibility for supporting the code they deliver at every stage of the service lifecycle: they build it, ship it, and own it in production. PagerDuty’s Service Standards empower organizations to easily define, share, and track the criteria for service configuration according to their unique needs. Individual teams receive clear guidelines for setting up and managing services within PagerDuty.

- New Change Events & Change Correlation for Mobile: Help responders solve incidents faster

Deliver machine-learning-powered change directly to on-call responders, now on mobile devices. With the latest context available at a glance and on mobile devices, responders can identify potential change correlation, triage incidents quickly, and reduce time-to-resolution while on the go.

Solutions Now Generally Available:

- Rundeck Actions + Automated Diagnostics Package: Empowers responders to immediately remove critical minutes from incident response

- PagerDuty Rundeck Actions help users take action to run automated diagnostics and remediate incidents directly within PagerDuty. Improve productivity by automating repeated diagnostic and remediation steps, replacing toil of manual tasks.

Share this

The Latest

September 27, 2023

Navigating observability pricing models can be compared to solving a perplexing puzzle which includes financial variables and contractual intricacies. Predicting all potential costs in advance becomes an elusive endeavor, exemplified by a recent eye-popping $65 million observability bill ...

September 26, 2023

Generative AI may be a great tool for the enterprise to help drive further innovation and meaningful work, but it also runs the risk of generating massive amounts of spam that will counteract its intended benefits. From increased AI spam bots to data maintenance due to large volumes of outputs, enterprise AI applications can create a cascade of issues that end up detracting from productivity gains ...

September 25, 2023

A long-running study of DevOps practices ... suggests that any historical gains in MTTR reduction have now plateaued. For years now, the time it takes to restore services has stayed about the same: less than a day for high performers but up to a week for middle-tier teams and up to a month for laggards. The fact that progress is flat despite big investments in people, tools and automation is a cause for concern ...

September 21, 2023

Companies implementing observability benefit from increased operational efficiency, faster innovation, and better business outcomes overall, according to 2023 IT Trends Report: Lessons From Observability Leaders, a report from SolarWinds ...

September 20, 2023

IT leaders are driving an increasing number of automation initiatives as a way to stay competitive, reduce costs and scale as they navigate an unpredictable social and economic environment, according to the 2023 State of Automation in IT survey conducted by Jitterbit ...

September 19, 2023

Customer loyalty is changing as retailers get increasingly competitive. More than 75% of consumers say they would end business with a company after a single bad customer experience. This means that just one price discrepancy, inventory mishap or checkout issue in a physical or digital store, could have customers running out to the next store that can provide them with better service. Retailers must be able to predict business outages in advance, and act proactively before an incident occurs, impacting customer experience ...

September 18, 2023
Digital transformation is key to ensuring companies keep up with the competitive market landscape. Putting digital at the core of a business can significantly reduce operating expenses and inefficiencies. However, this process often means changing the way internal teams work with one another. To help with the transition, this blog offers chief experience officers (CXOs) advice on how to lead a successful digital transformation project ...
September 14, 2023

Earlier this year, New Relic conducted a study on observability ... The 2023 Observability Forecast reveals observability's impact on the lives of technical professionals and businesses' bottom lines. Here are 10 key takeaways from the forecast ...

September 13, 2023
On September 10, MGM Resorts experienced what it called a "cybersecurity issue" that had a major impact on the company's systems, showing how cyberattacks can bring down applications, ultimately causing problems for a company in many ways ...
September 12, 2023

Only 33% of executives are "very confident" in their ability to operate in a public cloud environment, according to the 2023 State of CloudOps report from NetApp. This represents an increase from 2022 when only 21% reported feeling very confident ...