Datadog Launches Workflow Automation
June 06, 2023
Share this

Datadog announced the general availability of Workflow Automation.

The new product enables teams to automate end-to-end remediation processes—with out-of-the-box actions and pre-built templates—across all systems, apps and services to help identify, investigate and resolve service disruptions and security threats faster.

Workflow Automation combines alerts and remediation in a single, streamlined solution. Datadog provides developers and security engineers end-to-end visibility across the tech stack so that teams are alerted to disruptions as they arise. Now, with Workflow Automation, teams can use context-rich alerts to automate entire remediation processes across their tools and services in response to disruptions directly in Datadog's unified platform. By giving teams the ability to both observe the entire tech stack in a single platform and also remediate any issues directly in the same place, Workflow Automation helps organizations maintain the availability of their systems.

"Manual processes and context switching are often the root cause of long stretches of IT downtime. This impacts both a company's bottom line and its reputation with end users," said Michael Gerstenhaber, VP of Product at Datadog. "Workflow Automation provides the automation and context that DevOps, SRE and security teams need in order to remediate issues quickly. Automation that seamlessly connects observability information with remediation allows engineers to proactively respond to insights before they turn into issues that would otherwise affect their businesses."

Workflow Automation enables organizations to:

- Trigger Responses Instantly: Users can automatically trigger responses from observability alerts, security signals and dashboards. These response workflows can be enriched with real-time observability data, such as logs and metrics, that guide automated decision-making. This allows teams to identify, investigate and resolve service disruptions and security threats quickly while proactively maintaining the health of systems.

- Automate Complex Processes: Over 300 out-of-the-box actions and more than 40 pre-built templates enable users to easily automate routine response tasks and complex end-to-end processes to save engineers time, eliminate human error and reduce overhead.

- Safeguard Automation: Teams can create interactive, human-in-the-loop workflows, and safeguard these workflows with granular role-based access control (RBAC) while keeping track of executions with detailed workflow auditing.

Share this

The Latest

July 25, 2024

The 2024 State of the Data Center Report from CoreSite shows that although C-suite confidence in the economy remains high, a VUCA (volatile, uncertain, complex, ambiguous) environment has many business leaders proceeding with caution when it comes to their IT and data ecosystems, with an emphasis on cost control and predictability, flexibility and risk management ...

July 24, 2024

In June, New Relic published the State of Observability for Energy and Utilities Report to share insights, analysis, and data on the impact of full-stack observability software in energy and utilities organizations' service capabilities. Here are eight key takeaways from the report ...

July 23, 2024

The rapid rise of generative AI (GenAI) has caught everyone's attention, leaving many to wonder if the technology's impact will live up to the immense hype. A recent survey by Alteryx provides valuable insights into the current state of GenAI adoption, revealing a shift from inflated expectations to tangible value realization across enterprises ... Here are five key takeaways that underscore GenAI's progression from hype to real-world impact ...

July 22, 2024
A defective software update caused what some experts are calling the largest IT outage in history on Friday, July 19. The impact reverberated through multiple industries around the world ...
July 18, 2024

As software development grows more intricate, the challenge for observability engineers tasked with ensuring optimal system performance becomes more daunting. Current methodologies are struggling to keep pace, with the annual Observability Pulse surveys indicating a rise in Mean Time to Remediation (MTTR). According to this survey, only a small fraction of organizations, around 10%, achieve full observability today. Generative AI, however, promises to significantly move the needle ...

July 17, 2024

While nearly all data leaders surveyed are building generative AI applications, most don't believe their data estate is actually prepared to support them, according to the State of Reliable AI report from Monte Carlo Data ...

July 16, 2024

Enterprises are putting a lot of effort into improving the digital employee experience (DEX), which has become essential to both improving organizational performance and attracting and retaining talented workers. But to date, most efforts to deliver outstanding DEX have focused on people working with laptops, PCs, or thin clients. Employees on the frontlines, using mobile devices to handle logistics ... have been largely overlooked ...

July 15, 2024

The average customer-facing incident takes nearly three hours to resolve (175 minutes) while the estimated cost of downtime is $4,537 per minute, meaning each incident can cost nearly $794,000, according to new research from PagerDuty ...

July 12, 2024

In MEAN TIME TO INSIGHT Episode 8, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AutoCon with the conference founders Scott Robohn and Chris Grundemann ...

July 11, 2024

Numerous vendors and service providers have recently embraced the NaaS concept, yet there is still no industry consensus on its definition or the types of networks it involves. Furthermore, providers have varied in how they define the NaaS service delivery model. I conducted research for a new report, Network as a Service: Understanding the Cloud Consumption Model in Networking, to refine the concept of NaaS and reduce buyer confusion over what it is and how it can offer value ...