Datadog On-Call Introduced
June 26, 2024
Share this

Datadog announced Datadog On-Call, an on-call experience with observability-enriched paging and seamless incident management workflows.

Datadog On-Call instantly coordinates teams with relevant context for faster issue resolution, better incident control and improved collaboration.

By unifying observability and paging into one seamless platform, Datadog On-Call solves these issues and eliminates the inefficiencies of multiple disjointed tools, allowing engineers to focus on resolving incidents quickly and effectively without the added stress of switching contexts or missing critical information.

“Being on-call is one of the most challenging aspects of an engineer’s job, where redundant service configurations between various tools can lead to brittle, error-prone setups. The general overhead of maintaining on-call schedules and the ambiguity around service and team ownership make it a grueling ordeal, especially during critical times,” said Michael Whetten, VP of Product at Datadog. “Datadog On-Call addresses these pain points with a team-centric design that clarifies ownership, reduces redundancy and minimizes errors. This approach ensures that every team member knows their role and responsibilities, leading to quicker and more effective incident response.”

Datadog On-Call helps DevOps, SRE, Security and IT Operations teams:

- Act Quickly and Stay Informed: Paging with integrated observability and seamless incident management ensures critical insights and data are readily available within a single platform, eliminating the need for context switching.

- Connect with the Tools They Use Every Day: On-Call integrates with a rich ecosystem of third-party monitoring, alerting and service management tools so teams don’t have to learn new workflows or spend resources on training.

- Ensure Clear Service and Team Ownership: Break down knowledge silos and avoid confusion by associating teams with their respective services to simplify configuration, address ownership gaps and ensure the right responders are paged during an alert. Instantly trace upstream and downstream services affected by an outage or issue.

- Implement Intuitive Scheduling and Notifications: Automate scheduling and escalation policies to ensure continuous coverage and timely responses, reducing the burden on individual team members and enhancing overall team coordination.

- Measure On-Call Performance: Rich and customizable analytics measure on-call performance to help ensure system reliability, improve mean-time-to-resolution and optimize the well-being of on-call teams.

Datadog On-Call is in beta now.

Share this

The Latest

October 24, 2024

High-business-impact outages are costly, and a fast MTTx (mean-time-to-detect (MTTD) and mean-time-to-resolve (MTTR)) is crucial, with 62% of businesses reporting a loss of at least $1 million per hour of downtime ...

October 23, 2024

Organizations recognize the benefits of generative AI (GenAI) yet need help to implement the infrastructure necessary to deploy it, according to The Future of AI in IT Operations: Benefits and Challenges, a new report commissioned by ScienceLogic ...

October 22, 2024

Splunk's latest research reveals that companies embracing observability aren't just keeping up, they're pulling ahead. Whether it's unlocking advantages across their digital infrastructure, achieving deeper understanding of their IT environments or uncovering faster insights, organizations are slashing through resolution times like never before ...

October 21, 2024

A majority of IT workers surveyed (79%) believe the current service desk model will be unrecognizable within three years, with nearly as many (77%) saying new technologies will render it "redundant" by 2027, according to The Death (and Rebirth) of the Service Desk from Nexthink ...

October 17, 2024

Monitoring your cloud infrastructure on Microsoft Azure is crucial for maintaining its optimal functioning ... In this blog, we will discuss the key aspects you need to consider when selecting the right Azure monitoring software for your business ...

October 16, 2024

All eyes are on the value AI can provide to enterprises. Whether it's simplifying the lives of developers, more accurately forecasting business decisions, or empowering teams to do more with less, AI has already become deeply integrated into businesses. However, it's still early to evaluate its impact using traditional methods. Here's how engineering and IT leaders can make educated decisions despite the ambiguity ...

October 15, 2024

2024 is the year of AI adoption on the mainframe, according to the State of Mainframe Modernization Survey from Kyndryl ...

October 10, 2024

When employees encounter tech friction or feel frustrated with the tools they are asked to use, they will find a workaround. In fact, one in two office workers admit to using personal devices to log into work networks, with 32% of them revealing their employers are unaware of this practice, according to Securing the Digital Employee Experience ...

October 10, 2024

In today's high-stakes race to deliver innovative products without disruptions, the importance of feature management and experimentation has never been more clear. But what strategies are driving success, and which tools are truly moving the needle? ...

October 09, 2024
A well-performing application is no longer a luxury; it has become a necessity for many business organizations worldwide. End users expect applications to be fast, reliable, and responsive — anything less can cause user frustration, app abandonment, and ultimately lost revenue. This is where application performance testing comes in ....