
Datadog announced new capabilities for monitoring DNS, allowing engineers to troubleshoot DNS issues that affect the performance and availability of web applications and backend microservices.
Engineers today rely on performant DNS resolution in two ways: to make their user-facing applications globally accessible on the Internet, and to facilitate communication between the backend services on which these applications are built. Thus, when DNS performance issues inevitably arise, internally or on the third-party provider side, it often leads to downstream failure that impacts the end-user experience. Datadog’s DNS monitoring capabilities now allow customers to monitor key performance metrics about both internal and external DNS resolution to maintain efficient service networking and availability.
“DNS is the backbone of the Internet, so its performance has a direct effect on the bottom line of every web application,” said Ilan Rabinovitch, VP, Product and Community, Datadog. “We have built end to end DNS monitoring to provide comprehensive visibility into the health and availability of business-critical applications both for service discovery and user experience.”
These new DNS monitoring capabilities extend Datadog’s Network Performance Monitoring and Synthetic Monitoring capabilities by helping customers detect when poor internal DNS resolution leads to network and application issues in real time, and when external DNS resolution is affecting end-user experience due to incorrect and poorly performing resolution.
Functionalities include:
- Assessing the performance of all internal DNS queries, transactions, and managed services in one view
- Quickly isolating servers with the highest response time or error rate to incoming requests
- Tracking the resolution time and accessibility of managed domains
- Identifying regional DNS outages and mismatched records
- Correlating DNS queries with application performance and cross microservice communication
The DNS monitoring capabilities are now available for all Datadog customers.
The Latest
Overall outage frequency and the general level of reported severity continue to decline, according to the Outage Analysis 2025 from Uptime Institute. However, cyber security incidents are on the rise and often have severe, lasting impacts ...
In March, New Relic published the State of Observability for Media and Entertainment Report to share insights, data, and analysis into the adoption and business value of observability across the media and entertainment industry. Here are six key takeaways from the report ...
Regardless of their scale, business decisions often take time, effort, and a lot of back-and-forth discussion to reach any sort of actionable conclusion ... Any means of streamlining this process and getting from complex problems to optimal solutions more efficiently and reliably is key. How can organizations optimize their decision-making to save time and reduce excess effort from those involved? ...
As enterprises accelerate their cloud adoption strategies, CIOs are routinely exceeding their cloud budgets — a concern that's about to face additional pressure from an unexpected direction: uncertainty over semiconductor tariffs. The CIO Cloud Trends Survey & Report from Azul reveals the extent continued cloud investment despite cost overruns, and how organizations are attempting to bring spending under control ...

According to Auvik's 2025 IT Trends Report, 60% of IT professionals feel at least moderately burned out on the job, with 43% stating that their workload is contributing to work stress. At the same time, many IT professionals are naming AI and machine learning as key areas they'd most like to upskill ...
Businesses that face downtime or outages risk financial and reputational damage, as well as reducing partner, shareholder, and customer trust. One of the major challenges that enterprises face is implementing a robust business continuity plan. What's the solution? The answer may lie in disaster recovery tactics such as truly immutable storage and regular disaster recovery testing ...
IT spending is expected to jump nearly 10% in 2025, and organizations are now facing pressure to manage costs without slowing down critical functions like observability. To meet the challenge, leaders are turning to smarter, more cost effective business strategies. Enter stage right: OpenTelemetry, the missing piece of the puzzle that is no longer just an option but rather a strategic advantage ...
Amidst the threat of cyberhacks and data breaches, companies install several security measures to keep their business safely afloat. These measures aim to protect businesses, employees, and crucial data. Yet, employees perceive them as burdensome. Frustrated with complex logins, slow access, and constant security checks, workers decide to completely bypass all security set-ups ...

In MEAN TIME TO INSIGHT Episode 13, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses hybrid multi-cloud networking strategy ...
In high-traffic environments, the sheer volume and unpredictable nature of network incidents can quickly overwhelm even the most skilled teams, hindering their ability to react swiftly and effectively, potentially impacting service availability and overall business performance. This is where closed-loop remediation comes into the picture: an IT management concept designed to address the escalating complexity of modern networks ...