RapDev announced the latest extension of Arlo, its suite of AI Agents, for Datadog environments - built to streamline observability workflows, reduce operational toil, and accelerate incident resolution.
Launching late Q2 on the Datadog Marketplace, Arlo leverages RapDev's deep engineering expertise to deliver proactive AI-driven solutions that automate investigation and troubleshooting of incidents in real-time, using several LLM techniques. Arlo empowers SREs and engineering teams to stay focused on innovation, avoiding manual troubleshooting and infrastructure noise.
By embedding prompt-chaining techniques directly into Datadog environments, Arlo delivers real-time diagnostics, actionable remediation, and autonomous response capabilities, making incident resolution both faster and smarter.
Each Arlo Agent targets a specific area of infrastructure or application health, offering measurable outcomes and fast time-to-value:
- Arlo for Linux: Flags disk space issues, identifies large or runaway log files, and initiates cleanup actions before business services are impacted.
- Arlo for Kubernetes: Surfaces saturation and deployment anomalies at the node level, with built-in recommendations to reduce drift and prevent future failure.
- Arlo for Windows: Identifies memory pressure and system constraints on Windows VMs hosting .NET applications, pinpointing exactly which processes to address.
- Arlo for Networking: Diagnoses spanning tree and switch-level issues by logging into network devices and identifying misconfigurations, cutting network troubleshooting time dramatically.
"Arlo takes the burden off engineers by running real troubleshooting workflows across Linux, Windows, Kubernetes, and network devices," said Jay Barker, Director of Datadog Engineering at RapDev. "Whether it's root cause analysis or live remediation, Arlo accelerates incident response and turns SRE hours into minutes - all within your existing Datadog environment."
Whether investigating root causes, recommending fixes, or running commands directly, Arlo acts as a digital teammate that never sleeps - automating repetitive diagnostics so teams can resolve issues with confidence and speed.
"Arlo is built for the next wave of ProdOps," said Tameem Hourani, Principal and Founder at RapDev. "With agents that act directly within your observability workflows, it's not just surfacing data - it's taking action. That's where our industry is headed: AI agents that troubleshoot, resolve, and never sleep, so your teams can."
Arlo for Datadog exemplifies RapDev's commitment to building AI-native solutions that enhance core platform capabilities, improve engineer productivity, and deliver operational efficiencies at scale.
Arlo's launch marks a significant milestone in RapDev's AI strategy, with additional agent capabilities and customer-driven enhancements already in development.
The Latest
Cyber threats are growing more sophisticated every day, and at their forefront are zero-day vulnerabilities. These elusive security gaps are exploited before a fix becomes available, making them among the most dangerous threats in today's digital landscape ... This guide will explore what these vulnerabilities are, how they work, why they pose such a significant threat, and how modern organizations can stay protected ...
The prevention of data center outages continues to be a strategic priority for data center owners and operators. Infrastructure equipment has improved, but the complexity of modern architectures and evolving external threats presents new risks that operators must actively manage, according to the Data Center Outage Analysis 2025 from Uptime Institute ...
As observability engineers, we navigate a sea of telemetry daily. We instrument our applications, configure collectors, and build dashboards, all in pursuit of understanding our complex distributed systems. Yet, amidst this flood of data, a critical question often remains unspoken, or at best, answered by gut feeling: "Is our telemetry actually good?" ... We're inviting you to participate in shaping a foundational element for better observability: the Instrumentation Score ...
We're inching ever closer toward a long-held goal: technology infrastructure that is so automated that it can protect itself. But as IT leaders aggressively employ automation across our enterprises, we need to continuously reassess what AI is ready to manage autonomously and what can not yet be trusted to algorithms ...
Much like a traditional factory turns raw materials into finished products, the AI factory turns vast datasets into actionable business outcomes through advanced models, inferences, and automation. From the earliest data inputs to the final token output, this process must be reliable, repeatable, and scalable. That requires industrializing the way AI is developed, deployed, and managed ...
Almost half (48%) of employees admit they resent their jobs but stay anyway, according to research from Ivanti ... This has obvious consequences across the business, but we're overlooking the massive impact of resenteeism and presenteeism on IT. For IT professionals tasked with managing the backbone of modern business operations, these numbers spell big trouble ...
For many B2B and B2C enterprise brands, technology isn't a core strength. Relying on overly complex architectures (like those that follow a pure MACH doctrine) has been flagged by industry leaders as a source of operational slowdown, creating bottlenecks that limit agility in volatile market conditions ...
FinOps champions crucial cross-departmental collaboration, uniting business, finance, technology and engineering leaders to demystify cloud expenses. Yet, too often, critical cost issues are softened into mere "recommendations" or "insights" — easy to ignore. But what if we adopted security's battle-tested strategy and reframed these as the urgent risks they truly are, demanding immediate action? ...
Two in three IT professionals now cite growing complexity as their top challenge — an urgent signal that the modernization curve may be getting too steep, according to the Rising to the Challenge survey from Checkmk ...
While IT leaders are becoming more comfortable and adept at balancing workloads across on-premises, colocation data centers and the public cloud, there's a key component missing: connectivity, according to the 2025 State of the Data Center Report from CoreSite ...