
LightStep announced the release of its Service Health for Deployments solution to help developers quickly and easily identify and remediate service health issues during and after a deployment.
With LightStep’s solution, understanding service health has never been easier. Developers can monitor performance regressions to their services during and after a deployment, gaining visibility into the latency, error ratio, and throughput changes to their operations. Additionally, developers are able to understand why a regression has occurred, with rich aggregate trace analysis functionality such as latency histogram comparisons, operations diagrams, and an automated correlation engine for performing rapid root cause analysis. Users can also ensure they are getting the most value from their services by demystifying and iterating on proper instrumentation of their service with LightStep’s Instrumentation Quality Score.
"As developers ourselves, we know that deploys often result in regressions, and investigating the cause can be a time consuming process," said Kay Ousterhout, Software Engineer, LightStep. "Our solution takes the uncertainty and guesswork out of service deployments so our customers can focus on shipping quality applications faster."
Following a regression, Service Health for Deployments enables users to quickly perform rich root cause analysis to identify what went wrong. In addition to reactive investigation, users are able to proactively monitor deployments. Using LightStep, they’re able to:
- Compare performance before, during, and after a deployment
- Compare latency distributions to estimate the size and scope of a regression
- Correlate tags that have the biggest impact on latency
- Provide visibility into the complete operation and service diagrams with critical path latency mapped to each operation or service
- Perform aggregate trace analysis to identify what’s driving a regression
Microservices have become ubiquitous among enterprise development teams. According to research LightStep conducted, about 9 in 10 enterprise development teams are currently using or plan to use microservices. Unfortunately, when microservices scale, systems grow exponentially complex - making it extremely difficult for developers to understand why services fail. LightStep Service Health for Deployments empowers developers with the ability to seamlessly navigate evolving end-to-end application stacks so they can quickly identify and resolve service health issues before they impact the customer experience.
LightStep Service Health for Deployments is generally available.
The Latest
For years, infrastructure teams have treated compute as a relatively stable input. Capacity was provisioned, costs were forecasted, and performance expectations were set based on the assumption that identical resources behaved identically. That mental model is starting to break down. AI infrastructure is no longer behaving like static cloud capacity. It is increasingly behaving like a market ...
Resilience can no longer be defined by how quickly an organization recovers from an incident or disruption. The effectiveness of any resilience strategy is dependent on its ability to anticipate change, operate under continuous stress, and adapt confidently amid uncertainty ...
Mobile users are less tolerant of app instability than ever before. According to a new report from Luciq, No Margin for Error: What Mobile Users Expect and What Mobile Leaders Must Deliver in 2026, even minor performance issues now result in immediate abandonment, lost purchases, and long-term brand impact ...
Artificial intelligence (AI) has become the dominant force shaping enterprise data strategies. Boards expect progress. Executives expect returns. And data leaders are under pressure to prove that their organizations are "AI-ready" ...
Agentic AI is a major buzzword for 2026. Many tech companies are making bold promises about this technology, but many aren't grounded in reality, at least not yet. This coming year will likely be shaped by reality checks for IT teams, and progress will only come from a focus on strong foundations and disciplined execution ...
AI systems are still prone to hallucinations and misjudgments ... To build the trust needed for adoption, AI must be paired with human-in-the-loop (HITL) oversight, or checkpoints where humans verify, guide, and decide what actions are taken. The balance between autonomy and accountability is what will allow AI to deliver on its promise without sacrificing human trust ...
More data center leaders are reducing their reliance on utility grids by investing in onsite power for rapidly scaling data centers, according to the Data Center Power Report from Bloom Energy ...
In MEAN TIME TO INSIGHT Episode 21, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AI-driven NetOps ...
Enterprise IT has become increasingly complex and fragmented. Organizations are juggling dozens — sometimes hundreds — of different tools for endpoint management, security, app delivery, and employee experience. Each one needs its own license, its own maintenance, and its own integration. The result is a patchwork of overlapping tools, data stuck in silos, security vulnerabilities, and IT teams are spending more time managing software than actually getting work done ...
2025 was the year everybody finally saw the cracks in the foundation. If you were running production workloads, you probably lived through at least one outage you could not explain to your executives without pulling up a diagram and a whiteboard ...