Skip to main content

ServiceNow Releases Lightstep Incident Response

ServiceNow announced that Lightstep is extending beyond observability and creating a differentiated portfolio for app development with the general availability of Lightstep Incident Response, helping make organizations’ digital products and services more reliable and resilient.

Lightstep Incident Response will enable developers and site reliability engineers (SRE) to reduce downtime by arming them with the service context and automation they need to effectively respond to incidents, such as a software bug, power outage, or down network.

“What we’re hearing from developers and SREs is that eliminating ‘context switch’ – flipping between observability, on‑call, collaboration and incident management tools – would reduce human errors and speed up response times,” said Rohit Jainendra, VP and GM of emerging businesses at ServiceNow. "With Lightstep Incident Response, we are providing teams with a single platform that orchestrates on‑call escalation, alert grouping, incident analysis, and remediation, while seamlessly integrating with collaboration and incident management tools to eliminate ‘context switch’ and resolve incidents with speed."

ServiceNow acquired Lightstep in 2021 to extend the benefits of observability across business functions and enable enterprises to increase their cloud‑native capabilities. The company plans to extend Lightstep’s capabilities beyond observability, with the mission of becoming an end‑to‑end platform for app development organizations. The general availability of Lightstep Incident Response marks the first major step on that mission.

Lightstep Incident Response manages an organization’s on‑call rotations by synchronizing everyone’s schedule onto a shared calendar, with specific tags that indicate who needs to be looped in based on the nature of the incident and the service that is impacted. From there, collaborators are invited to a dedicated channel based on prebuilt collaboration integrations for quick remediation. Additionally, they can create automations that self‑triage and self‑remediate problems should they reoccur.

Lightstep Incident Response seamlessly integrates with leading monitoring, observability and collaboration tools, including LogicMonitor, Postman, Sumo Logic, Zoom, and more, streamlining the incident response process. For ServiceNow customers, Lightstep Incident Response natively integrates with the Now Platform, allowing users to quickly respond to or escalate incidents to the right team all on one platform and connecting incident response to core operations – putting the entire power of their organization behind the end‑customer experience.

“Combining real‑time observability and incident response gives on‑call engineers powerful insight into the changes that matter and the ability to act quickly,” said Ben Sigelman, GM of Lightstep and co‑creator of OpenTelemetry. “With the introduction of Lightstep Incident Response, we are delivering the all‑in‑one solution for developers and SREs to act with the speed and efficiency necessary to maintain exceptional experiences for customers using their applications and services. In combination with OpenTelemetry, a Cloud Native Computing Foundation sandbox project founded in part by Lightstep, organizations will now have the data platform, workflows, and an open standards approach necessary to successfully operate highly distributed cloud native services.”

Lightstep Incident Response is offered as free and paid versions and introduces an innovative usage based pricing model based on the number of active services being managed.

The Latest

For years, infrastructure teams have treated compute as a relatively stable input. Capacity was provisioned, costs were forecasted, and performance expectations were set based on the assumption that identical resources behaved identically. That mental model is starting to break down. AI infrastructure is no longer behaving like static cloud capacity. It is increasingly behaving like a market ...

Resilience can no longer be defined by how quickly an organization recovers from an incident or disruption. The effectiveness of any resilience strategy is dependent on its ability to anticipate change, operate under continuous stress, and adapt confidently amid uncertainty ...

Mobile users are less tolerant of app instability than ever before. According to a new report from Luciq, No Margin for Error: What Mobile Users Expect and What Mobile Leaders Must Deliver in 2026, even minor performance issues now result in immediate abandonment, lost purchases, and long-term brand impact ...

Artificial intelligence (AI) has become the dominant force shaping enterprise data strategies. Boards expect progress. Executives expect returns. And data leaders are under pressure to prove that their organizations are "AI-ready" ...

Agentic AI is a major buzzword for 2026. Many tech companies are making bold promises about this technology, but many aren't grounded in reality, at least not yet. This coming year will likely be shaped by reality checks for IT teams, and progress will only come from a focus on strong foundations and disciplined execution ...

AI systems are still prone to hallucinations and misjudgments ... To build the trust needed for adoption, AI must be paired with human-in-the-loop (HITL) oversight, or checkpoints where humans verify, guide, and decide what actions are taken. The balance between autonomy and accountability is what will allow AI to deliver on its promise without sacrificing human trust ...

More data center leaders are reducing their reliance on utility grids by investing in onsite power for rapidly scaling data centers, according to the Data Center Power Report from Bloom Energy ...

In MEAN TIME TO INSIGHT Episode 21, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AI-driven NetOps ... 

Enterprise IT has become increasingly complex and fragmented. Organizations are juggling dozens — sometimes hundreds — of different tools for endpoint management, security, app delivery, and employee experience. Each one needs its own license, its own maintenance, and its own integration. The result is a patchwork of overlapping tools, data stuck in silos, security vulnerabilities, and IT teams are spending more time managing software than actually getting work done ...

2025 was the year everybody finally saw the cracks in the foundation. If you were running production workloads, you probably lived through at least one outage you could not explain to your executives without pulling up a diagram and a whiteboard ...

ServiceNow Releases Lightstep Incident Response

ServiceNow announced that Lightstep is extending beyond observability and creating a differentiated portfolio for app development with the general availability of Lightstep Incident Response, helping make organizations’ digital products and services more reliable and resilient.

Lightstep Incident Response will enable developers and site reliability engineers (SRE) to reduce downtime by arming them with the service context and automation they need to effectively respond to incidents, such as a software bug, power outage, or down network.

“What we’re hearing from developers and SREs is that eliminating ‘context switch’ – flipping between observability, on‑call, collaboration and incident management tools – would reduce human errors and speed up response times,” said Rohit Jainendra, VP and GM of emerging businesses at ServiceNow. "With Lightstep Incident Response, we are providing teams with a single platform that orchestrates on‑call escalation, alert grouping, incident analysis, and remediation, while seamlessly integrating with collaboration and incident management tools to eliminate ‘context switch’ and resolve incidents with speed."

ServiceNow acquired Lightstep in 2021 to extend the benefits of observability across business functions and enable enterprises to increase their cloud‑native capabilities. The company plans to extend Lightstep’s capabilities beyond observability, with the mission of becoming an end‑to‑end platform for app development organizations. The general availability of Lightstep Incident Response marks the first major step on that mission.

Lightstep Incident Response manages an organization’s on‑call rotations by synchronizing everyone’s schedule onto a shared calendar, with specific tags that indicate who needs to be looped in based on the nature of the incident and the service that is impacted. From there, collaborators are invited to a dedicated channel based on prebuilt collaboration integrations for quick remediation. Additionally, they can create automations that self‑triage and self‑remediate problems should they reoccur.

Lightstep Incident Response seamlessly integrates with leading monitoring, observability and collaboration tools, including LogicMonitor, Postman, Sumo Logic, Zoom, and more, streamlining the incident response process. For ServiceNow customers, Lightstep Incident Response natively integrates with the Now Platform, allowing users to quickly respond to or escalate incidents to the right team all on one platform and connecting incident response to core operations – putting the entire power of their organization behind the end‑customer experience.

“Combining real‑time observability and incident response gives on‑call engineers powerful insight into the changes that matter and the ability to act quickly,” said Ben Sigelman, GM of Lightstep and co‑creator of OpenTelemetry. “With the introduction of Lightstep Incident Response, we are delivering the all‑in‑one solution for developers and SREs to act with the speed and efficiency necessary to maintain exceptional experiences for customers using their applications and services. In combination with OpenTelemetry, a Cloud Native Computing Foundation sandbox project founded in part by Lightstep, organizations will now have the data platform, workflows, and an open standards approach necessary to successfully operate highly distributed cloud native services.”

Lightstep Incident Response is offered as free and paid versions and introduces an innovative usage based pricing model based on the number of active services being managed.

The Latest

For years, infrastructure teams have treated compute as a relatively stable input. Capacity was provisioned, costs were forecasted, and performance expectations were set based on the assumption that identical resources behaved identically. That mental model is starting to break down. AI infrastructure is no longer behaving like static cloud capacity. It is increasingly behaving like a market ...

Resilience can no longer be defined by how quickly an organization recovers from an incident or disruption. The effectiveness of any resilience strategy is dependent on its ability to anticipate change, operate under continuous stress, and adapt confidently amid uncertainty ...

Mobile users are less tolerant of app instability than ever before. According to a new report from Luciq, No Margin for Error: What Mobile Users Expect and What Mobile Leaders Must Deliver in 2026, even minor performance issues now result in immediate abandonment, lost purchases, and long-term brand impact ...

Artificial intelligence (AI) has become the dominant force shaping enterprise data strategies. Boards expect progress. Executives expect returns. And data leaders are under pressure to prove that their organizations are "AI-ready" ...

Agentic AI is a major buzzword for 2026. Many tech companies are making bold promises about this technology, but many aren't grounded in reality, at least not yet. This coming year will likely be shaped by reality checks for IT teams, and progress will only come from a focus on strong foundations and disciplined execution ...

AI systems are still prone to hallucinations and misjudgments ... To build the trust needed for adoption, AI must be paired with human-in-the-loop (HITL) oversight, or checkpoints where humans verify, guide, and decide what actions are taken. The balance between autonomy and accountability is what will allow AI to deliver on its promise without sacrificing human trust ...

More data center leaders are reducing their reliance on utility grids by investing in onsite power for rapidly scaling data centers, according to the Data Center Power Report from Bloom Energy ...

In MEAN TIME TO INSIGHT Episode 21, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AI-driven NetOps ... 

Enterprise IT has become increasingly complex and fragmented. Organizations are juggling dozens — sometimes hundreds — of different tools for endpoint management, security, app delivery, and employee experience. Each one needs its own license, its own maintenance, and its own integration. The result is a patchwork of overlapping tools, data stuck in silos, security vulnerabilities, and IT teams are spending more time managing software than actually getting work done ...

2025 was the year everybody finally saw the cracks in the foundation. If you were running production workloads, you probably lived through at least one outage you could not explain to your executives without pulling up a diagram and a whiteboard ...