xMatters announced platform advancements that are powering a new method for incident response and management––adaptive incident management.
New adaptive incident management features available today in the xMatters platform include:
- an Incident Console to better triage, visualize and guide the entire incident lifecycle
- automation for incident response and improved collaboration
- the ability to quickly and easily scale the response to the incident as it evolves
- comprehensive incident analytics to drive continuous improvement.
Whether it’s a small technical issue affecting one software service or an enterprise-wide outage cascading across business services, these new capabilities empower technology teams to work within their preferred tools and deliver digital service resilience––the ability to recover quickly, adapt and learn from incidents such as performance issues and outages.
“Technology professionals today are expected to not only create and innovate digital services, but are also under extreme pressure to maintain service levels and uptime with well-tuned incident response and management. During a time when there is a critical reliance on digital services, traditional incident management processes and applications are not sufficient,” said Troy McAlpin, CEO at xMatters. “By adding adaptive incident management capabilities to our platform, we are automating repetitive, mundane work, and transforming how services are kept available for customers to enjoy.”
Technology organizations can minimize––and even avoid––the business impact from incidents by automating incident response and management with xMatters. New xMatters adaptive incident management features, which are accessed through the new Incident Console, and powered by xMatters’ visual workflow builder Flow Designer, solve the challenges of responding to service interruptions across different teams, cultures and systems.
New capabilities available from within the xMatters platform empower technology teams to:
■ Bridge incident management processes, data insights and disparate tools for cohesive and collaborative enterprise response automation:
- Incidents Views help teams find, evaluate and collaborate on incidents as they evolve, keeping track of incident statuses and allowing for efficient use of resources.
- The Incident Console shows a live view of an incident as it evolves through its lifecycle, including incident status and severity, notified and engaged responders, associated collaboration channels and roles.
- Incident Status Reports include timelines, key metrics and associated collaboration channels to support stakeholder communications and postmortem activity.
■ Combine the power of human problem-solving and process automation for faster, more targeted resolution:
- Critical incident metrics are automatically collected from integrated systems then delivered with the initial notification, reducing manual look-ups and improving resolution time.
- Flow Designer automations keep related systems up-to-date, including issue tracking systems, service desks, status pages and other supporting systems.
■ Accelerate continuous improvement and service resilience through comprehensive incident data and advanced analytics:
- The Incident Timeline provides a detailed record of what happened and when.
- Resolved incident metrics give teams a way to compare information from one incident to the next so teams can build metrics based on consistent data and improve digital service resilience over time.
- Performance Analytics helps evaluate the cost and impact of incidents in an organization’s environment and identify areas of improvement.
These xMatters platform advances are powered by the technical underpinnings of the xMatters platform. This includes Flow Designer for orchestration and automation, integrations that can be used to build powerful toolchains, sophisticated on-call management, event flood control, and configurable dashboards for visual incident and group performance tracking. With xMatters Flow Designer, teams with roles ranging from software developers to IT generalists can create incident management workflows without a single line of code. This drag and drop experience has revolutionized the way IT, DevOps, Operations and SRE teams integrate, synchronize and automate incident management toolchains.
“Investing in avoiding known types of incidents is an important part of digital service delivery. However, innovative companies know that eliminating incidents completely is impossible in practice. Our mission, then, is to help customers resolve incidents as quickly as possible and minimize their impact,” said Tobias Dunn-Krahn, CTO at xMatters. “We’ve developed the technology needed to make this happen. Now, customers can access powerful yet flexible applications that build upon our renowned approach to workflow automation, range of integration capabilities and sophisticated enterprise grade features to deliver digital service resilience and continuous improvement. By applying an adaptive incident management methodology, teams can automate incident resolution tasks and maximize the time spent making customers happy.”
Michael Olson on the AI+ITOPS Podcast: "I really see AIOps as being a core requirement for observability because it ... applies intelligence to your telemetry data and your incident data ... to potentially predict problems before they happen."
Enterprise ITOM and ITSM teams have been welcoming of AIOps, believing that it has the potential to deliver great value to them as their IT environments become more distributed, hybrid and complex. Not so with DevOps teams. It's safe to say they've kept AIOps at arm's length, because they don't think it's relevant nor useful for what they do. Instead, to manage the software code they develop and deploy, they've focused on observability ...
The post-pandemic environment has resulted in a major shift on where SREs will be located, with nearly 50% of SREs believing they will be working remotely post COVID-19, as compared to only 19% prior to the pandemic, according to the 2020 SRE Survey Report from Catchpoint and the DevOps Institute ...
All application traffic travels across the network. While application performance management tools can offer insight into how critical applications are functioning, they do not provide visibility into the broader network environment. In order to optimize application performance, you need a few key capabilities. Let's explore three steps that can help NetOps teams better support the critical applications upon which your business depends ...
In Episode 8, Michael Olson, Director of Product Marketing at New Relic, joins the AI+ITOPS Podcast to discuss how AIOps provides real benefits to IT teams ...
Will Cappelli on the AI+ITOPS Podcast: "I'll predict that in 5 years time, APM as we know it will have been completely mutated into an observability plus dynamic analytics capability."
When you consider that the average end-user interacts with at least 8 applications, then think about how important those applications are in the overall success of the business and how often the interface between the application and the hardware needs to be updated, it's a potential minefield for business operations. Any single update could explode in your face at any time ...
Despite the efforts in modernizing and building a robust infrastructure, IT teams routinely deal with the application, database, hardware, or software outages that can last from a few minutes to several days. These types of incidents can cause financial losses to businesses and damage its reputation ...
In Episode 7, Will Cappelli, Field CTO of Moogsoft and Former Gartner Research VP, joins the AI+ITOPS Podcast to discuss the future of APM, AIOps and Observability ...