BigPanda Expands Platform
October 29, 2019
Share this

BigPanda announced a major expansion of its platform capabilities to enable IT Ops, network operations center (NOC), and DevOps teams to rapidly investigate and resolve incidents and outages in cloud-native and hybrid-cloud environments.

Leveraging its Open Box Machine Learning and its Open Integration Hub technologies, BigPanda ingests changes from disparate change feeds and tools, and correlates and analyzes these changes against alerts collected from enterprise monitoring tools to rapidly isolate the root cause change that resulted in an incident or outage.

“Today’s IT environments are very fast-moving and constantly changing. Changes in software and infrastructure occur several times a day at most enterprises, which dramatically increases the potential for unexpected incidents and outages. Unfortunately, legacy IT operations tools weren’t designed for environments of rapid change and are slowing down operations teams from discovering and resolving outages in a timely manner,” said Assaf Resnick, CEO and co-founder, BigPanda. “BigPanda’s new offering puts, for the first time, the root-cause change behind an outage at the IT Ops teams’ fingertips, slashing mean-time-to-resolution and improving the performance of critical systems and applications. This is a win for IT operations teams, their enterprises, and most importantly, their customers.”

As enterprises migrate to the cloud, their IT stacks are accelerating. These fast-moving IT stacks are subject to hundreds or thousands of changes on a constant basis and experience ever-shifting application and service topologies. Legacy IT operations tools and root cause analysis techniques are ineffective inside these fast-moving IT stacks. That’s because legacy tools and techniques were designed for slower-moving monolithic applications and IT stacks, where the root causes of problems were mostly related to infrastructure and hardware failures.

When IT Ops, NOC, and DevOps teams try to use legacy tools and techniques to support cloud-native and hybrid-cloud architectures and applications, incidents and outages become more frequent, last longer and have a wider impact footprint. This creates serious consequences for businesses in the form of higher operating costs, degraded performance and availability, SLA violations and penalties, and ultimately, unhappy customers and end-users.

The BigPanda platform expansion includes the following features designed to speed up incident and outage resolution:

- Root Cause Changes: BigPanda’s platform expansion equips IT Ops, NOC, and DevOps teams, for the first time, with the tools to contend with the thousands of regular application and infrastructure changes that cause incidents and outages.Leveraging out-of-the-box integrations with all major change feeds and tools, BigPanda’s Root Cause Changes feature ingests changes from any source of change data, including change management, change log, configuration management, and others. Subsequently, BigPanda’s Root Cause Changes feature uses machine learning (ML) to correlate and analyze this dataset alongside the dataset of alerts collected from monitoring tools.The ML-driven cross-correlation and analysis surfaces the root cause change that resulted in an incident or outage, enabling IT Ops, NOC and DevOps teams to rapidly handle the change and resolve the incident or outage.

- Real-time Topology Mesh. Another aspect of the BigPanda platform expansion is the launch of the Real-time Topology Mesh. This new capability makes BigPanda’s platform the first AIOps solution to provide a real-time topology model across the entire IT stack, including the dynamic infrastructures inside fast-moving IT stacks, by piecing together the third critical dataset for IT operations: topology data.Leveraging out-of-the-box integrations, BigPanda’s Real-time Topology Mesh ingests topology data from configuration management, cloud & virtualization management, service discovery, APM and CMDB tools to create a full-stack, always up-to-date topology model.For IT Ops, NOC and DevOps teams struggling to detect, investigate and resolve incidents and outages in fast-moving IT environments, BigPanda’s Real-time Topology Mesh significantly improves their ability to detect those incidents and outages, visualize them, identify their probable root cause, understand their impact on users and customers, and route them to the right teams for rapid resolution, all in real-time.

“The world of hybrid IT — with a mix of cloud-native and legacy, on-prem workloads — is here for the foreseeable future. Old approaches to problem solving in these complex, dynamic environments don’t work, in part because they typically don’t deliver insight into the relationship between changes and incidents,” said Nancy Gohring, senior analyst with 451 Research. “Correlating alerts, change events and topology can help teams narrow in on the cause of performance problems in modern application and infrastructure environments.”

With the launch of Root Cause Changes and Real-time Topology Mesh, BigPanda is now able to ingest the three critical datasets in IT operations: alerts, changes and topology, across all layers of fast-moving IT stacks, and use ML to correlate and analyze this data in real-time. This helps IT Ops, NOC and DevOps teams rapidly detect, investigate and resolve incidents and outages, minimizing the impact on users and customers.

Both new additions to the BigPanda platform, Root Cause Changes, and Real-time Topology Mesh, are currently available to select customers as part of a beta program, and will be generally available at the end of the year.

Share this

The Latest

November 14, 2019

A brief introduction to Applications Performance Monitoring (APM), breaking it down to a few key points, followed by a few important lessons which I have learned over the years ...

November 13, 2019

Research conducted by ServiceNow shows that Gen Zs, now entering the workforce, recognize the promise of technology to improve work experiences, are eager to learn from other generations, and believe they can help older generations be more open‑minded ...

November 12, 2019

We're in the middle of a technology and connectivity revolution, giving us access to infinite digital tools and technologies. Is this multitude of technology solutions empowering us to do our best work, or getting in our way? ...

November 07, 2019

Microservices have become the go-to architectural standard in modern distributed systems. While there are plenty of tools and techniques to architect, manage, and automate the deployment of such distributed systems, issues during troubleshooting still happen at the individual service level, thereby prolonging the time taken to resolve an outage ...

November 06, 2019

A recent APMdigest blog by Jean Tunis provided an excellent background on Application Performance Monitoring (APM) and what it does. A further topic that I wanted to touch on though is the need for good quality data. If you are to get the most out of your APM solution possible, you will need to feed it with the best quality data ...

November 05, 2019

Humans and manual processes can no longer keep pace with network innovation, evolution, complexity, and change. That's why we're hearing more about self-driving networks, self-healing networks, intent-based networking, and other concepts. These approaches collectively belong to a growing focus area called AIOps, which aims to apply automation, AI and ML to support modern network operations ...

November 04, 2019

IT outages happen to companies across the globe, regardless of location, annual revenue or size. Even the most mammoth companies are at risk of downtime. Increasingly over the past few years, high-profile IT outages — defined as when the services or systems a business provides suddenly become unavailable — have ended up splashed across national news headlines ...

October 31, 2019

APM tools are ideal for an application owner or a line of business owner to track the performance of their key applications. But these tools have broader applicability to different stakeholders in an organization. In this blog, we will review the teams and functional departments that can make use of an APM tool and how they could put it to work ...

October 30, 2019

Enterprises depending exclusively on legacy monitoring tools are falling behind in business agility and operational efficiency, according to a new study, Prevalence of Legacy Tools Paralyzes Enterprises' Ability to Innovate conducted by Forrester Consulting ...

October 29, 2019

Hyperconverged infrastructure is sometimes referred to as a "data center in a box" because, after the initial cabling and minimal networking configuration, it has all of the features and functionality of the traditional 3-2-1 virtualization architecture (except that single point of failure) ...