Skip to main content

An Interview with Zyrion CEO

Pete Goldin
APMdigest

In BSMdigest’s exclusive interview, Vikas Aggarwal, founder and CEO of Zyrion, discusses Business Service Management in the cloud, and new BSM technologies and approaches for the modern datacenter.

BSM: What do you see as the main monitoring challenges of private cloud?

VA: Within a private cloud environment, the monitoring approach for applications and services has to account for inter-dependencies and impacts of the shared and virtual infrastructure. A non-cloud infrastructure has applications housed within discrete servers that are connected to the network, and the contention of resources outside of the physical server is limited primarily to the network. Within a private cloud though, applications share the same underlying physical resources and one application can impact performance of a totally unrelated application or service just because of the virtual infrastructure. There are a lot more dimensions that can impact a service, all of which need to be accounted for by the monitoring solution.

BSM: How can technology provide a business-oriented view of the cloud computing infrastructure?

VA: Performance management and monitoring technology has to enable mapping the different components of the cloud to the supported business services. The monitoring approach starts by first looking at the performance and availability of the business services, and then the underlying components within the cloud computing infrastructure. Creating the mapping between cloud components and business services is easier to do within private clouds, but will require well-documented and rich APIs for public clouds.

BSM: Does cloud make BSM a requirement, and if so, why?

VA: Traditional approaches to performance monitoring focus on individual nodes and components in the IT infrastructure, while cloud infrastructure is a shared resource and individual performance indicators in isolation are meaningless. Furthermore, one might not even have access to individual metrics in public clouds. Focusing on the performance of services instead and correlating all the underlying components of the service is the only way IT can support the business.

BSM: How does IT get aligned with the company's business goals? Is it an issue of corporate culture?

VA: Senior managers tend to understand the value of service-oriented IT monitoring. In our customer survey, over 80% of our customers use the BSM features in our product, and in almost all cases, senior managers were using the BSM technology and dashboards on a regular basis. So, it has to be driven from the top and has to be come a part of the corporate culture.

BSM: How do you define "real-time visibility"?

VA: Real-time visibility means having real-time or near-real time metrics and data on the availability and performance of business services and the underlying IT infrastructure. Real-time means having dashboards where you are immediately alerted if a business service is performing poorly because of the underlying IT infrastructure. Real-time means being able to instantly drill-down from a BSM dashboard to the packet flow and isolating the root cause impacting a business service. Real-time means no swivel chair management, no waiting for another group to respond and provide answers, no waiting for the database to get the data in the night to churn out a report – all the answers and data is available right away for everyone when they need it.

BSM: What are Business Service Containers?

VA: Zyrion’s Business Service Containers are flexible, automated objects which represent Business Services in an organization. They allow an organization to create logical, business-oriented views of the overall physical and virtualized computing network. You can define different SLAs for different containers, create fault-tolerant redundant models within a container, have nested containers with cascading alarms or create containers that include tests and containers owned by other departments. Our Business Service Containers allow different departments and users to create views of the IT infrastructure that align with their roles with full flexibility and access control that is essential for adoption within the enterprise. Most importantly, our Business Container model is overlaid on top of our topology discovery model to reduce alarm floods and very rapid root cause discovery of Business Service downtime.

BSM: Why is configuration management important to BSM?

VA: Configuration management enables backup, restore and tracking of changes in network device configurations across the enterprise network. Proper tracking and notification of configuration changes in the network prevents unexpected outages, as well as helps to correlate undesired changes in network behavior with recent configuration changes. Having configuration management integrated with BSM is important because IT administrators can correlate network outages to configuration changes and understand the corresponding impact on dependent business services.

BSM: What is the advantage of having a distributed data collection and database instead of a centralized data?

VA: Zyrion’s Traverse has a unique, patented architecture where all the data is collected and stored in distributed databases – there is no centralized data warehouse, unlike other products. Our business correlation engine presents a unified view across this distributed database in real-time.

Having a distributed database and collection architecture allows the solution to scale to very large environments not possible with earlier generation products. In order to provide a unified BSM solution, the platform has to be able to collect data from the network, server and applications and correlate, analyze and present it in real time. A large segment of our customer base switched from the other products because a centralized data warehouse model did not scale. We have customers monitoring their IT infrastructure with over 10,000 servers and routers in multiple datacenters and close to a million metrics every 5 minutes, and not requiring a single dedicated engineer to maintain the solution. As customers demand full visibility into their Business and IT services, having a real-time scalable system is a must, and having a distributed database and data collection approach is key to handling the demands of the new IT datacenter.

BSM: Do different stakeholders need different dashboards that speak to their needs?

VA: Yes, that is essential. Within an integrated BSM environment, with technicians, managers and business owners as users, information needs to be presented in a way that is relevant to the user roles within the organization. We have customers with over 200 active users of our product ranging from CxOs, product managers, database administrators, IT architects and NOC staff. While the CIO is only interested in the status of the key business operation IT containers, the product managers have dashboards to view response time, number of users, transactions and key applications relevant to their products. The database, server and network architects use the performance data for future planning, while the NOC needs the event driven dashboard to see what problems exist within the network.

Even the reports generated by each group are different – the product managers need reports on online user growth and response time, while the database manager needs trend reports on transactions per second, and the IT operations manager needs uptime SLA reports. An alert on the server administrator dashboard might not show on the product manager dashboard because of a fault-tolerant architecture or redundant network paths. Providing role-specific views becomes even more relevant when using private or publiccloud environments.

BSM: How was Zyrion created?

VA: Zyrion is a spin out of a public company, focused on correlating the impact of IT infrastructure on Business Services. We were the first company to integrate packet and flow analysis with BSM, and hence reduce the downtime of Business Services by enabling quicker resolution.

About Vikas Aggarwal

Vikas Aggarwal is founder and CEO of Zyrion Inc., a provider of BSM & IT infrastructure monitoring software solutions. Vikas Aggarwal has been an entrepreneur and senior executive at multiple technology startups over the past 20 years. He was the founder and CEO of Fidelia, a venture-backed IT infrastructure management software company, where he led the company's growth to about 100 customers before their acquisition by Network General. At Network General, he was the VP of Product Management where he oversaw product strategy through their acquisition by Netscout in late 2007.

Hot Topic
The Latest
The Latest 10

The Latest

I've spent a lot of time in the channel, and one thing I keep coming back to is this: a partner program is only as good as what it looks like in the field. Many programs look great on paper, but when a partner is in front of a customer navigating a complex hybrid environment or trying to make the case for AI-powered observability, the gap between what a vendor promises and what it actually delivers becomes very clear, very fast ...

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

An Interview with Zyrion CEO

Pete Goldin
APMdigest

In BSMdigest’s exclusive interview, Vikas Aggarwal, founder and CEO of Zyrion, discusses Business Service Management in the cloud, and new BSM technologies and approaches for the modern datacenter.

BSM: What do you see as the main monitoring challenges of private cloud?

VA: Within a private cloud environment, the monitoring approach for applications and services has to account for inter-dependencies and impacts of the shared and virtual infrastructure. A non-cloud infrastructure has applications housed within discrete servers that are connected to the network, and the contention of resources outside of the physical server is limited primarily to the network. Within a private cloud though, applications share the same underlying physical resources and one application can impact performance of a totally unrelated application or service just because of the virtual infrastructure. There are a lot more dimensions that can impact a service, all of which need to be accounted for by the monitoring solution.

BSM: How can technology provide a business-oriented view of the cloud computing infrastructure?

VA: Performance management and monitoring technology has to enable mapping the different components of the cloud to the supported business services. The monitoring approach starts by first looking at the performance and availability of the business services, and then the underlying components within the cloud computing infrastructure. Creating the mapping between cloud components and business services is easier to do within private clouds, but will require well-documented and rich APIs for public clouds.

BSM: Does cloud make BSM a requirement, and if so, why?

VA: Traditional approaches to performance monitoring focus on individual nodes and components in the IT infrastructure, while cloud infrastructure is a shared resource and individual performance indicators in isolation are meaningless. Furthermore, one might not even have access to individual metrics in public clouds. Focusing on the performance of services instead and correlating all the underlying components of the service is the only way IT can support the business.

BSM: How does IT get aligned with the company's business goals? Is it an issue of corporate culture?

VA: Senior managers tend to understand the value of service-oriented IT monitoring. In our customer survey, over 80% of our customers use the BSM features in our product, and in almost all cases, senior managers were using the BSM technology and dashboards on a regular basis. So, it has to be driven from the top and has to be come a part of the corporate culture.

BSM: How do you define "real-time visibility"?

VA: Real-time visibility means having real-time or near-real time metrics and data on the availability and performance of business services and the underlying IT infrastructure. Real-time means having dashboards where you are immediately alerted if a business service is performing poorly because of the underlying IT infrastructure. Real-time means being able to instantly drill-down from a BSM dashboard to the packet flow and isolating the root cause impacting a business service. Real-time means no swivel chair management, no waiting for another group to respond and provide answers, no waiting for the database to get the data in the night to churn out a report – all the answers and data is available right away for everyone when they need it.

BSM: What are Business Service Containers?

VA: Zyrion’s Business Service Containers are flexible, automated objects which represent Business Services in an organization. They allow an organization to create logical, business-oriented views of the overall physical and virtualized computing network. You can define different SLAs for different containers, create fault-tolerant redundant models within a container, have nested containers with cascading alarms or create containers that include tests and containers owned by other departments. Our Business Service Containers allow different departments and users to create views of the IT infrastructure that align with their roles with full flexibility and access control that is essential for adoption within the enterprise. Most importantly, our Business Container model is overlaid on top of our topology discovery model to reduce alarm floods and very rapid root cause discovery of Business Service downtime.

BSM: Why is configuration management important to BSM?

VA: Configuration management enables backup, restore and tracking of changes in network device configurations across the enterprise network. Proper tracking and notification of configuration changes in the network prevents unexpected outages, as well as helps to correlate undesired changes in network behavior with recent configuration changes. Having configuration management integrated with BSM is important because IT administrators can correlate network outages to configuration changes and understand the corresponding impact on dependent business services.

BSM: What is the advantage of having a distributed data collection and database instead of a centralized data?

VA: Zyrion’s Traverse has a unique, patented architecture where all the data is collected and stored in distributed databases – there is no centralized data warehouse, unlike other products. Our business correlation engine presents a unified view across this distributed database in real-time.

Having a distributed database and collection architecture allows the solution to scale to very large environments not possible with earlier generation products. In order to provide a unified BSM solution, the platform has to be able to collect data from the network, server and applications and correlate, analyze and present it in real time. A large segment of our customer base switched from the other products because a centralized data warehouse model did not scale. We have customers monitoring their IT infrastructure with over 10,000 servers and routers in multiple datacenters and close to a million metrics every 5 minutes, and not requiring a single dedicated engineer to maintain the solution. As customers demand full visibility into their Business and IT services, having a real-time scalable system is a must, and having a distributed database and data collection approach is key to handling the demands of the new IT datacenter.

BSM: Do different stakeholders need different dashboards that speak to their needs?

VA: Yes, that is essential. Within an integrated BSM environment, with technicians, managers and business owners as users, information needs to be presented in a way that is relevant to the user roles within the organization. We have customers with over 200 active users of our product ranging from CxOs, product managers, database administrators, IT architects and NOC staff. While the CIO is only interested in the status of the key business operation IT containers, the product managers have dashboards to view response time, number of users, transactions and key applications relevant to their products. The database, server and network architects use the performance data for future planning, while the NOC needs the event driven dashboard to see what problems exist within the network.

Even the reports generated by each group are different – the product managers need reports on online user growth and response time, while the database manager needs trend reports on transactions per second, and the IT operations manager needs uptime SLA reports. An alert on the server administrator dashboard might not show on the product manager dashboard because of a fault-tolerant architecture or redundant network paths. Providing role-specific views becomes even more relevant when using private or publiccloud environments.

BSM: How was Zyrion created?

VA: Zyrion is a spin out of a public company, focused on correlating the impact of IT infrastructure on Business Services. We were the first company to integrate packet and flow analysis with BSM, and hence reduce the downtime of Business Services by enabling quicker resolution.

About Vikas Aggarwal

Vikas Aggarwal is founder and CEO of Zyrion Inc., a provider of BSM & IT infrastructure monitoring software solutions. Vikas Aggarwal has been an entrepreneur and senior executive at multiple technology startups over the past 20 years. He was the founder and CEO of Fidelia, a venture-backed IT infrastructure management software company, where he led the company's growth to about 100 customers before their acquisition by Network General. At Network General, he was the VP of Product Management where he oversaw product strategy through their acquisition by Netscout in late 2007.

Hot Topic
The Latest
The Latest 10

The Latest

I've spent a lot of time in the channel, and one thing I keep coming back to is this: a partner program is only as good as what it looks like in the field. Many programs look great on paper, but when a partner is in front of a customer navigating a complex hybrid environment or trying to make the case for AI-powered observability, the gap between what a vendor promises and what it actually delivers becomes very clear, very fast ...

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...