Skip to main content

Why CIOs Need to Care More About Automating Network Operations

Song Pang
NetBrain Technologies

It is a challenging time to be the CIO of a large enterprise. In early January 2023, Salesforce laid off 10% of its staff — the most recent in a string of sizable layoffs among technology companies (many of those who aggressively hired during the height of the Covid-19 pandemic) as they cut operating costs in anticipation of an economic downturn. But as staff and budgets become more tightly scrutinized, and investment strategies become more conservative, IT departments will be asked to be even more proactive and responsive to the IT needs of the business to prevent service disruptions and security breaches, without the traditional linear relationship between infrastructure size and operational overhead.

At the same time, reported network outages globally continue to grow in frequency, duration and fiscal impact. And as migration to the cloud continues at a pace of nearly 5% per year, the amount of control over those cloud-based services typically decreases, which further increases operational risk and the potential for increased costs.

Overall outages were still alarmingly high in 2022 and each outage was getting more expensive

The Uptime Institute's Global Data Center Survey 2022 found that overall outages were still alarmingly high in 2022 and each outage was getting more expensive for those involved. Uptime suggests that more than two-thirds of all disruptions cost them over $100,000 in losses and a quarter of respondents said their most recent outage had cost them over $1 million- a significant increase over 2021. Alarmingly, the migration of IT service to public clouds is only exacerbating the situation since these behemoths are not immune from the outage issue, regularly reporting widespread and long in duration outages that affect hundreds or thousands of their clients which in turn affects millions of users across the globe.

Why are network service disruptions still such an issue?

There are several reasons including rapidly increasing network complexity including cloud-centric virtualized components, limited operational staff with the needed experience and knowledge, no good processes for leveraging knowledge from previous troubleshooting when the same problems reoccur, and most importantly, manual and bespoke network operational processes and approaches that have stagnated for decades. These processes are almost always inconsistent, non-repeatable and focus on individual device health rather than IT service delivery outcomes.

Modern Networks Demand Modern NetOps

Managing growing networks doesn't need to be complex even though the networks themselves are highly advanced. And the cost to manage these growing networks should not increase in a linear fashion either, since SME experience can easily be leveraged through no-code tools. And while the word "automation" may be associated with many IT projects that have come and gone over time with varying levels of success, network automation can now more easily be applied to any modern network to capture and replicate subject matter expertise to produce a high-quality and consistent network service delivery. Applying no-code network automation to the NetOps function allows the same approach to be used to proactively verify operational needs and prevent issues from occurring in the first place.

The first big shift in thinking for CIOs is to renew their focus on the business reasons that the network exists. This will yield a long list of desired outcomes (delivering essential applications running at required levels of performance, maintaining important security policies, etc.) that can be used to create an operational automation strategy. And since the list will be focused on business-aligned outcomes rather than device health, business leaders can incorporate their lists of needs as well.

For instance, the security operations teams may choose to automate the continuous verification of security boundaries to assure the hardware and software in place is always providing the protection it was designed to provide. The unified communication ops teams may also choose to automate the management of the network in the context of Voice-over-IP to assure that the quality of phone calls remains high, regardless of other applications and their use of the same common infrastructure.

4 Benefits of Network Automation

Applying no-code network automation to modern network operations enables:

■ Prevention of service failures long before they impact production. Automation can verify and validate long lists of operational parameters continuously, comparing the real-time observations to known expected behaviors, detecting problems in the making before they affect the bottom line.

■ Automation of preliminary problem diagnostics to make the network engineer's time more productive and speed up root cause determination. This can reduce service disruption duration and remedial costs by 75% or more.

■ Capturing and execution of SME remedial best practices allowing their knowledge to scale across geographies and timezones. Creating a repository of each subject matter expert's knowledge and making it available to be executed by the entire team enables engineers to solve issues that they may not specialize in and would otherwise result in escalations.

■ Accurate network visualization, performance and mapping in real-time. Every aspect of IT service delivery management relies on accurate understanding of the real-time digital infrastructure. Outdated documentation increases risk and costs time and money.

No-code network automation can make modern NetOps dramatically more effective when compared to the widely used and decades-old manual device-health-centric approaches. Automation revolutionizes NetOps with a strategic and forward-looking approach, far beyond the capabilities and scale of existing processes in place today. With the strategic goal to directly support business initiatives, while at the same time reducing risk and the overhead costs, no-code powered network automation becomes transformational. One of our customers recently estimated that no-code network automation reduced their operational servicing costs of their network by more than half, and even more astounding, reduced the duration of service degradations by more than 75%.

I'd urge any CIO or IT Director looking for the means to get in front of the chaos that defines their daily routine, to immediately consider how no-code network automation can be adopted as their strategic plan for the long haul.

Song Pang is CTO at NetBrain Technologies

The Latest

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...

Many organizations assumed their infrastructure strategy was settled. It had been implemented, optimized and built into long-term plans. Recent changes in technology and vendor consolidation are forcing a second look. Cloud outages and licensing changes have exposed how much dependency exists on a small number of platforms. As a result, organizations are reevaluating whether those decisions still hold up under current conditions ...

Edge AI is strategically embedded in core IT and infrastructure spending across industries, according to the 2026 Edge AI Survey from ZEDEDA. The research shows that 83% of C-suite and IT executive respondents say edge AI is important to their core business strategy ...

As AI adoption accelerates, operational complexity — not model intelligence — is becoming the primary barrier to reliable AI at scale, according to the State of AI Engineering 2026 from Datadog ... The report highlights a compounding complexity challenge as AI systems scale ... Around 5% of AI model requests fail in production, with nearly 60% of those failures caused by capacity limits ...

Why CIOs Need to Care More About Automating Network Operations

Song Pang
NetBrain Technologies

It is a challenging time to be the CIO of a large enterprise. In early January 2023, Salesforce laid off 10% of its staff — the most recent in a string of sizable layoffs among technology companies (many of those who aggressively hired during the height of the Covid-19 pandemic) as they cut operating costs in anticipation of an economic downturn. But as staff and budgets become more tightly scrutinized, and investment strategies become more conservative, IT departments will be asked to be even more proactive and responsive to the IT needs of the business to prevent service disruptions and security breaches, without the traditional linear relationship between infrastructure size and operational overhead.

At the same time, reported network outages globally continue to grow in frequency, duration and fiscal impact. And as migration to the cloud continues at a pace of nearly 5% per year, the amount of control over those cloud-based services typically decreases, which further increases operational risk and the potential for increased costs.

Overall outages were still alarmingly high in 2022 and each outage was getting more expensive

The Uptime Institute's Global Data Center Survey 2022 found that overall outages were still alarmingly high in 2022 and each outage was getting more expensive for those involved. Uptime suggests that more than two-thirds of all disruptions cost them over $100,000 in losses and a quarter of respondents said their most recent outage had cost them over $1 million- a significant increase over 2021. Alarmingly, the migration of IT service to public clouds is only exacerbating the situation since these behemoths are not immune from the outage issue, regularly reporting widespread and long in duration outages that affect hundreds or thousands of their clients which in turn affects millions of users across the globe.

Why are network service disruptions still such an issue?

There are several reasons including rapidly increasing network complexity including cloud-centric virtualized components, limited operational staff with the needed experience and knowledge, no good processes for leveraging knowledge from previous troubleshooting when the same problems reoccur, and most importantly, manual and bespoke network operational processes and approaches that have stagnated for decades. These processes are almost always inconsistent, non-repeatable and focus on individual device health rather than IT service delivery outcomes.

Modern Networks Demand Modern NetOps

Managing growing networks doesn't need to be complex even though the networks themselves are highly advanced. And the cost to manage these growing networks should not increase in a linear fashion either, since SME experience can easily be leveraged through no-code tools. And while the word "automation" may be associated with many IT projects that have come and gone over time with varying levels of success, network automation can now more easily be applied to any modern network to capture and replicate subject matter expertise to produce a high-quality and consistent network service delivery. Applying no-code network automation to the NetOps function allows the same approach to be used to proactively verify operational needs and prevent issues from occurring in the first place.

The first big shift in thinking for CIOs is to renew their focus on the business reasons that the network exists. This will yield a long list of desired outcomes (delivering essential applications running at required levels of performance, maintaining important security policies, etc.) that can be used to create an operational automation strategy. And since the list will be focused on business-aligned outcomes rather than device health, business leaders can incorporate their lists of needs as well.

For instance, the security operations teams may choose to automate the continuous verification of security boundaries to assure the hardware and software in place is always providing the protection it was designed to provide. The unified communication ops teams may also choose to automate the management of the network in the context of Voice-over-IP to assure that the quality of phone calls remains high, regardless of other applications and their use of the same common infrastructure.

4 Benefits of Network Automation

Applying no-code network automation to modern network operations enables:

■ Prevention of service failures long before they impact production. Automation can verify and validate long lists of operational parameters continuously, comparing the real-time observations to known expected behaviors, detecting problems in the making before they affect the bottom line.

■ Automation of preliminary problem diagnostics to make the network engineer's time more productive and speed up root cause determination. This can reduce service disruption duration and remedial costs by 75% or more.

■ Capturing and execution of SME remedial best practices allowing their knowledge to scale across geographies and timezones. Creating a repository of each subject matter expert's knowledge and making it available to be executed by the entire team enables engineers to solve issues that they may not specialize in and would otherwise result in escalations.

■ Accurate network visualization, performance and mapping in real-time. Every aspect of IT service delivery management relies on accurate understanding of the real-time digital infrastructure. Outdated documentation increases risk and costs time and money.

No-code network automation can make modern NetOps dramatically more effective when compared to the widely used and decades-old manual device-health-centric approaches. Automation revolutionizes NetOps with a strategic and forward-looking approach, far beyond the capabilities and scale of existing processes in place today. With the strategic goal to directly support business initiatives, while at the same time reducing risk and the overhead costs, no-code powered network automation becomes transformational. One of our customers recently estimated that no-code network automation reduced their operational servicing costs of their network by more than half, and even more astounding, reduced the duration of service degradations by more than 75%.

I'd urge any CIO or IT Director looking for the means to get in front of the chaos that defines their daily routine, to immediately consider how no-code network automation can be adopted as their strategic plan for the long haul.

Song Pang is CTO at NetBrain Technologies

The Latest

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...

Many organizations assumed their infrastructure strategy was settled. It had been implemented, optimized and built into long-term plans. Recent changes in technology and vendor consolidation are forcing a second look. Cloud outages and licensing changes have exposed how much dependency exists on a small number of platforms. As a result, organizations are reevaluating whether those decisions still hold up under current conditions ...

Edge AI is strategically embedded in core IT and infrastructure spending across industries, according to the 2026 Edge AI Survey from ZEDEDA. The research shows that 83% of C-suite and IT executive respondents say edge AI is important to their core business strategy ...

As AI adoption accelerates, operational complexity — not model intelligence — is becoming the primary barrier to reliable AI at scale, according to the State of AI Engineering 2026 from Datadog ... The report highlights a compounding complexity challenge as AI systems scale ... Around 5% of AI model requests fail in production, with nearly 60% of those failures caused by capacity limits ...