Skip to main content

The Role of Automation in Network Operations Isn't What You Think

Song Pang
NetBrain Technologies

Nearly all CIOs have seen IT automation projects get derailed, often because they try to do too much. But the IT skills gaps, layoffs or flat budgets and the increasing complexity of networks, automation is often the only way to scale up network management processes. Changes in technology have made network automation much more accessible to the point that there are low-code and no-code options available.

Any repetitive network management tasks can, and arguably should, be automated. Here are four examples of tasks that can be automated successfully with today's technology.

Task 1: Making network documentation more accurate and comprehensive

Network documentation is essential for a variety of reasons, but large enterprise networks change so quickly that documentation goes out of date almost as soon as it's completed. And more troubling, documentation that simply identifies device connectivity is simply not enough in an era of multi-vendor and multi-cloud digital infrastructures.

Documentation must have the ability to guide network engineers more intelligently as they solve service problems, which requires a clear understanding of all the device operating conditions (physical and virtual), the connection topology of devices, how traffic flows bi-directionally and the desired behaviors that results. And this all must be available at the touch of a button. Network automation is the perfect platform to enable this, since it can work in the background constantly maintaining this multi-facet model of any network in near real-time.

Task 2: Proactively looking for anomalous conditions such as outdated configurations or insecure passwords

Every network has a set of architectures that have been defined to support the business and its mission critical applications. By using no-code approaches to allow any engineer to translate the parameters of these architectures into validation logic, automation can be leveraged to execute verifications at scale.

By doing so, most network problems can be detected and corrected before they materially affect production services. These problem types range from available capacity and service delivery performance to security management and resilience. Device passwords must always be verified to be secure; failover links must be tested to assure they are available during times of stress; and device configurations must be tested to make sure required operating parameters are in effect. These and a hundred other scenarios can be crafted through automation to establish and maintain confidence in the operating baseline.

Task 3: The workflows associated with network troubleshooting

In every enterprise and MSP, there is a constant stream of operational service tasks, or tickets, that must be handled. The resolution of each of these tickets typically requires a set of repetitive steps that must be executed each time manually. And to make matters worse, the same service task being handled by different network engineers may be using completely different sets of steps based on their level of expertise and experience. The result is vastly inconsistent remedies.

Automation can capture the best practices for the majority of problem types and then make those available to engineers across the planet. And since those steps are repetitive, what may have taken hours to execute by hand may take minutes to execute by machine. Network engineers can leverage automation to run this golden set of diagnostics quickly, allowing them to focus on the harder networking issues which may be infrequent or deeply complex. This use of automation for troubleshooting results in lower MTTR, fewer tickets, faster MTTI, and improved resource management.

Task 4: Creating a more secure Change Management environment

Change is one of the most critical aspects of keeping every digital infrastructure up and running and in direct support of the business. And while there have been countless change management solutions over the years from more than a hundred vendors, they all lack the ability to understand the service delivery aspects of change, and they lack the ability to automatically verify that the change was not only completed successfully, but the results were as expected. Simply put, traditional change management solutions may successfully enable device changes to be made, but without an automated way to verify business service impacts, the business itself may suffer unforeseen dependencies.

A strong understanding of the network using a comprehensive digital twin coupled with an automation engine that can provide the means to verify the business services that traverse each and every device, both before and after change is made, is an entirely new way of thinking about change management- in the context of business service delivery, rather than device health.

Ensuring critical applications and IT services perform well is key to the operation and success of any business. No-code network automation enables fundamental change to long-established yet manual workflows, and in doing so, provides a level of consistency and operational performance never previously imagined. Network automation eliminates the tedium found in current processes and reduces the reliance on labor intensive tasks which are repetitive in nature. Network engineers can now rely on automation to handle the first two-thirds of the remedial work that they would otherwise manually do, allowing them to focus on more strategic and forward-looking work.

In broad strokes, automating NetOps enables outage prevention (which preserves the company's livelihood), troubleshooting scale (which saves time and money), application services to be delivered as needed (which increases revenue), network security to be continuously verified (which protects the business), and protected change (which eliminates the unintended consequences typically associated with change). Network automation can elevate network operations from tactical to strategic and bring simplicity and efficiency to NetOps teams.

Song Pang is CTO at NetBrain Technologies

Hot Topics

The Latest

As AI moves from generating responses to performing actions, the need for trust increases exponentially. And as organizations enlist AI agents for increasingly sophisticated business processes, trust is going to be the single most important theme for spurring adoption. What can organizations do to build trustworthy AI agents? ...

I've spent a lot of time in the channel, and one thing I keep coming back to is this: a partner program is only as good as what it looks like in the field. Many programs look great on paper, but when a partner is in front of a customer navigating a complex hybrid environment or trying to make the case for AI-powered observability, the gap between what a vendor promises and what it actually delivers becomes very clear, very fast ...

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

The Role of Automation in Network Operations Isn't What You Think

Song Pang
NetBrain Technologies

Nearly all CIOs have seen IT automation projects get derailed, often because they try to do too much. But the IT skills gaps, layoffs or flat budgets and the increasing complexity of networks, automation is often the only way to scale up network management processes. Changes in technology have made network automation much more accessible to the point that there are low-code and no-code options available.

Any repetitive network management tasks can, and arguably should, be automated. Here are four examples of tasks that can be automated successfully with today's technology.

Task 1: Making network documentation more accurate and comprehensive

Network documentation is essential for a variety of reasons, but large enterprise networks change so quickly that documentation goes out of date almost as soon as it's completed. And more troubling, documentation that simply identifies device connectivity is simply not enough in an era of multi-vendor and multi-cloud digital infrastructures.

Documentation must have the ability to guide network engineers more intelligently as they solve service problems, which requires a clear understanding of all the device operating conditions (physical and virtual), the connection topology of devices, how traffic flows bi-directionally and the desired behaviors that results. And this all must be available at the touch of a button. Network automation is the perfect platform to enable this, since it can work in the background constantly maintaining this multi-facet model of any network in near real-time.

Task 2: Proactively looking for anomalous conditions such as outdated configurations or insecure passwords

Every network has a set of architectures that have been defined to support the business and its mission critical applications. By using no-code approaches to allow any engineer to translate the parameters of these architectures into validation logic, automation can be leveraged to execute verifications at scale.

By doing so, most network problems can be detected and corrected before they materially affect production services. These problem types range from available capacity and service delivery performance to security management and resilience. Device passwords must always be verified to be secure; failover links must be tested to assure they are available during times of stress; and device configurations must be tested to make sure required operating parameters are in effect. These and a hundred other scenarios can be crafted through automation to establish and maintain confidence in the operating baseline.

Task 3: The workflows associated with network troubleshooting

In every enterprise and MSP, there is a constant stream of operational service tasks, or tickets, that must be handled. The resolution of each of these tickets typically requires a set of repetitive steps that must be executed each time manually. And to make matters worse, the same service task being handled by different network engineers may be using completely different sets of steps based on their level of expertise and experience. The result is vastly inconsistent remedies.

Automation can capture the best practices for the majority of problem types and then make those available to engineers across the planet. And since those steps are repetitive, what may have taken hours to execute by hand may take minutes to execute by machine. Network engineers can leverage automation to run this golden set of diagnostics quickly, allowing them to focus on the harder networking issues which may be infrequent or deeply complex. This use of automation for troubleshooting results in lower MTTR, fewer tickets, faster MTTI, and improved resource management.

Task 4: Creating a more secure Change Management environment

Change is one of the most critical aspects of keeping every digital infrastructure up and running and in direct support of the business. And while there have been countless change management solutions over the years from more than a hundred vendors, they all lack the ability to understand the service delivery aspects of change, and they lack the ability to automatically verify that the change was not only completed successfully, but the results were as expected. Simply put, traditional change management solutions may successfully enable device changes to be made, but without an automated way to verify business service impacts, the business itself may suffer unforeseen dependencies.

A strong understanding of the network using a comprehensive digital twin coupled with an automation engine that can provide the means to verify the business services that traverse each and every device, both before and after change is made, is an entirely new way of thinking about change management- in the context of business service delivery, rather than device health.

Ensuring critical applications and IT services perform well is key to the operation and success of any business. No-code network automation enables fundamental change to long-established yet manual workflows, and in doing so, provides a level of consistency and operational performance never previously imagined. Network automation eliminates the tedium found in current processes and reduces the reliance on labor intensive tasks which are repetitive in nature. Network engineers can now rely on automation to handle the first two-thirds of the remedial work that they would otherwise manually do, allowing them to focus on more strategic and forward-looking work.

In broad strokes, automating NetOps enables outage prevention (which preserves the company's livelihood), troubleshooting scale (which saves time and money), application services to be delivered as needed (which increases revenue), network security to be continuously verified (which protects the business), and protected change (which eliminates the unintended consequences typically associated with change). Network automation can elevate network operations from tactical to strategic and bring simplicity and efficiency to NetOps teams.

Song Pang is CTO at NetBrain Technologies

Hot Topics

The Latest

As AI moves from generating responses to performing actions, the need for trust increases exponentially. And as organizations enlist AI agents for increasingly sophisticated business processes, trust is going to be the single most important theme for spurring adoption. What can organizations do to build trustworthy AI agents? ...

I've spent a lot of time in the channel, and one thing I keep coming back to is this: a partner program is only as good as what it looks like in the field. Many programs look great on paper, but when a partner is in front of a customer navigating a complex hybrid environment or trying to make the case for AI-powered observability, the gap between what a vendor promises and what it actually delivers becomes very clear, very fast ...

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ...