The Changing Face of Network Downtime
October 02, 2014

Vess Bakalov
SevOne

Share this

Our connected world continues to transform into a mobile one. The network is a constant and fascinating companion, which grants us 24/7 access where communication is instant and takes place across an array of devices, unconstrained by physical barriers. As a result, the IT infrastructure is more critical than ever for business operations. Companies and organizations are calling upon a variety of technologies that are changing the face of today’s network — from mobile devices, to cloud services, to web-based applications.

And the strain on the network is not expected to decrease. In fact, Cisco reports that in two years, the number of devices connected to IP networks will be nearly three times that of the global population. At the same time, network management and performance challenges are also on the rise. The explosion of mobile, cloud and web-based apps make it difficult to determine where in today’s evolving world, the network begins and where it ends. As a result, service issues and outages are becoming more commonplace, prompting losses in revenue, customer satisfaction and employee productivity. A recent survey from Avaya speaks to the cost of network downtime, addressing the large degree of variance based on the characteristics of a business and environment (i.e., your vertical, risk tolerance, etc.), indicating the range is from $140K to $540K per hour.

Over the past couple of months, we’ve seen high-profile network outages capturing headlines across the US. A large number of service providers were affected by the 512K Day issue – when the Internet routing table grew beyond what many legacy routers were designed to handle. Then, in August more than 11 million Time Warner Cable (TWC) subscribers across 29 states lost service for about three hours, and just a week later, Facebook suffered its fourth outage over the past five months. Unavailability in two of the three previously mentioned cases was blamed on configuration glitches and as a result, quickly resolved.

The Most Important Word for Every Network: Availability

But why do network outages seem to be popping up more frequently, affecting more people? It’s really a question of perception – more people are consuming more services and everyone expects to be connected around the clock, around the world, using any device.

In a blog post earlier this summer, Andrew Lerner, a Research Director for Gartner, zeroed in on the most important word associated with every network: availability. As he notes, “Performance, scalability, management, agility, etc. all require the network to actually be online.”

Unfortunately, availability is assumed to be table stakes to most companies. I am not sure I agree with him entirely. Availability is table stakes. However, modern infrastructure — especially in service providers — is massively redundant. Pure availability is rarely the problem. More often service outages are due to poor capacity planning, spurious events or changes that bring unanticipated consequences (like Pakistan inadvertently re-routing all YouTube traffic).

For smaller businesses in particular, unavailability of core services not only represents a loss of control and a loss of earnings, but also potentially a lesson in reputational damage. Without network performance management solutions, businesses are unnecessarily exposing themselves to risk. Technology should be detecting and even preventing outages automatically, without the need for manual intervention. Technical staff cannot be expected to continually gather and analyze data that might indicate an impending outage, nor can they be expected to act quickly enough to stave off an incident. While the likes of TWC and Facebook can rapidly recover from disruptive infrastructure issues, smaller organizations can’t, and that is why they must take steps to protect themselves.

Reacting to performance thresholds is not enough. To ensure a company’s network is available 24/7, it’s critical to predict problems before they become service impacting. The deployment of solutions that log data and provide real-time analytics on large volumes of unstructured data are crucial to every IT department. These solutions provide IT organizations the opportunity to gain better insight into the behavior of users, customers, applications and networks, allowing businesses to spot issues before they happen – significantly reducing, or in some cases, eliminating downtime altogether.

Vess Bakalov is SVP, CTO and Co-Founder of SevOne.

Share this

The Latest

August 13, 2020

Retail companies typically start planning and testing in August and freeze code in September, but — according to a new survey commissioned by Catchpoint — due to COVID-19, most respondents (58%) are starting their planning and testing earlier than before ...

August 12, 2020

The outsourcing of IT infrastructure to a dedicated provider can make it difficult for organizations to understand where and how their operations are running and can become a breeding ground for misunderstanding and myths. To help clear up some of these myths, I've put together a guide to support organizations in the decision-making process and help them understand whether moving to the cloud is the right option for their business ...

August 11, 2020

Rapid adoption of cloud services, widespread use of SaaS applications, and reliance on the Internet has created business continuity risks for enterprises, according to the 2020 Internet Performance Report: COVID-19 Impact Edition from ThousandEyes ...

August 10, 2020

In Episode 2, Jonah Kowall, CTO of Logz.io and former Gartner Research VP, joins the AI+ITOPS Podcast to discuss some of the hottest topics in ITOps today, including AIOps, Open Telemetry, Observability, and the challenges of Big Data in AI ...

August 07, 2020

Dennis Drogseth, VP at EMA, on the AI+ITOPS Podcast: "Digital transformation ... and the need for IT to enable digital business outcomes, is greater than ever, and all the tools including AIOps and automation ... are critical in making the difference ..."

August 06, 2020

Most organizations (75% of those surveyed) find the need to upgrade outdated infrastructure and invest in new technologies, according to the State of IT Infrastructure 2020 report from Wipro Limited ...

August 05, 2020

Application or network downtime is expensive, and given the growing numbers and types of high-availability and mission-critical applications, systems and networks — and our increasing reliance on them — ensuring consistent access to mission-critical applications is essential for ensuring customer loyalty and keeping employees productive. Businesses must recognize that applications availability depends on the network and implement a strategy to ensure network-aware application performance monitoring ...

August 04, 2020

Business leaders around the world are comfortable and optimistic about the broad scale shift toward remote work, according to the Riverbed Future of Work Global Survey 2020 that included 700 business decision makers, however findings also indicate the vast majority of organizations were not well prepared when the COVID-19 pandemic began ...

August 03, 2020

APMdigest and The Field CTO joined forces to launch the AI+ITOPS Podcast. The mission of the podcast is to discuss the struggles faced by ITOps — such as digital transformation and the need to keep IT services "always on" — and explore how AI/ML, AIOps, APM and other ITOps and DevOps technologies can help. Episode 1 features guest Dennis Drogseth, VP at Enterprise Management Associates (EMA) ...

July 30, 2020

One of the most frustrating experiences for website visitors is a slow, unresponsive website. Worst-case scenario, a web bounce causes prospects to permanently bounce from your company. In an effort to help companies improve web performance, Google launched the Web Vitals initiative in May and announced three new search engine ranking factors ...