The State of Digital Operations: Downtime Costs $1 Million Per Hour
June 26, 2017

Eric Sigler
PagerDuty

Share this

Many organizations are struggling to resolve customer-impacting incidents quickly enough to preserve brand loyalty and revenue, according to PagerDuty's recent State of Digital Operations Report. Through a two-part survey of more than 300 IT personnel in development and operations, as well as over 300 consumers, the report also revealed a perception gap among those IT personnel who feel their organizations are properly equipped to support digital services, and revealed superiority in the use of DevOps practices to achieve the best IT outcomes.

DevOps is King

While digital services are adding extra pressure on developers and IT operations teams, many practitioners identified a number of processes and tools that help them prevent disruptions.

According to the report, DevOps reigns supreme among IT organizations that feel they are equipped to handle the rise in digital services.

Other best practices employed by these organizations include incident management and modern development methods like agile or continuous delivery.

Nearly one third of these organizations are also using ChatOps, or conversation-driven development, to help support digital services.

The Digital Services Expectation Gap

The survey of IT personnel in development and operations illustrated a gap between IT teams' ability to fix disruptions in digital services, and the performance consumers expect from these services. It found that consumer-impacting incidents take IT teams approximately double the amount of time consumers are willing to wait for a service that isn't working properly. According to the survey results, 69.2 percent of consumers will stop trying or even leave a digital app or service if it takes more than 15 minutes to resolve a service disruption (i.e., stops working or the service slows down).

Meanwhile, 38.4 percent of organizations take at least 30 minutes to resolve IT incidents that impact consumer-facing digital services, increasing the chances that customers will leave during the time it takes to get things back up and running.


As 59.8 percent of consumers surveyed use digital services to complete tasks such as banking, making dinner reservations or finding transportation at least one or more times daily, and 85.3 percent use these services at least one or more times a week, it's no surprise that consumer brand loyalty is heavily influenced by digital experiences.

But the impact of IT disruptions doesn't stop at the developer and IT operations teams responsible for managing infrastructure. Nearly one third of respondents (32.7 percent) reported that one hour of IT downtime costs their companies $1 million or more, meaning stakeholders in the lines of business –– ranging from finance to marketing to customer service –– are also facing difficulties due to the disruptions.

IT Readiness Perception vs. Reality

The survey also showed a misalignment between IT professionals' perception of their organizations' readiness to deploy, manage and maintain digital services, and the high frequency of consumer-facing IT incidents they currently face. Although 83.9 percent of IT personnel who took the survey felt confident that their IT organization is prepared to support digital services, 59.1 percent of those who identified as prepared to support digital services are still experiencing customer-impacting incidents (slowness or downtime) at least one or more times a week.

They cited increased complexity resulting in more cognitive load, an increase in the number of tools and increased difficulty in capacity planning (e.g., increase in volume of data) as the top operations challenges, illustrating how the rise in digital service offerings has created operations challenges for IT organizations.

Consumers' demands for digital services will continue to grow alongside their high expectations for flawless experiences. To prepare for this, organizations must gain a better understanding of the digital customer journey and employ the right combination of digital operations practices. By turning to DevOps, event management and modern incident management, organizations will be able to not only close the customer expectation gap, but exceed customer expectations.

Eric Sigler is Head of DevOps at PagerDuty
Share this

The Latest

November 24, 2020

Shoppers are heading into Black Friday with high expectations for digital experiences and are only willing to experience a service interruption of five minutes or less to get the best deal, according to the 2020 Black Friday and Cyber Monday eCommerce Trends Study, from xMatters ...

November 23, 2020

Digital Experience Monitoring (DEM) has become significant to businesses more than ever. Global events like Covid continue to disrupt best practices within IT to support business. The pandemic has already forced millions of employees to WFH and adopt a hybrid workspace. Network connectivity and cloud application issues in these environments will continue to impact productivity and slow progress. Even so, transparent migration and deployment of on-premise workloads across multi-cloud providers, by their very nature are complex ...

November 20, 2020

APMdigest posed the following question to the IT Operations community: How should ITOps adapt to the new normal? In response, industry experts offered their best recommendations for how ITOps can adapt to this new remote work environment. Part 5, the final installment in the series, covers open source and emerging technologies ...

November 19, 2020

APMdigest posed the following question to the IT Operations community: How should ITOps adapt to the new normal? In response, industry experts offered their best recommendations for how ITOps can adapt to this new remote work environment. Part 4 covers monitoring and visibility ...

November 18, 2020

APMdigest posed the following question to the IT Operations community: How should ITOps adapt to the new normal? In response, industry experts offered their best recommendations for how ITOps can adapt to this new remote work environment. Part 3 covers automation ...

November 17, 2020

APMdigest posed the following question to the IT Operations community: How should ITOps adapt to the new normal? In response, industry experts offered their best recommendations for how ITOps can adapt to this new remote work environment. Part 2 covers communication and collaboration ...

November 16, 2020

The "New Normal" in IT — the fact that most IT Operations personnel work from home (WFH) today — is here to stay. What started out as a reaction to the COVID-19 pandemic is now a way of life. Many experts agree that IT teams will not be going back to the office any time soon, even if the public health concerns are abated. How should ITOPs adapt to the new normal? That is the question APMdigest posed to the IT industry. ITOps experts — from analysts and consultants to the top vendors — offer their best recommendations for how ITOps can react to this new environment ...

November 12, 2020

The pandemic effectively "shocked" enterprises into pushing the gas on tech initiatives that, on the one hand, support a more flexible, decentralized workforce, but that were by-and-large already on the roadmap, regardless of whether businesses had been planning to support widespread work-from-home or not ...

November 10, 2020

Maintaining call quality with Microsoft Teams is a process, not a onetime event. Network engineers and Microsoft Teams application owners need to be vigilant in preserving optimal call quality to ensure audio, video, and screen-sharing always remain satisfactory for end-users. In this blog, we cover how the Microsoft Teams Call Quality Dashboard (CQD) combined with the audio/video synthetic transaction monitoring improves this maintenance process ...

November 09, 2020

For IT teams, catching errors in applications before they become detrimental to a project is critical. Wouldn't it be nice if there was someone standing over your shoulder, letting you know exactly when, where, and what the issue is so you can correct it immediately? Luckily, there are both application performance management (APM) and application stability management (ASM) solutions available that can do this for you, flagging errors in both the deployment and development stages of applications, before they can create larger issues down the line ...