Widespread Downtime Found in 99 Percent of Cloud Environments
December 12, 2017
Share this

Downtime and security risks were present in each cloud environment tested, according to 2016 Private Cloud Resiliency Benchmarks, a report from Continuity Software.

The study also found that security and performance risks were found in 99 percent and 97 percent of the environments respectively, with 82 percent of the companies facing data loss risks.

Some of the top risks identified across the private cloud environments include:

■ Configuration drifts between cluster nodes that prevent failover. Examples for such discrepancies range from the most trivial – e.g., a file that is not accessible by all hosts in the cluster – to more complex ones – such as incorrect settings of affinity rules.

■ Virtual networking configuration errors leading to virtual machine isolation and downtime. Examples include incorrect Virtual Machine Port Group configurations and resources misalignment between ESXi cluster hosts leading to a single point of failure.

■ Incorrect storage settings leading to corrupt backups and data store loss. Such risks range from invalid CBT configuration to inconsistent LUN numbering and incorrect UUID settings.

What do these private cloud environments look like?

■ 48 percent of the organizations included in the study run their virtual machines on Windows compared to 7 percent of the organizations that run on Linux. 46 percent of the organizations use a mix of operating systems.

■ Close to three quarters (73 percent) of the organizations use EMC data storage systems. Other storage systems used include NetApp (38 percent), IBM (26 percent), HP (24 percent) and Hitachi (18 percent).

■ 27 percent of the organizations use replication for automated offsite data protection.

■ 12 percent of the organizations utilize active-active failover for continuous availability.

■ Almost all of the organizations (96 percent) use more than one physical path to transfer data between the host and the external storage device.

With a growing level of the complexity, increasing interdependence among infrastructure components, and an escalating pace of change, keeping cloud infrastructure free of risky misconfiguration is becoming a challenge that most organizations fail to meet.

"Sooner or later, every system fails," said Gil Hecht, CEO of Continuity Software. "And when a popular service goes down, it doesn't take long for customers to notice."

Each year enterprises continue to encounter downtime, which currently costs an estimated $740,000 per outage according to Ponemon's most recent report.

"The good news is that most risks lurking in the cloud infrastructure can be identified and corrected before they turn into a service disruption," explained Hecht. "This requires a specialized set of processes and tools, but above all a mindset and strategy focused on early detection and the remediation of risks."

Share this

The Latest

October 10, 2019

The requirements of an APM tool are now much more complex than they've ever been. Not only do they need to trace a user transaction across numerous microservices on the same system, but they also need to happen pretty fast ...

October 09, 2019

Performance monitoring is an old problem. As technology has advanced, we've had to evolve how we monitor applications. Initially, performance monitoring largely involved sending ICMP messages to start troubleshooting a down or slow application. Applications have gotten much more complex, so this is no longer enough. Now we need to know not just whether an application is broken, but why it broke. So APM has had to evolve over the years for us to get there. But how did this evolution take place, and what happens next? Let's find out ...

October 08, 2019

There are some IT organizations that are using DevOps methodology but are wary of getting bogged down in ITSM procedures. But without at least some ITSM controls in place, organizations lose their focus on systematic customer engagement, making it harder for them to scale ...

October 07, 2019
OK, I admit it. "Service modeling" is an awkward term, especially when you're trying to frame three rather controversial acronyms in the same overall place: CMDB, CMS and DDM. Nevertheless, that's exactly what we did in EMA's most recent research: <span style="font-style: italic;">Service Modeling in the Age of Cloud and Containers</span>. The goal was to establish a more holistic context for looking at the synergies and differences across all these areas ...
October 03, 2019

If you have deployed a Java application in production, you've probably encountered a situation where the application suddenly starts to take up a large amount of CPU. When this happens, application response becomes sluggish and users begin to complain about slow response. Often the solution to this problem is to restart the application and, lo and behold, the problem goes away — only to reappear a few days later. A key question then is: how to troubleshoot high CPU usage of a Java application? ...

October 02, 2019

Operations are no longer tethered tightly to a main office, as the headquarters-centric model has been retired in favor of a more decentralized enterprise structure. Rather than focus the business around a single location, enterprises are now comprised of a web of remote offices and individuals, where network connectivity has broken down the geographic barriers that in the past limited the availability of talent and resources. Key to the success of the decentralized enterprise model is a new generation of collaboration and communication tools ...

October 01, 2019

To better understand the AI maturity of businesses, Dotscience conducted a survey of 500 industry professionals. Research findings indicate that although enterprises are dedicating significant time and resources towards their AI deployments, many data science and ML teams don't have the adequate tools needed to properly collaborate on, build and deploy AI models efficiently ...

September 30, 2019

Digital transformation, migration to the enterprise cloud and increasing customer demands are creating a surge in IT complexity and the associated costs of managing it. Technical leaders around the world are concerned about the effect this has on IT performance and ultimately, their business according to a new report from Dynatrace, based on an independent global survey of 800 CIOs, Top Challenges for CIOs in a Software-Driven, Hybrid, Multi-Cloud World ...

September 26, 2019

APM tools are your window into your application's performance — its capacity and levels of service. However, traditional APM tools are now struggling due to the mismatch between their specifications and expectations. Modern application architectures are multi-faceted; they contain hybrid components across a variety of on-premise and cloud applications. Modern enterprises often generate data in silos with each outflow having its own data structure. This data comes from several tools over different periods of time. Such diversity in sources, structure, and formats present unique challenges for traditional enterprise tools ...

September 25, 2019

Today's organizations clearly understand the value of digital transformation and its ability to spark innovation. It's surprising that fewer than half of organizations have undertaken a digital transformation project. Workfront has identified five of the top challenges that IT teams face in digital transformation — and how to overcome them ...