Sadly, natural disasters often cause major devastation and wreckage. They can make a business prone to widespread power outages, transportation stoppages, and massive flooding, interrupting day-to-day physical operations and revenue streams. But recent advances in computing – specifically, the advent of Cloud computing – have made today’s data centers and the businesses they support much more resilient.
For example, if the recent Hurricane Sandy had any silver lining, it was this: even as data centers in the northeast took a beating, Cloud service providers and the overall Internet infrastructure remained solid. Compuware’s own Outage Analyzer indicated only a few scattered outages, and major service disruptions were avoided. As a result, many area businesses saw minimal disruption to critical business processes conducted online, including CRM, SCM, content management and accounting, with the worst effects limited to infrastructure and applications located in the worst hit areas of Manhattan.
The distributed nature of the Cloud made this possible by addressing the holy grail of business continuity — eliminating single points of failure. The ability to host data center assets off-premise in remote, distributed data centers can protect data and applications from a disaster, even if it’s a storm system spanning several hundred miles. When it comes to maintaining application performance (speed) and continuity in the face of a major natural disaster — or the constant day-to-day volatility of the Internet for that matter — here are three key takeaways:
1. Use the Cloud for Business Continuity
One of the most understated use cases for the Cloud is business continuity. People often think of the Cloud as a way to save money and gain agility, but the Cloud is also built for back-up and recovery, with geographically dispersed networks.
We expect that many businesses are going to start thinking more seriously about disaster recovery in the Cloud. Many businesses can't afford to put in the redundancy they have in a Cloud solution with an on-premise solution and make it accessible to so many people regardless of their location. If you have two feet of water in your data center, your servers and backup are likely gone; but if you are on one or more Cloud platforms, you can just drive to your local fast-food restaurant or library and be up-and-running.
2. Make Sure Your Chosen Cloud Service Provider Can Perform at the Level You Expect
When you select a Cloud service provider, you should make sure they can support the level of application performance your business requires on a day-to-day basis. Many Cloud service providers offer availability guarantees, but all this means is that their servers are up and running — not necessarily that your application end users are having a fast, high-quality experience.
You should also expect your Cloud service provider to be able to seamlessly move your applications – even without your awareness — in the event of an impending localized disaster. Many Cloud service providers offer standard back-up and disaster recovery services that make continuous access to data and applications for their clients a non-issue.
The extent to which a Cloud service provider is responsible for your back-up and disaster recovery depends on how you are using the Cloud services. If you’re using Cloud services in a Software-as-a-Service (SaaS) business model — a mode of software delivery in which software and associated data are centrally hosted on the Cloud — the Cloud service provider bears responsibility for ensuring your apps are redundant.
On the other hand, if you’re using Cloud services in an Infrastructure-as-a-Service (IaaS) provision model — meaning you’re “renting” from the Cloud the equipment used to support operations, including storage, hardware, servers and networking components — responsibility for software management (including redundancy) remains with you.
3. Monitor Your Apps, 24x7
Even if you have the most reliable Cloud service provider in the world, there are still network and website components like CDNs, regional and local ISPs and third-party services that can degrade performance at the edge of the Internet. In fact, Compuware recently found that ad servers were the number one culprit when it comes to slowing or bringing down websites, choking the very sites from which they’re trying to generate revenue.
It doesn't take a natural disaster to create the first tear that rips apart other connections. Sometimes just one service getting hammered is all it takes to start a chain reaction that knocks your site off the web. Outages and slow-downs for network and website components can be completely random, and the truth is that the Internet has “little storms” like this all the time, caused by things as mundane as server failures, unplugged cables, backhoe-on-fiber collisions, and dragging fish boat anchors.
This means you need to take responsibility for understanding your own end-user experiences. You must monitor all your applications 24x7, storm or no storm, whether you’re using the Cloud or not. You must understand where your single points of failure are and eliminate them. You never want to get into a spot where your application is failing you, and it’s your customers letting you know.
In summary, regional presence should never determine one’s vulnerability to lost applications and data. Today’s data centers are more virtual than ever, and that’s a major plus in the face of all types of network events — natural disasters and otherwise. To cost-effectively protect your business operations, consider using the Cloud for business continuity; make sure your Cloud service provider meets your day-to-day application performance requirements as well as your back-up and disaster recovery requirements; and realize you are ultimately responsible for managing the performance of all your own applications, around the clock.
Stephen Pierzchala, Technology Strategist, Compuware APM's Center of Excellence.
Related Links:
The Latest
Developers need a tool that can be portable and vendor agnostic, given the advent of microservices. It may be clear an issue is occurring; what may not be clear is if it's part of a distributed system or the app itself. Enter OpenTelemetry, commonly referred to as OTel, an open-source framework that provides a standardized way of collecting and exporting telemetry data (logs, metrics, and traces) from cloud-native software ...
As SLOs grow in popularity their usage is becoming more mature. For example, 82% of respondents intend to increase their use of SLOs, and 96% have mapped SLOs directly to their business operations or already have a plan to, according to The State of Service Level Objectives 2023 from Nobl9 ...
Observability has matured beyond its early adopter position and is now foundational for modern enterprises to achieve full visibility into today's complex technology environments, according to The State of Observability 2023, a report released by Splunk in collaboration with Enterprise Strategy Group ...
Before network engineers even begin the automation process, they tend to start with preconceived notions that oftentimes, if acted upon, can hinder the process. To prevent that from happening, it's important to identify and dispel a few common misconceptions currently out there and how networking teams can overcome them. So, let's address the three most common network automation myths ...
Many IT organizations apply AI/ML and AIOps technology across domains, correlating insights from the various layers of IT infrastructure and operations. However, Enterprise Management Associates (EMA) has observed significant interest in applying these AI technologies narrowly to network management, according to a new research report, titled AI-Driven Networks: Leveling Up Network Management with AI/ML and AIOps ...
When it comes to system outages, AIOps solutions with the right foundation can help reduce the blame game so the right teams can spend valuable time restoring the impacted services rather than improving their MTTI score (mean time to innocence). In fact, much of today's innovation around ChatGPT-style algorithms can be used to significantly improve the triage process and user experience ...
Gartner identified the top 10 data and analytics (D&A) trends for 2023 that can guide D&A leaders to create new sources of value by anticipating change and transforming extreme uncertainty into new business opportunities ...
The only way for companies to stay competitive is to modernize applications, yet there's no denying that bringing apps into the modern era can be challenging ... Let's look at a few ways to modernize applications and consider what new obstacles and opportunities 2023 presents ...
As online penetration grows, retailers' profits are shrinking — with the cost of serving customers anytime, anywhere, at any speed not bringing in enough topline growth to best monetize even existing investments in technology, systems, infrastructure, and people, let alone new investments, according to Digital-First Retail: Turning Profit Destruction into Customer and Shareholder Value, a new report from AlixPartners and World Retail Congress ...