Every day, compelling new applications, built to support the needs of enterprises, are turning up in the cloud. As the significant benefits of these SaaS and hybrid cloud services become more evident, it's no surprise that cloud is playing an increasing role in enterprise application portfolios.
Over the last couple of years a new class of mission-critical SaaS applications providing core communication services (e.g., email, VoIP, online meetings, document storage/collaboration, etc.) have come to the fore, enabling organizations of any size to cost-effectively provide highly sophisticated services to their users.
However, while the reward is great, because these apps are mission-critical and deployed to your entire workforce, so is the risk. If your cloud-based CRM system is unavailable, the sales team is certainly impacted, but if email, IP and/or VoIP communications are unavailable, the entire organization takes a productivity hit.
To address this risk, IT must take a fresh look at how they monitor and manage these services. Moving your mission-critical apps to the cloud doesn't absolve IT of responsibility for the quality of service. If users can't access email, they are not going to call Microsoft or Google or Amazon. They are going to call the IT help desk and the IT team will be expected to fix the issue regardless of where it exists.
Therein lies the problem. With SaaS applications, IT does not have direct access to most of the server and network infrastructure running the services. They may have access to a service provider status dashboard, but those often do not provide anything close to real time information. Nor do they provide any information on the health and availability of the various networks (yours, the ISP's, the regional backbone, etc.) connecting the users to the service.
To effectively monitor and manage mission-critical SaaS applications, IT needs to be able to identify and isolate problems that may exist outside the infrastructure they own and operate. But how?
Bring on the Crowd
SaaS applications are by definition shared by a global community of customers. So it stands to reason that monitoring of these services could and should be done in a shared manner as well.
There are certainly examples of the crowd monitoring the cloud already happening in informal ways through Twitter. It's not uncommon for users to check Twitter when they are having problems with a cloud service. Twitter in effect becomes an impromptu global network of monitors, watching the service from hundreds of thousands of access points.
The problem with Twitter though is that it is primarily anecdotal and qualitative information and generally does not give organizations using mission-critical SaaS applications the fidelity needed to fix issues impacting users.
Despite Twitter's limitations as an IT tool, there is a lot to be said for the "power of the crowd" that is so fundamental to Twitter. What if IT could take that same model and use it to proactively monitor SaaS applications?
First, you need to go from ad hoc qualitative observations (e.g. "My email seems slow today") to consistent collection of performance data from a broad user community. This requires some type of active monitoring at the locations where users access their SaaS applications. Monitoring from the organization's points of access is critical. A solution that monitors from arbitrary points on the Internet will still be blind to local or ISP issues affecting a specific office.
Monitoring from a single location gives you real-time data for that location, which is certainly an improvement over the service provider dashboards, but that isn't enough. From a single point of access, an outage will look much the same regardless of whether it's local, in the network, or as the provider. This is where the crowd model comes in. By aggregating data from multiple locations, you can start to see trends and spot anomalies between them.
But why stop there? Why not aggregate data across all users of the SaaS service? The greater the number of monitoring points, the more accurately you can detect and isolate specific problem spots. Think of it like GPS for the cloud, pinpointing the issues that degrade service levels and user experience.
Armed with this level of visibility, IT could do a better job of optimizing their environment and minimizing the time to resolution of any service impacting issues. In doing so they regain the ability to ensure their users get consistent service and a high quality user experience.
A Call to Action
Obviously, no single consumer of a SaaS application can expect to gather all this data themselves. Cobbling together measurements from multiple office locations would be challenging enough and collecting data from other organizations would be downright impractical. This is where the industry needs to innovate and bring new SaaS solutions to market that enable IT organizations to realize the benefits of the cloud without losing the visibility and control they've had with their traditional systems.
The power of the crowd is a pervasive and growing force enabled by cloud-based technologies. Virtual crowds come together every day to do everything from building software to funding start-ups, from collecting funny cat pictures to overturning oppressive governments. Maybe it's time IT was able to leverage the power of the crowd to help manage the ever more complex array of cloud applications and services they depend on.
Patrick Carey is VP Product Management & Marketing at Exoprise.
The Latest
APMdigest and leading IT research firm Enterprise Management Associates (EMA) are partnering to bring you the EMA-APMdigest Podcast, a new podcast focused on the latest technologies impacting IT Operations. In Episode 2 - Part 2 Pete Goldin, Editor and Publisher of APMdigest, discusses Network Observability with Shamus McGillicuddy, Vice President of Research, Network Infrastructure and Operations, at EMA ...
Most organizations suffer from some form of alert noise. Alert noise is only going to increase as organizations support cloud-native applications spanning multiple public and private clouds, including ephemeral deployments and more. It's not going to get easier for organizations to understand the signal from all those alerts being sent. So what can be done about it? ...
This blog presents the case for a radical new approach to basic information technology (IT) education. This conclusion is based on a study of courses and other forms of IT education which purport to cover IT "fundamentals" ...
To achieve maximum availability, IT leaders must employ domain-agnostic solutions that identify and escalate issues across all telemetry points. These technologies, which we refer to as Artificial Intelligence for IT Operations, create convergence — in other words, they provide IT and DevOps teams with the full picture of event management and downtime ...
APMdigest and leading IT research firm Enterprise Management Associates (EMA) are partnering to bring you the EMA-APMdigest Podcast, a new podcast focused on the latest technologies impacting IT Operations. In Episode 2 - Part 1 Pete Goldin, Editor and Publisher of APMdigest, discusses Network Observability with Shamus McGillicuddy, Vice President of Research, Network Infrastructure and Operations, at EMA ...
CIOs have stepped into the role of digital leader and strategic advisor, according to the 2023 Global CIO Survey from Logicalis ...
Synthetic monitoring is crucial to deploy code with confidence as catching bugs with E2E tests on staging is becoming increasingly difficult. It isn't trivial to provide realistic staging systems, especially because today's apps are intertwined with many third-party APIs ...
Recent EMA field research found that ServiceOps is either an active effort or a formal initiative in 78% of the organizations represented by a global panel of 400+ IT leaders. It is relatively early but gaining momentum across industries and organizations of all sizes globally ...
Managing availability and performance within SAP environments has long been a challenge for IT teams. But as IT environments grow more complex and dynamic, and the speed of innovation in almost every industry continues to accelerate, this situation is becoming a whole lot worse ...
Harnessing the power of network-derived intelligence and insights is critical in detecting today's increasingly sophisticated security threats across hybrid and multi-cloud infrastructure, according to a new research study from IDC ...