In BSMdigest’s exclusive interview, Al Sargent, Sr. Product Marketing Manager at VMware, discusses the Business Service Management news coming out of VMworld, and the monitoring and management challenges of the cloud.
BSM: Many companies in the Business Service Management space are exhibiting at, making announcements at and attending VMworld. Why is VMworld such an important event for monitoring and management software companies?
AS: The reason is that VMworld has become the de-facto conference for modern datacenter management. It is more than just one vendor’s conference.
Two of the most important trends in datacenter management are virtualization and cloud computing. VMware is at the center of virtualization with Vsphere, but in addition to that, VMware’s new vFabric cloud application platform provides customers with a pragmatic and evolutionary path to cloud computing.
BSM: What announcements did VMware make at VMworld to address the monitoring and management needs of the market?
AS: The main announcement was around the introduction of vFabric, and how Hyperic supports the monitoring of cloud applications. One key goal for Hyperic going forward is to provide best-in-class monitoring of cloud applications, whether running on our vFabric cloud application platform or a platform from another vendor.
Fulfilling this goal entails the following three capabilities: support for dynamic architectures and elastic capacity; extreme scalability to collect all the performance data from all the VMs in the data center; and monitoring a large number of application infrastructure components.
BSM: You mention the massive amount of performance data. Why is there so much more performance data coming out of the cloud?
AS: Think about a data center with a thousand servers, which is actually a pretty small datacenter. If you are collecting 1,000 metrics on each one of those 1,000 virtual machines every minute, that is one million metrics per minute that needs to be processed. Even a midsized firm can hit this level of metrics data.
BSM: So one factor is the number of VMs. Is another factor that there are more metrics coming out of each application because of all the changes that are going on?
AS: Exactly. That is a really good point. Another thing is this inexorable march towards web applications that streamline business processes and therefore need to accommodate surges in the business cycle. Every industry has these cycles, and applications need to be architected to accommodate the surges in demand that accompany them. As the infrastructure scales up and scales down, that is going to mean more changes in your datacenter and that is going to mean that at the peak of those surges you are going to have a lot of VMs throwing off a lot of performance metrics, and your monitoring tool has to be able to accommodate that.
BSM: So is it just a matter of scalability? The ability to handle many more metrics?
There are three elements. One is the peak number of VMs. Second, it is the fact that the number of those VMs varies over time. And third is the fact that the stakes are so much higher. During these surge periods, if your application is slow or unavailable entirely, for every minute of performance problems you are going to have significant lost revenue.
BSM: What are the current monitoring technologies missing that do not allow them be able to handle this new environment?
AS: We are going to see more metrics collected more and more frequently. Believe it or not, there are many monitoring tools that think it’s perfectly acceptable to capture metrics every 10 minutes. That might have worked fine in the late 90s when those tools first came out, but today that will not cut it.
We are seeing a need for monitoring tools that monitor very frequently - as frequently as one minute or less. There are two big drivers for this. One is the fact that consumer software is driving the enterprise software innovation. Think of Twitter: users expect that when you post something to Twitter, it is immediately available to the world. Users today expect software to work in real time, and that expectation weaves its way into the requirements for monitoring tools and reporting metrics.
The second point goes back to surges in web application workloads due to business cycles. To accommodate those surges, you need to figure out how much you need to dynamically scale up your virtualized environment. Doing that confidently requires that you collect performance metrics very frequently.
Here’s an example: Let’s say you only spin up a new app server VM if you have four datapoints indicating that the app is running slowly, because you don’t want to spin up a new VM based on a single, possibly spurious data point. If you collect metrics once every 15 minutes – a common setting among legacy tools – it will be a whole hour before you spawn a new VM. No business can afford an entire hour of sluggishness in its critical apps. You can imagine the conversation that the business would have with IT.
But let’s say you collect metrics once a minute. In four minutes, you’ll have four datapoints, and can confidently spin up that new VM. IT responds quickly, and the business and customers are happy.
BSM: At VMworld, VMware announced that the introduction of the vFabric cloud application platform will drive IT as a service. Do you see vFabric helping users of the cloud move to a more business-centric view of IT service?
Yes, vFabric lets IT move in that direction. IT can start to scale infrastructure more quickly in response to the needs of the business, and that frees them up to understand more about the cycles of the business. For instance, if IT serves a retail business, they can think about the major shopping days during the holidays, and when they’ll need to ramp up their infrastructure during those shopping days.
About Al Sargent
Al Sargent, Sr. Product Marketing Manager at VMware, handles product marketing for Hyperic, VMware's application monitoring product. He has 15+ years of experience in product management and marketing, business development, sales, and engineering at VMware, Oracle, Mercury and startups such as Wily Technology.
Scaling DevOps and SRE practices is critical to accelerating the release of high-quality digital services. However, siloed teams, manual approaches, and increasingly complex tooling slow innovation and make teams more reactive than proactive, impeding their ability to drive value for the business, according to a new report from Dynatrace, Deep Cloud Observability and Advanced AIOps are Key to Scaling DevOps Practices ...
Over three quarters (79%) of database professionals are now using either a paid-for or in-house monitoring tool, according to a new survey from Redgate Software ...
Gartner announced the top strategic technology trends that organizations need to explore in 2022. With CEOs and Boards striving to find growth through direct digital connections with customers, CIOs' priorities must reflect the same business imperatives, which run through each of Gartner's top strategic tech trends for 2022 ...
Distributed tracing has been growing in popularity as a primary tool for investigating performance issues in microservices systems. Our recent DevOps Pulse survey shows a 38% increase year-over-year in organizations' tracing use. Furthermore, 64% of those respondents who are not yet using tracing indicated plans to adopt it in the next two years ...
Businesses are embracing artificial intelligence (AI) technologies to improve network performance and security, according to a new State of AIOps Study, conducted by ZK Research and Masergy ...
What may have appeared to be a stopgap solution in the spring of 2020 is now clearly our new workplace reality: It's impossible to walk back so many of the developments in workflow we've seen since then. The question is no longer when we'll all get back to the office, but how the companies that are lagging in their technological ability to facilitate remote work can catch up ...
The pandemic accelerated organizations' journey to the cloud to enable agile, on-demand, flexible access to resources, helping them align with a digital business's dynamic needs. We heard from many of our customers at the start of lockdown last year, saying they had to shift to a remote work environment, seemingly overnight, and this effort was heavily cloud-reliant. However, blindly forging ahead can backfire ...
SmartBear recently released the results of its 2021 State of Software Quality | Testing survey. I doubt you'll be surprised to hear that a "lack of time" was reported as the number one challenge to doing more testing, especially as release frequencies continue to increase. However, it was disheartening to see that a lack of time was also the number one response when we asked people to identify the biggest blocker to professional development ...
The role of the CIO is evolving with an increased focus on unlocking customer connections through service innovation, according to the 2021 Global CIO Survey. The study reveals the shift in the role of the CIO with the majority of CIO respondents stating innovation, operational efficiency, and customer experience as their top priorities ...
The perception of IT support has dramatically improved thanks to the successful response of service desks to the pandemic, lockdowns and working from home, according to new research from the Service Desk Institute (SDI), sponsored by Sunrise Software ...