An Interview with VMware’s Al Sargent

August 31, 2010

Pete Goldin

Editor and Publisher

APMdigest

In BSMdigest’s exclusive interview, Al Sargent, Sr. Product Marketing Manager at VMware, discusses the Business Service Management news coming out of VMworld, and the monitoring and management challenges of the cloud.

BSM: Many companies in the Business Service Management space are exhibiting at, making announcements at and attending VMworld. Why is VMworld such an important event for monitoring and management software companies?

AS: The reason is that VMworld has become the de-facto conference for modern datacenter management. It is more than just one vendor’s conference.

Two of the most important trends in datacenter management are virtualization and cloud computing. VMware is at the center of virtualization with Vsphere, but in addition to that, VMware’s new vFabric cloud application platform provides customers with a pragmatic and evolutionary path to cloud computing.

BSM: What announcements did VMware make at VMworld to address the monitoring and management needs of the market?

AS: The main announcement was around the introduction of vFabric, and how Hyperic supports the monitoring of cloud applications. One key goal for Hyperic going forward is to provide best-in-class monitoring of cloud applications, whether running on our vFabric cloud application platform or a platform from another vendor.

Fulfilling this goal entails the following three capabilities: support for dynamic architectures and elastic capacity; extreme scalability to collect all the performance data from all the VMs in the data center; and monitoring a large number of application infrastructure components.

BSM: You mention the massive amount of performance data. Why is there so much more performance data coming out of the cloud?

AS: Think about a data center with a thousand servers, which is actually a pretty small datacenter. If you are collecting 1,000 metrics on each one of those 1,000 virtual machines every minute, that is one million metrics per minute that needs to be processed. Even a midsized firm can hit this level of metrics data.

BSM: So one factor is the number of VMs. Is another factor that there are more metrics coming out of each application because of all the changes that are going on?

AS: Exactly. That is a really good point. Another thing is this inexorable march towards web applications that streamline business processes and therefore need to accommodate surges in the business cycle. Every industry has these cycles, and applications need to be architected to accommodate the surges in demand that accompany them. As the infrastructure scales up and scales down, that is going to mean more changes in your datacenter and that is going to mean that at the peak of those surges you are going to have a lot of VMs throwing off a lot of performance metrics, and your monitoring tool has to be able to accommodate that.

BSM: So is it just a matter of scalability? The ability to handle many more metrics?

There are three elements. One is the peak number of VMs. Second, it is the fact that the number of those VMs varies over time. And third is the fact that the stakes are so much higher. During these surge periods, if your application is slow or unavailable entirely, for every minute of performance problems you are going to have significant lost revenue.

BSM: What are the current monitoring technologies missing that do not allow them be able to handle this new environment?

AS: We are going to see more metrics collected more and more frequently. Believe it or not, there are many monitoring tools that think it’s perfectly acceptable to capture metrics every 10 minutes. That might have worked fine in the late 90s when those tools first came out, but today that will not cut it.

We are seeing a need for monitoring tools that monitor very frequently - as frequently as one minute or less. There are two big drivers for this. One is the fact that consumer software is driving the enterprise software innovation. Think of Twitter: users expect that when you post something to Twitter, it is immediately available to the world. Users today expect software to work in real time, and that expectation weaves its way into the requirements for monitoring tools and reporting metrics.

The second point goes back to surges in web application workloads due to business cycles. To accommodate those surges, you need to figure out how much you need to dynamically scale up your virtualized environment. Doing that confidently requires that you collect performance metrics very frequently.

Here’s an example: Let’s say you only spin up a new app server VM if you have four datapoints indicating that the app is running slowly, because you don’t want to spin up a new VM based on a single, possibly spurious data point. If you collect metrics once every 15 minutes – a common setting among legacy tools – it will be a whole hour before you spawn a new VM. No business can afford an entire hour of sluggishness in its critical apps. You can imagine the conversation that the business would have with IT.

But let’s say you collect metrics once a minute. In four minutes, you’ll have four datapoints, and can confidently spin up that new VM. IT responds quickly, and the business and customers are happy.

BSM: At VMworld, VMware announced that the introduction of the vFabric cloud application platform will drive IT as a service. Do you see vFabric helping users of the cloud move to a more business-centric view of IT service?

Yes, vFabric lets IT move in that direction. IT can start to scale infrastructure more quickly in response to the needs of the business, and that frees them up to understand more about the cycles of the business. For instance, if IT serves a retail business, they can think about the major shopping days during the holidays, and when they’ll need to ramp up their infrastructure during those shopping days.

About Al Sargent

Al Sargent, Sr. Product Marketing Manager at VMware, handles product marketing for Hyperic, VMware's application monitoring product. He has 15+ years of experience in product management and marketing, business development, sales, and engineering at VMware, Oracle, Mercury and startups such as Wily Technology.

The Latest

The Great SaaS Hangover (and the Cure Nobody Is Talking About)

May 13, 2025

2020 was the equivalent of a wedding with a top-shelf open bar. As businesses scrambled to adjust to remote work, digital transformation accelerated at breakneck speed. New software categories emerged overnight. Tech stacks ballooned with all sorts of SaaS apps solving ALL the problems — often with little oversight or long-term integration planning, and yes frequently a lot of duplicated functionality ... But now the music's faded. The lights are on. Everyone from the CIO to the CFO is checking the bill. Welcome to the Great SaaS Hangover ...

OpenShift Monitoring: 5 Things You Need to Keep an Eye on

May 12, 2025

Regardless of OpenShift being a scalable and flexible software, it can be a pain to monitor since complete visibility into the underlying operations is not guaranteed ... To effectively monitor an OpenShift environment, IT administrators should focus on these five key elements and their associated metrics ...

AI Drives New Wave of Digital Transformation

May 09, 2025

An overwhelming majority of IT leaders (95%) believe the upcoming wave of AI-powered digital transformation is set to be the most impactful and intensive seen thus far, according to The Science of Productivity: AI, Adoption, And Employee Experience, a new report from Nexthink ...

Data Center Outage Frequency Decreasing

May 08, 2025

Overall outage frequency and the general level of reported severity continue to decline, according to the Outage Analysis 2025 from Uptime Institute. However, cyber security incidents are on the rise and often have severe, lasting impacts ...

6 Takeaways from the State of Observability for Media and Entertainment

May 07, 2025

In March, New Relic published the State of Observability for Media and Entertainment Report to share insights, data, and analysis into the adoption and business value of observability across the media and entertainment industry. Here are six key takeaways from the report ...

Simplifying Actionable Problem-Solving with Decision Optimization

May 06, 2025

Regardless of their scale, business decisions often take time, effort, and a lot of back-and-forth discussion to reach any sort of actionable conclusion ... Any means of streamlining this process and getting from complex problems to optimal solutions more efficiently and reliably is key. How can organizations optimize their decision-making to save time and reduce excess effort from those involved? ...

Cloud Cost Crisis: As Semiconductor Tariffs Loom, CIOs Face Budget Overruns

May 05, 2025

As enterprises accelerate their cloud adoption strategies, CIOs are routinely exceeding their cloud budgets — a concern that's about to face additional pressure from an unexpected direction: uncertainty over semiconductor tariffs. The CIO Cloud Trends Survey & Report from Azul reveals the extent continued cloud investment despite cost overruns, and how organizations are attempting to bring spending under control ...

The Results Are In: IT Professionals Want More AI and Automation Support

May 01, 2025

According to Auvik's 2025 IT Trends Report, 60% of IT professionals feel at least moderately burned out on the job, with 43% stating that their workload is contributing to work stress. At the same time, many IT professionals are naming AI and machine learning as key areas they'd most like to upskill ...

Immutable by Design: Reinventing Business Continuity and Disaster Recovery

April 30, 2025

Businesses that face downtime or outages risk financial and reputational damage, as well as reducing partner, shareholder, and customer trust. One of the major challenges that enterprises face is implementing a robust business continuity plan. What's the solution? The answer may lie in disaster recovery tactics such as truly immutable storage and regular disaster recovery testing ...

Embracing Cost-Effective Observability Through an OpenTelemetry Approach

April 29, 2025

IT spending is expected to jump nearly 10% in 2025, and organizations are now facing pressure to manage costs without slowing down critical functions like observability. To meet the challenge, leaders are turning to smarter, more cost effective business strategies. Enter stage right: OpenTelemetry, the missing piece of the puzzle that is no longer just an option but rather a strategic advantage ...