BSM and the Art of Discovery
Monitoring, Management and BSM: Discovery and Automation
September 22, 2011
Richard Ptak
Share this

Successfully implementing BSM in the daily operations of a data center brings into play a multitude of skills, tools, processes and personalities. Boundaries, responsibilities, authorities and accountabilities must be identified, negotiated and exercised in a complex, dynamic series of interactions. Properly done, the result will be a smooth, consistent and near problem-free operations environment where the goals of both IT and business partners are achieved with a minimum of effort.

OOPS! Someone has lost touch with the real world while chasing unicorns. The reality is that a successful implementation is merely the first step. Today’s data center and business service operations and delivery environments are, and will continue to be, highly dynamic. They require active, intelligent and increasingly automated oversight and control of available assets (including infrastructure, processes, etc.) for reliable service delivery.

Doing this requires active monitoring and management of the infrastructure, assets and processes involved in and impacting the delivery of a service. To assure reliable delivery at the level that meets business needs, services must be monitored and data made available to various business and operations staff.

The needed data includes:

- Performance and availability of the service

- The assets used by a service and their impact on service delivery

- Other assets in the environment – in-use/idle/available configurations and capabilities (bandwidth, size, speeds, etc.)

- Who/which service is using these assets

- What services assets can support

- Interdependencies between services

- The impact of re-assignment or change (priority to business, value received, cost of change, etc.)

Information based on this data must be available in a quickly comprehensible format to a wide list of consumers. This list will include operations staff, administration staff, service managers, line-of-business managers, as well as executives who want assurance that service SLAs are being met.

Getting accurate and timely data about the infrastructure and processes involved in service delivery doesn’t happen automatically. You need to assure that your monitoring solution will discover and report on process and infrastructure changes that will impact specific services. Typically, this involves building a model of the assets, infrastructure and processes used in service delivery and definition of dependencies.

Business requires change and adaptation to attract, service, and maintain customer (or user) satisfaction. New services must be developed and delivered. Infrastructure (both virtual and physical) will be altered, reassigned, and reconfigured as it adapts to evolving environmental, operational and business needs. Today’s highly dynamic data center operations will rapidly make a static model outdated and useless.

What is required is a monitoring solution with automated capabilities to build on and update the basic modeling engine. It must automatically discover and be able to integrate changes in these processes and infrastructure. It must be able to inform when such changes will affect service delivery.

Typically, infrastructure elements are part of the delivery of multiple, different services. Such elements can be manipulated by staff that are unaware of or don’t care about the impact on other services when changes are made. For example, one can imagine an infrastructure change that optimizes a delivery path for Service A under stress conditions. Service B shares infrastructure elements with Service A. The changes don’t normally affect B’s delivery, but will disrupt Service B when invoked. It is also possible to make changes that will benefit Service B (i.e. provide a redundant service path), but mask a failure in Service B primary resources. Only intelligent monitoring with an ability to create correlated service-oriented infrastructure views will provide the holistic visibility into infrastructure utilization needed to avoid such potential issues.

BSM requires dynamic, intelligent asset management able to provide a range of IT and business views that accurately represent service delivery and the operational environment (virtual and physical) that enables delivery. A critical element supporting that management is a monitoring solution that can automatically discover and dynamically adjust to changes in assets and infrastructure. And, that can detect and report on changes in shared assets that may affect service delivery.

About Rich Ptak

Rich Ptak, Managing Partner at Ptak, Noel & Associates LLC. has over 30 years experience in systems product management, working closely with Fortune 50 companies in developing product direction and strategies at a global level. Previously Ptak held positions as Senior Vice President at Hurwitz Group and D.H. Brown Associates. Earlier in his career he held engineering and marketing management positions with Western Electric’s Electronic Switch Manufacturing Division and Digital Equipment Corporation. He is frequently quoted in major business and trade press. Ptak holds a master’s in business administration from the University of Chicago and a master of science in engineering from Kansas State University.

Related Links:

www.ptaknoel.com

Share this

The Latest

July 25, 2024

The 2024 State of the Data Center Report from CoreSite shows that although C-suite confidence in the economy remains high, a VUCA (volatile, uncertain, complex, ambiguous) environment has many business leaders proceeding with caution when it comes to their IT and data ecosystems, with an emphasis on cost control and predictability, flexibility and risk management ...

July 24, 2024

In June, New Relic published the State of Observability for Energy and Utilities Report to share insights, analysis, and data on the impact of full-stack observability software in energy and utilities organizations' service capabilities. Here are eight key takeaways from the report ...

July 23, 2024

The rapid rise of generative AI (GenAI) has caught everyone's attention, leaving many to wonder if the technology's impact will live up to the immense hype. A recent survey by Alteryx provides valuable insights into the current state of GenAI adoption, revealing a shift from inflated expectations to tangible value realization across enterprises ... Here are five key takeaways that underscore GenAI's progression from hype to real-world impact ...

July 22, 2024
A defective software update caused what some experts are calling the largest IT outage in history on Friday, July 19. The impact reverberated through multiple industries around the world ...
July 18, 2024

As software development grows more intricate, the challenge for observability engineers tasked with ensuring optimal system performance becomes more daunting. Current methodologies are struggling to keep pace, with the annual Observability Pulse surveys indicating a rise in Mean Time to Remediation (MTTR). According to this survey, only a small fraction of organizations, around 10%, achieve full observability today. Generative AI, however, promises to significantly move the needle ...

July 17, 2024

While nearly all data leaders surveyed are building generative AI applications, most don't believe their data estate is actually prepared to support them, according to the State of Reliable AI report from Monte Carlo Data ...

July 16, 2024

Enterprises are putting a lot of effort into improving the digital employee experience (DEX), which has become essential to both improving organizational performance and attracting and retaining talented workers. But to date, most efforts to deliver outstanding DEX have focused on people working with laptops, PCs, or thin clients. Employees on the frontlines, using mobile devices to handle logistics ... have been largely overlooked ...

July 15, 2024

The average customer-facing incident takes nearly three hours to resolve (175 minutes) while the estimated cost of downtime is $4,537 per minute, meaning each incident can cost nearly $794,000, according to new research from PagerDuty ...

July 12, 2024

In MEAN TIME TO INSIGHT Episode 8, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AutoCon with the conference founders Scott Robohn and Chris Grundemann ...

July 11, 2024

Numerous vendors and service providers have recently embraced the NaaS concept, yet there is still no industry consensus on its definition or the types of networks it involves. Furthermore, providers have varied in how they define the NaaS service delivery model. I conducted research for a new report, Network as a Service: Understanding the Cloud Consumption Model in Networking, to refine the concept of NaaS and reduce buyer confusion over what it is and how it can offer value ...