Skip to main content

As CIOs Address App Sprawl, Observability Can't Be an Afterthought

Bill Lobig
Bill Lobig is VP of Apptio / IBM IT Automation
IBM

App sprawl has been a concern for technologists for some time, but it has never presented such a challenge as now. As organizations move to implement generative AI into their applications, it's only going to become more complex. In fact, a recent Canva report found that 72% of CIOs see application sprawl as a challenge — and with 71% of CIOs expecting to adopt 30-60 new apps this year, this complexity is poised to keep growing.

Potential solutions include consolidating applications, optimizing workflows, and automating IT processes to reduce strain on technologists so they can tackle the issue of app sprawl head-on. While these are all valid and necessary approaches, observability is a necessary component for understanding the vast amounts of complex data within AI-infused applications, and it must be the centerpiece of an app- and data-centric strategy to truly manage app sprawl.

Cracking the Code for AI App Sprawl Challenges

In a year of elevated global IT spend, ensuring investments aren't wasted is a necessity for overwhelmed technology leaders, who not only must make decisions around which technologies to implement, but also make sense of application performance amid growing tides of vast and complex data.

When AI enters the mix, it's even more important to have complete visibility, which many organizations still lack. Observability tools and practices can help technologists address AI app sprawl by providing visibility into the performance, behavior, and dependencies of AI applications. Unfortunately, many teams are still attempting to work with incomplete visibility, meaning they simply don't know what they don't know.

Simply put, traditional application performance monitoring (APM) tools can provide visibility to a certain degree, but they weren't built to necessarily account for the influx of generative AI applications that modern enterprises are dealing with.

End-users for generative AI-infused applications demand continuous availability and frictionless experience. However, with a lack of real-time visibility, they will feel when an outage or delay compromises their experience, particularly as applications span numerous platforms. Not to mention, the implementation and understanding of complex generative AI models' behaviors are still something many organizations are working to figure out. These potential blind spots could create significant performance, compliance, and security issues.

Observe the Full Stack So You Can Add to It

Organizations can better understand the internal state of their AI applications by analyzing their external outputs with observability. When this is connected to business outcomes and effectively addressed, technology teams can lay the groundwork for a well-functioning application monitoring and management process. So, when generative AI is introduced to the environment, the foundation is already in place to effectively see and optimize application processes.

As generative AI applications join the enterprise equation, observability tools are a must-have for facilitating the delivery of higher-quality software at a faster pace — these are best enabled through:

Finding and fixing the "unknown unknowns": You can't fix what you can't see. Unfortunately, many monitoring practices and tools can only address flaws that are previously known. Observability uncovers conditions that would be impossible to find manually or with traditional platforms. It then monitors the correlation to different performance flaws and gives context for discovering root causes, resulting in quick and easy remediation.

Detecting and remediating issues early on: With observability, monitoring is integrated into the initial stages of software development. It's then easy to pinpoint and rectify new code issues before they affect the service level agreements (SLAs) and customer experience.

Self-healing application infrastructure and automated resolution: Observability can be coupled with automation capabilities to anticipate issues from system outputs and resolve them autonomously without requiring manual intervention.

Scaling and load balancing: We need to be able to observe and control the current load on systems but also help forecast future demands. The data can be used to optimize applications in real-time without having the end-users feel any impact.

Cost management: From optimizing workloads to computational resources, SREs can and should find ways to save on VM, GPU, cloud and inferencing costs through observability.

Observability is crucial for organizations to address AI app sprawl and overcome myriad challenges that come along with it, especially as generative AI becomes a mainstay across enterprises. By following observability best practices and deploying the right automated tools, organizations can proactively identify and resolve issues and ensure all AI applications are always available and friction-free.

Bill Lobig is VP, Product Management, IBM Automation

Hot Topics

The Latest

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 5 covers the infrastructure and hardware supporting AI ...

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 4 covers advancements in AI technology ...

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 3 covers AI's impact on employees and their roles ...

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 2 covers the challenges presented by AI, as well as solutions to those problems ...

In the final part of APMdigest's 2025 Predictions Series, industry experts offer predictions on how AI will evolve and impact technology and business in 2025 ...

E-commerce is set to skyrocket with a 9% rise over the next few years ... To thrive in this competitive environment, retailers must identify digital resilience as their top priority. In a world where savvy shoppers expect 24/7 access to online deals and experiences, any unexpected downtime to digital services can lead to significant financial losses, damage to brand reputation, abandoned carts with designer shoes, and additional issues ...

Efficiency is a highly-desirable objective in business ... We're seeing this scenario play out in enterprises around the world as they continue to struggle with infrastructures and remote work models with an eye toward operational efficiencies. In contrast to that goal, a recent Broadcom survey of global IT and network professionals found widespread adoption of these strategies is making the network more complex and hampering observability, leading to uptime, performance and security issues. Let's look more closely at these challenges ...

Image
Broadcom

The 2025 Catchpoint SRE Report dives into the forces transforming the SRE landscape, exploring both the challenges and opportunities ahead. Let's break down the key findings and what they mean for SRE professionals and the businesses relying on them ...

Image
Catchpoint

The pressure on IT teams has never been greater. As data environments grow increasingly complex, resource shortages are emerging as a major obstacle for IT leaders striving to meet the demands of modern infrastructure management ... According to DataStrike's newly released 2025 Data Infrastructure Survey Report, more than half (54%) of IT leaders cite resource limitations as a top challenge, highlighting a growing trend toward outsourcing as a solution ...

Image
Datastrike

Gartner revealed its top strategic predictions for 2025 and beyond. Gartner's top predictions explore how generative AI (GenAI) is affecting areas where most would assume only humans can have lasting impact ...

As CIOs Address App Sprawl, Observability Can't Be an Afterthought

Bill Lobig
Bill Lobig is VP of Apptio / IBM IT Automation
IBM

App sprawl has been a concern for technologists for some time, but it has never presented such a challenge as now. As organizations move to implement generative AI into their applications, it's only going to become more complex. In fact, a recent Canva report found that 72% of CIOs see application sprawl as a challenge — and with 71% of CIOs expecting to adopt 30-60 new apps this year, this complexity is poised to keep growing.

Potential solutions include consolidating applications, optimizing workflows, and automating IT processes to reduce strain on technologists so they can tackle the issue of app sprawl head-on. While these are all valid and necessary approaches, observability is a necessary component for understanding the vast amounts of complex data within AI-infused applications, and it must be the centerpiece of an app- and data-centric strategy to truly manage app sprawl.

Cracking the Code for AI App Sprawl Challenges

In a year of elevated global IT spend, ensuring investments aren't wasted is a necessity for overwhelmed technology leaders, who not only must make decisions around which technologies to implement, but also make sense of application performance amid growing tides of vast and complex data.

When AI enters the mix, it's even more important to have complete visibility, which many organizations still lack. Observability tools and practices can help technologists address AI app sprawl by providing visibility into the performance, behavior, and dependencies of AI applications. Unfortunately, many teams are still attempting to work with incomplete visibility, meaning they simply don't know what they don't know.

Simply put, traditional application performance monitoring (APM) tools can provide visibility to a certain degree, but they weren't built to necessarily account for the influx of generative AI applications that modern enterprises are dealing with.

End-users for generative AI-infused applications demand continuous availability and frictionless experience. However, with a lack of real-time visibility, they will feel when an outage or delay compromises their experience, particularly as applications span numerous platforms. Not to mention, the implementation and understanding of complex generative AI models' behaviors are still something many organizations are working to figure out. These potential blind spots could create significant performance, compliance, and security issues.

Observe the Full Stack So You Can Add to It

Organizations can better understand the internal state of their AI applications by analyzing their external outputs with observability. When this is connected to business outcomes and effectively addressed, technology teams can lay the groundwork for a well-functioning application monitoring and management process. So, when generative AI is introduced to the environment, the foundation is already in place to effectively see and optimize application processes.

As generative AI applications join the enterprise equation, observability tools are a must-have for facilitating the delivery of higher-quality software at a faster pace — these are best enabled through:

Finding and fixing the "unknown unknowns": You can't fix what you can't see. Unfortunately, many monitoring practices and tools can only address flaws that are previously known. Observability uncovers conditions that would be impossible to find manually or with traditional platforms. It then monitors the correlation to different performance flaws and gives context for discovering root causes, resulting in quick and easy remediation.

Detecting and remediating issues early on: With observability, monitoring is integrated into the initial stages of software development. It's then easy to pinpoint and rectify new code issues before they affect the service level agreements (SLAs) and customer experience.

Self-healing application infrastructure and automated resolution: Observability can be coupled with automation capabilities to anticipate issues from system outputs and resolve them autonomously without requiring manual intervention.

Scaling and load balancing: We need to be able to observe and control the current load on systems but also help forecast future demands. The data can be used to optimize applications in real-time without having the end-users feel any impact.

Cost management: From optimizing workloads to computational resources, SREs can and should find ways to save on VM, GPU, cloud and inferencing costs through observability.

Observability is crucial for organizations to address AI app sprawl and overcome myriad challenges that come along with it, especially as generative AI becomes a mainstay across enterprises. By following observability best practices and deploying the right automated tools, organizations can proactively identify and resolve issues and ensure all AI applications are always available and friction-free.

Bill Lobig is VP, Product Management, IBM Automation

Hot Topics

The Latest

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 5 covers the infrastructure and hardware supporting AI ...

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 4 covers advancements in AI technology ...

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 3 covers AI's impact on employees and their roles ...

Industry experts offer predictions on how AI will evolve and impact technology and business in 2025. Part 2 covers the challenges presented by AI, as well as solutions to those problems ...

In the final part of APMdigest's 2025 Predictions Series, industry experts offer predictions on how AI will evolve and impact technology and business in 2025 ...

E-commerce is set to skyrocket with a 9% rise over the next few years ... To thrive in this competitive environment, retailers must identify digital resilience as their top priority. In a world where savvy shoppers expect 24/7 access to online deals and experiences, any unexpected downtime to digital services can lead to significant financial losses, damage to brand reputation, abandoned carts with designer shoes, and additional issues ...

Efficiency is a highly-desirable objective in business ... We're seeing this scenario play out in enterprises around the world as they continue to struggle with infrastructures and remote work models with an eye toward operational efficiencies. In contrast to that goal, a recent Broadcom survey of global IT and network professionals found widespread adoption of these strategies is making the network more complex and hampering observability, leading to uptime, performance and security issues. Let's look more closely at these challenges ...

Image
Broadcom

The 2025 Catchpoint SRE Report dives into the forces transforming the SRE landscape, exploring both the challenges and opportunities ahead. Let's break down the key findings and what they mean for SRE professionals and the businesses relying on them ...

Image
Catchpoint

The pressure on IT teams has never been greater. As data environments grow increasingly complex, resource shortages are emerging as a major obstacle for IT leaders striving to meet the demands of modern infrastructure management ... According to DataStrike's newly released 2025 Data Infrastructure Survey Report, more than half (54%) of IT leaders cite resource limitations as a top challenge, highlighting a growing trend toward outsourcing as a solution ...

Image
Datastrike

Gartner revealed its top strategic predictions for 2025 and beyond. Gartner's top predictions explore how generative AI (GenAI) is affecting areas where most would assume only humans can have lasting impact ...