Skip to main content

The Future of Observability: Why Open Source Will Lead the Way

Chad Meley
StarTree

Observability has become a critical component of managing modern, complex systems, helping organizations ensure uptime, optimize performance, and quickly diagnose issues. However, the way most vendors address observability today has left companies frustrated. Skyrocketing costs, rigid vendor lock-in, and needless complexity have turned what should be an empowering capability into a significant pain point. But the tide is shifting. With open-source projects stepping in to fill key parts of the observability stack, the market is on the brink of a major disruption. Much like other industries that have embraced modular, open architectures, observability is entering a new era of flexibility, cost efficiency, and innovation.

The First Wave of Observability

Microservices and distributed architectures improve scalability but add complexity, making issue detection harder. Enter Observability by analyzing metrics, logs, and traces to provide real-time insights, detect issues, and ensure uptime.

While analyzing data was not new, the first wave of observability vendors had a breakthrough innovation: the ability to instrument applications and collect health data from complex systems. By making metrics, logs, and traces accessible for analysis, these vendors enabled companies to monitor their infrastructure like never before. This was a game-changer for IT operations, setting the stage for modern observability as we know it.

The Challenges and Pain Companies Face Today

While the original observability platforms were innovative, they've since created significant pain points for businesses. Over time, these monolithic solutions have locked customers in and inflated costs.

  • Unsustainable Costs: Observability costs are often tied to the volume of data collected. As businesses grow, so do their observability bills — increasingly to unsustainable levels. Industry luminaries Jeremy Burton and Charity Majors have both tagged Observability costs to often reach 30% of IT infrastructure spend.
  • Vendor Lock-In: Vendors force customers into all-or-nothing bundles — agents, visualization tool, query engine, storage, data movement tool, etc. Over time, this dependency grows, making it increasingly difficult and costly to switch to better solutions.
  • Fragmented Visibility: These platforms store metrics, logs, and traces in separate, proprietary systems, creating inefficiencies when performing root cause analysis.

The result is a market rife with frustration, as companies struggle to manage the costs and complexity of monolithic observability stacks. In cases where existing solutions are expensive and fail to meet expectations, innovators step in to create better, more cost-effective alternatives. As Jeff Bezos famously said, "Your margin is my opportunity," highlighting how expensive products that leave needs unmet invite disruption by new entrants.

From Monolithic to Disaggregated: A New Era of Observability

The observability market is undergoing a seismic shift, moving from monolithic platforms to a disaggregated stack. To understand what that means by way of analogy, consider what happened to data warehouses.

The original data warehouse vendors — Teradata, IBM, and Oracle — sold tightly integrated "appliances" bundling database software, servers, storage, networking, and operating systems, locking customers into inflexible, proprietary ecosystems that were costly to maintain. Today, organizations have shifted to modular, cloud-based data warehouses. Companies now choose what database software they want and configure cloud infrastructure, which offers flexibility to optimize cost, performance, and scalability while avoiding vendor lock-in.

The catalyst for the change in the data warehouse market was the cloud. The catalyst for this transformation in observability is OpenTelemetry (OTel), an open-source standard that has removed the proprietary barriers to collecting and transmitting observability data. By standardizing how metrics, logs, and traces are instrumented and exported, OTel has eliminated one of the key enablers of vendor lock-in. But OTel just addresses one layer of a full solution.

In the disaggregated observability stack, companies now select best-of-breed solutions for each layer. Companies can use whatever software they want to move observability data, store it, query it, or visualize it.

In many cases there are clear open source winners that have emerged:

  • Agents: OTel is becoming the de facto standard for instrumentation.
  • Collection: Kafka handles data ingestion and transport.
  • Visualization: Grafana leads the way in open-source dashboards for visualizing observability data.
Image
Startree

The query and storage layer in the disaggregated observability stack is a critical battleground for open-source projects. Open-source query engines like Prometheus and ClickHouse have become key players in the observability ecosystem, but they still face limitations in creating a unified, cost effective alternative. Prometheus excels at querying real-time metrics but lacks support for logs or traces, and its pull-based architecture struggles with scale, requiring complex federation and sharding.

Similarly, ClickHouse has gained traction for log-based queries but falls short for metrics and time-series data, leaving observability gaps. Additionally, its compute inefficiencies for real-time workloads can drive up costs as data volumes increase, highlighting the need for more affordable, unified observability solutions.

More and more companies are turning to Apache Pinot to fill this role. Originally developed at LinkedIn, Pinot was designed as a real-time, columnar analytics database capable of handling massive scale with subsecond query speeds. Recently, Pinot has evolved to meet the demands of modern observability use cases:

  • Time-Series Analytics: New contributions to the open-source project by Uber have optimized Pinot for time-series workloads, making it ideal for metrics, logs, and traces.
  • PromQL Support: Pinot now integrates with Prometheus Query Language (PromQL), enabling seamless visualization in tools like Grafana.
  • Unified Observability: Unlike legacy vendors that store data in separate silos, Pinot allows companies to unify metrics, logs, and traces in a single platform.

This shift isn't just about performance — it's about cost. Apache Pinot's efficient architecture and tiered storage help organizations scale observability affordably.

Open source is disrupting observability by offering:

  • Freedom: Open source tools like Pinot, Grafana, Kafka, and OTel prevent vendor lock-in.
  • Cost Efficiency: Open-source economics and cloud storage cut costs.
  • Innovation: Community-driven development drives rapid evolution.

As observability costs grow, open source offers a smarter, more affordable path forward.

Chad Meley is SVP, Marketing and Developer Relations, at StarTree

The Latest

Artificial intelligence (AI) is core to observability practices, with some 41% of respondents reporting AI adoption as a core driver of observability, according to the State of Observability for Financial Services and Insurance report from New Relic ...

Application performance monitoring (APM) is a game of catching up — building dashboards, setting thresholds, tuning alerts, and manually correlating metrics to root causes. In the early days, this straightforward model worked as applications were simpler, stacks more predictable, and telemetry was manageable. Today, the landscape has shifted, and more assertive tools are needed ...

Cloud adoption has accelerated, but backup strategies haven't always kept pace. Many organizations continue to rely on backup strategies that were either lifted directly from on-prem environments or use cloud-native tools in limited, DR-focused ways ... Eon uncovered a handful of critical gaps regarding how organizations approach cloud backup. To capture these prevailing winds, we gathered insights from 150+ IT and cloud leaders at the recent Google Cloud Next conference, which we've compiled into the 2025 State of Cloud Data Backup ...

Private clouds are no longer playing catch-up, and public clouds are no longer the default as organizations recalibrate their cloud strategies, according to the Private Cloud Outlook 2025 report from Broadcom. More than half (53%) of survey respondents say private cloud is their top priority for deploying new workloads over the next three years, while 69% are considering workload repatriation from public to private cloud, with one-third having already done so ...

As organizations chase productivity gains from generative AI, teams are overwhelmingly focused on improving delivery speed (45%) over enhancing software quality (13%), according to the Quality Transformation Report from Tricentis ...

Back in March of this year ... MongoDB's stock price took a serious tumble ... In my opinion, it reflects a deeper structural issue in enterprise software economics altogether — vendor lock-in ...

In MEAN TIME TO INSIGHT Episode 15, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Do-It-Yourself Network Automation ... 

Zero-day vulnerabilities — security flaws that are exploited before developers even know they exist — pose one of the greatest risks to modern organizations. Recently, such vulnerabilities have been discovered in well-known VPN systems like Ivanti and Fortinet, highlighting just how outdated these legacy technologies have become in defending against fast-evolving cyber threats ... To protect digital assets and remote workers in today's environment, companies need more than patchwork solutions. They need architecture that is secure by design ...

Traditional observability requires users to leap across different platforms or tools for metrics, logs, or traces and related issues manually, which is very time-consuming, so as to reasonably ascertain the root cause. Observability 2.0 fixes this by unifying all telemetry data, logs, metrics, and traces into a single, context-rich pipeline that flows into one smart platform. But this is far from just having a bunch of additional data; this data is actionable, predictive, and tied to revenue realization ...

64% of enterprise networking teams use internally developed software or scripts for network automation, but 61% of those teams spend six or more hours per week debugging and maintaining them, according to From Scripts to Platforms: Why Homegrown Tools Dominate Network Automation and How Vendors Can Help, my latest EMA report ...

The Future of Observability: Why Open Source Will Lead the Way

Chad Meley
StarTree

Observability has become a critical component of managing modern, complex systems, helping organizations ensure uptime, optimize performance, and quickly diagnose issues. However, the way most vendors address observability today has left companies frustrated. Skyrocketing costs, rigid vendor lock-in, and needless complexity have turned what should be an empowering capability into a significant pain point. But the tide is shifting. With open-source projects stepping in to fill key parts of the observability stack, the market is on the brink of a major disruption. Much like other industries that have embraced modular, open architectures, observability is entering a new era of flexibility, cost efficiency, and innovation.

The First Wave of Observability

Microservices and distributed architectures improve scalability but add complexity, making issue detection harder. Enter Observability by analyzing metrics, logs, and traces to provide real-time insights, detect issues, and ensure uptime.

While analyzing data was not new, the first wave of observability vendors had a breakthrough innovation: the ability to instrument applications and collect health data from complex systems. By making metrics, logs, and traces accessible for analysis, these vendors enabled companies to monitor their infrastructure like never before. This was a game-changer for IT operations, setting the stage for modern observability as we know it.

The Challenges and Pain Companies Face Today

While the original observability platforms were innovative, they've since created significant pain points for businesses. Over time, these monolithic solutions have locked customers in and inflated costs.

  • Unsustainable Costs: Observability costs are often tied to the volume of data collected. As businesses grow, so do their observability bills — increasingly to unsustainable levels. Industry luminaries Jeremy Burton and Charity Majors have both tagged Observability costs to often reach 30% of IT infrastructure spend.
  • Vendor Lock-In: Vendors force customers into all-or-nothing bundles — agents, visualization tool, query engine, storage, data movement tool, etc. Over time, this dependency grows, making it increasingly difficult and costly to switch to better solutions.
  • Fragmented Visibility: These platforms store metrics, logs, and traces in separate, proprietary systems, creating inefficiencies when performing root cause analysis.

The result is a market rife with frustration, as companies struggle to manage the costs and complexity of monolithic observability stacks. In cases where existing solutions are expensive and fail to meet expectations, innovators step in to create better, more cost-effective alternatives. As Jeff Bezos famously said, "Your margin is my opportunity," highlighting how expensive products that leave needs unmet invite disruption by new entrants.

From Monolithic to Disaggregated: A New Era of Observability

The observability market is undergoing a seismic shift, moving from monolithic platforms to a disaggregated stack. To understand what that means by way of analogy, consider what happened to data warehouses.

The original data warehouse vendors — Teradata, IBM, and Oracle — sold tightly integrated "appliances" bundling database software, servers, storage, networking, and operating systems, locking customers into inflexible, proprietary ecosystems that were costly to maintain. Today, organizations have shifted to modular, cloud-based data warehouses. Companies now choose what database software they want and configure cloud infrastructure, which offers flexibility to optimize cost, performance, and scalability while avoiding vendor lock-in.

The catalyst for the change in the data warehouse market was the cloud. The catalyst for this transformation in observability is OpenTelemetry (OTel), an open-source standard that has removed the proprietary barriers to collecting and transmitting observability data. By standardizing how metrics, logs, and traces are instrumented and exported, OTel has eliminated one of the key enablers of vendor lock-in. But OTel just addresses one layer of a full solution.

In the disaggregated observability stack, companies now select best-of-breed solutions for each layer. Companies can use whatever software they want to move observability data, store it, query it, or visualize it.

In many cases there are clear open source winners that have emerged:

  • Agents: OTel is becoming the de facto standard for instrumentation.
  • Collection: Kafka handles data ingestion and transport.
  • Visualization: Grafana leads the way in open-source dashboards for visualizing observability data.
Image
Startree

The query and storage layer in the disaggregated observability stack is a critical battleground for open-source projects. Open-source query engines like Prometheus and ClickHouse have become key players in the observability ecosystem, but they still face limitations in creating a unified, cost effective alternative. Prometheus excels at querying real-time metrics but lacks support for logs or traces, and its pull-based architecture struggles with scale, requiring complex federation and sharding.

Similarly, ClickHouse has gained traction for log-based queries but falls short for metrics and time-series data, leaving observability gaps. Additionally, its compute inefficiencies for real-time workloads can drive up costs as data volumes increase, highlighting the need for more affordable, unified observability solutions.

More and more companies are turning to Apache Pinot to fill this role. Originally developed at LinkedIn, Pinot was designed as a real-time, columnar analytics database capable of handling massive scale with subsecond query speeds. Recently, Pinot has evolved to meet the demands of modern observability use cases:

  • Time-Series Analytics: New contributions to the open-source project by Uber have optimized Pinot for time-series workloads, making it ideal for metrics, logs, and traces.
  • PromQL Support: Pinot now integrates with Prometheus Query Language (PromQL), enabling seamless visualization in tools like Grafana.
  • Unified Observability: Unlike legacy vendors that store data in separate silos, Pinot allows companies to unify metrics, logs, and traces in a single platform.

This shift isn't just about performance — it's about cost. Apache Pinot's efficient architecture and tiered storage help organizations scale observability affordably.

Open source is disrupting observability by offering:

  • Freedom: Open source tools like Pinot, Grafana, Kafka, and OTel prevent vendor lock-in.
  • Cost Efficiency: Open-source economics and cloud storage cut costs.
  • Innovation: Community-driven development drives rapid evolution.

As observability costs grow, open source offers a smarter, more affordable path forward.

Chad Meley is SVP, Marketing and Developer Relations, at StarTree

The Latest

Artificial intelligence (AI) is core to observability practices, with some 41% of respondents reporting AI adoption as a core driver of observability, according to the State of Observability for Financial Services and Insurance report from New Relic ...

Application performance monitoring (APM) is a game of catching up — building dashboards, setting thresholds, tuning alerts, and manually correlating metrics to root causes. In the early days, this straightforward model worked as applications were simpler, stacks more predictable, and telemetry was manageable. Today, the landscape has shifted, and more assertive tools are needed ...

Cloud adoption has accelerated, but backup strategies haven't always kept pace. Many organizations continue to rely on backup strategies that were either lifted directly from on-prem environments or use cloud-native tools in limited, DR-focused ways ... Eon uncovered a handful of critical gaps regarding how organizations approach cloud backup. To capture these prevailing winds, we gathered insights from 150+ IT and cloud leaders at the recent Google Cloud Next conference, which we've compiled into the 2025 State of Cloud Data Backup ...

Private clouds are no longer playing catch-up, and public clouds are no longer the default as organizations recalibrate their cloud strategies, according to the Private Cloud Outlook 2025 report from Broadcom. More than half (53%) of survey respondents say private cloud is their top priority for deploying new workloads over the next three years, while 69% are considering workload repatriation from public to private cloud, with one-third having already done so ...

As organizations chase productivity gains from generative AI, teams are overwhelmingly focused on improving delivery speed (45%) over enhancing software quality (13%), according to the Quality Transformation Report from Tricentis ...

Back in March of this year ... MongoDB's stock price took a serious tumble ... In my opinion, it reflects a deeper structural issue in enterprise software economics altogether — vendor lock-in ...

In MEAN TIME TO INSIGHT Episode 15, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Do-It-Yourself Network Automation ... 

Zero-day vulnerabilities — security flaws that are exploited before developers even know they exist — pose one of the greatest risks to modern organizations. Recently, such vulnerabilities have been discovered in well-known VPN systems like Ivanti and Fortinet, highlighting just how outdated these legacy technologies have become in defending against fast-evolving cyber threats ... To protect digital assets and remote workers in today's environment, companies need more than patchwork solutions. They need architecture that is secure by design ...

Traditional observability requires users to leap across different platforms or tools for metrics, logs, or traces and related issues manually, which is very time-consuming, so as to reasonably ascertain the root cause. Observability 2.0 fixes this by unifying all telemetry data, logs, metrics, and traces into a single, context-rich pipeline that flows into one smart platform. But this is far from just having a bunch of additional data; this data is actionable, predictive, and tied to revenue realization ...

64% of enterprise networking teams use internally developed software or scripts for network automation, but 61% of those teams spend six or more hours per week debugging and maintaining them, according to From Scripts to Platforms: Why Homegrown Tools Dominate Network Automation and How Vendors Can Help, my latest EMA report ...