Skip to main content

Datadog Expands BigQuery Monitoring

Datadog announced multiple product launches, including expanded monitoring capabilities for BigQuery. 

Datadog’s expanded BigQuery monitoring capabilities, now in preview, help teams view BigQuery usage by user and project to identify those incurring the most spend, pinpoint the long-running queries in those segments to optimize, and detect data quality issues.

“BigQuery is an integral part of Google Cloud users’ tech stacks, enabling them to unlock insights from their proprietary datasets. With Datadog’s new monitoring capabilities, Google Cloud customers can more granularly track usage, attribute costs to users and teams, and ensure their BigQuery data is up to date for accurate insights,” said Yasmeen Ahmad, Managing Director of Strategy & Outbound Product Management for Data, Analytics & AI at Google Cloud.

“Today, it takes significant time to pinpoint where the largest BigQuery usage is coming from across projects and drill into the problematic queries to optimize. With our new BigQuery monitoring capabilities, which complement our existing 35+ Google Cloud integrations, Datadog customers can identify cross-project BigQuery cost centers, high-impact optimization opportunities and the stakeholders that need to be involved,” said Yrieix Garnier, VP of Product at Datadog. “Customers can also improve data quality by detecting data freshness and volume anomalies so they can fix issues quickly and ensure their business has accurate and up-to-date insights.”

Datadog’s expanded BigQuery monitoring capabilities build on the company’s existing capabilities for Google Cloud. Other recent product launches and integrations with Google Cloud include:

  • LLM Observability: With Datadog LLM Observability, users can monitor, troubleshoot, improve and secure their Gemini and Vertex AI LLM applications, and get started quickly with auto-instrumentation.
  • Cloud TPU Integration: With Datadog’s new Cloud TPU integration, teams can detect resource bottlenecks in—and underutilization of—their TPU infrastructure across workers and GKE clusters.
  • Private Service Connect: Datadog users can now send their observability telemetry to Datadog’s Google Cloud-hosted sites with Google’s Private Service Connect for better data security and reduced data transfer costs.
  • GKE Autoscaling (in Preview): Datadog Kubernetes Autoscaling gives users multi-dimensional workload scaling recommendations for their GKE environment and the ability to automate them within the Datadog platform, enabling teams to deliver cost savings while maintaining performance and stability.
  • Storage Monitoring (in Preview): With Storage Monitoring for Google Cloud Storage, users get visibility into their Google Cloud Storage at the object and prefix levels, enabling teams to identify bottlenecks, track performance and quickly detect unusual growth in their storage consumption.
  • Google Cloud Cost Recommendations (in Preview): Datadog Cloud Cost Management now automatically identifies cost inefficiencies in Google Cloud environments and provides optimization recommendations for Google Cloud services like Cloud Run and Cloud SQL.

These capabilities further enhance Datadog’s ability to provide world-class observability and security at scale for joint customers.

The Latest

Outages aren't new. What's new is how quickly they spread across systems, vendors, regions and customer workflows. The moment that performance degrades, expectations escalate fast. In today's always-on environment, an outage isn't just a technical event. It's a trust event ...

Most organizations approach OpenTelemetry as a collection of individual tools they need to assemble from scratch. This view misses the bigger picture. OpenTelemetry is a complete telemetry framework with composable components that address specific problems at different stages of organizational maturity. You start with what you need today and adopt additional pieces as your observability practices evolve ...

One of the earliest lessons I learned from architecting throughput-heavy services is that simplicity wins repeatedly: fewer moving parts, loosely coupled execution (fewer synchronous calls), and precise timing metering. You want data and decisions to travel the shortest possible path. The goal is to build a system where every strategy and each line of code (contention is the key metric) complements the decision trees ...

As discussions around AI "autonomous coworkers" accelerate, many industry projections assume that agents will soon operate alongside human staff in making decisions, taking actions, and managing tasks with minimal oversight. But a growing number of critics (including some of the developers building these systems) argue that the industry still has a long way to go to be able to treat AI agents like fully trusted teammates ...

Enterprise AI has entered a transformational phase where, according to Digitate's recently released survey, Agentic AI and the Future of Enterprise IT, companies are moving beyond traditional automation toward Agentic AI systems designed to reason, adapt, and collaborate alongside human teams ...

The numbers back this urgency up. A recent Zapier survey shows that 92% of enterprises now treat AI as a top priority. Leaders want it, and teams are clamoring for it. But if you look closer at the operations of these companies, you see a different picture. The rollout is slow. The results are often delayed. There's a disconnect between what leaders want and what their technical infrastructure can handle ...

Kyndryl's 2025 Readiness Report revealed that 61% of global business and technology leaders report increasing pressure from boards and regulators to prove AI's ROI. As the technology evolves and expectations continue to rise, leaders are compelled to generate and prove impact before scaling further. This will lead to a decisive turning point in 2026 ...

Cloudflare's disruption illustrates how quickly a single provider's issue cascades into widespread exposure. Many organizations don't fully realize how tightly their systems are coupled to thirdparty services, or how quickly availability and security concerns align when those services falter ... You can't avoid these dependencies, but you can understand them ...

If you work with AI, you know this story. A model performs during testing, looks great in early reviews, works perfectly in production and then slowly loses relevance after operating for a while. Everything on the surface looks perfect — pipelines are running, predictions or recommendations are error-free, data quality checks show green; yet outcomes don't meet the ground reality. This pattern often repeats across enterprise AI programs. Take for example, a mid-sized retail banking and wealth-management firm with heavy investments in AI-powered risk analytics, fraud detection and personalized credit-decisioning systems. The model worked well for a while, but transactions increased, so did false positives by 18% ...

Basic uptime is no longer the gold standard. By 2026, network monitoring must do more than report status, it must explain performance in a hybrid-first world. Networks are no longer just static support systems; they are agile, distributed architectures that sit at the very heart of the customer experience and the business outcomes ... The following five trends represent the new standard for network health, providing a blueprint for teams to move from reactive troubleshooting to a proactive, integrated future ...

Datadog Expands BigQuery Monitoring

Datadog announced multiple product launches, including expanded monitoring capabilities for BigQuery. 

Datadog’s expanded BigQuery monitoring capabilities, now in preview, help teams view BigQuery usage by user and project to identify those incurring the most spend, pinpoint the long-running queries in those segments to optimize, and detect data quality issues.

“BigQuery is an integral part of Google Cloud users’ tech stacks, enabling them to unlock insights from their proprietary datasets. With Datadog’s new monitoring capabilities, Google Cloud customers can more granularly track usage, attribute costs to users and teams, and ensure their BigQuery data is up to date for accurate insights,” said Yasmeen Ahmad, Managing Director of Strategy & Outbound Product Management for Data, Analytics & AI at Google Cloud.

“Today, it takes significant time to pinpoint where the largest BigQuery usage is coming from across projects and drill into the problematic queries to optimize. With our new BigQuery monitoring capabilities, which complement our existing 35+ Google Cloud integrations, Datadog customers can identify cross-project BigQuery cost centers, high-impact optimization opportunities and the stakeholders that need to be involved,” said Yrieix Garnier, VP of Product at Datadog. “Customers can also improve data quality by detecting data freshness and volume anomalies so they can fix issues quickly and ensure their business has accurate and up-to-date insights.”

Datadog’s expanded BigQuery monitoring capabilities build on the company’s existing capabilities for Google Cloud. Other recent product launches and integrations with Google Cloud include:

  • LLM Observability: With Datadog LLM Observability, users can monitor, troubleshoot, improve and secure their Gemini and Vertex AI LLM applications, and get started quickly with auto-instrumentation.
  • Cloud TPU Integration: With Datadog’s new Cloud TPU integration, teams can detect resource bottlenecks in—and underutilization of—their TPU infrastructure across workers and GKE clusters.
  • Private Service Connect: Datadog users can now send their observability telemetry to Datadog’s Google Cloud-hosted sites with Google’s Private Service Connect for better data security and reduced data transfer costs.
  • GKE Autoscaling (in Preview): Datadog Kubernetes Autoscaling gives users multi-dimensional workload scaling recommendations for their GKE environment and the ability to automate them within the Datadog platform, enabling teams to deliver cost savings while maintaining performance and stability.
  • Storage Monitoring (in Preview): With Storage Monitoring for Google Cloud Storage, users get visibility into their Google Cloud Storage at the object and prefix levels, enabling teams to identify bottlenecks, track performance and quickly detect unusual growth in their storage consumption.
  • Google Cloud Cost Recommendations (in Preview): Datadog Cloud Cost Management now automatically identifies cost inefficiencies in Google Cloud environments and provides optimization recommendations for Google Cloud services like Cloud Run and Cloud SQL.

These capabilities further enhance Datadog’s ability to provide world-class observability and security at scale for joint customers.

The Latest

Outages aren't new. What's new is how quickly they spread across systems, vendors, regions and customer workflows. The moment that performance degrades, expectations escalate fast. In today's always-on environment, an outage isn't just a technical event. It's a trust event ...

Most organizations approach OpenTelemetry as a collection of individual tools they need to assemble from scratch. This view misses the bigger picture. OpenTelemetry is a complete telemetry framework with composable components that address specific problems at different stages of organizational maturity. You start with what you need today and adopt additional pieces as your observability practices evolve ...

One of the earliest lessons I learned from architecting throughput-heavy services is that simplicity wins repeatedly: fewer moving parts, loosely coupled execution (fewer synchronous calls), and precise timing metering. You want data and decisions to travel the shortest possible path. The goal is to build a system where every strategy and each line of code (contention is the key metric) complements the decision trees ...

As discussions around AI "autonomous coworkers" accelerate, many industry projections assume that agents will soon operate alongside human staff in making decisions, taking actions, and managing tasks with minimal oversight. But a growing number of critics (including some of the developers building these systems) argue that the industry still has a long way to go to be able to treat AI agents like fully trusted teammates ...

Enterprise AI has entered a transformational phase where, according to Digitate's recently released survey, Agentic AI and the Future of Enterprise IT, companies are moving beyond traditional automation toward Agentic AI systems designed to reason, adapt, and collaborate alongside human teams ...

The numbers back this urgency up. A recent Zapier survey shows that 92% of enterprises now treat AI as a top priority. Leaders want it, and teams are clamoring for it. But if you look closer at the operations of these companies, you see a different picture. The rollout is slow. The results are often delayed. There's a disconnect between what leaders want and what their technical infrastructure can handle ...

Kyndryl's 2025 Readiness Report revealed that 61% of global business and technology leaders report increasing pressure from boards and regulators to prove AI's ROI. As the technology evolves and expectations continue to rise, leaders are compelled to generate and prove impact before scaling further. This will lead to a decisive turning point in 2026 ...

Cloudflare's disruption illustrates how quickly a single provider's issue cascades into widespread exposure. Many organizations don't fully realize how tightly their systems are coupled to thirdparty services, or how quickly availability and security concerns align when those services falter ... You can't avoid these dependencies, but you can understand them ...

If you work with AI, you know this story. A model performs during testing, looks great in early reviews, works perfectly in production and then slowly loses relevance after operating for a while. Everything on the surface looks perfect — pipelines are running, predictions or recommendations are error-free, data quality checks show green; yet outcomes don't meet the ground reality. This pattern often repeats across enterprise AI programs. Take for example, a mid-sized retail banking and wealth-management firm with heavy investments in AI-powered risk analytics, fraud detection and personalized credit-decisioning systems. The model worked well for a while, but transactions increased, so did false positives by 18% ...

Basic uptime is no longer the gold standard. By 2026, network monitoring must do more than report status, it must explain performance in a hybrid-first world. Networks are no longer just static support systems; they are agile, distributed architectures that sit at the very heart of the customer experience and the business outcomes ... The following five trends represent the new standard for network health, providing a blueprint for teams to move from reactive troubleshooting to a proactive, integrated future ...