DataOps Data Quality TestGen Expands: Now Supporting BigQuery and Apache Iceberg

DataOps TestGen Enterprise is now compatible with Google BigQuery and can be used to profile and test file-based data accessible through Redshift Spectrum and Snowflake external tables using Apache Iceberg and other file formats.

We’re excited to announce two major expansions to DataOps Data Quality TestGen Enterprise that bring intelligent data quality testing to even more of your data ecosystem. Whether you’re working with Google BigQuery or managing file-based data through external tables, TestGen now has you covered.

Google BigQuery Support: Enterprise-Grade Testing Meets Cloud Data Warehousing

DataOps TestGen Enterprise is now fully compatible with Google BigQuery, one of the most popular cloud data warehouse solutions. This integration means you can now apply the same rigorous data quality standards to your BigQuery datasets that you’ve come to expect from TestGen.

What this means for your team:

With BigQuery support, TestGen offers advanced profiling features that automatically analyze the structure, patterns, and anomalies in your BigQuery tables. The platform generates detailed test suites automatically, eliminating the manual effort typically required to establish data quality benchmarks. Setting up TestGen with BigQuery is straightforward, allowing you to quickly gain insights into database health, identify potential data issues before they impact downstream processes, and maintain confidence in your analytics and machine learning workflows.

File-Based Data and Apache Iceberg: A New Level of Architectural Independence

Perhaps even more exciting is TestGen’s expanded support for file-based data formats. DataOps TestGen Enterprise can now profile and test structured data stored in file formats and accessed through Redshift Spectrum and Snowflake Amazon Redshift Spectrum and Snowflake external tables

external tables. Supported formats include: Apache Iceberg tables, Parquet, Avro, ORC, CSV, JSON

Why this matters

This capability marks a paradigm shift for data architects. You’re no longer limited to testing only data within traditional database systems. Whether you’re building a data lakehouse architecture, working with data lake storage, or managing data in modern table formats like Apache Iceberg, DataOps TestGen can now verify quality at the source

This architectural flexibility allows you to perform data quality checks earlier in your pipeline, detect issues before data reaches your warehouse, and ensure consistency across hybrid storage strategies. For organizations adopting data lake and lakehouse architectures, this offers enterprise-grade testing for your most flexible storage layers.

Getting Started

Ready to expand your data quality coverage? Refer to the Introduction to DataOps TestGen for a comprehensive list of supported databases and setup instructions.

These new integrations reflect our commitment to meeting data teams where they are—regardless of which databases, warehouses, or storage formats power your data infrastructure. Whether you’re fully invested in BigQuery, building a modern lakehouse, or operating a hybrid environment, TestGen helps ensure your data is reliable, trustworthy, and ready to inform critical business decisions.  Today, v4.32.5 of Enterprise TestGen supports these features. Open source will follow soon!

Want to learn more about how TestGen can improve data quality in your organization? Visit our website for an online demo.

Sign-Up for our Newsletter

Get the latest straight into your inbox

DataOps Data Quality TestGen:

Simple, Fast, Generative Data Quality Testing, Execution, and Scoring.

[Open Source, Enterprise]

DataOps Observability:

Monitor every data pipeline, from source to customer value, & find problems fast

[Open Source, Enterprise]

DataOps Automation:

Orchestrate and automate your data toolchain with few errors and a high rate of change.

[Enterprise]

recipes for dataops success

DataKitchen Consulting Services


DataOps Assessments

Identify obstacles to remove and opportunities to grow

DataOps Consulting, Coaching, and Transformation

Deliver faster and eliminate errors

DataOps Training

Educate, align, and mobilize

Commercial Data & Analytics Platform for Pharma

Get trusted data and fast changes to create a single source of truth

 

dataops-cookbook-download

DataOps Learning and Background Resources


DataOps Journey FAQ
DataOps Observability basics
Data Journey Manifesto
Why it matters!
DataOps FAQ
All the basics of DataOps
DataOps 101 Training
Get certified in DataOps
Maturity Model Assessment
Assess your DataOps Readiness
DataOps Manifesto
Thirty thousand signatures can't be wrong!

 

DataKitchen Basics


About DataKitchen

All the basics on DataKitchen

DataKitchen Team

Who we are; Why we are the DataOps experts

Careers

Come join us!

Contact

How to connect with DataKitchen

 

DataKitchen News


Newsroom

Hear the latest from DataKitchen

Events

See DataKitchen live!

Partners

See how partners are using our Products

 

Monitor every Data Journey in an enterprise, from source to customer value, in development and production.

Simple, Fast Data Quality Test Generation and Execution. Your Data Journey starts with verifying that you can trust your data.

Orchestrate and automate your data toolchain to deliver insight with few errors and a high rate of change.