Skip to main content

GCP Dataplex Data Scan

Google Cloud Dataplex Data Scan is a managed resource that defines and schedules automated data profiling and/or data-quality checks over a Dataplex asset. A scan can target BigQuery tables or Cloud Storage filesets, then generate metrics, quality rule evaluations and history so that an organisation can discover issues before releasing new datasets or pipelines. Full API reference is available in the Google documentation: https://cloud.google.com/dataplex/docs/reference/rest/v1/projects.locations.dataScans#DataScan

Terrafrom Mappings:

  • google_dataplex_datascan.id

Supported Methods​

  • GET: Get a gcp-dataplex-data-scan by its "locations|dataScans"
  • LIST
  • SEARCH: Search for Dataplex data scans in a location. Use the location name e.g., 'us-central1' or the format "projects/[project_id]/locations/[location]/dataScans/[data_scan_id]" which is supported for terraform mappings.

gcp-storage-bucket​

A Dataplex Data Scan whose target asset is a Cloud Storage fileset ultimately reads objects that live inside a storage bucket; therefore, Overmind links the scan to the gcp-storage-bucket resource that contains the data being analysed.