Skip to main content

Repository: ga4-dataform

Repository to hold GA4 dataform processing

GitHub
ga4-dataform
Ownership
#govuk-insights-and-analytics-team owns the repo. #insights-and-analytics-alerts receives automated alerts for this repo.
Category
Data engineering

README

This repo holds the code used by GOV.UK to partition and flatten GA4 data using DataForm in the GCP project gds-bq-reporting

Key files

  • ga4_process_shard.sqlx This defines which shard should be processed by the pipeline

  • partitioned_events.sqlx This partitions the sharded nested GA4 data

  • ga4_process_partition.sqlx This defines the parition ID to flatten

  • partitioned_flattened_events.sqlx This flattens the nested partition