Table of contents

Architecture of

The (DGU) platform is used to publish and view datasets. A dataset is a document about a collection of links to documentation or data hosted somewhere on the Internet.

Architectural overview of

Dgu architecture

The original for this diagram is available on the Platform Health Google Drive and can be edited with Services

Services owned by

  • CKAN is the legacy publishing and finder app for datasets (‘packages’). It also runs Nginx to support Find.
  • Find is the public frontend for searching datasets using Elasticsearch. It replaces CKAN for certain routes.
  • Publish is the prototype publishing app for datasets. It currently syncs with CKAN to populate Elasticsearch.
  • Reference is a legacy service that attempts to provide a nomenclature of time intervals, hosted on Heroku.

Services with sub-domains, but owned by other departments

Several datasets link to and require user login to access. Although branded as, this is a totally separate service. If a user is having difficulty accessing this system, they should contact the maintainers of this resource, who are currently Airbus Defence & Space.

Publish and Find

Publish and Find are provisioned on GOV.UK Paas. The deployment and monitoring pages explain this in more detail, but you can use the following commands to get an overview.

cf apps
cf services
cf routes
cf env publish-data-beta-production

We use GOV.UK Signon for user authentication in Publish Data, with the app in each environment linked to the corresponding instance of GOV.UK Signon. See the Publish ADR for more info.


Legacy CKAN is hosted on Bytemark servers. New CKAN (still under development) will be hosted on GOV.UK infrastructure.

This page was last reviewed . It needs to be reviewed again by the page owner #govuk-platform-health.