Skip to main content
Table of contents

data.gov.uk

Architecture of data.gov.uk

The data.gov.uk (DGU) platform is used to publish and view datasets. A dataset contains the metadata for a collection of links to data hosted somewhere on the internet.

Architectural overview of data.gov.uk

Dgu architecture

The original for this diagram is available on the Platform Health Google Drive and can be edited with draw.io.

data.gov.uk Services

Services owned by data.gov.uk

  • CKAN is the publishing app for datasets (‘packages’).
  • Find is the public frontend for searching datasets using Elasticsearch.
  • Publish is a prototype publishing app for datasets. Whilst not public facing, it currently syncs data from CKAN into Elasticsearch for use in Find.
  • Reference is a legacy service that attempts to provide a nomenclature of time intervals, hosted on Heroku.

Services with data.gov.uk sub-domains, but owned by other departments

Several datasets link to environment.data.gov.uk and require user login to access. Although branded as data.gov.uk, this is a totally separate service. If a user is having difficulty accessing this system, they should contact the maintainers of this resource, who are currently Airbus Defence & Space.

Publish and Find

Publish and Find are provisioned on GOV.UK Paas. The deployment and monitoring pages explain this in more detail, but you can use the following commands to get an overview.

cf apps
cf services
cf routes
cf env publish-data-beta-production

We use GOV.UK Signon for user authentication in Publish Data, with the app in each environment linked to the corresponding instance of GOV.UK Signon. See the Publish ADR for more info.

CKAN

CKAN is hosted on AWS and is maintained/deployed in the same way as most other GOV.UK applications.

This page was last reviewed on 27 September 2019. It needs to be reviewed again on 27 March 2020 by the page owner #govuk-platform-health .
This page was set to be reviewed before 27 March 2020 by the page owner #govuk-platform-health. This might mean the content is out of date.