Skip to main content
Warning This document has not been updated for a while now. It may be out of date.
Last updated: 1 Aug 2018

content-publisher: 2: Use local datastore for publisher workflow

Date: 2018-07-20

Context

We're building a new publishing architecture and MVP workflow in private beta. We have taken the decision to build this in a new application and therefore we need to decide how we use the Publishing API.

One of the goals of the migration effort from 2016 was to for all publishing applications to be clients of the Publishing API and this API would means that these applications would not need local datastores. This is written up in the GOV.UK Platform Architecture Goals 2017-18+

Since writing these goals we have built the Specialist Publisher application using this pattern. The difficulties of building Rails application this way have led us to reconsider this approach.

We are about to build another Publishing API client and we should now use the learnings we have to ensure that not using a local datastore is the best decision.

Option 1: No local datastore

In this option we continue with the platform architecture goals.

Pros

  • Single source of truth - all data is in the Publishing API
  • Multiple apps can edit the same content
  • Single implementation of scheduling
  • Single abstraction layer

Cons

  • Effectively building an Active Resource layer (object-relational mapper for REST)
  • Harder to resolve performance problems
  • Current workflow is opinionated and may not meet needs of content
  • Testing against an API is hard, it requires a lot of stubbing or running a version of the API as part of the test suite

Option 2: Use a local datastore

Pros

  • It’s standard Rails so quick to build and easy to understand
  • Easy to test, testing with a local datastore is very straightforward
  • Easy to iterate workflow, changing the API workflow is much harder
  • Only final state of content can be put into Publishing API - some formats will need interim data which isn’t needed for publishing
  • No new features needed for meta data - notes, and other publisher info

Cons

  • Syncing between local and Publishing API
  • Local data is hard to query by other applications
  • Combination of local and Publishing API workflows
  • Hard to feedback to use on asynchronous API updates
  • Content-Schema may not match Rails validations causing unhelpful error messages

In addition to the above there are a few pragmatic problems which have influenced our decision:

  • We want to store content prior to it being available as a draft on Publishing API. Doing this with no local datastore would have the complexity of a holding place for this data.
  • We want to model the full edit history of documents, which Publishing API doesn't support.
  • We want to support migrating of Whitehall documents with their histories - Moving this data to a new publishing application is arguably simpler than having to resume the Publishing API migration work and completing full history migration.
  • We want to support features the Publishing API doesn't have such as reviews, scheduling and storing additional user level metadata.

All of these can be done with a commitment to developing the Publishing API but it would much slower, would mean resuming the migration programme and have potential impacts on every client of the Publishing API. It also isn't clear that the Publishing API should have all of this functionality.

To achieve the aims of rapidly building a new publishing application to be ready for beta we're able to do this faster and more effectively by modelling these concepts in a local datastore.

Decision

We will use a local datastore which syncs to the Publishing API.

Status

Accepted

Consequences

The problems we found when building and maintaining Specialist Publisher have shown that the practicalities of not having a local datastore outweigh the architectural purity of a single datastore.

There will be problems in keeping the local datastore and the publishing API in sync in addition to the user experience problems of updating users on the results of asynchronous updates. However the speed and ease of development improvements should outweigh this.

In addition we will also need to review and update the GOV.UK Platform Architecture Goals document to be consistent with this ADR.