Skip to main content
Warning This document has not been updated for a while now. It may be out of date.
Last updated: 9 Sep 2021

govuk-data-science-workshop: Contributing

We love contributions! We've compiled this documentation to help you understand our contributing guidelines. If you still have questions, please contact us and we'd be happy to help!

Code of Conduct

Please read CODE_OF_CONDUCT.md before contributing.

Getting started

To start contributing, open your terminal, and install the required Python packages, and pre-commit hooks using:

pip install -r requirements.txt
pre-commit install

or the make command:

make requirements

The pre-commit hooks are a security feature to ensure, for example, no secrets[^1], large data files, and Jupyter notebook outputs are accidentally committed into the repository. For more information on pre-commit hooks see our documentation.

[^1]: Only secrets of specific patterns are detected by the pre-commit hooks.

Code conventions

We mainly follow the GDS Way in our code conventions.

Git and GitHub

We use Git to version control the source code. Please read the GDS Way for details on Git best practice. This includes how to write good commit messages, use git rebase for local branches and git merge --no-ff for merges, as well as using git push --force-with-lease instead of git push -f.

If you want to modify the .gitignore files, see the template documentation for further details.

Our source code is stored on GitHub. Pull requests into main require at least one approved review.

Python

For Python code, we follow the GDS Way Python style guide with a line length of 88; the flake8 pre-commit hook should help with this!

Markdown

Local links can be written as normal, but external links should be referenced at the bottom of the Markdown file for clarity. For example:

Use a [local link to reference the `README.md`](../../README.md) file, but [an external
link for GOV.UK][gov-uk].

[gov-uk]: https://www.gov.uk/

We also try to wrap Markdown to a line length of 88 characters, but this is not strictly enforced in all cases, for example with long hyperlinks.

Testing

Tests are written using the pytest framework, with its configuration in the pyproject.toml file. Note, only tests in the tests folder are run. To run the tests, enter the following command in your terminal:

pytest

Code coverage

Code coverage of Python scripts is measured using the coverage Python package; its configuration can be found in pyproject.toml. Note coverage only extends to Python scripts in the src folder.

To run code coverage, and view it as an HTML report, enter the following command in your terminal:

coverage run -m pytest
coverage html

or use the make command:

make coverage_html

The HTML report can be accessed at htmlcov/index.html.

Documentation

We write our documentation in MyST Markdown for use in Sphinx. This is mainly stored in the docs folder, unless it's more appropriate to store it elsewhere, like this file.

Please read our guidance on how to write accessible documentation, as well as our guidance on writing Sphinx documentation. This allows you to build the documentation into an accessible, searchable website.