Tutorial testing with Spread

Automated tests that run the documentation tutorial end-to-end inside a Multipass VM. Shell commands are extracted from the Markdown tutorial pages and executed sequentially by the Spread test framework.

Overview

The tutorial Markdown files under docs/tutorial/ are the single source of truth. Test metadata, wait points, assertions, and hidden commands are all expressed as HTML comments inside those files — invisible to readers but consumed by extract_commands.py.

The generation pipeline:

Each docs/tutorial/<page>.md that contains a  metadata block maps to a python/tests/tutorial/<page>.sh script and a python/tests/tutorial/<page>/task.yaml Spread task. Execution order is determined by the priority field in the spread metadata, not filenames.
extract_commands.py extracts ```shell fenced blocks, processes annotations, and writes both the .sh script and task.yaml. It supports a directory mode (extract_commands.py <input_dir> <output_dir>) that auto-discovers all .md files with spread metadata, as well as explicit <input.md> <output.sh> pairs.
Generation is driven either by tox (tox -e tutorial-extract) or by the Makefile (make -f python/tests/tutorial/Makefile extract). Both use directory discovery mode by default.

Generated files (.sh and task.yaml) are not stored in git. They must be generated locally before running Spread.

Prerequisites

Ubuntu host machine (tested on 24.04)
Multipass
Go
Spread installed via Go (not as a snap)
Python 3 and make (usually pre-installed on Ubuntu)

Quick start

From a fresh clone:

git clone <repo-url> && cd spark-k8s-bundle

# 1. Generate the .sh scripts and task.yaml files from Markdown sources.
tox -e tutorial-extract

# 2. Run the full tutorial test suite (extract + spread).
#    Runs all stages even if earlier ones fail.
tox -e tutorial

Alternatively, you can use the Makefile directly:

make -f python/tests/tutorial/Makefile extract   # step 1
make -f python/tests/tutorial/Makefile test      # steps 1+2 (abort on first failure)

The tox -e tutorial env runs in continue mode (no -abend), executing all stages even if earlier ones fail. This is the mode used by CI.

The Makefile test target uses -abend to stop immediately on the first failure — more useful during local development.

Run modes

tox / Make	Spread flags	Behaviour
`tox -e tutorial` / `make … test-continue`	`-vv`	Run all stages even if earlier ones fail (CI default)
`tox -e tutorial-extract` / `make … extract`	—	Generate scripts only (no Spread run)
`make … test`	`-abend -vv`	Abort on first failure, tear down VM
`make … test-debug`	`-abend -vv -debug`	Abort on first failure, drop into an interactive VM shell

test-debug is the most useful mode during development. When a step fails, Spread pauses and prints SSH credentials for the VM. You can SSH in, inspect juju status, read logs, re-run commands by hand, then type exit (or Ctrl+D) to let Spread clean up. Example:

make -f python/tests/tutorial/Makefile test-debug

On failure you'll see output like:

2026-04-11 20:13:23 Debug shell on multipass:ubuntu-24.04-64 for multipass:ubuntu-24.04-64:python/tests/tutorial/1-environment-setup
2026-04-11 20:13:23   Address: 10.189.154.39:22
2026-04-11 20:13:23   User:    root
2026-04-11 20:13:23   Password: 6d11d3739e023950

Use those credentials to SSH in:

ssh root@10.189.154.39     # password from the output above
cd /spark-k8s-bundle
juju status                 # inspect the model
bash python/tests/tutorial/1-environment-setup.sh   # re-run the failing script

When done, exit the shell and Spread will tear down the VM.

Running directly with `spread`

You can also call spread directly for finer control:

# Run all stages regardless of failures to get a full report (default):
spread -vv multipass:ubuntu-24.04-64:python/tests/tutorial/

# Abort on first failure:
spread -abend -vv multipass:ubuntu-24.04-64:python/tests/tutorial/

# Debug mode — interactive shell on failure:
spread -abend -vv -debug multipass:ubuntu-24.04-64:python/tests/tutorial/

# Run a single stage:
spread -abend -vv -debug multipass:ubuntu-24.04-64:python/tests/tutorial/1-environment-setup

Resource defaults (override with env vars):

Variable	Default	Purpose
`SPREAD_VM_CPUS`	`8`	Multipass VM CPU count
`SPREAD_VM_MEM`	`16G`	Multipass VM RAM
`SPREAD_VM_DISK`	`50G`	Multipass VM disk

CI

Tutorial tests run automatically in GitHub Actions via .github/workflows/tutorial-tests.yaml. The workflow:

Triggers: manual dispatch, workflow_call (from other workflows), and monthly schedule (1st of every month at 03:00 UTC).
Runner: self-hosted xlarge with KVM support (required by Multipass).
Mode: continue (-vv, no -abend) — runs all stages and reports all failures.

To trigger manually:

gh workflow run tutorial-tests.yaml --ref <branch>
gh run watch

Adding a new tutorial page

Add a  block to the Markdown file with priority and kill-timeout (see below). This is what makes the file discoverable by extract_commands.py.
Register the page in the SCRIPTS variable in python/tests/tutorial/Makefile so that make all can track it for incremental (timestamp-based) rebuilds. tox uses directory discovery and does not need updating.
Run tox -e tutorial-extract (or make -f python/tests/tutorial/Makefile extract) to generate both the .sh script and task.yaml.

Annotation reference

Annotations are HTML comments in the Markdown source. Only ```shell fences are extracted; other tags (```bash, ```text) are ignored.

Available annotations:

 — skip the next shell block
 — emit a sleep
 — wait for all units to be active/idle
 — run next block with a timeout
 — capture command output into variables
 — hidden commands (not rendered)
 — hidden assertions
 — Spread task metadata

``

Skip the next ```shell block.

``

Emit sleep N at that point in the script.

``

Emit a wait_idle call (from helpers.sh) that polls juju status until all units are active/idle. --timeout is in seconds (default: 1200). --allow-blocked lists apps permitted to be in blocked state (comma-separated).

``

Run the next ```shell block inside timeout N; ignore exit code.

``

Run a command and extract named fields into shell variables. Subsequent <field> placeholders in shell blocks are auto-replaced with ${VAR}.

<!-- test:set-variables
command: juju run data-integrator/leader get-credentials
KAFKA_USERNAME: username
KAFKA_PASSWORD: password
-->

``

Emit hidden shell commands (not visible in rendered docs).

``

Like test:run but marked as an assertion. Relies on set -e to abort on failure. Use jq -e, grep -q, or test for checks.

<!-- test:assert
juju status --format json | jq -e '.applications.kafka.units | length == 3'
-->

``

Spread task metadata. Used to generate task.yaml. Not emitted into scripts.

<!-- test:spread
priority: 200
kill-timeout: 30m
-->

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tutorial testing with Spread

Overview

Prerequisites

Quick start

Run modes

Running directly with `spread`

CI

Adding a new tutorial page

Annotation reference

``

``

``

``

``

``

``

``

Uh oh!

FilesExpand file tree

TESTING.md

Latest commit

History

TESTING.md

File metadata and controls

Tutorial testing with Spread

Overview

Prerequisites

Quick start

Run modes

Running directly with spread

CI

Adding a new tutorial page

Annotation reference

Running directly with `spread`

``

``

``

``

``

``

``

``