Parallelize PR system tests via GitHub Actions matrix. by PranjalManhgaye · Pull Request #829 · precice/tutorials

PranjalManhgaye · 2026-06-05T14:20:29Z

Summary

This PR splits PR system tests into two GitHub Actions matrix jobs (release_test_shard_1 and release_test_shard_2). Together they cover the same 48 cases as release_test; I left release_test itself unchanged so manual runs and other workflows still work as before, I set fail-fast: false so if one shard fails, the other keeps running => that makes failures easier to read and cheaper to retry.

Why

Right now, when one test fails you often have to re-run the whole suite and dig through one huge log. With two shards, you get smaller logs per job and can re-run only the failed matrix job.

If we have two precice-tests-vm runners, the shards can run in parallel. On a single runner they may still queue, but we still get clearer CI output => which matches what we discussed for this issue.

Test plan

I have already ran python3 validate_release_test_shards.py =>48 cases = 24 + 24
Local : system-tests-dev with --rundir on the precice-data partition (pass)
please check from there side for this PR with trigger-system-tests so we can verify on precice-tests-vm (I don’t have access to trigger that from my side ig)

Notes

Manual and latest-components workflows still use release_test; we can extend the matrix there in a follow-up if you want.
If two shards run at once on different runners, Docker may build the same images twice. I kept v1 simple; we can tighten that later if CI shows problems.

close #789

PranjalManhgaye · 2026-06-05T14:25:54Z

@MakisH follow-ups (i think later, not this PR) :

same matrix for manual / latest-components workflows
more shards if we get more runners
docker build sharing only if ci shows we need it
option 2 (systemtests.py parallel) => skipped for now

MakisH

Nice to see a first prototype towards parallization!

While this is a valid and easy-to-implement approach for parallelization, I think it mainly adds a layer to the current approach.

Ideally, we should end up with the individual test suites (the ones per tutorial) as shards, so that the jobs also get meaningful names. Issues there will be:

race conditions in building the same Docker layers, if a runner picks more than one shard at the same time,
that some too long tests are currently excluded from the release_test (see extra), as these take too long. We could then just define these directly in the extra, instead of referring to the ones defined at the tutorial level with anchors.

Nevertheless, I could create another runner and use this PR to test if the parallelism with multiple custom runners makes sense.

Add release_test_shard_1/2 covering the same cases as release_test, and run them as separate matrix jobs for clearer logs and cheaper reruns.

Move the release_test shard matrix to system-tests-latest-components, restore system-tests-pr to a single release_test job, and clarify README wording on concurrent Docker builds.

Define release_test_shard_1/2 tutorial lists once with YAML anchors and build release_test as their union. Flatten nested list aliases in TestSuite parsing and remove validate_release_test_shards.py.

MakisH · 2026-06-07T18:06:59Z

    uses: precice/tutorials/.github/workflows/run_testsuite_workflow.yml@develop
    with:
-      suites: ${{ inputs.suites || 'release_test' }}
+      suites: ${{ matrix.suites }}


The || 'release_test' part is for workflow runs that don't start from a workflow_dispatch (manual trigger) event, which would provide input arguments. Also in this case, we will need defaults.

MakisH · 2026-06-07T18:11:01Z

+    @staticmethod
+    def _iter_tutorial_cases(tutorials_section):
+        """Yield tutorial case dicts, flattening YAML list aliases (e.g. shard lists)."""
+        for item in tutorials_section:
+            if isinstance(item, list):
+                yield from TestSuites._iter_tutorial_cases(item)
+            else:
+                yield item


This needs some comments/motivation. Why is it needed? Why (only this) declared as a static method?

PranjalManhgaye added a commit to PranjalManhgaye/tutorials that referenced this pull request Jun 5, 2026

Add changelog entry for PR precice#829

7e76ab0

PranjalManhgaye requested a review from MakisH June 5, 2026 14:27

MakisH reviewed Jun 6, 2026

View reviewed changes

Comment thread heat-exchanger/download-meshes.sh Outdated

Comment thread .github/workflows/system-tests-pr.yml

Comment thread tools/tests/README.md Outdated

Comment thread tools/tests/validate_release_test_shards.py Outdated

MakisH mentioned this pull request Jun 6, 2026

Add incremental system test progress to job summary #821

Closed

3 tasks

PranjalManhgaye added 3 commits June 6, 2026 18:10

Parallelize PR system tests via GitHub Actions matrix.

21f71aa

Add release_test_shard_1/2 covering the same cases as release_test, and run them as separate matrix jobs for clearer logs and cheaper reruns.

Add changelog entry for PR precice#829

e29c818

Remove unrelated heat-exchanger/download-meshes.sh from PR

9d75c1f

PranjalManhgaye force-pushed the issue-789-parallel-system-tests-matrix branch from 7e76ab0 to 9d75c1f Compare June 6, 2026 12:46

PranjalManhgaye added 2 commits June 6, 2026 19:57

Address review: run matrix on latest-components workflow.

1362723

Move the release_test shard matrix to system-tests-latest-components, restore system-tests-pr to a single release_test job, and clarify README wording on concurrent Docker builds.

Compose release_test from shard YAML anchors.

38b4813

Define release_test_shard_1/2 tutorial lists once with YAML anchors and build release_test as their union. Flatten nested list aliases in TestSuite parsing and remove validate_release_test_shards.py.

PranjalManhgaye requested a review from MakisH June 6, 2026 15:06

MakisH reviewed Jun 7, 2026

View reviewed changes

MakisH added the systemtests label Jun 7, 2026

MakisH added this to GSoC 2026: System tests improvements Jun 8, 2026

github-project-automation Bot moved this to Planned next in GSoC 2026: System tests improvements Jun 8, 2026

MakisH moved this from Planned next to Needs review in GSoC 2026: System tests improvements Jun 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parallelize PR system tests via GitHub Actions matrix.#829

Parallelize PR system tests via GitHub Actions matrix.#829
PranjalManhgaye wants to merge 5 commits into
precice:developfrom
PranjalManhgaye:issue-789-parallel-system-tests-matrix

PranjalManhgaye commented Jun 5, 2026

Uh oh!

PranjalManhgaye commented Jun 5, 2026

Uh oh!

MakisH left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MakisH Jun 7, 2026

Uh oh!

MakisH Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

PranjalManhgaye commented Jun 5, 2026

Summary

Why

Test plan

Notes

Uh oh!

PranjalManhgaye commented Jun 5, 2026

Uh oh!

MakisH left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MakisH Jun 7, 2026

Choose a reason for hiding this comment

Uh oh!

MakisH Jun 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants