Set up GitHub Actions Workflow for Testing Parsl with Flux #3159

mercybassey · 2024-03-06T08:22:57Z

Description

This pull request introduces a new GitHub Actions workflow aimed at testing Parsl's integration with Flux.

Changed Behaviour

The primary focus of this PR is to enhance Parsl's testing infrastructure by integrating a new GitHub Actions workflow aimed at validating Parsl's compatibility and functionality with Flux.

Fixes

Fixes #2713

Type of change

Choose which options apply, and delete the ones which do not apply.

Enhancement

mercybassey · 2024-03-06T09:21:43Z

Hi @benclifford I've created the CI workflow for testing Flux with Parsl. Initially, the CI failed due to issues with bootstrap.sh and Reframe tests. I simplified the workflow by removing these elements and focused on setting up Flux. After these adjustments, the CI successfully ran with just Flux.

I then added Parsl installation with pip3 install parsl, which ran successfully. I believe this will lay the groundwork for future Parsl and Flux integration tests.

Kindly review. I am open to any feedback or suggestions for further improvements.

benclifford · 2024-03-06T11:24:20Z

ok, great.

here are some things that I think should happen next in this PR:

i) Install the "right" parsl version

These Github Actions are run to test specific new versions of parsl that only exist in pull requests. This step in your newly added Actions workflow checks out the correct version of Parsl (with whatever changes the submitter has made):

      - name: Checkout
        uses: actions/checkout@v3

but later on you install the latest official release of Parsl (as released on Pypi.org), not the version that was checked out from the pull request:

      - name: Install Parsl
        run: |
          pip3 install parsl

To install the version that is checked out by the checkout step (instead of using the version from Pypi), I think it will be enough to instead call pip3 install . - in that command, . means "install whatever is in the current directory, and we hope it will be parsl... if you look in .github/workflows/ci.yaml you should see both a checkout action and then later on some pip install . commands, and basically I think you should do something like that.

ii) Run some tests to check you installed parsl right.

You can check that the basic parsl install worked, before running anything with parsl+flux, with make local_thread_test (which actually just runs: pytest parsl/tests/ -k "not cleannet" --config parsl/tests/configs/local_threads.py --random-order --durations 10) - that should run for about 10 seconds, and end like this:

=============================================== 148 passed, 222 skipped, 4 warnings in 5.94s ================================================

You might encounter some problems here with stuff not being installed right (missing dependencies, for example)

iii) test parsl+flux

There's one basic flux test file here parsl/tests/test_flux.py so I think you should be able to run:

pytest parsl/tests/test_flux.py

Extras to think about after the above is working, but don't attempt until the basics above work:

iv) These tests have a problem that I dislike: if they detect flux is not working, they will not fail (and so CI will not fail, and so developers will not know that flux is no longer being tested). Probably in a separate PR after this PR is done, we can work on changing that behaviour.

v) Our other executors, we try to run a big range of general Parsl tests to check that lots of different parsl usage patterns and options work with this executor - that would be a useful thing to add in. If you look in the makefile, you will see various executors tested by running pytest with --config <some config> configured for different kinds of executors, and it would be good to make that work for a flux configuration too.

merging

mercybassey · 2024-03-06T16:39:40Z

Hi @benclifford I've implemented the pip3 install parsl . and integrated the installation of dependencies from test-requirements.txt into the CI pipeline, so all necessary libraries for testing are installed which worked well.

Following that, I attempted to verify the Parsl installation by running a subset of the test suite as suggested (pytest parsl/tests/ -k "not cleannet" --config parsl/tests/configs/local_threads.py --random-order --durations 10). However, I've encountered a persistent issue where the test fails due to an unrecognized argument --random-order. After adjusting the test command to remove potentially problematic flags, it did not resolve the issue.

I'll be needing your suggestions on potential adjustments or alternative approaches to verify the Parsl installation. Your guidance will be greatly appreciated.

benclifford · 2024-03-06T16:57:23Z

The implementation for --random-order is installed from a extra pytest plugin module: https://pypi.org/project/pytest-random-order/

that's usually installed as part of the testing requirements... in the main Parsl dev instructions, that happens as part of "make deps", or you could change your test install to pip install . -r test-requirements.txt which will install both . (the parsl code under test) and also all the dependencies in test-requirements.txt

…flux merge

mercybassey · 2024-03-06T17:56:05Z

@benclifford During the CI run, I encountered a test failure in parsl/tests/test_bash_apps/test_stdout.py, which asserts a failure condition related to handling stdout and stderr paths for bash apps. This is happening in the Verify Parsl Installation step.
=================================== FAILURES =================================== _____________________________ test_bad_stderr_file _____________________________

@pytest.mark.issue363
def test_bad_stderr_file():
    """Testing bad stderr file"""

    err = "/bad/dir/t2.err"

    fn = echo_to_streams("Hello world", stderr=err)

    try:
        fn.result()
    except perror.BadStdStreamFile:
        pass
    else:
      assert False, "Did not raise expected exception BadStdStreamFile"
       AssertionError: Did not raise expected exception BadStdStreamFile
       assert False

parsl/tests/test_bash_apps/test_stdout.py:70: AssertionError ----------------------------- Captured stdout call ----------------------------- Hello world

Upon investigating, I identified that the failure was due to the test expecting an exception when attempting to write to a non-existent directory, but the exception was not raised as expected.

Could you please provide some guidance on how you would like me to address this test failure? Should I attempt to modify the test or the underlying functionality to resolve the issue, or is there a different approach you'd prefer?

benclifford · 2024-03-07T10:16:53Z

that test looks like this problem some people have had on WSL2: #3160

in that test, the assumption is that / will not be writable, which is true on a traditional multi-user unix system. but apparently not true for that WSL users in #3160, and looks like its not true in the flux container that you're using.

I think seeing this test break in two different places is good motivation to fix the test - maybe you're interested in a side-quest to fix #3160 (as a separate PR) which would help both the WSL users and this flux testing.

mercybassey · 2024-03-07T10:37:37Z

that test looks like this problem some people have had on WSL2: #3160

in that test, the assumption is that / will not be writable, which is true on a traditional multi-user unix system. but apparently not true for that WSL users in #3160, and looks like its not true in the flux container that you're using.

I think seeing this test break in two different places is good motivation to fix the test - maybe you're interested in a side-quest to fix #3160 (as a separate PR) which would help both the WSL users and this flux testing.

Okay, l'll take a look

benclifford · 2024-03-18T13:29:18Z

looks like recent tests are hitting that hang-at-exit problem again. I'll have a look at that right now.

mercybassey · 2024-03-18T15:29:28Z

@benclifford the failing test passed this time.

benclifford · 2024-03-18T15:41:15Z

@benclifford the failing test passed this time.

yes annoying. but we can merge this PR, and just accept that these tests will sometimes fail this way until someone diagnoses and fixes whatever is broken. so I think if you do the tidyups of messy files in this PR, it would then be ready to merge.

…flux merge

mercybassey · 2024-03-18T16:35:08Z

@benclifford the failing test passed this time.

yes annoying. but we can merge this PR, and just accept that these tests will sometimes fail this way until someone diagnoses and fixes whatever is broken. so I think if you do the tidyups of messy files in this PR, it would then be ready to merge.

@benclifford I have removed all the messy files.

benclifford · 2024-03-18T20:23:12Z

While trying to understand what's happening with hung runs, I realised that this PR does not record test artefacts in the same way that the main github actions workflow does. I put this commit onto my testing branch to capture a test artefact when a run finishes, with the hope that when I later see a hanging run it will give some more clues.

Artefacts like this have been really useful for debugging tests in general.

You can look in ci.yaml to see what I copied this from

a41b732

mercybassey · 2024-03-22T10:30:59Z

Hi @benclifford. Are there any updates on this issue? I'd like to begin working on this one.

benclifford · 2024-03-22T10:57:08Z

@mercybassey you can work on the os x model before this PR #3159 is merged, on another branch - I need to write up an issue for what I discovered with flux.

khk-globus · 2024-06-10T18:14:05Z

.github/workflows/parsl+flux.yaml

+  build:
+    runs-on: ubuntu-20.04
+    permissions:


This is now 4 years out of date. Are we interested in using a more recent distro for the test and CI framework?

The flux container image is tagged for jammy which is the subsequent ubuntu LTS release (22.04). The most recent ubuntu LTS is 24.04 which was released just over a month ago, and it looks like there isn't a flux container for that (on https://hub.docker.com/r/fluxrm/flux-sched/tags).

So I'll upgrade this to 22.04 and see what happens.

…ot the case in environments running as root

…of time is common

benclifford · 2024-06-10T19:35:28Z

I pulled some of the test changes I made here into PR #3483 for merge first separately.

mercybassey added 3 commits March 6, 2024 09:14

Added flux ci

b3bcdef

removed tests

1b6e399

configured the CI to install Parsl

5c7a6c0

mercybassey marked this pull request as ready for review March 6, 2024 09:23

mercybassey force-pushed the parsl+flux branch from 4b583bc to 5c7a6c0 Compare March 6, 2024 11:51

mercybassey and others added 10 commits March 6, 2024 14:03

Merge branch 'master' into parsl+flux

2bf4de1

Merge branch 'master' into parsl+flux

a5fbe78

Install checked-out version of Parsl in CI

b1eda08

Merge remote-tracking branch 'origin/parsl+flux' into parsl+flux

9fa40ef

merging

Add basic Parsl verification to CI

efacad0

Fixed indentation

177944c

Add step to install pytest

f714af6

Installed pytest and pytest-random-order for CI tests

a71e86c

Install dependencies from test-requirements.txt for CI tests

8447148

Adjust CI test command to resolve pytest failure

54d21e8

Merge branch 'master' into parsl+flux

d8b073b

mercybassey added 3 commits March 6, 2024 18:11

Combining steps

15d280e

Merge branch 'parsl+flux' of github.com:mercybassey/parsl into parsl+…

f1c8105

…flux merge

Added --random-order

961561c

benclifford mentioned this pull request Mar 7, 2024

CI testing on OS X #3189

Open

benclifford mentioned this pull request Mar 7, 2024

test_bad_stdout_specs vs writeable / #3160

Closed

mercybassey added 2 commits March 7, 2024 13:00

Configured CI to install python3-dev

a2f0923

Added a test for writing to non-writable directory

fe5921e

mercybassey added 2 commits March 18, 2024 14:13

took test_stdout.py to how it was and corrected ci

91d1539

Omitted tests marked as as well in 'Test Parsl with Flux Config'

215a035

Merge branch 'master' into parsl+flux

3bb44d3

mercybassey added 2 commits March 18, 2024 17:11

removed unwanted files

155d8e1

Merge branch 'parsl+flux' of github.com:mercybassey/parsl into parsl+…

d03cd52

…flux merge

benclifford mentioned this pull request Mar 18, 2024

[not for merge] benc poking at CI hangs in flux test #3259

Closed

benclifford added 3 commits June 10, 2024 20:07

Merge branch 'master' into parsl+flux

f332bb1

Remove changes to .gitignore

0662487

Use 1 line between targets, like elsewhere

f5d1630

khk-globus approved these changes Jun 10, 2024

View reviewed changes

benclifford added 6 commits June 10, 2024 18:29

mark tests that need unix filesystem permissions enforced, which is n…

d7312d1

…ot the case in environments running as root

Another bad file test that requires unix fs permission enforcement

b4d5d85

flake8

882f0e8

Increase workflow timeout - it was being reached in normal operation

4167a42

Increase timeout more - hit 10 minutes and i'd like to see what kind …

26c29e5

…of time is common

Upgrade base ubuntu to jammy

49f210f

Merge branch 'master' into parsl+flux

e990f08

benclifford merged commit 5973f39 into Parsl:master Jun 10, 2024
7 checks passed

benclifford mentioned this pull request Jun 12, 2024

flux-in-parsl-CI testing is very hangy #3484

Closed

benclifford mentioned this pull request Jun 24, 2024

Parsl+flux+debug #3218

Closed

benclifford mentioned this pull request Jul 31, 2024

make test tries to download non-existing file #3544

Open

benclifford mentioned this pull request Aug 11, 2024

CI testing of SLURM in a container environment #3579

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set up GitHub Actions Workflow for Testing Parsl with Flux #3159

Set up GitHub Actions Workflow for Testing Parsl with Flux #3159

mercybassey commented Mar 6, 2024 •

edited

Loading

mercybassey commented Mar 6, 2024 •

edited

Loading

benclifford commented Mar 6, 2024 •

edited

Loading

mercybassey commented Mar 6, 2024 •

edited

Loading

benclifford commented Mar 6, 2024

mercybassey commented Mar 6, 2024 •

edited

Loading

benclifford commented Mar 7, 2024

mercybassey commented Mar 7, 2024 •

edited

Loading

benclifford commented Mar 18, 2024

mercybassey commented Mar 18, 2024

benclifford commented Mar 18, 2024

mercybassey commented Mar 18, 2024

benclifford commented Mar 18, 2024

mercybassey commented Mar 22, 2024

benclifford commented Mar 22, 2024

khk-globus Jun 10, 2024

benclifford Jun 10, 2024

benclifford commented Jun 10, 2024

Set up GitHub Actions Workflow for Testing Parsl with Flux #3159

Set up GitHub Actions Workflow for Testing Parsl with Flux #3159

Conversation

mercybassey commented Mar 6, 2024 • edited Loading

Description

Changed Behaviour

Fixes

Type of change

mercybassey commented Mar 6, 2024 • edited Loading

benclifford commented Mar 6, 2024 • edited Loading

mercybassey commented Mar 6, 2024 • edited Loading

benclifford commented Mar 6, 2024

mercybassey commented Mar 6, 2024 • edited Loading

benclifford commented Mar 7, 2024

mercybassey commented Mar 7, 2024 • edited Loading

benclifford commented Mar 18, 2024

mercybassey commented Mar 18, 2024

benclifford commented Mar 18, 2024

mercybassey commented Mar 18, 2024

benclifford commented Mar 18, 2024

mercybassey commented Mar 22, 2024

benclifford commented Mar 22, 2024

khk-globus Jun 10, 2024

Choose a reason for hiding this comment

benclifford Jun 10, 2024

Choose a reason for hiding this comment

benclifford commented Jun 10, 2024

mercybassey commented Mar 6, 2024 •

edited

Loading

mercybassey commented Mar 6, 2024 •

edited

Loading

benclifford commented Mar 6, 2024 •

edited

Loading

mercybassey commented Mar 6, 2024 •

edited

Loading

mercybassey commented Mar 6, 2024 •

edited

Loading

mercybassey commented Mar 7, 2024 •

edited

Loading