github: Use Canonical runners for system tests #469

roosterfish · 2024-11-08T09:05:16Z

This PR switches the system tests from the runner group GitHubMicrocloud to our own self hosted runners.

Signed-off-by: Julian Pelizäus <[email protected]>

roosterfish · 2024-11-08T17:28:58Z

@masnax I did some tests regarding this error.

Unfortunately the timeout set in the MicroCeph GetConfig client func is set to only 5s.
Based on your suggestion in the meeting earlier, can it be that right after forming the MicroCloud, the proxy is waiting for something in the cluster to settle before it can forward the request to MicroCeph's local unix socket?

I have bootstrapped a single node MicroCloud and fired requests to /1.0/services/microceph in parallel.
Right around when the MicroCloud is bootstrapped I saw a delay in the response which could proof that something is going on there.

masnax · 2024-11-08T17:36:09Z

I have bootstrapped a single node MicroCloud and fired requests to /1.0/services/microceph in parallel. Right around when the MicroCloud is bootstrapped I saw a delay in the response which could proof that something is going on there.

Well this current failure is happening long before MicroCloud is bootstrapped, as it happens right after system discovery and before asking any setup questions. In the bootstrap case, the only delay would be related to refreshing the truststore and waiting for the lock, but even that wouldn't happen on a single-node request as it all goes through the unix socket which skips truststore verification.

When bootstrapping, the listeners also restart, so that could be the delay you're seeing locally. But again that wouldn't affect the test failure since it's not during bootstrap.

can it be that right after forming the MicroCloud, the proxy is waiting for something in the cluster to settle before it can forward the request to MicroCeph's local unix socket?

this is the whole local proxy block in MicroCloud so it's definitely not waiting for anything here.

Since it's a network request, there is the additional overhead of authHandlerMTLS pulling the truststore.

Ensure MicroCeph is fully started after bootstrapping to prevent running into timeouts if the test suite is too fast. Signed-off-by: Julian Pelizäus <[email protected]>

Signed-off-by: Julian Pelizäus <[email protected]>

roosterfish · 2024-11-11T15:35:56Z

Well this current failure is happening long before MicroCloud is bootstrapped, as it happens right after system discovery and before asking any setup questions

Mh it looks we can fix it by waiting for microceph cluster bootstrap to settle and only continue if it's done.
I have added another commit that adds a wrapper function we can use throughout the test suite to wait until microceph status reports the single node cluster services are present. I found waiting for this condition looks to be enough.

masnax · 2024-11-12T15:26:22Z

Well this current failure is happening long before MicroCloud is bootstrapped, as it happens right after system discovery and before asking any setup questions

Mh it looks we can fix it by waiting for microceph cluster bootstrap to settle and only continue if it's done. I have added another commit that adds a wrapper function we can use throughout the test suite to wait until microceph status reports the single node cluster services are present. I found waiting for this condition looks to be enough.

Is this something that can be checked over the API? Perhaps microceph cluster bootsrap shouldn't return until all its services are finished, or have a ready API that we can check against before sending requests.

roosterfish force-pushed the self_hosted_runners branch 2 times, most recently from 8af0edc to 4f38ed2 Compare November 8, 2024 11:07

roosterfish marked this pull request as ready for review November 8, 2024 13:58

roosterfish added 5 commits November 8, 2024 16:11

github: Use Canonical runners for system tests

d6c032a

Signed-off-by: Julian Pelizäus <[email protected]>

github: Remove units not present on Canonical runners

0841a5f

Signed-off-by: Julian Pelizäus <[email protected]>

github: Docker is not installed on the Canonical runners

6ebfab5

Signed-off-by: Julian Pelizäus <[email protected]>

github: Align paths

ba433ea

Signed-off-by: Julian Pelizäus <[email protected]>

github: Overwrite the GOCOVERDIR on Canonical runners

44b77ae

Signed-off-by: Julian Pelizäus <[email protected]>

roosterfish force-pushed the self_hosted_runners branch from 159ed91 to 44b77ae Compare November 8, 2024 15:11

roosterfish force-pushed the self_hosted_runners branch from 50e94ba to 7c6d5ff Compare November 11, 2024 09:48

roosterfish added 2 commits November 11, 2024 11:37

test/suites/basic: Use wrapper when bootstrapping MicroCeph

8d77402

Ensure MicroCeph is fully started after bootstrapping to prevent running into timeouts if the test suite is too fast. Signed-off-by: Julian Pelizäus <[email protected]>

multicast/discovery: Avoid leaking the context in tests (govet)

ae01100

Signed-off-by: Julian Pelizäus <[email protected]>

roosterfish force-pushed the self_hosted_runners branch from 7c6d5ff to ae01100 Compare November 11, 2024 10:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

github: Use Canonical runners for system tests #469

github: Use Canonical runners for system tests #469

roosterfish commented Nov 8, 2024 •

edited

Loading

roosterfish commented Nov 8, 2024

masnax commented Nov 8, 2024 •

edited

Loading

roosterfish commented Nov 11, 2024

masnax commented Nov 12, 2024

github: Use Canonical runners for system tests #469

Are you sure you want to change the base?

github: Use Canonical runners for system tests #469

Conversation

roosterfish commented Nov 8, 2024 • edited Loading

roosterfish commented Nov 8, 2024

masnax commented Nov 8, 2024 • edited Loading

roosterfish commented Nov 11, 2024

masnax commented Nov 12, 2024

roosterfish commented Nov 8, 2024 •

edited

Loading

masnax commented Nov 8, 2024 •

edited

Loading