-
Notifications
You must be signed in to change notification settings - Fork 30.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test_runner: add bail out #56490
base: main
Are you sure you want to change the base?
test_runner: add bail out #56490
Conversation
Review requested:
|
89f64db
to
55874e8
Compare
lib/internal/test_runner/runner.js
Outdated
this.root.harness.testsProcesses.forEach((child) => { | ||
child.kill(); | ||
}); | ||
return; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
replace with an abort signal?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hey @atlowChemi, sure!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We already have an existing signal passed to the spawned process, but that is a signal at the Test
level IIRC. perhaps we can add a new controller on root.harness
, and spawn with a AbortSignal.any
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done!
I moved the AbortSignal.any
directly into createTestTree
for consistency
Can you explain more about what was flaky? I'm guessing you mean the tests were at different points of execution when they received the bail out signal. I think the best way to work around this is to use test fixtures that never finish. |
Hey @cjihrig, you're guessing right! Your proposed solution sounds good. I think we should add a test as follows:
WDYT? |
doc/api/test.md
Outdated
By enabling this flag, the test runner will exit the test suite early | ||
when it encounters the first failing test, preventing | ||
the execution of subsequent tests. | ||
Already running tests will be canceled, and no further tests will be started. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure if we need to get into this small of a detail, but technically there will be a window of time where new tests may be started before being cancelled. Maybe we can say something like "The test runner will cancel all remaining tests."
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
doc/api/test.md
Outdated
@@ -1483,6 +1512,11 @@ changes: | |||
does not have a name. | |||
* `options` {Object} Configuration options for the test. The following | |||
properties are supported: | |||
* `bail` {boolean} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think this should be an option to test()
. Should it be part of run()
instead?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, my bad, the bail
option it's part of run
. I'll update the doc ASAP !
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed
test/parallel/test-runner-run.mjs
Outdated
it('should only allow a boolean in options.bail', async () => { | ||
[Symbol(), {}, [], () => { }, 0, 1, 0n, 1n, '', '1', Promise.resolve([])] | ||
.forEach((bail) => assert.throws(() => run({ bail }), { | ||
code: 'ERR_INVALID_ARG_TYPE' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just out of curiosity, are we sure that these errors actually correspond to the use of bail
? For example, it's possible that ERR_INVALID_ARG_TYPE
or ERR_INVALID_ARG_VALUE
are thrown by run()
completely unrelated to the bail
option.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've written the test following a red-green cycle.
I'll take another look and I'll add a comment here 😁
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Another option is to update the assertion to check the code and the message.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree, we should probably do the same for the other "validation" tests as well
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
test/parallel/test-runner-run.mjs
Outdated
|
||
// TODO(pmarchini): Bailout is not supported in watch mode yet but it should be. | ||
// We should enable this test once it is supported. | ||
it.todo('should handle the bail option with watch mode'); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this be documented?
There are all sorts of annoying edge cases to account for here, and it might be worth doing a survey of how the tap, mocha, and vitest runners handle bailing out when things are running in parallel. It's much more straightforward when only one thing is running. But, for example, if the very first test in the first process fails, do we bother spawning the other child processes at all? Or, in a bail out situation, how important is it to have an accurate summary at the end of the test run with correct counts for total tests, cancelled tests, etc. |
55874e8
to
f1c269c
Compare
@cjihrig I'm still checking different runners and it seems that mostly the behaviour is "non-standard". Mocha stops the execution returning a partial result without reporting the full list of cancelled tests. const result = await runMochaAsync('options/parallel/test-*', [
'--parallel',
'--bail'
]);
// we don't know _exactly_ how many tests will be skipped here
// due to the --bail, but the number of tests completed should be
// less than the total, which is 5.
return expect(
result.passing + result.pending + result.failing,
'to be less than',
5
); Checking tests like this one I have the impression that the testing of the feature itself follows a "best effort" approach in more than one tool. While I'm still checking other examples I think that the most common use case for the bailout is to stop as soon as possible the execution in a CI/automation env. Regarding the tests I was thinking about:
Do you have any suggestions / other behaviours you think we should ensure? |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #56490 +/- ##
==========================================
+ Coverage 89.12% 89.19% +0.07%
==========================================
Files 662 662
Lines 191555 191854 +299
Branches 36859 36935 +76
==========================================
+ Hits 170722 171133 +411
+ Misses 13694 13565 -129
- Partials 7139 7156 +17
|
Catching up with the last attempt (#48919), this is another try at introducing the bailout feature.
I'm opening this PR as a draft to discuss the implementation and because refactoring may be needed if this approach is well-received by the community.
Note: In some tests, I had to enforce a
concurrency=1
setting because testing the bailout feature across multiple files concurrently proved to be extremely flaky.