Refactors workflows engine loop #14375

vyzaldysanchez · 2024-09-09T18:30:10Z

Requires Dependencies

Resolves Dependencies

core/services/workflows/engine.go

patrickhuie19 · 2024-10-07T17:39:13Z

core/services/workflows/engine.go

 			// Executed synchronously to ensure we correctly schedule subsequent tasks.
-			err := e.handleStepUpdate(ctx, stepUpdate)
+			e.logger.Debugw(fmt.Sprintf("received step update for execution %s", stepUpdate.ExecutionID),
+				eIDKey, stepUpdate.ExecutionID, sRKey, stepUpdate.Ref)


should this be eIDKey instead of a key for step update execution IDs?

patrickhuie19 · 2024-10-07T17:41:45Z

core/services/workflows/engine.go

+	})
+	if !added {
+		// skip this execution since there's already a stepUpdateLoop running for the execution ID
+		e.logger.With(eIDKey, executionID).Debugf("won't start execution for execution %s, execution was already started", executionID)


/nit call With once at line 510?

lggr := e.logger.With("event", event, eIDKey, executionID)

patrickhuie19 · 2024-10-07T17:45:10Z

core/services/workflows/engine.go

+		return err
+	}
+	for _, sd := range stepDependents {
+		e.queueIfReady(state, sd)


is it cleaner wrt to separation of concerns if handleStepUpdate isn't responsible for queue'ing dependents?

The call to queue is really part of handling the update here.

Aren't they separate concerns? If there was a parent orchestrator that called handleStepUpdate and then processed the dependents, that wouldn't be the case.

patrickhuie19 · 2024-10-07T17:52:45Z

core/services/workflows/engine.go

+func (e *Engine) isWorkflowFullyProcessed(state store.WorkflowExecution) (bool, string, error) {
+	statuses := map[string]string{}
+	// we need to first propagate the status of the errored status if it exists...
+	err := e.workflow.walkDo(workflows.KeywordTrigger, func(s *step) error {


do these funcs passed into walkDo need to be anonymous? unit testing isWorkflowFullyProcessed seems like it could be cleaner if you could call into these

they can be named, I see no issues there.

patrickhuie19 · 2024-10-07T17:59:17Z

core/services/workflows/engine.go

+
+	var hasErrored, hasTimedOut, hasCompletedEarlyExit bool
+	// Let's determine the status of the workflow.
+	for _, status := range statuses {


I haven't read into walkDo suggest, but its name suggests that its walking the DAG of the workflow steps for a provided func. In this case, is the error status we present to the workflow executor the first errored status the walker finds (b/c that status is what is propagated to all the dependends of an error'd step)?

We consider 3 states other than completed, to consider the workflow processed: error, completed_early_exit and timeout.

The way we return the error to the executor is basically as follows: if there's a single error, then it is considered error, if there's a single timeout, then it is timedout, and if none of the others, if there's a completed_early_exit, then it is considered as such.

The precedence of preference is error -> timeout -> completed_early_exit.

Not sure if this answers your question clearly, please let me know @patrickhuie19.

To restate, these states are mutually exclusive and propagated to their dependents. If we traverse the tree in the order of executions we could return the first non-complete status we see, right?

cl-sonarqube-production · 2024-10-07T19:07:36Z

Quality Gate passed

Issues
2 New issues
1 Fixed issue
0 Accepted issues

Measures
0 Security Hotspots
88.8% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

vyzaldysanchez added 4 commits September 9, 2024 14:29

Refactors workflows engine loop

5898945

Adds changeset + fixes lint

4f1c898

Adds mutex

2ce6164

Merge branch 'develop' into task/KS-387/workflows-engine-loop

ed92879

vyzaldysanchez requested review from cedric-cordenier and bolekk September 10, 2024 15:33

vyzaldysanchez added 4 commits September 10, 2024 11:47

Merge branch 'develop' into task/KS-387/workflows-engine-loop

67c9e4f

Fixes tests - WIP

84650e6

Merge branch 'develop' into task/KS-387/workflows-engine-loop

eed0679

Merge branch 'develop' into task/KS-387/workflows-engine-loop

5a20db5

cedric-cordenier reviewed Sep 11, 2024

View reviewed changes

core/services/workflows/engine.go Outdated Show resolved Hide resolved

cedric-cordenier reviewed Sep 11, 2024

View reviewed changes

core/services/workflows/engine.go Outdated Show resolved Hide resolved

cedric-cordenier reviewed Sep 11, 2024

View reviewed changes

core/services/workflows/engine.go Outdated Show resolved Hide resolved

cedric-cordenier reviewed Sep 11, 2024

View reviewed changes

core/services/workflows/engine.go Outdated Show resolved Hide resolved

vyzaldysanchez added 3 commits September 11, 2024 09:33

Improves locking handling for goroutines

0499915

Merge branch 'develop' into task/KS-387/workflows-engine-loop

a117f5f

Fixes lint

74d42fb

vyzaldysanchez requested a review from cedric-cordenier September 11, 2024 13:52

vyzaldysanchez added 3 commits September 11, 2024 10:31

Merge branch 'develop' into task/KS-387/workflows-engine-loop

ef161e3

Fixes tests

3820cc2

Merge branch 'develop' into task/KS-387/workflows-engine-loop

81c8580

vyzaldysanchez marked this pull request as ready for review September 11, 2024 19:22

vyzaldysanchez requested a review from a team as a code owner September 11, 2024 19:22

jmank88 previously approved these changes Sep 11, 2024

View reviewed changes

vyzaldysanchez added 6 commits September 12, 2024 10:56

Merge branch 'develop' into task/KS-387/workflows-engine-loop

d48109a

Merge branch 'develop' into task/KS-387/workflows-engine-loop

28c4337

Merge branch 'develop' into task/KS-387/workflows-engine-loop

14fc5f1

Merge branch 'develop' into task/KS-387/workflows-engine-loop

3df251c

Merge branch 'develop' into task/KS-387/workflows-engine-loop

351798a

Merge branch 'develop' into task/KS-387/workflows-engine-loop

5fa4a68

vyzaldysanchez dismissed stale reviews from cedric-cordenier, krehermann, and jmank88 via 0e116fc October 7, 2024 15:29

vyzaldysanchez added 2 commits October 7, 2024 12:31

Fixes status propagation to step dependents

a5067d6

Merge branch 'develop' into task/KS-387/workflows-engine-loop

0a22bd4

vyzaldysanchez requested review from jmank88, cedric-cordenier and krehermann October 7, 2024 16:32

jmank88 reviewed Oct 7, 2024

View reviewed changes

core/services/workflows/engine.go Outdated Show resolved Hide resolved

Adds defensive lock-check

92bf83d

vyzaldysanchez requested a review from jmank88 October 7, 2024 16:38

jmank88 approved these changes Oct 7, 2024

View reviewed changes

patrickhuie19 reviewed Oct 7, 2024

View reviewed changes

Merge branch 'develop' into task/KS-387/workflows-engine-loop

cfbc6f5

krehermann approved these changes Oct 7, 2024

View reviewed changes

vyzaldysanchez added this pull request to the merge queue Oct 7, 2024

Merged via the queue into develop with commit 816b25c Oct 7, 2024
130 checks passed

vyzaldysanchez deleted the task/KS-387/workflows-engine-loop branch October 7, 2024 19:35

This was referenced Oct 7, 2024

[DO NOT MERGE] Changeset Release Preview - v2.19.0 #13148

Draft

[DO NOT MERGE] Changeset Release Preview - v2.19.0 trading2024/chainlink#1

Draft

[DO NOT MERGE] Changeset Release Preview - v2.18.0 luisriverag/chainlink#572

Draft

This was referenced Oct 21, 2024

[DO NOT MERGE] Changeset Release Preview - v2.19.0 philipjonsen/chainlink#2

Draft

[DO NOT MERGE] Changeset Release Preview - v2.18.0 picoinnetwork/chainlink#1

Draft

[DO NOT MERGE] Changeset Release Preview - v2.18.0 fanligroup/chainlink#1

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactors workflows engine loop #14375

Refactors workflows engine loop #14375

vyzaldysanchez commented Sep 9, 2024

patrickhuie19 Oct 7, 2024

patrickhuie19 Oct 7, 2024

patrickhuie19 Oct 7, 2024

vyzaldysanchez Oct 7, 2024

patrickhuie19 Oct 7, 2024

patrickhuie19 Oct 7, 2024 •

edited

Loading

vyzaldysanchez Oct 7, 2024

patrickhuie19 Oct 7, 2024

vyzaldysanchez Oct 7, 2024

patrickhuie19 Oct 7, 2024

cl-sonarqube-production bot commented Oct 7, 2024

Refactors workflows engine loop #14375

Refactors workflows engine loop #14375

Conversation

vyzaldysanchez commented Sep 9, 2024

Requires Dependencies

Resolves Dependencies

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

patrickhuie19 Oct 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cl-sonarqube-production bot commented Oct 7, 2024

Quality Gate passed

patrickhuie19 Oct 7, 2024 •

edited

Loading