Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] 1.11.2 workflow histories failing to replay on 1.11.5 #1582

Open
selbyk opened this issue Dec 10, 2024 · 0 comments
Open

[Bug] 1.11.2 workflow histories failing to replay on 1.11.5 #1582

selbyk opened this issue Dec 10, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@selbyk
Copy link

selbyk commented Dec 10, 2024

What are you really trying to do?

We want to update to the latest and have been running @temporalio/*@1.11.2 since 9/25/24. I tried to update to @temporalio/*@1.11.5 a few days ago and was faced with:

2024-12-10T21:41:34.484512Z  WARN temporal_sdk_core::worker::workflow: Failing workflow task run_id={{runId}} failure=Failure { failure: Some(Failure { message: "[TMPRL1100] Nondeterminism error: Child workflow id of scheduled event 'task/{{uuid}}' does not match child workflow id of command 'task/{{different_uuid}}'", source: "", stack_trace: "", encoded_attributes: None, cause: None, failure_info: Some(ApplicationFailureInfo(ApplicationFailureInfo { r#type: "", non_retryable: false, details: None, next_retry_delay: None })) }), force_cause: NonDeterministicError }
Replay failed https://cloud.temporal.io/namespaces/{{namespace}}/workflows/{{wfid}}/{{runid}}
DeterminismViolationError: Replay failed with a nondeterminism error. This means that the workflow code as written is not compatible with the history that was fed in. Details: Workflow activation completion failed: Failure { failure: Some(Failure { message: "[TMPRL1100] Nondeterminism error: Child workflow id of scheduled event 'task/{{uuid}}' does not match child workflow id of command 'task/{{different_uuid}}'", source: "", stack_trace: "", encoded_attributes: None, cause: None, failure_info: Some(ApplicationFailureInfo(ApplicationFailureInfo { r#type: "", non_retryable: false, details: None, next_retry_delay: None })) }), force_cause: NonDeterministicError }
    at evictionReasonToReplayError (/Users/selby/projects/monorepo/node_modules/.pnpm/@[email protected]_@[email protected][email protected]/node_modules/@temporalio/worker/lib/replay.js:34:20)
    at Worker.runReplayHistories (/Users/selby/projects/monorepo/node_modules/.pnpm/@[email protected]_@[email protected][email protected]/node_modules/@temporalio/worker/lib/worker.js:228:76)
    at async replayWorkflows (/Users/selby/projects/monorepo/apps/temporal-workers/workflow-tests/replay.ts:2:3826)
    at async replayWorkflowsInEnv (/Users/selby/projects/monorepo/apps/temporal-workers/workflow-tests/replay.ts:2:4907)
    at async replay (/Users/selby/projects/monorepo/apps/temporal-workers/workflow-tests/replay.ts:2:6297) 

on almost all of some of our workflow history types, including our latest histories. They seem to replay fine on @temporalio/*@1.11.3.

Describe the bug

We use uuid4() (from @temporalio/workflow) to generate a workflowId => task/uuid4(), and we use that workflowId in startChild({ workflowId }). We replay all running and up to the last 500 completed histories on each of our workflow types on every PR/deploy, and this part of our code and the replays have been stable for months.

Minimal Reproduction

Maybe we can find a minimal reproduction together. I just want to get this on your radar.

Environment/Versions

  • OS and processor: M2 Mac and Linux (GH Actions)
  • Temporal Version: 1.11.5

Additional context

@selbyk selbyk added the bug Something isn't working label Dec 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant