Configurable timeout for argocd-update step #3515

razvan-agape · 2025-02-18T07:29:14Z

Checklist

I've searched the issue queue to verify this is not a duplicate feature request.
I've pasted the output of kargo version, if applicable.
I've pasted logs, if applicable.

Proposed Feature

The default timeout for argocd-update operation is 5 minutes.
Kargo version: 1.2.0.

Motivation

In some scenarios, an application sync may take longer that, which will mark the step as failed, although, eventually, the sync might succeed.

Suggested Implementation

It would be useful to be able to configure the timeout, or have a configurable number of retries.

The text was updated successfully, but these errors were encountered:

krancour · 2025-02-18T10:59:34Z

https://docs.kargo.io/user-guide/reference-docs/promotion-templates#step-retries

lknite · 2025-03-15T21:45:24Z

@razvan-agape is this working for you? I tried the following and it doesn't seem to have any effect:

      - uses: argocd-update
        retry:
          errorThreshold: 1
          timeout: 2m0s

kargo v1.3.1

krancour · 2025-03-17T21:20:35Z

@lknite, the timeouts are not exact because steps do not continuously retry internally. If the timeout hasn't elapsed, a step that's still running (waiting on something external) is retried on the next reconciliation attempt.

In general, those attempts are every five minutes, but the next attempt can be sooner if a related resource has a state change that forces the Promotion back onto the queue. It can also be later depending on the depth of the queue.

In your case, setting the timeout to 2m, may have no practical effect in the average case where the next reconciliation attempt is made (roughly) five minutes later. It worked well for @razvan-agape because he was raising the limit.

We're a little bit at the mercy of the controller runtime here since we don't have precise control over the interval before the next reconciliation...

That said, we can probably get closer to the specified timeout by shortening the requeue interval when timeout is sooner than when the next reconciliation attempt would typically be.

I will write up a separate issue for this as soon as I've got a chance.

krancour · 2025-03-17T22:48:27Z

@lknite, I opened #3663

razvan-agape added kind/enhancement kind/proposal labels Feb 18, 2025

github-actions bot added needs/priority needs/area labels Feb 18, 2025

razvan-agape closed this as completed Feb 19, 2025

krancour mentioned this issue Mar 17, 2025

controller: try to make step timeout somewhat more accurate #3663

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable timeout for argocd-update step #3515

Configurable timeout for argocd-update step #3515

razvan-agape commented Feb 18, 2025

krancour commented Feb 18, 2025

lknite commented Mar 15, 2025 •

edited

Loading

krancour commented Mar 17, 2025

krancour commented Mar 17, 2025

Configurable timeout for argocd-update step #3515

Configurable timeout for argocd-update step #3515

Comments

razvan-agape commented Feb 18, 2025

Checklist

Proposed Feature

Motivation

Suggested Implementation

krancour commented Feb 18, 2025

lknite commented Mar 15, 2025 • edited Loading

krancour commented Mar 17, 2025

krancour commented Mar 17, 2025

lknite commented Mar 15, 2025 •

edited

Loading