fix #3271: refining the wait logic #3274

shawkins · 2021-06-25T13:40:35Z

Description

To address #3271 the only time to retry is on http gone - the watch framework will already handle all of the other retries.
delete does not need special handling in the watcher

Type of change

Bug fix (non-breaking change which fixes an issue)
Feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change
Chore (non-breaking change which doesn't affect codebase;
test, version modification, documentation, etc.)

Checklist

Code contributed by me aligns with current project license: Apache 2.0
I Added CHANGELOG entry regarding this change
I have implemented unit tests to cover my changes
I have added/updated the javadocs and other documentation accordingly
No new bugs, code smells, etc. in SonarCloud report
I tested my code in Kubernetes
I tested my code in OpenShift

centos-ci · 2021-06-25T13:40:36Z

Can one of the admins verify this patch?

shawkins · 2021-06-26T14:22:27Z

Like most things this turned out to be bigger than I thought at first glance... It makes sense to me to switch the implementation to be based upon an informer instead - as that already has the necessary retry logic and it's simple to express.

With this change it is unnecessary to specifically configure an exponential backoff for this operation - it should use the defaults for watches / informers. I did not remove those fields from all of the relevant contexts as that greatly expands the scope of these changes.

This change also reuses the existing KubernetesClientTimeoutException instead of using an IllegalArgumentException for timed out wait calls - I'll note that as a breaking change.

This also simplifies the logic in NamespaceVisitFromServerGetWatchDeleteRecreateWaitApplicableListImpl - it was somewhat idiosyncratic and should be using the fork join pool for the potentially long running waitUntilCondition tasks. At some point the resource handlers and underlying logic should exposes the underlying futures as that would prevent this needless usage of threads.

This addresses having duplicate metadata.name fields in the watch url, but the real problem is #3275

manusa

LGTM, thx!

shawkins · 2021-06-28T23:37:07Z

As mentioned on the issue, the subsequent commit removes the declared interruptedexception to simplify internal code and for consistency with the rest of the api. @manusa if that doesn't seem reasonable I can pull that out.

manusa · 2021-06-29T05:38:45Z

...lient/dsl/internal/NamespaceVisitFromServerGetWatchDeleteRecreateWaitApplicableListImpl.java

-      executor.shutdown();
-    }
+  public List<HasMetadata> waitUntilReady(final long amount, final TimeUnit timeUnit) {
+    return waitUntilCondition(resource -> Objects.nonNull(resource) && getReadiness().isReady(resource), amount, timeUnit);


The statement

getReadiness().isReady(resource)

is not equivalent to

handlerOf(resource).waitUntilReady(...)

However the former doesn't make much sense since the effective call of waitUntilRady should be that of the applicable *OperationsImpl (e.g. ServiceHandler->ServiceOperationsImpl), especially considering the ServiceTest. ~~Are we sure the Endpoints are being queried after these changes?~~

fabric8io/mockwebserver#62

the only time to retry is on http gone delete does not need special handling in the watcher

also fully removing the additional operation/context fields

sonarcloud · 2021-06-29T14:44:26Z

SonarCloud Quality Gate failed.

0 Bugs
0 Vulnerabilities
0 Security Hotspots
9 Code Smells

1.2% Coverage
0.0% Duplication

InterruptedException is no longer thrown on websocket timeout, rather a KubernetesClientException. Unfortunately there is no way to differentiate between a timeout and an error. * fabric8io/kubernetes-client#3274 * fabric8io/kubernetes-client#3197

shawkins requested review from manusa and rohanKanojia June 25, 2021 13:40

oscerd approved these changes Jun 25, 2021

View reviewed changes

shawkins force-pushed the wait branch from c9a0044 to a8271e1 Compare June 26, 2021 14:22

shawkins force-pushed the wait branch 2 times, most recently from 5716562 to 1b1a26f Compare June 26, 2021 16:49

shawkins mentioned this pull request Jun 28, 2021

WatchListDeletable does not properly implement Waitable #3278

Closed

manusa approved these changes Jun 28, 2021

View reviewed changes

manusa added this to the 5.5.0 milestone Jun 28, 2021

shawkins force-pushed the wait branch from 967bf19 to 85382a1 Compare June 29, 2021 02:37

manusa force-pushed the wait branch from 85382a1 to 7c46106 Compare June 29, 2021 05:13

manusa reviewed Jun 29, 2021

View reviewed changes

rohanKanojia approved these changes Jun 29, 2021

View reviewed changes

shawkins added 3 commits June 29, 2021 15:42

fix fabric8io#3271: refining the wait logic

2cebbcb

the only time to retry is on http gone delete does not need special handling in the watcher

switching to an informer based implementation

9a3331e

also fully removing the additional operation/context fields

addressing code smells and adding more changelog

df8a550

manusa force-pushed the wait branch from 7c46106 to daca241 Compare June 29, 2021 13:46

removing declared interruptedexception

5751f31

manusa force-pushed the wait branch from daca241 to 5751f31 Compare June 29, 2021 14:01

manusa merged commit 64dce1f into fabric8io:master Jun 29, 2021

manusa mentioned this pull request Jun 30, 2021

SSLException on waitUntilReady() #2956

Closed

Vlatombe mentioned this pull request Mar 31, 2022

[JENKINS-67664] Adapt to kubernetes-client changes jenkinsci/kubernetes-plugin#1159

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #3271: refining the wait logic #3274

fix #3271: refining the wait logic #3274

shawkins commented Jun 25, 2021 •

edited

Loading

centos-ci commented Jun 25, 2021

shawkins commented Jun 26, 2021 •

edited

Loading

manusa left a comment

shawkins commented Jun 28, 2021

manusa Jun 29, 2021 •

edited

Loading

manusa Jun 29, 2021

sonarcloud bot commented Jun 29, 2021

fix #3271: refining the wait logic #3274

fix #3271: refining the wait logic #3274

Conversation

shawkins commented Jun 25, 2021 • edited Loading

Description

Type of change

Checklist

centos-ci commented Jun 25, 2021

shawkins commented Jun 26, 2021 • edited Loading

manusa left a comment

Choose a reason for hiding this comment

shawkins commented Jun 28, 2021

manusa Jun 29, 2021 • edited Loading

Choose a reason for hiding this comment

manusa Jun 29, 2021

Choose a reason for hiding this comment

sonarcloud bot commented Jun 29, 2021

shawkins commented Jun 25, 2021 •

edited

Loading

shawkins commented Jun 26, 2021 •

edited

Loading

manusa Jun 29, 2021 •

edited

Loading