Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Readiness Probe for Kubernetes > 1.20.0 #1742

Closed
critchtionary opened this issue Mar 23, 2023 · 3 comments
Closed

Readiness Probe for Kubernetes > 1.20.0 #1742

critchtionary opened this issue Mar 23, 2023 · 3 comments
Labels
enhancement New feature or request

Comments

@critchtionary
Copy link

Is this a request for help?: No


Is this a BUG REPORT or FEATURE REQUEST? (choose one): Feature Request

Version of Helm and Kubernetes: 3.10.1/1.24

Which chart: artifactory

Which product license (Enterprise/Pro/oss): Enterprise

I was wondering why readiness probes are only added to the Artifactory stateful set when the Kubernetes version is <1.20.0? Based on the changelog this was an intentional change.

I'm deploying onto K8s 1.24 and have had to patch in a readiness probe so that I can use minReadySeconds to prevent the pod being sent requests a couple of seconds before it's actually ready to handle them. This is not ideal as I'm having to duplicate the config for the check, which could break in a future version.

If there is a valid reason to have this disabled for >1.20.0, could it be implemented instead by setting the default value of enabled for the readiness probe in values.yaml based on the K8s version, so that people could more easily override it?

@chukka chukka added the enhancement New feature or request label Mar 24, 2023
@chukka
Copy link
Collaborator

chukka commented Mar 24, 2023

@critchtionary Thanks for reporting ! startupProbe is applicable for k8s versions >= 1.20.0 unless you enable featureflags in K8s for lower versions , Hence, readinessProbe.initialDelaySeconds is set to zero for >=1.20.0.

we will get back to you on your use case

@bramaq
Copy link
Contributor

bramaq commented Apr 24, 2023

This should be considered a bug.

We had indeed an outage as follows:

  • the nginx deployment uses the artifactory svc as upstream
  • the nginx pods use http://localhost:8082/router/api/v1/system/health as readiness check, ie the nginx readiness resolves to the upstream artifactory.svc router health ...
  • one of the artifactory replicas' router health checks started to fail due to an issue with its jfconnect service
  • but since the readiness check this artifactory pod was missing as described in this ticket, the unhealthy pod was not taken out of the svc endpoints

This made the HA setup fail with 503s despite the remaining pod being in a perfectly healthy state.

@nbgbankdata
Copy link

We have had big issues with this.
Readiness probes are basically not present and since artifactory is sometimes stalling and not answering on some pods the service keep sending requests to stalling pods.
The only solution is to remove the version requirement.
Same experience as @bramaq
Would also recommend going for an official fix.

@gitta-jfrog gitta-jfrog closed this as not planned Won't fix, can't repro, duplicate, stale Aug 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants