-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chore: Figure out appropriate requests and limits for Claudie services #935
Comments
Have these recommendations taken spikes into account ? We've had several rounds of requests/limit tuning already, and there's a reason why they are roughly as you've found them |
@MarioUhrik I don't trust GKE observability tooling to provide the accurate stats including spikes, I think they average them out quite aggressively. At the bottom I mentioned what should be the correct steps. The tables show what I would consider with GKE tooling, but without further investigation using kube metrics and prometheus we should not proceed. |
Sounds good, thanks |
|
There are some gathered data from monitoring stack, when the pipeline on e2e cluster ran. Memory biggest spikes for last 24h
CPU biggest spikes for last 24h
Based on the spikes I have proposed some requests and limits changes, but I am not sure, whether they are relevant. CPU
Memory
see tables in excel |
We have discussed new requests and limits with @katapultcloud and @cloudziu on a call. You can see them in table below. BTW we have decided to remove limits on CPU and keep only requests. Based on the spikes I have proposed some requests and limits changes, but I am not sure, whether they are relevant. CPU
Memory
|
Description
Requests and limits should be adjusted as they seems to take way more than they actually need causing overprovisioning of hardware. I've viewed consumption of requests of each service (some not included) in GKE observability console and this is what I came up with.
Memory
CPU
However, the statistics in GKE console are not great and I'd like to monitor services for some time 1-2 weeks before setting these in stone.
Exit criteria
The text was updated successfully, but these errors were encountered: