Include default requests/limits in all loki + promtail + grafana-agent deployments #358

ubergesundheit · 2021-06-22T08:06:43Z

The loki-app + promtail-app should include reasonable default limits/requests

hervenicol · 2022-09-29T09:48:19Z

When we review requests/limits on Loki, could be good to have this issue in mind also: https://github.com/giantswarm/giantswarm/issues/21562

TheoBrigitte · 2023-11-27T17:45:03Z

As we recently did a lot of tuning in Loki, do we still need this @hervenicol ?

hervenicol · 2023-11-27T17:51:10Z

For Loki we should be good, buts I didn't do anything on Promtail.

Rotfuks · 2024-07-09T12:12:44Z

Let's check if this is done yet, or if we still have some todos here to set request/limits

QuentinBisson · 2024-07-16T09:23:03Z

Loki is fine:

k get sts -n loki loki-backend -oyaml | yq '.spec.template.spec.containers.[].resources'
limits:
  cpu: 100m
  memory: 100Mi
requests:
  cpu: 50m
  memory: 50Mi
limits:
  memory: 3Gi
requests:
  cpu: 200m
  memory: 1Gi
> k get sts -n loki loki-write -oyaml | yq '.spec.template.spec.containers.[].resources'
limits:
  memory: 8Gi
requests:
  cpu: "1"
  memory: 4Gi
> k get deploy -n loki loki-read -oyaml | yq '.spec.template.spec.containers.[].resources'
limits:
  memory: 3Gi
requests:
  cpu: 200m
  memory: 1Gi

Promtail is okay:

k get ds -n kube-system promtail -oyaml | yq '.spec.template.spec.containers.[].resources'
limits:
  cpu: "1"
  memory: 256Mi
requests:
  cpu: 25m
  memory: 128Mi

Grafana-agent/Alloy are not:

k get deploy -n kube-system grafana-agent -oyaml | yq '.spec.template.spec.containers.[].resources'
{}
requests:
  cpu: 1m
  memory: 5Mi
> 
> k get deploy -n monitoring alloy-rules -oyaml | yq '.spec.template.spec.containers.[].resources'
{}
requests:
  cpu: 1m
  memory: 5Mi

QuentinBisson · 2024-07-16T11:42:40Z

@giantswarm/team-atlas I would really like some thoughts on how to proceed here. Should we set some random resource usage, use vpa, use hpa?

The issue is when we want to play with clustering (which we don't know) but it supports hpas

hervenicol · 2024-07-16T12:09:24Z

Why is it important to take the right decision regarding VPA or HPA right now?

I guess it's because it requires a new olly-bundle release, whereas we can change the deployment type for grafana-agent (or alloy-logs or logging agent) directly from additional values on the MC via logging-operator.
Right?

Otherwise, if both configs (xPA and deployment type) can be setup from the same place, we should start with VPA and we will move to HPA later when we have the need.

QuentinBisson · 2024-07-16T12:21:18Z

I'm not trying to take the future right decisions but knowing where to go changes what I have to do (adding vpa support upstream is different than setting resources :) )

hervenicol · 2024-07-16T12:28:15Z

oh, there's no VPA upstream!
Well, we could only add VPA to our own chart.
Contributing VPA in upstream chart could be nice as well, but given the delay for PRs I think we should not wait for this before we do something on our side.

QuentinBisson · 2024-07-16T12:57:31Z

Upstream PR grafana/alloy#1305.
I'll let this wait a bit and see how it goes by the end of the week :)

QuentinBisson · 2024-09-02T19:08:05Z

Initial VPA PR: giantswarm/alloy-app#44
Followed with a Fix: giantswarm/alloy-app#46
Configured in prometheus-rules for now: giantswarm/prometheus-rules#1339

QuentinBisson · 2024-09-03T07:38:21Z

Alloy now has proper limits and we can enable VPA on memory if needed

ubergesundheit added team/halo feature-request labels Jun 22, 2021

JosephSalisbury added team/atlas Team Atlas and removed team/halo labels Oct 7, 2021

TheoBrigitte mentioned this issue Aug 22, 2023

Implement logging infrastructure #311

Closed

37 tasks

QuentinBisson changed the title ~~Include default requests/limits in all loki + promtail deployments~~ Include default requests/limits in all loki + promtail + grafana-agent deployments Mar 5, 2024

github-project-automation bot added this to Roadmap Mar 5, 2024

github-project-automation bot moved this to Inbox 📥 in Roadmap Mar 5, 2024

QuentinBisson self-assigned this Jul 16, 2024

QuentinBisson added the blocked-upstream label Jul 17, 2024

This was referenced Sep 2, 2024

add vertical pod autoscaler giantswarm/alloy-app#44

Merged

fix-alloy-vpa-value-usage giantswarm/alloy-app#46

Merged

QuentinBisson closed this as completed Sep 3, 2024

github-project-automation bot moved this from Inbox 📥 to Done ✅ in Roadmap Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include default requests/limits in all loki + promtail + grafana-agent deployments #358

Include default requests/limits in all loki + promtail + grafana-agent deployments #358

ubergesundheit commented Jun 22, 2021

hervenicol commented Sep 29, 2022 •

edited

Loading

TheoBrigitte commented Nov 27, 2023

hervenicol commented Nov 27, 2023

Rotfuks commented Jul 9, 2024

QuentinBisson commented Jul 16, 2024

QuentinBisson commented Jul 16, 2024

hervenicol commented Jul 16, 2024

QuentinBisson commented Jul 16, 2024

hervenicol commented Jul 16, 2024

QuentinBisson commented Jul 16, 2024

QuentinBisson commented Sep 2, 2024

QuentinBisson commented Sep 3, 2024

Include default requests/limits in all loki + promtail + grafana-agent deployments #358

Include default requests/limits in all loki + promtail + grafana-agent deployments #358

Comments

ubergesundheit commented Jun 22, 2021

hervenicol commented Sep 29, 2022 • edited Loading

TheoBrigitte commented Nov 27, 2023

hervenicol commented Nov 27, 2023

Rotfuks commented Jul 9, 2024

QuentinBisson commented Jul 16, 2024

QuentinBisson commented Jul 16, 2024

hervenicol commented Jul 16, 2024

QuentinBisson commented Jul 16, 2024

hervenicol commented Jul 16, 2024

QuentinBisson commented Jul 16, 2024

QuentinBisson commented Sep 2, 2024

QuentinBisson commented Sep 3, 2024

hervenicol commented Sep 29, 2022 •

edited

Loading