Remove Prometheus Metrics When Kubernetes Pods Are Deleted #1032

borg-z · 2024-10-24T08:54:43Z

Motivation

Currently, Falcosidekick integrates with Prometheus to expose metrics related to Falco events. However, there is an issue where metrics associated with Kubernetes pods are not being cleaned up when those pods are deleted from the cluster. For example, when a pod named dns-cz89z is removed, the corresponding metric:

falco_events{k8s_ns_name="default", k8s_pod_name="dns-cz89z", rule="Exec from shell in container", tier="cluster"}

remains in Prometheus. Over time, this leads to an accumulation of metrics for pods that no longer exist, resulting in a cluttered metrics database and potential confusion when monitoring the cluster's state. This issue affects the reliability and accuracy of monitoring, making it difficult to distinguish between active and inactive resources.

Feature

Implement one of the following solutions:

Cleanup Mechanism:
Automatically remove Prometheus metrics associated with Kubernetes pods when those pods are deleted from the cluster.
Change Metric Type:
Modify the falco_events metric from a CounterVec to a GaugeVec

Alternatives

Additional context

Example of the issue in action:

After deleting the pod dns-cz89z, the following metric remains:

falco_events{k8s_ns_name="default", k8s_pod_name="dns-cz89z", rule="Exec from shell in container", tier="cluster"}

Over time, these stale metrics accumulate, leading to performance degradation and cluttered monitoring dashboards.

The text was updated successfully, but these errors were encountered:

Issif · 2024-10-24T10:42:47Z

Hi,

I understand your point, but I don't see any way to tell falcosidekick that the source pod doesn't exist anymore. And I'll not change the metric type, to avoid any breaking change for the users. I don't know how other applications handle this situation.

borg-z · 2024-10-24T11:58:28Z

Thanks for the quick reply! I will think of ways to solve the problem

Issif · 2024-10-24T12:55:03Z

Maybe a periodic restart might help.

borg-z added the kind/feature New feature or request label Oct 24, 2024

Issif self-assigned this Oct 24, 2024

Issif added this to Falcosidekick 2.x Oct 24, 2024

github-project-automation bot moved this to To do in Falcosidekick 2.x Oct 24, 2024

Issif added this to the 2.30 milestone Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove Prometheus Metrics When Kubernetes Pods Are Deleted #1032

Remove Prometheus Metrics When Kubernetes Pods Are Deleted #1032

borg-z commented Oct 24, 2024

Issif commented Oct 24, 2024

borg-z commented Oct 24, 2024

Issif commented Oct 24, 2024

Remove Prometheus Metrics When Kubernetes Pods Are Deleted #1032

Remove Prometheus Metrics When Kubernetes Pods Are Deleted #1032

Comments

borg-z commented Oct 24, 2024

Issif commented Oct 24, 2024

borg-z commented Oct 24, 2024

Issif commented Oct 24, 2024