-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
False success after running the alba proxy-test a second time #351
Comments
I thought that the idea was that checkMK needed some form of input when calling the commands. As both SKIPPED and INFO are ignored, success was the the most valid one. |
Indeed, Checkmk ignore SKIPPED. Info levels are not shown in json output. Or you can use WARNING if you want to see it in checkmk but not get awake of this. |
@matthiasdeblock |
As long as it isn't INFO... We do not want to receive a SUCCESS on something that is already been tested on another node. This will give us the false positives. |
Please change to [SKIPPED] |
@wimpers SKIPPED is filtered by operations |
It is not feasible to make it a warning as it will have the same result as it has now (going from WARNING -> critical would trigger an alarm) Option 1: Integrate the healthcheck in the framework and work with caching and scheduling of checks
PROS:
CONS:
My estimation for this approach is 1,5 months (with the loss of Kvan, some knowledge will be lost) Option 2: Healthcheck keeping its own results
PROS
Without the own scheduling, I estimate it to be 2 weeks of work Option 3: change it to skipped
PROS
CONS
|
If we pick this up and store results in Arakoon we might want to do something with max_hours_zero_disk_safety which was introduced in 6deb314#diff-a00aa6255fb5cf4f75fc130a527a174a but never used |
False success after running the alba proxy-test a second time. This gave us a Recovery on a broken albaproxy. This needs to be 'INFO' instead of 'SUCCESS' as we do not want to get a RECOVERY on these.
@jeroenmaelbrancke and @jtorreke : Do you guys agree on this?
The text was updated successfully, but these errors were encountered: