KEP for flagz page for Kubernetes Components #4831

richabanker · 2024-09-06T23:49:33Z

One-line PR description: Add a KEP for flagz page in Kubernetes components.

Issue link: Flagz for Kubernetes Components #4828

dgrisonnet · 2024-09-19T16:40:51Z

/assign

dgrisonnet · 2024-09-25T19:32:12Z

keps/sig-instrumentation/4828-component-flagz/README.md

+
+### Data Format and versioning
+
+Initially, the flagz page will exclusively support a plain text format for responses. We will implement these endpoints using versioned URLs, like /v1/flagz, to ensure future compatibility. This versioning strategy allows us to seamlessly introduce structured data formats (e.g., JSON) in the future, through a distinct endpoint such as /v2/flagz, without disrupting existing implementations.


Wouldn't it be better to have the endpoints in the form of /flagz/v1? Flagz is the subdomain that we want to version, if we were to put the version first, that would be confusing with the component version as it would then be at the top-level.

Yes you're right, I got that the other way round. Fixed it now, thanks!

One thing I was wondering since then is whether going with a query parameter might not be better. Something like:

/flagz?version=1 /flags?version=2

wdyt?

This was also raised in the statusz KEP PR. I am ok with either, whichever seems the best to indicate the version for the endpoint.

Updated the KEP to use query params for specifying the version.

dgrisonnet · 2024-09-25T19:37:23Z

keps/sig-instrumentation/4828-component-flagz/README.md

+that might indicate a serious problem?
+-->
+
+- apiserver_request_duration_seconds metric that will tell us if there's a spike in request latency of kube-apiserver that might indicate that the flagz endpoint is interfering with the component's core functionality or causing delays in processing other requests


IIRC you won't get this metric for free, you'll have to make sure that the handler for the flagz endpoint is instrumented.

Umm unsure about how flagz handler relates with this metric which I thought is enabled by default? The idea was to use this metric to monitor general load of requests that apiserver handles and use that as a signal to check if flagz is consuming all system resources causing delays in apiserver's capability to serve other resource requests..

though yeah I guess that wouldnt be a clean signal to detect issues with the flagz endpoint in a deterministic way.. Do you suggest introducing a new metric that keeps track of request latency for just /flagz requests ?

I forgot to submit my comment on your PR, but you already dealt with that concern of mine in your POC: kubernetes/kubernetes#127581 (comment)

richabanker · 2024-09-26T20:48:43Z

cc @johnbelamaric for PRR. Hi John, could you please review this KEP for prod-readiness whenever you get a chance? Thanks a lot!

richabanker · 2024-09-26T20:52:07Z

/assign @johnbelamaric

whoops meant to add as assignee

johnbelamaric · 2024-09-26T21:00:38Z

/assign

thockin · 2024-10-07T19:41:30Z

keps/sig-instrumentation/4828-component-flagz/README.md

+
+1. **No sensitive data exposed**
+
+    We will ensure that no sensitive data is exposed through flagz and that access to this endpoint is gated by using system-monitoring group.


how? Do flag definitions allow us to express the idea that the flag is "sensitive" ?

I dont think there currently is support for marking flags like that. So we would need to introduce a way to express this.
cc @cjcullen who I just reached out to for his thoughts on the same. Also @liggitt

Do we need to push that up to pflag? One could argue that falgs should NEVER have sensitive information, since they are visible via ps.

Do we need to push that up to pflag?

Or.. maybe we could create a separate set of "sensitive flags" and while printing out the values in the flagz handler, redact the values for those flags?

One could argue that falgs should NEVER have sensitive information, since they are visible via ps

That's indeed true. But if we still feel that exposing this info in an endpoint will be easier for attackers to get hold of, we can impose some restrictions on what data to expose in the endpoint.

Do we have any examples of these sorts of flags?

Perhaps the TLS related flags.. (also waiting for someone from sig-auth to help answer that..)

I'm not aware any flags in long-lived components (kube-apiserver, kubelet, scheduler, controller-manager, etc) that contain sensitive information directly. For things like TLS or credentials, they always point at files which contain the data.

There are shorter-lived invocations that take flags containing sensitive information (like kubeadm init with --certificate-key or --token), and components built that bind client-go user flags (like kubectl) can specify a token credential on the command-line using --token. Both of those have the option to consume those values from a file as well.

pflag does have the ability to annotate flags, which we apparently use like this to mark some flags as "classified":

// AddSecretAnnotation add secret flag to Annotation. func (f FlagInfo) AddSecretAnnotation(flags *pflag.FlagSet) FlagInfo { flags.SetAnnotation(f.LongName, "classified", []string{"true"}) return f }

and then filter out here when building an annotation to include in API objects when the client chooses to record the change cause in an annotation (--record)

parseFunc := func(flag *pflag.Flag, value string) error { flags = flags + " --" + flag.Name if set, ok := flag.Annotations["classified"]; !ok || len(set) == 0 { flags = flags + "=" + value } else { flags = flags + "=CLASSIFIED" } return nil }

Great! Thanks for pointing out the pflag annotation feature. That's perfect for displaying the non-CLASSIFIED flags.

dgrisonnet · 2024-10-07T20:58:04Z

keps/sig-instrumentation/4828-component-flagz/README.md

+
+#### Request
+* Method: **GET** 
+* Endpoint: **v1/flagz**


this should be updated once we've agreed on the versioning format we want to follow

keps/sig-instrumentation/4828-component-flagz/README.md

mattbailey

shadow prod readiness review

Just some nits/clerical (checkboxes), otherwise PRR looks good.

keps/sig-instrumentation/4828-component-flagz/README.md

johnbelamaric

Some minor points on PRR, but looks OK to me.

keps/sig-instrumentation/4828-component-flagz/README.md

johnbelamaric · 2024-10-08T22:19:23Z

PRR looks good, I will wait for sig approval to use the magic word

dgrisonnet · 2024-10-09T16:48:19Z

/lgtm
/approve

johnbelamaric · 2024-10-09T17:30:48Z

/approve

k8s-ci-robot · 2024-10-09T17:30:57Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dgrisonnet, johnbelamaric, richabanker

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/prod-readiness/OWNERS~~ [johnbelamaric]
~~keps/sig-instrumentation/OWNERS~~ [dgrisonnet,johnbelamaric]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Sep 6, 2024

k8s-ci-robot requested review from logicalhan and mrbobbytables September 6, 2024 23:49

k8s-ci-robot added kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/instrumentation Categorizes an issue or PR as relevant to SIG Instrumentation. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Sep 6, 2024

richabanker force-pushed the component-flagz branch from 87ddedc to 6b51cc5 Compare September 6, 2024 23:51

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Sep 6, 2024

richabanker mentioned this pull request Sep 6, 2024

Flagz for Kubernetes Components #4828

Open

6 tasks

richabanker force-pushed the component-flagz branch 3 times, most recently from 75f707c to 6a00d59 Compare September 13, 2024 21:28

k8s-ci-robot assigned dgrisonnet Sep 19, 2024

richabanker mentioned this pull request Sep 24, 2024

Add flagz endpoint for apiserver kubernetes/kubernetes#127581

Merged

dgrisonnet reviewed Sep 25, 2024

View reviewed changes

mrbobbytables removed their request for review September 25, 2024 19:40

richabanker force-pushed the component-flagz branch from 6a00d59 to 391583b Compare September 26, 2024 20:30

k8s-ci-robot assigned johnbelamaric Sep 26, 2024

thockin reviewed Oct 7, 2024

View reviewed changes

dgrisonnet reviewed Oct 7, 2024

View reviewed changes

keps/sig-instrumentation/4828-component-flagz/README.md Outdated Show resolved Hide resolved

mattbailey reviewed Oct 8, 2024

View reviewed changes

keps/sig-instrumentation/4828-component-flagz/README.md Outdated Show resolved Hide resolved

keps/sig-instrumentation/4828-component-flagz/README.md Show resolved Hide resolved

keps/sig-instrumentation/4828-component-flagz/README.md Show resolved Hide resolved

richabanker force-pushed the component-flagz branch from c382842 to 946dfcb Compare October 8, 2024 17:57

johnbelamaric reviewed Oct 8, 2024

View reviewed changes

richabanker force-pushed the component-flagz branch 5 times, most recently from 98f4f05 to c6c6e53 Compare October 9, 2024 15:38

KEP for flagz page for Kubernetes Components

47dbde4

richabanker force-pushed the component-flagz branch from c6c6e53 to 47dbde4 Compare October 9, 2024 15:46

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 9, 2024

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 9, 2024

k8s-ci-robot merged commit 6bdbc9a into kubernetes:master Oct 9, 2024
4 checks passed

k8s-ci-robot added this to the v1.32 milestone Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP for flagz page for Kubernetes Components #4831

KEP for flagz page for Kubernetes Components #4831

richabanker commented Sep 6, 2024 •

edited

Loading

dgrisonnet commented Sep 19, 2024

dgrisonnet Sep 25, 2024

richabanker Sep 26, 2024

dgrisonnet Oct 7, 2024

richabanker Oct 8, 2024

richabanker Oct 8, 2024

dgrisonnet Sep 25, 2024

richabanker Sep 26, 2024

dgrisonnet Oct 7, 2024

richabanker commented Sep 26, 2024

richabanker commented Sep 26, 2024

johnbelamaric commented Sep 26, 2024

thockin Oct 7, 2024

richabanker Oct 7, 2024

thockin Oct 7, 2024

richabanker Oct 8, 2024

thockin Oct 8, 2024

richabanker Oct 8, 2024 •

edited

Loading

liggitt Oct 8, 2024 •

edited

Loading

richabanker Oct 8, 2024

dgrisonnet Oct 7, 2024

mattbailey left a comment

johnbelamaric left a comment

johnbelamaric commented Oct 8, 2024

dgrisonnet commented Oct 9, 2024

johnbelamaric commented Oct 9, 2024

k8s-ci-robot commented Oct 9, 2024


		### Data Format and versioning

		Initially, the flagz page will exclusively support a plain text format for responses. We will implement these endpoints using versioned URLs, like /v1/flagz, to ensure future compatibility. This versioning strategy allows us to seamlessly introduce structured data formats (e.g., JSON) in the future, through a distinct endpoint such as /v2/flagz, without disrupting existing implementations.


		1. No sensitive data exposed

		We will ensure that no sensitive data is exposed through flagz and that access to this endpoint is gated by using system-monitoring group.

KEP for flagz page for Kubernetes Components #4831

KEP for flagz page for Kubernetes Components #4831

Conversation

richabanker commented Sep 6, 2024 • edited Loading

dgrisonnet commented Sep 19, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

richabanker commented Sep 26, 2024

richabanker commented Sep 26, 2024

johnbelamaric commented Sep 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

richabanker Oct 8, 2024 • edited Loading

Choose a reason for hiding this comment

liggitt Oct 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattbailey left a comment

Choose a reason for hiding this comment

johnbelamaric left a comment

Choose a reason for hiding this comment

johnbelamaric commented Oct 8, 2024

dgrisonnet commented Oct 9, 2024

johnbelamaric commented Oct 9, 2024

k8s-ci-robot commented Oct 9, 2024

richabanker commented Sep 6, 2024 •

edited

Loading

richabanker Oct 8, 2024 •

edited

Loading

liggitt Oct 8, 2024 •

edited

Loading