-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increase default QPS and Burst value to improve slow startup duration. #1066
Conversation
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: nak3 The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Codecov Report
@@ Coverage Diff @@
## main #1066 +/- ##
=======================================
Coverage 80.95% 80.95%
=======================================
Files 18 18
Lines 1355 1355
=======================================
Hits 1097 1097
Misses 205 205
Partials 53 53 |
/assign @skonto |
Do you have a linked issue describing the slow startup? |
There was originally #879 , which was eventually closed as stale. |
Out of curiosity which outbound calls to the K8s API server are being throttled? |
I believe it was services and endpoints gets, done by the IngressTranslator |
Is there a reason those calls aren't using informers? |
I was not involved the discussion in #276 but the client correctly list all resources at startup. I researched the context before as #968 (comment) but it was a little bit complicated and still open #968 (it didn't intend to solve this informer vs client but I think related) for that. |
Finding good default values is an open question to me (depends on the scale we support). If we can't avoid the direct K8s api server calls then I don't see many options here. In general this is a problem faced elsewhere too see for example kedacore/keda#3730 & the fix kedacore/keda#3731. |
We will actually turn off them in the future knative/pkg#2756 so using bigger value like |
I guess to confirm my understanding - the listing to fill the informer cache is the slow piece right now? And the increasing the QPS/Burst settings will help? |
Yes, that's correct. The one "the listing to fill the informer cache is the slow piece right now", I need to test and confirm it by myself but I believe there is no change since #276. |
@dprotaso Afaik @nak3 could you confirm the last bits so we can merge. |
Please see the log in #840 or (SRVKS-1078 our internal issue). It prints |
Yeah correct now I remember ;) Ok so should we merge? Are there any objections? |
I'm confused why doesn't |
Hmm... I tested #1075. It works and the start speed is fast on 200 ksvcs. I'm not sure what issue observed with the informer when #276 was created (the comment line clearly mentions Should we merge #1075 instead of here? |
Yeah - if it's fast then overall it's fetching less data from the API server |
🐛 This patch increases default QPS and Burst value to improve slow startup duration.
Release Note
/cc @skonto @maschmid