Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[k8s] handle network error from pulling docker image #2551
[k8s] handle network error from pulling docker image #2551
Changes from 21 commits
3d5e97b
f6f93be
7e84a1b
2bf6968
88fe005
3f8b923
565765f
5a7e843
980446f
e808b86
30d1220
fc08d50
4039a51
0d7fce5
079caff
a98a6b4
704b7b8
554fd37
324c6e1
7cd299d
67e02eb
5e92f8a
5ec4a64
f7bcb97
a364581
f590ace
87d41cf
c131990
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't this check and raise be moved to after L265, where we already have similar checks in place? It seems to make more sense to have all waiting related errors handled at one place, and this method should be relegated to simply be a wait loop.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for catching this. I'm wondering now if the following check from
_wait_for_pods_to_schedule
should actually be done after the pods are scheduled and doesn't need to be checked from_wait_for_pods_to_schedule
.waiting.reason
can be set to'ContainerCreating'
only after the pods are scheduled, so checking if the pods reachedContainerCreating
state should be placed in_wait_for_pods_to_run
. And this can update the original waiting check at_wait_for_pods_to_run
with the waiting check from_wait_for_pods_to_schedule
.I updated
from
_wait_for_pods_to_run
withso that the post-schedule errors can be hanlded from
_wait_for_pods_to_run
. Please correct me if I'm missing anything! Tested for network error(post-schedule error) and excessive resource request error(pre-schedule error), and both failed over correctly.