-
Notifications
You must be signed in to change notification settings - Fork 5
pure1-unplugged - kubelet node status | ERROR | kubelet node is not ready #41
Comments
Hello @Sephzer , sorry to hear you're running into this issue. A few questions for you:
I haven't seen this issue before so I'm hoping it's an easy fix/environment factor. |
Hi @Pure-AdamuKaapan , thanks for getting back to me. Unfortunately, I won't be able to upload any logs as this is in a dark site environment and is completely locked down. I noticed that after the initial install none of the services were running, it was showing 1/15 successful checks. So naturally I ran the init command and everything came up apart from that one service. I know enough Linux to get around comfortably and read logs etc. Can you let me know what to check and what to look out for. Appreciate the fact that this is not going to be easy but we don't really have any other way to troubleshoot. |
1/15 before running anything sounds reasonable (don't remember what the 1 check is but probably some pre-condition). Could you go ahead and check in the |
Hi Adam,
Sorry, had a million and one things to do before finally getting to this.
So I managed to run the journalctl -u kubelet command and there are
definitely some errors present, however as I am running this on such a
small screen all of the output to the right-hand side gets cut off, even if
I log to desktop. Is there any way to fix this? If I can capture the info
then we should hopefully be able to get to the bottom of this once and for
all.
…On Fri, 26 Feb 2021 at 18:09, Adam Krpan ***@***.***> wrote:
1/15 before running anything sounds reasonable (don't remember what the 1
check is but probably some pre-condition).
Could you go ahead and check in the kubelet journalctl logs and just see
if there's any obvious errors as to why it may not be starting? Placing my
bets on networking setup for now but I guess we'll find out. Thanks!
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#41 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAZFSDAMNAIBDATWC63SB3TTA7PV5ANCNFSM4YEKEFGA>
.
|
@Sephzer I believe in journalctl you can scroll to the right using the arrow keys (at least, I'm able to on my instance of Unplugged). If not, you can save the output to a file and then use an editor (whether on the VM or on another server in the same network) to view the full thing. |
Can’t believe I didn’t think of that... let me get back to you =]. Cheers.
…On Wed, 3 Mar 2021 at 17:58, Adam Krpan ***@***.***> wrote:
@Sephzer <https://github.com/Sephzer> I believe in journalctl you can
scroll to the right using the arrow keys (at least, I'm able to on my
instance of Unplugged). If not, you can save the output to a file and then
use an editor (whether on the VM or on another server in the same network)
to view the full thing.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#41 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAZFSDEHSXWZVJPOBFL67G3TBZ2EBANCNFSM4YEKEFGA>
.
|
@Pure-AdamuKaapan I've managed to capture the logs properly so they're now legible. Thanks for the assist there. Now onto what's in the logs.. Lots of failures connecting to the API on port 6443, connection is always refused which looks like a FW issue. Though that's a bit strange as it's connecting to itself. And then we have a lot of 'node not found' errors. Also, lots of 'no networks found in /etc/cni/net.d' and container runtime network not ready, network plugin not ready. Not sure if that helps. Let me know if you need any specifics. |
Okay, so what I'm gathering from that is that it's a Kubernetes networking issue. Can you do Also, what do the two |
Morning @Pure-AdamuKaapan, I've just run the grep command and there are no errors in there. I have found matching entries and it all looks good, existing endpoints are found and mac, interfaces are added etc. Can also confirm that the two calico checks are green, no issues there. The only error is the |
@Sephzer that is bizarre... so far most of the errors still point towards network plugin issues/Calico not coming up right. Can you do Can you also try the following and send the output?
|
@Pure-AdamuKaapan no outport on the export command. Second command gives the following (all entries are running and are ready 1/1). This has been typed out manually and abbreviated.
Let me know if you need anything else. |
@Sephzer very strange... I need some time to do some research, this is an issue I've never seen before and everything seems to be fine as far as I can tell. I'll try to have some more ideas/information in the next couple days. |
@Sephzer just to check since I realized this could be the case: for the various errors you say it's spitting out, is it still doing so? Or were they just emitted at one point and now the services are sitting there with no output/different output? Also, you say that you've encountered this issue with both the OVA and ISO so it sounds like just a reinstall won't fix things. I would be curious to know though: what's the IP address (or at least CIDR) of your VM, and what are your CIDRs set to in |
@Pure-AdamuKaapan Sorry it took so long to get back to you, things have been manic at the bank. Let me see if I can dig up the details for you. Don't think I changed the internal subnets but I could be wrong. |
@Pure-AdamuKaapan apologies once again, finally got round to this. I've just run the status command and now we are getting 15 errors... no idea why. Might reboot the VM and try again. Looks like the IP address has now disappeared as well... not really sure what is going on now. POD CIDR = 192.168.0.0/16, serviceCIDR = 10.96.0.0/12. Will attempt to get the box back online with an IP. |
Oooh that sounds fun, I wish you the best in resolving that. Once you get the VM back online, can you make sure that both the pod CIDR and service CIDR DO NOT CONTAIN the IP addresses both of the VM and the arrays/blades you're connecting to (and then please let me know what you updated it to). For example, if your VM and the arrays are on the Thanks! |
I'm experiencing an issue with a brand new .ISO install.
After completing the guide and browsing to the pure1-unplugged IP I am getting err_connection_refused. After checking the CLI and running puctl infra status the kubelet node status is showing an error. Everything is passed and is green.
I have never used Kubernetes before so I am not sure where to look for help on this, especially as this looks to be a non-standard implementation. I have tried to check the logs specified in the infra status command but there is nothing that draws any attention to itself.
Can you let me know how best to troubleshoot this. I have also experienced the same error when installing from the .OVA as well.
Just to note, Zubair from Pure is also engaged from professional services and ideally we need to get Pure1 up and running ASAP. Cheers.
The text was updated successfully, but these errors were encountered: