-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Rocky Linux 9 and AlmaLinux 9 hosts #301
Conversation
Do you want me to ping some rocky devs? I think they might be able to provide insight. |
Probably not yet, until VXLAN works for me on Rocky |
Okay I won't! But if they could be of help here (getting it working) let me know and I can. |
WIP: this seems to somehow enable VXLAN functional (sysctl values are from https://qiita.com/tom7/items/1bc7f4e568b20c306845)
|
Woot! So just to clarify - if I run this on the host nodes (not in containers) right before make up, this should work? I can try this tonight (after you confirm the above!) It would be so great to get this working on rocky because our networking is good there, but we haven't figured out ubuntu yet. |
It turns out that This has to be 0 (disabled) or 2 (loose) in the rootless dockerd's network namespace. This value may still remain 1 on the host. |
Signed-off-by: Akihiro Suda <[email protected]>
Now this is ready for testing. |
Excellent! So should I test this branch as it is now, no changes to my rocky base images, or do we need further changes? |
No further change is expected to be needed |
Awesome! My rocky image is building now and I should be able to bring up a testing cluster after dinner. Will send you an update when I do! 🎉 |
Confirmed that this works on AlmaLinux 9.2 too, of course |
hey @AkihiroSuda! Congrats on your award today, you and your contributions are amazing and we so appreciate you! I was running into some issues (related to this one, but on ubuntu) and wanted to post what I learned for some future person. cat: /sys/fs/cgroup/user.slice/user-501043911.slice/[email protected]/cgroup.controllers: No such file or directory
Failed to connect to bus: No such file or directory
[INFO] systemd not detected, dockerd-rootless.sh needs to be started manually: And the fix was to rebuild my base image, and I added $ cat /etc/sysctl.d/99-usernetes.conf
net.ipv4.conf.default.rp_filter = 2 I has already run this
(sidenote) no matter how many times I run this, I always see this warning and I haven't figured out why that's the case yet:
But I determined that I think it's still set to 1 on my host?
So I did: $ sudo vim /etc/sysctl.conf
vsochat_gmail_com@usernetes-compute-001:/opt/usernetes$ sudo sysctl -p
net.ipv4.conf.default.rp_filter = 2 (changing it to 2) and restarted docker:
And then the
For the worker node, my power went out and I didn't get to test it fully, but when I ran the script to bring up the worker it seemed to hang:
I think the above was running |
Thank you
Because the sysctl value of the dockerd process is propagated to the container.
Any error from kubeadm-join?
I'd suggest to use local VMs for an exercise e.g., with https://lima-vm.io/ :
|
oh neat - I am not familiar with this tool. I'll try this out after a meeting / later this evening and give you an update! |
okay so I created two rocky VMs - but I don't really know how to get them networked or even the basics. I do see there are templates: |
Okay I installed lima and QEMU and created two rocky VMs - and I don't know enough basics to even get a ping working from one VM to the other. I do see there are templates: And namely some for k8s and k3s - is there any reason there isn't a template for usernetes? is it that a template == one vm? It seems like if one person has stepped through this process of using lima (and knows how to do it) it would be logical to provide a template for a control plan and then N workers for someone else to easily deploy. |
Will bring up a cluster now and look into this! I've been working for months on these terraform (now OpenTofu) templates and it feels daunting to start from scratch with a VM tool I've never used before. I'm hoping I'm close with the tofu configs on GCP to have something working more quickly. |
okay here is the error from kubeadm-join:
|
If I shell in (or just run again from the outside) it hangs here:
|
For the control plane (that appears to work) what I see in
And the worker node (hanging) I see:
But I don't see the node is registered:
This did work once for me, when it was in the middle of development! I wish I knew what changed :/ I could try going back to rocky since that works now, but I had thought ubuntu was a more sound option. |
The hanging terminal finally timed out:
|
No description provided.