-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Latest next VMWare OVA Fails To Boot #1802
Comments
so there's no messages at all on the console of the VMWare machines? Does the VM even attempt to boot at all or is it something happening at the VMware level that is causing it to not work at all? What happens if you boot a |
No console/boot messages at all so it seems like there is something wrong with the OVA. To all intents and purposes the VM in vSphere looks the same as a Rebasing a Also, I have re-confirmed today that the OVA deployment issue exists with the very latest |
Can you also confirm it DOES NOT exist with the lastest |
Confirmed. Overnight |
We use the exact same build container to build I'm really not sure. I would expect something to come across the console that we could use to investigate, but you say there is nothing there either :( |
ahh. rebasing from Can you run |
This replicate the issue with the node failing to reboot and failing to reboot when manual power on signal is given via vSphere console. For the record this was the output I got when applying
|
Thanks @fifofonix. I've got a few more questions (sorry!). I've had at least one person report that installing Fedora Server 41 beta seems to work OK so maybe it's not GRUB and it is the way we've created the disk image itself (in the OVA). Is there a way you could try the "bare metal install" workflow using our ISO image (or PXE)? This would isolate the specific package set as the problem (i.e. where we previously suspected GRUB 2.12 as the problem) versus the built disk image as the problem. |
At least on my side bare metal install using ISO image works.
bootupd also seems to work and the system boots after the following commands.
|
Are you on VMWare? |
Booting the aarch64 live ISO on VMWare Fusion shows the Grub prompt and goes through to the live bash prompt. Is this sufficient to prove that Grub is not the issue or do I need to install to disk to complete this test? Note this is slightly different to the original issue which is reported for x86. Do I need to find an old Mac to test the x86 live ISO too? |
Yeah - not switching out the architecture would be nice. Sorry I just thought you had a VMWare infra (other than your laptop) where you could run a test. It would be nice if we could try the test on the same architecture and same infra where you hit the original failures. I think that would be on x86_64, and yes, preferrably a full install to disk + reboot. |
Had a colleague run the x86 ISO and install to disk and reboot on VMWare Workstation and everything goes well. This is an environment that fails when you try to install the OVA. |
Yes, I installed Windows 10 on a bare metal machine, installed VMWare Workstation 17 Player and then installed Fedora CoreOS |
Awesome. Thanks!
If you have time that would be great! More datapoints certainly help! |
Thanks! This should help us narrow down the root cause. I'm guessing now somehow some issue in how the OVA is built for one versus the other. Though it is the same code that is currently building |
I can confirm that when I try to boot the |
From the community meeting: |
I also got the same issue using FCOS next (41) in VMWare Fusion on MAC (x86), FCOS 40 stable works just fine. |
Just adding thoughts here, the only thing that changed recently at our side was the osbuild part, maybe something before the OVA creation could be causing it? Here is some diff between the 2 images, the sizes are slightly different, other than that, seems only the grub is different.
I will try to create an image with fcos#41-next downgrading the grub to see what it gives us. |
According to the reported test results above it seems like it could be either. There's definitely something nuanced here.
That will be a good test. |
It is indeed a grub issue:
|
Can you try https://bodhi.fedoraproject.org/updates/FEDORA-2024-a067416d33 ? That should narrow it to the 2.12 rebase. |
Fedora BZ opened: https://bugzilla.redhat.com/show_bug.cgi?id=2317048 |
I added the options |
Two more data points that might be helpful:
|
@dustymabe f41 ova boots with BIOS; it is UEFI that is always failing. |
We found the issue, the new serial configs for Grub 2.12 needs to have the port or the unit added to work. Changing it to : Seems the fix for us it to work with However, as @dustymabe mentioned in https://bugzilla.redhat.com/show_bug.cgi?id=2317048#c10 it may be an issue for users trying to upgrade. Thanks @marta-lewandowska for all your support and time spent on it! |
Ok, "serial" is rarely used with UEFI (usually I test with UEFI console), I'll look into it |
I found the bug in grub ACPI code, fix attached to the above RH bugzilla and will be on its way upstream soon |
Thanks so much @ozbenh! @marta-lewandowska - could we get https://lists.gnu.org/archive/html/grub-devel/2024-10/msg00216.html backported to rawhide and Fedora 41? |
we're working on it. looks like upstream reviewed the patch, so we should be able to take it as is. |
and it's landed in https://bodhi.fedoraproject.org/updates/FEDORA-2024-7d58433dd5 Thanks all! |
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
new package fast-track in coreos/fedora-coreos-config#3190 |
- VMWare OVA Fails To Boot due grub serial bug; - Fast track packages with the fix. See: coreos/fedora-coreos-tracker#1802 Signed-off-by: Renata Ravanelli <[email protected]>
fixup in coreos/fedora-coreos-config#3209 |
|
Also did testing |
Tested in Mac X86, with security boot, also worked fine! |
This morning our daily scheduled pipelines for the deployment of |
The fix for this went into |
This issue never affected |
Describe the bug
When launching a sans ignition Fedora41
next
OVA in VMWare Workstation on Windows the VM fails to boot with the message "The firmware encountered an unexpected exception. The vfirtual machine cannot boot." When using thetesting
Fedora40 OVA the VM boots to a login prompt without issue.Separately, CICD scripts that deploy the same OVAs using OpenTofu to a server VMWare vSphere infrastructure, also fail although without such a message. In the server deployment case the VMs will be listed in vSphere but will be in an 'off' status, with any power on attempts yielding an 'off' status. No console messages produced or error messages. Again the same projects using
testing
deploy just fine.Reproduction steps
Expected behavior
VM should boot to login as it does for prior FCOS versions
Actual behavior
As described above.
System details
Butane or Ignition config
None
Additional information
The text was updated successfully, but these errors were encountered: