Add QEMU on Windows to CI #3475

arixmkii · 2025-04-26T17:14:43Z

For now it will use additional templates, because of incompatible mounts.

This is probably not for 1.1.0.

It is possible to use default.yaml for Windows with changes from #3318 (this would need rebase first, but I checked it using a rebased patch in a forked repo - example run https://github.com/arixmkii/qcw/actions/runs/14681726480/job/41205771124).

arixmkii · 2025-04-26T17:23:34Z

time="2025-04-26T17:20:45Z" level=fatal msg="failed to validate YAML file "C:\\a\\lima\\lima\\templates\\experimental\\default-windows.yaml": can't parse builtin Lima version "cfbffd8": cfbffd8 is not in dotted-tri format"

~~make/git on Windows incorrectly resolve version. I will check it (no such issues, when checkout and build are done with msys2 tools).~~ fixed

Another topic to check - use chocolatey to install QEMU, because msys2 QEMU installation feels slow.

arixmkii · 2025-04-26T17:38:20Z

Probably would need to move mounts-windows under _default to not fail validation script.

arixmkii · 2025-04-28T18:12:22Z

Chocolatey QEMU package is not well maintained, so, I chose winget instead, which is a great alternative. There is a known limitation that it is not available out of the box in Windows Server 2022, so, there is a hacky action to add it, which is now archived and will not be needed at all after migration to Windows Server 2025, this is highlighted by the comment.

arixmkii · 2025-04-28T18:30:03Z

@jandubois @AkihiroSuda I would like to know your opinions on how reasonable is it to extend CI to support this (to not overload CI and not increase costs significantly). From my side there is no rush and I can see reasons to postpone this until #3316 is addressed (via #3318 refresh or other means). Also it might be reasonable to wait for migration to WinServer 2025 to not use now archived https://github.com/Cyberboss/install-winget action.

I authored it now to have proof of concept confirmed and potentially creating reference starting point for its introduction.

templates/experimental/default-windows.yaml

.github/workflows/test.yml

AkihiroSuda · 2025-05-02T06:28:56Z

.github/workflows/test.yml

@@ -175,6 +175,44 @@ jobs:
        $env:_LIMA_WINDOWS_EXTRA_PATH = 'C:\Program Files\Git\usr\bin'
        bash.exe -c "./hack/test-templates.sh templates/experimental/wsl2.yaml"

+  windows-qemu:


Can we now drop these lines?

lima/pkg/limayaml/defaults.go

Lines 68 to 75 in df13f30

if runtime.GOOS == "windows" && runtime.GOARCH == "amd64" {

// https://github.com/lima-vm/lima/pull/3487#issuecomment-2846253560

// > #931 intentionally prevented the code from setting it to max when running on Windows,

// > and kept it at qemu64.

//

// TODO: remove this if "max" works with the latest qemu

defaultX8664 = "qemu64"

}

From my experience "max" just didn't work well with WHPX acceleration. I tested it on 3 different machines in the past. I only was able to make it work by disabling specific CPU features, which were different on every machine. It was not user friendly default. I can do some canary testing to compare how it works now with newer QEMU/Windows versions and if the failures are as common as they were before.

arixmkii · 2025-05-02T19:18:01Z

I tried to limit both Windows builds to windows-2025 standard. I see QEMU one failed with errors mounting SSHFS (I observed this instability before with standard runners, they are definitely recurring and could very persistent restarting job). WSL2 faced some error and now test in a locked state (it will be terminated after 30 minutes time out, because I can't cancel it manually). I can say that WSL2 was less stable (comparing to Lima 8-cores runner), when I used standard runners, but I mostly faced errors from sysmtemd, this one is new.

I will restart build setting both to windows-2025-8-cores to compare.

arixmkii · 2025-05-02T19:36:54Z

It didn't help for QEMU

time="2025-05-02T19:26:57Z" level=info msg="[hostagent] :/c/Users/runneradmin: No such file or directory"
time="2025-05-02T19:26:57Z" level=warning msg="[hostagent] failed to confirm whether /c/Users/runneradmin [remote] is successfully mounted" error="failed to execute script \"wait-for-remote-ready\": stdout=\"\", stderr=\"mux_client_request_session: read from master failed: Connection reset by peer\\r\\nControlSocket /c/Users/runneradmin/.lima/default/ssh.sock already exists, disabling multiplexing\\r\\nsshfs does not seem to be mounted on /c/Users/runneradmin\\n\": exit status 1"

SSHFS is weird on Windows in general and inside runners specifically. Giving some insights on my experience testing this in GH runners for a month or so. It always (or almost always) failed to mount $TEMP, but most of the time managed to mount $HOME, the situation with $TEMP - if temp was tried, but was not mounted the integration tests will still pass.

Troubleshooting the $TEMP issue locally I first managed to replicate it on my dev machine, but the fix was to clean the content of $TEMP folder. It looked like sftp-server might be sensitive to the folder contents, but I didn't try to test this in details.

I'm thinking I will test the standard runners and disable mount tests on Windows platform with a comment of them being flaky - which they indeed are.

Will experiment in my repo on isolated examples and then will update this PR once again.

jandubois · 2025-05-03T04:35:06Z

It always (or almost always) failed to mount $TEMP, but most of the time managed to mount $HOME

Is this just another instance of #302? Because $TEMP will be located at $HOME\AppData\Temp?

I always thought the issue was the overlap in the guest, but maybe the overlap on the host is the real problem?

At the time I filed #302 we did not yet have support for specifying a different mountPoint, so it was impossible to tell which side was causing the issue.

AkihiroSuda · 2025-06-15T17:47:25Z

What's the current status of this?

arixmkii · 2025-06-17T14:30:03Z

I believe I managed (up to my current understanding of the flaky part) to address this running it locally. I will experiment in GH CI in a forked repo and will post the status here this week.

Signed-off-by: Arthur Sengileyev <[email protected]>

AkihiroSuda · 2025-06-18T14:13:13Z

.github/workflows/test.yml

-    name: "Windows tests"
-    runs-on: windows-2022-8-cores
+    name: "Windows tests (WSL2)"
+    runs-on: windows-2025-8-cores


Maybe

Suggested change

runs-on: windows-2025-8-cores

runs-on: windows-2025

We can try this. From my observations it was less stable in a forked repo, where I have only default runners. I will apply this change after I will check the comment with qemucpu to max one.

If this will result in unstable builds it can be reverted later.

AkihiroSuda

Thanks

arixmkii · 2025-06-18T14:46:20Z

I spent all day doing experiments with CI to make SSHFS behave on Windows. What I tried

setting HOME and TEMP to different locations to have reduced FS tree for sftp-server
setting HOME and TEMP to not nested setup (default on windows is TEMP nested deep inside HOME)
trying different drives C: and D: as locations - may be the disks have different I/O settings in the runner

What I failed to test

running test as a different user to fully isolate from build using psexec of paexec. I gave up on this idea because it is difficult to pass all needed environment configurations to another user session and there is no good way to redirect logs to stdout for traceability of the test runs

For now it seems that disabling sshfs tests on Windows was a way to go.

I have success to run them in github CI in a forked repo, where I chain 2 VMs - one creating build in a zip form and another one unzipping artifact and running tests on it. As Lima build is I/O heavy it might be that it hits sort of I/O throttling after the build, but I have no way to confirm it. And this chaining strategy giving better results still sometimes result in failures. I will continue to experiment with different setups for sshfs tests in a forked repo.

arixmkii marked this pull request as draft April 26, 2025 17:23

arixmkii force-pushed the qemu-win-ci branch from 835b9ac to 1d05008 Compare April 26, 2025 17:28

arixmkii mentioned this pull request Apr 26, 2025

[Windows Please] #909

Open

arixmkii force-pushed the qemu-win-ci branch from 1d05008 to 51c0388 Compare April 28, 2025 18:08

arixmkii marked this pull request as ready for review April 28, 2025 18:23

AkihiroSuda added this to the v1.1.0 milestone Apr 30, 2025

AkihiroSuda added component/qemu QEMU platform/Windows labels Apr 30, 2025

AkihiroSuda reviewed Apr 30, 2025

View reviewed changes

templates/experimental/default-windows.yaml Outdated Show resolved Hide resolved

AkihiroSuda reviewed Apr 30, 2025

View reviewed changes

.github/workflows/test.yml Outdated Show resolved Hide resolved

AkihiroSuda reviewed May 2, 2025

View reviewed changes

arixmkii force-pushed the qemu-win-ci branch from 51c0388 to ff4c2d5 Compare May 2, 2025 18:50

arixmkii force-pushed the qemu-win-ci branch from ff4c2d5 to 85aaec3 Compare May 2, 2025 19:21

AkihiroSuda modified the milestones: v1.1.0, v1.1.x (?) May 12, 2025

Add QEMU on Windows to CI

9f7b9a8

Signed-off-by: Arthur Sengileyev <[email protected]>

arixmkii force-pushed the qemu-win-ci branch from 85aaec3 to 9f7b9a8 Compare June 18, 2025 14:11

AkihiroSuda reviewed Jun 18, 2025

View reviewed changes

AkihiroSuda modified the milestones: v1.1.x (?), v1.1.2 Jun 18, 2025

AkihiroSuda approved these changes Jun 18, 2025

View reviewed changes

AkihiroSuda merged commit db2c41a into lima-vm:master Jun 18, 2025
38 checks passed

	if runtime.GOOS == "windows" && runtime.GOARCH == "amd64" {
	// https://github.com/lima-vm/lima/pull/3487#issuecomment-2846253560
	// > #931 intentionally prevented the code from setting it to max when running on Windows,
	// > and kept it at qemu64.
	//
	// TODO: remove this if "max" works with the latest qemu
	defaultX8664 = "qemu64"
	}

Add QEMU on Windows to CI #3475

Add QEMU on Windows to CI #3475

Uh oh!

Conversation

arixmkii commented Apr 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arixmkii commented Apr 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arixmkii commented Apr 26, 2025

Uh oh!

arixmkii commented Apr 28, 2025

Uh oh!

arixmkii commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

AkihiroSuda May 2, 2025

Choose a reason for hiding this comment

Uh oh!

arixmkii May 2, 2025

Choose a reason for hiding this comment

Uh oh!

arixmkii commented May 2, 2025

Uh oh!

arixmkii commented May 2, 2025

Uh oh!

jandubois commented May 3, 2025

Uh oh!

AkihiroSuda commented Jun 15, 2025

Uh oh!

arixmkii commented Jun 17, 2025

Uh oh!

AkihiroSuda Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

arixmkii Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

AkihiroSuda left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

arixmkii commented Jun 18, 2025

Uh oh!

Uh oh!

arixmkii commented Apr 26, 2025 •

edited

Loading

arixmkii commented Apr 26, 2025 •

edited

Loading