Do not close FDs 0, 1, or 2 #186

DemiMarie · 2025-01-09T21:51:30Z

If they are closed, another file descriptor could be created with these numbers, and so standard library functions that use them might write to an unwanted place. dup2() a file descriptor to /dev/null over them instead.

codecov · 2025-01-09T22:17:30Z

Codecov Report

Attention: Patch coverage is 51.57895% with 46 lines in your changes missing coverage. Please review.

Project coverage is 79.18%. Comparing base (6077a10) to head (c7a7826).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
agent/qrexec-agent.c	54.66%	34 Missing ⚠️
libqrexec/exec.c	40.00%	12 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #186      +/-   ##
==========================================
+ Coverage   79.17%   79.18%   +0.01%     
==========================================
  Files          54       54              
  Lines        9953     9999      +46     
==========================================
+ Hits         7880     7918      +38     
- Misses       2073     2081       +8

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DemiMarie · 2025-01-10T03:16:22Z

Codecov appears to not be testing what happens in the child process after fork() and the error path of “cannot open /dev/null”.

marmarek · 2025-01-10T03:51:46Z

AFAIR unit tests do not cover the PAM handling part, as they are not running as a system service, test runners don't have necessary PAM configuration etc.

qubesos-bot · 2025-01-11T14:27:55Z

OpenQA test summary

Complete test suite and dependencies: https://openqa.qubes-os.org/tests/overview?distri=qubesos&version=4.3&build=2025011103-4.3&flavor=pull-requests

Test run included the following:

New failures, excluding unstable

Compared to: https://openqa.qubes-os.org/tests/overview?distri=qubesos&version=4.3&build=2024111705-4.3&flavor=update

system_tests_qrexec
- TC_00_Qrexec_debian-12-xfce: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_debian-12-xfce: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_debian-12-xfce: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
- TC_00_Qrexec_fedora-41-xfce: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_fedora-41-xfce: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_fedora-41-xfce: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
- TC_00_Qrexec_whonix-gateway-17: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_whonix-gateway-17: test_083_qrexec_service_argument_specific_implementation (error)
  subprocess.CalledProcessError: Command '/usr/lib/qubes/qrexec-clien...
- TC_00_Qrexec_whonix-gateway-17: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_whonix-gateway-17: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
- TC_00_Qrexec_whonix-workstation-17: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_whonix-workstation-17: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_whonix-workstation-17: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
system_tests_dispvm
- TC_20_DispVM_fedora-41-xfce: test_100_open_in_dispvm (failure)
  AssertionError: './open-file test.txt' failed with ./open-file test...
system_tests_kde_gui_interactive
- gui_keyboard_layout: wait_serial (wait serial expected)
  # wait_serial expected: "echo -e '[Layout]\nLayoutList=us,de' | sud...

Failed tests

16 failures

system_tests_qrexec
- TC_00_Qrexec_debian-12-xfce: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_debian-12-xfce: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_debian-12-xfce: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
- TC_00_Qrexec_fedora-41-xfce: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_fedora-41-xfce: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_fedora-41-xfce: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
- TC_00_Qrexec_whonix-gateway-17: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_whonix-gateway-17: test_083_qrexec_service_argument_specific_implementation (error)
  subprocess.CalledProcessError: Command '/usr/lib/qubes/qrexec-clien...
- TC_00_Qrexec_whonix-gateway-17: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_whonix-gateway-17: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
- TC_00_Qrexec_whonix-workstation-17: test_053_qrexec_vm_service_eof_reverse (failure)
  AssertionError: Timeout, probably EOF wasn't transferred
- TC_00_Qrexec_whonix-workstation-17: test_092_qrexec_service_socket_dom0_eof_reverse (failure)
  AssertionError: service timeout, probably EOF wasn't transferred fr...
- TC_00_Qrexec_whonix-workstation-17: test_098_qrexec_service_socket_vm_eof (failure)
  AssertionError: service timeout, probably EOF wasn't transferred to...
system_tests_dispvm
- TC_20_DispVM_fedora-41-xfce: test_100_open_in_dispvm (failure)
  AssertionError: './open-file test.txt' failed with ./open-file test...
system_tests_kde_gui_interactive
- gui_keyboard_layout: wait_serial (wait serial expected)
  # wait_serial expected: "echo -e '[Layout]\nLayoutList=us,de' | sud...
- gui_keyboard_layout: Failed (test died)
  # Test died: command 'test "$(cd ~user;ls e1*)" = "$(qvm-run -p wor...

Fixed failures

Compared to: https://openqa.qubes-os.org/tests/119126#dependencies

3 fixed

system_tests_extra
- TC_00_QVCTest_whonix-gateway-17: test_010_screenshare (failure)
  ~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^... AssertionError: 0 == 0
system_tests_basic_vm_qrexec_gui_zfs
- switch_pool: Failed (test died)
  # Test died: command 'dnf install -y ./zfs-release.rpm' failed at /...
system_tests_audio@hw1
- TC_20_AudioVM_PipeWire_whonix-workstation-17: test_260_audio_mic_enabled_switch_audiovm (failure)
  AssertionError: too short audio, expected 10s, got 0.00013605442176...

Unstable tests

system_tests_update
update2/Failed (1/5 times with errors)
- job 121711 # Test died: command '(set -o pipefail; qubesctl --show-output stat...
system_tests_update@hw1
update2/Failed (1/5 times with errors)
- job 121711 # Test died: command '(set -o pipefail; qubesctl --show-output stat...
system_tests_update@hw7
update2/Failed (1/5 times with errors)
- job 121711 # Test died: command '(set -o pipefail; qubesctl --show-output stat...

marmarek · 2025-01-11T15:35:18Z

system_tests_qrexec

* TC_00_Qrexec_debian-12-xfce: [test_053_qrexec_vm_service_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_debian-12-xfce/4) (failure)
  `AssertionError: Timeout, probably EOF wasn't transferred`

* TC_00_Qrexec_debian-12-xfce: [test_092_qrexec_service_socket_dom0_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_debian-12-xfce/18) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred fr...`

* TC_00_Qrexec_debian-12-xfce: [test_098_qrexec_service_socket_vm_eof](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_debian-12-xfce/23) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred to...`

* TC_00_Qrexec_fedora-41-xfce: [test_053_qrexec_vm_service_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_fedora-41-xfce/4) (failure)
  `AssertionError: Timeout, probably EOF wasn't transferred`

* TC_00_Qrexec_fedora-41-xfce: [test_092_qrexec_service_socket_dom0_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_fedora-41-xfce/18) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred fr...`

* TC_00_Qrexec_fedora-41-xfce: [test_098_qrexec_service_socket_vm_eof](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_fedora-41-xfce/23) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred to...`

* TC_00_Qrexec_whonix-gateway-17: [test_053_qrexec_vm_service_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_whonix-gateway-17/4) (failure)
  `AssertionError: Timeout, probably EOF wasn't transferred`

* TC_00_Qrexec_whonix-gateway-17: [test_083_qrexec_service_argument_specific_implementation](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_whonix-gateway-17/14) (error)
  `subprocess.CalledProcessError: Command '/usr/lib/qubes/qrexec-clien...`

* TC_00_Qrexec_whonix-gateway-17: [test_092_qrexec_service_socket_dom0_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_whonix-gateway-17/18) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred fr...`

* TC_00_Qrexec_whonix-gateway-17: [test_098_qrexec_service_socket_vm_eof](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_whonix-gateway-17/23) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred to...`

* TC_00_Qrexec_whonix-workstation-17: [test_053_qrexec_vm_service_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_whonix-workstation-17/4) (failure)
  `AssertionError: Timeout, probably EOF wasn't transferred`

* TC_00_Qrexec_whonix-workstation-17: [test_092_qrexec_service_socket_dom0_eof_reverse](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_whonix-workstation-17/18) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred fr...`

* TC_00_Qrexec_whonix-workstation-17: [test_098_qrexec_service_socket_vm_eof](https://openqa.qubes-os.org/tests/125357#step/TC_00_Qrexec_whonix-workstation-17/23) (failure)
  `AssertionError: service timeout, probably EOF wasn't transferred to...`

This is the only qrexec PR in this test run, so the above failures seems to be regression caused by this change.

This is the convention used by the rest of qrexec. This commit should be backported to stable branches.

These should never happen, but call exit() if they do.

Saves an (admittedly cheap) system call.

No functional change intended.

This will be used by tests later. No functional change intended.

This also fixes a bug: basename can mutate its argument, so a copy must be passed to it.

This makes the unit test code more like the actual code used by end-users, and therefore makes the tests more accurate.

If they are closed, another file descriptor could be created with these numbers, and so standard library functions that use them might write to an unwanted place. dup2() a file descriptor to /dev/null over them instead. Also statically initialize trigger_fd to -1, which is the conventional value for an invalid file descriptor. This requires care to avoid closing the file descriptor to /dev/null in fix_fds(), which took over an hour to debug.

marmarek · 2025-01-12T00:03:02Z

libqrexec/libqrexec-utils.h

@@ -162,6 +162,11 @@ void buffer_append(struct buffer *b, const char *data, int len);
 void buffer_remove(struct buffer *b, int len);
 int buffer_len(struct buffer *b);
 void *buffer_data(struct buffer *b);
+/* Open /dev/null and keep it from being closed before the exec func is called.


Isn't it simpler (and safer) to simply open /dev/null just before dup-ing it over 0,1,2 (in the child process already)?

marmarek · 2025-01-12T00:10:24Z

But also, I question usefulness of this PR as a whole - the closing of standard FDs happens in a process that has a single purpose - wait for the child process and then exit, in the very same function as closing happens. There are few PAM cleanup calls, but it's very unlikely for them to be a problem (especially, it isn't a problem now, or for the last 10 or so years).

DemiMarie · 2025-01-12T00:55:41Z

Whether or not the last commit in the PR is merged, I definitely think the other commits should be merged. In particular, it turned out that the “close the FD” functionality had no unit tests because the unit tests took a codepath that was too different from the production code. This PR makes the production and test code follow the same path, with the result that the actual bug (/dev/null FD being closed by fix_fds()) is now caught. I think that this test improvement (and the other bug fixes) is itself useful.

There are few PAM cleanup calls, but it's very unlikely for them to be a problem (especially, it isn't a problem now, or for the last 10 or so years).

PAM cleanup calls into PAM modules, so it can do anything. I suspect Qubes OS only gets away with it because we have a fairly simple PAM stack by default. PAM cleanup is used for e.g. unmounting filesystems and closing encrypted volumes.

The best approach would be for PAM to run with stdin pointed at /dev/null and stdout and stderr pointed at the system log. The FDs would be fixed directly before executing the child process. That’s a bigger refactor, though.

marmarek · 2025-01-12T01:21:10Z

Whether or not the last commit in the PR is merged,

Indeed I was talking about the last commit (which until the last force-push was the only commit in this PR).

PAM cleanup calls into PAM modules, so it can do anything. I suspect Qubes OS only gets away with it because we have a fairly simple PAM stack by default.

Aren't PAM modules expected handle proper logging themselves? I don't think they are supposed to touch calling process's stdin/out/err in any case. And if they would do, that likely would interfere also with cases where they aren't closed (and then replaced with with unrelated thing) - for example it could interfere with an application log file on stderr that is expected in a specific format (different than PAM messages).

marmarek · 2025-01-12T01:25:30Z

As for the other commits, won't that have some non-trivial conflicts with #141 (which I hope is quite close to merge-able state)?

DemiMarie · 2025-01-12T03:06:59Z

As for the other commits, won't that have some non-trivial conflicts with #141 (which I hope is quite close to merge-able state)?

I can include them in #141 or rebase this PR on top of it. I can also close this PR if you prefer, but I’d prefer that at least the bug fixes and testability changes go in.

DemiMarie force-pushed the no-close-low-fd branch from 29a0392 to e123ed7 Compare January 9, 2025 21:51

marmarek added the openqa-pending label Jan 11, 2025

This was referenced Jan 11, 2025

Add filter for qubes stored on inaccessible storage pools QubesOS/qubes-manager#399

Draft

Make the qubes-pciback module omittable QubesOS/qubes-core-admin-linux#178

Open

DemiMarie added 10 commits January 11, 2025 11:44

Use QREXEC_EXIT_PROBLEM for errors spawning child process

81ef557

This is the convention used by the rest of qrexec. This commit should be backported to stable branches.

Fix checking for memory allocation errors

f76920a

These should never happen, but call exit() if they do.

Set PAM error if snprintf() fails

a18818e

Get effective UID at startup

8b8b49b

Saves an (admittedly cheap) system call.

Move waiting on the child to a helper function

6e6739a

No functional change intended.

agent: Move exec to helper function

0cf051d

This will be used by tests later. No functional change intended.

Move closing fds to helper function

79bc074

This will be used by tests later. No functional change intended.

Move basename handling to common function

b83c05c

This also fixes a bug: basename can mutate its argument, so a copy must be passed to it.

Use fork()/exec() in unit test code

fc921e1

This makes the unit test code more like the actual code used by end-users, and therefore makes the tests more accurate.

DemiMarie force-pushed the no-close-low-fd branch from e123ed7 to c7a7826 Compare January 11, 2025 21:48

marmarek reviewed Jan 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not close FDs 0, 1, or 2 #186

Do not close FDs 0, 1, or 2 #186

DemiMarie commented Jan 9, 2025

codecov bot commented Jan 9, 2025 •

edited

Loading

DemiMarie commented Jan 10, 2025

marmarek commented Jan 10, 2025

qubesos-bot commented Jan 11, 2025 •

edited

Loading

marmarek commented Jan 11, 2025

marmarek Jan 12, 2025

marmarek commented Jan 12, 2025

DemiMarie commented Jan 12, 2025

marmarek commented Jan 12, 2025 •

edited

Loading

marmarek commented Jan 12, 2025

DemiMarie commented Jan 12, 2025

Do not close FDs 0, 1, or 2 #186

Are you sure you want to change the base?

Do not close FDs 0, 1, or 2 #186

Conversation

DemiMarie commented Jan 9, 2025

codecov bot commented Jan 9, 2025 • edited Loading

Codecov Report

DemiMarie commented Jan 10, 2025

marmarek commented Jan 10, 2025

qubesos-bot commented Jan 11, 2025 • edited Loading

OpenQA test summary

New failures, excluding unstable

Failed tests

Fixed failures

Unstable tests

marmarek commented Jan 11, 2025

marmarek Jan 12, 2025

Choose a reason for hiding this comment

marmarek commented Jan 12, 2025

DemiMarie commented Jan 12, 2025

marmarek commented Jan 12, 2025 • edited Loading

marmarek commented Jan 12, 2025

DemiMarie commented Jan 12, 2025

codecov bot commented Jan 9, 2025 •

edited

Loading

qubesos-bot commented Jan 11, 2025 •

edited

Loading

marmarek commented Jan 12, 2025 •

edited

Loading