c18n: Fix rtld_bind reentry bug #2134

dpgao · 2024-06-29T13:48:16Z

~~Just for CI. Do not merge yet.~~

This should fix #2130.

jrtc27 · 2024-07-01T17:18:04Z

libexec/rtld-elf/rtld_c18n.c

+	 * stack be used because the signal handler uses the recorded size of
+	 * the stack to determine whether it has been allocated.
+	 */
+	static char dummy_stk;


This is ugly, surely there's a better way. Where does this end up being used to determine that the compartment is RTLD, anyway? Isn't the trusted stack tracking the compartment ID?

Isn't the trusted stack tracking the compartment ID?

This is generally true except at certain points in the trampoline where the trusted stack becomes temporarily out-of-sync with the currently installed untrusted stack.

The signal handler needs to handle such situations and makes a few guesses about the actual compartment that the current untrusted stack belongs to. It does so by checking whether the current untrusted stack is a subset of one of the candidate compartments' stack. Hence the dummy stack must be tagged and have positive length to prevent compartments from impersonating as RTLD.

"makes a few guesses about the actual compartment that the current untrusted stack belongs to" doesn't fill me with confidence as to the security of this mechanism

The security argument definitely has lots of room for improvement. But at least signal handling should be functionally correct now.

jrtc27 · 2024-07-01T17:20:41Z

libexec/rtld-elf/rtld_c18n.h

+static inline struct trusted_frame *
+pop_dummy_rtld_trusted_frame(struct trusted_frame *tf)
+{
+	assert(get_trusted_stk() == tf);


So why not just make it implicit?...

I want to retain the assertion because it would be very hard to debug if something goes wrong here.

What does asserting that the input is the one valid value achieve? The function can just use get_trusted_stk() itself, it doesn't need an argument.

My main worry is that somebody pushed to the trusted stack and forgot to pop. The assertion helps to catch that.

And how does this code help catch that?

For example, if somebody pushed but forgot to pop, then we will have get_trusted_stk() < tf.

As in, this assertion checks that nobody's made unbalanced push/pops to the trusted stack between the pair of push and pop that I am doing right now.

jrtc27 · 2024-07-01T17:21:55Z

libexec/rtld-elf/rtld_c18n.h

+ * current compartment is RTLD must be pushed.
+ */
+static inline struct trusted_frame *
+push_dummy_rtld_trusted_frame(struct trusted_frame *tf)


The only case when this isn't get_trusted_stk() is when you first create the stack. Implicitly use get_trusted_stk(), and then upon creation you just need to set_trusted_stk() before calling this.

jrtc27 · 2024-07-01T17:52:28Z

Please also add a regression test to cheribsdtest_ifunc.c

dpgao · 2024-07-04T10:03:45Z

@jrtc27 I've added a regression test. Regarding the use of implicit arguments, I'm still in favour of explicit arguments because then we are able to track canonical state of the trusted stack instead of relying on a special register, which might be corrupted between a push and a pop.

The check for benchmark ABI was erroneously added and conflicts with benchmark ABI-only code below.

This is a common pattern and warrants shared code.

The dummy frame indicates that the current compartment is RTLD. Because _rtld_bind may cause domain switches (e.g., via a ifunc resolver which calls a lazily-bound symbol), a frame that correctly identifies the current compartment as RTLD is necessary. The signal handler has also been updated to handle the temporary inconsistency between the untrusted stack and the callee field of the topmost truted frame that result from the changes above.

The current method of popping the topmost trusted frame and restoring it at the end of the function does not correctly identify the current compartment as RTLD. Instead, push a dummy frame that does so.

This prevents a compartment from being able to set its stack to NULL and then trigger a signal that would cause it to be mis-identified as RTLD.

The dummy frame indicates that the current compartment is RTLD. Because _rtld_tlsdesc_dynamic may cause domain transitions (e.g., locking), a frame that correctly identifies the current compartment as RTLD is necessary.

dpgao mentioned this pull request Jun 30, 2024

Invalid permissions for mapped object in pkg64cb with globally enabled c18n #2130

Closed

dpgao force-pushed the dpgao-patch-1 branch from a9cc90c to b63954f Compare June 30, 2024 21:43

dpgao changed the title ~~[WIP] c18n: Fix rtld_bind reentry bug~~ c18n: Fix rtld_bind reentry bug Jun 30, 2024

dpgao force-pushed the dpgao-patch-1 branch from b63954f to 593037f Compare July 1, 2024 09:28

dpgao requested a review from jrtc27 July 1, 2024 13:50

dpgao force-pushed the dpgao-patch-1 branch from 593037f to 328b2d5 Compare July 1, 2024 16:58

jrtc27 reviewed Jul 1, 2024

View reviewed changes

dpgao mentioned this pull request Jul 1, 2024

cheribsdtest: Add test for cross-object IFUNC invocation #2138

Closed

dpgao force-pushed the dpgao-patch-1 branch 2 times, most recently from 8dab7ed to 84897f7 Compare July 2, 2024 09:43

dstolfa mentioned this pull request Jul 3, 2024

[RFC] c18n: New {get,set}context APIs for libunwind #2122

Closed

dpgao requested a review from jrtc27 July 4, 2024 09:57

dpgao and others added 8 commits July 4, 2024 15:34

cheribsdtest: Add test for cross-object IFUNC invocation

5deb157

c18n: Fix typo in ifdef expression

337b055

The check for benchmark ABI was erroneously added and conflicts with benchmark ABI-only code below.

c18n: Factor out code that push/pop dummy trusted frames

4b13173

This is a common pattern and warrants shared code.

c18n: Reduce platform-specific code in signal handling

510f645

c18n: Improve stack consistency during stack resolution

8cc83d7

The current method of popping the topmost trusted frame and restoring it at the end of the function does not correctly identify the current compartment as RTLD. Instead, push a dummy frame that does so.

c18n: Use a tagged dummy stack for Morello purecap ABI

b3b8d81

This prevents a compartment from being able to set its stack to NULL and then trigger a signal that would cause it to be mis-identified as RTLD.

c18n: Push dummy frame during _rtld_tlsdesc_dynamic

905deea

The dummy frame indicates that the current compartment is RTLD. Because _rtld_tlsdesc_dynamic may cause domain transitions (e.g., locking), a frame that correctly identifies the current compartment as RTLD is necessary.

dpgao force-pushed the dpgao-patch-1 branch from 84897f7 to 905deea Compare July 5, 2024 10:18

dpgao merged commit 905deea into dev Jul 5, 2024
29 checks passed

dpgao deleted the dpgao-patch-1 branch July 5, 2024 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

c18n: Fix rtld_bind reentry bug #2134

c18n: Fix rtld_bind reentry bug #2134

dpgao commented Jun 29, 2024 •

edited

Loading

jrtc27 Jul 1, 2024

dpgao Jul 1, 2024

jrtc27 Jul 1, 2024

dpgao Jul 1, 2024

jrtc27 Jul 1, 2024

dpgao Jul 1, 2024

jrtc27 Jul 1, 2024

dpgao Jul 1, 2024

jrtc27 Jul 1, 2024

dpgao Jul 1, 2024

dpgao Jul 1, 2024 •

edited

Loading

jrtc27 Jul 1, 2024

jrtc27 commented Jul 1, 2024

dpgao commented Jul 4, 2024

c18n: Fix rtld_bind reentry bug #2134

c18n: Fix rtld_bind reentry bug #2134

Conversation

dpgao commented Jun 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dpgao Jul 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jrtc27 commented Jul 1, 2024

dpgao commented Jul 4, 2024

dpgao commented Jun 29, 2024 •

edited

Loading

dpgao Jul 1, 2024 •

edited

Loading