Add a path for BPF-accelerated async signal emulation. #3731

khuey · 2024-04-22T01:39:20Z

Starting in kernel 6.10 BPF filters can choose whether or not to trigger the SIGIO behavior for a perf event that becomes readable. We combine that with a hardware breakpoint and a BPF filter that matches the GPRs to produce an accelerated internal breakpoint type that can fast forward through loop iterations to deliver async signals. On one trace this reduced rr's replay overhead by 94%.

This adds a runtime dependency on libbpf and a compile time dependency on clang --target bpf. rr also needs CAP_BPF and CAP_PERFMON to use this feature. Because of all of that, this isn't really suitable for wide use at this point and is instead a CMake feature usebpf. Set -Dusebpf=ON to test it.

(I think we should wait until the kernel side hits Linus's tree to merge this.)

CMakeLists.txt

rocallahan · 2024-05-16T10:06:05Z

src/PerfCounters.cc

+  static struct user_regs_struct* bpf_regs;
+
+  if (!fd_async_signal_accelerator.is_open()) {
+    if (!initialized) {


How about moving this BPF initialization code into its own function?

It feel a bit ugly to be mashing the BPF program's global state in this function. And it's ugly to be mmapping that buffer and then leaking it to the global variable.

How hard would it be to put the BPF program and its state into its own class with proper ownership, and have each ReplaySession hold a shared pointer to an object of that class?

Alright I reorganized this along those lines. The bpf singleton stuff lives in a BpfAccelerator class that's shared between the different PerfCounters instances.

src/PerfCounters.h

src/ReplaySession.cc

src/bpf/async_event_filter.c

src/PerfCounters.cc

rocallahan · 2024-05-27T07:08:08Z

src/PerfCounters.cc

+
+class BpfAccelerator {
+public:
+  static std::shared_ptr<BpfAccelerator> get_or_create();


I was thinking we could just create one BpfAccelerator in ReplaySession and copy the reference when we clone ReplaySessions so we don't need a static variable here.

I'm not convinced this is a great idea. It means moving BpfAccelerator into the header so ReplaySession can get at it. Is that really better than a static singleton?

src/bpf/async_event_filter.c

Starting in kernel 6.10 BPF filters can choose whether or not to trigger the SIGIO behavior for a perf event that becomes readable. We combine that with a hardware breakpoint and a BPF filter that matches the GPRs to produce an accelerated internal breakpoint type that can fast forward through loop iterations to deliver async signals. On one trace this reduced rr's replay overhead by 94%. This adds a runtime dependency on libbpf and a compile time dependency on clang --target bpf. rr also needs CAP_BPF and CAP_PERFMON to use this feature. Because of all of that, this isn't really suitable for wide use at this point and is instead a CMake feature usebpf. Set -Dusebpf=ON to test it.

khuey requested a review from rocallahan April 22, 2024 01:39

khuey force-pushed the bpf_async_signal branch from 7b620cc to 743aafe Compare May 15, 2024 15:46

rocallahan reviewed May 16, 2024

View reviewed changes

khuey force-pushed the bpf_async_signal branch from 61a216b to 9237959 Compare May 26, 2024 19:02

khuey requested a review from rocallahan May 27, 2024 01:39

rocallahan requested changes May 27, 2024

View reviewed changes

khuey force-pushed the bpf_async_signal branch from e272850 to fccb968 Compare May 30, 2024 17:38

khuey requested a review from rocallahan May 30, 2024 17:40

rocallahan approved these changes Jun 3, 2024

View reviewed changes

rocallahan mentioned this pull request Jun 12, 2024

replaying backwards into an executable mmaped region causes rr to crash #3762

Open

khuey force-pushed the bpf_async_signal branch from 013b30d to 1ac134c Compare June 26, 2024 16:38

khuey merged commit e7d9e8f into rr-debugger:master Jun 26, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a path for BPF-accelerated async signal emulation. #3731

Add a path for BPF-accelerated async signal emulation. #3731

khuey commented Apr 22, 2024

rocallahan May 16, 2024

rocallahan May 16, 2024

khuey May 27, 2024

rocallahan May 27, 2024

khuey May 30, 2024

Add a path for BPF-accelerated async signal emulation. #3731

Add a path for BPF-accelerated async signal emulation. #3731

Conversation

khuey commented Apr 22, 2024

rocallahan May 16, 2024

Choose a reason for hiding this comment

rocallahan May 16, 2024

Choose a reason for hiding this comment

khuey May 27, 2024

Choose a reason for hiding this comment

rocallahan May 27, 2024

Choose a reason for hiding this comment

khuey May 30, 2024

Choose a reason for hiding this comment