Add packet batch trigger for better packet handling #60

jagerman · 2023-11-02T21:19:12Z

This adds an optional packet batch callback to endpoint that allows endpoint to signal the application when it is done processing a set of incoming packets for the endpoint -- either because it has processed them all, or because it hit the max-recv-per-loop limit (64).

This is designed to allow an application to more efficiently deal with packets, especially datagrams, in batches: by using this an application can use the datagram handler to collect incoming datagrams, then use the batch callback to process accumulated datagrams in one go. Because the batch callback always fires before libquic goes back to potentially blocking waiting to read new packets, this means the application can do batch processing without needing to resort to timers or polling for packet handling.

This is particularly important for Lokinet, where we take the packet in the callback transfer it to a job in the Lokinet main loop thread to process it there: doing this one packet at a time is not likely to scale well because of the sheer number of jobs that would have to be put on the logic thread event queue; by batching them into groups of up to 64 packets at a time we ought to be able to do considerably better.

dr7ana · 2023-11-03T12:11:02Z

include/quic/udp.hpp

        using receive_callback_t = std::function<void(const Packet& pkt)>;
+        using receive_batch_callback_t = std::function<void()>;


I have a knee-jerk reaction to suffixing with {}_t now and you only have yourself to blame

dr7ana · 2023-11-03T12:12:31Z

include/quic/udp.hpp

@@ -103,6 +113,8 @@ namespace oxen::quic

        event_ptr rev_ = nullptr;
        receive_callback_t receive_callback_;
+        receive_batch_callback_t receive_callback_batch_;


Maybe batch_processing_cb ?

dr7ana · 2023-11-03T12:22:15Z

src/udp.cpp

+                    if (self.pending_receive_batch_)
+                    {
+                        self.pending_receive_batch_ = false;
+                        if (self.receive_callback_batch_)
+                            self.receive_callback_batch_();
+                    }


I'm wondering about this boolean (self.pending_receive_batch_). Few questions/thoughts we can chat about later:

I feel like there shouldn't be a case where self.pending_receive_batch_ would be true and self.receive_callback_batch_ would not be set? If the user doesn't provide a batch processing cb, I would guess that the batch processing logic should be avoided entirely? Though that boolean is flipped to true at line 206 ::process_packet, implying its flipped and checked invariant of whether batch processing is "active" for this endpoint or not

What if we used the set-ness of the callback as an indicator of whether batch processing is "active" or "inactive" for this endpoint? Then the boolean could be a "we have a full batch" signal of sorts

We could wrap the callback in a lightweight struct that has two members, a callback and an int/size_t specifying the size of the batch? The size parameter could even default to something sensible at or below the loop limit of 64 packets

Follow-on... would be cool to speed-test this with different batch sizes to see what works most optimally.

Follow-on follow-on... maybe even approximate grid search cross-validation across all our configurable parameters...

dr7ana · 2023-11-03T12:25:21Z

tests/007-datagrams.cpp

+        auto batch_counter_before_final = f.get();
+        REQUIRE(data_counter == 31);
+        REQUIRE(batch_counter_before_final > batches_before_flood);
+        REQUIRE(batch_counter == batch_counter_before_final + 1);


Interestingly the last REQUIRE failed on a few debug CI builds

jagerman · 2023-11-03T18:40:10Z

I think the accept-const-string change here is actually buggy: libquic ends up storing a string_view to be used when it can actually send the packet, but we have no guarantee that the string will still be around at that point.

This is dropped now.

This adds an optional packet post-receive callback to Endpoint that allows Endpoint to signal the application when it is done processing a set of incoming packets -- either because it has processed them all, or because it hit the max-recv-per-loop limit (64). This is designed to allow an application to more efficiently deal with packets, especially datagrams, in batches: by using this an application can use the datagram handler to collect incoming datagrams, then use the post-receive callback to process accumulated datagrams in one go. Because the post-receive callback always fires immediately *before* libquic goes back to potentially blocking waiting to read new packets, this means the application can do batch processing without needing to resort to timers or polling for packet handling. This is particularly important for Lokinet, where we take the packet in the callback transfer it to a job in the Lokinet main loop thread to process it there: doing this one packet at a time is not likely to scale well because of the sheer number of jobs that would have to be put on the logic thread event queue; by batching them into groups of up to 64 packets at a time we ought to be able to do considerably better, and by triggering the processing based on the post-receive trigger we ensure we don't introduce unnecessary delays in terms of when packets get processed.

jagerman · 2023-11-03T21:30:25Z

Keeping as draft until I've tried making this fit into Lokinet.

jagerman force-pushed the packet-batch-callback branch from b7af646 to 8a5b63b Compare November 2, 2023 21:19

jagerman requested review from dr7ana and tewinget November 2, 2023 21:19

dr7ana reviewed Nov 3, 2023

View reviewed changes

jagerman force-pushed the packet-batch-callback branch from 8a5b63b to 025fe0e Compare November 3, 2023 18:04

jagerman added 2 commits November 3, 2023 18:21

Actually do what we say on the tin

5eb3bac

jagerman force-pushed the packet-batch-callback branch from 025fe0e to 8318b81 Compare November 3, 2023 21:21

jagerman marked this pull request as draft November 3, 2023 21:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add packet batch trigger for better packet handling #60

Add packet batch trigger for better packet handling #60

jagerman commented Nov 2, 2023

dr7ana Nov 3, 2023

dr7ana Nov 3, 2023

dr7ana Nov 3, 2023 •

edited

Loading

dr7ana Nov 3, 2023 •

edited

Loading

jagerman commented Nov 3, 2023 •

edited

Loading

jagerman commented Nov 3, 2023

		using receive_callback_t = std::function<void(const Packet& pkt)>;
		using receive_batch_callback_t = std::function<void()>;

Add packet batch trigger for better packet handling #60

Are you sure you want to change the base?

Add packet batch trigger for better packet handling #60

Conversation

jagerman commented Nov 2, 2023

dr7ana Nov 3, 2023

Choose a reason for hiding this comment

dr7ana Nov 3, 2023

Choose a reason for hiding this comment

dr7ana Nov 3, 2023 • edited Loading

Choose a reason for hiding this comment

dr7ana Nov 3, 2023 • edited Loading

Choose a reason for hiding this comment

jagerman commented Nov 3, 2023 • edited Loading

jagerman commented Nov 3, 2023

dr7ana Nov 3, 2023 •

edited

Loading

dr7ana Nov 3, 2023 •

edited

Loading

jagerman commented Nov 3, 2023 •

edited

Loading