HubEvent subscription responses are out of order #2143

BlinkyStitt · 2024-07-09T23:34:57Z

What is the bug?

There is something wrong with how hubble is sending HubEvents to a subscription. They are supposed to be ordered by id, but sometimes they are being sent out of order.

I mad

How can it be reproduced? (optional)

I made these two simple apps that only subscribe and then log when messages are out of order: https://gist.github.com/BlinkyStitt/619706df5aac39e601ff0b5e6a85e88b. I can move it into a proper repo with the protobuf files if you need me to.

The node one runs fine, but the rust one throws errors very quickly. At first we thought this meant there was a bug in the rust library. But when I captured some packets and printed them out with wireshark, I can see the events out of order on the wire.

packet 30 has a message with id 452947924307972
packet 31 has a message with id 452947924307971

capture.pcap.zip

Even though this node gist doesn't see the bug, our production node code does. In fact, one time it saw an event id out of order by more than 1 million.

The text was updated successfully, but these errors were encountered:

sds · 2024-07-13T03:47:01Z

Thanks for the report!

For anyone else running into this: the underlying cause is due to a quirk of how async events are handled in the combination of Rust + TypeScript that Hubble uses. It is more likely to be hit in hubs under high load.

The fix is likely not quick/easy—it would require porting the entire Hubble gRPC server to Rust as well—a large undertaking.

For that reason, we recommend anyone who processes events should:

Not reject an event just because it has an earlier event ID than the latest event ID you've seen so far
Use a "recently seen" cache to avoid double processing events, instead of relying on the event ID

FWIW, Warpcast uses the cache technique to avoid double processing.

github-actions bot added the s-triage Needs to be reviewed, designed and prioritized label Jul 9, 2024

sds added s-ready Ready to be picked up and removed s-triage Needs to be reviewed, designed and prioritized labels Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HubEvent subscription responses are out of order #2143

HubEvent subscription responses are out of order #2143

BlinkyStitt commented Jul 9, 2024 •

edited

Loading

sds commented Jul 13, 2024

HubEvent subscription responses are out of order #2143

HubEvent subscription responses are out of order #2143

Comments

BlinkyStitt commented Jul 9, 2024 • edited Loading

sds commented Jul 13, 2024

BlinkyStitt commented Jul 9, 2024 •

edited

Loading