fix: (log_poller backfill) exit on non-rpc err #14114

bukata-sa · 2024-08-14T12:02:09Z

What

Properly detect error type and exit on non-rpc error

Why

On graceful shutdown backfill method reacts on context canceled error as rpc batch size error, which leads to crit log and failed test
Example log cl-node-54af8721.log

Resolves Dependencies

#13647

reductionista · 2024-08-14T18:29:28Z

core/chains/evm/logpoller/log_poller.go

+			var rpcErr *client.JsonError
+			if !pkgerrors.As(err, &rpcErr) || rpcErr.Code != jsonRpcLimitExceeded {
+				lp.lggr.Errorw("Unable to query for logs", "err", err, "from", from, "to", to)
+				return err


I initially tried to make this change in Dec 2023, but during review it became clear that this is a somewhat risky change and there was more to investigate about what the right way is to fix it. I re-opened it a few weeks ago, and got a more complex fix working that actually detects the error correctly (at least on some rpc servers.) This by itself will not, because it's comparing against a concrete type that will only match during testing, not in production.

I thought that was ready to merge last week, but unfortunately during review it again became clear that things are more complicated. There are so many different error codes and string formats returned from different rpc servers for the same condition we'll have to rely mostly on string parsing to differentiate them.

Leaving it as-is at least errs on the side of caution because, even though the logic is flawed, it's just taking preventative measures when it doesn't actually need to in most cases... and reporting a critical error in some cases when it shouldn't. Not taking those preventative measures, or not reporting a critical condition is more risky in production.

Is there a way to fix the flaky test without making this change?

Here is the current PR as it stands:
#11654

reductionista

See comment

cl-sonarqube-production · 2024-08-15T11:35:35Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

bukata-sa requested a review from a team as a code owner August 14, 2024 12:02

bukata-sa temporarily deployed to sdlc August 14, 2024 12:02 — with GitHub Actions Inactive

bukata-sa requested review from reductionista, andrevmatos, jmank88 and a team and removed request for a team and jmank88 August 14, 2024 12:03

fix: (log_poller backfill) exit on non-rpc err

389875c

bukata-sa force-pushed the fix/backfill_err_exit branch from 14597fb to 389875c Compare August 14, 2024 12:30

bukata-sa temporarily deployed to sdlc August 14, 2024 12:30 — with GitHub Actions Inactive

Update gorgeous-carpets-grab.md

3333386

bukata-sa temporarily deployed to sdlc August 14, 2024 18:19 — with GitHub Actions Inactive

reductionista reviewed Aug 14, 2024

View reviewed changes

reductionista requested changes Aug 15, 2024

View reviewed changes

bukata-sa added 2 commits August 15, 2024 10:49

fix

468b2b7

lint

c06aaa8

bukata-sa closed this Aug 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: (log_poller backfill) exit on non-rpc err #14114

fix: (log_poller backfill) exit on non-rpc err #14114

bukata-sa commented Aug 14, 2024 •

edited

Loading

reductionista Aug 14, 2024 •

edited

Loading

reductionista left a comment

cl-sonarqube-production bot commented Aug 15, 2024

fix: (log_poller backfill) exit on non-rpc err #14114

fix: (log_poller backfill) exit on non-rpc err #14114

Conversation

bukata-sa commented Aug 14, 2024 • edited Loading

What

Why

Resolves Dependencies

reductionista Aug 14, 2024 • edited Loading

Choose a reason for hiding this comment

reductionista left a comment

Choose a reason for hiding this comment

cl-sonarqube-production bot commented Aug 15, 2024

Quality Gate passed

bukata-sa commented Aug 14, 2024 •

edited

Loading

reductionista Aug 14, 2024 •

edited

Loading