Add leader election to the backend #847

mbarnes · 2024-11-15T18:10:53Z

What this PR does

Adds leader election to the backend deployment using the k8s.io/client-go/tools/leaderelection module so it can be horizontally scaled but still restrict active polling to one pod at a time.

Jira: No Jira, added by request
Link to demo recording:

Special notes for your reviewer

The leader election module emits unstructured log messages through klog, so I had to catch these messages and try to adapt them to the standard library structured logger, which required some futzing. Please enlighten me if there's a better way to do this.
I went ahead and bumped the backend replica count to 2 just to prove it works in dev environments.
We should add a /healthz endpoint to the backend at some point. The leader election module offers healthz integration for when it fails to renew the leader lease, but that's a pull request for another day.

backend/main.go

stevekuznetsov · 2024-11-18T15:26:13Z

backend/main.go

-
-	operationsScanner.Join()
+	go func() {
+		sig := <-signalChannel


Please just use signal.NotifyContext on this thing? Didn't we already talk about this?

I want to log the signal received along with a timestamp so I can observe how much longer it takes the process to shut down. I don't see any big advantage to signal.NotifyContext; it's a convenience wrapper that does the same thing I'm doing.

Is signalChannel just for your logger? Then the following is simpler:

ctx, cancel := signal.NotifyContext(context.Background(), syscall.SIGINT, syscall.SIGTERM) defer cancel() go func() { <-ctx.Done() logger.Info("Caught interrupt signal") }()

And the only difference is you can't tell if you got SIGINT or SIGTERM (why do you care? in production, the kubelet only ever sends SIGTERM or SIGKILL (ref), so this seems like a distinction without much meaning). Code is much less Rube-Goldberg. IDK. I respect you think this logging of the specific signal that is sent is somehow important but in a decade of writing k8s-native applications it has never been useful or required in the past.

As far as big advantage, I did want to mention - using a more complex cascade of handlers and goroutines when it's not necessary is a downside for sure. Making the simple choice every time you can helps make this maintainable ten years down the line.

backend/operations_scanner.go

Allows the backend deployment to be scaled up but still have only one instance polling at a time.

Because we can now that aro-hcp-backend uses leader election.

mbarnes requested review from bennerv and SudoBrendan as code owners November 15, 2024 18:10

stevekuznetsov reviewed Nov 18, 2024

View reviewed changes

backend/main.go Show resolved Hide resolved

stevekuznetsov reviewed Nov 18, 2024

View reviewed changes

backend/main.go Outdated Show resolved Hide resolved

stevekuznetsov reviewed Nov 18, 2024

View reviewed changes

backend/operations_scanner.go Outdated Show resolved Hide resolved

Matthew Barnes added 2 commits November 20, 2024 04:45

backend: Add Kubernetes leader election

86b1a1d

Allows the backend deployment to be scaled up but still have only one instance polling at a time.

backend: Deploy two aro-hcp-backend pods

93c1217

Because we can now that aro-hcp-backend uses leader election.

mbarnes force-pushed the backend-leader-election branch from 66227b5 to 93c1217 Compare November 20, 2024 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add leader election to the backend #847

Add leader election to the backend #847

mbarnes commented Nov 15, 2024

stevekuznetsov Nov 18, 2024

mbarnes Nov 19, 2024

stevekuznetsov Nov 20, 2024

stevekuznetsov Nov 20, 2024

Add leader election to the backend #847

Are you sure you want to change the base?

Add leader election to the backend #847

Conversation

mbarnes commented Nov 15, 2024

What this PR does

Special notes for your reviewer

stevekuznetsov Nov 18, 2024

Choose a reason for hiding this comment

mbarnes Nov 19, 2024

Choose a reason for hiding this comment

stevekuznetsov Nov 20, 2024

Choose a reason for hiding this comment

stevekuznetsov Nov 20, 2024

Choose a reason for hiding this comment