gateway: ensure llb digests are deterministic when sent by frontends #5517

jsternberg · 2024-11-13T16:33:09Z

This ensures different valid protobuf serializations that are sent by frontends will be rewritten into digests that are normalized for the buildkit solver.

The most recent example of this is that older frontends would generate protobuf with gogo and the newer buildkit is using the google protobuf library. These produce different serializations and cause the solver to think that identical operations are actually different.

This is done by rewriting the incoming definition sent by the llb bridge forwarder when a gateway calls solve with a protobuf definition.

jsternberg · 2024-11-13T18:30:27Z

We did some more digging and the byte difference between the two is related to field order. Google's protobuf library will marshal the fields based on this code snippet: https://github.com/protocolbuffers/protobuf-go/blob/b98563540c0a4edb38526bcd6e6c97f9fac1f453/internal/order/order.go#L21-L41

It automatically puts any oneof field after any non-oneof field. The gogo ordering seems to just be numerical. There's a oneof field in field 3 and there's a normal field in field 11 which is ordered differently and causes the bytes to be different.

tonistiigi · 2024-11-13T18:33:23Z

frontend/gateway/gateway.go

@@ -760,6 +760,17 @@ func (lbf *llbBridgeForwarder) Solve(ctx context.Context, req *pb.SolveRequest)
 		}
 	}

+	if req.Definition != nil {
+		// Rewrite digests in the definition. This ensures the digests are validated


Why is this in gateway and not in solver loader where we already have ops remarshal logic for source policies?

I'll update the logic there. I think the logic also needs to be slightly different. The current one has a kind of conditional rewrite and we'll just need to always remarshal it.

tonistiigi · 2024-11-14T18:50:19Z

client/llb/diff.go

@@ -67,7 +67,7 @@ func (m *DiffOp) Marshal(ctx context.Context, constraints *Constraints) (digest.

 	proto.Op = &pb.Op_Diff{Diff: op}

-	dt, err := deterministicMarshal(proto)
+	dt, err := proto.Marshal()


Why these changes in marshal? Does it still call the deterministic internally?

It's still deterministic but I was defining and calling it in multiple packages. I also figured that having any marshal function that calls MarshalVT when we know that's an improper way to marshal the data would not be a good idea.

To be fair, I made this change when I was adding a third location where we were doing this so I had a bigger interest in removing the duplication. It's now back to two locations.

I also debated renaming the function to MarshalDeterministic() but then I had to change a few call sites and I didn't think it was worth it.

tonistiigi · 2024-11-14T19:01:09Z

solver/llbsolver/vertex.go

+
+	dm.mapping[dgst] = newDgst
+	if dgst != newDgst {
+		// Ensure the indices also map to the new digest.


This looks bit weird. Why do these containers mix old and new digests?

It should be that there is a container to check if digest has been converted already and then lookup by the old digest to op if it has not. I think this is also needed to avoid loops as otherwise loops should be possible by mixing old and new digests in the definition.

tonistiigi · 2024-11-14T19:04:18Z

solver/llbsolver/vertex.go

+	}
+
+	index := dm.indexByDigest[dgst]
+	dm.out.Def[index] = data


Do we need this? We have already parsed the data once. If we write back the encoded data that means we need to parse it again. We should only need to marshal to get the new digest, don't actually need the new bytes.

Hm I'll go back and see if this is an easier way of doing it. I think you're likely right. I'll give it a try.

This ensures different valid protobuf serializations that are sent by frontends will be rewritten into digests that are normalized for the buildkit solver. The most recent example of this is that older frontends would generate protobuf with gogo and the newer buildkit is using the google protobuf library. These produce different serializations and cause the solver to think that identical operations are actually different. Signed-off-by: Jonathan A. Sternberg <[email protected]>

jsternberg · 2024-11-14T22:23:29Z

Modified this PR to be a bit more faithful to the original code. The code to detect if a mutation happened has now been removed in favor of unconditional mutation, but the rest of it remains faithful to the original so it won't repeat deserialization. I've also added a testdata file with the gogo protobuf serialization to test that things get recomputed correctly.

github-actions bot added area/llb area/client area/frontend area/solver labels Nov 13, 2024

jsternberg force-pushed the deterministic-llb-for-gateway branch from f7041e1 to 8c41ea7 Compare November 13, 2024 18:18

jsternberg requested a review from tonistiigi November 13, 2024 18:28

tonistiigi reviewed Nov 13, 2024

View reviewed changes

jsternberg force-pushed the deterministic-llb-for-gateway branch from 8c41ea7 to ea52c76 Compare November 14, 2024 16:37

github-actions bot added area/testing and removed area/frontend labels Nov 14, 2024

jsternberg requested a review from tonistiigi November 14, 2024 17:21

jsternberg added this to the v0.18.0 milestone Nov 14, 2024

tonistiigi reviewed Nov 14, 2024

View reviewed changes

tonistiigi requested a review from cpuguy83 November 14, 2024 19:04

jsternberg force-pushed the deterministic-llb-for-gateway branch from ea52c76 to 6081ae3 Compare November 14, 2024 19:38

jsternberg force-pushed the deterministic-llb-for-gateway branch from 6081ae3 to 9f65f8c Compare November 14, 2024 22:13

github-actions bot removed area/llb area/client labels Nov 14, 2024

jsternberg requested a review from tonistiigi November 14, 2024 22:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gateway: ensure llb digests are deterministic when sent by frontends #5517

gateway: ensure llb digests are deterministic when sent by frontends #5517

jsternberg commented Nov 13, 2024

jsternberg commented Nov 13, 2024

tonistiigi Nov 13, 2024

jsternberg Nov 13, 2024

tonistiigi Nov 14, 2024

jsternberg Nov 14, 2024

tonistiigi Nov 14, 2024

tonistiigi Nov 14, 2024

jsternberg Nov 14, 2024

jsternberg commented Nov 14, 2024

gateway: ensure llb digests are deterministic when sent by frontends #5517

Are you sure you want to change the base?

gateway: ensure llb digests are deterministic when sent by frontends #5517

Conversation

jsternberg commented Nov 13, 2024

jsternberg commented Nov 13, 2024

tonistiigi Nov 13, 2024

Choose a reason for hiding this comment

jsternberg Nov 13, 2024

Choose a reason for hiding this comment

tonistiigi Nov 14, 2024

Choose a reason for hiding this comment

jsternberg Nov 14, 2024

Choose a reason for hiding this comment

tonistiigi Nov 14, 2024

Choose a reason for hiding this comment

tonistiigi Nov 14, 2024

Choose a reason for hiding this comment

jsternberg Nov 14, 2024

Choose a reason for hiding this comment

jsternberg commented Nov 14, 2024