Type Conversions in Layer Norm Fusion #17281

philipphack · 2024-09-17T21:02:22Z

Enables the fusion of layer norm graphs with type conversions of input, scale and bias.

reedwm · 2024-09-26T23:19:50Z

xla/service/gpu/transforms/cudnn_norm_rewriter.cc

+      // shapes of scale and bias must match. If a conversion to the type of the
+      // input is the only user of the output, set the output to the conversion.
+      // Similarly, if one of the users of the scale/bias is a conversion to the
+      // type of the bias/scale, set the scale/bias to the conversion.
+      if (instr->user_count() == 1 &&


I am nervous about doing an incorrect rewrite based on these conversions you allow. What if the user, e.g., converts the input of layer norm from f32 to s2 and does all the layer norm logic in s2, before casting back to f32? It would be incorrect i think to rewrite this to a cudnn layer norm of full precision. Can we check at least all the types are floats of at least precision bf16?

reedwm · 2024-09-26T23:59:59Z

xla/service/gpu/transforms/cudnn_norm_rewriter.cc

+      for (HloInstruction* scale_user : scale->users()) {
+        if (scale_user->opcode() == HloOpcode::kConvert &&
+            ShapeUtil::SameElementType(scale_user->shape(), bias->shape())) {
+          scale = scale_user;
+          break;
+        }
+      }


Why check if there is a convert user? This user might not even be part of the layer norm.

reedwm · 2024-10-04T22:55:15Z

This looks good to me. As we discussed offline, you want to do some more testing on models first, so I'll wait until you do that before approving, otherwise this may get prematurely merged.

reedwm · 2024-10-08T17:03:50Z

Approving as you confirmed the testing did not find issues.

Imported from GitHub PR #17281 Enables the fusion of layer norm graphs with type conversions of input, scale and bias. Copybara import of the project: -- 2880fde by Philipp Hack <[email protected]>: Layer norm fusion with type conversions of input, scale and bias. -- 898f002 by Philipp Hack <[email protected]>: Layer norm fusion with type conversions of input, scale and bias. Merging this change closes #17281 FUTURE_COPYBARA_INTEGRATE_REVIEW=#17281 from philipphack:u_layer_convert_xla 898f002 PiperOrigin-RevId: 683664734

Imported from GitHub PR openxla/xla#17281 Enables the fusion of layer norm graphs with type conversions of input, scale and bias. Copybara import of the project: -- 2880fde5f71ad1aba23651ff866fd00becc706e5 by Philipp Hack <[email protected]>: Layer norm fusion with type conversions of input, scale and bias. -- 898f002f419909f367e6f9eedc72c61f4c73d201 by Philipp Hack <[email protected]>: Layer norm fusion with type conversions of input, scale and bias. Merging this change closes #17281 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#17281 from philipphack:u_layer_convert_xla 898f002f419909f367e6f9eedc72c61f4c73d201 PiperOrigin-RevId: 683664734

Imported from GitHub PR openxla/xla#17281 Enables the fusion of layer norm graphs with type conversions of input, scale and bias. Copybara import of the project: -- 2880fde5f71ad1aba23651ff866fd00becc706e5 by Philipp Hack <[email protected]>: Layer norm fusion with type conversions of input, scale and bias. -- 898f002f419909f367e6f9eedc72c61f4c73d201 by Philipp Hack <[email protected]>: Layer norm fusion with type conversions of input, scale and bias. Merging this change closes #17281 PiperOrigin-RevId: 683682296

FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#17281 from philipphack:u_layer_convert_xla 898f002f419909f367e6f9eedc72c61f4c73d201 PiperOrigin-RevId: 671073597

Layer norm fusion with type conversions of input, scale and bias.

2880fde

philipphack requested a review from reedwm September 17, 2024 21:02

NaiyerRizz self-assigned this Sep 18, 2024

reedwm requested changes Sep 27, 2024

View reviewed changes

Layer norm fusion with type conversions of input, scale and bias.

898f002

reedwm approved these changes Oct 8, 2024

View reviewed changes

copybara-service bot mentioned this pull request Oct 8, 2024

PR #17281: Type Conversions in Layer Norm Fusion #18046

Merged

copybara-service bot mentioned this pull request Oct 8, 2024

PR #17281: Type Conversions in Layer Norm Fusion tensorflow/tensorflow#77272

Merged

copybara-service bot closed this in becd9d6 Oct 8, 2024

copybara-service bot mentioned this pull request Oct 8, 2024

Remove lite/flex:delegate from pywrap_tensorflow_internal tensorflow/tensorflow#75124

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type Conversions in Layer Norm Fusion #17281

Type Conversions in Layer Norm Fusion #17281

philipphack commented Sep 17, 2024

reedwm Sep 26, 2024

reedwm Sep 26, 2024

reedwm commented Oct 4, 2024

reedwm commented Oct 8, 2024

Type Conversions in Layer Norm Fusion #17281

Type Conversions in Layer Norm Fusion #17281

Conversation

philipphack commented Sep 17, 2024

reedwm Sep 26, 2024

Choose a reason for hiding this comment

reedwm Sep 26, 2024

Choose a reason for hiding this comment

reedwm commented Oct 4, 2024

reedwm commented Oct 8, 2024