Improve aggressive factor propagation strategy in Shardy. There are two main differences from `BasicFactorPropagation`. #28

copybara-service · 2024-07-25T03:22:09Z

Improve aggressive factor propagation strategy in Shardy. There are two main differences from BasicFactorPropagation.

Difference 1

BasicFactorPropagation propagates the same sharding axes to all the tensors
along a factor. This strategy can propagate different sharding axes to
different tensors. For example, Tensors T0, T1, T2 contains Factor F0. T0/F0
is already sharded along ["a", "b"], and "b" is already used by T2 ("b" can be
explicitly replicated, or it is used to shard another factor).
BasicFactorPropagation propagates ["a"] to both T1/F0 and T2/F0, while this
strategy propagates ["a", "b"] to T1/F0 and ["a"] to T2/F0, respectively.

Difference 2

BasicFactorPropagation is conservative in terms of conflicts across
factors. The overlapped axis between factors cannot be propagated. This
strategy is more aggressive by allowing the overlapped axis being propagated
along different factors if there is no overlapped axis in the result
shardings.

…wo main differences from `BasicFactorPropagation`. ### Difference 1 `BasicFactorPropagation` propagates the same sharding axes to all the tensors along a factor. This strategy can propagate different sharding axes to different tensors. For example, Tensors T0, T1, T2 contains Factor F0. T0/F0 is already sharded along ["a", "b"], and "b" is already used by T2 ("b" can be explicitly replicated, or it is used to shard another factor). `BasicFactorPropagation` propagates ["a"] to both T1/F0 and T2/F0, while this strategy propagates ["a", "b"] to T1/F0 and ["a"] to T2/F0, respectively. ### Difference 2 `BasicFactorPropagation` is conservative in terms of conflicts across factors. The overlapped axis between factors cannot be propagated. This strategy is more aggressive by allowing the overlapped axis being propagated along different factors if there is no overlapped axis in the result shardings. PiperOrigin-RevId: 657641564

copybara-service bot assigned ZixuanJiang Jul 25, 2024

copybara-service bot force-pushed the test_655675663 branch from c7c5d33 to 7a3f6c1 Compare July 30, 2024 17:21

copybara-service bot force-pushed the test_655675663 branch from 7a3f6c1 to 2eeee20 Compare July 30, 2024 17:24

copybara-service bot merged commit 2eeee20 into main Jul 30, 2024

copybara-service bot deleted the test_655675663 branch July 30, 2024 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve aggressive factor propagation strategy in Shardy. There are two main differences from `BasicFactorPropagation`. #28

Improve aggressive factor propagation strategy in Shardy. There are two main differences from `BasicFactorPropagation`. #28

copybara-service bot commented Jul 25, 2024 •

edited

Loading

Improve aggressive factor propagation strategy in Shardy. There are two main differences from BasicFactorPropagation. #28

Improve aggressive factor propagation strategy in Shardy. There are two main differences from BasicFactorPropagation. #28

Conversation

copybara-service bot commented Jul 25, 2024 • edited Loading

Difference 1

Difference 2

Improve aggressive factor propagation strategy in Shardy. There are two main differences from `BasicFactorPropagation`. #28

Improve aggressive factor propagation strategy in Shardy. There are two main differences from `BasicFactorPropagation`. #28

copybara-service bot commented Jul 25, 2024 •

edited

Loading