Skip to content

Add Compilation Flag for enabling/disabling clipping, and tune number of blocks for mscclpp allreduce8 #1607

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 7 commits into
base: develop
Choose a base branch
from

Conversation

isaki001
Copy link
Contributor

Details

Do not mention proprietary info or link to internal work items in this PR.

Work item: "Internal", or link to GitHub issue (if applicable).

What were the changes?
Added performance optmization in mscclpp by tuning the number of blocks and disabling clipping by default.

Why were the changes made?
observed significant regression between current mscclpp commit in rccl, and prior mscclpp commit.

How was the outcome achieved?
Compilation flag that is set by default to false, prevents clipping routine from wrapping each addition. Additionally, the number of blocks in allreduce8 were increased from 512 to 1024.

Additional Documentation:
Disabling clipping has impact on correctness but may be necessary for particular workloads.

Approval Checklist

Do not approve until these items are satisfied.

  • Verify the CHANGELOG has been updated, if
    • there are any NCCL API version changes,
    • any changes impact library users, and/or
    • any changes impact any other ROCm library.

Copy link
Contributor

@nusislam nusislam left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The PR looks good. Can be merged when the CI passes.

@isaki001
Copy link
Contributor Author

@nusislam I haven't merged yet since I have added a few commits since your approval. Please let me know if there are any more reviews

@nusislam
Copy link
Contributor

@nusislam I haven't merged yet since I have added a few commits since your approval. Please let me know if there are any more reviews

I would suggest merging this one after #1615

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants