-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] torchtune usecase #260
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please include a README file
Of course, I marked it as "work in progress" |
## For G4dn and other G5, comment out all | ||
# export FI_EFA_USE_DEVICE_RDMA=1 # use for p4d | ||
# export FI_EFA_FORK_SAFE=1 | ||
export FI_LOG_LEVEL=1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
export FI_LOG_LEVEL=1 | |
export FI_LOG_LEVEL=warn |
###### User Variables ##### | ||
########################### | ||
|
||
GPUS_PER_NODE=4 # 4 for G5.12x, 8 for P4/P5 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
GPUS_PER_NODE=4 # 4 for G5.12x, 8 for P4/P5 | |
GPUS_PER_NODE=8 # 4 for G5.12x, 8 for P4/P5 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please consider moving to container.
Left comments
db51efe
to
6002abb
Compare
44e448e
to
1209815
Compare
Issue #, if available:
Description of changes:
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.