Skip to content

deepspeed initial entire model on each GPU at begining #3154

Answered by zarzen
floatingbigcat asked this question in Q&A
Discussion options

You must be logged in to vote

You will need to init your model with

with deepspeed.zero.Init():
    model = ....

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@floatingbigcat
Comment options

@tjruwase
Comment options

@apzl
Comment options

Answer selected by tjruwase
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants