The size of num_samples greatly affects the inference speed of the model #114
-
Thank you for this job! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
You're right. More samples will typically improve performance but will require more GPU memory/inference time. As this is autoregressive generation, I am afraid I am not aware of a way to draw more samples without incurring this cost. |
Beta Was this translation helpful? Give feedback.
You're right. More samples will typically improve performance but will require more GPU memory/inference time. As this is autoregressive generation, I am afraid I am not aware of a way to draw more samples without incurring this cost.