You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello all 👋🏽, I'm super new to W&B in general. I just wanted to make sure I understand how exactly Launch works after reading the docs, and how it can help my team's use case, which is the following:
Use Case: "As a ML engineer, I don't want to always run the training process for our model on my local CPU. I can make changes to the ML code, and then send it to the Azure Kubernetes Service (AKS), to run the training on fast GPUs; and then, see the training metrics whenever I log back into my W&B dashboard (for now, let's say I'm on the SaaS offering)."
Using W&B terminology, I think what I'd need to do to implement this is the following:
Second question: If I go through the UI to create a queue, I notice that under the "Resource" dropdown, there's no option called "Azure". Would that pose an issue for us, or is it safe for us to just go with the "Docker" option?
As I understand it, once a job is dequeued from the queue, then it's the launch agent's responsibility to move it into the appropriate execution environment.
Third question: If that is the case, I'm still scratching my head as to how the machine running my K8s job will be able to authenticate, so it can send training metrics to W&B. In this situation, what's the most secure way for us to set the WANDB_API_KEY environment variable in the Docker container, that gets run inside of AKS? One idea I thought of is to include it under the environment property when creating the agent configuration, but that's just a guess...
Thanks in advance for your insights on this.
The text was updated successfully, but these errors were encountered:
Hello all 👋🏽, I'm super new to W&B in general. I just wanted to make sure I understand how exactly Launch works after reading the docs, and how it can help my team's use case, which is the following:
Using W&B terminology, I think what I'd need to do to implement this is the following:
WANDB_DOCKER
variable?WANDB_API_KEY
environment variable in the Docker container, that gets run inside of AKS? One idea I thought of is to include it under theenvironment
property when creating the agent configuration, but that's just a guess...Thanks in advance for your insights on this.
The text was updated successfully, but these errors were encountered: