Move logic from TorchX CLI -> API, so MVAI can call it #955

Sanjay-Ganeshan · 2024-09-11T16:01:32Z

Summary:
MVAI's "light" is synchronous - you can immediately see the logs for jobs you start. Only "fire" is asynchronous.

TorchX's API, since it's generic, always creates jobs that are asynchronous. Therefore, there isn't a built-in interface for "tailing" the stderr of every started process - just for tailing individual replicas of a given role.

The TorchX CLI's torchx run command has implemented this, but its implementation is coupled with the CLI implementations of torchx run and torchx log.

This diff extracts the useful logic into a helper function of the TorchX API

Reviewed By: andywag

Differential Revision: D62463211

facebook-github-bot · 2024-09-11T16:01:54Z

This pull request was exported from Phabricator. Differential Revision: D62463211

andywag

Review automatically exported from Phabricator review in Meta.

Summary: Pull Request resolved: pytorch#955 MVAI's "light" is synchronous - you can immediately see the logs for jobs you start. Only "fire" is asynchronous. TorchX's API, since it's generic, *always* creates jobs that are asynchronous. Therefore, there isn't a built-in interface for "tailing" the stderr of every started process - just for tailing individual replicas of a given role. The TorchX CLI's `torchx run` command **has** implemented this, but its implementation is coupled with the CLI implementations of `torchx run` and `torchx log`. This diff extracts the useful logic into a helper function of the TorchX API Reviewed By: andywag Differential Revision: D62463211

facebook-github-bot · 2024-09-11T16:27:08Z

This pull request was exported from Phabricator. Differential Revision: D62463211

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 11, 2024

facebook-github-bot added the fb-exported label Sep 11, 2024

andywag approved these changes Sep 11, 2024

View reviewed changes

Sanjay-Ganeshan force-pushed the export-D62463211 branch from ba44d32 to 0bfee30 Compare September 11, 2024 16:27

facebook-github-bot merged commit b7fd00b into pytorch:main Sep 11, 2024
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move logic from TorchX CLI -> API, so MVAI can call it #955

Move logic from TorchX CLI -> API, so MVAI can call it #955

Sanjay-Ganeshan commented Sep 11, 2024

facebook-github-bot commented Sep 11, 2024

andywag left a comment

facebook-github-bot commented Sep 11, 2024

Move logic from TorchX CLI -> API, so MVAI can call it #955

Move logic from TorchX CLI -> API, so MVAI can call it #955

Conversation

Sanjay-Ganeshan commented Sep 11, 2024

facebook-github-bot commented Sep 11, 2024

andywag left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Sep 11, 2024