Skip to content

CORE: enhance error log in collective init #1104

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

Sergei-Lebedev
Copy link
Contributor

What

  • Improved error logging in ucc_collective_init to include detailed collective arguments on failure.
  • Updated ucc_tl_ucp_allreduce_knomial_init to ensure proper task cleanup on initialization failure.

@artemry-nv
Copy link
Collaborator

bot:retest

@Sergei-Lebedev Sergei-Lebedev force-pushed the topic/improve_error_handling_on_init_error branch from 64e76c1 to 989cd06 Compare April 8, 2025 12:29
@Sergei-Lebedev Sergei-Lebedev requested a review from ikryukov April 8, 2025 12:29
@janjust janjust force-pushed the topic/improve_error_handling_on_init_error branch from 989cd06 to de16909 Compare April 10, 2025 15:26
- Improved error logging in `ucc_collective_init` to include detailed collective arguments on failure.
- Updated `ucc_tl_ucp_allreduce_knomial_init` to ensure proper task cleanup on initialization failure.
@Sergei-Lebedev Sergei-Lebedev force-pushed the topic/improve_error_handling_on_init_error branch from de16909 to bee0585 Compare April 11, 2025 05:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants