Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: add cache flushing control flags to stream memory operations #129

Open
Epliz opened this issue Jan 17, 2025 · 0 comments
Open
Labels

Comments

@Epliz
Copy link

Epliz commented Jan 17, 2025

Suggestion Description

Hi,

As mentioned briefly at #104 , I believe that adding support for cache flushing control flags to the stream memory read/write operations could be great, similarly to how it is possible with events.

Thanks to stream memory operations I have been able to implement an all-reduce single machine implementation that has ~18us latency, which is really good already (better than rccl from what I see). I believe that being able to avoid cache flushes might help shave more on that.

Best,
Epliz

Operating System

No response

GPU

No response

ROCm Component

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants