Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Testing merge #17

Merged
merged 411 commits into from
Dec 9, 2024
Merged

Testing merge #17

merged 411 commits into from
Dec 9, 2024

Conversation

MarcelKoch
Copy link
Owner

No description provided.

yhmtsai and others added 30 commits August 20, 2024 10:12
This PR adds workaround for tbb on github action because the environment does not install oneTBB.

Related PR: ginkgo-project#1666
…el precision.

Thus, we can not cast e to current level precision unless it is the last level
This pr fixes the casting issue in mixed multigrid

Related PR: ginkgo-project#1663
Co-authored-by: Pratik Nayak <[email protected]>
Unify and simplify batch functionality: Multivector

Related PR: ginkgo-project#1651
+ also fix kernel names: remove _kernel suffix
Unify and simplify batch functionality: Matrix formats (csr, dense, ell)

Related PR: ginkgo-project#1669
Co-authored-by: Marcel Koch <[email protected]>
Co-authored-by: Pratik Nayak <[email protected]>
This pr adds the file config for schwarz preconditioner.
Moreover, it add the global_index into type_descriptor now

Related PR: ginkgo-project#1658
This causes some kernels on ROCm debug builds to fail
There is some weird interaction between inlining of shfl_xor and the
(otherwise unused) members of thread_block_tile.
The easiest way of working around it is to inline them explicitly as
__shfl_xor(_sync).
This fixes some issues with assertions and the bitonic sorting kernels on ROCm 6.x

Related PR: ginkgo-project#1670
Fritz Goebel and others added 29 commits November 28, 2024 13:51
Co-authored-by: Yu-Hsiang M. Tsai <[email protected]>
Co-authored-by: Marcel Koch <[email protected]>
…ead_distributed

Additive read distributed

This PR adds an option to communicate overlap in the distributed matrix' `read_distributed`. If the option is used, nonzero entries present in multiple ranks are added up on the owning rank rather than thrown away in `read_distributed`. This functionality can also be used via a free function `assemble_rows_from_neighbors` just using `device_matrix_data`.

This can be useful e.g. if in a domain decomposed finite element setting, each rank assembles their local contribution to a global system matrix and when assembling the global system matrix information on the subdomain boundaries has to be exchanged.

Related PR: ginkgo-project#1650
- bitmap: lead wrong answer or segfault
- hashmap: infinite loop
…pposite behavior).

Co-authored-by: Tobias Ribizel <[email protected]>
Co-authored-by: Natalie Beams <[email protected]>
This PR adds ginkgo own impelementation for ILU and IC. It is controlled by `incomplete_algorithm::sync_free`.

Related PR: ginkgo-project#1684
@MarcelKoch MarcelKoch merged commit 97c825c into main Dec 9, 2024
5 of 21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants