Refactor replace and sort #102

RAMitchell · 2025-08-15T13:19:44Z

No description provided.

Signed-off-by: Rory Mitchell <[email protected]>

Copilot

Pull Request Overview

This PR refactors the replace and sort operations by splitting CUDA-specific GPU code into separate files and modernizing the test implementation. The changes migrate from CUDF-based tests to Arrow-based tests while maintaining backward compatibility for CUDA operations.

Key changes:

Separates GPU variants into dedicated .cu files for better code organization
Modernizes test infrastructure from CUDF to Arrow
Exposes task classes and helper functions in public headers

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`cpp/tests/test_replace.cpp`	Migrates from CUDF to Arrow-based test framework
`cpp/src/sort.cu`	New file containing GPU-specific sort implementation
`cpp/src/sort.cpp`	Refactored to remove GPU code and expose shared functions
`cpp/src/replace.cu`	New file containing GPU-specific replace implementation
`cpp/src/replace.cpp`	Refactored to remove GPU code and update type checking
`cpp/include/legate_dataframe/sort.hpp`	Exposes task class and helper functions
`cpp/include/legate_dataframe/replace.hpp`	Exposes task class definition
`cpp/include/legate_dataframe/reduction.hpp`	Removes unused variant options

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-08-15T13:52:51Z

cpp/src/sort.cu

+  auto split_indices = cudf::lower_bound(sorted_table_with_rank.select(keys_idxx),
+                                         global_split_values.select(value_keysx),
+                                         column_order_with_rank,
+                                         null_precendence_with_rank,


Variable name 'null_precendence_with_rank' contains a spelling error. It should be 'null_precedence_with_rank' (precedence, not precendence).

Suggested change

null_precendence_with_rank,

auto null_precedence_with_rank = null_precedence;

null_precedence_with_rank.push_back(cudf::null_order::AFTER);

auto split_indices = cudf::lower_bound(sorted_table_with_rank.select(keys_idxx),

global_split_values.select(value_keysx),

column_order_with_rank,

null_precedence_with_rank,

Copilot · 2025-08-15T13:52:52Z

cpp/src/sort.cu

+  auto split_indices = cudf::lower_bound(sorted_table_with_rank.select(keys_idxx),
+                                         global_split_values.select(value_keysx),
+                                         column_order_with_rank,
+                                         null_precendence_with_rank,


Variable name 'null_precendence_with_rank' contains a spelling error. It should be 'null_precedence_with_rank' (precedence, not precendence).

Suggested change

null_precendence_with_rank,

auto null_precedence_with_rank = null_precedence;

null_precedence_with_rank.push_back(cudf::null_order::AFTER);

auto split_indices = cudf::lower_bound(sorted_table_with_rank.select(keys_idxx),

global_split_values.select(value_keysx),

column_order_with_rank,

null_precedence_with_rank,

Copilot · 2025-08-15T13:52:52Z

cpp/src/sort.cu

+  auto split_indices = cudf::lower_bound(sorted_table_with_rank.select(keys_idxx),
+                                         global_split_values.select(value_keysx),
+                                         column_order_with_rank,
+                                         null_precendence_with_rank,


Variable name 'null_precendence_with_rank' contains a spelling error. It should be 'null_precedence_with_rank' (precedence, not precendence).

Suggested change

null_precendence_with_rank,

auto null_precedence_with_rank = null_precedence;

null_precedence_with_rank.push_back(cudf::null_order::AFTER);

auto split_indices = cudf::lower_bound(sorted_table_with_rank.select(keys_idxx),

global_split_values.select(value_keysx),

column_order_with_rank,

null_precedence_with_rank,

Signed-off-by: Rory Mitchell <[email protected]>

seberg

A lot of code that needed to move here, but looks straight forward. Nice, thanks!

RAMitchell added 3 commits August 15, 2025 06:18

Refactor replace

cd44adc

Signed-off-by: Rory Mitchell <[email protected]>

Do sort

9823e9c

Signed-off-by: Rory Mitchell <[email protected]>

Fix type check in replace

e171fa5

Signed-off-by: Rory Mitchell <[email protected]>

RAMitchell requested review from Copilot and seberg August 15, 2025 13:52

Copilot AI reviewed Aug 15, 2025

View reviewed changes

Fix overflow

2a129a1

Signed-off-by: Rory Mitchell <[email protected]>

seberg approved these changes Aug 18, 2025

View reviewed changes

RAMitchell merged commit 632ec4c into rapidsai:main Aug 18, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor replace and sort #102

Refactor replace and sort #102

Uh oh!

RAMitchell commented Aug 15, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 15, 2025

Uh oh!

Copilot AI Aug 15, 2025

Uh oh!

Copilot AI Aug 15, 2025

Uh oh!

seberg left a comment

Uh oh!

Uh oh!

Uh oh!

Refactor replace and sort #102

Refactor replace and sort #102

Uh oh!

Conversation

RAMitchell commented Aug 15, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

seberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!