Preparation for the coming Kudo support #11667

firestarman · 2024-10-28T05:40:11Z

contributes to #11590

This is some code refactor for easily adding in the Kudo support in Shuffle coalesce and join execs. The main idea is to abstract the batch operations (SerializedTableOperator) from the coalesce read process.

Some code refactor for easily adding in the Kudo support in Shuffle coalesce and join execs. Signed-off-by: Firestarman <[email protected]>

Signed-off-by: Firestarman <[email protected]>

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala

…alesceExec.scala Co-authored-by: Renjie Liu <[email protected]>

Signed-off-by: Firestarman <[email protected]>

firestarman · 2024-10-28T07:36:33Z

build

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala

revans2

Mostly nits, but I would like a few more sets of eyes on this.

revans2 · 2024-10-28T16:18:54Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala

+
+object CoalesceReadOption {
+  def apply(conf: RapidsConf): CoalesceReadOption = {
+    // TODO get the value from conf


Do we need to do this?

It is better to have.
It will be quite easy to add in new fields that can be got from the conf. And you do not need to update all the places where the CoalesceReadOption(conf) is called.

revans2 · 2024-10-28T16:20:26Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala

+      metricsMap: Map[String, GpuMetric],
+      prefetchFirstBatch: Boolean = false): Iterator[ColumnarBatch] = {
+    val hostIter = if (readOption.kudoEnabled) {
+      // TODO replace with the actual Kudo host iterator


Can we please throw an exception instead? If this is true we have problems and making the data disappear feels wrong to me.

Good suggestion, done

revans2 · 2024-10-28T16:30:59Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala

+  def getCoalescedBufferSize(concated: AnyRef): Long = concated match {
+    // TODO add the Kudo case
+    case c: HostConcatResult => c.getTableHeader.getDataLen
+    case g => GpuColumnVector.getTotalDeviceMemoryUsed(g.asInstanceOf[ColumnarBatch])


nit: can we please have some better error messages here or something? I don't like that we have g.asInstanceOf, when the match should do that for us. It also means that the error message if something goes wrong will just be a class cast exception instead of a cleaner message

This function is removed.

firestarman · 2024-10-29T01:59:41Z

Thx for review, but I will drop this PR since it is not necessary for only the table operator part.

@liurenjie1024 Seems it would be better to have a single PR for the Kudo support.

Signed-off-by: Firestarman <[email protected]>

firestarman · 2024-10-31T12:07:30Z

build

jlowe

lgtm, but it would be good to have @revans2 have a second look.

Signed-off-by: Firestarman <[email protected]>

firestarman · 2024-11-01T01:57:01Z

build

liurenjie1024

Generally LGTM, just a nit

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala

firestarman · 2024-11-04T01:41:37Z

@revans2 Could you help take a look ? Thanks.

firestarman added 2 commits October 28, 2024 12:12

Preparation for the coming Kudo support

5e0def5

Some code refactor for easily adding in the Kudo support in Shuffle coalesce and join execs. Signed-off-by: Firestarman <[email protected]>

More comments for TODO

e80a16e

Signed-off-by: Firestarman <[email protected]>

liurenjie1024 reviewed Oct 28, 2024

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala Outdated Show resolved Hide resolved

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala Outdated Show resolved Hide resolved

firestarman and others added 3 commits October 28, 2024 14:29

Update sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCo…

1114d71

…alesceExec.scala Co-authored-by: Renjie Liu <[email protected]>

Update sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCo…

5ec4586

…alesceExec.scala Co-authored-by: Renjie Liu <[email protected]>

address comments

13058a6

Signed-off-by: Firestarman <[email protected]>

jlowe reviewed Oct 28, 2024

View reviewed changes

revans2 reviewed Oct 28, 2024

View reviewed changes

firestarman closed this Oct 29, 2024

firestarman reopened this Oct 31, 2024

firestarman added 2 commits October 31, 2024 08:09

address more comments

666f2a8

Signed-off-by: Firestarman <[email protected]>

remove unused import

d4777cc

Signed-off-by: Firestarman <[email protected]>

firestarman requested review from jlowe, liurenjie1024 and revans2 October 31, 2024 08:38

jlowe previously approved these changes Oct 31, 2024

View reviewed changes

correct some doc

b5a2734

Signed-off-by: Firestarman <[email protected]>

firestarman dismissed jlowe’s stale review via b5a2734 November 1, 2024 01:35

liurenjie1024 approved these changes Nov 1, 2024

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuShuffleCoalesceExec.scala Show resolved Hide resolved

revans2 approved these changes Nov 4, 2024

View reviewed changes

revans2 merged commit 35980d6 into NVIDIA:branch-24.12 Nov 4, 2024
46 of 47 checks passed

firestarman deleted the refactor-coalesce-shuffle branch November 5, 2024 01:40

sameerz added the performance A performance related task/issue label Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preparation for the coming Kudo support #11667

Preparation for the coming Kudo support #11667

firestarman commented Oct 28, 2024 •

edited

Loading

firestarman commented Oct 28, 2024

revans2 left a comment

revans2 Oct 28, 2024

firestarman Oct 31, 2024 •

edited

Loading

revans2 Oct 28, 2024

firestarman Oct 31, 2024

revans2 Oct 28, 2024

firestarman Oct 31, 2024

firestarman commented Oct 29, 2024 •

edited

Loading

firestarman commented Oct 31, 2024

jlowe left a comment

firestarman commented Nov 1, 2024

liurenjie1024 left a comment

firestarman commented Nov 4, 2024

Preparation for the coming Kudo support #11667

Preparation for the coming Kudo support #11667

Conversation

firestarman commented Oct 28, 2024 • edited Loading

firestarman commented Oct 28, 2024

revans2 left a comment

Choose a reason for hiding this comment

revans2 Oct 28, 2024

Choose a reason for hiding this comment

firestarman Oct 31, 2024 • edited Loading

Choose a reason for hiding this comment

revans2 Oct 28, 2024

Choose a reason for hiding this comment

firestarman Oct 31, 2024

Choose a reason for hiding this comment

revans2 Oct 28, 2024

Choose a reason for hiding this comment

firestarman Oct 31, 2024

Choose a reason for hiding this comment

firestarman commented Oct 29, 2024 • edited Loading

firestarman commented Oct 31, 2024

jlowe left a comment

Choose a reason for hiding this comment

firestarman commented Nov 1, 2024

liurenjie1024 left a comment

Choose a reason for hiding this comment

firestarman commented Nov 4, 2024

firestarman commented Oct 28, 2024 •

edited

Loading

firestarman Oct 31, 2024 •

edited

Loading

firestarman commented Oct 29, 2024 •

edited

Loading