rcca: test default kv_cache_reuse option for pytorch multimodal #5544

StanleySun639 · 2025-06-27T09:07:29Z

PR title

Please write the PR title by following template:

[JIRA ticket link/nvbug link/github issue link][fix/feat/doc/infra/...] <summary of this PR>

For example, assume I have a PR hope to support a new feature about cache manager of Jira TRTLLM-1000 ticket, it would be like

[TRTLLM-1000][feat] Support a new feature about cache manager

Description

RCCA bug#5252057

Test Coverage

GitHub Bot Help

/bot [-h] ['run', 'kill', 'skip', 'reuse-pipeline'] ...

Provide a user friendly way for developers to interact with a Jenkins server.

Run /bot [-h|--help] to print this help message.

See details below for each supported subcommand.

run [--disable-fail-fast --skip-test --stage-list "A10-1, xxx" --gpu-type "A30, H100_PCIe" --add-multi-gpu-test --only-multi-gpu-test --disable-multi-gpu-test --post-merge --extra-stage "H100_PCIe-[Post-Merge]-1, xxx"]

Launch build/test pipelines. All previously running jobs will be killed.

--disable-fail-fast (OPTIONAL) : Disable fail fast on build/tests/infra failures.

--skip-test (OPTIONAL) : Skip all test stages, but still run build stages, package stages and sanity check stages. Note: Does NOT update GitHub check status.

--stage-list "A10-1, xxx" (OPTIONAL) : Only run the specified test stages. Examples: "A10-1, xxx". Note: Does NOT update GitHub check status.

--gpu-type "A30, H100_PCIe" (OPTIONAL) : Only run the test stages on the specified GPU types. Examples: "A30, H100_PCIe". Note: Does NOT update GitHub check status.

--only-multi-gpu-test (OPTIONAL) : Only run the multi-GPU tests. Note: Does NOT update GitHub check status.

--disable-multi-gpu-test (OPTIONAL) : Disable the multi-GPU tests. Note: Does NOT update GitHub check status.

--add-multi-gpu-test (OPTIONAL) : Force run the multi-GPU tests. Will also run L0 pre-merge pipeline.

--post-merge (OPTIONAL) : Run the L0 post-merge pipeline instead of the ordinary L0 pre-merge pipeline.

--extra-stage "H100_PCIe-[Post-Merge]-1, xxx" (OPTIONAL) : Run the ordinary L0 pre-merge pipeline and specified test stages. Examples: --extra-stage "H100_PCIe-[Post-Merge]-1, xxx".

For guidance on mapping tests to stage names, see docs/source/reference/ci-overview.md.

kill

kill

Kill all running builds associated with pull request.

skip

skip --comment COMMENT

Skip testing for latest commit on pull request. --comment "Reason for skipping build/test" is required. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

reuse-pipeline

reuse-pipeline

Reuse a previous pipeline to validate current commit. This action will also kill all currently running builds associated with the pull request. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

StanleySun639 · 2025-06-27T09:09:17Z

/bot run

tensorrt-cicd · 2025-06-27T09:14:30Z

PR_Github #10131 [ run ] triggered by Bot

yechank-nvidia

We are not supporting kv_cache_reuse for multimodal now.

StanleySun639 · 2025-06-27T10:25:09Z

We are not supporting kv_cache_reuse for multimodal now.
@yechank-nvidia yes, we need remove the explicit option to test default value, to test this PR: https://github.com/NVIDIA/TensorRT-LLM/pull/4025/files

tensorrt-cicd · 2025-06-27T12:01:53Z

PR_Github #10131 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #7478 completed with status: 'FAILURE'

StanleySun639 · 2025-06-28T08:35:55Z

/bot run

tensorrt-cicd · 2025-06-28T08:41:22Z

PR_Github #10196 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-28T10:04:42Z

PR_Github #10196 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #7528 completed with status: 'FAILURE'

StanleySun639 · 2025-06-30T01:48:33Z

/bot run

tensorrt-cicd · 2025-06-30T01:53:35Z

PR_Github #10254 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-30T04:42:41Z

PR_Github #10254 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #7579 completed with status: 'FAILURE'

StanleySun639 · 2025-06-30T06:57:11Z

/bot run --stage-list "H100_PCIe-TensorRT-1"

tensorrt-cicd · 2025-06-30T07:04:08Z

PR_Github #10292 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-30T19:12:42Z

PR_Github #10292 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #7609 (Partly Tested) completed with status: 'FAILURE'

Signed-off-by: Stanley Sun <[email protected]>

StanleySun639 · 2025-07-01T01:25:58Z

/bot run

tensorrt-cicd · 2025-07-01T01:30:57Z

PR_Github #10408 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-01T04:02:05Z

PR_Github #10408 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #7696 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

StanleySun639 requested review from crazydemo, yechank-nvidia, LarryXFly and xinhe-nv June 27, 2025 09:07

StanleySun639 force-pushed the user/stsun/rcca-5252057 branch from 81f4866 to ea02c60 Compare June 27, 2025 09:07

StanleySun639 changed the title ~~rcca: test default kv_cache_reuse option~~ rcca: test default kv_cache_reuse option for pytorch multimodal Jun 27, 2025

crazydemo approved these changes Jun 27, 2025

View reviewed changes

yechank-nvidia reviewed Jun 27, 2025

View reviewed changes

yechank-nvidia approved these changes Jun 27, 2025

View reviewed changes

StanleySun639 force-pushed the user/stsun/rcca-5252057 branch from ea02c60 to cc1c52f Compare June 28, 2025 08:34

StanleySun639 force-pushed the user/stsun/rcca-5252057 branch from cc1c52f to 29f6049 Compare June 30, 2025 01:46

StanleySun639 force-pushed the user/stsun/rcca-5252057 branch from 29f6049 to bd6bb50 Compare June 30, 2025 06:54

rcca: test default kv_cache_reuse option

f084037

Signed-off-by: Stanley Sun <[email protected]>

StanleySun639 force-pushed the user/stsun/rcca-5252057 branch from bd6bb50 to f084037 Compare July 1, 2025 01:25

StanleySun639 merged commit 7135b27 into NVIDIA:main Jul 1, 2025
3 checks passed

StanleySun639 deleted the user/stsun/rcca-5252057 branch July 1, 2025 04:12

Shunkangz pushed a commit to Shunkangz/TensorRT-LLM that referenced this pull request Jul 2, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

e0974b3

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 9, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

0830c88

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

9b6b15a

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

65219d4

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

91e0362

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

2777a57

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

2a70ddc

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

f935f40

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

rcca: test default kv_cache_reuse option for pytorch multimodal (NVID…

3ea827c

…IA#5544) Signed-off-by: Stanley Sun <[email protected]>

rcca: test default kv_cache_reuse option for pytorch multimodal #5544

rcca: test default kv_cache_reuse option for pytorch multimodal #5544

Uh oh!

Conversation

StanleySun639 commented Jun 27, 2025

PR title

Description

Test Coverage

GitHub Bot Help

kill

skip

reuse-pipeline

Uh oh!

StanleySun639 commented Jun 27, 2025

Uh oh!

tensorrt-cicd commented Jun 27, 2025

Uh oh!

yechank-nvidia left a comment

Choose a reason for hiding this comment

Uh oh!

StanleySun639 commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tensorrt-cicd commented Jun 27, 2025

Uh oh!

StanleySun639 commented Jun 28, 2025

Uh oh!

tensorrt-cicd commented Jun 28, 2025

Uh oh!

tensorrt-cicd commented Jun 28, 2025

Uh oh!

StanleySun639 commented Jun 30, 2025

Uh oh!

tensorrt-cicd commented Jun 30, 2025

Uh oh!

tensorrt-cicd commented Jun 30, 2025

Uh oh!

StanleySun639 commented Jun 30, 2025

Uh oh!

tensorrt-cicd commented Jun 30, 2025

Uh oh!

tensorrt-cicd commented Jun 30, 2025

Uh oh!

StanleySun639 commented Jul 1, 2025

Uh oh!

tensorrt-cicd commented Jul 1, 2025

Uh oh!

tensorrt-cicd commented Jul 1, 2025

Uh oh!

Uh oh!

Uh oh!

StanleySun639 commented Jun 27, 2025 •

edited

Loading