Skip to content

Commit ec6ff88

Browse files
reasonsolodominicshanshan
authored andcommitted
[https://nvbugs/5448767][fix] disable kv cache reuse for disagg pp>1 tests (NVIDIA#7354)
Signed-off-by: Lizhi Zhou <[email protected]> Signed-off-by: Wangshanshan <[email protected]>
1 parent 2e7a195 commit ec6ff88

File tree

4 files changed

+7
-0
lines changed

4 files changed

+7
-0
lines changed

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxpp2_genpp2.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT
@@ -29,6 +30,7 @@ generation_servers:
2930
kv_cache_config:
3031
free_gpu_memory_fraction: 0.2
3132
enable_partial_reuse: False
33+
enable_block_reuse: False
3234
disable_overlap_scheduler: True
3335
cache_transceiver_config:
3436
backend: DEFAULT

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxpp2_gentp2.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxpp4_genpp4.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT
@@ -29,6 +30,7 @@ generation_servers:
2930
kv_cache_config:
3031
free_gpu_memory_fraction: 0.2
3132
enable_partial_reuse: False
33+
enable_block_reuse: False
3234
disable_overlap_scheduler: True
3335
cache_transceiver_config:
3436
backend: DEFAULT

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxtp2pp2_gentp2pp2.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT
@@ -29,6 +30,7 @@ generation_servers:
2930
kv_cache_config:
3031
free_gpu_memory_fraction: 0.2
3132
enable_partial_reuse: False
33+
enable_block_reuse: False
3234
disable_overlap_scheduler: True
3335
cache_transceiver_config:
3436
backend: DEFAULT

0 commit comments

Comments
 (0)