Skip to content

Commit caf73f5

Browse files
authored
[https://nvbugs/5383702][fix] test_llm_api_pytorch.py::TestLlama3_1_8BInstruct::test_fp8_4gpus (NVIDIA#6889)
Signed-off-by: Superjomn <[email protected]>
1 parent e77ec06 commit caf73f5

File tree

1 file changed

+0
-6
lines changed

1 file changed

+0
-6
lines changed

tensorrt_llm/_torch/compilation/piecewise_optimizer.py

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -208,15 +208,9 @@ def __call__(self, *args):
208208
runtime_input_addresses = [
209209
i.data_ptr() for i in args if isinstance(i, torch.Tensor)
210210
]
211-
runtime_output_addresses = [
212-
i.data_ptr() for i in output if isinstance(i, torch.Tensor)
213-
]
214211

215212
assert (entry.input_addresses == runtime_input_addresses
216213
), f"{entry.input_addresses} vs\n {runtime_input_addresses}"
217-
assert (
218-
entry.output_addresses == runtime_output_addresses
219-
), f"{entry.output_addresses} vs\n {runtime_output_addresses}"
220214

221215
entry.cuda_graph.replay()
222216

0 commit comments

Comments
 (0)