Skip to content

Commit 3469736

Browse files
committed
correcting misprint for TRT LLM load
1 parent 77f2145 commit 3469736

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

py/torch_tensorrt/dynamo/_compiler.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -502,7 +502,7 @@ def compile(
502502
enable_weight_streaming (bool): Enable weight streaming.
503503
tiling_optimization_level (str): The optimization level of tiling strategies. A higher level allows TensorRT to spend more time searching for better tiling strategy. We currently support ["none", "fast", "moderate", "full"].
504504
l2_limit_for_tiling (int): The target L2 cache usage limit (in bytes) for tiling optimization (default is -1 which means no limit).
505-
ç
505+
use_distributed_mode_trace (bool): Using aot_autograd to trace the graph. This is enabled when DTensors or distributed tensors are present in distributed model
506506
**kwargs: Any,
507507
Returns:
508508
torch.fx.GraphModule: Compiled FX Module, when run it will execute via TensorRT

0 commit comments

Comments
 (0)