Skip to content

Commit

Permalink
Tuning script adaptation for the new triton
Browse files Browse the repository at this point in the history
  • Loading branch information
gshtras committed Dec 19, 2024
1 parent c1ccc5e commit dfb1df5
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion benchmarks/kernels/benchmark_moe.py
Original file line number Diff line number Diff line change
Expand Up @@ -155,7 +155,7 @@ def get_rocm_tuning_space(use_fp16):
# For now we see better perf with num_stages=0 for all gemm configs we care
# But keep this explicit so that we do not forget we may need to set it to
# other values in the future
num_stage_range = [0]
num_stage_range = [2]
waves_per_eu_range = [0]
matrix_instr_nonkdim_range = [16, 32] if use_fp16 else []
kpack_range = [1, 2] if use_fp16 else []
Expand Down

0 comments on commit dfb1df5

Please sign in to comment.