Skip to content

Commit

Permalink
remove tl.constexpr to avoid repeatly compile triton kernel
Browse files Browse the repository at this point in the history
  • Loading branch information
AllentDan committed Dec 5, 2024
1 parent cc5d110 commit bbc108c
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions lmdeploy/pytorch/kernels/cuda/flashattention.py
Original file line number Diff line number Diff line change
Expand Up @@ -444,9 +444,9 @@ def _flash_prefill_fwd_kernel_with_mask(
stride_os: tl.constexpr,
stride_oh: tl.constexpr,
stride_od: tl.constexpr,
stride_amb: tl.constexpr,
stride_amqs: tl.constexpr,
stride_amkvs: tl.constexpr,
stride_amb,
stride_amqs,
stride_amkvs,
kv_group_num,
head_dim_k,
head_dim_v,
Expand Down

0 comments on commit bbc108c

Please sign in to comment.