Skip to content

Conversation

ooooo-create
Copy link
Contributor

add unit tests for cutlass_fp8_fp8_half_block_gemm_fused

Copy link

paddle-bot bot commented Sep 14, 2025

Thanks for your contribution!

@paddle-bot paddle-bot bot added the contributor External developers label Sep 14, 2025
@ckl117
Copy link
Collaborator

ckl117 commented Sep 19, 2025

由于FP8的数值范围和精度要比BF16小,这种将FP8反量化成BF16的对比方法很有可能出现误差。建议将x、y均初始化成bf16精度,再将x、y量化成FP8类型,进行计算和对比结果。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants