Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add README section on performance counters #191

Merged
merged 2 commits into from
Mar 20, 2024
Merged

Conversation

nazavode
Copy link
Collaborator

@nazavode nazavode commented Mar 4, 2024

This PR adds a full description of all performance counters reported by the post-processing script. All descriptions have been discussed with hw designers to avoid discrepancies with the micro-architecture.

Copy link

github-actions bot commented Mar 4, 2024

kernels baseline linalg scf_xdsl snitch_stream snrt min_llvm_mlir
conv2d_d1_s1_3x3 1x1x8x8xf64 1559 1489 1503 1486 1489
ddot 128xf64 929 937 961 570 550 929
dense 8x8xf64 3239 3238 2730 2709 3238
dsum 8x16xf32 1202 794 3543 209 187 794
fill 16x16xf64 370 347 2665 294 299 347
matmul 8x8xf64 2562 2560 13602 2333 2304 2560
pooling_nchw_max_d1_s2_3x3 1x1x16x16xf64 1435 1201 13548 1115 1094 1201
pooling_nchw_sum_d1_s2_3x3 1x1x16x16xf64 1948 1948 13548 1997 1976 1948
relu 16x16xf64 1081 1074 4974 322 334 1074
softmax 16xf64 32613 32613

1 similar comment
Copy link

github-actions bot commented Mar 4, 2024

kernels baseline linalg scf_xdsl snitch_stream snrt min_llvm_mlir
conv2d_d1_s1_3x3 1x1x8x8xf64 1559 1489 1503 1486 1489
ddot 128xf64 929 937 961 570 550 929
dense 8x8xf64 3239 3238 2730 2709 3238
dsum 8x16xf32 1202 794 3543 209 187 794
fill 16x16xf64 370 347 2665 294 299 347
matmul 8x8xf64 2562 2560 13602 2333 2304 2560
pooling_nchw_max_d1_s2_3x3 1x1x16x16xf64 1435 1201 13548 1115 1094 1201
pooling_nchw_sum_d1_s2_3x3 1x1x16x16xf64 1948 1948 13548 1997 1976 1948
relu 16x16xf64 1081 1074 4974 322 334 1074
softmax 16xf64 32613 32613

@nazavode nazavode force-pushed the nazavode/docs-perf branch from 3badbed to 9159d38 Compare March 20, 2024 11:00
Copy link

kernels baseline linalg scf_xdsl snitch_stream snrt min_llvm_mlir
conv2d_d1_s1_3x3 1x1x8x8xf64 1559 1489 1503 1486 1489
ddot 128xf64 929 937 961 570 550 929
dense 8x8xf64 3239 3238 2730 2709 3238
dsum 8x16xf32 1202 794 2019 209 187 794
fill 16x16xf64 370 347 2665 294 299 347
matmul 8x8xf64 2515 2823 2336 2323 2515
pooling_nchw_max_d1_s2_3x3 1x1x16x16xf64 1435 1201 1115 1094 1201
pooling_nchw_sum_d1_s2_3x3 1x1x16x16xf64 1948 1948 1997 1976 1948
relu 16x16xf64 1081 1074 3460 322 334 1074

Copy link

kernels baseline linalg scf_xdsl snitch_stream snrt
conv2d_d1_s1_3x3 1x1x8x8xf64 0.23 0.21 0.18 0.18
ddot 128xf64 0.14 0.14 0.13 0.00 0.00
dense 8x8xf64 0.20 0.20 0.22 0.23
dsum 8x16xf32 0.11 0.16 0.06 0.00 0.69
fill 16x16xf64 0.00 0.00 0.00 0.02 0.86
matmul 8x8xf64 0.20 0.18 0.25 0.25
pooling_nchw_max_d1_s2_3x3 1x1x16x16xf64 0.31 0.37 0.35 0.36
pooling_nchw_sum_d1_s2_3x3 1x1x16x16xf64 0.22 0.22 0.19 0.20
relu 16x16xf64 0.24 0.24 0.07 0.01 0.01

@nazavode nazavode merged commit 3b20f02 into main Mar 20, 2024
2 checks passed
Copy link

kernels baseline linalg scf_xdsl snitch_stream snrt min_llvm_mlir
conv2d_d1_s1_3x3 1x1x8x8xf64 1559 1489 1503 1486 1489
ddot 128xf64 929 937 961 570 550 929
dense 8x8xf64 3239 3238 2730 2709 3238
dsum 8x16xf32 1202 794 2019 209 187 794
fill 16x16xf64 370 347 2665 294 299 347
matmul 8x8xf64 2515 2823 2336 2323 2515
pooling_nchw_max_d1_s2_3x3 1x1x16x16xf64 1435 1201 1115 1094 1201
pooling_nchw_sum_d1_s2_3x3 1x1x16x16xf64 1948 1948 1997 1976 1948
relu 16x16xf64 1081 1074 3460 322 334 1074

Copy link

kernels baseline linalg scf_xdsl snitch_stream snrt
conv2d_d1_s1_3x3 1x1x8x8xf64 0.23 0.21 0.18 0.18
ddot 128xf64 0.14 0.14 0.13 0.00 0.00
dense 8x8xf64 0.20 0.20 0.22 0.23
dsum 8x16xf32 0.11 0.16 0.06 0.00 0.69
fill 16x16xf64 0.00 0.00 0.00 0.02 0.86
matmul 8x8xf64 0.20 0.18 0.25 0.25
pooling_nchw_max_d1_s2_3x3 1x1x16x16xf64 0.31 0.37 0.35 0.36
pooling_nchw_sum_d1_s2_3x3 1x1x16x16xf64 0.22 0.22 0.19 0.20
relu 16x16xf64 0.24 0.24 0.07 0.01 0.01

@nazavode nazavode deleted the nazavode/docs-perf branch June 4, 2024 09:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants