Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use stride of 1 in pooling kernels #313

Merged
merged 3 commits into from
Oct 3, 2024
Merged

use stride of 1 in pooling kernels #313

merged 3 commits into from
Oct 3, 2024

Conversation

superlopuh
Copy link
Collaborator

No description provided.

Copy link

github-actions bot commented Sep 2, 2024

kernels,baseline,linalg,linalg_xdsl,snitch_stream,snrt
conv2d_d1_s1_3x3 4x4xf64,667,,308,,
ddot 128xf64,956,965,,213,577
dense 8x8xf64,3206,3530,,2741,2723
fill 4x4xf64,50,,63,,
matmul 4x16x8xf64,2495,,708,,
matmul_transb 4x16x16xf32,3386,,,871,849
pooling_nchw_max_d1_s2_3x3 4x4xf64,442,,275,,
pooling_nchw_sum_d1_s2_3x3 4x4xf64,582,,271,,
relu 4x4xf64,142,,72,,
relu 4x8xf32,297,210,,67,85
saxpy 64xf32,634,634,,,140
sum 4x4xf64,129,,87,,
sum 4x8xf32,238,247,87,,72
sum 8x8xf16,821,,65,,

Copy link

github-actions bot commented Sep 2, 2024

kernels,baseline,linalg,linalg_xdsl,snitch_stream,snrt
conv2d_d1_s1_3x3 4x4xf64,0.22,,0.58,,
ddot 128xf64,0.13,0.13,,0.64,0.22
dense 8x8xf64,0.20,0.18,,0.26,0.26
fill 4x4xf64,0.02,,0.29,,
matmul 4x16x8xf64,0.21,,0.82,,
matmul_transb 4x16x16xf32,0.21,,,0.77,0.79
pooling_nchw_max_d1_s2_3x3 4x4xf64,0.33,,0.65,,
pooling_nchw_sum_d1_s2_3x3 4x4xf64,0.22,,0.66,,
relu 4x4xf64,0.13,,0.25,,
relu 4x8xf32,0.33,0.16,,0.28,0.22
saxpy 64xf32,0.10,0.10,,,0.46
sum 4x4xf64,0.13,,0.20,,
sum 4x8xf32,0.14,0.13,0.20,,0.24
sum 8x8xf16,0.31,,0.26,,

Base automatically changed from sasha/regalloc/exclude-zero-register to main October 3, 2024 12:18
@superlopuh superlopuh self-assigned this Oct 3, 2024
Copy link

github-actions bot commented Oct 3, 2024

kernels,baseline,linalg,linalg_xdsl,snitch_stream,snrt
conv2d_d1_s1_3x3 4x4xf64,667,,308,,
ddot 128xf64,956,965,,213,577
dense 8x8xf64,3206,3530,,2741,2723
fill 4x4xf64,50,,63,,
matmul 4x16x8xf64,2495,,708,,
matmul_transb 4x16x16xf32,3386,,,871,849
pooling_nchw_max_d1_s2_3x3 4x4xf64,584,,275,,
pooling_nchw_sum_d1_s2_3x3 4x4xf64,902,,271,,
relu 4x4xf64,142,,72,,
relu 4x8xf32,297,210,,67,85
saxpy 64xf32,634,634,,,140
sum 4x4xf64,129,,87,,
sum 4x8xf32,238,247,87,,72
sum 8x8xf16,821,,65,,

Copy link

github-actions bot commented Oct 3, 2024

kernels,baseline,linalg,linalg_xdsl,snitch_stream,snrt
conv2d_d1_s1_3x3 4x4xf64,0.22,,0.58,,
ddot 128xf64,0.13,0.13,,0.64,0.22
dense 8x8xf64,0.20,0.18,,0.26,0.26
fill 4x4xf64,0.02,,0.29,,
matmul 4x16x8xf64,0.21,,0.82,,
matmul_transb 4x16x16xf32,0.21,,,0.77,0.79
pooling_nchw_max_d1_s2_3x3 4x4xf64,0.39,,0.65,,
pooling_nchw_sum_d1_s2_3x3 4x4xf64,0.22,,0.66,,
relu 4x4xf64,0.13,,0.25,,
relu 4x8xf32,0.33,0.16,,0.28,0.22
saxpy 64xf32,0.10,0.10,,,0.46
sum 4x4xf64,0.13,,0.20,,
sum 4x8xf32,0.14,0.13,0.20,,0.24
sum 8x8xf16,0.31,,0.26,,

@superlopuh superlopuh merged commit 0b0fd3a into main Oct 3, 2024
1 check passed
@superlopuh superlopuh deleted the sasha/stride-one branch October 3, 2024 17:05
Copy link

github-actions bot commented Oct 3, 2024

kernels,baseline,linalg,linalg_xdsl,snitch_stream,snrt
conv2d_d1_s1_3x3 4x4xf64,667,,308,,
ddot 128xf64,956,965,,213,577
dense 8x8xf64,3206,3530,,2741,2723
fill 4x4xf64,50,,63,,
matmul 4x16x8xf64,2495,,708,,
matmul_transb 4x16x16xf32,3386,,,871,849
pooling_nchw_max_d1_s2_3x3 4x4xf64,584,,275,,
pooling_nchw_sum_d1_s2_3x3 4x4xf64,902,,271,,
relu 4x4xf64,142,,72,,
relu 4x8xf32,297,210,,67,85
saxpy 64xf32,634,634,,,140
sum 4x4xf64,129,,87,,
sum 4x8xf32,238,247,87,,72
sum 8x8xf16,821,,65,,

Copy link

github-actions bot commented Oct 3, 2024

kernels,baseline,linalg,linalg_xdsl,snitch_stream,snrt
conv2d_d1_s1_3x3 4x4xf64,0.22,,0.58,,
ddot 128xf64,0.13,0.13,,0.64,0.22
dense 8x8xf64,0.20,0.18,,0.26,0.26
fill 4x4xf64,0.02,,0.29,,
matmul 4x16x8xf64,0.21,,0.82,,
matmul_transb 4x16x16xf32,0.21,,,0.77,0.79
pooling_nchw_max_d1_s2_3x3 4x4xf64,0.39,,0.65,,
pooling_nchw_sum_d1_s2_3x3 4x4xf64,0.22,,0.66,,
relu 4x4xf64,0.13,,0.25,,
relu 4x8xf32,0.33,0.16,,0.28,0.22
saxpy 64xf32,0.10,0.10,,,0.46
sum 4x4xf64,0.13,,0.20,,
sum 4x8xf32,0.14,0.13,0.20,,0.24
sum 8x8xf16,0.31,,0.26,,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants