Update llama tests for block size 32 #696

aviator19941 · 2024-12-14T00:08:42Z

The block_seq_stride default is changing to 32 instead of 16, so this PR updates the tests to use the block_seq_stride flag and the new numpy inputs for block size 32 to benchmark correctly. This PR also removes the decomposed fp16 tests that are not needed anymore.

Signed-off-by: aviator19941 <[email protected]>

aviator19941 added 4 commits December 13, 2024 13:50

Inital update of tests

cb2b7b3

Signed-off-by: aviator19941 <[email protected]>

Fix compile command and input file name

9cb3afa

Signed-off-by: aviator19941 <[email protected]>

Fix 8b tests

a21fed5

Signed-off-by: aviator19941 <[email protected]>

Update tests

3a2d9f7

Signed-off-by: aviator19941 <[email protected]>

aviator19941 requested review from archana-ramalingam and saienduri December 14, 2024 00:08

aviator19941 added 4 commits December 13, 2024 20:13

Fix 70b f16 benchmark test

5f3008e

Signed-off-by: aviator19941 <[email protected]>

Make block_seq_stride in ExportArtifacts optional

2eb7e19

Signed-off-by: aviator19941 <[email protected]>

Add on pull request to check large llama tests

e2172e5

Signed-off-by: aviator19941 <[email protected]>

Remove on pull request test

2188177

Signed-off-by: aviator19941 <[email protected]>

archana-ramalingam approved these changes Dec 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update llama tests for block size 32 #696

Update llama tests for block size 32 #696

aviator19941 commented Dec 14, 2024

Update llama tests for block size 32 #696

Are you sure you want to change the base?

Update llama tests for block size 32 #696

Conversation

aviator19941 commented Dec 14, 2024