Add IREE numerics test for Llama 3.1 8B FP16 TP8 #394

sogartar · 2024-10-31T11:30:09Z

Introduce a Llama 3.1 8B FP16 TP8 test that appears to not have good numerical accuracy. It is compared to an fp64 unsharded torch variant to ensure that the reference is of high accuracy.

Refactor the sharded Llama tests. Increase code reuse and use the TorchGenerator in the toy-sized tests. Use the shard_llm_dataset and export_paged_llm_v1 scripts in the test flow to increase their test coverage.

Introduce a Llama 3.1 8B FP16 TP8 test that appears to not have good numerical accuracy. It is compared to an fp64 unsharded torch variant to ensure that the reference is of high accuracy. Refactor the sharded Llama tests. Increase code reuse and use the TorchGenerator in the toy-sized tests. Use the shard_llm_dataset and export_paged_llm_v1 scripts in the test flow to increase their test coverage.

sogartar · 2024-10-31T11:32:19Z

This PR depends on #383, #384, #386, #390, #391, #392, #393.

sogartar requested review from rsuderman and IanNod October 31, 2024 11:30

sogartar mentioned this pull request Oct 31, 2024

Introduce CausalLMModel intefrace and add IREE numerics test for Llama 3.1 8B FP16 TP8 #375

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add IREE numerics test for Llama 3.1 8B FP16 TP8 #394

Add IREE numerics test for Llama 3.1 8B FP16 TP8 #394

sogartar commented Oct 31, 2024

sogartar commented Oct 31, 2024

Add IREE numerics test for Llama 3.1 8B FP16 TP8 #394

Are you sure you want to change the base?

Add IREE numerics test for Llama 3.1 8B FP16 TP8 #394

Conversation

sogartar commented Oct 31, 2024

sogartar commented Oct 31, 2024