Support the case where query_len>context_len in the multi-queries paged attention. #10308
build_and_test.yml
on: pull_request
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
cpp-test-bin
|
706 MB |
|
github-pages
|
5.67 MB |
|
torch-xla-wheels
|
220 MB |
|