Skip to content

Refactor turbomind attention by precomputing rotary embed #3155

Refactor turbomind attention by precomputing rotary embed

Refactor turbomind attention by precomputing rotary embed #3155

Triggered via pull request December 9, 2024 13:47
@irexycirexyc
synchronize #2801
irexyc:rope
Status Failure
Total duration 3m 48s
Artifacts

unit-test.yml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

1 error
unit_test
Process completed with exit code 2.