Skip to content

Refactor turbomind attention by precomputing rotary embed #3129

Refactor turbomind attention by precomputing rotary embed

Refactor turbomind attention by precomputing rotary embed #3129

Triggered via pull request December 4, 2024 08:37
@irexycirexyc
synchronize #2801
irexyc:rope
Status Success
Total duration 14m 1s
Artifacts

unit-test.yml

on: pull_request
Fit to window
Zoom out
Zoom in