feat: expose max_num_tokens as configurable #96
e2e-nvidia-l4-x1.yml
on: pull_request_target
start-medium-ec2-runner
1m 43s
stop-medium-ec2-runner
5s
e2e-medium-workflow-complete
0s
Annotations
2 errors
e2e-medium-test
Canceling since a higher priority waiting request for 'E2E (NVIDIA L4 x1)-340' exists
|
e2e-medium-test
The operation was canceled.
|