CUDA: fix broken oob check for FA vec f32 kernel (#7904) #43
Job | Run time |
---|---|
10m 47s | |
55m 5s | |
13m 48s | |
50m 27s | |
1h 7m 51s | |
51m 9s | |
33m 50s | |
36m 45s | |
38m 8s | |
13m 44s | |
14m 20s | |
6h 25m 54s |
Job | Run time |
---|---|
10m 47s | |
55m 5s | |
13m 48s | |
50m 27s | |
1h 7m 51s | |
51m 9s | |
33m 50s | |
36m 45s | |
38m 8s | |
13m 44s | |
14m 20s | |
6h 25m 54s |