Feat (gptq): optimizing CPU to GPU memory transfer #1342
Job | Run time |
---|---|
2m 14s | |
1m 49s | |
26m 49s | |
25m 47s | |
12m 31s | |
29m 3s | |
25m 21s | |
17m 26s | |
39m 9s | |
32m 52s | |
17m 22s | |
33m 38s | |
27m 39s | |
15m 42s | |
35m 31s | |
28m 25s | |
17m 3s | |
34m 14s | |
27m 47s | |
13m 56s | |
2m 18s | |
1m 50s | |
26m 31s | |
26m 11s | |
15m 25s | |
28m 51s | |
24m 43s | |
15m 38s | |
39m 3s | |
33m 10s | |
20m 1s | |
33m 52s | |
27m 30s | |
15m 9s | |
34m 48s | |
29m 16s | |
15m 44s | |
34m 7s | |
29m 3s | |
19m 44s | |
15h 37m 12s |