Fix llamacpp caching by making LlamaCppTokenizer
an outlines Tokenizer
#2195
Job | Run time |
---|---|
16s | |
10m 39s | |
10m 35s | |
8s | |
21m 38s |
LlamaCppTokenizer
an outlines Tokenizer
#2195
Job | Run time |
---|---|
16s | |
10m 39s | |
10m 35s | |
8s | |
21m 38s |