Fix llamacpp caching by making LlamaCppTokenizer
an outlines Tokenizer
#2196
Job | Run time |
---|---|
15s | |
11m 27s | |
11m 13s | |
14s | |
23m 9s |
LlamaCppTokenizer
an outlines Tokenizer
#2196
Job | Run time |
---|---|
15s | |
11m 27s | |
11m 13s | |
14s | |
23m 9s |