Fix llamacpp caching by making LlamaCppTokenizer
an outlines Tokenizer
#2198
Job | Run time |
---|---|
11m 4s | |
20s | |
10m 53s | |
17s | |
22m 34s |
LlamaCppTokenizer
an outlines Tokenizer
#2198
Job | Run time |
---|---|
11m 4s | |
20s | |
10m 53s | |
17s | |
22m 34s |