Support for GPT-3.5 Tokenizer #14

iRambax · 2023-04-01T20:16:42Z

Hello, thank you for creating the ChatGPTSwift library. I noticed that the tokenizer currently used is the BPE tokenizer for ChatGPT-3, which is different from the Unigram language model tokenizer used by GPT-3.5.

Since we need to manually count the used tokens in stream mode, I was wondering if there is a plan to implement the GPT-3.5 tokenizer in the ChatGPTSwift library.

Thank you for your consideration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for GPT-3.5 Tokenizer #14

Support for GPT-3.5 Tokenizer #14

iRambax commented Apr 1, 2023

Support for GPT-3.5 Tokenizer #14

Support for GPT-3.5 Tokenizer #14

Comments

iRambax commented Apr 1, 2023