How to handle very long prompts? #26

Selbi182 · 2023-06-28T11:07:51Z

Selbi182
Jun 28, 2023

I'm writing a Discord bot meant to summarize the last n messages. The idea is that large group chats often accumulate hundreds of unread messages while someone else is offline and it's a pain to read it all back by hand.

Now, so far I got it set up to summarize the last like 200 messages just fine. However, after that I get an error message because my prompt used too many tokens (more than 4096).

What's the approach to passing larger prompts (like 1000 messages)?

I tried splitting the messages into partitions of 200 each, which sorta works, but GPT treats each partition individually rather than part of a bigger picture and it doesn't keep context.

CJCrafter · 2023-07-03T16:05:27Z

CJCrafter
Jul 3, 2023
Maintainer

So you are limited by token counts. Here are 2 ideas to consider:

Increasing the Token Count

Using a 16k or 32k context model (If you have beta access to it... you need to apply for the 32k context I believe). gpt-3.5-turbo-16k

Note that using these models is much more expensive.

Segmenting the work

In 1 ChatUser.USER message, you can provide, say 100 messages, ask it to summarize, next 100, summarize, etc etc.

Then you can either append these summaries together or use another ChatRequest to smoothly combine these summaries.

Choosing another AI

I hate to recommend this at all, but OpenAI isn't really built for long context windows. They're larger context models are much more expensive and are still too small for certain tasks. If you are willing to build your own API, some models exist online that can handle large amounts of text and are good at summaries (I believe there are models that are used summarize entire books).

5 replies

Selbi182 Jul 3, 2023
Author

Thanks for the extensive feedback! Yeah, the first option isn't financially viable for me, but I also don't want to switch to a different AI altogether. I like the second approach the most.

However, there's one problem with that as well. It should happen rarely, but sometimes people dump half a novel's worth of content into a single message. This has already resulted in exceeding the token limit multiple times. If I knew how many tokens a given prompt would need beforehand, I'd limit it programmatically, but so far I couldn't find a reliable solution. It's not just the word count.

CJCrafter Jul 3, 2023
Maintainer

That's an API I need to add actually, it's called the Tokenizer. I don't know if ChatGPT has a tokenizer api yet though...

You could also try-catch the error and check to see if the issue is tokens.

An even hackier approach is to get the number of words, multiply by 0.75, and that's roughly how many tokens you have.

Selbi182 Jul 3, 2023
Author

I did try the Tokenizer library before, but it doesn't seem to be really accurate either.
int tokenCount = SimpleTokenizer.INSTANCE.tokenize(prompt).length;
It's only off by a few tokens, but that doesn't stop GPT from complaining. try-catching is my current approach, but obviously it's bugfixing with a bandaid.

The last part is weird to me, because so far it seems like the token count is always greater than the word count. Do you mean divide by 0.75 instead perhaps?

CJCrafter Jul 3, 2023
Maintainer

oh yeah words per token not tokens per word.

Selbi182 Jul 3, 2023
Author

Alright, thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle very long prompts? #26

{{title}}

Replies: 1 comment 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How to handle very long prompts? #26

Selbi182 Jun 28, 2023

Replies: 1 comment · 5 replies

CJCrafter Jul 3, 2023 Maintainer

Increasing the Token Count

Segmenting the work

Choosing another AI

Selbi182 Jul 3, 2023 Author

CJCrafter Jul 3, 2023 Maintainer

Selbi182 Jul 3, 2023 Author

CJCrafter Jul 3, 2023 Maintainer

Selbi182 Jul 3, 2023 Author

Selbi182
Jun 28, 2023

Replies: 1 comment 5 replies

CJCrafter
Jul 3, 2023
Maintainer

Selbi182 Jul 3, 2023
Author

CJCrafter Jul 3, 2023
Maintainer

Selbi182 Jul 3, 2023
Author

CJCrafter Jul 3, 2023
Maintainer

Selbi182 Jul 3, 2023
Author