Make @RegisterAiService beans request scoped by default #96

geoand · 2023-12-05T13:47:31Z

This is done because otherwise the chat memory
does not get cleared properly.

Fixes: #95

Do we need a docs update to go along with this?

geoand · 2023-12-05T14:09:43Z

@jmartisk pinecone test still failing after merging your PR :)

jmartisk · 2023-12-05T14:28:32Z

That must be because https://github.com/quarkiverse/quarkus-langchain4j/actions/runs/7101672400 and https://github.com/quarkiverse/quarkus-langchain4j/actions/runs/7101669612 were triggered at about the same time.
It guards against running the matrix together within a single build, but it doesn't guard about somebody triggering two independent builds at once by PRs coming from the main repo...

jmartisk · 2023-12-05T14:30:47Z

We'll need to submit PRs from forks unless absolutely necessary :)

geoand · 2023-12-05T14:33:03Z

Ah gotcha, thanks!

geoand · 2023-12-05T14:52:03Z

I think I'm gonna take out the pinecone test altogether and only enable it in the nightly build.
It's too big of a toll on test parallelism for the sake of just one test

cescoffier · 2023-12-05T14:56:10Z

I’m not sure about that. Typically, rest clients are application-scoped. We should change how we handle memory instead.

geoand · 2023-12-05T15:00:06Z

Unfortunately it's not easy (or possible at all).

The basic problem is that memory is also used when using AiService.create(), and if you try to change the context or the memory to fit the @RegisterAiService model, you end up breaking the create model.
I tried a few variations of this, and none worked well - the only thing that worked was the request scoped service - whose only real problem really is when you are not on a request (which is not really like, but even if it is, one can use @ActivateRequestScope).

jmartisk · 2023-12-05T15:02:22Z

I think I'm gonna take out the pinecone test altogether and only enable it in the nightly build. It's too big of a toll on test parallelism for the sake of just one test

Yeah I kinda agree. You can disable it by removing the propagation of secrets into workflows. The test runs when it sees a non-empty value of PINECONE_API_KEY.

cescoffier · 2023-12-05T15:40:54Z

Are you sure this will continue to work with the web socket? It will create many instances of the AI service, which is what we want to avoid.

I do not get the issue with the creation part. I would need to have a look. Maybe we need to prioritize the scope thingy.

geoand · 2023-12-05T15:43:38Z

Are you sure this will continue to work with the web socket?

See the changes to the examples :). Essentially it works if you use context propagation

geoand · 2023-12-05T15:44:53Z

I do not get the issue with the creation part. I would need to have a look. Maybe we need to prioritize the scope thingy.

What happens is that although the ChatMemoryProvider closes and removes all content, the ChatMemory of AiServicesContext still uses them.
If you try to remove them from there as well, you'll get a bunch of breaking tests that use AiServices.create

cescoffier · 2023-12-05T16:02:53Z

What if we use a proxy/facade in-between that uses the ARC API to instantiate the memory in the proper context?

cescoffier · 2023-12-05T16:05:49Z

Also, what about having AiServicesContext in a special scope and not the AIService?

geoand · 2023-12-05T16:07:02Z

I thought of that. The problem is producing it, but I'll look into it

geoand · 2023-12-06T06:55:40Z

@jmartisk the nightly build with the pinecone test is failing very weirdly: https://github.com/quarkiverse/quarkus-langchain4j/actions/runs/7108550795/job/19352035493

jmartisk · 2023-12-06T08:40:11Z

Holy * that test is obnoxious. Looking into it...

jmartisk · 2023-12-06T09:07:30Z

Nice, Pinecone has changed the response format and they now don't include an empty metadata object in the response if a vector has no metadata, causing the NPE. I will send a PR to adjust it on our side....

geoand · 2023-12-06T09:12:45Z

Thanks!

geoand · 2023-12-06T10:23:51Z

Also, what about having AiServicesContext in a special scope and not the AIService?

This is what I am trying now, but so far it has subtle issues, not to mention that the code is a lot more complicated than what I have pushed here.
I do believe that what I pushed is a fine solution, there are no real downsides of having the client be request scoped.

geoand · 2023-12-06T12:24:47Z

Also, the more I think about it, the more it makes sense to me that the context (which is an entity unknown to the user) should have the same scope as the bean itself.

geoand · 2023-12-06T14:10:19Z

I have updated the PR with something slightly better.

As for the websockets, the scope is opened, but it is never closed unfortunately... So for those samples I made the services singleton.

If we agree on this approach, then I'll need to update the docs.

cescoffier · 2023-12-07T07:22:27Z

Ah Ah! I knew web sockets would be problematic.

I won’t have time to look before next week (traveling). So let’s go with this approach and extend the doc, for now.

We need to go back to the whiteboard to find a proper way to deal with this.

geoand · 2023-12-07T07:23:42Z

We need to go back to the whiteboard to find a proper way to deal with this.

I totally agree

cescoffier

Just a typo.

core/runtime/src/main/java/io/quarkiverse/langchain4j/RegisterAiService.java

This is done because otherwise the chat memory does not get cleared properly. Furthermore, add a way to remove memory entries when the service goes out of scope Fixes: #95

geoand requested a review from a team as a code owner December 5, 2023 13:47

geoand mentioned this pull request Dec 5, 2023

ChatMemory is persisted over request boundaries #95

Closed

geoand force-pushed the #95 branch from 34386f7 to e5e20b8 Compare December 5, 2023 13:48

geoand force-pushed the #95 branch 2 times, most recently from a736eed to cf17b8c Compare December 6, 2023 14:08

geoand force-pushed the #95 branch 2 times, most recently from 749fb9a to ee75189 Compare December 7, 2023 07:14

geoand force-pushed the #95 branch 2 times, most recently from ae6d811 to 645cc73 Compare December 7, 2023 08:07

geoand requested a review from cescoffier December 7, 2023 09:19

cescoffier approved these changes Dec 7, 2023

View reviewed changes

core/runtime/src/main/java/io/quarkiverse/langchain4j/RegisterAiService.java Outdated Show resolved Hide resolved

Make @RegisterAiService beans request scoped by default

e5d30ee

This is done because otherwise the chat memory does not get cleared properly. Furthermore, add a way to remove memory entries when the service goes out of scope Fixes: #95

geoand force-pushed the #95 branch from 1fe1446 to e5d30ee Compare December 7, 2023 10:06

geoand merged commit b72b2db into main Dec 7, 2023
2 checks passed

geoand deleted the #95 branch December 7, 2023 12:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make @RegisterAiService beans request scoped by default #96

Make @RegisterAiService beans request scoped by default #96

geoand commented Dec 5, 2023 •

edited

Loading

geoand commented Dec 5, 2023

jmartisk commented Dec 5, 2023

jmartisk commented Dec 5, 2023

geoand commented Dec 5, 2023

geoand commented Dec 5, 2023

cescoffier commented Dec 5, 2023

geoand commented Dec 5, 2023 •

edited

Loading

jmartisk commented Dec 5, 2023

cescoffier commented Dec 5, 2023

geoand commented Dec 5, 2023

geoand commented Dec 5, 2023

cescoffier commented Dec 5, 2023

cescoffier commented Dec 5, 2023

geoand commented Dec 5, 2023

geoand commented Dec 6, 2023

jmartisk commented Dec 6, 2023

jmartisk commented Dec 6, 2023

geoand commented Dec 6, 2023

geoand commented Dec 6, 2023

geoand commented Dec 6, 2023

geoand commented Dec 6, 2023

cescoffier commented Dec 7, 2023

geoand commented Dec 7, 2023 •

edited

Loading

cescoffier left a comment

Make @RegisterAiService beans request scoped by default #96

Make @RegisterAiService beans request scoped by default #96

Conversation

geoand commented Dec 5, 2023 • edited Loading

geoand commented Dec 5, 2023

jmartisk commented Dec 5, 2023

jmartisk commented Dec 5, 2023

geoand commented Dec 5, 2023

geoand commented Dec 5, 2023

cescoffier commented Dec 5, 2023

geoand commented Dec 5, 2023 • edited Loading

jmartisk commented Dec 5, 2023

cescoffier commented Dec 5, 2023

geoand commented Dec 5, 2023

geoand commented Dec 5, 2023

cescoffier commented Dec 5, 2023

cescoffier commented Dec 5, 2023

geoand commented Dec 5, 2023

geoand commented Dec 6, 2023

jmartisk commented Dec 6, 2023

jmartisk commented Dec 6, 2023

geoand commented Dec 6, 2023

geoand commented Dec 6, 2023

geoand commented Dec 6, 2023

geoand commented Dec 6, 2023

cescoffier commented Dec 7, 2023

geoand commented Dec 7, 2023 • edited Loading

cescoffier left a comment

Choose a reason for hiding this comment

geoand commented Dec 5, 2023 •

edited

Loading

geoand commented Dec 5, 2023 •

edited

Loading

geoand commented Dec 7, 2023 •

edited

Loading