EdgeDB AI provider #1121

diksipav · 2024-10-17T19:33:20Z

I think we need a call to discuss what we want to support with the provider and extension (or maybe I'm just not aligned with the goals and provider should just be built to support current capabilities of the extension).

Examples of how we can use this with the Vercel SDK:
the completion route,
the chat route

This PR has the basic support for completion and chat that we also support with edgedb-js ai lib. I made it to work with the AI ext we have and its current capabilities. And it can be used as this but definitely needs improvements in order to have fully compatible interfaces with Vercel LanguageModelV1 provider.

TODO: I need to add README and update some identifiers and interface names throughout the provider since we are not really a language model.

QUESTIONS:

We don't support any of LLM settings
maxTokens,
temperature,
topP,
topK,
frequencyPenalty,
presencePenalty,
stopSequences,
responseFormat,
seed etc,
so I don't know do we wan't to add support for these or should I return an error from the provider if user tries to set any of these.
Do we want to support setting for safe_prompt?
Do we support structured outputs?
Do we have support for tools?
In the output our RAG returns it doesn't return:

usage: {
  promptTokens: ...,
  completionTokens: ...,
},

and some other things that are used/returned from the Vercel LanguageModelV1 provider. So I return 0 for these 2 values for now from the provider.

Do we want to support different settings for different LLMs or we navigate more towards supporting basic works and settings all LLMs support. I mean some of them even support images and audio.

Instead of using the async generator and converting the text back into a byte stream, just pipe the response from the RAG request directly to the response in `streamRag`.

Ensures that the chunks are properly parsed and emitted as they come in, splitting multiple events that arrive together in one chunk, and combining partial events from multiple chunks into a single emitted value.

1st1 · 2024-10-17T19:47:33Z

I'd like @deepbuzin to also look at this

diksipav · 2024-10-18T15:59:25Z

Here are the examples of how we can use this with the Vercel SDK:
the completion route,
the chat route

scotttrinh · 2024-10-18T19:14:15Z

so I don't know do we wan't to add support for these or should I return an error from the provider if user tries to set any of these.

https://github.com/vercel/ai/blob/65e108f94f40b80890b00ccc12eeb04c792a4b92/packages/mistral/src/mistral-chat-language-model.ts#L71-L78

Yeah it looks like there is some expectation that we return some warnings.

Do we want to support setting for safe_prompt?

Do we support structured outputs?

Do we have support for tools?

We can skip this stuff for now.

So I return 0 for these 2 values for now from the provider.

Looks like the Mistral provider uses NaN 😂 https://github.com/vercel/ai/blob/65e108f94f40b80890b00ccc12eeb04c792a4b92/packages/mistral/src/mistral-chat-language-model.ts#L242-L245

Do we want to support different settings for different LLMs or we navigate more towards supporting basic works and settings all LLMs support. I mean some of them even support images and audio.

I think for now we support just the API that our AI extension exposes and we treat it as if we're a really restricted LLM that has built in RAG. Developers will be configuring the AI extension itself to take advantage of features in the underlying model, but we'll need to expose it directly from the extension in order for this provider to take advantage of it.

scotttrinh · 2024-10-18T19:17:15Z

packages/driver/src/cli.mts

-    debug(
-      `  - CLI found in PATH at: ${location} (resolved to: ${actualLocation})`,
-    );
+  if (locations) {


Small nit: maybe just treat this as an early return instead of wrapping this whole section in an if?

if (locations == null) { debug(" - No CLI found in PATH."); return null; }

I'm annoyed that we aren't catching this in our tests 🫤

diksipav · 2024-11-06T12:56:30Z

RAG provider should be fine. But I think we need to extend capabilities of the AI binding. It should support other parameters besides the query and also function calling.

diksipav · 2024-11-06T17:13:47Z

I created a new cleaner PR for this: #1129.

scotttrinh and others added 23 commits September 6, 2024 23:49

wip Stream API for @edgedb/ai

a4cf14b

Make @edgedb/ai an ES module

80042db

lint: add explicit typedef

877d988

Add explicit type annotation to async generator

4a06852

Use undefined as the Next value in AsyncGenerator

4f4d07b

Stream RAG request directly to response body

80328ee

Instead of using the async generator and converting the text back into a byte stream, just pipe the response from the RAG request directly to the response in `streamRag`.

Have async generator generate typed values

70a4a80

Fix a few typos

9a2ee10

Use eventsource-parser in async generator

9c6a46a

Ensures that the chunks are properly parsed and emitted as they come in, splitting multiple events that arrive together in one chunk, and combining partial events from multiple chunks into a single emitted value.

Update docs

fd82d48

Add HTTPSCRAMAuth to authed fetch

1be3174

Use an intermediate class with varied outputs in AI pack

0f80c52

Update ai core apis

874f023

Update AI core to use just queryRag & streamRag func

c1b0110

Small refactor of AI core

ec9a0cb

Interface instead of type; add data check to queryRag

31b96ae

Update readme

4bc3dd4

Revert test workflow continue-on-error for nightly only

831259e

Add embeddings func, update readme, add streamRag type

5809d25

Update Readme

ab6683b

Update some parts of Readme to use bold text

d7dd1fe

Merge master

8873d35

Add edgedb ai sdk provider

884198b

diksipav requested a review from scotttrinh October 17, 2024 19:33

diksipav changed the title ~~Stream ai~~ EdgeDB AI provider Oct 17, 2024

diksipav added 2 commits October 17, 2024 21:06

Update interface of EdgeDB RAG model

1604f09

Run prettier

29cf7d1

diksipav requested a review from anbuzin October 18, 2024 08:58

Support settings prop for the embedding model

595ca7a

Add LLM settings, update return generate & stream types

05100b9

diksipav marked this pull request as draft October 18, 2024 15:51

scotttrinh reviewed Oct 18, 2024

View reviewed changes

diksipav added 4 commits October 31, 2024 14:32

Rename AI to SDK, add func calling support

e84ad4f

Fix the query that's sent to the LLM

fe2c29f

Delete unused code

a7331b6

Update types in ai binding, fix custom prompt in rag

4a39e1a

merge master

2a54a5b

diksipav force-pushed the stream-ai branch from ae8ee34 to 2a54a5b Compare November 6, 2024 16:08

diksipav closed this Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EdgeDB AI provider #1121

EdgeDB AI provider #1121

diksipav commented Oct 17, 2024 •

edited

Loading

1st1 commented Oct 17, 2024

diksipav commented Oct 18, 2024

scotttrinh commented Oct 18, 2024

scotttrinh Oct 18, 2024

diksipav commented Nov 6, 2024

diksipav commented Nov 6, 2024

EdgeDB AI provider #1121

EdgeDB AI provider #1121

Conversation

diksipav commented Oct 17, 2024 • edited Loading

1st1 commented Oct 17, 2024

diksipav commented Oct 18, 2024

scotttrinh commented Oct 18, 2024

scotttrinh Oct 18, 2024

Choose a reason for hiding this comment

diksipav commented Nov 6, 2024

diksipav commented Nov 6, 2024

diksipav commented Oct 17, 2024 •

edited

Loading