feat(ai): llm provider #539

kallebysantos · 2025-05-10T21:47:53Z

What kind of change does this PR introduce?

feature, refactor

What is the current behaviour?

Current the Session only supports self-hosted Ollama or some OpenAI like provider - with no way to specify the API key.

What is the new behaviour?

This PR applies some refactors in the ai module to support an unified LLM provider API, this way it can be easily extended to new providers as well exporting a more standardised output format.

Improved typescript support

The ai module was huge refactored to provide better ts hints that dynamically changes based on the selected type:

examples

using type gte-small:

using type ollama:

using type openaicompatible:

Automatically infer AsyncGenerator type when stream: true

All providers, ollama & openai

Improved error handling support

In order to ensure error checking, the ai module was been refactored to follow Result pattern - Go like. It means that while Session.run() the returned value will be a tuple array of [success, error], this result is compatible with TS pattern matching, so it provides a completely LSP feedback.

examples

Non stream

Result type def

Checking error automatically validates the success part

Stream

When stream: true the first result will handle errors that may occur before create the AsyncGenerator.
Then the incoming message will be a result as well, this way users can apply error handling while streaming.

Result type def

Streaming type def

Common response and Usage metrics

Since all LLM providers must implement a common interface they now also shared a unified response object.

response definitions

Success part

// This object represents a success LLM output, where inner is the raw value returned by the provider
export interface ILLMProviderOutput<T = object> {
  value?: string; // An unified way to get the generated value
  usage: {
    inputTokens: number;
    outputTokens: number;
    totalTokens: number;
  };
  inner: T;
}

Error part

// This object represents an error output, where inner is the original error value
export type SessionError<T = object | string> = {
  message: string;
  inner: T;
};

Tested OpenAI compatible providers

missing

Anthropic: I don't have free account for it
Grok: I don't have free account for it
Amazon Bedrock: I don't have free account for it

ideas

Gemini: maybe implement a custom provider for this one

kallebysantos · 2025-05-26T09:20:52Z

Hi guys 💚

I know that I did things different here, by introducing internal .ts files. I notice that upstream repo is already coding ts instead of js and this PR was like a "experiment" of it and it really worked - I could get a great dev exp from this typescript support.

Since I did it, I wondering that may be nice if we have tsc bundler, this way we could easily generate .d.ts files to release - Instead of writing everything inside the globals.d.ts file.

I don't have too much exp on theses bundlers/releases but I'm sure that is possible to do. I would like to learn/do it, but I need some suggestion/approve before I go further.

My idea is to have something like that:

call tsc to look into ext/**/index.d.ts - or anything else u guys prefer to name it
inside this file we just re-export our internal types in a public namespace
then bundle it in a dist/edge-runtime.d.ts - This one will be then release to be used as

import "jsr:@supabase/functions-js/edge-runtime.d.ts";

I believe that native .ts files for future feats/refactors can enforce our dev exp, at same time we export theses types defs to user land.

cc @laktek, @nyannyacha

- LLM Section is a wrapper to handle LLM inference based on the selected provider

- Extracting json parsers to a separated file - Moving LLM stream related code to a separated folder

- Applying LLM provider interfaces to implement the Ollama provider

- Applying LLM provider interfaces to implement the 'openaicompatible' mode

- Improving Typescript support for dynamic suggestion based on the selected Session type. - Break: Now LLM models must be defined inside `options` argument, it allows a better typescript checking as well makes easier to extend the API. - There's no need to check if `inferenceHost` env var is defined, since we can now switch between different LLM providers. Instead, we can enable LLM support if the given type is an allowed provider.

- Improving typescript with conditional output types based on the selected provider - Defining common properties for LLM providers like `usage` metrics and simplified `value`

- OpenAI uses a different streaming alternative that ends with `[DONE]`

- Applying 'pattern matching' and 'Result pattern' to improve error handling. It enforces that users must first check for errors before consuming the message

- It ensures that only valid strings with content can be embeded

- Fix wrong input variable name. - Accepting 'opts' param as optinal, applying null safes.

- Improving tests by checking the result types: success or errors - Testing invalid `gte-small` type name

kallebysantos marked this pull request as ready for review May 15, 2025 17:59

kallebysantos added 14 commits May 26, 2025 10:23

feat: creating llm session abstraction

1838447

- LLM Section is a wrapper to handle LLM inference based on the selected provider

stamp: moving streaming utils to llm folder

d61614f

- Extracting json parsers to a separated file - Moving LLM stream related code to a separated folder

feat: implementing Ollama LLM provider

f90fdd6

- Applying LLM provider interfaces to implement the Ollama provider

feat: implementing 'OpenAI compatible' provider

3d98ea4

- Applying LLM provider interfaces to implement the 'openaicompatible' mode

stamp: creating result types and common usage interface

b021cb1

- Improving typescript with conditional output types based on the selected provider - Defining common properties for LLM providers like `usage` metrics and simplified `value`

stamp: cleaning & polishing

ad05bae

fix: openai streaming

98ad9f1

- OpenAI uses a different streaming alternative that ends with `[DONE]`

stamp: improved error handling

e78542a

- Applying 'pattern matching' and 'Result pattern' to improve error handling. It enforces that users must first check for errors before consuming the message

fix: SessionOutput type defs

69336ca

fix: validating empty texts before run gte

23cddf9

- It ensures that only valid strings with content can be embeded

fix: embedding inference for gte-small type

d413c5d

- Fix wrong input variable name. - Accepting 'opts' param as optinal, applying null safes.

test: fix gte-small tests to pass with the new ai refactors

c99e7c5

- Improving tests by checking the result types: success or errors - Testing invalid `gte-small` type name

stamp: format :)

23cd02e

kallebysantos force-pushed the feat-llm-provider branch from 25ab35a to 23cd02e Compare May 26, 2025 09:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(ai): llm provider #539

feat(ai): llm provider #539

Uh oh!

kallebysantos commented May 10, 2025 •

edited

Loading

Uh oh!

kallebysantos commented May 26, 2025

Uh oh!

Uh oh!

Uh oh!

feat(ai): llm provider #539

Are you sure you want to change the base?

feat(ai): llm provider #539

Uh oh!

Conversation

kallebysantos commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What kind of change does this PR introduce?

What is the current behaviour?

What is the new behaviour?

Improved typescript support

Improved error handling support

Non stream

Stream

Common response and Usage metrics

Tested OpenAI compatible providers

Uh oh!

kallebysantos commented May 26, 2025

Uh oh!

Uh oh!

kallebysantos commented May 10, 2025 •

edited

Loading