Introduce the `InferenceChatModel` for langchain #206429

pgayvallet · 2025-01-13T13:27:02Z

Summary

Part of #206710

This PR introduces the InferenceChatModel class, which is a langchain chatModel utilizing the inference APIs (chatComplete) under the hood.

Creating instances of InferenceChatModel can either be done by manually importing the class from the new @kbn/inference-langchain package, or by using the new createChatModel API exposes from the inference plugin's start contract.

The main upside of using this chatModel is that the unification and normalization layers are already being taken care of by the inference plugin, making sure that the underlying models are being used with the exact same capabilities. More details on the upsides and reasoning in the associated issue.

Usage

Usage is very straightforward

const chatModel = await inferenceStart.getChatModel({
  request,
  connectorId: myInferenceConnectorId,
  chatModelOptions: {
    temperature: 0.2,
  },
});

// just use it as another langchain chatModel, e.g.
const response = await chatModel.stream('What is Kibana?');
for await (const chunk of response) {
     // do something with the chunk
}

Important

This PR is only adding the implementation, and not wiring it anywhere or using it in any existing code. This is meant to be done in a later stage. Merging that implementation first will allow to have distinct PRs for the integration with search (playground) and security (assistant + other workflows), with proper testing

pgayvallet · 2025-01-13T13:27:17Z

/ci

pgayvallet · 2025-01-13T13:35:07Z

/ci

pgayvallet · 2025-01-13T13:59:04Z

/ci

pgayvallet · 2025-01-13T14:10:40Z

/ci

…angchain

pgayvallet · 2025-01-15T08:11:39Z

/ci

pgayvallet · 2025-01-15T08:22:20Z

/ci

pgayvallet · 2025-01-15T10:36:34Z

/ci

…angchain

pgayvallet · 2025-01-27T14:12:41Z

/ci

pgayvallet · 2025-01-29T09:05:42Z

/ci

pgayvallet · 2025-01-29T09:36:21Z

/ci

…angchain

pgayvallet · 2025-01-29T13:07:50Z

/ci

…angchain

pgayvallet · 2025-01-29T16:54:33Z

/ci

…angchain

pgayvallet · 2025-01-30T07:09:43Z

/ci

elasticmachine · 2025-01-30T08:56:45Z

💚 Build Succeeded

Buildkite Build
Commit: 9acb9f7

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`inference`	28	34	+6
`observabilityAIAssistantApp`	508	514	+6
`observabilityAiAssistantManagement`	385	391	+6
`searchAssistant`	265	271	+6
`searchPlayground`	285	291	+6
total			+30

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`@kbn/inference-common`	54	59	+5
`@kbn/inference-langchain`	-	47	+47
`inference`	28	29	+1
total			+53

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`inference`	5	6	+1

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`inference`	7.5KB	7.6KB	+85.0B

Unknown metric groups

API count

id	before	after	diff
`@kbn/inference-common`	166	174	+8
`@kbn/inference-langchain`	-	48	+48
`inference`	40	42	+2
total			+58

ESLint disabled line counts

id	before	after	diff
`@kbn/router-to-openapispec`	3	2	-1

Total ESLint disabled count

id	before	after	diff
`@kbn/router-to-openapispec`	3	2	-1

History

elasticmachine · 2025-01-30T08:58:43Z

Pinging @elastic/appex-ai-infra (Team:AI Infra)

gsoldevila

Core changes LGTM (review only): Removing an eslint exception is usually good news!

KDKHD

Looks good from GenAi side. Just one comment but it is a nit pick.

KDKHD · 2025-01-30T09:40:51Z

.../platform/plugins/shared/inference/server/chat_complete/utils/convert_upstream_error.test.ts

+// https://github.com/elastic/kibana/blob/49df29609e10429809f3710d6b8b1d5efbc833a7/x-pack/platform/plugins/shared/actions/server/sub_action_framework/sub_action_connector.ts#L200-L202
+// Status code: 400. Message: API Error: model_error - The response was filtered due to the prompt triggering Azure OpenAI's content management policy. Please modify your prompt and retry.
+// status_exception - Received an authentication error status code for request from inference entity id [openai-chat_completion-uuid] status [401]. Error message: [Incorrect API key provided]


Are these extra comments? Or can you clarify why the same text is repeated below.

nice catch thanks, yeah those were copy pasted to create the consts in the test file, will remove the comments

pgayvallet added 8 commits January 13, 2025 14:07

create @kbn/inference-langchain package

999943a

initial implementation

77514a2

add fake id tool to inference-common

8711d3d

export stuff

bc887ca

fix comment->tsdoc

2e57060

move zod-to-json-schema from dev to prod dependencies

b3df181

add some basic readme

1b93869

bootstrap

331bbf7

pgayvallet added release_note:skip Skip the PR/issue when compiling release notes v9.0.0 backport:version Backport to applied version labels v8.18.0 labels Jan 13, 2025

Add reference in the inference's readme

bf7633c

pgayvallet added 4 commits January 15, 2025 08:05

Merge remote-tracking branch 'upstream/main' into kbn-xxx-inference-l…

a610078

…angchain

add support for temperature param

f144a28

extract response conversion

683c5f6

add ref

f2a2df5

pgayvallet force-pushed the kbn-xxx-inference-langchain branch from 04575f1 to f2a2df5 Compare January 15, 2025 07:15

set tool_calls as undefined instead of empty list when not used

05967e3

pgayvallet mentioned this pull request Jan 15, 2025

Adding a compatibility layer between langchain and the inference APIs. #206710

Open

pgayvallet added 3 commits January 17, 2025 09:41

Merge remote-tracking branch 'upstream/main' into kbn-xxx-inference-l…

17b7915

…angchain

add support for model parameter

1f7492a

replay CI commits

27d8f00

pass down logger

0a33655

kibanamachine and others added 3 commits January 27, 2025 14:22

[CI] Auto-commit changed files from 'node scripts/notice'

a8943f8

add tests, starting self review

bc50918

first batch of unit tests for the chat model

66cfefc

[CI] Auto-commit changed files from 'node scripts/notice'

f9442c1

pgayvallet added 3 commits January 29, 2025 10:56

more unit tests

e41b59b

done with unit tests

267a043

Merge remote-tracking branch 'upstream/main' into kbn-xxx-inference-l…

3e5cd91

…angchain

pgayvallet added 3 commits January 29, 2025 15:57

Merge remote-tracking branch 'upstream/main' into kbn-xxx-inference-l…

4293011

…angchain

include status code in inference responses

10ce51a

add mocks + test for unrecoverable errors

2a7ef42

pgayvallet added the Team:AI Infra AppEx AI Infrastructure Team label Jan 29, 2025

pgayvallet added 2 commits January 30, 2025 08:00

Merge remote-tracking branch 'upstream/main' into kbn-xxx-inference-l…

d9f8c57

…angchain

fix unit test

9acb9f7

pgayvallet marked this pull request as ready for review January 30, 2025 08:58

pgayvallet requested review from a team as code owners January 30, 2025 08:58

gsoldevila approved these changes Jan 30, 2025

View reviewed changes

dgieselaar approved these changes Jan 30, 2025

View reviewed changes

KDKHD approved these changes Jan 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce the `InferenceChatModel` for langchain #206429

Introduce the `InferenceChatModel` for langchain #206429

pgayvallet commented Jan 13, 2025 •

edited

Loading

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 15, 2025

pgayvallet commented Jan 15, 2025

pgayvallet commented Jan 15, 2025

pgayvallet commented Jan 27, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 30, 2025

elasticmachine commented Jan 30, 2025

API count

ESLint disabled line counts

Total ESLint disabled count

elasticmachine commented Jan 30, 2025

gsoldevila left a comment

KDKHD left a comment

KDKHD Jan 30, 2025

pgayvallet Jan 30, 2025

Introduce the InferenceChatModel for langchain #206429

Are you sure you want to change the base?

Introduce the InferenceChatModel for langchain #206429

Conversation

pgayvallet commented Jan 13, 2025 • edited Loading

Summary

Usage

Important

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 13, 2025

pgayvallet commented Jan 15, 2025

pgayvallet commented Jan 15, 2025

pgayvallet commented Jan 15, 2025

pgayvallet commented Jan 27, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 29, 2025

pgayvallet commented Jan 30, 2025

elasticmachine commented Jan 30, 2025

💚 Build Succeeded

Metrics [docs]

Module Count

Public APIs missing comments

Public APIs missing exports

Page load bundle

API count

ESLint disabled line counts

Total ESLint disabled count

History

elasticmachine commented Jan 30, 2025

gsoldevila left a comment

Choose a reason for hiding this comment

KDKHD left a comment

Choose a reason for hiding this comment

KDKHD Jan 30, 2025

Choose a reason for hiding this comment

pgayvallet Jan 30, 2025

Choose a reason for hiding this comment

Introduce the `InferenceChatModel` for langchain #206429

Introduce the `InferenceChatModel` for langchain #206429

pgayvallet commented Jan 13, 2025 •

edited

Loading