OpenAI: instrument embeddings and chat completions #2398

kaylareopelle · 2024-01-18T00:23:38Z

Initial PR for OpenAI instrumentation. We're merging this into the openai_instrumentation branch to preserve one historical PR for our initial instrumentation rollout to hopefully make it easier for others to reference in the future.

Co-authored by @hannahramadan
Closes #2403
Closes #2404

This PR includes:

Basic instrumentation setup for the ruby-openai gem
- Support for versions 3.4.0+, based on versions currently in use by our customers (dashboard)(internal)
- Create of an ai multiverse runner group
- Add ai/ruby_openai testing to the appropriate CI workflows
Instrumentation for OpenAI::Client#embeddings:
- Record a segment on every method invocation
- Record a metric on every method invocation
- Create LlmEmbedding events
Instrumentation for OpenAI::Client#chat, known in the spec as chat completions:
- Record a segment on every method invocation
- Record a metric on every method invocation
- Create LlmChatCompletionSummary events
- Create LlmChatCompletionMessage events
Add attributes to LlmEmbedding and LlmChatCompletionSummary events from the Net::HTTP response headers
Set llm: true as an attribute on all transaction events with LLM-related segments
Test using mocked responses from real OpenAI requests

It also makes the following updates to the Llm-namespaced classes/modules:

Update the LlmEvent and other Llm classes to use the attribute key strings expected by the UI
Add an llm_event attribute to AbstractSegment
Organize attribute names to reflect the order of the spec

This PR does not contain the following GA-required tasks:

Changelog entry
Tests for the Net::HTTP response header assignment to the LLM events
Tests that validate the OpenAI instrumentation correctly assigns LLM event attribute values
Error tracing attributes
Feedback API
The overall ai_monitoring.enabled configuration option

These items will be opened in separate PRs that we'll merge into the openai_instrumentation feature branch before GA.

lib/new_relic/agent/instrumentation/net_http/instrumentation.rb

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb

… openai

Some attribute names are expected upstream to include periods. We can't use names with periods in setter/getter method names. To resolve this, a new ATTRIBUTE_NAME_EXCEPTIONS constant, which is accompanied by an attribute_name_exceptions method exists to store the string value of the attribute name. Then, replace_attr_with_string returns the correct string name when we're creating the final event_attributes hash.

Embedding and ChatCompletionSummary both need guids for their ids. ChatCompletionMessage does not. Let's set the guid as the default, and allow ChatCompletionMessage to override the guid.

… openai

Our instrumentation was changing the response when called in the begin block.

fallwith · 2024-02-13T00:51:37Z

lib/new_relic/agent/instrumentation/ruby_openai.rb

+
+  executes do
+    if use_prepend?
+      if OPENAI_VERSION >= Gem::Version.new('5.0.0')


Cool to see the support for older versions.

fallwith · 2024-02-13T00:57:43Z

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb

+        vendor: VENDOR,
+        conversation_id: conversation_id,
+        api_key_last_four_digits: parse_api_key,
+        request_max_tokens: parameters[:max_tokens] || parameters['max_tokens'],


What leads the hash to have symbols or strings for keys, and are there ever both types of key present in the same hash? It might be handy to have a helper method like this:

request_max_tokens: parameter_value(parameters, :max_tokens) ... def parameter_value(parameters, value) parameters[value] || parameters[value.to_s] end

and if we can rely on the hash keys being all symbols or all strings, we could further enhance the helper to memoize which key type is involved.

Both Strings and Symbols are accepted as keys, so it's all up to the user on which they use. And yes—both types can be mix and matched in the same request.

Side note: We did performance testing of || vs kind_of? vs is_a? and found || to be the most performant by about 1.2x, which is why we went with an "or" check vs creating a helper method that would decide which to use.

fallwith · 2024-02-13T01:00:37Z

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb

+    end
+
+    def parse_api_key
+      'sk-' + headers['Authorization'][-4..-1]


Hmm... it looks like the ability to use [-4..] instead of [-4..-1] wasn't introduced until Ruby v2.6.

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb

fallwith · 2024-02-13T01:15:19Z

lib/new_relic/agent/llm/chat_completion_summary.rb

+        def attribute_name_exceptions
+          # TODO: OLD RUBIES < 2.6
+          # Hash#merge accepts multiple arguments in 2.6, so we can reduce this
+          # to a single Hash#merge call with two arguments at that point


This is where better granular perf tests would help. If there's a significant performance gain to be had, it'd be worth writing the code both ways:

if RUBY_VERSION >= 2.6 new_way else old way end

Great call out! Ran a perf test a few times and item.merge(item1).merge(item2) is consistently ~1.25x slower. We will break this out into a conditional.

Done! I wanted to use a ternary operator but it got super long 36d6cab

tannalynn · 2024-02-13T18:50:49Z

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb

+          sequence: index,
+          completion_id: summary_id,
+          vendor: VENDOR,
+          is_response: false


According to the spec, this shouldn't be set to false, it should be not included if it would be false

set to True if a message is the result of a chat completion and not an input message - omitted in False cases

Great callout! Thank you!

This has been updated! cdd42f6

merge(**args) is 1.25x more performant than merge.().merge(), so break out into a conditional untill Ruby <2.6 is dropped

CHANGELOG.md

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb

…n.rb

kaylareopelle self-assigned this Jan 22, 2024

kaylareopelle added 4 commits January 23, 2024 16:55

OpenAI instrumentation scaffold

0c348d0

Rubocop

0d9e3fd

Instrument json_post method

9fb700c

Draft json_post instrumentation to get response header attrs

11bf4d4

kaylareopelle force-pushed the openai branch from 2c378a2 to 11bf4d4 Compare January 24, 2024 00:55

Cleanup

3a9b12a

kaylareopelle commented Jan 24, 2024

View reviewed changes

lib/new_relic/agent/instrumentation/net_http/instrumentation.rb Outdated Show resolved Hide resolved

kaylareopelle commented Jan 24, 2024

View reviewed changes

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb Outdated Show resolved Hide resolved

kaylareopelle changed the title ~~OpenAI instrumentation scaffold~~ OpenAI instrumentation Jan 24, 2024

kaylareopelle and others added 20 commits January 29, 2024 09:30

OpenAI instrumentation scaffold

c372d90

Rubocop

543e92c

Instrument json_post method

90e5d97

Draft json_post instrumentation to get response header attrs

cc5b2cc

Cleanup

8ec46d9

Sketch out some attribute assignments

9be9588

Merge branch 'openai' of github.com:newrelic/newrelic-ruby-agent into…

53864e5

… openai

Add chat completion event params

6854a8b

Document embeddings attributes

adf18cc

Add embedding event details

2c352b8

Add attributes for messages

76cb29d

Test hash to assign correct string values to custom event attributes

42adee3

Default LlmEvent ID value to guid, allow passed-in arg to override

cadb30e

Embedding and ChatCompletionSummary both need guids for their ids. ChatCompletionMessage does not. Let's set the guid as the default, and allow ChatCompletionMessage to override the guid.

Separate instrumentation methods

118dbda

Add conversation_id from transaction custom attributes

da7692d

Merge branch 'openai' of github.com:newrelic/newrelic-ruby-agent into…

3146677

… openai

Move post-response operations out of begin block

c179385

Our instrumentation was changing the response when called in the begin block.

Minor instrumentation refactors and comments

e1a70c9

Add multiverse tests for openai requests

4cbcc61

fallwith reviewed Feb 13, 2024

View reviewed changes

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb Outdated Show resolved Hide resolved

fallwith reviewed Feb 13, 2024

View reviewed changes

Only send llm attributes to DST_TRANSACTION_EVENTS

c83a892

tannalynn reviewed Feb 13, 2024

View reviewed changes

hannahramadan added 4 commits February 13, 2024 13:23

Conditonally merge depending on Ruby version

36d6cab

merge(**args) is 1.25x more performant than merge.().merge(), so break out into a conditional untill Ruby <2.6 is dropped

Remove tests that look for a txn in error collector

4e26f74

Don't run flaky test

85f7d51

Remove unit test assert for txn found in error collector

22968ea

hannahramadan changed the base branch from dev to openai_instrumentation February 13, 2024 22:34

kaylareopelle added 3 commits February 13, 2024 15:24

Update capitalization for OpenAI in segment names

90b7ce9

Remove placeholder Net::HTTP tests for response headers

0359ae1

Rubocop

d69d404

kaylareopelle commented Feb 13, 2024

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

lib/new_relic/agent/instrumentation/ruby_openai/instrumentation.rb Outdated Show resolved Hide resolved

is_response: true - code feedback

cdd42f6

kaylareopelle mentioned this pull request Feb 13, 2024

OpenAI: Write Net::HTTP tests for LLM attributes assigned from response headers #2438

Closed

kaylareopelle marked this pull request as ready for review February 14, 2024 00:54

kaylareopelle mentioned this pull request Feb 14, 2024

Update LLM event attribute names to period-delimited strings #2424

Closed

kaylareopelle and others added 2 commits February 14, 2024 08:49

Update lib/new_relic/agent/instrumentation/ruby_openai/instrumentatio…

a8092e0

…n.rb

Freeze openai regex

d43b1a6

fallwith approved these changes Feb 14, 2024

View reviewed changes

tannalynn approved these changes Feb 14, 2024

View reviewed changes

This was linked to issues Feb 14, 2024

OpenAI: Instrument embeddings #2403

Closed

OpenAI: Instrument chat completions #2404

Closed

kaylareopelle changed the title ~~OpenAI instrumentation~~ OpenAI: instrument embeddings and chat completions Feb 14, 2024

kaylareopelle merged commit faf9ab9 into openai_instrumentation Feb 14, 2024
27 of 28 checks passed

kaylareopelle mentioned this pull request Feb 14, 2024

OpenAI: Instrument chat completions #2404

Closed

4 tasks

hannahramadan deleted the openai branch April 12, 2024 19:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI: instrument embeddings and chat completions #2398

OpenAI: instrument embeddings and chat completions #2398

kaylareopelle commented Jan 18, 2024 •

edited

Loading

fallwith Feb 13, 2024

fallwith Feb 13, 2024

hannahramadan Feb 13, 2024

fallwith Feb 13, 2024

fallwith Feb 13, 2024

hannahramadan Feb 13, 2024

hannahramadan Feb 13, 2024

tannalynn Feb 13, 2024

kaylareopelle Feb 13, 2024

hannahramadan Feb 13, 2024

OpenAI: instrument embeddings and chat completions #2398

OpenAI: instrument embeddings and chat completions #2398

Conversation

kaylareopelle commented Jan 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaylareopelle commented Jan 18, 2024 •

edited

Loading