feat: set up gen ai inference attributes for foundational text models in java v1 sdk #19

yiyuan-he · 2024-11-05T23:33:23Z

Description of changes:
Adding auto-instrumentation support for GenAI inference parameters for Java V1 AWS SDK.

The following foundational text models are supported:

AI21 Jamba
Amazon Titan
Anthropic Claude
Cohere Command R
Meta Llama
Mistral AI

Full list can be found here.

Note: Removed support for old Cohere Command models since they already throw a 404 response for Java V1.

Note: There is a backwards compatibility issue with the existing tests as they rely on older versions of the AWS SDK. Since Bedrock is newer resource we are forced to introduce a newer SDK version which implicitly adds in behavior that is not backwards compatible with the existing tests. It seems these dependency issues in the tests are a known issue in upstream which is why they rely on such old AWS resource versions in the first place. Synced with @mxiamxia and we decided to temporarily disable these tests for now as we figure out a long-term solution. We rely on the contract tests to verify the SDK quality for the time being.

New inference parameter attributes added according to OpenTelemetry Semantic Conventions for GenAI attributes:

gen_ai.request.max_tokens
gen_ai.request.temperature
gen_ai.request.top_p
gen_ai.response.finish_reasons
gen_ai.usage.input_tokens
gen_ai.usage.output_tokens

Test Plan:
Set up sample app to make Bedrock Runtime InvokeModel API calls to the supported foundational models and verified the auto-instrumentation attributes.

AI21 Jamba

Amazon Titan

Anthropic Claude

Cohere Command

Meta Llama

Mistral AI

… for v1 sdk

yiyuan-he · 2024-11-13T16:52:58Z

Deprecating in favor of new PR

yiyuan-he force-pushed the gen-ai-support-v1 branch 2 times, most recently from a49de66 to 660191e Compare November 6, 2024 22:41

feat: set up gen ai inference attributes for foundational text models…

073210b

… for v1 sdk

yiyuan-he force-pushed the gen-ai-support-v1 branch 4 times, most recently from e8eaec9 to b428c01 Compare November 8, 2024 03:27

temporarily some tests due to backwards compatability issues

8802b9c

yiyuan-he force-pushed the gen-ai-support-v1 branch from b428c01 to 8802b9c Compare November 8, 2024 03:31

yiyuan-he changed the title ~~feat: set up gen ai inference attributes for foundational text models [not ready]~~ feat: set up gen ai inference attributes for foundational text models [not ready for merge] Nov 8, 2024

yiyuan-he changed the title ~~feat: set up gen ai inference attributes for foundational text models [not ready for merge]~~ feat: set up gen ai inference attributes for foundational text models in java v1 sdk Nov 9, 2024

yiyuan-he closed this Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: set up gen ai inference attributes for foundational text models in java v1 sdk #19

feat: set up gen ai inference attributes for foundational text models in java v1 sdk #19

yiyuan-he commented Nov 5, 2024 •

edited

Loading

yiyuan-he commented Nov 13, 2024

feat: set up gen ai inference attributes for foundational text models in java v1 sdk #19

feat: set up gen ai inference attributes for foundational text models in java v1 sdk #19

Conversation

yiyuan-he commented Nov 5, 2024 • edited Loading

yiyuan-he commented Nov 13, 2024

yiyuan-he commented Nov 5, 2024 •

edited

Loading