[AQUA][GPT-OSS] Add Shape-Specific Env Config for GPT-OSS Models in AQUA Deployment Config Reader #1244

mrDzurb · 2025-08-10T00:43:05Z

Description:

Summary
This PR updates the AQUA deployment config reader to support shape-specific environment variables for GPT-OSS model deployments. The change introduces an env section in the deployment configuration for A-series GPU shapes, allowing AQUA handlers to return custom environment variables alongside existing parameter sets.

Example
A request to:

/aqua/deployments/{model_ocid}/params?instance_shape=BM.GPU4.8

will now return:

{
  "data": [
    "--trust-remote-code",
    "--gpu-memory-utilization 0.98",
    "--enforce-eager",
    "--max-num-seqs 32",
    "--max_model_len 130000",
    "--dtype bfloat16"
  ],
  "env": {
    "VLLM_ATTENTION_BACKEND": "TRITON_ATTN_VLLM_V1"
  }
}

This change is required to ensure GPT-OSS model deployments on A-series shapes use the correct VLLM attention backend (TRITON_ATTN_VLLM_V1), which improves compatibility and performance for these hardware configurations.

…onfig Reader

github-actions · 2025-08-10T01:14:27Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2025-08-10T05:44:35Z

📌 Cov diff with main:

📌 Overall coverage:

mayoor · 2025-08-10T16:35:29Z

ads/aqua/common/utils.py

@@ -997,6 +997,45 @@ def get_container_params_type(container_type_name: str) -> str:
        return UNKNOWN


+@lru_cache(maxsize=None)


we need expiration, otherwise if there is an update to the config, it might no reflect

darenr

I never knew about casefold()!

github-actions · 2025-08-10T17:16:26Z

📌 Cov diff with main:

📌 Overall coverage:

Add Shape-Specific Env Config for GPT-OSS Models in AQUA Deployment C…

d5653e5

…onfig Reader

mrDzurb requested review from darenr, mayoor, VipulMascarenhas, qiuosier and ahosler as code owners August 10, 2025 00:43

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Aug 10, 2025

darenr previously approved these changes Aug 10, 2025

View reviewed changes

Merge branch 'main' into aqua_md_env_config

097a0ec

mayoor reviewed Aug 10, 2025

View reviewed changes

Removes LRU cache for getting config method

1aac73d

mrDzurb dismissed darenr’s stale review via 1aac73d August 10, 2025 16:38

mrDzurb requested review from darenr and mayoor August 10, 2025 16:39

mayoor approved these changes Aug 10, 2025

View reviewed changes

darenr approved these changes Aug 10, 2025

View reviewed changes

Merge branch 'main' into aqua_md_env_config

2ffe0d2

mrDzurb merged commit ca17053 into main Aug 10, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AQUA][GPT-OSS] Add Shape-Specific Env Config for GPT-OSS Models in AQUA Deployment Config Reader #1244

[AQUA][GPT-OSS] Add Shape-Specific Env Config for GPT-OSS Models in AQUA Deployment Config Reader #1244

Uh oh!

mrDzurb commented Aug 10, 2025

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

mayoor Aug 10, 2025

Uh oh!

mrDzurb Aug 10, 2025

Uh oh!

darenr left a comment

Uh oh!

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

Uh oh!

		@@ -997,6 +997,45 @@ def get_container_params_type(container_type_name: str) -> str:
		return UNKNOWN


		@lru_cache(maxsize=None)

[AQUA][GPT-OSS] Add Shape-Specific Env Config for GPT-OSS Models in AQUA Deployment Config Reader #1244

[AQUA][GPT-OSS] Add Shape-Specific Env Config for GPT-OSS Models in AQUA Deployment Config Reader #1244

Uh oh!

Conversation

mrDzurb commented Aug 10, 2025

Description:

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

mayoor Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

mrDzurb Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

darenr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

Uh oh!