Skip to content

Commit

Permalink
Merge pull request #395 from cassiekays/azureOAI-pattern
Browse files Browse the repository at this point in the history
changed two AOAI alerts to visible with the proper thresholds
  • Loading branch information
JoeyBarnes authored Oct 31, 2024
2 parents 8429a63 + c77f342 commit 33a4e54
Showing 1 changed file with 13 additions and 13 deletions.
26 changes: 13 additions & 13 deletions services/CognitiveServices/accounts/alerts.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -308,19 +308,19 @@
- name: AzureOpenAIContextTokensCacheMatchRate
description: Percentage of the prompt tokens hit the cache, avaiable for PTU-managed.
type: Metric
verified: false
visible: false
verified: true
visible: true
tags: manual-ck
properties:
metricName: AzureOpenAIContextTokensCacheMatchRate
metricNamespace: Microsoft.CognitiveServices/accounts
severity: 0
severity: 2
windowSize: PT5M
evaluationFrequency: PT1M
timeAggregation: Total
operator: GreaterThan
criterionType: StaticThresholdCriterion
threshold: 0.0
threshold: 75
guid: 81f8369c-65bf-4194-bfd2-ffdfa2470577
- name: AzureOpenAIProvisionedManagedUtilization
description: Utilization % for a provisoned-managed deployment, calculated as (PTUs consumed / PTUs deployed) x 100.
Expand All @@ -342,19 +342,19 @@
- name: AzureOpenAIProvisionedManagedUtilizationV2
description: Utilization % for a provisoned-managed deployment, calculated as (PTUs consumed / PTUs deployed) x 100.
type: Metric
verified: false
visible: false
verified: true
visible: true
tags: manual-ck
properties:
metricName: AzureOpenAIProvisionedManagedUtilizationV2
metricNamespace: Microsoft.CognitiveServices/accounts
severity: 0
severity: 2
windowSize: PT5M
evaluationFrequency: PT1M
timeAggregation: Total
operator: GreaterThan
criterionType: StaticThresholdCriterion
threshold: 0.0
threshold: 80
guid: 693a3b37-1e2a-42d1-aaed-b1f374276d1c
- name: AzureOpenAIRequests
description: Number of calls made to the Azure OpenAI API over a period of time.
Expand All @@ -374,21 +374,21 @@
threshold: 0.0
guid: a1528d17-f288-46b1-b084-8b8fe3af90fa
- name: AzureOpenAITimeToResponse
description: Recommended latency (responsiveness) measure for streaming requests.
description: Recommended latency (responsiveness) measure for streaming requests. Time in milliseconds.
type: Metric
verified: false
visible: false
verified: true
visible: true
tags: manual-ck
properties:
metricName: AzureOpenAITimeToResponse
metricNamespace: Microsoft.CognitiveServices/accounts
severity: 0
severity: 2
windowSize: PT5M
evaluationFrequency: PT1M
timeAggregation: Total
operator: GreaterThan
criterionType: StaticThresholdCriterion
threshold: 0.0
threshold: 200
guid: 995cc12a-1887-4669-92c5-70a6ca8bfe70
- name: BaselineEstimatorOverallReward
description: Baseline Estimator Overall Reward.
Expand Down

0 comments on commit 33a4e54

Please sign in to comment.