forked from GoogleCloudPlatform/ai-on-gke
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Jetstream Autoscaling Guide (GoogleCloudPlatform#703)
* first commit * missing files * various improvements * some autoscaling changes for testing * add targetlabels to podmonitoring * Revert repo pinning * more reversions * more reversions * cleanup * more cleanup * Added to README * revert topology change * tweaks to deployment * HPA terraform fixes * remove stray comment * Add more to README * parameterize metrics scrape port * Cleaned up readme * readme tweak * typo * remove indentation * newline * More updates to readme * change wording * Update metrics scrape example * remove annotation * terraform format * missing comma * maxengine-server in terraform * wording * terraform fmt * parameterize container images * wording * remove ksa var * move deployment to kubectl directory * App -> app * pipe from maxengine module to main * Update tutorials-and-examples/inference-servers/jetstream/maxtext/single-host-inference/README.md Co-authored-by: RupengLiu <[email protected]> * remove TODO * HPA can now scale with HBM --------- Co-authored-by: RupengLiu <[email protected]>
- Loading branch information
Showing
16 changed files
with
848 additions
and
10 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -35,3 +35,4 @@ default.tfstate.backup | |
terraform.tfstate* | ||
terraform.tfvars | ||
tfplan | ||
.vscode/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
26 changes: 26 additions & 0 deletions
26
...xt/single-host-inference/terraform/custom-metrics-stackdriver-adapter/README.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
# Custom Metrics Stackdriver Adapter | ||
|
||
Adapted from https://raw.githubusercontent.com/GoogleCloudPlatform/k8s-stackdriver/master/custom-metrics-stackdriver-adapter/deploy/production/adapter_new_resource_model.yaml | ||
|
||
## Usage | ||
|
||
To use this module, include it from your main terraform config, i.e.: | ||
|
||
``` | ||
module "custom_metrics_stackdriver_adapter" { | ||
source = "./path/to/custom-metrics-stackdriver-adapter" | ||
} | ||
``` | ||
|
||
For a workload identity enabled cluster, some additional configuration is | ||
needed: | ||
|
||
``` | ||
module "custom_metrics_stackdriver_adapter" { | ||
source = "./path/to/custom-metrics-stackdriver-adapter" | ||
workload_identity = { | ||
enabled = true | ||
project_id = "<PROJECT_ID>" | ||
} | ||
} | ||
``` |
Oops, something went wrong.