Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-enables auto benchmarking / acceptance testing #78

Merged
merged 31 commits into from
Dec 19, 2024
Merged
Show file tree
Hide file tree
Changes from 25 commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
32c1e29
Adding basic acceptance testing script
merschformann Dec 11, 2024
6312791
Preparing test entrypoint for benchmarking (app.yaml, input embedded …
merschformann Dec 15, 2024
937be91
Completing auto benchmark
merschformann Dec 15, 2024
5c6ebfa
Fixing version tag
merschformann Dec 16, 2024
ca1ce8e
Fixing version test
merschformann Dec 16, 2024
df58314
Cleaning code
merschformann Dec 16, 2024
1adc5f9
Fixing version test
merschformann Dec 16, 2024
f2fab6a
Reorganizing prints
merschformann Dec 16, 2024
dca29b0
Adding required name parameter
merschformann Dec 16, 2024
46f0463
Adding missing param
merschformann Dec 16, 2024
6e366f1
Removing obsolete print
merschformann Dec 16, 2024
8937c46
Merge remote-tracking branch 'origin/develop' into merschformann/reen…
merschformann Dec 16, 2024
183e41a
Adding auto benchmark workflow
merschformann Dec 16, 2024
47091dd
Adding branch name for pacing slack notifications
merschformann Dec 16, 2024
738a6c2
Pass secrets
merschformann Dec 16, 2024
71d1a2b
Using output summary, improved slack message
merschformann Dec 17, 2024
c68feb2
More specific slack message
merschformann Dec 17, 2024
dd6b523
Bump nextmv version
merschformann Dec 17, 2024
a5fe09b
Fixing api key
merschformann Dec 17, 2024
5bc5cad
Forward account id
merschformann Dec 17, 2024
92bc042
Moving account id to env vars
merschformann Dec 17, 2024
bd8cf40
Renaming job
merschformann Dec 17, 2024
037545b
Fixing use of variable
merschformann Dec 17, 2024
3c4a84a
Use official nextmv-py release
merschformann Dec 18, 2024
6990c5a
Upgrade baseline when running on main branch
merschformann Dec 18, 2024
87984e5
Move env var to step that needs it
merschformann Dec 18, 2024
2256778
Moving some env vars to GH env section
merschformann Dec 19, 2024
d54638f
Using shifted geometric mean instead
merschformann Dec 19, 2024
38a70a5
Fixing id and tag handling
merschformann Dec 19, 2024
b673876
Pre-output url
merschformann Dec 19, 2024
632b196
Flushing url
merschformann Dec 19, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
37 changes: 37 additions & 0 deletions .github/workflows/auto-benchmark.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
name: auto benchmark
on: [push]

env:
GO_VERSION: 1.23
PYTHON_VERSION: 3.12

jobs:
auto-benchmark:
runs-on: ubuntu-latest
env:
BENCHMARK_ACCOUNT_ID: ${{ vars.BENCHMARK_ACCOUNT_ID }}
dirkschumacher marked this conversation as resolved.
Show resolved Hide resolved
steps:
- name: git clone
uses: actions/checkout@v4

- name: set up Go
uses: actions/setup-go@v5
with:
go-version: ${{ env.GO_VERSION }}

- name: set up Python ${{ env.PYTHON_VERSION }}
uses: actions/setup-python@v5
with:
python-version: ${{ env.PYTHON_VERSION }}

- name: install dependencies
run: |
python -m pip install --upgrade pip
pip install -r .nextmv/benchmark.requirements.txt

- name: run acceptance test
run: |
export BRANCH_NAME=$(echo $GITHUB_REF | awk -F'/' '{print $3}')
export BENCHMARK_API_KEY_PROD=${{ secrets.BENCHMARK_API_KEY_PROD }}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you use the env feature of the GH workflow for BENCHMARK_API_KEY_PROD and SLACK_URL_DEV_SCIENCE?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What do you mean? 🤔
This is necessary to actually surface the secret as an environment variable in the bash sense. env of Github is used to pass environment variables between steps and/or jobs.

I don't think you need to worry about the secrets leaking this way. As far as I can tell this is the intended way of things to work. Github removes the secrets from the log output (see image).

image

I can rewrite the script to accept these as arguments via argparse (bypassing environment variables), but the call to it would still be logged like this, I think:

python .nextmv/benchmark.py --api-key "***" --slack-url "***"

cc @dirkschumacher

Copy link
Contributor

@dirkschumacher dirkschumacher Dec 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No my comment just referred to using the yaml env key within a step instead of export. I thought that would be the same. But no strong opinion, export obviously also works

e.g.

        env:
          BENCHMARK_ACCOUNT_ID: ${{ vars.BENCHMARK_ACCOUNT_ID }}
          BENCHMARK_API_KEY_PROD: ${{ secrets.BENCHMARK_API_KEY_PROD }}

Copy link
Member Author

@merschformann merschformann Dec 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, works for me. Does indeed look a little nicer. 😊

export SLACK_URL_DEV_SCIENCE=${{ secrets.SLACK_URL_DEV_SCIENCE }}
python .nextmv/benchmark.py
217 changes: 217 additions & 0 deletions .nextmv/benchmark.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,217 @@
# Description:
# This script does the following:
# - Make sure the working directory is clean.
# - Pushes a new version of the app (if it does not already exist; uses git sha as version).
# - Updates the candidate instance to use the new version.
# - Runs an acceptance test between the candidate and baseline instances.
# - Waits for the test to complete.
# - Posts the result to Slack (if requested).

import os
import subprocess
from datetime import datetime, timezone

import requests
from nextmv import cloud

APP_ID = "nextroute-bench"
API_KEY = os.environ["BENCHMARK_API_KEY_PROD"]
SLACK_WEBHOOK = os.getenv("SLACK_URL_DEV_SCIENCE", None)
ACCOUNT_ID = os.getenv("BENCHMARK_ACCOUNT_ID", None)
BRANCH_NAME = os.getenv("BRANCH_NAME", None)
Comment on lines +17 to +21
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could have used nextmv.Options for this 👻

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Haha, fair. I suppose I should get used to it more. 😊



METRICS = [
cloud.Metric(
field="result.value",
metric_type=cloud.MetricType.direct_comparison,
params=cloud.MetricParams(
tolerance=cloud.MetricTolerance(
value=0.05,
type=cloud.ToleranceType.relative,
),
operator=cloud.Comparison.less_than_or_equal_to,
),
statistic=cloud.StatisticType.mean,
)
]


def ensure_clean_working_directory():
"""
Ensure the working directory is clean by throwing an exception if it is not.
"""
if os.system("git diff --quiet") != 0 or os.system("git diff --cached --quiet") != 0:
raise Exception("Working directory is not clean")


def get_tag(app: cloud.Application) -> str:
"""
Get the tag for the new version.
If the version already exists, we append a timestamp to the tag.
"""
# If we don't have a tag, we generate one based on the git sha and a timestamp
git_sha = subprocess.check_output(["git", "rev-parse", "HEAD"]).decode().strip()[0:8]
version_id = f"auto-{git_sha}"
# If the version already exists, we append a timestamp to the tag
exists = False
try:
app.version(version_id)
exists = True
except Exception:
pass
if exists:
ts = (
datetime.now(timezone.utc)
.replace(microsecond=0)
.isoformat()
.replace("+00:00", "Z")
.replace(":", "")
.replace("-", "")
)
version_id = f"{version_id}-{ts}"
# Otherwise, we just use the git sha
return version_id


def push_new_version(app: cloud.Application, tag: str) -> None:
"""
Push a new version of the app and update the candidate instance to use it.
"""
app.push(app_dir=".")
app.new_version(
id=tag,
name=f"Auto version {tag}",
description=f"Automatically generated version {tag}",
)
instance = app.instance("candidate")
app.update_instance(
id="candidate",
version_id=tag,
name=instance.name, # Name is required, but we don't want to change it
)


def upgrade_baseline(app: cloud.Application, version_id: str) -> None:
"""
Upgrade the baseline instance to use the new version.
"""
instance = app.instance("baseline")
app.update_instance(
id="baseline",
version_id=version_id,
name=instance.name, # Name is required, but we don't want to change it
)


def run_acceptance_test(
app: cloud.Application,
id: str,
tag: str,
) -> cloud.AcceptanceTest:
"""
Run an acceptance test between the candidate and baseline instances.
"""
result = app.new_acceptance_test_with_result(
candidate_instance_id="candidate",
baseline_instance_id="baseline",
id=id,
metrics=METRICS,
name=f"Auto-test {tag}",
description=f"Automated test for {tag}",
input_set_id="nextroute-bench-v20",
polling_options=cloud.PollingOptions(
max_duration=600, # 10 minutes
max_tries=1000, # basically forever - we'll stop by duration
),
)
return result


def create_test_url(result_id: str) -> str:
"""
Create a URL to the acceptance test result.
"""
if ACCOUNT_ID:
return f"https://cloud.nextmv.io/acc/{ACCOUNT_ID}/app/nextroute-bench/experiments/acceptance/{result_id}"
return "unavailable"


def write_to_summary(content):
"""Appends content to the GitHub Actions step summary (if available)."""
summary_file = os.getenv("GITHUB_STEP_SUMMARY")
if not summary_file:
return

# Write content to the summary file
with open(summary_file, "a") as f:
f.write(content + "\n")


def main():
"""
Main function that runs the benchmark.
"""
# Change to the directory of the app (sibling directory of this script)
os.chdir(os.path.join(os.path.dirname(__file__), "..", "cmd"))

print("Making sure the working directory is clean...")
ensure_clean_working_directory()

client = cloud.Client(api_key=API_KEY)
app = cloud.Application(client=client, id=APP_ID)

tag = get_tag(app)

print(f"Pushing new version with tag: {tag}")
push_new_version(app, tag)

id = f"auto-{tag}"
write_to_summary("# Acceptance Test Report")
write_to_summary("")
write_to_summary(f"ID: {id}")
write_to_summary(f"Link: [link]({create_test_url(id)})")

print(f"Running acceptance test with ID: {id}")
print("Waiting for it to complete...")
result = run_acceptance_test(app, id, tag)
passed = "unknown"
if result and result.results:
passed = "passed" if result.results.passed else "failed"
print(f"Acceptance test completed with status: {passed}")

if SLACK_WEBHOOK and BRANCH_NAME == "develop":
print("Posting to Slack...")
response = requests.post(
SLACK_WEBHOOK,
json={
"text": f"nextroute acceptance test {result and result.id} completed with status: {passed}"
+ f" (<{create_test_url(result and result.id)}|View results>)",
},
)

if response.status_code != 200:
print(f"Failed to send notification to Slack: {response.text}")
else:
print("Notification sent to Slack")

write_to_summary("")
write_to_summary(f"Result: {passed}")
if result and result.results:
if result.results.error:
write_to_summary(f"Error: {result.results.error}")
else:
write_to_summary("Metrics:")
write_to_summary("")
for metric in result.results.metric_results:
write_to_summary(f"- {metric.metric.field}: {metric.passed}")

if BRANCH_NAME == "develop":
print("Upgrading baseline instance to use the new version...")
upgrade_baseline(app, tag)

print("Done")


if __name__ == "__main__":
main()
2 changes: 2 additions & 0 deletions .nextmv/benchmark.requirements.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
nextmv>=v0.14.2
requests>=2.32.3
10 changes: 10 additions & 0 deletions cmd/app.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
# This manifest holds the information the app needs to run on the Nextmv Cloud.
type: go
runtime: ghcr.io/nextmv-io/runtime/default:latest
build:
command: go build -o main .
environment:
GOOS: linux
GOARCH: arm64
files:
- main
41 changes: 41 additions & 0 deletions cmd/main.go
Original file line number Diff line number Diff line change
Expand Up @@ -5,10 +5,12 @@ package main

import (
"context"
"encoding/json"
"fmt"
"log"
"os"
"strings"
"time"

"github.com/nextmv-io/nextroute"
"github.com/nextmv-io/nextroute/check"
Expand Down Expand Up @@ -39,31 +41,70 @@ type options struct {
Check check.Options `json:"check,omitempty"`
}

type customOptions struct {
MaxDuration *float64 `json:"max_duration,omitempty"`
}

// applyCustomOptions applies the extended custom options from the input to the
// actual options.
func applyCustomOptions(opts options, customOpts any) (options, error) {
jOpts, err := json.Marshal(customOpts)
if err != nil {
return opts, err
}
var custom customOptions
err = json.Unmarshal(jOpts, &custom)
if err != nil {
return opts, err
}
if custom.MaxDuration != nil {
opts.Solve.Duration = time.Duration(*custom.MaxDuration * float64(time.Second))
}
return opts, nil
}

func solver(
ctx context.Context,
input schema.Input,
options options,
) (runSchema.Output, error) {
// Apply input embedded options, if any. This is used internally for
// benchmarking and testing.
if input.Options != nil {
opts, err := applyCustomOptions(options, input.Options)
if err != nil {
return runSchema.Output{}, err
}
options = opts
}

// Create the model from the input and options.
model, err := factory.NewModel(input, options.Model)
if err != nil {
return runSchema.Output{}, err
}

// Create the solver from the model.
solver, err := nextroute.NewParallelSolver(model)
if err != nil {
return runSchema.Output{}, err
}

// Solve the model.
solutions, err := solver.Solve(ctx, options.Solve)
if err != nil {
return runSchema.Output{}, err
}

// Get the last solution.
// This call is blocking until the solver terminates. Alternatively,
// solutions can be ranged over (see All() method).
last, err := solutions.Last()
if err != nil {
return runSchema.Output{}, err
}

// Process the solution for output.
output, err := check.Format(
ctx,
options,
Expand Down
Loading