Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cert Renewal script #14667

Closed
wants to merge 773 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
773 commits
Select commit Hold shift + click to select a range
33b8a7b
[copy] fix the TimeoutError and ServerDisconnected issues in copy
May 6, 2022
ab66960
remove pylints
May 11, 2022
af2a9d5
more pylints
May 11, 2022
ad1fc0e
Merge pull request #195 from populationgenomics/merge-fix-gcs-copy
vladsavelyev May 13, 2022
c45d0d0
Merge commit '513139587f578575d5251ac0a16efa95b0f71321' into upstream…
vladsavelyev May 17, 2022
8305632
Merge pull request #196 from populationgenomics/upstream-main-0.2.95
vladsavelyev May 17, 2022
068f803
Allow specifying the pool name explicitly
lgruen May 26, 2022
d328bcf
Add SQL for adding seqr-specific pools
lgruen May 26, 2022
ecfc93e
Allow pool name prefix
lgruen May 26, 2022
b3564a2
Use pool labels
lgruen May 26, 2022
98bcfd1
Fix label config
lgruen May 26, 2022
1fd69a0
Non-null pool labels
lgruen May 26, 2022
84da5bc
Fix post label
lgruen May 26, 2022
0dd5d57
Add label as allowed resource key
lgruen May 26, 2022
ac1e946
label --> pool_label
lgruen May 26, 2022
6b70db7
Merge pull request #197 from populationgenomics/pool-name
lgruen May 31, 2022
13c377b
Use spot machines on GCP
lgruen Jun 1, 2022
c08b295
Sort
lgruen Jun 1, 2022
552189c
Merge pull request #198 from populationgenomics/spot
vladsavelyev Jun 1, 2022
1f3d93c
Revert "Use Spot VMs on GCP" (#199)
lgruen Jun 1, 2022
d4bd6a2
Second attempt at enabling Spot VMs on GCP (#200)
lgruen Jun 2, 2022
98e11c7
Merge branch 'main' of https://github.com/hail-is/hail into upstream
illusional Jun 29, 2022
1a26fb4
Fix order of sql migrations
illusional Jun 29, 2022
51b96ae
Merge pull request #201 from populationgenomics/upstream
illusional Jun 29, 2022
6b3a9bd
Put deployment-specific files into $ORGANIZATION_DOMAIN subdirectory …
lgruen Jul 4, 2022
cdb1cbf
Merge commit 'c9e540b7988a7f575a5daf59b0e24b4fe506f146' into upstream
illusional Jul 11, 2022
da3f128
fix type in _zip_func (#12020) (#206)
lgruen Jul 12, 2022
9c8c650
Merge pull request #205 from populationgenomics/upstream
illusional Jul 12, 2022
da04d6b
Fix autoscaling policy (#207)
lgruen Jul 13, 2022
f19bf65
Use GitHub organization instead of organization domain for credential…
lgruen Jul 13, 2022
d2c0f68
Define "scope" in auth template (#208)
lgruen Jul 13, 2022
583e321
Add resource_limits to enable new autoscaling_policy (#209)
lgruen Jul 13, 2022
a85b403
Remove subdomains for services that we don't deploy (#210)
lgruen Jul 20, 2022
c4eb2b1
Add pool_recycle parameter to database creation to avoid stale connec…
lgruen Jul 21, 2022
5a00a00
Merge commit '282fecf' into db-migrations
illusional Aug 2, 2022
f735118
Merge pull request #213 from populationgenomics/db-migrations
illusional Aug 2, 2022
a37753e
Change default batch search to all relevant (#212)
illusional Aug 2, 2022
811f52c
Merge commit 'db0a5f5' into pool-ids
illusional Aug 2, 2022
b28ed8c
Merge pull request #214 from populationgenomics/pool-ids
illusional Aug 2, 2022
b2745a7
FIX: proper default return
violetbrina Aug 12, 2022
10d4386
FIX: formatter changes
violetbrina Aug 12, 2022
cac7ac4
Merge pull request #216 from populationgenomics/query_batch_jobs_for_…
violetbrina Aug 12, 2022
7f6c7a7
Allow overriding the temp bucket for Dataproc (#217)
lgruen Aug 22, 2022
aac31f4
Cherry-pick GKE autoprovisioning change (#218)
lgruen Aug 24, 2022
694134b
Merge commit 'a885cf9' into v0.2.98-upstream1
illusional Aug 31, 2022
4375206
Merge pull request #220 from populationgenomics/v0.2.98-upstream1
illusional Aug 31, 2022
e496355
Merge remote-tracking branch 'upstream/main' into v0.2.98-upstream1
illusional Aug 31, 2022
48ff6e3
Merge pull request #221 from populationgenomics/v0.2.98-upstream2
illusional Sep 1, 2022
b759201
Merge remote-tracking branch 'upstream/main' into v0.2.98-upstream3
vladsavelyev Sep 13, 2022
49c4899
Merge pull request #222 from populationgenomics/v0.2.98-upstream3
vladsavelyev Sep 14, 2022
10add1e
Merge remote-tracking branch 'upstream/main' into upstream_0.2.99
vladsavelyev Sep 21, 2022
064456e
Merge pull request #223 from populationgenomics/upstream_0.2.99
vladsavelyev Sep 21, 2022
21b5aac
Merge commit '3f31b55' into upstream-1
illusional Sep 28, 2022
3c77829
Merge commit 'bd7951e' into upstream-2
illusional Sep 28, 2022
e458072
Merge commit '2ea2615a797a5aff72d20d9d12a2609342846a07' into upstream-3
illusional Sep 28, 2022
c77d293
Merge branch 'main' into upstream-1
illusional Sep 28, 2022
a08c103
Merge pull request #224 from populationgenomics/upstream-1
illusional Sep 28, 2022
7e7c784
Remove rest_authenticated_developers_only (#227)
illusional Sep 28, 2022
e865bfb
Merge pull request #225 from populationgenomics/upstream-2
illusional Sep 28, 2022
ac1d122
Merge pull request #226 from populationgenomics/upstream-3
illusional Sep 28, 2022
97d07ea
Allow finer granularity for JVM memory (#228)
lgruen Sep 29, 2022
04a1e5d
Prepare JVMs with various core numbers (#229)
lgruen Sep 29, 2022
b8adfff
Fix JVM index (#230)
lgruen Sep 29, 2022
d4c40c5
Merge commit 'bfb02c3' into upstream1
illusional Oct 5, 2022
16cf271
Merge commit 'd6838fcc9f596ab59aac0527252dc959c23c9866' into upstream…
illusional Oct 5, 2022
f7a5b20
Merge pull request #231 from populationgenomics/upstream1
illusional Oct 5, 2022
389a9a1
Merge pull request #232 from populationgenomics/upstream1-0.2.101-rel…
illusional Oct 5, 2022
83e72cc
Merge branch 'main' of github.com:hail-is/hail into upstream-102
illusional Oct 6, 2022
e84f62b
Merge pull request #233 from populationgenomics/upstream-102
illusional Oct 6, 2022
f279eb4
Merge remote-tracking branch 'upstream/main' into fix12281
lgruen Oct 7, 2022
71b268d
Merge pull request #234 from populationgenomics/fix12281
illusional Oct 7, 2022
7921901
Add debugging compiler stack trace for Tim Poterba (#235)
lgruen Oct 9, 2022
a2d4e62
Fix Terraform
lgruen Oct 10, 2022
e2c36e0
Merge pull request #236 from populationgenomics/terraform-project
vladsavelyev Oct 10, 2022
5a05029
Sync Terraform with upstream (#237)
lgruen Oct 10, 2022
727a75b
Merge remote-tracking branch 'upstream/main' into fix12305
lgruen Oct 12, 2022
2dbba62
Merge pull request #238 from populationgenomics/fix12305
lgruen Oct 12, 2022
eb2cbba
More debugging instrumentation for Tim Poterba (#239)
lgruen Oct 13, 2022
0c057d6
Take 2 for instrumentation patch for Tim Poterba (#240)
lgruen Oct 13, 2022
a35c7c8
HOOK: add ci hook to deploy azure as well as gcp hail batch
violetbrina Oct 14, 2022
a327e70
FIX: workflow hook fixes
violetbrina Oct 14, 2022
fed92d4
FIX: update slack notification for prod_deploy
violetbrina Oct 14, 2022
800ed43
FIX: add stages to Azure prod_deploy
violetbrina Oct 14, 2022
3c66763
Better QoB error propagation (patch from Tim Poterba) (#242)
lgruen Oct 16, 2022
66ccd6f
Update .github/workflows/prod_deploy.yaml
violetbrina Oct 17, 2022
68eb66b
Update .github/workflows/prod_deploy.yaml
violetbrina Oct 17, 2022
510bd85
Update .github/workflows/prod_deploy.yaml
violetbrina Oct 17, 2022
5a2758a
[fs] Fix incorrect seeking semantics of GoogleStorageFS (#244)
lgruen Nov 20, 2022
df89a3d
Revert "[fs] Fix incorrect seeking semantics of GoogleStorageFS (#244…
lgruen Nov 20, 2022
1878c47
[fs] Fix incorrect seeking semantics of GoogleStorageFS (#246)
lgruen Nov 20, 2022
1fb241e
Revert "Better QoB error propagation (patch from Tim Poterba) (#242)"
lgruen Nov 22, 2022
7cbfd74
Revert "Take 2 for instrumentation patch for Tim Poterba (#240)"
lgruen Nov 22, 2022
06d31eb
Revert "More debugging instrumentation for Tim Poterba (#239)"
lgruen Nov 22, 2022
ef39fd2
Revert "Add debugging compiler stack trace for Tim Poterba (#235)"
lgruen Nov 22, 2022
6e659a3
Merge pull request #247 from populationgenomics/revert-instrumentation
illusional Nov 22, 2022
d6e54b7
Merge branch 'main' of github.com:hail-is/hail into upstream-105-mid
illusional Nov 22, 2022
e9c22e9
Fix compilation error
lgruen Nov 23, 2022
a6b90be
[hailctl] dataproc --pyfiles: include non-py files
vladsavelyev Nov 23, 2022
65adf03
Merge pull request #249 from populationgenomics/dataproc-add-nonpy-files
vladsavelyev Nov 23, 2022
77f1c58
Merge pull request #248 from populationgenomics/upstream-105-mid
illusional Nov 23, 2022
57f1bb3
Merge commit '7070635'
illusional Nov 27, 2022
61400c2
Merge pull request #251 from populationgenomics/nested-batches-mid-merge
illusional Nov 27, 2022
330b52a
Remove test_batch_0 from CI deploy steps (#252)
lgruen Nov 27, 2022
0a44289
Add db to start_deploy args for prod deploy (#253)
illusional Nov 28, 2022
3f9f441
Proper fix
vladsavelyev Nov 28, 2022
5ed7ce7
Merge pull request #250 from populationgenomics/fix-cluster-start
vladsavelyev Nov 28, 2022
be337d4
Fix (#254)
vladsavelyev Nov 29, 2022
61993f3
Deploy the hailtop image (#255)
lgruen Nov 30, 2022
f6126c4
[dataproc] fix pyfiles (#256)
vladsavelyev Dec 1, 2022
9c20c9b
Merge commit 'a6c75d6' into upstream-106
illusional Dec 14, 2022
06959b2
Merge remote-tracking branch 'upstream/main' into upstream-106
illusional Dec 14, 2022
9ad93a0
Merge commit '2387bb0' into upstream-106
illusional Dec 14, 2022
fe50208
Colors (#258)
lgruen Dec 16, 2022
e48bcf2
Merge pull request #257 from populationgenomics/upstream-106
lgruen Dec 16, 2022
dfa48ba
Revert "Colors (#258)"
lgruen Jan 16, 2023
a354807
Revert "[batch][UI] revert table row background colors to gray/white …
lgruen Jan 16, 2023
d891f31
Merge pull request #259 from populationgenomics/rainbow
lgruen Jan 16, 2023
8b14d78
Fix broken merge from https://github.com/hail-is/hail/pull/12521 (dup…
lgruen Jan 17, 2023
35dc394
Merge tag '0.2.108' into 0.2.108
lgruen Jan 17, 2023
d4685c5
Merge pull request #261 from populationgenomics/upstream-0.2.108
lgruen Jan 17, 2023
fd35e68
Merge pull request #241 from populationgenomics/azure-ci-hook
violetbrina Jan 19, 2023
72f3064
Restore printing of child batch URLs (#262)
lgruen Jan 20, 2023
4a4f6ca
Conditional on not wait (#263)
lgruen Jan 22, 2023
e2df775
Merge commit 'b71b065' into upstream
illusional Feb 10, 2023
0133915
Merge pull request #265 from populationgenomics/upstream-109
illusional Feb 10, 2023
cdc3f50
>= is not compatible with wildcards (#266)
lgruen Feb 10, 2023
842da3e
[query] fix NPE introduced in #12555 (#12711) (#267)
lgruen Feb 18, 2023
d312247
Add cost to job resource and upgrade to aggregated_resources_v2 (#268)
illusional Feb 21, 2023
168ae9a
add resize to cmd pre (#270)
cassimons Mar 1, 2023
601daf5
Merge commit '9e0081c' into upstream-2023-03-01_1
illusional Mar 1, 2023
b8b6004
Merge pull request #271 from populationgenomics/upstream-2023-03-01_1
illusional Mar 1, 2023
2380312
Azure 02-03-2023 (#273)
violetbrina Mar 2, 2023
87680e9
Remove unused secrets that prevent a dev deploy (#274)
lgruen Mar 2, 2023
abd4476
Add endpoint that fetches completed batches in order (#275)
illusional Mar 5, 2023
cc0748c
[batch] downgrade gcsfuse to avoid gcsfuse bug (#12749) (#276)
illusional Mar 7, 2023
ed74164
Merge commit '58b52' into resource-migration-58b52
illusional Mar 9, 2023
638b681
Merge commit 'c7f903c' into upstream-110
illusional Mar 9, 2023
2d4b679
Merge pull request #277 from populationgenomics/resource-migration-58b52
illusional Mar 9, 2023
02ecbc4
Merge pull request #278 from populationgenomics/upstream-110
illusional Mar 9, 2023
c19a539
Revert "[query] MatrixBlockMatrixWriter can be lowered (#12695)" (#279)
lgruen Mar 13, 2023
d634f34
FIX: missing comma, SQL syntax compromised causing 500 internal serve…
violetbrina Mar 16, 2023
39116b8
Don't write composite object for wheels (#281)
lgruen Apr 4, 2023
49c179c
Merge commit 'cf32652c50771e7f81b1bfa618f6e3f399cd238d' into upstream…
illusional Apr 12, 2023
2068ccc
Linting
illusional Apr 12, 2023
844f6ee
Rename missed hail_pip_install_image -> hailgenetics_hail_image
illusional Apr 12, 2023
4c355ce
Add quick links to job types in batch (#272)
cassimons Apr 18, 2023
f5875e2
Merge pull request #282 from populationgenomics/upstream-113
illusional Apr 18, 2023
186b510
Use image with gcloud for deploy_wheel step (#283)
lgruen Apr 18, 2023
0b5bc2e
Fix deploy_wheel dependencies (#284)
lgruen Apr 18, 2023
f727962
Catch deploy errors (#269)
illusional Apr 18, 2023
7a1302f
Fix fail handling (#285)
lgruen Apr 19, 2023
725c21f
Merge tag '0.2.114' into upstream-0.2.114
lgruen Apr 20, 2023
e7facb7
Merge pull request #286 from populationgenomics/upstream-0.2.114
lgruen Apr 21, 2023
18a9b5d
Merge tag '0.2.115' into upstream-0.2.115
lgruen Apr 26, 2023
71fc978
Merge pull request #287 from populationgenomics/upstream-0.2.115
lgruen May 2, 2023
5859489
[services] indicate how many errors we have seen
May 4, 2023
fe001cd
Merge pull request #288 from populationgenomics/retry-info
lgruen May 4, 2023
cfc5628
Merge remote-tracking branch 'upstream/main' into upstream-0.2.115-in…
lgruen May 5, 2023
fc212f5
Merge pull request #289 from populationgenomics/upstream-0.2.115-inte…
lgruen May 5, 2023
66dcd7d
Revert "[services] indicate how many errors we have seen"
lgruen May 7, 2023
1a34aeb
Merge commit 'cd64e08' into hail-117-surgery
illusional Jun 13, 2023
e758b32
Allow optional byte suffix for memory and storage requests (#293)
lgruen May 9, 2023
2d619ed
FIX: storage accounts weren't unique. Added prefix. Fixed zone bug in…
violetbrina May 16, 2023
0525d32
Merge commit '113b989' into hail-117-surgery
illusional Jun 13, 2023
c5093fc
Merge commit '2a7ef11' into hail-117-surgery-ii
illusional Jun 13, 2023
71d6bd4
Merge commit 'a4ca239' into hail-118
illusional Jun 13, 2023
10eb0d5
Merge pull request #298 from populationgenomics/hail-117-surgery-ii
illusional Jun 14, 2023
5ac6288
Merge pull request #299 from populationgenomics/hail-118
illusional Jun 14, 2023
d675979
Fix bad "just-wheel" merge (#300)
illusional Jun 14, 2023
abaf45c
Fix renamed hail_jar build dependency (#301)
illusional Jun 14, 2023
31ca3e8
Merge commit 'c9db002' into sept-23-upstream-1-c9db002
illusional Sep 7, 2023
8101915
Merge commit '31a0889' into sept-23-upstream-2-31a0889
illusional Sep 7, 2023
e43761b
Merge commit '1c28203' into sept-23-upstream-3-1c28203
illusional Sep 7, 2023
b015fef
Merge commit 'dc1f086' into sept-23-upstream-4-dc1f086
illusional Sep 8, 2023
f11ef53
Merge commit 'be9d88a' into sept-23-upstream-5-0.2.122
illusional Sep 8, 2023
b4de354
Restore two blank lines between top-level functions
jmarshall Sep 8, 2023
f76da5a
Fix dropped change
illusional Sep 12, 2023
a56649f
TF: Replace google_storage_bucket. with module.
illusional Sep 12, 2023
ba7792d
Address review feedback for bad merge
illusional Sep 18, 2023
a1f10dd
Merge pull request #303 from populationgenomics/sept-23-upstream-2-31…
illusional Sep 19, 2023
eb6b29c
Merge pull request #302 from populationgenomics/sept-23-upstream-1-c9…
illusional Sep 19, 2023
9ddd340
[k8s] update to avoid unsupported apiVersions (#13277)
danking Jul 21, 2023
2b74e60
Apply review suggestions
illusional Sep 19, 2023
ded82fe
Merge pull request #307 from populationgenomics/sept-23-upstream-2-ho…
illusional Sep 19, 2023
9fc7f9f
Merge pull request #304 from populationgenomics/sept-23-upstream-3-1c…
illusional Sep 19, 2023
daecd7d
Merge pull request #305 from populationgenomics/sept-23-upstream-4-dc…
illusional Sep 19, 2023
e53807d
Merge branch 'main' of github.com:populationgenomics/hail into sept-2…
illusional Sep 19, 2023
1647e93
Remove deploy_memory from deploy steps (#308)
illusional Sep 19, 2023
73e49d6
Merge pull request #306 from populationgenomics/sept-23-upstream-5-0.…
illusional Sep 19, 2023
d4c773d
Pass empty dev list (#309)
illusional Sep 19, 2023
3869022
Fix URLs to filter job states in a batch (#311)
EddieLF Sep 25, 2023
740894b
[batch] Fix job cpu and memory recording (#13626)
daniel-goldstein Sep 15, 2023
59b2283
Merge pull request #312 from populationgenomics/cherry-pick-resource-…
illusional Oct 5, 2023
93cce37
Merge upstream tag 0.2.124 (13536b5) into upstream-0.2.124
jmarshall Oct 5, 2023
607408b
Work around broken gcsfuse apt repository
jmarshall Oct 10, 2023
f1a76d7
Apply upstream authentication decorator renaming to our additions
jmarshall Oct 10, 2023
f67fbbb
Merge upstream HEAD as of 2023-10-11 (d40ac24ec) into upstream-0.2.12…
jmarshall Oct 10, 2023
7fa6e3c
Merge pull request #313 from populationgenomics/upstream-0.2.124
jmarshall Oct 10, 2023
ea31139
Make cost_breakdown optional (#315)
illusional Oct 16, 2023
a8322e3
Allow prod_deploy to be hit by non-developer CI (#316)
illusional Oct 17, 2023
7e15f72
Merge upstream HEAD (3b38d0b, 2023-11-10) for 0.2.126 and OOM fix
jmarshall Nov 9, 2023
f36c781
Merge pull request #317 from populationgenomics/upstream-126+oom
jmarshall Nov 10, 2023
9a7c610
Update CPG-local step after upstream wheel-container.tar refactor (#318)
jmarshall Nov 10, 2023
7b2ecb7
Log whether database migrations have already been applied (#319)
jmarshall Nov 10, 2023
1a26f7a
Update fix-job-groups-state-enum.sql migration for CPG database state…
jmarshall Nov 13, 2023
a00f48f
Reorder to reduce the difference from upstream backend.py (#321)
jmarshall Nov 14, 2023
5c73038
Disable trial_bp to alleviate error during user creation (#322)
illusional Nov 15, 2023
5373bdb
Merge pull request from GHSA-487p-qx68-5vjw
Dec 29, 2023
1f30670
[auth] hd is not present for iam.gserviceaccount.com (#14114)
danking Dec 29, 2023
e5b3147
Merge pull request #323 from populationgenomics/fix-org-claim
illusional Dec 29, 2023
f3d6d7f
Merge upstream HEAD (da6668b, 2024-01-02) for auth fix
jmarshall Jan 3, 2024
ca2046c
Fix cherry-picking accident in recent auth bug fix
jmarshall Jan 3, 2024
0937153
Merge pull request #325 from populationgenomics/fix-auth
illusional Jan 3, 2024
c27a6e7
Merge pull request #324 from populationgenomics/upstream-newyear2024
jmarshall Jan 5, 2024
074bedb
Add hailctl client (#326)
illusional Jan 5, 2024
01714cc
Fix merge accident in recent auth bug fixes (#327)
jmarshall Jan 8, 2024
be23273
Update completed endpoint with new job group format (#328)
illusional Feb 1, 2024
8f6797b
Expose job resource in batch API (#329)
illusional Feb 22, 2024
82e5406
Merge upstream HEAD(b7bde56d5, 2024-05-14) Stop writing to V2 tables
milo-hyben May 14, 2024
7094105
Fixing merged conflicts.
milo-hyben May 14, 2024
6146638
Merge upstream HEAD(e68103e03, 2024-05-14) Remove V2 tables
milo-hyben May 14, 2024
a19a9f1
Fixing merged conflicts.
milo-hyben May 14, 2024
4fe048f
Merge upstream HEAD(dc7fce0e3, 2024-05-14) Use CI's credentials for i…
milo-hyben May 14, 2024
f73747b
Merge upstream HEAD(13de4e6ff, 2024-05-14) Add job groups [migration …
milo-hyben May 14, 2024
a3cbc74
Fixing merged conflicts.
milo-hyben May 14, 2024
f50d5f5
Removing the module ukbb clause
milo-hyben May 20, 2024
b37df1e
Updating sql for func get_completed_batches_ordered_by_completed_time.
milo-hyben May 20, 2024
82b13e5
Merge upstream HEAD(6a6c38d5a, 2024-05-21) Expose HAIL_CI_STORAGE_URI…
milo-hyben May 21, 2024
8dd30c3
Merge upstream HEAD(bea04d9, 2024-05-21) [release] 0.2.130 (#14454)
milo-hyben May 21, 2024
91f99b5
Merge pull request #330 from populationgenomics/upstream-13892
milo-hyben May 23, 2024
1837104
Merge pull request #331 from populationgenomics/upstream-13893
milo-hyben May 23, 2024
4a7910c
Convert our locally added prod_deploy() endpoint to use AppKeys (#336)
jmarshall May 23, 2024
f8a4e39
Allow os.PathLike paths for Batch.read_input()/read_input_group() (#1…
jmarshall May 14, 2024
e864ba6
Merge pull request #332 from populationgenomics/upstream-14301
milo-hyben May 24, 2024
4b8ee7f
Merge pull request #333 from populationgenomics/upstream-14282
milo-hyben May 24, 2024
637f17f
Merge pull request #334 from populationgenomics/upstream-14419
milo-hyben May 24, 2024
6f76e36
Merge pull request #335 from populationgenomics/upstream-14454
milo-hyben May 24, 2024
a9a79d2
Try adding extra route to metadata server (#337)
illusional May 26, 2024
d7191ed
Implement the .../service-accounts/.../scopes endpoint in metadata se…
jmarshall May 27, 2024
4fb0e47
FIX: SQL command update for get_completed_batches_ordered_by_complete…
milo-hyben May 29, 2024
1ced860
One more fix for the SQL, job_groups table have time_completed as wel…
milo-hyben May 29, 2024
eea6227
Update SQL reference to aggregated_job_resources_v2 with aggregated_j…
milo-hyben May 30, 2024
534aa68
Add pool label to "resource requests ... are unsatisfiable" message (…
jmarshall Jun 11, 2024
0e87fe7
Update to this month's base Ubuntu image
jmarshall Jul 4, 2024
4a7881a
Use CI's credentials instead of removed upstream registry-push-creden…
jmarshall Jul 7, 2024
1fe8637
Merge upstream release 0.2.132
jmarshall Jul 10, 2024
d65f84e
Merge pull request #344 from populationgenomics/upstream-0.2.132
jmarshall Jul 11, 2024
b3e6ad8
Fix for /api/v1alpha/batches/completed, picking only ROOT_JOB_GROUP w…
milo-hyben Jul 25, 2024
7f79844
Add support for public-ip-address in dataproc (#346)
illusional Aug 5, 2024
99a0b4e
Added a cert renewal bash script
nevoodoo Aug 22, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 0 additions & 1 deletion .dockerignore
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,6 @@ hail/.bloop/
hail/.gradle/
hail/.idea/
hail/.pytest_cache/
.git/
hail/.ensime.cache.d/
hail/.ensime_cache.d/
hail/.ensime_cache/
Expand Down
43 changes: 43 additions & 0 deletions .github/workflows/prod_deploy.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
name: prod-deploy
on:
push:
branches:
- main
jobs:
invoke-prod-deploy:
runs-on: ubuntu-latest
steps:
- name: deploy to GCP
continue-on-error: true
run: |
DEPLOY_BATCH_URL_GCP=$(curl --fail --silent --show-error -X POST \
-H "Authorization: Bearer ${{ secrets.CI_TOKEN }}" \
-H "Content-Type:application/json" \
-d '{"steps": ["deploy_auth", "deploy_batch", "deploy_ci", "deploy_hailgenetics_image", "deploy_wheel", "upload_query_jar"], "sha": "${{ github.sha }}"}' \
https://ci.hail.populationgenomics.org.au/api/v1alpha/prod_deploy || echo "failed")
echo DEPLOY_BATCH_URL_GCP="$DEPLOY_BATCH_URL_GCP" >> $GITHUB_ENV

- name: deploy to Azure
continue-on-error: true
run: |
DEPLOY_BATCH_URL_AZURE=$(curl --fail --silent --show-error -X POST \
-H "Authorization: Bearer ${{ secrets.CI_TOKEN_AZURE }}" \
-H "Content-Type:application/json" \
-d '{"steps": ["deploy_auth", "deploy_batch", "deploy_ci", "upload_query_jar"], "sha": "${{ github.sha }}"}' \
https://ci.azhail.populationgenomics.org.au/api/v1alpha/prod_deploy || echo "failed")
echo DEPLOY_BATCH_URL_AZURE="$DEPLOY_BATCH_URL_AZURE" >> $GITHUB_ENV

- name: post to Slack
run: |
SLACK_MSG="Deploying Hail Batch:\n*GCP:* $DEPLOY_BATCH_URL_GCP\n*Azure:* $DEPLOY_BATCH_URL_AZURE"
curl --fail --silent --show-error -X POST \
-H "Authorization: Bearer ${{ secrets.SLACK_BOT_TOKEN }}" \
-H "Content-type: application/json" \
-d "{\"channel\": \"production-announcements\", \"text\": \"$SLACK_MSG\"}" \
https://slack.com/api/chat.postMessage

- name: check if any deploy failed
run: |
if [ "$DEPLOY_BATCH_URL_GCP" == "failed" ] || [ "$DEPLOY_BATCH_URL_AZURE" == "failed" ]; then
exit 1
fi
4 changes: 4 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,9 @@ node_modules
GPATH
GRTAGS
GTAGS
*.dylib
*/hail.jar
infra/.terraform.lock.hcl
hail/python/hail/docs/experimental/hail.experimental.DB.rst
hail/python/hailtop/batch/docs/api/
hail/upload-qob-jar
Expand All @@ -46,4 +49,5 @@ wheel-container.tar
hail/python/hail/backend/extra_classpath
hail/python/hail/backend/hail.jar
hail/install-editable
_/
.helix
12 changes: 12 additions & 0 deletions amundsen/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
include ../config.mk

PYTHON := PYTHONPATH=$${PYTHONPATH:+$${PYTHONPATH}:}$(EXTRA_PYTHONPATH) python3

JINJA_ENVIRONMENT = '{"code":{"sha":"$(shell git rev-parse --short=12 HEAD)"},"deploy":$(DEPLOY),"default_ns":{"name":"$(NAMESPACE)"},"global":{"docker_prefix":"$(DOCKER_PREFIX)","domain":"$(DOMAIN)","k8s_server_url":"$(KUBERNETES_SERVER_URL)"},"scope":"$(SCOPE)"}'

.PHONY: deploy
deploy:
! [ -z $(NAMESPACE) ] # call this like: make deploy NAMESPACE=default
E=$(JINJA_ENVIRONMENT) && \
python3 ../ci/jinja2_render.py $$E deployment.yaml deployment.yaml.out
kubectl -n $(NAMESPACE) apply -f deployment.yaml.out
81 changes: 81 additions & 0 deletions amundsen/deployment.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,81 @@
---
apiVersion: apps/v1
kind: Deployment
metadata:
name: amundsen-frontend
labels:
app: amundsen-frontend
hail.is/sha: "{{ code.sha }}"
spec:
selector:
matchLabels:
app: amundsen-frontend
replicas: 1
template:
metadata:
labels:
app: amundsen-frontend
hail.is/sha: "{{ code.sha }}"
spec:
{% if deploy %}
priorityClassName: production
nodeSelector:
preemptible: "false"
{% else %}
nodeSelector:
preemptible: "true"
tolerations:
- key: preemptible
value: "true"
{% endif %}
containers:
- name: amundsen-frontend
image: {{ global.docker_prefix }}/amundsendev/amundsen-frontend:2.3.0
imagePullPolicy: Always
resources:
requests:
cpu: "20m"
memory: "20M"
limits:
cpu: "1"
memory: "1G"
ports:
- containerPort: 5000
env:
{% if deploy %}
- name: FRONTEND_BASE
value: https://amundsen-frontend.hail.is
- name: SEARCHSERVICE_BASE
value: https://amundsen-search.hail.is
- name: METADATASERVICE_BASE
value: https://amundsen-metadata.hail.is
{% else %}
- name: FRONTEND_BASE
value: https://internal.hail.is/{{ default_ns.name }}/amundsen-frontend
- name: SEARCHSERVICE_BASE
value: https://internal.hail.is/{{ default_ns.name }}/amundsen-search
- name: METADATASERVICE_BASE
value: https://internal.hail.is/{{ default_ns.name }}/amundsen-metadata
{% endif %}
- name: LONG_RANDOM_STRING
value: 4532y7y2389faehuwfteyw8704y329
command: ["gunicorn"]
args: ['-w', '4', '--bind', ':5000', 'amundsen_application.wsgi']
readinessProbe:
httpGet:
path: "/healthcheck"
port: 5000
initialDelaySeconds: 10
periodSeconds: 60
timeoutSeconds: 1
successThreshold: 1
failureThreshold: 5
livenessProbe:
httpGet:
path: "/healthcheck"
port: 5000
initialDelaySeconds: 10
periodSeconds: 60
timeoutSeconds: 1
successThreshold: 1
failureThreshold: 5
4 changes: 3 additions & 1 deletion auth/auth/driver/driver.py
Original file line number Diff line number Diff line change
Expand Up @@ -442,7 +442,9 @@ async def _create_user(app, user, skip_trial_bp, cleanup):
raise DatabaseConflictError


async def create_user(app, user, skip_trial_bp=False):
# 2023-11-16 mfranklin: disable trial bp because there's an auth problem
# https://hail.zulipchat.com/#narrow/stream/300487-Hail-Batch-Dev/topic/Issue.20creating.20users/near/401890787
async def create_user(app, user, skip_trial_bp=True):
cleanup: List[Callable[[], Awaitable[None]]] = []
try:
await _create_user(app, user, skip_trial_bp, cleanup)
Expand Down
18 changes: 10 additions & 8 deletions batch/batch/batch.py
Original file line number Diff line number Diff line change
Expand Up @@ -48,8 +48,9 @@ def batch_record_to_dict(record: Dict[str, Any]) -> Dict[str, Any]:
duration_ms = None
duration = None

if record['cost_breakdown'] is not None:
record['cost_breakdown'] = cost_breakdown_to_dict(json.loads(record['cost_breakdown']))
cost_breakdown = record.get('cost_breakdown')
if cost_breakdown is not None:
cost_breakdown = cost_breakdown_to_dict(json.loads(cost_breakdown))

batch_response = {
'id': record['id'],
Expand All @@ -70,8 +71,8 @@ def batch_record_to_dict(record: Dict[str, Any]) -> Dict[str, Any]:
'duration_ms': duration_ms,
'duration': duration,
'msec_mcpu': record['msec_mcpu'],
'cost': coalesce(record['cost'], 0),
'cost_breakdown': record['cost_breakdown'],
'cost': coalesce(record.get('cost'), 0),
'cost_breakdown': cost_breakdown,
}

attributes = json.loads(record['attributes'])
Expand Down Expand Up @@ -138,8 +139,9 @@ def job_record_to_dict(record: Dict[str, Any], name: Optional[str]) -> JobListEn
exit_code = None
duration = None

if record['cost_breakdown'] is not None:
record['cost_breakdown'] = cost_breakdown_to_dict(json.loads(record['cost_breakdown']))
cost_breakdown = record.get('cost_breakdown')
if cost_breakdown is not None:
cost_breakdown = cost_breakdown_to_dict(json.loads(cost_breakdown))

return cast(
JobListEntryV1Alpha,
Expand All @@ -152,9 +154,9 @@ def job_record_to_dict(record: Dict[str, Any], name: Optional[str]) -> JobListEn
'state': record['state'],
'exit_code': exit_code,
'duration': duration,
'cost': coalesce(record['cost'], 0),
'cost': coalesce(record.get('cost'), 0),
'msec_mcpu': record['msec_mcpu'],
'cost_breakdown': record['cost_breakdown'],
'cost_breakdown': cost_breakdown,
'always_run': bool(record['always_run']),
'display_state': None,
},
Expand Down
1 change: 1 addition & 0 deletions batch/batch/cloud/gcp/driver/create_instance.py
Original file line number Diff line number Diff line change
Expand Up @@ -311,6 +311,7 @@ def scheduling() -> dict:
-e DOCKER_PREFIX=$DOCKER_PREFIX \
-e DOCKER_ROOT_IMAGE=$DOCKER_ROOT_IMAGE \
-e INSTANCE_CONFIG=$INSTANCE_CONFIG \
-e DOCKER_PREFIX=$DOCKER_PREFIX \
-e MAX_IDLE_TIME_MSECS=$MAX_IDLE_TIME_MSECS \
-e BATCH_WORKER_IMAGE=$BATCH_WORKER_IMAGE \
-e BATCH_WORKER_IMAGE_ID=$BATCH_WORKER_IMAGE_ID \
Expand Down
3 changes: 3 additions & 0 deletions batch/batch/driver/instance_collection/pool.py
Original file line number Diff line number Diff line change
Expand Up @@ -139,6 +139,7 @@ def __init__(
self.worker_max_idle_time_secs = config.worker_max_idle_time_secs
self.job_queue_scheduling_window_secs = config.job_queue_scheduling_window_secs
self.min_instances = config.min_instances
self.label = config.label

self.all_supported_regions = self.inst_coll_manager.regions

Expand Down Expand Up @@ -167,6 +168,7 @@ def config(self):
'standing_worker_max_idle_time_secs': self.standing_worker_max_idle_time_secs,
'worker_max_idle_time_secs': self.worker_max_idle_time_secs,
'job_queue_scheduling_window_secs': self.job_queue_scheduling_window_secs,
'label': self.label,
}

def configure(self, pool_config: PoolConfig):
Expand All @@ -190,6 +192,7 @@ def configure(self, pool_config: PoolConfig):
self.standing_worker_max_idle_time_secs = pool_config.standing_worker_max_idle_time_secs
self.worker_max_idle_time_secs = pool_config.worker_max_idle_time_secs
self.job_queue_scheduling_window_secs = pool_config.job_queue_scheduling_window_secs
self.label = pool_config.label

def adjust_for_remove_instance(self, instance):
super().adjust_for_remove_instance(instance)
Expand Down
3 changes: 3 additions & 0 deletions batch/batch/driver/main.py
Original file line number Diff line number Diff line change
Expand Up @@ -663,6 +663,8 @@ async def pool_config_update(request: web.Request, _) -> NoReturn:
f'a non-negative integer less than or equal to max_live_instances {max_live_instances}',
)

label = post['label']

possible_worker_cores = []
for cores in possible_cores_from_worker_type(pool.cloud, worker_type):
if not worker_local_ssd_data_disk:
Expand Down Expand Up @@ -758,6 +760,7 @@ async def pool_config_update(request: web.Request, _) -> NoReturn:
worker_max_idle_time_secs=worker_max_idle_time_secs,
standing_worker_max_idle_time_secs=standing_worker_max_idle_time_secs,
job_queue_scheduling_window_secs=job_queue_scheduling_window_secs,
label=label,
)

current_client_pool_config = json.loads(str(post['_pool_config_json']))
Expand Down
1 change: 1 addition & 0 deletions batch/batch/driver/templates/pool.html
Original file line number Diff line number Diff line change
Expand Up @@ -29,6 +29,7 @@ <h2>Configuration</h2>
<div>Worker max idle time in seconds: <input name="worker_max_idle_time_secs" value="{{ pool.worker_max_idle_time_secs }}" /></div>
<div>Standing worker max idle time in seconds: <input name="standing_worker_max_idle_time_secs" value="{{ pool.standing_worker_max_idle_time_secs }}" /></div>
<div>Job queue scheduling window in seconds: <input name="job_queue_scheduling_window_secs" value="{{ pool.job_queue_scheduling_window_secs }}" /></div>
<div>Label: <input name="label" value="{{ pool.label }}" /></div>
<input type="hidden" name="_csrf" value="{{ csrf_token }}"/>
<input type="hidden" name="_pool_config_json" value="{{ pool_config_json }}"/>
<button>
Expand Down
Loading
Loading