-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1040 Adjust health checks for ECS #67
Conversation
7055bfa
to
0be6e01
Compare
Ref. metriport/metriport-internal#1040 Signed-off-by: Rafael Leite <[email protected]>
0be6e01
to
b4cd733
Compare
Ref. metriport/metriport-internal#1040 Signed-off-by: Rafael Leite <[email protected]>
cpu: 1 * vCPU, | ||
memoryLimitMiB: 1024, | ||
memoryLimitMiB: 2048, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1 vCPU requires at least 2GB of RAM - https://docs.aws.amazon.com/AmazonECS/latest/developerguide/task_definition_parameters.html#task_size
cpu: 1 * vCPU, | ||
memoryLimitMiB: 2048, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
0.5 vCPU was not being enough to start the service - health checks were failing, even adjusting those thresholds - context
unhealthyThresholdCount: 4, | ||
interval: Duration.seconds(20), | ||
timeout: Duration.seconds(15), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cpu: 4 * vCPU, | ||
memoryLimitMiB: 8192, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually, considering the original comment and the weekly load of max CPU, taking a closer look at some of the spikes, we see they run for ~100% for minutes, so its clear the CPU is also a bottleneck - see below:
Jul 27:
Jul 26
Jul 29 (single spike)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good
Ref. metriport/metriport-internal#1040
Dependencies
Description
Adjust health checks for ECS - context on each change.
Release Plan