Increase API size #135

slifty · 2023-10-12T15:29:34Z

This PR increase the API machine to double the amount of processor capacity.

cecilia-donnelly

Will you also do staging? Not a true blocker since I think we'll rebuild the images on main after this is merged anyway, but I don't want to forget.

slifty · 2023-10-12T18:14:05Z

I'm not entirely sure how to re-run terraform tasks from here, but one did fail for staging.

I think maybe that failed due to something else happening in Terraform at the same time; @cecilia-donnelly do you see anything about the change to staging that you would expect to cause an error?

cecilia-donnelly · 2023-10-13T17:52:09Z

You know, @slifty, this is a real problem that only shows up on staging! @fenn-cs 's automated test script creates a subnet in Permanent's AWS that conflicts with the subnet search for staging, which is why this doesn't work. We ran into this a few months ago, too.

Last time I deleted the conflicting subnet since Fon wasn't running his tests then and had deleted the relevant instances. We could (1) do that again and bring it back up with the script afterward, (2) do that and also change the script so it uses a non-conflicting subnet, or (3) make the search in infrastructure a bit smarter so the subnets don't conflict anymore. Any of those is fine, though (3) requires a bit more careful testing.

slifty · 2023-10-13T18:00:10Z

@cecilia-donnelly I vote for 2 -- it sounds like a bug in the test scripts that should be resolved.

@fenn-cs is this something you are able to knock out so we can get this merged?

nfebe · 2023-10-13T20:07:29Z

@slifty Yeah early next week, but if it's urgent I can check it out this weekend.

nfebe · 2023-10-18T17:59:05Z

@slifty So I started by doing 1) above which is deleting the subnets, using terraform destroy from the same environment that created the subnet.

I have looked into how to make it "smart" and found the cidrsubnet which I have played with locally and realized the addressed calculated are still pretty predictable and is not all that "smart".

So,

Either I have not figured out things correctly
It's as it seems, which means I don't really need cidrsubnet and can just change the conflicting IP to another one different from the existing one.

I want us to test this, by you repeating the action, so I can see if the cidrsubnet works as it should when the first address is unavailable before concluding what goes into the PR for the rclone-iac

For now, there is not conflict and the deployment can continue.

Further context

Here is the problematic section:

https://github.com/OpenTechStrategies/permanent-rclone-iac/blob/057d71a8a515d0fb541d75155336a602725d1761/main.tf#L40-L47

resource "aws_vpc" "mainvpc" {
  cidr_block = "10.0.0.0/16"
  enable_dns_support   = true
  enable_dns_hostnames = true
}

resource "aws_subnet" "public" {
  vpc_id = aws_vpc.mainvpc.id
  cidr_block = "10.0.0.0/24"
  availability_zone = "us-west-2a"
  tags = {
    Name = "Default subnet for us-west-2a"
  }
}

Here is what it should look like with cidrsubnet

resource "aws_vpc" "mainvpc" {
  cidr_block = "10.0.0.0/16"
  enable_dns_support   = true
  enable_dns_hostnames = true
}

resource "aws_subnet" "public" {
  vpc_id = aws_vpc.mainvpc.id
  cidr_block = cidrsubnet(aws_vpc.mainvpc.cidr_block, 8, 1) // The last paramameter as "0" leads to the exact same conflicting address
  availability_zone = "us-west-2a"
  tags = {
    Name = "Default subnet for us-west-2a"
  }
}

As part of our work exploring the impact of the sftp service, we identified the need to increase the perfomance capability of the API server in order to handle the volume of API calls made by SFTP users [1]. Amazon provides a range of instance sizes. This choice increases the CPU capacity of the machine without expanding the memory capacity. This is because our initial benchmarks seemed to be pinning CPU usage but not putting a particularly large dent in memory. [1] PermanentOrg/sftp-service#268

Fix a typo

a779d7b

slifty requested a review from cecilia-donnelly October 12, 2023 15:29

cecilia-donnelly requested changes Oct 12, 2023

View reviewed changes

slifty force-pushed the noissue-api-size branch from ff7e023 to cb60e60 Compare October 12, 2023 15:51

slifty requested a review from cecilia-donnelly October 12, 2023 18:14

slifty force-pushed the noissue-api-size branch from cb60e60 to fd7d058 Compare October 19, 2023 15:11

cecilia-donnelly approved these changes Oct 19, 2023

View reviewed changes

slifty merged commit 7ce5670 into main Oct 19, 2023
3 checks passed

slifty deleted the noissue-api-size branch October 19, 2023 15:13

cecilia-donnelly mentioned this pull request Oct 23, 2023

Revert "Increase API size" #136

Merged

kfogel mentioned this pull request Oct 26, 2023

Double capacity of api instance #137

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase API size #135

Increase API size #135

slifty commented Oct 12, 2023

cecilia-donnelly left a comment

slifty commented Oct 12, 2023 •

edited

Loading

cecilia-donnelly commented Oct 13, 2023

slifty commented Oct 13, 2023

nfebe commented Oct 13, 2023

nfebe commented Oct 18, 2023 •

edited

Loading

Increase API size #135

Increase API size #135

Conversation

slifty commented Oct 12, 2023

cecilia-donnelly left a comment

Choose a reason for hiding this comment

slifty commented Oct 12, 2023 • edited Loading

cecilia-donnelly commented Oct 13, 2023

slifty commented Oct 13, 2023

nfebe commented Oct 13, 2023

nfebe commented Oct 18, 2023 • edited Loading

Further context

slifty commented Oct 12, 2023 •

edited

Loading

nfebe commented Oct 18, 2023 •

edited

Loading