UNMAINTAINED: this project is not being actively maintained anymore. It works as it is, but we won't be working on new features or bug fixes anymore. If you are using this module and are willing to maintain it, open an issue and we'll be more than happy to transfer the repo.
Terraform module(s) to setup Vault on AWS
This terraform module sets up a HA vault with a DynamoDB backend. The module sets up TLS using Letsencrypt with dns-01 challenge.
Both vault servers are optionally configured with Teleport for SSH management.
Two route53 records are provided to access the individual instances.
Because this module can be configured to setup a DynamoDB table in a separate region from the main table, it expects two Terraform providers: aws
and aws.replica
. If you want to enable the replica table, you'll need to create a second aws
provider targeting the region you want to put the replica table in, and then pass it on to the module as aws.replica
:
provider "aws" {
region = "eu-west-1"
}
provider "aws" {
alias = "replica"
region = "eu-west-2"
}
module "vault" {
...
enable_dynamodb_replica_table = true
providers = {
aws = aws
aws.replica = aws.replica
}
}
If you don't want to enable the replica table, you still need to set the providers
block in the vault module, but you can set the default aws
provider in both "slots", like so:
provider "aws" {
region = "eu-west-1"
}
module "vault" {
...
enable_dynamodb_replica_table = false
providers = {
aws = aws
aws.replica = aws
}
}
Name | Description | Type | Default | Required |
---|---|---|---|---|
acm_arn | The ACM ARN to use on the alb | string | n/a | yes |
acme_server | ACME server where to point certbot on the Teleport server to fetch an SSL certificate. Useful if you want to point to the letsencrypt staging server. |
string | "https://acme-v02.api.letsencrypt.org/directory" |
no |
ami | The AMI ID to use for the vault instances | string | n/a | yes |
dns_root | The root domain to configure for vault | string | "production.skyscrape.rs" |
no |
dynamodb_table_name_override | Override Vault's DynamoDB table name with this variable. This module will generate a name if this is left empty (default behavior) | string | null |
no |
ec2_instances_cpu_credits | The type of cpu credits to use | string | "standard" |
no |
enable_dynamodb_replica_table | Setting this to true will create a DynamoDB table on another region and enable global tables for replication. The replica table is going to be managed by the 'replica' Terraform provider | bool | false |
no |
enable_point_in_time_recovery | Whether to enable point-in-time recovery - note that it can take up to 10 minutes to enable for new tables. Note that additional charges will apply by enabling this setting | bool | true |
no |
enable_ui | Enables the Vault UI | bool | true |
no |
environment | Name of the environment where to deploy Vault (just for naming reasons) | string | n/a | yes |
instance_type | The instance type to use for the vault servers | string | "t2.micro" |
no |
key_name | Name of the sshkey to deploy on the vault instances | string | null |
yes |
lb_internal | Should the ALB be created as an internal Loadbalancer | bool | false |
no |
lb_subnets | The subnets to use for the alb | list(string) | n/a | yes |
le_email | The email address that's going to be used to register to LetsEncrypt | string | n/a | yes |
project | Name of the project | string | n/a | yes |
teleport_auth_server | The hostname or ip of the Teleport auth server. If empty, Teleport integration will be disabled (default). | string | null |
no |
teleport_node_sg | The security-group ID of the teleport server | string | null |
no |
teleport_token_1 | The Teleport token for the first instance. This can be a dynamic short-lived token | string | null |
no |
teleport_token_2 | The Teleport token for the second instance. This can be a dynamic short-lived token | string | null |
no |
teleport_version | The Teleport version to deploy | string | "3.1.8" |
no |
vault1_subnet | The subnet ID for the first vault instance | string | n/a | yes |
vault2_subnet | The subnet ID for the second vault instance | string | n/a | yes |
vault_nproc | The amount of nproc to configure vault with. Set this to the amount of CPU cores | number | 1 |
no |
vault_version | The Vault version to deploy | string | n/a | yes |
vpc_id | The VPC id to launch the instances in | string | n/a | yes |
Name | Description |
---|---|
alb_arn | The alb ARN |
alb_id | The alb id |
alb_main_target_group | The main ALB target group ARN |
alb_sg_id | The alb security group ID |
alb_vault1_target_group | The vault1 target group ARN |
alb_vault2_target_group | The vault2 target group ARN |
dynamodb_table_name | The Vault dynamodb table name |
iam_policy | The iam policy ARN used for vault |
main_dynamodb_table_region | Region where the main DynamoDB table will be created |
replica_dynamodb_table_region | Region where the replica DynamoDB table will be created, if enabled |
sg_id | The vault security-group id |
vault1_instance_id | The vault1 instance ID |
vault1_role_id | The vault1 instance-role ID |
vault1_role_name | The vault1 instance-role name |
vault1_route53_record | The vault1 route53 record id |
vault2_instance_id | The vault2 instance ID |
vault2_role_id | The vault2 instance-role ID |
vault2_role_name | The vault2 instance-role name |
vault2_route53_record | The vault2 route53 record id |
vault_route53_record | The main vault route53 record id |
In version v5.0.0 of this module, we switched to the new Terraform 0.12 syntax, and introduced some breaking changes along the way.
We've moved around some resources inside the module, like the ALB and related resources, so you'll need to tweak the state to avoid having to replace everything. We've also normalized some resource names, for example the ALB target groups will need to be replaced.
Here's the Terraform commands to run to prepare the state:
# Import ALB into the new resource address. You'll need to find the current ALB ARN
terraform import -var-file production.tfvars module.ha_vault.aws_lb.alb ALB_ARN
# Import ALB https listener into the new resource address. You'll need to finde the current listener ARN
terraform import -var-file production.tfvars module.ha_vault.aws_lb_listener.https HTTPS_ALB_LISTENER_ARN
# Remove the ALB and https listener from the old addresses
terraform state rm module.ha_vault.module.alb.aws_alb_listener.https module.ha_vault.module.alb.aws_alb.alb
# Move ALB security group and its rules
terraform state mv module.ha_vault.module.alb.aws_security_group.sg_alb module.ha_vault.aws_security_group.alb
terraform state mv module.ha_vault.module.alb.aws_security_group_rule.sg_alb_https_ingress module.ha_vault.aws_security_group_rule.sg_alb_https_ingress
terraform state mv module.ha_vault.module.alb.aws_security_group_rule.sg_alb_target_egress module.ha_vault.aws_security_group_rule.sg_alb_target_egress
Now you can apply terraform normally. There might be a slight downtime, as the target groups will need to be replaced, but shouldn't be more than a few seconds.
In v3.x of this module, the DynamoDB table creation has been moved inside a nested Terraform module. To avoid having to re-create such table, you'll need to move/rename it inside the state. The following Terraform commands should suffice:
terraform state mv module.ha_vault.aws_appautoscaling_policy.dynamodb_table_read_policy module.ha_vault.module.main_dynamodb_table.aws_appautoscaling_policy.dynamodb_table_read_policy
terraform state mv module.ha_vault.aws_appautoscaling_policy.dynamodb_table_write_policy module.ha_vault.module.main_dynamodb_table.aws_appautoscaling_policy.dynamodb_table_write_policy
terraform state mv module.ha_vault.aws_appautoscaling_target.dynamodb_table_read_target module.ha_vault.module.main_dynamodb_table.aws_appautoscaling_target.dynamodb_table_read_target
terraform state mv module.ha_vault.aws_appautoscaling_target.dynamodb_table_write_target module.ha_vault.module.main_dynamodb_table.aws_appautoscaling_target.dynamodb_table_write_target
terraform state mv module.ha_vault.aws_dynamodb_table.vault_dynamodb_table module.ha_vault.module.main_dynamodb_table.aws_dynamodb_table.vault_dynamodb_table
Additionally, if you want to enable DynamoDB global tables on an existing Vault setup, you'll first need to export all the current data, empty the table, enable the global tables feature and finally import your data back in the main table. You could do this with the DynamoDB import/export tool. The reason for this is that enabling the global tables feature in DynamoDB requires all the tables involved to be empty.
Starting from v2.0.0 of this module, the name of Vault's DynamoDB table will be dynamically generated from the values of "${var.project}"
and "${var.environment}"
. In previous versions it was hardcoded to vault-dynamodb-backend
, so to avoid breaking current deployments, we've introduced a new variable dynamodb_table_name_override
to force a specific name for the DynamoDB table. So if you're upgrading from a previous version of the module, you'll probably want to set dynamodb_table_name_override=vault-dynamodb-backend
so Terraform doesn't recreate the table.
Check out the examples folder.
Since version 2.1.0
of this module, Vault's DynamoDB table has point-in-time recovery enabled by default, which means that Vault data is being continuously backed up, and backups are retained for 35 days.
A restore operation will force you to create a brand new DynamoDB table, and if you do that you'll have to take some additional steps to fix the Terraform drift caused by that.
Follow this process only if you don't have cross-region replication enabled. If you have that enabled, see the section below.
-
Stop the Vault EC2 instances
-
Restore your data to a new DynamoDB table.
-
Set
dynamodb_table_name_override
variable to the name of your new DynamoDB table -
Remove the old DynamoDB table from the Terraform state (make sure to replace the name of the vault module you've used -
module.vault
)terraform state rm module.vault.module.main_dynamodb_table.aws_dynamodb_table.vault_dynamodb_table
-
Import the new DynamoDB table to the Terraform state (make sure to replace the name of the vault module and the name of your new DynamoDB table)
terraform import module.vault.module.main_dynamodb_table.aws_dynamodb_table.vault_dynamodb_table yournewdynamotable
-
Taint the Vault instances so they are replaced with the new DynamoDB table name. You have to do this, as Terraform ignores any changes on the instances user-data, which is where the DynamoDB table name is set. Note that doing this will force a replace of those instances in your next terraform apply
terraform taint -module ha_vault.vault1 aws_instance.instance terraform taint -module ha_vault.vault2 aws_instance.instance
-
Apply terraform. This should make the following changes:
- Modify the IAM policy to grant access to the new DynamoDB table
- Replace both Vault instances
- Configure autoscaling to the new DynamoDB table (if capacity autoscaling is enabled)
-
Wait until the new EC2 instances are up and marked as healthy in the individual ALB target groups, then unseal vault
-
Decide what to do with the old DynamoDB table. You can keep it in case you want to do another restore, or you can just delete it
In case you also have cross-region replication (Global tables) enabled, the process is a bit more complex:
-
Stop the Vault EC2 instances
-
Restore your data to a new DynamoDB table.
-
Once the restore is finished, export the data from your new DynamoDB table to S3. You can do this with Data Pipeline following this guide from AWS.
-
In next steps we're going to create a new DynamoDB table, so choose a name for it and set it in the
dynamodb_table_name_override
variable. Remember that DynamoDB table names must be unique. An alternative would be to remove the existing DynamoDB table and reuse the same name for the new table, but that way you would loose all the incremental backups from your old table. If you choose to do that, just remove the table from the AWS console and skip steps 4 and 5. -
Remove the old DynamoDB tables from the Terraform state (make sure to replace the name of the vault module you've used -
module.vault
)terraform state rm module.vault.module.main_dynamodb_table.aws_dynamodb_table.vault_dynamodb_table terraform state rm module.vault.module.replica_dynamodb_table.aws_dynamodb_table.vault_dynamodb_table terraform state rm module.vault.aws_dynamodb_global_table.vault_global_table
-
Apply terraform targeting just the global table resource. This will create your new DynamoDB tables
terraform apply -target module.vault.aws_dynamodb_global_table.vault_global_table -var-file myvars.tfvars
-
Import the Vault data from S3 to the newly created table. Same as with the export, you can do this with Data Pipeline following this guide from AWS. Note that as of this writing, the import data pipeline doesn't compute correctly the write capacity of a global table, as it adds the write capacity of all the tables belonging to the global table. So if there are two tables in a global table, both with a provisioned write capacity of 40, the data pipeline will assume the table has a provisioned write capacity of 80, and in consequence there will be a lot of throttled write requests. A workaround is to set the DynamoDB write throughput ratio of the pipeline to 0.5
-
After the import is complete, taint the Vault instances so they are replaced with the new DynamoDB table name. You have to do this, as Terraform ignores any changes on the instances user-data, which is where the DynamoDB table name is set. Note that doing this will force a replace of those instances in your next terraform apply
terraform taint -module ha_vault.vault1 aws_instance.instance terraform taint -module ha_vault.vault2 aws_instance.instance
-
Apply terraform. This should make the following changes:
- Modify the IAM policy to grant access to the new DynamoDB table
- Replace both Vault instances
- Configure autoscaling to the new DynamoDB table (if capacity autoscaling is enabled)
-
Wait until the new EC2 instances are up and marked as healthy in the individual ALB target groups, then unseal vault
-
Decide what to do with the old DynamoDB table. You can keep it in case you want to do another restore, or you can just delete it