[DRAFT] Terraform refactor / Go buildpack deploy #75

jadudm · 2025-01-05T13:11:25Z

The previous Terraform had no modularity at all. It was one file, with no abstraction.

This introduces a new structure, and it should allow for (relatively) easy deployment to multiple spaces (e.g. dev, staging, and production). These notes will also be moved into a README in the TF directory.

launching the stack

make dev

at the top of the tree will deploy the dev stack. More work needs to be done in order to store the TF state in S3, so that we can run this from Github Actions. For now, this is not complete; if different devs deploy, they will have to completely destroy (tear down) the state of the other devs. This will become... annoying... once we start storing data in buckets. (Buckets must be empty in order to be torn down.)

So, the deploy to Cloud.gov is still a work-in-progress. But, it is possible, while testing/developing, to do a deploy from a local machine. Once we have GH Actions in place, we will never do a deploy from a local machine. We will always do our deploys from an action.

layout

At the top of the terraform directory are two files that matter:

Makefile
developers.tf

developers.tf will become part of our onboarding. This file is where devs add themselves as an initial commit so that they gain access to the Cloud.gov environment. We will control access to Cgov through this file. (This wiring is not in place yet, but the file is there. The access controls have to be implemented as scripts executed in a Github Action that call the CF API on Cloud.gov.)

Cgov deployments are organized into organizations and spaces. An organization might be gsa-tts-search, and a space might be dev, staging, or production.

There are two directories (currently) that contain the Terraform deploy scripts:

dev
shared

dev contains the variables and drivers for deploying to our (eventual) dev space. Every service that we deploy will get a section in this file:

module "fetch" {
  source = "../shared/services/fetch"
  # disk_quota = 256
  # memory = 128
  # instances = 1
  space_name = data.cloudfoundry_space.app_space.name
  app_space_id = data.cloudfoundry_space.app_space.id
  domain_id = data.cloudfoundry_domain.public.id
  databases = module.databases.ids
  buckets = module.buckets.ids
}

I have not yet determined if this can be made reusable between spaces (meaning, avoiding the boilerplate-ness of this). Each service has to be wired up to the correct databases and S3 buckets in its space in order to execute. Further, we might want to allocate different amounts of RAM, disk, and instances to services in the different spaces. That is, we might one 1 instance of fetch in the dev environment, but 3 instances of fetch in production. Because we only have one pool of RAM for all of the spaces combined, we will probably run light in lower environments, and run a fuller stack in production.

The service itself is defined in shared/services/<service-name>. We apparently have to include the provider (?), define the variables for the module, the outputs, and the module itself. Put another way:

providers.tf is boilerplate. It will need to change when we switch to the official cloudfoundry/cloudfoundry provider.
variables.tf defines the variables that the service needs to have defined in order to execute. For example, when instantiating the module, we need to provide the amount of RAM, disk, and the number of instances the service will be created with.
service.tf defines the service itself.

We can see the fetch service:

resource "cloudfoundry_app" "fetch" {
  name                 = "fetch"
  space                = var.app_space_id # data.cloudfoundry_space.app_space.id
  buildpacks            = ["https://github.com/cloudfoundry/apt-buildpack", "https://github.com/cloudfoundry/binary-buildpack.git"]
  path                 = "${path.module}/../app.tar.gz"
  source_code_hash     = filesha256("${path.module}/../app.tar.gz")
  disk_quota           = var.disk_quota
  memory               = var.memory
  instances            = var.instances
  strategy             = "rolling"
  timeout              = 200
  health_check_type    = "port"
  health_check_timeout = 180
  health_check_http_endpoint = "/api/heartbeat"

  service_binding {
    service_instance = var.databases.queues
  }

  service_binding {
    service_instance = var.databases.work
  }

  service_binding {
    service_instance = var.buckets.fetch
  }
}

All of the services get the entire codebase; this is because we then launch, on a per-instance basis, different code from cmd.

Variables include the ID of the space we are deploying to (e.g. we do not deploy to dev, but to a UUID4 value representing dev), the disk, memory, and instances, and more importantly, bindings to the databases and S3 buckets.

buckets and databases

In shared/cloudgov are module definitions for our databases and S3 buckets.

In dev/main.tf, we instantiate these as follows:

module "databases" {
  source              = "../shared/cloudgov/databases"
  cf_org              = local.cf_org
  cf_space            = local.cf_space
  queue_db_plan_name  = "micro-psql"
  search_db_plan_name = "micro-psql"
  work_db_plan_name   = "micro-psql"
}

For dev, we might only use micro instances. For production, however, we might instantiate xl instances. This lets us configure the databases on a per-space basis. (S3 buckets are all the same, so there is no configuration.)

This module has outputs. Once instantiated, we can refer to module.databases as a map(string) and reference the id of each of the databases (or buckets). In this way, we can pass the entire map of IDs to the services, and they can then bind to the correct databases/S3 buckets. Most (all?) services will want to bind to the queues databases; only some need to bind to work, and some need to bind to serve.

This brings the TF back to functional. Interim checkin.

This is setting up for multiple envs.

Those now need to be turned into go buildpacks.

Going to see if I can do this in a branch.

This will fail in multiple ways, because I have no secrects configured, etc. But, I'd like to see what the runner does just the same.

I can't see the action...

Apparently.

Lets avoid bash scripts.

jadudm added 5 commits January 4, 2025 08:22

Terraform updates for CGov

51eb447

This brings the TF back to functional. Interim checkin.

Continued refactoring/restructuring

3e4783a

This is setting up for multiple envs.

Continued refactoring

b72a8d4

Adding in all the services

2ac4ecf

Deploys infra, not apps

0d80973

Those now need to be turned into go buildpacks.

jadudm changed the title ~~Terraform refactor / Go buildpack deploy~~ [DRAFT] Terraform refactor / Go buildpack deploy Jan 5, 2025

jadudm added 12 commits January 5, 2025 10:03

Adding a demo workflow

db2fcd7

Going to see if I can do this in a branch.

Trying a workflow dispatch.

6471f5c

A test deploy script

f443a24

This will fail in multiple ways, because I have no secrects configured, etc. But, I'd like to see what the runner does just the same.

Will this work?

ae71729

I can't see the action...

Dispatch only?

e030c7e

Push might be needed?

31c7fca

Forgot permissions

6ef5a50

Root is needed in the runner

cd39054

Apparently.

Do more work in the workflow

faffed1

Lets avoid bash scripts.

Iterating GH Action

cd2d03c

Iterating GH Action

d817626

Iterating GH Action

2d8e780

jadudm had a problem deploying to dev January 5, 2025 16:15 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 11:29:05 AM EST 2025

4b372c4

jadudm had a problem deploying to dev January 5, 2025 16:29 — with GitHub Actions Failure

jadudm added 2 commits January 5, 2025 11:33

Iterating GH Action Sun Jan 5 11:33:31 AM EST 2025

76c4057

Iterating GH Action Sun Jan 5 11:34:39 AM EST 2025

e810c0d

jadudm had a problem deploying to dev January 5, 2025 16:34 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 11:35:56 AM EST 2025

c1e679e

jadudm had a problem deploying to dev January 5, 2025 16:36 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 11:38:56 AM EST 2025

b2dd3de

jadudm had a problem deploying to dev January 5, 2025 16:39 — with GitHub Actions Failure

Moving towards S3 backed deployment

f22e2a7

jadudm had a problem deploying to dev January 5, 2025 20:27 — with GitHub Actions Failure

jadudm had a problem deploying to dev January 6, 2025 02:23 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 09:25:40 PM EST 2025

27241ef

jadudm had a problem deploying to dev January 6, 2025 02:25 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 09:27:19 PM EST 2025

a78aaa3

jadudm had a problem deploying to dev January 6, 2025 02:27 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 09:32:24 PM EST 2025

c9e7c96

jadudm had a problem deploying to dev January 6, 2025 02:32 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 09:35:38 PM EST 2025

e434613

jadudm had a problem deploying to dev January 6, 2025 02:35 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 09:37:34 PM EST 2025

d77d25f

jadudm had a problem deploying to dev January 6, 2025 02:37 — with GitHub Actions Failure

Iterating GH Action Sun Jan 5 09:41:44 PM EST 2025

660c6b7

jadudm had a problem deploying to dev January 6, 2025 02:41 — with GitHub Actions Failure

Iterating GH Action Mon Jan 6 08:45:53 AM EST 2025

e958858

jadudm had a problem deploying to dev January 6, 2025 13:46 — with GitHub Actions Error

Iterating GH Action Mon Jan 6 03:17:23 PM EST 2025

64ebc53

jadudm had a problem deploying to dev January 6, 2025 20:17 — with GitHub Actions Failure

Iterating GH Action Mon Jan 6 03:21:36 PM EST 2025

3a4ce4b

jadudm had a problem deploying to dev January 6, 2025 20:21 — with GitHub Actions Failure

Iterating GH Action Mon Jan 6 03:24:24 PM EST 2025

551a3f4

jadudm had a problem deploying to dev January 6, 2025 20:24 — with GitHub Actions Failure

Iterating GH Action Mon Jan 6 03:25:20 PM EST 2025

f13a88d

jadudm had a problem deploying to dev January 6, 2025 20:25 — with GitHub Actions Failure

Iterating GH Action Mon Jan 6 03:27:47 PM EST 2025

688eac7

jadudm had a problem deploying to dev January 6, 2025 20:27 — with GitHub Actions Failure

Iterating GH Action Mon Jan 6 03:28:33 PM EST 2025

68116aa

jadudm had a problem deploying to dev January 6, 2025 20:28 — with GitHub Actions Failure

Iterating GH Action Mon Jan 6 03:29:37 PM EST 2025

0a79b92

jadudm had a problem deploying to dev January 6, 2025 20:29 — with GitHub Actions Failure

Interim, stashing

dc1ddbe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] Terraform refactor / Go buildpack deploy #75

[DRAFT] Terraform refactor / Go buildpack deploy #75

jadudm commented Jan 5, 2025

[DRAFT] Terraform refactor / Go buildpack deploy #75

Are you sure you want to change the base?

[DRAFT] Terraform refactor / Go buildpack deploy #75

Conversation

jadudm commented Jan 5, 2025

launching the stack

layout

buckets and databases