cloudfoundry · philippthun · May 5, 2025 · Apr 1, 2025
diff --git a/.rubocop_cc.yml b/.rubocop_cc.yml
@@ -138,8 +138,8 @@ Rails/DangerousColumnNames: # Disabled, in comparison to active_record we need t
 Rails/SkipsModelValidations: # We don`t want any model at all in migrations and migration specs
   Enabled: true
   Exclude:
-    - db/migrations/*
-    - spec/migrations/*
+    - db/migrations/**/*
+    - spec/migrations/**/*
 
 #### ENABLED SECTION
 Gemspec/DeprecatedAttributeAssignment:

diff --git a/db/migrations/20250327142351_bigint_migration_events_step1.rb b/db/migrations/20250327142351_bigint_migration_events_step1.rb
@@ -0,0 +1,22 @@
+require 'database/bigint_migration'
+
+Sequel.migration do
+  up do
+    if database_type == :postgres && !VCAP::BigintMigration.opt_out?
+      if VCAP::BigintMigration.table_empty?(self, :events)
+        VCAP::BigintMigration.change_pk_to_bigint(self, :events)
+      else
+        VCAP::BigintMigration.add_bigint_column(self, :events)
+        VCAP::BigintMigration.create_trigger_function(self, :events)
+      end
+    end
+  end
+
+  down do
+    if database_type == :postgres
+      VCAP::BigintMigration.revert_pk_to_integer(self, :events)
+      VCAP::BigintMigration.drop_trigger_function(self, :events)
+      VCAP::BigintMigration.drop_bigint_column(self, :events)
+    end
+  end
+end
diff --git a/decisions/0013-migrating-int-to-bigint-for-primary-keys.md b/decisions/0013-migrating-int-to-bigint-for-primary-keys.md
@@ -1,40 +1,50 @@
 # 13: Migrating `int` to `bigint` for `id` Primary Keys
-
-Date: 2025-02-04
+Date: 2025-04-04
 
 ## Status
-
 Draft :construction:
 
 ## Context
-
 The primary key `id` columns in all database tables use the integer type, which has a maximum value of 2,147,483,647.
 As foundations grow over time, the `id` values in some of these tables (e.g., events) are approaching this limit.
-If the limit is reached, the cloud controller will be unable to insert new records, leading to critical failures in the CF API.  
+If the limit is reached, the cloud controller will be unable to insert new records, leading to critical failures in the CF API.
 E.g.:
 ```
 PG::SequenceGeneratorLimitExceeded: ERROR:  nextval: reached maximum value of sequence "events_id_seq"
 ```
-The goal is to migrate these primary key `id` columns from `int` to `bigint` without causing downtime and to ensure compatibility across PostgreSQL and MySQL databases.
+The goal is to migrate these primary key `id` columns from `int` to `bigint` without causing downtime.
 This migration must:
 - Avoid downtime since the CF API is actively used in production.
-- Handle tables with millions of records efficiently. 
+- Handle tables with millions of records efficiently.
 - Provide a safe rollback mechanism in case of issues during the migration.
 - Be reusable for other tables in the future.
 - Ensure that migration only is executed when the new `id_bigint` column is fully populated.
 
 The largest tables in a long-running foundation are `events`, `delayed_jobs`, `jobs`, and `app_usage_events`.
 
 ## Decisions
+### PostgreSQL Only
+We will implement the migration exclusively for PostgreSQL databases.
+The reasons for this decision are:
+- **Organizational Usage**: Our organization exclusively uses PostgreSQL, not MySQL. This allows us to test the migration with copies of large production databases and perform a step-wise rollout from test environments to production.
+- **Support and Contribution**: Focusing on PostgreSQL enables us to identify and address any issues during the migration process. We can contribute solutions back to the migration procedure, benefiting the broader community.
+- **Deployment Environments**: We operate PostgreSQL instances across various hyperscalers. Successful migration in our environments increases confidence that it will work for others using PostgreSQL.
+- **Limited MySQL Exposure**: We lack access to production environments using MySQL and have limited expertise with it. Testing would be confined to community-owned test foundations, which do not reflect real-world production data. Additionally, the community uses a limited set of MySQL variants, reducing our ability to detect and resolve issues during a production rollout.
+- **Community Feedback**: Feedback from other organizations operating Cloud Foundry on MySQL indicates they would opt-out of this migration, as their foundations are smaller and unlikely to encounter the issues this migration addresses.
+
+While this approach results in somewhat inconsistent schemas between PostgreSQL and MySQL — specifically regarding the data types of primary and foreign keys — the application layer does not depend on these specifics.
+Therefore, no additional application logic is required to handle these differences.
+
+By concentrating our efforts on PostgreSQL, we can ensure a robust and thoroughly tested migration process, leveraging our expertise and infrastructure to maintain the stability and scalability of the Cloud Controller database.
 
 ### Opt-Out Mechanism
 Operators of smaller foundations, which are unlikely to ever encounter the integer overflow issue, may wish to avoid the risks and complexity associated with this migration.
-They can opt out of the migration by setting the `skip_bigint_id_migration` flag in the CAPI-Release manifest.
+They can opt-out of the migration by setting a flag in the CAPI-Release manifest.
 When this flag is set, all migration steps will result in a no-op but will still be marked as applied in the `schema_versions` table.
-*Important*: Removing the flag later will *not* re-trigger the migration. Operators must handle the migration manually if they choose to opt out.
 
-### Scope
+*Important*: Removing the flag later will *not* re-trigger the migration. Operators must handle the migration manually if they choose to opt-out.
 
+### Scope
 The `events` table will be migrated first as it has the most significant growth in `id` values.
 Other tables will be migrated at a later stage.
 
@@ -44,6 +54,7 @@ This will be implemented with migration step 1 and will be only applied, if the
 
 ### Phased Migration
 The migration will be conducted in multiple steps to ensure minimal risk.
+
 #### Step 1 - Preparation
 - If the opt-out flag is set, this step will be a no-op.
 - In case the target table is empty the type of the `id` column will be set to `bigint` directly.
@@ -57,19 +68,21 @@ The migration will be conducted in multiple steps to ensure minimal risk.
 - If the `id_bigint` column does not exist, backfill will be skipped or result in a no-op.
 - Use a batch-processing script (e.g. a delayed job) to populate `id_bigint` for existing rows in both the primary table and, if applicable, all foreign key references.
 - Table locks will be avoided by using a batch processing approach.
-- In case the table has a configurable cleanup duration, the backfill job will only process records which are beyond the cleanup duration to reduce the number of records to be processed. 
+- In case the table has a configurable cleanup duration (e.g. events), no backfill job will be created. The recommended procedure is to wait with the next migration step until the cleanup job has removed all records without `id_bigint` from the table. Additionally, there is a Rake task (`bundle exec rake 'db:bigint_backfill[table]'`) to manually force backfilling. This could be used in case the next migration step should be applied before the cleanup duration has passed. Performance implications on the database should be considered when running this task.
 - Backfill will be executed outside the migration due to its potentially long runtime.
 - If necessary the backfill will run for multiple weeks to ensure all records are processed.
 
 #### Step 3 - Migration
 - The migration is divided into two parts: a pre-check and the actual migration but both will be stored in a single migration script.
 - This step will be a no-op if the opt-out flag is set or the `id` column is already of type `bigint`.
 - All sql statements will be executed in a single transaction to ensure consistency.
+
 ##### Step 3a - Migration Pre Check
 - In case the `id_bigint` column does not exist the migration will fail with a clear error message.
 - Add a `CHECK` constraint to verify that `id_bigint` is fully populated (`id_bigint == id & id_bigint != NULL`).
 - In case the backfill is not yet complete or the `id_bigint` column is not fully populated the migration will fail.
 - If pre-check fails, operators might need to take manual actions to ensure all preconditions are met as the migration will be retried during the next deployment.
+
 ##### Step 3b - Actual Migration
 - Remove the `CHECK` constraint once verified.
 - Drop the primary key constraint on id.
@@ -87,11 +100,6 @@ The default value of the `id` column could be either a sequence (for PostgreSQL
 This depends on the version of PostgreSQL which was used when the table was initially created.
 The migration script needs to handle both cases.
 
-#### MySQL
-MySQL primary key changes typically cause table rebuilds due to clustered indexing, which can be expensive and disruptive, especially with clustered replication setups like Galera.
-A common approach to mitigate this involves creating a new shadow table, performing a backfill, and then swapping tables atomically.  
-Further details will be refined during implementation.
-
 ### Rollback Mechanism
 The old `id` column is no longer retained, as the `CHECK` constraint ensures correctness during migration.
 Step 3b (switch over) will be executed in a single transaction and will be rolled back if any issues occur.
@@ -103,22 +111,21 @@ Write reusable scripts for adding `id_bigint`, setting up triggers, backfilling
 These scripts can be reused for other tables in the future.
 
 ### Release Strategy
-
-Steps 1-2 will be released as a cf-deployment major release to ensure that the database is prepared for the migration.  
-Steps 3-4 will be released as a subsequent cf-deployment major release to complete the migration.  
+Steps 1-2 will be released as a cf-deployment major release to ensure that the database is prepared for the migration.
+Steps 3-4 will be released as a subsequent cf-deployment major release to complete the migration.
 Between these releases there should be a reasonable time to allow the backfill to complete.
 
-For the `events` table there is a default cleanup interval of 31 days. Therefore, for the `events` table the gap between the releases should be around 60 days.
+For the `events` table there is a default cleanup interval of 31 days.
+Therefore, for the `events` table the gap between the releases should be around 60 days.
 
 ## Consequences
-### Positive Consequences
 
+### Positive Consequences
 - Future-proofing the schema for tables with high record counts.
 - Minimal locking during step 3b (actual migration) could result in slower queries or minimal downtime.
 - A standardized process for similar migrations across the database.
 
 ### Negative Consequences
-
 - Increased complexity in the migration process.
 - Potentially long runtimes for backfilling data in case tables have millions of records.
 - Requires careful coordination across multiple CAPI/CF-Deployment versions.
@@ -127,31 +134,26 @@ For the `events` table there is a default cleanup interval of 31 days. Therefore
 ## Alternatives Considered
 
 ### Switching to `guid` Field as Primary Key
-
 Pros: Provides globally unique identifiers and eliminates the risk of overflow.
 
 Cons: Might decrease query and index performance, requires significant changes for foreign key constraints, and introduces non-sequential keys.
 
 Reason Rejected: The overhead and complexity outweighed the benefits for our use case.
 
 ### Implementing Rollover for `id` Reuse
-
 Pros: Delays the overflow issue by reusing IDs from deleted rows. Minimal schema changes.
 
 Cons: Potential issues with foreign key constraints and increased complexity in the rollover process. Could be problematic for tables which do not have frequent deletions.
 
 Reason Rejected: Might work well for tables like events, but not a universal solution for all tables where there is no guarantee of frequent deletions.
 
-
 ### Direct Migration of `id` to `bigint` via `ALTER TABLE` Statement
-
 Pros: One-step migration process.
 
 Cons: Requires downtime, locks the table for the duration of the migration, and can be slow for tables with millions of records.
 
 Reason Rejected: Downtimes are unacceptable for productive foundations.
 
-
 ## Example Migration Scripts With PostgreSQL Syntax For `events` Table
 
 ### Step 1 - Preparation
@@ -178,7 +180,6 @@ CREATE TRIGGER trigger_events_set_id_bigint
     EXECUTE FUNCTION events_set_id_bigint_on_insert();
 
 COMMIT;
-
 ```
 
 ### Step 2 - Backfill
@@ -196,10 +197,9 @@ FROM batch
 WHERE events.id = batch.id;
 ```
 
-
 ### Step 3a - Migration Pre Check
 ```sql
-ALTER TABLE events  ADD CONSTRAINT check_id_bigint_matches CHECK (id_bigint IS NOT NULL AND id_bigint = id);
+ALTER TABLE events ADD CONSTRAINT check_id_bigint_matches CHECK (id_bigint IS NOT NULL AND id_bigint = id);
 
 -- Alternative:
 SELECT COUNT(*) FROM events WHERE id_bigint IS DISTINCT FROM id;
@@ -229,7 +229,7 @@ ALTER TABLE events RENAME COLUMN id_bigint TO id;
 ALTER TABLE events ADD PRIMARY KEY (id);
 
 -- Set `id` as IDENTITY with correct starting value
-DO $$ 
+DO $$
 DECLARE max_id BIGINT;
 BEGIN
 SELECT COALESCE(MAX(id), 1) + 1 INTO max_id FROM events;
@@ -256,10 +256,10 @@ SELECT COUNT(*) FROM events WHERE id_bigint IS DISTINCT FROM id;
 ```
 
 ## References
-WIP - e.g.:  
-Migration scripts in the repository.  
-Backfill script documentation.  
-Trigger functions for PostgreSQL and MySQL.  
+WIP - e.g.:
+Migration scripts in the repository.
+Backfill script documentation.
+Trigger functions for PostgreSQL and MySQL.
 
 ### Helpful Links
 - [Stack Overflow: Migrating int to bigint](https://stackoverflow.com/questions/33504982/postgresql-concurrently-change-column-type-from-int-to-bigint)

diff --git a/lib/cloud_controller/config_schemas/base/api_schema.rb b/lib/cloud_controller/config_schemas/base/api_schema.rb
@@ -100,6 +100,7 @@ class ApiSchema < VCAP::Config
             optional(:max_migration_statement_runtime_in_seconds) => Integer,
             optional(:migration_psql_concurrent_statement_timeout_in_seconds) => Integer,
             optional(:migration_psql_worker_memory_kb) => Integer,
+            optional(:skip_bigint_id_migration) => bool,
             db: {
               optional(:database) => Hash, # db connection hash for sequel
               max_connections: Integer, # max connections in the connection pool

diff --git a/lib/cloud_controller/config_schemas/base/migrate_schema.rb b/lib/cloud_controller/config_schemas/base/migrate_schema.rb
@@ -10,6 +10,7 @@ class MigrateSchema < VCAP::Config
             optional(:max_migration_statement_runtime_in_seconds) => Integer,
             optional(:migration_psql_concurrent_statement_timeout_in_seconds) => Integer,
             optional(:migration_psql_worker_memory_kb) => Integer,
+            optional(:skip_bigint_id_migration) => bool,
 
             db: {
               optional(:database) => Hash, # db connection hash for sequel

diff --git a/lib/database/bigint_migration.rb b/lib/database/bigint_migration.rb
@@ -0,0 +1,83 @@
+module VCAP::BigintMigration
+  class << self
+    def opt_out?
+      opt_out = VCAP::CloudController::Config.config&.get(:skip_bigint_id_migration)
+      opt_out.nil? ? false : opt_out
+    rescue VCAP::CloudController::Config::InvalidConfigPath
+      false
+    end
+
+    def table_empty?(db, table)
+      db[table].empty?
+    end
+
+    def change_pk_to_bigint(db, table)
+      db.set_column_type(table, :id, :Bignum) if column_type(db, table, :id) != 'bigint'
+    end
+
+    def revert_pk_to_integer(db, table)
+      db.set_column_type(table, :id, :integer) if column_type(db, table, :id) == 'bigint'
+    end
+
+    def add_bigint_column(db, table)
+      db.add_column(table, :id_bigint, :Bignum, if_not_exists: true)
+    end
+
+    def drop_bigint_column(db, table)
+      db.drop_column(table, :id_bigint, if_exists: true)
+    end
+
+    def create_trigger_function(db, table)
+      drop_trigger_function(db, table)
+
+      function = <<~FUNC
+        BEGIN
+          NEW.id_bigint := NEW.id;
+          RETURN NEW;
+        END;
+      FUNC
+      db.create_function(function_name(table), function, language: :plpgsql, returns: :trigger)
+      db.create_trigger(table, trigger_name(table), function_name(table), each_row: true, events: :insert)
+    end
+
+    def drop_trigger_function(db, table)
+      db.drop_trigger(table, trigger_name(table), if_exists: true)
+      db.drop_function(function_name(table), if_exists: true)
+    end
+
+    def backfill(logger, db, table, batch_size: 10_000, iterations: -1)
+      raise "table '#{table}' does not contain column 'id_bigint'" unless column_exists?(db, table, :id_bigint)
+
+      logger.info("starting bigint backfill on table '#{table}' (batch_size: #{batch_size}, iterations: #{iterations})")
+      loop do
+        updated_rows = db.
+                       from(table, :batch).
+                       with(:batch, db[table].select(:id).where(id_bigint: nil).order(:id).limit(batch_size).for_update.skip_locked).
+                       where(Sequel.qualify(table, :id) => :batch__id).
+                       update(id_bigint: :batch__id)
+        logger.info("updated #{updated_rows} rows")
+        iterations -= 1 if iterations > 0
+        break if updated_rows < batch_size || iterations == 0
+      end
+      logger.info("finished bigint backfill on table '#{table}'")
+    end
+
+    private
+
+    def column_type(db, table, column)
+      db.schema(table).find { |col, _| col == column }&.dig(1, :db_type)
+    end
+
+    def function_name(table)
+      :"#{table}_set_id_bigint_on_insert"
+    end
+
+    def trigger_name(table)
+      :"trigger_#{function_name(table)}"
+    end
+
+    def column_exists?(db, table, column)
+      db[table].columns.include?(column)
+    end
+  end
+end
diff --git a/lib/tasks/db.rake b/lib/tasks/db.rake
@@ -177,6 +177,21 @@ namespace :db do
     end
   end
 
+  desc 'Backfill id_bigint column for a given table'
+  task :bigint_backfill, %i[table batch_size iterations] => :environment do |_t, args|
+    args.with_defaults(batch_size: 10_000, iterations: -1)
+    raise ArgumentError.new("argument 'table' is required") if args.table.nil?
+
+    RakeConfig.context = :migrate
+
+    require 'database/bigint_migration'
+    logging_output
+    logger = Steno.logger('cc.db.bigint_backfill')
+    RakeConfig.config.load_db_encryption_key
+    db = VCAP::CloudController::DB.connect(RakeConfig.config.get(:db), logger)
+    VCAP::BigintMigration.backfill(logger, db, args.table.to_sym, batch_size: args.batch_size.to_i, iterations: args.iterations.to_i)
+  end
+
   namespace :dev do
     desc 'Migrate the database set in spec/support/bootstrap/db_config'
     task migrate: :environment do

diff --git a/spec/migrations/20250327142351_bigint_migration_events_step1_spec.rb b/spec/migrations/20250327142351_bigint_migration_events_step1_spec.rb
@@ -0,0 +1,16 @@
+require 'spec_helper'
+require 'migrations/helpers/bigint_migration_step1_shared_context'
+
+RSpec.describe 'bigint migration - events table - step1', isolation: :truncation, type: :migration do
+  include_context 'bigint migration step1' do
+    let(:migration_filename) { '20250327142351_bigint_migration_events_step1.rb' }
+    let(:table) { :events }
+    let(:insert) do
+      lambda do |db|
+        db[:events].insert(guid: SecureRandom.uuid, timestamp: Time.now.utc, type: 'type',
+                           actor: 'actor', actor_type: 'actor_type',
+                           actee: 'actee', actee_type: 'actee_type')
+      end
+    end
+  end
+end