Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate tooling off of gs://kubernetes-jenkins #33381

Open
6 of 10 tasks
michelle192837 opened this issue Aug 28, 2024 · 7 comments
Open
6 of 10 tasks

Migrate tooling off of gs://kubernetes-jenkins #33381

michelle192837 opened this issue Aug 28, 2024 · 7 comments
Assignees
Labels
kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. sig/testing Categorizes an issue or PR as relevant to SIG Testing.

Comments

@michelle192837
Copy link
Contributor

michelle192837 commented Aug 28, 2024

Follow-up to the control plane migration (#33350).

Tools like Kettle, TestGrid, and others read from gs://kubernetes-jenkins, but should switch to using the new bucket (gs://kubernetes-ci-logs) or something else if needed.

  • Ensure results from kubernetes-jenkins are synced to kubernetes-ci-logs (believe this is already done as part of the control plane migration?)
  • Migrate Kettle
  • Migrate TestGrid
  • Migrate other miscellaneous use
  • K8s Infra Prow writes to kubernetes-ci-logs instead of kubernetes-jenkins
  • Remove access permissions from kubernetes-jenkins
  • Update references to kubernetes-jenkins across repos
    • kubernetes-sigs / prow
    • (any others?)
  • Ensure TTL for gs://kubernetes-jenkins
    • NOTE: Leave gs://kubernetes-jenkins (and its project) around to reserve the bucket name.

/assign
/sig testing

@michelle192837 michelle192837 added the kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. label Aug 28, 2024
@k8s-ci-robot k8s-ci-robot added the sig/testing Categorizes an issue or PR as relevant to SIG Testing. label Aug 28, 2024
@jbpratt
Copy link
Contributor

jbpratt commented Sep 17, 2024

/assign

@michelle192837
Copy link
Contributor Author

michelle192837 commented Oct 11, 2024

Other updates (copying from Slack):

  • Purge messages on k8s-infra-kettle subscription
  • Stop the kettle deployment in kubernetes-public temporarily
  • Update k8s.io deployment.yaml
    • Auto-deploy should handle it once submitted?
    • Update deployment.yaml in test-infra as well, but nothing should care about that file

Kettle is updated, looks like it is now pulling from the new subscription successfully according to logs. (I'm not sure if it's correctly acking finished.json files since I don't know if we have an example of that yet 🤔 So still confirming that)

Okay good, it's acknowledging finished.json.
I think we're good to go on the Prow change then? I'm going to let the bucket sync run one more time then stop it temporarily, then we can submit the prow change and hopefully it all works smoothly?
Run has complete and the sync job is disabled!

@michelle192837
Copy link
Contributor Author

michelle192837 commented Oct 23, 2024

Kettle has had recent issues (see Slack threads in testing-ops and sig-k8s-infra)

Ref: #33628

kubernetes/k8s.io#7454 should hopefully fix this. (And I'll make the change in test-infra to match just in case, though it shouldn't change anything).

@BenTheElder
Copy link
Member

https://cs.k8s.io/?q=kubernetes-jenkins&i=nope&files=&excludeFiles=&repos=

there's a lot more of these than I hoped, but most of them are just links in docs. a few are pulling data. a few more don't need changing and are discussing the migration.

@BenTheElder
Copy link
Member

I manually set a matching TTL on the new GCS bucket to match the old one (90d+ Age => delete)

We still need to set this in the terraform but I wanted to make sure we started aging out those old logs now as it's already like 200 TB.

@BenTheElder
Copy link
Member

@upodroid plans to migrate the TTL to terraform (thanks!)

@michelle192837
Copy link
Contributor Author

Existing lifecycle on gs://kubernetes-jenkins is also 90 days:

gcloud storage buckets describe gs://kubernetes-jenkins --format="default(lifecycle_config)"
lifecycle_config:
  rule:
  - action:
      type: Delete
    condition:
      age: 90

Migration happened August 21-ish, but log bucket migration happened later (largely early October) so things haven't hit TTL yet. We'll probably need to wait until ~January.

I don't have permissions to update gs://kubernetes-jenkins (just to view), so if we want to restrict permissions on it someone else will need to do it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. sig/testing Categorizes an issue or PR as relevant to SIG Testing.
Projects
None yet
Development

No branches or pull requests

4 participants