-
Notifications
You must be signed in to change notification settings - Fork 658
Noble cut over #2655
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Noble cut over #2655
Conversation
- replace ruby with bash - fail if more than one IP addr is found for "OUTER_CONTAINER_IP" - sundry bash lint cleanup
- rename some job - remove (ubuntu-)jammy references - replace jammy with noble - replace concours `var` with YAML anchor(s)
- addresses ruby style / deprecation issues
- noble behavior is alreayd the bosh-deployment default - docker-cpi image has `docker-env` for debug use - docker-cpi cleanup `start-bosh` script
This is required because the FIPS stemcell is currently Jammy and the compiled release is for Noble.
Based on this commit[1] we are mutating the in-container garden.ini setting to make Warden use systemd instead of the garden-provide init binary. [1] cloudfoundry/bosh-warden-cpi-release@434738f#diff-f3d9c00d365d08274b8e73e1dc4fc4b2d38a92a654d4d2b27f4ffdc01730576bR1-R8
|
One notable help in debugging was installing the gaol CLI for warden. This might be something that should be installed on the on the
cc @mkocher who may be able to provide more context |
|
A quick scan of Concourse shows that there have been problems with cgroups v2[1] in the past. These may have been resolved. Leaving this link here as a breadcrumb in case there are issues running a Concourse worker on an ubuntu-noble stemcell. |
|
Manually attempting to deploy a single noble based worker tagged with
The diff above is against the concourse deployment manifest which can be acquired via: The current FIWG concourse deployment now has a single noble-based Concourse worker. This change is ephemeral and will be removes when the periodic redeploy happens. |
|
Presumably the Build running docker-cpi on Noble concourse worker: https://bosh.ci.cloudfoundry.org/teams/main/pipelines/bosh-director/jobs/brats-acceptance/builds/459 |
|
warden-cpi image Possible root error message from We thing this corresponds to the following invocation chain, starting with The start-bosh script: Calls The Which eventually invokes the script And this script then invokes Then within
It seems possible that the method We believe that mount -o remount,loop,pquota,noatime -t xfs /var/vcap/data/grootfs/store/unprivileged.backing-store /var/vcap/data/grootfs/store/unprivilegedFrom the execution if this ... and we are not sure why |
Set the env var DEBUG to any non-empty value and both bash and bosh debug logging will be enabled.
|
…cli/bosh-gcscli-0.0.351-linux-amd64
… -> azure-storage-cli/azure-storage-cli-0.0.204-linux-amd64
|
@rkoster / @beyhan - do either of you understand why the @cf-rabbit-bot and CI Bot commits are being flagged by the CLA checker? |
…64 -> verify-multidigest/verify-multidigest-0.0.580-linux-amd64
|
Hi @aramprice, usually, @christopherclark is helping us to register bots with EasyCLA. Did we start using those bots shortly and they need now EasyCLA? |
|
@beyhan - it might be because CI is currently running on this PR's branch ( There is a CI failure in the bbl pipeline which also appears to be related to the CI bot: Perhaps something has changed with Bots in general? |
Part of: #2655
Local diff for
fly'ing thebosh-directorpipeline to consume this branch:Known issues
There are three failing jobs in CI, two are blocking and one is not:
warden-cpi, likely relies on persistent disk which is not supported bydocker-cpidocker-cpi, tightly coupleddocker-cpi, tightly coupledNeither
docker-cpi, norwarden-cpican boot a "VM" (container image) on the existing Concourse workers (jammy stemcell based) because (at least) Noble stemcells rely on Linux cgroupsv2 which are not present in jammy-based stemcells - for the "protect monit" behavior.The fix to allow the
warden-cpito successfully boot an agent is in ci/dockerfiles/warden-cpi/start-bosh.sh:7-9, see comment for context on making warden boot containers with systemd as PID 1. Based on this bosh-warden-cpi-release commit, thanks @jpalermo 🎉 .Todos
Make the following CI jobs pass
bosh-disaster-recovery-acceptance-tests
warden-cpi, likely relies on persistent disk which is not supported bydocker-cpibrats-acceptance
docker-cpi, tightly coupledUn-pause the cut-release jobs in ci