I think we need some sort of unit testing for our flag generation logic to prevent issues such as https://github.com/bazel-contrib/bazelrc-preset.bzl/pull/56.