ContainerImage.Pinniped

Author	SHA1	Message	Date
Matt Moyer	58bbffded4	Switch to a slimmer distroless base image. At a high level, it switches us to a distroless base container image, but that also includes several related bits: - Add a writable /tmp but make the rest of our filesystems read-only at runtime. - Condense our main server binaries into a single pinniped-server binary. This saves a bunch of space in the image due to duplicated library code. The correct behavior is dispatched based on `os.Args[0]`, and the `pinniped-server` binary is symlinked to `pinniped-concierge` and `pinniped-supervisor`. - Strip debug symbols from our binaries. These aren't really useful in a distroless image anyway and all the normal stuff you'd expect to work, such as stack traces, still does. - Add a separate `pinniped-concierge-kube-cert-agent` binary with "sleep" and "print" functionality instead of using builtin /bin/sleep and /bin/cat for the kube-cert-agent. This is split from the main server binary because the loading/init time of the main server binary was too large for the tiny resource footprint we established in our kube-cert-agent PodSpec. Using a separate binary eliminates this issue and the extra binary adds only around 1.5MiB of image size. - Switch the kube-cert-agent code to use a JSON `{"tls.crt": "<b64 cert>", "tls.key": "<b64 key>"}` format. This is more robust to unexpected input formatting than the old code, which simply concatenated the files with some extra newlines and split on whitespace. - Update integration tests that made now-invalid assumptions about the `pinniped-server` image. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-08-09 15:05:13 -04:00
Ryan Richard	2bba39d723	TestAgentController unit test is flaky, try to add workaround TestAgentController really runs the controller and evaluates multiple calls to the controller's Sync with real informers caching updates. There is a large amount of non-determinism in this unit test, and it does not always behave the same way. Because it makes assertions about the specific errors that should be returned by Sync, it was not accounting for some errors that are only returned by Sync once in a while depending on the exact (unpredictable) order of operations. This commit doesn't fix the non-determinism in the test, but rather tries to work around it by also allowing other (undesired but inevitable) error messages to appear in the list of actual error messages returned by the calls to the Sync function. Signed-off-by: Margo Crawford <margaretc@vmware.com>	2021-07-15 13:41:31 -07:00
Matt Moyer	657488fe90	Create CredentialIssuer at install, not runtime. Previously, our controllers would automatically create a CredentialIssuer with a singleton name. The helpers we had for this also used "raw" client access and did not take advantage of the informer cache pattern. With this change, the CredentialIssuer is always created at install time in the ytt YAML. The controllers now only update the existing CredentialIssuer status, and they do so using the informer cache as much as possible. This change is targeted at only the kubecertagent controller to start. The impersonatorconfig controller will be updated in a following PR along with other changes. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-05-19 17:15:25 -05:00
Matt Moyer	b80cbb8cc5	Run kube-cert-agent pod as Concierge ServiceAccount. Since `0dfb3e95c5`, we no longer directly create the kube-cert-agent Pod, so our "use" permission on PodSecurityPolicies no longer has the intended effect. Since the deployments controller is now the one creating pods for us, we need to get the permission on the PodSpec of the target pod instead, which we do somewhat simply by using the same service account as the main Concierge pods. We still set `automountServiceAccountToken: false`, so this should not actually give any useful permissions to the agent pod when running. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-05-03 16:20:13 -05:00
Matt Moyer	e532a88647	Add a new "legacy pod cleaner" controller. This controller is responsible for cleaning up kube-cert-agent pods that were deployed by previous versions. They are easily identified because they use a different `kube-cert-agent.pinniped.dev` label compared to the new agent pods (`true` vs. `v2`). Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-04-26 08:19:45 -06:00
Matt Moyer	54a8297cc4	Add generated mocks for kubecertagent. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-04-26 08:19:45 -06:00
Matt Moyer	2843c4f8cb	Refactor kube-cert-agent controllers to use a Deployment. This is a relatively large rewrite of much of the kube-cert-agent controllers. Instead of managing raw Pod objects, they now create a single Deployment and let the builtin k8s controller handle it from there. This reduces the amount of code we need and should handle a number of edge cases better, especially those where a Pod becomes "wedged" and needs to be recreated. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-04-26 08:19:45 -06:00
Monis Khan	00694c9cb6	dynamiccert: split into serving cert and CA providers Signed-off-by: Monis Khan <mok@vmware.com>	2021-03-15 12:24:07 -04:00
Monis Khan	2d28d1da19	Implement all optional methods in dynamic certs provider Signed-off-by: Monis Khan <mok@vmware.com>	2021-03-11 16:24:08 -05:00
Ryan Richard	d8c6894cbc	All controller unit tests should not cancel context until test is over All controller unit tests were accidentally using a timeout context for the informers, instead of a cancel context which stays alive until each test is completely finished. There is no reason to risk unpredictable behavior of a timeout being reached during an individual test, even though with the previous 3 second timeout it could only be reached on a machine which is running orders of magnitude slower than usual, since each test usually runs in about 100-300 ms. Unfortunately, sometimes our CI workers might get that slow. This sparked a review of other usages of timeout contexts in other tests, and all of them were increased to a minimum value of 1 minute, under the rule of thumb that our tests will be more reliable on slow machines if they "pass fast and fail slow".	2021-03-04 17:26:01 -08:00
Matt Moyer	2a29303e3f	Fix label handling in kubecertagent controllers. These controllers were a bit inconsistent. There were cases where the controllers ran out of the expected order and the custom labels might not have been applied. We should still plan to remove this label handling or move responsibility into the middleware layer, but this avoids any regression. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-03-02 13:59:46 -06:00
Matt Moyer	643c60fd7a	Drop NewKubeConfigInfoPublisherController, start populating strategy frontend from kubecertagent execer controller. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-03-02 13:09:25 -06:00
Matt Moyer	c94ee7188c	Factor out issuerconfig.UpdateStrategy helper. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-03-01 16:21:10 -06:00
Matt Moyer	6565265bee	Use new 'go.pinniped.dev/generated/latest' package. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-02-16 13:00:08 -06:00
Monis Khan	0a9f446893	Update credential issuer logic to use status subresource Signed-off-by: Monis Khan <mok@vmware.com>	2021-02-10 21:52:10 -05:00
Monis Khan	89b00e3702	Declare war on namespaces Signed-off-by: Monis Khan <mok@vmware.com>	2021-02-10 21:52:07 -05:00
Margo Crawford	5611212ea9	Changing references from 1.19 to 1.20	2021-01-07 15:25:47 -08:00
Monis Khan	15a5332428	Reduce log spam Signed-off-by: Monis Khan <mok@vmware.com>	2020-11-10 10:22:27 -05:00
Monis Khan	418f4d20ae	Use parent func to indicate when the controller queue is a singleton This prevents unnecessary sync loop runs when the controller is running with a single worker. When the controller is running with more than one worker, it prevents subtle bugs that can cause the controller to go "back in time." Signed-off-by: Monis Khan <mok@vmware.com> Signed-off-by: Matt Moyer <moyerm@vmware.com>	2020-11-04 11:08:10 -06:00
Matt Moyer	59263ea733	Rename CredentialIssuerConfig to CredentialIssuer. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2020-11-02 17:39:42 -06:00
Ryan Richard	75c35e74cc	Refactor and add unit tests for previous commit to run agent pod as root	2020-11-02 15:03:37 -08:00
Ryan Richard	a01921012d	kubecertagent: explicitly run as root We need root here because the files that this pod reads are most likely restricted to root access. Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-11-02 16:33:46 -05:00
Ryan Richard	ab5c04b1f3	Merge pull request #176 from vmware-tanzu/agent_pod_additional_label_handling Handle custom labels better in the agent pod controllers	2020-11-02 09:08:42 -08:00
Ryan Richard	7597b12a51	Small unit test changes for deleter_test.go	2020-11-02 08:40:39 -08:00
Ryan Richard	f76b9857da	Don't use custom labels when selecting an agent pod And delete the agent pod when it needs its custom labels to be updated, so that the creator controller will notice that it is missing and immediately create it with the new custom labels.	2020-10-30 17:41:17 -07:00
Matt Moyer	9e1922f1ed	Split the config CRDs into two API groups. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2020-10-30 19:22:46 -05:00
Ryan Richard	94f20e57b1	Concierge controllers add labels to all created resources	2020-10-15 10:14:23 -07:00
Andrew Keesler	b21b43c654	Fix expected CIC status message on non-hosted control planes	2020-09-24 17:56:55 -04:00
Andrew Keesler	9e0195e024	kubecertagent: use initial event for when key can't be found This should fix integration tests running on clusters that don't have visible controller manager pods (e.g., GKE). Pinniped should boot, not find any controller manager pods, but still post a status in the CIC. I also updated a test helper so that we could tell the difference between when an event was not added and when an event was added with an empty key. Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-24 16:54:20 -04:00
Andrew Keesler	d853cbc7ff	Plumb through ImagePullSecrets to agent pod Right now in the YTT templates we assume that the agent pods are gonna use the same image as the main Pinniped deployment, so we can use the same logic for the image pull secrets. Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-24 15:52:05 -04:00
Andrew Keesler	fab36c55f5	inernal/controller/kubecertagent: fix some godoc's Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-24 13:53:06 -04:00
Ryan Richard	253d3bb36f	Remove an accidentally committed `it.Focus`	2020-09-24 08:15:10 -07:00
Andrew Keesler	9f80b0ea00	Set CIC error statuses in kubecertagent annotater and creater Also fix an instance where we were using an informer in a retry loop. Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-24 10:40:50 -04:00
Ryan Richard	381811b36f	Refactor constructor params of the kubecertagent pkg's controllers - Only inject things through the constructor that the controller will need - Use pkg private constants when possible for things that are not actually configurable by the user - Make the agent pod template private to the pkg - Introduce a test helper to reduce some duplicated test code - Remove some `it.Focus` lines that were accidentally committed, and repair the broken tests that they were hiding	2020-09-23 17:30:22 -07:00
Andrew Keesler	6d047c151f	Fix kubecertagent deleter test to reconcile on pod template fields I think we want to reconcile on these pod template fields so that if someone were to redeploy Pinniped with a new image for the agent, the agent would get updated immediately. Before this change, the agent image wouldn't get updated until the agent pod was deleted.	2020-09-23 11:30:13 -04:00
Andrew Keesler	9735122db9	Wire in kubecertagent.NewExecerController() to server Also fill in a couple of low-hanging unit tests. Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-23 11:01:41 -04:00
Andrew Keesler	6c555f94e3	internal/provider -> internal/dynamiccert 3 main reasons: - The cert and key that we store in this object are not always used for TLS. - The package name "provider" was a little too generic. - dynamiccert.Provider reads more go-ish than provider.DynamicCertProvider. Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-23 08:29:35 -04:00
Andrew Keesler	a55e9de4fc	Use existing clock test double to get kubecertagent units passing Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-23 07:50:45 -04:00
Ryan Richard	eb0d9a15fc	WIP: start replacing the kubecertauthority pkg with a new controller - Lots of TODOs added that need to be resolved to finish this WIP - execer_test.go seems like it should be passing, but it fails (sigh) Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-22 17:45:20 -07:00
Ryan Richard	526be79b11	Finish WIP from previous commits: agent pods created in install namespace	2020-09-21 17:15:36 -07:00
Ryan Richard	820f1e977e	Continue the WIP from the previous commit: finish adding second informer - All of the `kubecertagent` controllers now take two informers - This is moving in the direction of creating the agent pods in the Pinniped installation namespace, but that will come in a future commit	2020-09-21 16:37:22 -07:00
Andrew Keesler	50258fc569	WIP: start to create kube-cert-agent pods in namespace Signed-off-by: Andrew Keesler <akeesler@vmware.com>	2020-09-21 16:27:00 -04:00
Andrew Keesler	5a608cc84c	Add kube-cert-agent controller for getting kube API keypair	2020-09-21 14:16:14 -04:00

43 Commits