ContainerImage.Pinniped

Author	SHA1	Message	Date
Monis Khan	c71ffdcd1e	leader election: use better duration defaults OpenShift has good defaults for these duration fields that we can use instead of coming up with them ourselves: `e14e06ba8d/pkg/config/leaderelection/leaderelection.go (L87-L109)` Copied here for easy future reference: // We want to be able to tolerate 60s of kube-apiserver disruption without causing pod restarts. // We want the graceful lease re-acquisition fairly quick to avoid waits on new deployments and other rollouts. // We want a single set of guidance for nearly every lease in openshift. If you're special, we'll let you know. // 1. clock skew tolerance is leaseDuration-renewDeadline == 30s // 2. kube-apiserver downtime tolerance is == 78s // lastRetry=floor(renewDeadline/retryPeriod)retryPeriod == 104 // downtimeTolerance = lastRetry-retryPeriod == 78s // 3. worst non-graceful lease acquisition is leaseDuration+retryPeriod == 163s // 4. worst graceful lease acquisition is retryPeriod == 26s if ret.LeaseDuration.Duration == 0 { ret.LeaseDuration.Duration = 137 time.Second } if ret.RenewDeadline.Duration == 0 { // this gives 107/26=4 retries and allows for 137-107=30 seconds of clock skew // if the kube-apiserver is unavailable for 60s starting just before t=26 (the first renew), // then we will retry on 26s intervals until t=104 (kube-apiserver came back up at 86), and there will // be 33 seconds of extra time before the lease is lost. ret.RenewDeadline.Duration = 107 * time.Second } if ret.RetryPeriod.Duration == 0 { ret.RetryPeriod.Duration = 26 * time.Second } Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-24 16:21:53 -04:00
Monis Khan	c0617ceda4	leader election: in-memory leader status is stopped before release This change fixes a small race condition that occurred when the current leader failed to renew its lease. Before this change, the leader would first release the lease via the Kube API and then would update its in-memory status to reflect that change. Now those events occur in the reverse (i.e. correct) order. Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-24 15:02:56 -04:00
Mo Khan	3077034b2d	Merge branch 'main' into oidc_password_grant	2021-08-24 12:23:52 -04:00
Ryan Richard	211f4b23d1	Log auth endpoint errors with stack traces	2021-08-20 14:41:02 -07:00
Monis Khan	c356710f1f	Add leader election middleware Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-20 12:18:25 -04:00
Ryan Richard	6239a567a8	remove one nolint:unparam comment	2021-08-19 10:57:00 -07:00
Ryan Richard	e4d418a076	Merge branch 'main' into oidc_password_grant	2021-08-19 10:55:54 -07:00
Ryan Richard	b4a39ba3c4	Remove `unparam` linter We decided that this linter does not provide very useful feedback for our project.	2021-08-19 10:20:24 -07:00
Ryan Richard	61c21d2977	Refactor some authorize and callback error handling, and add more tests	2021-08-18 12:06:46 -07:00
Ryan Richard	04b8f0b455	Extract Supervisor authorize endpoint string constants into apis pkg	2021-08-18 10:20:33 -07:00
Ryan Richard	0089540b07	Extract Supervisor IDP discovery endpoint string constants into apis pkg	2021-08-17 17:50:02 -07:00
Ryan Richard	62c6d53a21	Merge branch 'main' into oidc_password_grant	2021-08-17 15:23:29 -07:00
Ryan Richard	96474b3d99	Extract Supervisor IDP discovery endpoint types into apis package	2021-08-17 15:23:03 -07:00
Ryan Richard	964d16110e	Some refactors based on PR feedback from @enj	2021-08-17 13:14:09 -07:00
Monis Khan	e0901f4fe5	dynamiccert: prevent misuse of NewServingCert The Kube API server code that we use will cast inputs in an attempt to see if they implement optional interfaces. This change adds a simple wrapper struct to prevent such casts from causing us any issues. Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-17 12:58:32 -04:00
Ryan Richard	52409f86e8	Merge branch 'main' into oidc_password_grant	2021-08-16 15:17:55 -07:00
Ryan Richard	91c8a3ebed	Extract private helper in auth_handler.go	2021-08-16 15:17:30 -07:00
Ryan Richard	52cb0bbc07	More unit tests and small error handling changes for OIDC password grant	2021-08-16 14:27:40 -07:00
Monis Khan	7a812ac5ed	impersonatorconfig: only unload dynamiccert when proxy is disabled In the upstream dynamiccertificates package, we rely on two pieces of code: 1. DynamicServingCertificateController.newTLSContent which calls - clientCA.CurrentCABundleContent - servingCert.CurrentCertKeyContent 2. unionCAContent.VerifyOptions which calls - unionCAContent.CurrentCABundleContent This results in calls to our tlsServingCertDynamicCertProvider and impersonationSigningCertProvider. If we Unset these providers, we subtly break these consumers. At best this results in test slowness and flakes while we wait for reconcile loops to converge. At worst, it results in actual errors during runtime. For example, we previously would Unset the impersonationSigningCertProvider on any sync loop error (even a transient one caused by a network blip or a conflict between writes from different replicas of the concierge). This would cause us to transiently fail to issue new certificates from the token credential require API. It would also cause us to transiently fail to authenticate previously issued client certs (which results in occasional Unauthorized errors in CI). Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-16 16:07:46 -04:00
Ryan Richard	50085a505b	First unit test for auth endpoint's password grant and related refactor	2021-08-12 17:53:14 -07:00
Ryan Richard	5b96d014b4	Merge branch 'main' into oidc_password_grant	2021-08-12 11:12:57 -07:00
Ryan Richard	84c3c3aa9c	Optionally allow OIDC password grant for CLI-based login experience - Add `AllowPasswordGrant` boolean field to OIDCIdentityProvider's spec - The oidc upstream watcher controller copies the value of `AllowPasswordGrant` into the configuration of the cached provider - Add password grant to the UpstreamOIDCIdentityProviderI interface which is implemented by the cached provider instance for use in the authorization endpoint - Enhance the IDP discovery endpoint to return the supported "flows" for each IDP ("cli_password" and/or "browser_authcode") - Enhance `pinniped get kubeconfig` to help the user choose the desired flow for the selected IDP, and to write the flow into the resulting kubeconfg - Enhance `pinniped login oidc` to have a flow flag to tell it which client-side flow it should use for auth (CLI-based or browser-based) - In the Dex config, allow the resource owner password grant, which Dex implements to also return ID tokens, for use in integration tests - Enhance the authorize endpoint to perform password grant when requested by the incoming headers. This commit does not include unit tests for the enhancements to the authorize endpoint, which will come in the next commit - Extract some shared helpers from the callback endpoint to share the code with the authorize endpoint - Add new integration tests	2021-08-12 10:45:39 -07:00
Monis Khan	4a17e1e736	impersonator: update tests for new Impersonate-Uid code Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-09 19:16:54 -04:00
Monis Khan	a027f1ae2c	jwtcachefiller: update to use CAContentProvider Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-09 19:16:25 -04:00
Matt Moyer	58bbffded4	Switch to a slimmer distroless base image. At a high level, it switches us to a distroless base container image, but that also includes several related bits: - Add a writable /tmp but make the rest of our filesystems read-only at runtime. - Condense our main server binaries into a single pinniped-server binary. This saves a bunch of space in the image due to duplicated library code. The correct behavior is dispatched based on `os.Args[0]`, and the `pinniped-server` binary is symlinked to `pinniped-concierge` and `pinniped-supervisor`. - Strip debug symbols from our binaries. These aren't really useful in a distroless image anyway and all the normal stuff you'd expect to work, such as stack traces, still does. - Add a separate `pinniped-concierge-kube-cert-agent` binary with "sleep" and "print" functionality instead of using builtin /bin/sleep and /bin/cat for the kube-cert-agent. This is split from the main server binary because the loading/init time of the main server binary was too large for the tiny resource footprint we established in our kube-cert-agent PodSpec. Using a separate binary eliminates this issue and the extra binary adds only around 1.5MiB of image size. - Switch the kube-cert-agent code to use a JSON `{"tls.crt": "<b64 cert>", "tls.key": "<b64 key>"}` format. This is more robust to unexpected input formatting than the old code, which simply concatenated the files with some extra newlines and split on whitespace. - Update integration tests that made now-invalid assumptions about the `pinniped-server` image. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-08-09 15:05:13 -04:00
Matt Moyer	5f679059d5	Add ClusterIP service to impersonator-config-controller informer. Prior to this fix, this controller did not correctly react to changes to the ClusterIP service. It would still eventually react with a long delay due to our 5 minute resync interval. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-28 11:57:18 -05:00
Monis Khan	8b4ed86071	certs_expirer: be specific about what secret to delete This change fixes a race that can occur because we have multiple writers with no leader election lock. 1. TestAPIServingCertificateAutoCreationAndRotation/automatic expires the current serving certificate 2. CertsExpirerController 1 deletes expired serving certificate 3. CertsExpirerController 2 starts deletion of expired serving certificate but has not done so yet 4. CertsManagerController 1 creates new serving certificate 5. TestAPIServingCertificateAutoCreationAndRotation/automatic records the new serving certificate 6. CertsExpirerController 2 finishes deletion, and thus deletes the newly created serving certificate instead of the old one 7. CertsManagerController 2 creates new serving certificate 8. TestAPIServingCertificateAutoCreationAndRotation/automatic keeps running and eventually times out because it is expecting the serving certificate created by CertsManagerController 2 to match the value it recorded from CertsManagerController 1 (which will never happen since that certificate was incorrectly deleted). Signed-off-by: Monis Khan <mok@vmware.com>	2021-07-28 09:56:05 -04:00
Matt Moyer	727035a2dc	Fix form_post CSS styling in Firefox and Safari. This functioned fine, but did not have the intended visual appearance when it came to how the text of the auth code wrapped inside the copy button in the manual flow. The new styling behaves correctly on at least Chrome, Firefox, and Safari on macOS. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-28 08:09:20 -05:00
Ryan Richard	f17f7c0c6a	Small refactors in impersonator_config.go suggested by @mattmoyer	2021-07-26 17:46:06 -07:00
Ryan Richard	708164b878	Carefully merge desired annotations into impersonation proxy Service Don't overwrite annotations that might have come from a human user or from some other non-Pinniped controller.	2021-07-22 17:09:50 -07:00
Ryan Richard	2bba39d723	TestAgentController unit test is flaky, try to add workaround TestAgentController really runs the controller and evaluates multiple calls to the controller's Sync with real informers caching updates. There is a large amount of non-determinism in this unit test, and it does not always behave the same way. Because it makes assertions about the specific errors that should be returned by Sync, it was not accounting for some errors that are only returned by Sync once in a while depending on the exact (unpredictable) order of operations. This commit doesn't fix the non-determinism in the test, but rather tries to work around it by also allowing other (undesired but inevitable) error messages to appear in the list of actual error messages returned by the calls to the Sync function. Signed-off-by: Margo Crawford <margaretc@vmware.com>	2021-07-15 13:41:31 -07:00
Matt Moyer	71d4e05fb6	Add custom response_mode=form_post HTML template. This is a new pacakge internal/oidc/provider/formposthtml containing a number of static files embedded using the relatively recent Go "//go:embed" functionality introduced in Go 1.16 (https://blog.golang.org/go1.16). The Javascript and CSS files are minifiied and injected to make a single self-contained HTML response. There is a special Content-Security-Policy helper to calculate hash-based script-src and style-src rules. This new code is covered by a new integration test that exercises the JS/HTML functionality in a real browser outside of the rest of the Supervisor. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	674cd4a88c	Adjust our securityheader pkg to support form_post. Our Supervisor callback handler now needs to load JS and CSS from the provider endpoint, and this JS needs to make a `fetch()` call across origins (to post the form to the CLI callback). This requires a custom Content-Security-Policy compared to other pages we render. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	2823d4d1e3	Add "response_modes_supported" to Supervisor discovery response. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	6d83ecb420	Unit test response_mode=form_post in internal/oidc/callback. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	c27eb17f23	Add "response_mode=form_post" to CLI client. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:42 -05:00
Matt Moyer	9f91c6c884	Merge branch 'main' into oidc-upstream-watcher-supports-proxy	2021-07-09 07:24:52 -07:00
Ryan Richard	e130da6daa	Add unit test assertion for new OIDC client request timeout	2021-07-08 11:47:49 -07:00
Ryan Richard	f0d120a6ca	Fix broken upstream OIDC discovery timeout added in previous commit After noticing that the upstream OIDC discovery calls can hang indefinitely, I had tried to impose a one minute timeout on them by giving them a timeout context. However, I hadn't noticed that the context also gets passed into the JWKS fetching object, which gets added to our cache and used later. Therefore the timeout context was added to the cache and timed out while sitting in the cache, causing later JWKS fetchers to fail. This commit is trying again to impose a reasonable timeout on these discovery and JWKS calls, but this time by using http.Client's Timeout field, which is documented to be a timeout for each request/response cycle, so hopefully this is a more appropriate way to impose a timeout for this use case. The http.Client instance ends up in the cache on the JWKS fetcher object, so the timeout should apply to each JWKS request as well. Requests that can hang forever are effectively a server-side resource leak, which could theoretically be taken advantage of in a denial of service attempt, so it would be nice to avoid having them.	2021-07-08 09:44:02 -07:00
Ryan Richard	f1e63c55d4	Add `https_proxy` and `no_proxy` settings for the Supervisor - Add new optional ytt params for the Supervisor deployment. - When the Supervisor is making calls to an upstream OIDC provider, use these variables if they were provided. - These settings are integration tested in the main CI pipeline by sometimes setting them on deployments in certain cases, and then letting the existing integration tests (e.g. TestE2EFullIntegration) provide the coverage, so there are no explicit changes to the integration tests themselves in this commit.	2021-07-07 12:50:13 -07:00
Matt Moyer	562951b77a	Merge branch 'main' into oidc-upstream-watcher-supports-proxy	2021-07-06 11:30:09 -07:00
Ryan Richard	629bf61655	Extract some trivial helpers for identical code usages	2021-06-30 15:02:14 -07:00
Guangyuan Wang	76dc39ac2d	Use hostname instead of host and split on ":" Co-authored-by: Christian Ang <angc@vmware.com> Co-authored-by: Tyler Schultz <tschultz@vmware.com>	2021-06-28 23:03:05 +00:00
Guangyuan Wang	d19d63ad7d	Set Proxy on oidc upstream watcher transport - this allows the oidc upsream watcher to honor the HTTP_PROXY,HTTPS_PROXY,NO_PROXY environment variables Co-authored-by: Christian Ang <angc@vmware.com>	2021-06-24 22:35:16 +00:00
Monis Khan	5ff2be973c	credentialrequest: use safer approximation for ExpirationTimestamp We want the value of time.Now() to be calculated before the call to IssueClientCertPEM to prevent the ExpirationTimestamp from being later than the notAfter timestamp on the issued certificate. Signed-off-by: Monis Khan <mok@vmware.com>	2021-06-23 11:07:00 -04:00
Monis Khan	d78b845575	Fix bad test package name Signed-off-by: Monis Khan <mok@vmware.com>	2021-06-22 11:23:19 -04:00
Matt Moyer	551249fb69	Use a custom type for our static CLI client (smaller change). Before this change, we used the `fosite.DefaultOpenIDConnectClient{}` struct, which implements the `fosite.Client` and `fosite.OpenIDConnectClient` interfaces. For a future change, we also need to implement some additional optional interfaces, so we can no longer use the provided default types. Instead, we now use a custom `clientregistry.Client{}` struct, which implements all the requisite interfaces and can be extended to handle the new functionality (in a future change). There is also a new `clientregistry.StaticRegistry{}` struct, which implements the `fosite.ClientManager` and looks up our single static client. We could potentially extend this in the future with a registry backed by Kubernetes API, for example. This should be 100% refactor, with no user-observable change. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-06-15 15:31:48 -05:00
Monis Khan	269db6b7c2	impersonator: always authorize every request This change updates the impersonator to always authorize every request instead of relying on the Kuberentes API server to perform the check on the impersonated request. This protects us from scenarios where we fail to correctly impersonate the user due to some bug in our proxy logic. We still rely completely on the API server to perform admission checks on the impersonated requests. Signed-off-by: Monis Khan <mok@vmware.com>	2021-06-14 12:53:09 -04:00
Monis Khan	addf632e7c	impersonator: add docs regarding limited serivce account Signed-off-by: Monis Khan <mok@vmware.com>	2021-06-11 15:37:55 -04:00
Monis Khan	898f2bf942	impersonator: run as a distinct SA with minimal permissions This change updates the impersonation proxy code to run as a distinct service account that only has permission to impersonate identities. Thus any future vulnerability that causes the impersonation headers to be dropped will fail closed instead of escalating to the concierge's default service account which has significantly more permissions. Signed-off-by: Monis Khan <mok@vmware.com>	2021-06-11 12:13:53 -04:00

1 2 3 4 5 ...

735 Commits