ContainerImage.Pinniped

Author	SHA1	Message	Date
Monis Khan	e2cf9f6b74	leader election test: approximate that followers have observed change Instead of blindly waiting long enough for a disruptive change to have been observed by the old leader and followers, we instead rely on the approximation that checkOnlyLeaderCanWrite provides - i.e. only a single actor believes they are the leader. This does not account for clients that were in the followers list before and after the disruptive change, but it serves as a reasonable approximation. Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-26 12:59:52 -04:00
Monis Khan	74daa1da64	test/integration: run parallel tests concurrently with serial tests Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-26 12:59:52 -04:00
Ryan Richard	d20cab10b9	Replace one-off usages of busybox and debian images in integration tests Those images that are pulled from Dockerhub will cause pull failures on some test clusters due to Dockerhub rate limiting. Because we already have some images that we use for testing, and because those images are already pre-loaded onto our CI clusters to make the tests faster, use one of those images and always specify PullIfNotPresent to avoid pulling the image again during the integration test.	2021-08-25 15:12:07 -07:00
Monis Khan	c71ffdcd1e	leader election: use better duration defaults OpenShift has good defaults for these duration fields that we can use instead of coming up with them ourselves: `e14e06ba8d/pkg/config/leaderelection/leaderelection.go (L87-L109)` Copied here for easy future reference: // We want to be able to tolerate 60s of kube-apiserver disruption without causing pod restarts. // We want the graceful lease re-acquisition fairly quick to avoid waits on new deployments and other rollouts. // We want a single set of guidance for nearly every lease in openshift. If you're special, we'll let you know. // 1. clock skew tolerance is leaseDuration-renewDeadline == 30s // 2. kube-apiserver downtime tolerance is == 78s // lastRetry=floor(renewDeadline/retryPeriod)retryPeriod == 104 // downtimeTolerance = lastRetry-retryPeriod == 78s // 3. worst non-graceful lease acquisition is leaseDuration+retryPeriod == 163s // 4. worst graceful lease acquisition is retryPeriod == 26s if ret.LeaseDuration.Duration == 0 { ret.LeaseDuration.Duration = 137 time.Second } if ret.RenewDeadline.Duration == 0 { // this gives 107/26=4 retries and allows for 137-107=30 seconds of clock skew // if the kube-apiserver is unavailable for 60s starting just before t=26 (the first renew), // then we will retry on 26s intervals until t=104 (kube-apiserver came back up at 86), and there will // be 33 seconds of extra time before the lease is lost. ret.RenewDeadline.Duration = 107 * time.Second } if ret.RetryPeriod.Duration == 0 { ret.RetryPeriod.Duration = 26 * time.Second } Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-24 16:21:53 -04:00
Monis Khan	c0617ceda4	leader election: in-memory leader status is stopped before release This change fixes a small race condition that occurred when the current leader failed to renew its lease. Before this change, the leader would first release the lease via the Kube API and then would update its in-memory status to reflect that change. Now those events occur in the reverse (i.e. correct) order. Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-24 15:02:56 -04:00
Mo Khan	3077034b2d	Merge branch 'main' into oidc_password_grant	2021-08-24 12:23:52 -04:00
Monis Khan	132ec0d2ad	leader election test: fix flake related to invalid assumption Even though a client may hold the leader election lock in the Kube lease API, that does not mean it has had a chance to update its internal state to reflect that. Thus we retry the checks in checkOnlyLeaderCanWrite a few times to allow the client to catch up. Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-20 17:04:26 -04:00
Monis Khan	c356710f1f	Add leader election middleware Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-20 12:18:25 -04:00
Ryan Richard	62c6d53a21	Merge branch 'main' into oidc_password_grant	2021-08-17 15:23:29 -07:00
Monis Khan	cf25c308cd	test/integration: ignore restarts associated with test pods Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-17 12:57:41 -04:00
Ryan Richard	3fb683f64e	Update expected error message in e2e integration test	2021-08-16 15:40:34 -07:00
Ryan Richard	52409f86e8	Merge branch 'main' into oidc_password_grant	2021-08-16 15:17:55 -07:00
Monis Khan	7a812ac5ed	impersonatorconfig: only unload dynamiccert when proxy is disabled In the upstream dynamiccertificates package, we rely on two pieces of code: 1. DynamicServingCertificateController.newTLSContent which calls - clientCA.CurrentCABundleContent - servingCert.CurrentCertKeyContent 2. unionCAContent.VerifyOptions which calls - unionCAContent.CurrentCABundleContent This results in calls to our tlsServingCertDynamicCertProvider and impersonationSigningCertProvider. If we Unset these providers, we subtly break these consumers. At best this results in test slowness and flakes while we wait for reconcile loops to converge. At worst, it results in actual errors during runtime. For example, we previously would Unset the impersonationSigningCertProvider on any sync loop error (even a transient one caused by a network blip or a conflict between writes from different replicas of the concierge). This would cause us to transiently fail to issue new certificates from the token credential require API. It would also cause us to transiently fail to authenticate previously issued client certs (which results in occasional Unauthorized errors in CI). Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-16 16:07:46 -04:00
Ryan Richard	5b96d014b4	Merge branch 'main' into oidc_password_grant	2021-08-12 11:12:57 -07:00
Ryan Richard	84c3c3aa9c	Optionally allow OIDC password grant for CLI-based login experience - Add `AllowPasswordGrant` boolean field to OIDCIdentityProvider's spec - The oidc upstream watcher controller copies the value of `AllowPasswordGrant` into the configuration of the cached provider - Add password grant to the UpstreamOIDCIdentityProviderI interface which is implemented by the cached provider instance for use in the authorization endpoint - Enhance the IDP discovery endpoint to return the supported "flows" for each IDP ("cli_password" and/or "browser_authcode") - Enhance `pinniped get kubeconfig` to help the user choose the desired flow for the selected IDP, and to write the flow into the resulting kubeconfg - Enhance `pinniped login oidc` to have a flow flag to tell it which client-side flow it should use for auth (CLI-based or browser-based) - In the Dex config, allow the resource owner password grant, which Dex implements to also return ID tokens, for use in integration tests - Enhance the authorize endpoint to perform password grant when requested by the incoming headers. This commit does not include unit tests for the enhancements to the authorize endpoint, which will come in the next commit - Extract some shared helpers from the callback endpoint to share the code with the authorize endpoint - Add new integration tests	2021-08-12 10:45:39 -07:00
Monis Khan	34fd0ea2e2	impersonation proxy: assert nested UID impersonation is disallowed Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-10 00:03:33 -04:00
Monis Khan	724acdca1d	Update tests for new CSR duration code Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-09 19:16:50 -04:00
Matt Moyer	58bbffded4	Switch to a slimmer distroless base image. At a high level, it switches us to a distroless base container image, but that also includes several related bits: - Add a writable /tmp but make the rest of our filesystems read-only at runtime. - Condense our main server binaries into a single pinniped-server binary. This saves a bunch of space in the image due to duplicated library code. The correct behavior is dispatched based on `os.Args[0]`, and the `pinniped-server` binary is symlinked to `pinniped-concierge` and `pinniped-supervisor`. - Strip debug symbols from our binaries. These aren't really useful in a distroless image anyway and all the normal stuff you'd expect to work, such as stack traces, still does. - Add a separate `pinniped-concierge-kube-cert-agent` binary with "sleep" and "print" functionality instead of using builtin /bin/sleep and /bin/cat for the kube-cert-agent. This is split from the main server binary because the loading/init time of the main server binary was too large for the tiny resource footprint we established in our kube-cert-agent PodSpec. Using a separate binary eliminates this issue and the extra binary adds only around 1.5MiB of image size. - Switch the kube-cert-agent code to use a JSON `{"tls.crt": "<b64 cert>", "tls.key": "<b64 key>"}` format. This is more robust to unexpected input formatting than the old code, which simply concatenated the files with some extra newlines and split on whitespace. - Update integration tests that made now-invalid assumptions about the `pinniped-server` image. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-08-09 15:05:13 -04:00
Monis Khan	ac7d65c4a8	concierge_impersonation_proxy_test: run slowly for EKS Signed-off-by: Monis Khan <mok@vmware.com>	2021-08-01 18:19:53 -04:00
Matt Moyer	1e32530d7b	Fix broken TTY after manual auth code prompt. This may be a temporary fix. It switches the manual auth code prompt to use `promptForValue()` instead of `promptForSecret()`. The `promptForSecret()` function no longer supports cancellation (the v0.9.2 behavior) and the method of cancelling in `promptForValue()` is now based on running the blocking read in a background goroutine, which is allowed to block forever or leak (which is not important for our CLI use case). This means that the authorization code is now visible in the user's terminal, but this is really not a big deal because of PKCE and the limited lifetime of an auth code. The main goroutine now correctly waits for the "manual prompt" goroutine to clean up, which now includes printing the extra newline that would normally have been entered by the user in the manual flow. The text of the manual login prompt is updated to be more concise and less scary (don't use the word "fail"). Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-30 12:45:44 -05:00
Monis Khan	22be97eeda	concierge_impersonation_proxy_test: check all forms of DNS Signed-off-by: Monis Khan <mok@vmware.com>	2021-07-29 13:35:37 -04:00
Ryan Richard	d73093a694	Avoid failures due to impersonation Service having unrelated annotations	2021-07-28 14:19:14 -07:00
Matt Moyer	b42b1c1110	Relax the timeout for TestLegacyPodCleaner a bit. This test is asynchronously waiting for the controller to do something, and in some of our test environments it will take a bit longer than we'd previously allowed. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-28 13:08:57 -05:00
Matt Moyer	48c8fabb5c	Fix backwards condition in E2E test assertion. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-28 12:40:07 -05:00
Ryan Richard	71cae75758	Merge branch 'main' into merge_impersonator_service_annotations	2021-07-27 11:57:16 -07:00
Ryan Richard	58ab57201f	Suppress lint errors	2021-07-26 17:20:49 -07:00
Ryan Richard	9e27c28b39	Fix TestImpersonationProxy integration test changes from previous commit Forgot to account for our new booking annotation on the impersonator's Service.	2021-07-23 14:23:24 -07:00
Ryan Richard	ac4bc02817	Enhance integration test for CredentialIssuer spec annotations	2021-07-23 09:46:40 -07:00
Ryan Richard	e30cf6e51a	Merge branch 'main' into cli_username_password_env_vars	2021-07-22 09:29:03 -07:00
Matt Moyer	ae72d30cec	Switch to GHCR tools images for local tests, with `imagePullPolicy: IfNotPresent`. This is more consistent with our CI environment. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-21 09:21:05 -05:00
Ryan Richard	deb699a84a	e2e test: PINNIPED_USERNAME/PINNIPED_PASSWORD env vars during LDAP login	2021-07-19 17:08:52 -07:00
Ryan Richard	b3208f0ca6	wait for lb dns to resolve in the impersonation proxy integration test this will hopefully fix some flakes where aws provisioned a host for the load balancer but the tests weren't able to resolve it. Signed-off-by: Margo Crawford <margaretc@vmware.com>	2021-07-15 16:39:15 -07:00
Ryan Richard	48b58e2fad	Clear the browser cookies between each TestE2EFullIntegration test It seems like page.ClearCookies() only clears cookies for the current domain, so there doesn't seem to be a function to clear all browser cookies. Instead, we'll just start a whole new browser each test. They start fast enough that it shouldn't be a problem.	2021-07-13 16:20:02 -07:00
Ryan Richard	33461ddc14	Merge branch 'main' into deflake-serving-certificate-rotation-test	2021-07-13 14:04:34 -07:00
Matt Moyer	5527566a36	Fix TestCLILoginOIDC when running directly against Okta. Our actual CLI code behaved correctly, but this test made some invalid assumptions about the "upstream" IDP we're testing. It assumed that the upstream didn't support `response_mode=form_post`, but Okta does. This means that when we end up on the localhost callback page, there are no URL query parameters. Adjusting this regex makes the test pass as expected. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 16:29:42 -05:00
Matt Moyer	43f66032a9	Extend TestE2EFullIntegration to test manual OIDC flow. Using the same fake TTY trick we used to test LDAP login, this new subtest runs through the "manual"/"jump box" login flow. It runs the login with a `--skip-listen` flag set, causing the CLI to skip opening the localhost listener. We can then wait for the login URL to be printed, visit it with the browser and log in, and finally simulate "manually" copying the auth code from the browser and entering it into the waiting CLI prompt. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:45 -05:00
Matt Moyer	d0b37a7c90	Adjust TestFormPostHTML to work on Linux chromedriver. For some reason our headless Chrome test setup behaves slightly differently on Linux and macOS hosts. On Linux, the emoji characters are not recognized as valid text, so they are URL encoded. This change updates the test to cope with both cases correctly. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:44 -05:00
Matt Moyer	5029495fdb	Add manual paste flow to `pinniped login oidc` command. This adds a new login flow that allows manually pasting the authorization code instead of receiving a browser-based callback. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:44 -05:00
Matt Moyer	9fba8d2203	Adjust TestE2EFullIntegration for new form_post flow. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:44 -05:00
Matt Moyer	428f389c7d	Add missing t.Helper() on RequireEventuallyf(). This gives us nicer test assertion failure messages. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:44 -05:00
Matt Moyer	71d4e05fb6	Add custom response_mode=form_post HTML template. This is a new pacakge internal/oidc/provider/formposthtml containing a number of static files embedded using the relatively recent Go "//go:embed" functionality introduced in Go 1.16 (https://blog.golang.org/go1.16). The Javascript and CSS files are minifiied and injected to make a single self-contained HTML response. There is a special Content-Security-Policy helper to calculate hash-based script-src and style-src rules. This new code is covered by a new integration test that exercises the JS/HTML functionality in a real browser outside of the rest of the Supervisor. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	1904f8ddc3	In browsertest.Open(), capture console INFO logs. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	6b801056b5	Add testlib.RandBytes() helper. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	2823d4d1e3	Add "response_modes_supported" to Supervisor discovery response. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 12:08:43 -05:00
Matt Moyer	3a840cee76	Make TestAPIServingCertificateAutoCreationAndRotation less flaky. This test would occasionally flake for me when running locally. This change moves more of the assertions into the "eventually" loop, so they can temporarily fail as long as they converge on the expected values. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 11:29:02 -05:00
Matt Moyer	04e9897d51	Make TestImpersonationProxy less flaky. This test did not tolerate this connection failing, which can happen for any number of flaky networking-related reasons. This change moves the connection setup into an "eventually" retry loop so it's allowed to fail temporarily as long as it eventually connects. Signed-off-by: Matt Moyer <moyerm@vmware.com>	2021-07-09 11:28:33 -05:00
Ryan Richard	74f3ce5dcd	Merge branch 'main' into ldap-client-int-tests-only-on-kind	2021-07-08 12:54:56 -07:00
Ryan Richard	2f7dbed321	Try increasing the "eventually" timeouts in one integration test There were 10 second timeouts in `TestAPIServingCertificateAutoCreationAndRotation` which fail often on CI. Maybe increasing the timeouts will help?	2021-07-08 11:17:22 -07:00
Ryan Richard	709c10227f	Run the LDAP client's integration tests only on Kind TestSimultaneousLDAPRequestsOnSingleProvider proved to be unreliable on AKS due to some kind of kubectl port-forward issue, so only run the LDAP client's integration tests on Kind. They are testing the integration between the client code and the OpenLDAP test server, not testing anything about Kubernetes, so running only on Kind should give us sufficient test coverage.	2021-07-08 11:10:53 -07:00
Monis Khan	d78b845575	Fix bad test package name Signed-off-by: Monis Khan <mok@vmware.com>	2021-06-22 11:23:19 -04:00

1 2 3 4 5 ...

619 Commits