elegant-florist-94385
12/06/2024, 10:37 PMcruizba/ubuntu-dind:noble-27.1.1
docker image.
However, my other CI job for the integration tests has some different constraints since it needs to run a very large docker compose stack. I run this "raw" on the github runner (an ARC runner provided by my organization). Running directly on the runner as opposed to inside a container avoids dealing with complicated docker-in-docker issues. Anyways, I get a number of 403 errors when trying to access the cache in this job. Having a hard time debugging this..
Details in thread.elegant-florist-94385
12/06/2024, 10:38 PM21:51:06.14 [INFO] Counters:
backtrack_attempts: 0
docker_execution_errors: 0
docker_execution_requests: 0
docker_execution_successes: 0
local_cache_read_errors: 0
local_cache_requests: 112
local_cache_requests_cached: 0
local_cache_requests_uncached: 112
local_cache_total_time_saved_ms: 0
local_cache_write_errors: 0
local_execution_requests: 112
local_process_total_time_run_ms: 16083220
remote_cache_read_errors: 39
remote_cache_request_timeouts: 0
remote_cache_requests: 66
remote_cache_requests_cached: 0
remote_cache_requests_uncached: 23
remote_cache_speculation_local_completed_first: 4
remote_cache_speculation_remote_completed_first: 0
remote_cache_total_time_saved_ms: 0
remote_cache_write_attempts: 66
remote_cache_write_errors: 23
remote_cache_write_successes: 43
remote_execution_errors: 0
remote_execution_requests: 0
remote_execution_rpc_errors: 0
remote_execution_rpc_execute: 0
remote_execution_rpc_retries: 0
remote_execution_rpc_wait_execution: 0
remote_execution_success: 0
remote_execution_timeouts: 0
remote_process_total_time_run_ms: 0
remote_store_exists_attempts: 770
remote_store_exists_errors: 88
remote_store_exists_successes: 317
remote_store_missing_digest: 0
remote_store_read_attempts: 0
remote_store_read_cached: 0
remote_store_read_errors: 0
remote_store_read_uncached: 0
remote_store_request_timeouts: 0
remote_store_write_attempts: 212
remote_store_write_errors: 0
remote_store_write_successes: 212
elegant-florist-94385
12/06/2024, 10:39 PM22:28:52.35 [WARN] Failed to write to remote cache (1 occurrences so far): failed to query pants_ci_cache/byte-store/21/c7/21c741a7e0a9441673601c3b947bea357d5754824234584970308bf78375e3d3: PermissionDenied (persistent) at stat, context: { uri: <https://ghe-actions-prod-qhqyjglk.s3.amazonaws.com/actions-69c8c8939b70/9bb02394953d4d45a28a6ccad6554933/5b57473ef36b141086d40095c0eef846?AWSAccessKeyId=AKIAQ3EGVTWOBBE5CS57&Expires=1733527732&Signature=24h6W5N7uX%2FJLKZ%2BQTHr7CtJhbY%3D>, response: Parts { status: 403, version: HTTP/1.1, headers: {"x-amz-request-id": "50E4VFZEM4E4GT66", "x-amz-id-2": "7MwRomL34rFYOzzquN0+SG2N0e1kqCbR32VDqdC5Z6aJK5EOpnmF9J3g9iRX/3TrRbElYhRhdTY=", "content-type": "application/xml", "transfer-encoding": "chunked", "date": "Fri, 06 Dec 2024 22:28:51 GMT", "server": "AmazonS3"} }, service: ghac, path: pants_ci_cache/byte-store/21/c7/21c741a7e0a9441673601c3b947bea357d5754824234584970308bf78375e3d3 }
elegant-florist-94385
12/06/2024, 10:40 PMelegant-florist-94385
12/06/2024, 10:41 PMfierce-truck-19259
12/07/2024, 1:49 AMelegant-florist-94385
12/07/2024, 11:28 AMfierce-truck-19259
12/07/2024, 11:30 AMelegant-florist-94385
12/07/2024, 12:13 PMelegant-florist-94385
12/07/2024, 12:44 PMfierce-truck-19259
12/07/2024, 1:00 PM-ldebug
to get some remote cache logselegant-florist-94385
12/07/2024, 2:30 PMfierce-truck-19259
12/07/2024, 2:36 PMRemote cache lookup for:
and
[DEBUG] Using [remote_cache::CommandRunner
or along those lines, they should log request errors towards the store IME as wellelegant-florist-94385
12/07/2024, 2:50 PM13:04:47.08 [DEBUG] Using [cache::CommandRunner { inner: remote_cache::CommandRunner { inner: bounded::CommandRunner { inner: SwitchedCommandRunner { .. }, .. }, .. }, .. }, cache::CommandRunner { inner: remote_cache::CommandRunner { inner: bounded::CommandRunner { inner: SwitchedCommandRunner { .. }, .. }, .. }, .. }] for process execution.
just once near the start while initializing scheduler/pantsd/etc.
I do also see the Remote cache lookup for: <task>
lines, but the only lines relating to the http error are the warning level logs:
2024-12-07T12:50:42.4182878Z 12:50:42.41 [33m[WARN][0m Failed to read from remote cache (1 occurrences so far): failed to read pants_ci_cache/action-cache/ca/4b/ca4be85b2bd62dd89d13365599166268394a755205426cd5ae6a5a4a16a0a9a7: Unexpected (persistent) at read, context: { uri: <https://ghe-actions-prod-qhqyjglk.s3.amazonaws.com/actions-69c8c8939b70/9bb02394953d4d45a28a6ccad6554933/2287473ef36b141086d40095c0eef846?AWSAccessKeyId=AKIAQ3EGVTWOBBE5CS57&Expires=1733579442&Signature=kJL4aUwFQOjU1v2b9zDTS3IlV8I%3D>, response: Parts { status: 400, version: HTTP/1.1, headers: {"x-amz-request-id": "XFDEE0WMTXQ7AK17", "x-amz-id-2": "rhAXrDD1An2kfE87JPAHRYIPbPtBvTLQi/yQCDTVEKdvw51WNUHS3PllWt/MeBDNgJXuMYKBe/M=", "x-amz-region": "us-east-1", "content-type": "application/xml", "transfer-encoding": "chunked", "date": "Sat, 07 Dec 2024 12:50:41 GMT", "connection": "close", "server": "AmazonS3"} }, service: ghac, path: pants_ci_cache/action-cache/ca/4b/ca4be85b2bd62dd89d13365599166268394a755205426cd5ae6a5a4a16a0a9a7, range: 0- } => <?xml version="1.0" encoding="UTF-8"?>
elegant-florist-94385
12/07/2024, 2:57 PMfierce-truck-19259
12/07/2024, 3:05 PMfierce-truck-19259
12/07/2024, 3:07 PMelegant-florist-94385
12/07/2024, 3:09 PMPANTS_REMOTE_STORE_ADDRESS: <https://github.software.gevernova.com/_services/artifactcache/ocrHE8ZScyxq8pTiT1IyKRHNwdDtYL7HOvJRB2oXuk7JCGeFXS/>
Its going through the experimental-github-actions-cache, but beyond that, I don't know. (Org is migrating us from gitlab to an enterprise github instance, so I'm rewriting all CI for github.)elegant-florist-94385
12/07/2024, 3:09 PMfierce-truck-19259
12/07/2024, 3:11 PMfierce-truck-19259
12/07/2024, 3:13 PMelegant-florist-94385
12/07/2024, 3:14 PMbazel-remote-cache
up in the cloud somewhere then haha.elegant-florist-94385
12/07/2024, 3:14 PMfierce-truck-19259
12/07/2024, 3:15 PMelegant-florist-94385
12/07/2024, 4:31 PM2024-12-07T15:52:18.4765380Z <Error><Code>InvalidArgument</Code><Message>Requests specifying Server Side Encryption with AWS KMS managed keys require AWS Signature Version 4.</Message><ArgumentName>Authorization</ArgumentName><ArgumentValue>null</ArgumentValue><RequestId>ATSBDNV56PRJE6D2</RequestId><HostId>wRkPUK3v7M8Njes5T/+JNBxYU1LD7odDa2fEXG1uGXAuM+Zw/97NPxSma8ELujp6uYY9ulbJP+8=</HostId></Error>
that is showing up as a new line, but always immediately after the Failed to read
warning lines.
This looks much more like an actionable error messages. Seems to imply that the Token being provided by github doesn't have the remote read permissions.
I've been using actions/cache
already and it is working, and these are supposed to be the same URLs. Maybe I can figure it out from there. Need to update AWS sdk version somewhere or somethingelegant-florist-94385
12/07/2024, 5:07 PM