I ve got rancher stable version `2 12 1` installed via Helm Rancher Users #general

I’ve got rancher (stable version `2.12.1`) install...

numerous-agency-66232

09/08/2025, 3:45 PM

I’ve got rancher (stable version

2.12.1

) installed via Helm Chart deployed w/ ArgoCD. This lives in a lower EKS environment. • This is working well and I intend for this to be my management cluster for the time being • We’ll call this environment

source

I’m trying to import another EKS cluster (diff AWS account + region) • We’ll call this environment

target

• I’ve allowed NAT Gateway IPs at the eks level + SG level where relevant (on both

source

and

target

eks clusters • However, I’m still getting the error:

Copy code

failed to communicate with cluster: Get "<https://MY_TARGET_CLUSTER.gr7.us-west-2.eks.amazonaws.com/api/v1/namespaces/cattle-system>": dial tcp TARGET_EKS_PUBLIC_IP:443: i/o timeout

When I check the pod logs of the

target

cluster I see the following

Copy code

INFO: <https://rancher>.<MY_DOMAIN>.com/ping is accessible
...
time="2025-09-04T20:17:18Z" level=info msg="Listening on /tmp/log.sock"                                                                                                           │
│ time="2025-09-04T20:17:18Z" level=info msg="starting cattle-credential-cleanup goroutine in the background"                                                                       │
│ time="2025-09-04T20:17:18Z" level=info msg="Rancher agent version v2.12.1 is starting"                                                                                            │
│ time="2025-09-04T20:17:18Z" level=error msg="unable to read CA file from /etc/kubernetes/ssl/certs/serverca: open /etc/kubernetes/ssl/certs/serverca: no such file or directory"  │
│ time="2025-09-04T20:17:18Z" level=info msg="Connecting to <wss://rancher>.<MY_DOMAIN>.com/v3/connect/register with token starting with TOKEN_STRING"
time="2025-09-06T01:49:06Z" level=info msg="Connecting to proxy" url="<wss://rancher>.<MY_DOMAIN>.com/v3/connect"

I think we can ignore those cert errors as I’ve already set

agentTLSMode: "system-store"

• When i fixed this, it proceeded beyond the cert errors -> to the connecting to proxy msg Further, I’ve added the following to NO_PROXY on

source

cluster

Copy code

,.<http://eks.amazonaws.com|eks.amazonaws.com>,<http://eks.amazonaws.com|eks.amazonaws.com>,<http://TARGET_CLUSTER.gr7.us-west-2.eks.amazonaws.com|TARGET_CLUSTER.gr7.us-west-2.eks.amazonaws.com>

and the following on the

target

cluster

Copy code

rancher.<MY_DOMAIN>.com,.svc,.cluster.local,10.0.0.0/8,172.16.0.0/12,192.168.0.0/16

I am at a loss now for why this isn’t connecting. A timeout implies an SG / network issue I would guess My understanding is these 2 AWS accounts do not need network connectivity i.e via VPC peering. They should only need api access to the cluster URL, but I could be wrong there • the Public IP for the

target

eks cluster it times out on is AWS owned and not anything I have access to tl;dr why is my import of existing EKS cluster failing with a timeout error?

Open in Slack

Previous Next