This message was deleted Rancher Users #rke2

Join Slack

This message was deleted.

# rke2

adamant-kite-43734

02/09/2023, 12:43 PM

This message was deleted.

eager-london-83975

02/09/2023, 12:54 PM

are the logs coming from the node agent you are trying to add or the master ?

stocky-article-82001

02/09/2023, 12:54 PM

The node I’m trying to add.

stocky-article-82001

02/09/2023, 12:55 PM

Copy code

{"level":"warn","ts":"2023-02-09T12:33:59.607Z","logger":"etcd-client","caller":"v3@v3.5.4-k3s1/retry_interceptor.go:62","msg":"retrying of unary invoker failed","target":"<etcd-endpoints://0xc001504000/127.0.0.1:2379>","attempt":0,"error":"rpc error: code = DeadlineExceeded desc = latest balancer error: last connection error: connection error: desc = \"transport: Error while dialing dial tcp 172.30.12.3:2379: connect: no route to host\""}
Feb 09 12:33:59 <MASTER HOSTNAME> rke2[13805]: time="2023-02-09T12:33:59Z" level=warning msg="Learner <HOSTNAME OF NEW NODE>-9ad37932 stalled at RaftAppliedIndex=0 for 5m0.607218804s"
Feb 09 12:33:59 <MASTER HOSTNAME> rke2[13805]: time="2023-02-09T12:33:59Z" level=warning msg="Removed learner <HOSTNAME OF NEW NODE>-9ad37932 from etcd cluster"

stocky-article-82001

02/09/2023, 12:55 PM

These are some logs from the master which is weird.

eager-london-83975

02/09/2023, 12:55 PM

Your network is not routed correctly

eager-london-83975

02/09/2023, 12:56 PM

Your masternodes cannot communicate with each other

stocky-article-82001

02/09/2023, 12:57 PM

That’s bizarre, this new one has the exact same config as the other 2 masters.

eager-london-83975

02/09/2023, 12:57 PM

Copy code

transport: Error while dialing dial tcp 172.30.12.3:2379: connect: no route to host

this happens when the ip is not routed properly

eager-london-83975

02/09/2023, 12:57 PM

It's easy to miss, but check your security groups/firewall

eager-london-83975

02/09/2023, 12:57 PM

Or that your subnet is routed properly, or even if they are in the same network/vpc

stocky-article-82001

02/09/2023, 12:58 PM

Yeah ok, I’ll have a dig around. Thanks!

eager-london-83975

02/09/2023, 1:00 PM

Please don't forget to notify if you manage to solve it, might help others!

stocky-article-82001

02/09/2023, 3:16 PM

The nodes can communicate (I’ve confirmed) but it is still not working.

stocky-article-82001

02/09/2023, 3:17 PM

I can curl the :6443 on the master from the new node fine

eager-london-83975

02/09/2023, 3:17 PM

but is 2379 open, namely ETCD running

stocky-article-82001

02/09/2023, 3:18 PM

Hmm, seems I’ve spoken too soon. It is working now, however there were some 500 errors at the start

stocky-article-82001

02/09/2023, 3:18 PM

it has joined to the cluster successfully now, let me wait for it to fully reconcile.

stocky-article-82001

02/09/2023, 3:22 PM

Yeah it still seems to be fucking up with networking.

stocky-article-82001

02/09/2023, 3:23 PM

~~~scratch that~~~

eager-london-83975

02/09/2023, 5:31 PM

So it's working good now ?

stocky-article-82001

02/09/2023, 5:36 PM

No, we’ve found some networking issues that we’re currently working through.

eager-london-83975

02/09/2023, 5:36 PM

Is it on-premises or what cloud are you using to manage your network and nodes ?

1867 Views

Open in Slack

Previous Next