This message was deleted Rancher Users #k3s

Join Slack

This message was deleted.

# k3s

adamant-kite-43734

06/29/2022, 1:35 PM

This message was deleted.

creamy-pencil-82913

06/29/2022, 3:37 PM

Do you have more than one interface with a default route, or no default route at all?

melodic-hamburger-23329

06/29/2022, 3:40 PM

The issue apparently went away after I passed

--node-ip

--node-external-ip

, and

--flannel-iface

with the IP address assigned by the cloud env. However, now getting following errors:

Copy code

time="2022-06-30T00:36:18+09:00" level=info msg="Cluster-Http-Server 2022/06/30 00:36:18 http: TLS handshake error from 127.0.0.1:50128: remote error: tls: bad certificate"
time="2022-06-30T00:36:18+09:00" level=info msg="Waiting to retrieve agent configuration; server is not ready: unable to find interface: route ip+net: no such network interface"

creamy-pencil-82913

06/29/2022, 3:42 PM

Same question

creamy-pencil-82913

06/29/2022, 3:43 PM

You may need to add a dummy default route. Kubernetes can be run airgapped but it still expects to have a default route for a bunch of the interface selection logic.

melodic-hamburger-23329

06/29/2022, 3:44 PM

ip route

shows one default route:

default nhid 12 via inet6 f0::...... proto bgp src <internal ipv4 address> metric 20

melodic-hamburger-23329

06/29/2022, 4:01 PM

changed --flannel-iface to interface name instead of ip. Seems to work now.

melodic-hamburger-23329

06/29/2022, 4:06 PM

Now getting following errors:

Copy code

E0630 01:03:27.777938  182940 resource_quota_controller.go:413] unable to retrieve the complete list of server APIs: <http://metrics.k8s.io/v1beta1|metrics.k8s.io/v1beta1>: the server is currently unable to handle the request

W0630 01:03:28.050326  182940 garbagecollector.go:707] failed to discover some groups: map[<http://metrics.k8s.io/v1beta1:the|metrics.k8s.io/v1beta1:the> server is currently unable to handle the request]

E0630 01:03:33.398952  182940 pod_workers.go:951] "Error syncing pod, skipping" err="[failed to \"StartContainer\" for \"lb-tcp-80\" with CrashLoopBackOff: \"back-off 20s restarting failed container=lb-tcp-80 pod=svclb-traefik-dc729b50-vrfdm_kube-system(0e80293c-9e5c-4a08-9126-1e4515fed5fd)\", failed to \"StartContainer\" for \"lb-tcp-443\" with CrashLoopBackOff: \"back-off 20s restarting failed container=lb-tcp-443 pod=svclb-traefik-dc729b50-vrfdm_kube-system(0e80293c-9e5c-4a08-9126-1e4515fed5fd)\"]" pod="kube-system/svclb-traefik-dc729b50-vrfdm" podUID=0e80293c-9e5c-4a08-9126-1e4515fed5fd

E0630 01:03:40.610544  182940 available_controller.go:524] <http://v1beta1.metrics.k8s.io|v1beta1.metrics.k8s.io> failed with: failing or missing response from <https://10.42.0.13:4443/apis/metrics.k8s.io/v1beta1>: Get "<https://10.42.0.13:4443/apis/metrics.k8s.io/v1beta1>": proxy error from 127.0.0.1:6443 while dialing 10.42.0.13:4443, code 500: 500 Internal Server Error

creamy-pencil-82913

06/29/2022, 4:20 PM

If you describe the crashing pods, and look at the logs, what does it say?

melodic-hamburger-23329

06/29/2022, 11:44 PM

@creamy-pencil-82913 `kubectl logs --namespace=kube-system coredns-d76bd69b-qnvxd`:

Copy code

[WARNING] No files matching import glob pattern: /etc/coredns/custom/*.server
.:53
[WARNING] No files matching import glob pattern: /etc/coredns/custom/*.server
[INFO] plugin/reload: Running configuration SHA512 = b941b080e5322f6519009bb49349462c7ddb6317425b0f6a83e5451175b720703949e3f3b454a24e77f3ffe57fd5e9c6130e528a5a1dd00d9000e4afd6c1108d
CoreDNS-1.9.1
linux/amd64, go1.17.8, 4b597f8
[ERROR] plugin/errors: 2 3162228087811675511.9106375430785523333. HINFO: read udp 10.42.0.24:36527->8.8.8.8:53: i/o timeout

`kubectl logs --namespace=kube-system traefik-df4ff85d6-4jd6w`:

Copy code

time="2022-06-29T23:40:50Z" level=info msg="Configuration loaded from flags."
time="2022-06-29T23:41:02Z" level=error msg="accept tcp [::]:9100: use of closed network connection" entryPointName=metrics
time="2022-06-29T23:41:02Z" level=error msg="accept tcp [::]:9000: use of closed network connection" entryPointName=traefik
time="2022-06-29T23:41:02Z" level=error msg="accept tcp [::]:8443: use of closed network connection" entryPointName=websecure
time="2022-06-29T23:41:02Z" level=error msg="accept tcp [::]:8000: use of closed network connection" entryPointName=web
time="2022-06-29T23:41:02Z" level=error msg="close tcp [::]:9000: use of closed network connection" entryPointName=traefik
time="2022-06-29T23:41:02Z" level=error msg="close tcp [::]:8443: use of closed network connection" entryPointName=websecure
time="2022-06-29T23:41:02Z" level=error msg="close tcp [::]:9100: use of closed network connection" entryPointName=metrics
time="2022-06-29T23:41:02Z" level=error msg="close tcp [::]:8000: use of closed network connection" entryPointName=web

creamy-pencil-82913

06/29/2022, 11:47 PM

looks like it can’t talk to DNS servers…

creamy-pencil-82913

06/29/2022, 11:47 PM

for coredns at leastm not sure why traefik is crashing

melodic-hamburger-23329

06/29/2022, 11:47 PM

the environment has limitations regarding internet access, so in case it tries to access internet, that might be cause

melodic-hamburger-23329

06/29/2022, 11:51 PM

hmm, https://platform9.com/kb/kubernetes/why-is-coredns-resolving-via-upstream-google-dns-8-8-8-8

melodic-hamburger-23329

06/29/2022, 11:51 PM

How can I override this? I don’t want k3s to connect to google dns, nor any other external DNS for that matter

melodic-hamburger-23329

06/29/2022, 11:58 PM

how do I configure coredns/k3s to use specific DNS server(s)?

melodic-hamburger-23329

06/30/2022, 12:27 AM

hmm, this is getting tricky.. I need to also pull all the images from internal Harbor registry

melodic-hamburger-23329

06/30/2022, 12:31 AM

by specifying resolv.conf, it seems all pods are now up and running

70 Views

Open in Slack

Previous Next