This message was deleted Rancher Users #general

Join Slack

This message was deleted.

# general

adamant-kite-43734

10/22/2024, 1:47 PM

This message was deleted.

powerful-librarian-10572

10/22/2024, 1:48 PM

Check also rke2-server.service logs on the node.

powerful-librarian-10572

10/22/2024, 1:49 PM

Because nothing in what you share sounds breaking, that reflector error isnt very telling of anything

fancy-art-11312

10/22/2024, 1:53 PM

This is the logs from the rk2-server.service

powerful-librarian-10572

10/22/2024, 1:54 PM

I think its just still booting

powerful-librarian-10572

10/22/2024, 1:54 PM

wait no

powerful-librarian-10572

10/22/2024, 1:54 PM

if you restart rancher-system-agent by any chance?

fancy-art-11312

10/22/2024, 1:54 PM

I have tried to leave this for more than 24 hours and it stays in this state.

fancy-art-11312

10/22/2024, 1:55 PM

Ive tried to reboot the system 4 times, and there was a thread somewhere that suggested, it could resolve the issue.

fancy-art-11312

10/22/2024, 1:57 PM

Tried to restart that service now again

powerful-librarian-10572

10/22/2024, 1:57 PM

They may be something weird going on with fleet, although the error should be different... go to your local cluster, enable "show all namespaces" on the top, go to more ressources > fleet > clusters then force update your non-working cluster (url : (rancherurl)/dashboard/c/local/explorer/fleet.cattle.io.cluster )

fancy-art-11312

10/22/2024, 1:59 PM

Ive set restrictedAdmin on the helm deploy, so that rancher does not manage my local k8s cluster. As I dont want rancher to manage my EKS cluster.

powerful-librarian-10572

10/22/2024, 1:59 PM

you can also check the logs of the fleet-controller-manager => fleet-agentmanagement ( /dashboard/c/local/explorer/apps.deployment/cattle-fleet-system/fleet-controller )

powerful-librarian-10572

10/22/2024, 1:59 PM

Fleet will still be used to manage your remote cluster

fancy-art-11312

10/22/2024, 2:00 PM

okay. let me see

powerful-librarian-10572

10/22/2024, 2:00 PM

If fleet fails to communicate, you will get a working cluster from the kuybernetes POV but rancher UI showing non-working things

powerful-librarian-10572

10/22/2024, 2:01 PM

To be more precise, i don't see a fleet agent pod on the list you shared earlier so that could be a lead

fancy-art-11312

10/22/2024, 2:03 PM

I dont see a

fleet-controller-manager,

only a

fleet-controller

powerful-librarian-10572

10/22/2024, 2:03 PM

yes, sorry

powerful-librarian-10572

10/22/2024, 2:04 PM

click on it (not view logs directly) and check logs for the 'fleet-agentmanagement' one

fancy-art-11312

10/22/2024, 2:05 PM

Thank you. I see some connection errors here. Its trying to connect the public endpoint, and I have security-group rules on there that only allowed spesific sources. Let me fix that.

Copy code

fleet-agentmanagement time="2024-10-22T14:03:49Z" level=error msg="error syncing 'fleet-local/local': handler import-cluster: Get \"<https://rancher.a.b.c/k8s/clusters/local/version?timeout=15s>\": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers), requeuing"

powerful-librarian-10572

10/22/2024, 2:06 PM

oooh that sounds bad

powerful-librarian-10572

10/22/2024, 2:07 PM

It looks like fleet has failed to initialize upstream, but it might be because you're using the restrictedAdmin mode

powerful-librarian-10572

10/22/2024, 2:08 PM

do you have a cattle-fleet-local-system namespace? in the upstram cluster

modern-farmer-28333

10/22/2024, 2:10 PM

@fancy-art-11312 because you only have one node and the node might only have one or some not all of control plane, etcd, and worker. once the cluster have at least one control plane, 1 etcd, and 1 worker. Your cluster will be ready. nothing wrong... please add your worker or your etcd or control plane

powerful-librarian-10572

10/22/2024, 2:11 PM

I dont think thats the issue

✅ 1

fancy-art-11312

10/22/2024, 2:11 PM

I did select all the roles

✅ 1

fancy-art-11312

10/22/2024, 2:11 PM

Copy code

.... --etcd --controlplane --worker

modern-farmer-28333

10/22/2024, 2:11 PM

If you select all roles, it will not be that case.

powerful-librarian-10572

10/22/2024, 2:12 PM

but your command kubectl get nodes doesnt show edge-node-1 is a worker

powerful-librarian-10572

10/22/2024, 2:12 PM

Copy code

kubectl get nodes
NAME         STATUS   ROLES                              AGE    VERSION
gra-node-1   Ready    control-plane,etcd,master,worker   172d   v1.30.4+rke2r1
gra-node-2   Ready    control-plane,etcd,master,worker   98d    v1.30.4+rke2r1
rbx-node-1   Ready    control-plane,etcd,master,worker   176d   v1.30.4+rke2r1
sbg-node-1   Ready    control-plane,etcd,master,worker   168d   v1.30.4+rke2r1
sbg-node-2   Ready    control-plane,etcd,master,worker   99d    v1.30.4+rke2r1

powerful-librarian-10572

10/22/2024, 2:12 PM

mine for example

powerful-librarian-10572

10/22/2024, 2:12 PM

It might be because fleet doe

modern-farmer-28333

10/22/2024, 2:14 PM

@fancy-art-11312 please add one worker node, your cluster might be ready.

fancy-art-11312

10/22/2024, 2:14 PM

Should I maybe try to recreate the cluster. I fixed the fleet-controller connection issue

powerful-librarian-10572

10/22/2024, 2:14 PM

If you fixed it, your lcuster might appear online all by itself

modern-farmer-28333

10/22/2024, 2:15 PM

Your cluster does not have a worker node.

fancy-art-11312

10/22/2024, 2:15 PM

the roles on the node still does not include worker

modern-farmer-28333

10/22/2024, 2:15 PM

when you add a worker, the cluster will be ready

fancy-art-11312

10/22/2024, 2:15 PM

or i can try to just add a worker

fancy-art-11312

10/22/2024, 2:15 PM

let me do that

modern-farmer-28333

10/22/2024, 2:15 PM

please

fancy-art-11312

10/22/2024, 2:25 PM

I got some new errors, might be because I set the

restrictedAdmin=true

in the helm chart

powerful-librarian-10572

10/22/2024, 2:26 PM

where is that error coming from..?

fancy-art-11312

10/22/2024, 2:26 PM

I see it on the UI

powerful-librarian-10572

10/22/2024, 2:27 PM

is that cluster upstream or the downstream one you tried to create?

fancy-art-11312

10/22/2024, 2:29 PM

Not really sure what you mean. I created a new cluster on the UI, and then ran the registration command on the nodes thats on the edge.

powerful-librarian-10572

10/22/2024, 2:29 PM

mtn-cluster

powerful-librarian-10572

10/22/2024, 2:29 PM

Is that upstream (the rancher cluster) or downstream (the one you're trying to make) ?

fancy-art-11312

10/22/2024, 2:29 PM

The one I am trying to make

powerful-librarian-10572

10/22/2024, 2:30 PM

oh. Weird indeed then

fancy-art-11312

10/22/2024, 2:31 PM

Im going to try to create a new cluster. Maybe there's something not configured correctly due to the connection issue it had.

fancy-art-11312

10/22/2024, 2:45 PM

Yay

fancy-art-11312

10/22/2024, 2:46 PM

Thank you sooo much @powerful-librarian-10572. Really appreciate your help on how to troubleshoot my issue!

powerful-librarian-10572

10/22/2024, 2:46 PM

I have no idea why you had to recreate the whole cluster but youre welcome

fancy-art-11312

10/22/2024, 2:47 PM

🙇

modern-farmer-28333

10/22/2024, 2:48 PM

@fancy-art-11312 By the way, did you do any extra configurations on you fresh VM Ubuntu24? or just register the node?

fancy-art-11312

10/22/2024, 2:49 PM

Only updates and then register node

modern-farmer-28333

10/22/2024, 2:49 PM

Cooooool👍. Thanks @fancy-art-11312

fancy-art-11312

10/22/2024, 2:50 PM

Copy code

#cloud-config
package_update: true
package_upgrade: true
package_reboot_if_required: true
packages:
  - vim

# Manage /etc/hosts with cloud-init.
# On every boot, /etc/hosts will be re-written from
# ``/etc/cloud/templates/hosts.tmpl``.
manage_etc_hosts: true

# Setting hostname
preserve_hostname: false
hostname: edge-node-2

users:
  - name: maarten
    sudo: ALL=(ALL) NOPASSWD:ALL
    shell: /bin/bash
    ssh-authorized-keys:
      - ecdsa-sha2-nistp521 abc

runcmd:
  - curl -fL <https://a.b.c/system-agent-install.sh> | sudo  sh -s - --server <https://rancher.a.b.c> --label '<http://cattle.io/os=linux|cattle.io/os=linux>' --token abc --etcd --controlplane --worker

✅ 1

16 Views

Open in Slack

Previous Next