This message was deleted Rancher Users #rke2

Join Slack

This message was deleted.

# rke2

adamant-kite-43734

03/26/2024, 4:39 PM

This message was deleted.

creamy-pencil-82913

03/26/2024, 4:52 PM

It was chosen by Product Management for alignment with RKE.

thankful-train-2781

03/26/2024, 4:54 PM

Thank makes sense, which sort of falls under "historical circumstances", which is what I suspected, but thanks for confirming.

creamy-pencil-82913

03/26/2024, 4:54 PM

We do not support changing CNIs on an existing cluster. It just doesn’t work well and is a supportability nightmare. If you had a non-prod cluster to test it on you might be able to come up with some steps that work for you, but we don’t test it and don’t have any documentation that covers that.

thankful-train-2781

03/26/2024, 4:56 PM

Sure, I definitely wouldn't expect explicit "support", and I would be doing this at my own risk, more so I wanted to confirm I understood what rke2 was doing under the umbrella of "installing a CNI", that I would need to undo.

creamy-pencil-82913

03/26/2024, 5:07 PM

yeah, rke2 just installs different charts for each CNI. You might have to go as far as deleting and re-registering the nodes to get all the various bits tracked properly.

thankful-train-2781

03/26/2024, 5:08 PM

I'll keep that in mind. Thanks again.

thankful-train-2781

03/26/2024, 7:44 PM

Hey, sorry to bother you again, but in doing some more homework on this, it seems like the calico project's position is that new clusters should use calico alone (e.g. the note at the top of https://docs.tigera.io/calico/latest/getting-started/kubernetes/flannel/install-for-flannel). Is canal being the default purely for continuity/alignment, or is it Rancher's position that, unless there is a specific requirement otherwise, new clusters for the foreseeable future should still use canal? The reason I ask, is that calico themselves have an automated migration tool https://docs.tigera.io/calico/latest/getting-started/kubernetes/flannel/migration-from-flannel, is Rancher aware of this? While I totally sympathize with not wanting to support any arbitrary CNI switch, this specific migration path seems like something that more and more clusters will want/need as time goes on, especially if calico's position ever shifts from "you should use calico instead" to "do not use canal", or would Rancher provide a stop-gap, a la dockers-shim, in that case?

creamy-pencil-82913

03/26/2024, 9:05 PM

There’s no position on one CNI over another. We support all of them equally. The default is just the default because thats what folks were used to and we wanted to make it easy for folks to make the switch from RKE to RKE2.

creamy-pencil-82913

03/26/2024, 9:07 PM

historically Calico was required for Windows support, but the network policy stuff on Windows is super broken because of some issues that Microsoft is apparently unable to fix, so we are now recommending use of Flannel, with the caveat that network policies are not supported.

creamy-pencil-82913

03/26/2024, 9:08 PM

Windows basically drops all existing connections every time the network policy changes, and they can’t or won’t fix that.

thankful-train-2781

03/26/2024, 9:34 PM

Interesting, that's good to know, thank you. Unfortunately, I believe network policies are required for this cluster, windows or not, so we'll just have to put up with that, if nothing else its more justification we can give to this one customer to not use windows anymore. Unless, of course, its possible to use flannel for windows, and can{al,ico} for linux and bridge them, but I've never heard of anything like that being possible. I'm going to try out calico's migration tool with some devtools I have for creating temporary rke2 clusters. If I get it working consistently and wrapped as a HelmRelease object, do you think Rancher would be interested in accepting it as a contribution?

creamy-pencil-82913

03/26/2024, 9:41 PM

from what I’ve heard the netpol stuff makes windows basically unusable because clients are constantly disconnected whenever pods change. @bland-account-99790 might be able to link to the issue in question.

thankful-train-2781

03/26/2024, 9:43 PM

Is it just unusable w/ NetworkPolicies, or just at all?

creamy-pencil-82913

03/26/2024, 9:52 PM

it is specifically network policies that do it.

creamy-pencil-82913

03/26/2024, 9:53 PM

https://docs.tigera.io/calico/latest/getting-started/kubernetes/windows-calico/limitations#pod-to-pod-connections-are-dropped-with-tcp-reset-packets

thankful-train-2781

03/26/2024, 9:54 PM

Hmm, okay. I think that'd be fine for development, but production is another story, we may have to convince them to re-vector. Thank you, this has been very informative.

creamy-pencil-82913

03/26/2024, 9:54 PM

any time the policies that handle traffic to or from a pod change, windows reloads the whole hns acl policy engine which causes all the existing connections to be reset. Is is super dumb and it is apparently just how windows works.

creamy-pencil-82913

03/26/2024, 9:55 PM

the issue has been open without MS being able to deliver a fix, long enough that we have added standalone flannel support to RKE2 and are now recommending use of that for Windows instead of Calico

bland-account-99790

03/27/2024, 6:34 AM

MS is supposed to deliver a fix soon. According to the information I have, the problem is addressed with OS builds with build number >=25922.1000 and the plan was to deliver that in the March/April release of Windows 2022 Server, but I would not trust that 100%, it might take a bit longer

thankful-train-2781

04/11/2024, 1:26 AM

Thanks again for all the info. If you don't mind me bothering you again, we took your advice and wrangled enough hardware for a separate cluster w/ flannel, but seeing some very bizarre behavior. Running the agent quickstart on a fresh copy of server 2019 standard has the node show up as ready, but running a test pod results in flannel complaining about "duplicate allocations" (sorry, its in a place that'd be hard to copy paste from), followed quickly by the node losing network connectivity. Once we wrestled control back, the logs show that something called "host-local.exe" is upset about the X.Y.Z.2 address of our service CIDR being already allocated to "dummy" (no idea what this is, we didn't do anything with that name), and this resulting in a empty string being passed as the "source-vip" to kube-proxy. Have you seen anything like this before? This is the edge of my expertise, and neither google nor github yield any useful results.

bland-account-99790

04/11/2024, 6:05 AM

When rke2 starts in Windows, before giving any IP, it reserves an IP for the source-vip of kube-proxy. It does this using

host-local

bland-account-99790

04/11/2024, 6:06 AM

it reserves it to a pod called dummy (which does not exist, it's just because we need to reserve one IP to kube-proxy)

bland-account-99790

04/11/2024, 6:07 AM

We are slightly changing this in the next release (end of April) but I am surprised about the error

bland-account-99790

04/11/2024, 6:08 AM

host-local is saving the given IPs in

C:\var\lib\cni\networks\

, could you check what you have there?

bland-account-99790

04/11/2024, 6:09 AM

This was our first release with rke2+flannel in windows and we are hitting some bugs. That's why we consider it "experimental" in the meanwhile. Thanks for reporting it!

creamy-pencil-82913

04/11/2024, 8:47 AM

@bland-account-99790 maybe the dummy pod reservation could be named something that better ties it to kube-proxy somehow? just for better troubleshooting.

bland-account-99790

04/11/2024, 9:04 AM

yes, that's one of the changes coming in the April release. Already merged: https://github.com/rancher/rke2/blob/master/pkg/windows/flannel.go#L349-L355

bland-account-99790

04/11/2024, 9:05 AM

https://github.com/rancher/rke2/commit/fe58cee4f0a9df4353deb89b8e10282c2ecfb304 no more dummy 😛

bland-account-99790

04/11/2024, 1:20 PM

I reproduced your issue. I am working on a fix. The problem I see is that if I run rke2 in windows, it reserves an IP for kube-proxy. If I stop rke2 and restart it (without deleting anything), host-local tries reserving another IP instead of reusing the one already reserved. I thought

host-local

was intelligent enough to pick the already reserved IP address

thankful-train-2781

04/11/2024, 3:31 PM

Wow, you guys are fast, how nice to wake up to problems being solved. Do you still need the contents of our cni directory, or do you have it from here? You mentioned restarting without deleting things, is there a workaround we can try to get around this/things we need to do between restarts, or do we just have to sit tight and wait for your fix?

bland-account-99790

04/11/2024, 4:12 PM

this is the fix ==> https://github.com/rancher/rke2/pull/5705

bland-account-99790

04/11/2024, 4:13 PM

The workaround for your current problem is easy: go to

/var/lib/cni/networks/vxlan0

and remove the file there. Before removing it, check its content and verify it has the dummy string

bland-account-99790

04/11/2024, 4:13 PM

then restart RKE2 service

thankful-train-2781

04/11/2024, 4:22 PM

Indeed it does, however, it has it twice, with a newline between, is that expected?

thankful-train-2781

04/11/2024, 4:23 PM

In any case, the event viewer shows kube-proxy as having an actual VIP this time, so I think we're in business, thanks again!

bland-account-99790

04/11/2024, 4:27 PM

yes, that's expected 🙂

thankful-train-2781

04/11/2024, 4:28 PM

Looks like the actual host network is still cycling on and off once it allocates an IP for the pod. Is that part of the expected fix, or is that an indication something is wrong on our end?

bland-account-99790

04/11/2024, 4:42 PM

when flannel creates the network, then you might see some connectivity bump but after that it should work well

thankful-train-2781

04/11/2024, 4:45 PM

Hmm, this doesn't look like a short bump or hiccup. I created a pod ~20m ago, the node stopped heartbeating to the contorlplane within ~1m, I deleted the pod shortly after, and I still can't get a reliable RDP/SSH connection to it (it will start a connection, then die within a few seconds). Unfortunately, I'm not a network engineer, much less a windows one, but I'll see if we have some boots on the ground that can make sense of it.

bland-account-99790

04/12/2024, 10:04 AM

can you check the rke2 logs in the node and see what you get there?

thankful-train-2781

04/12/2024, 1:59 PM

Nothing out of the ordinary in the event viewer or the log files under /var/lib/rancher/rke2/agent/logs, outside of connection errors back to the control plane and image reigstry, consistent with it losing network access while a container is running.

thankful-train-2781

04/12/2024, 2:00 PM

I did notice that the etc/resolv.conf that it generated under /var/lib/rancher was set to just use 8.8.8.8 instead of the internal DNS we have set the node's windows networking config

thankful-train-2781

04/12/2024, 2:02 PM

This might be relevant: This machine has two ethernet NICs (though only one is enabled and plugged in)

thankful-train-2781

04/15/2024, 11:42 PM

Both of my colleagues that work in the datacenter were out Friday, so this had to be put on hold, but now that we have eyes on the machine in this bad state, it looks like as soon as the first pod IP is allocated by flannel, the physical ethernet device disappears, with the vEthernet device instead. Is this intended behavior? We also removed the second NIC with identical results.

bland-account-99790

04/16/2024, 12:04 PM

Interesting, I have never seen that. What I have seen is that when the hns network gets created/removed, the physical interface bound to that hns network disappears for some seconds (~20s). But if I understand correctly, in your case, everything is fine: hns is created, physical NICs are available... then you create a pod and the physical interface bound to the hns network disappears for some seconds?

thankful-train-2781

04/16/2024, 2:45 PM

It disappears and never comes back

bland-account-99790

04/16/2024, 4:12 PM

I have never seen that before, sorry

9 Views

Open in Slack

Previous Next