This message was deleted Rancher Users #harvester

Join Slack

This message was deleted.

# harvester

adamant-kite-43734

07/04/2024, 1:08 PM

This message was deleted.

prehistoric-balloon-31801

07/08/2024, 2:07 AM

Does the VM run on the node that the VIP is on? @faint-art-23779 @red-king-19196 is this the same as https://github.com/harvester/harvester/issues/3960?

some-addition-13540

07/08/2024, 9:06 AM

yeah

red-king-19196

07/09/2024, 3:27 AM

Probably hitting issue 3960. Have you created any new cluster network and network config other than the

mgmt

one? Or just created the L2Vlan VM Network associated with the

mgmt

cluster network?

some-addition-13540

07/09/2024, 7:56 AM

yeah wr have a separate nic for cluster network

some-addition-13540

07/09/2024, 7:38 PM

@red-king-19196 what can we do to fix this?

some-addition-13540

07/09/2024, 8:06 PM

@adventurous-portugal-91104 cc

adventurous-portugal-91104

07/09/2024, 8:08 PM

ok? if this is true, this is not very good. Though we have a 32gb node we could use in this role and rebuild our harvestercluster (again..... sigh).

adventurous-portugal-91104

07/09/2024, 8:12 PM

We don't have this kind of setup @some-addition-13540. We have a dedicated vm network on a totally different vland id. Remember we had a similar issue earlier when we did something like this, when we ran vms on the mgmt network.

red-king-19196

07/10/2024, 9:37 AM

Can you directly curl port 6443 with any of the three VMs’ IP addresses from the bastion VM?

some-addition-13540

07/10/2024, 9:55 AM

Yes I can

some-addition-13540

07/10/2024, 9:56 AM

@red-king-19196 I also created a plain Ubuntu VM with nginx the Proxy pod gives

Copy code

time="2024-07-10T09:54:55Z" level=info msg="probe error, I/O timeout, address: 10.0.0.227:80, timeout: 3s"

But I can

curl 10.0.0.227:80

totally fine from the Bastion itself

some-addition-13540

07/10/2024, 9:56 AM

LB doesn't work

some-addition-13540

07/10/2024, 9:57 AM

This seems wrong? The mgmt-br has all of the IP addresses associated with it and not

hypervisor-br

which I thought is the one I have assigned it to in the LB IPAM Pool

Copy code

8: mgmt-br: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000                       
    link/ether 1c:98:ec:5c:18:28 brd ff:ff:ff:ff:ff:ff                                                                                                         
    inet 10.0.2.10/24 brd 10.0.2.255 scope global mgmt-br                                                                                                      
       valid_lft forever preferred_lft forever                                                                                                                 
    inet 10.0.2.20/32 scope global mgmt-br                                                                                                                     
       valid_lft forever preferred_lft forever                                                                                                                 
    inet 10.0.0.30/32 scope global mgmt-br                                                                                                                     
       valid_lft forever preferred_lft forever                                                                                                                 
    inet 10.0.0.31/32 scope global mgmt-br                                                                                                                     
       valid_lft forever preferred_lft forever

some-addition-13540

07/10/2024, 2:13 PM

@red-king-19196 any idea?

faint-art-23779

07/10/2024, 2:16 PM

Seems there're 3 LB IP associated. 10.0.2.20, 30, 31. Only 10.0.2.10 can be ping by the IP of the same subnet. Can you do

ip route show

and

ip neigh

on the node and provide the output?

some-addition-13540

07/10/2024, 2:27 PM

10.0.0.2.10 is the harvester node, 10.0.2.30 is the harvester vip on management net 10.0.0.30 is a talos cluster lb vip on hypervisor / vm network 10.0.0.31 is a nginx test on hypervisor / vm network

faint-art-23779

07/10/2024, 2:40 PM

Sorry that I was confused by the IP 10.0.0.30 and 10.0.2.30. The IP addresses with postfix /`32` imply they're LB IPs and there're no subnet for them. It means they can't not be found by L2 broadcast (ARP). The VIP is reached by some magic iptables rules to route the traffic destined to VIP from mgmt-br's IP (10.0.2.10). So you mean curl to 10.0.2.10 is OK but curl 10.0.2.30 fails?

some-addition-13540

07/10/2024, 2:46 PM

10.0.2.10 is a node ip of the node, 10.0.2.30 is harvester's VIP 10.0.0.30 and .31 are other VM / k8s based LB's that don't work

faint-art-23779

07/10/2024, 2:58 PM

Should be the problem of the LB. The ping should work and only http(s) fails? Can you do

iptables-save

to save to a file and post the file here?

some-addition-13540

07/10/2024, 3:05 PM

sure..

some-addition-13540

07/10/2024, 3:05 PM

And no ping to the VM LB's doesn't work

faint-art-23779

07/10/2024, 3:11 PM

The command

sysctl -a | grep bridge-nf-call

should show

net.bridge.bridge-nf-call-iptables=0

in your configuration. If so, please

sysctl -w net.bridge.bridge-nf-call-iptables=1

to check if the ping/curl back to work or not. Thanks

some-addition-13540

07/10/2024, 3:12 PM

do I need to reboot or something then after?

faint-art-23779

07/10/2024, 3:13 PM

No.

faint-art-23779

07/10/2024, 3:13 PM

The sysctl should work at runtime.

some-addition-13540

07/10/2024, 3:25 PM

did that now, still can't ping etc

faint-art-23779

07/10/2024, 3:28 PM

Then I'll need the output file of the

iptables-save

to check how the ping/http packet been redirected....

some-addition-13540

07/10/2024, 3:31 PM

https://gist.github.com/ekarlso/0b113a698f4355bca02e207fba113ff2

faint-art-23779

07/10/2024, 3:39 PM

-A INPUT -d 10.0.0.31/32 -p tcp -m tcp --dport 80 -m comment --comment "default/webtest kube-vip load -A INPUT -d 10.0.0.30/32 -p tcp -m tcp --dport 6443 -m comment --comment "cluster-capi-mgmt-p-01/cluster-capi-mgmt-p-01-capi-mgmt-p-01-lb kube-vip load balancer IP" -j ACCEPT -A INPUT -d 10.0.0.30/32 -p udp -m udp --dport 68 -m comment --comment "cluster-capi-mgmt-p-01/cluster-capi-mgmt-p-01-capi-mgmt-p-01-lb kube-vip load balancer IP" -j ACCEPT It seems that you are able to

curl

10.0.0.31 (the nginx).

10.0.0.30

is only for 6443 (apiserver) but not allowed for port 80.

some-addition-13540

07/10/2024, 5:28 PM

No I am not able to curl 10.0.0.31

some-addition-13540

07/10/2024, 5:30 PM

Copy code

curl --connect-timeout 1 10.0.0.31:80
curl: (28) Failed to connect to 10.0.0.31 port 80 after 1001 ms: Timeout was reached

some-addition-13540

07/10/2024, 5:30 PM

Copy code

curl --connect-timeout 1 10.0.0.30:6443
curl: (28) Failed to connect to 10.0.0.30 port 6443 after 1001 ms: Timeout was reached

some-addition-13540

07/10/2024, 5:30 PM

that's from the

bastion

some-addition-13540

07/10/2024, 5:30 PM

on the same network as the LB

some-addition-13540

07/10/2024, 10:40 PM

Also as a note if I set

Copy code

sysctl -w net.bridge.bridge-nf-call-iptables=0

Then the console / Harvester VIP is available from the Bastion if I set it to 1 then it doesn't ..

Copy code

sysctl -w net.bridge.bridge-nf-call-iptables=1

some-addition-13540

07/10/2024, 10:50 PM

I do however reach the vip:6443 / kube api fine

red-king-19196

07/11/2024, 3:02 AM

Let me recap the current information: • The target VMs are on the

10.0.0.0/24

subnet • The LB IP address for the target VMs is

10.0.0.30

• The bastion VM is also on the

10.0.0.0/24

subnet but outside the Harvester cluster • The Harvester nodes are on the

10.0.2.0/24

subnet • The Harvester cluster VIP address is

10.0.2.30

• The VM Network for the

10.0.0.0/24

subnet is associated with the

hypervisor

Cluster Network using a secondary NIC on each Harvester nodes And you’re unable to access the LB IP address with port 6443 from the bastion VM. Direct access to the target VMs’ port 6443 is okay. Is that correct?

red-king-19196

07/11/2024, 4:08 AM

If that’s the case, the Harvester Load Balancer currently does not support assigning LB IP addresses to secondary interfaces. It will always bind the LB IP addresses regardless of what subnets are to the management interface, i.e.,

mgmt-br

. So, for this kind of VM-type load balancer usage, it’s required to create LBs from the IP pool with the same subnet range as the Harvester nodes’ management network. The other way is to move the LB IP address inside the target VMs. If they are a guest cluster that has the Harvester Cloud Provider running, you can create an LB-type Service to announce the LB IP address on the

10.0.0.0/24

subnet.

adventurous-portugal-91104

07/11/2024, 6:42 AM

How could we get this kind of setup to work in Harvester? It is very limiting to only run LBs on the mgmt interface.

adventurous-portugal-91104

07/11/2024, 6:56 AM

Thats correct

adventurous-portugal-91104

07/11/2024, 8:04 AM

I think we need to get someone from Suse to answer us on this

prehistoric-balloon-31801

07/11/2024, 8:46 AM

@adventurous-portugal-91104 @red-king-19196 is the main developer of Load balancer feature.

👍 1

adventurous-portugal-91104

07/11/2024, 8:47 AM

Oh I am sorry, he didnt have the SUSE employee tag on his name 🙂

adventurous-portugal-91104

07/11/2024, 8:47 AM

@red-king-19196 So what me and Endre did

adventurous-portugal-91104

07/11/2024, 8:47 AM

Let me summerize

adventurous-portugal-91104

07/11/2024, 8:47 AM

• mgmt network: 10.0.2.0/24 (untagged) on first harvester nic, • vm-network (called hypervisor) tagged to vlanid 4 on secondary nic: 10.0.0.0/24, • Bastion host runnin on vm-network (10.0.0.5/24) • for vm network I have create also a cluster network and the network config ofcourse in harvester. • I have a workstation that can also contact all harvester from a different network, only Endre remotes into my environment through the Bastion host • Harvester node mgmt IP: 10.0.2.10 • Harvester mgmt Cluster VIP: 10.0.2.20

adventurous-portugal-91104

07/11/2024, 8:51 AM

What we have done until now

adventurous-portugal-91104

07/11/2024, 8:53 AM

We have tried to create multiple kubernetes clusters with @cool-thailand-26552 cluster api module. It creates the machines successfully and a LB in harvester. VMs are then places in the hypervisor network (10.0.0.0/24) and with a LB set to ip-pool with range 10.0.0.31-40, gateway 10.0.0.1. Machines boots and starts as expected though LB wont accept or forward any traffic to the vms in the hypervisor (vm-network/vlan id 4).

adventurous-portugal-91104

07/11/2024, 8:54 AM

So we have been puzzled with this so I started to reverse my tracks and start over thinking. Well does it forward a traffic at all if I just create a vm in hypervisor network and install nginx and create a LB in hypervisor (vlan id4) with the same ip pool as we tried with the kubernetes clusters and no it dosent foward it

adventurous-portugal-91104

07/11/2024, 8:54 AM

though here is the interesting part

adventurous-portugal-91104

07/11/2024, 8:55 AM

so if I create another LB, but this LB is set to the mgmt network 10.0.2.0/24, THEN it transfers that traffic to the VM in the hypervisor network as expected though this LB is on the mgmt network and not hypervisor network.

adventurous-portugal-91104

07/11/2024, 8:59 AM

If that’s the case, the Harvester Load Balancer currently does not support assigning LB IP addresses to secondary interfaces. It will always bind the LB IP addresses regardless of what subnets are to the management interface, i.e.,
mgmt-br
. So, for this kind of VM-type load balancer usage, it’s required to create LBs from the IP pool with the same subnet range as the Harvester nodes’ management network.

The other way is to move the LB IP address inside the target VMs. If they are a guest cluster that has the Harvester Cloud Provider running, you can create an LB-type Service to announce the LB IP address on the
10.0.0.0/24
subnet.

So to understand you correctly, to get this to work we would need to use a Rancher server, connect it to Harvester with the Harvester Cloud Provider and use that path to provide a Kubernetes cluster to get a working with a LB in a different network than the MGMT network ?

red-king-19196

07/11/2024, 10:41 AM

so if I create another LB, but this LB is set to the mgmt network 10.0.2.0/24, THEN it transfers that traffic to the VM in the hypervisor network as expected though this LB is on the mgmt network and not hypervisor network.

This is the way to go. Traffic destined for

10.0.0.30

will never be routed to the

mgmt-br

interface.

red-king-19196

07/11/2024, 10:44 AM

Or like I mentioned, with Harvester Cloud Provider, you can bind the VIP address in the guest cluster. But I’m unsure how you provisioned the Kubernetes clusters. I guess you need to manually install it.

adventurous-portugal-91104

07/11/2024, 10:45 AM

I could do this at my homelab, but if my workplace was to consider Harvester in our datacenters, this is something we would never accept when we need to change out vmware vsphere.

cool-thailand-26552

07/11/2024, 10:46 AM

What exactly is not acceptable @adventurous-portugal-91104 ?

adventurous-portugal-91104

07/11/2024, 10:56 AM

If we are forced to use the mgmt network of the physical nodes to LB any workload lets say that are not of type kubernetes guest LB into the same network as the nodes live. that would be exposing us to very high risk.

cool-thailand-26552

07/11/2024, 10:57 AM

My understanding is that using ipPools would circumvent that ... am I right @red-king-19196 ?

adventurous-portugal-91104

07/11/2024, 10:57 AM

my Network and Security Engineer at my current company would never accept that just saying

adventurous-portugal-91104

07/11/2024, 10:57 AM

if you guys want to target all the ex-vmware customers to grab that business this needs to be fixed

adventurous-portugal-91104

07/11/2024, 11:06 AM

though in a way for just kubernetes api it could be acceptable for my home lab to run this in mgmt network but a serous enterprise would never accept it. though I have one question if I then get this cluster up and I want to have a couple of services (ofcourse) for some workload to get a L2 vip for exposure, would this work as a guest kubernetes workload lb in say my vm-network (hypervisor: 10.0.0.0/24)?

adventurous-portugal-91104

07/11/2024, 11:07 AM

or would this end up getting the same issue and has to have an endpoint also in the mgmt network?

some-addition-13540

07/11/2024, 11:14 AM

What is the use of the feature to select alternative ip pools then that can be bound to alternate NICs then when creating a lb?

red-king-19196

07/11/2024, 11:27 AM

My understanding is that using ipPools would circumvent that

It wouldn’t. It’s a connectivity issue. Packets won’t be routed to the right interface where the LB IP address is bound.

red-king-19196

07/11/2024, 11:28 AM

What I suggested is like below:

cool-thailand-26552

07/11/2024, 11:29 AM

Ok so it looks like it worked for me on Equinix Metal using Metal Gateways because, somehow, the

mgmt-br

Interface was routed to the Metal gateway subnets....

red-king-19196

07/11/2024, 11:29 AM

Sorry, this one is more clear (not in dark mode)

red-king-19196

07/11/2024, 11:32 AM

Ok so it looks like it worked for me on Equinix Metal using Metal Gateways because, somehow, the
mgmt-br
Interface was routed to the Metal gateway subnets....

If the VLANs and routing rules are configured appropriately, it would work 👍

adventurous-portugal-91104

07/11/2024, 11:33 AM

What is needed to configure the VLANs and routing appropriately ? I run Ubiquiti network stack at home with decent vlan and networking configuration options.

adventurous-portugal-91104

07/11/2024, 11:40 AM

btw your drawing is exactly what I want but it dosent work by using a LB IP on 10.0.0.30/32 I would have to put it on 10.0.2.x/32 to work

red-king-19196

07/11/2024, 11:41 AM

You’ll need to add a static route for network 10.0.0.30/32 to go to the first NIC of the Harvester node on the router. I don’t have a complex environment to check if that’s all you need to do.

adventurous-portugal-91104

07/11/2024, 11:42 AM

As Endre also wrote, its weird we can create a vm-network on 10.0.0.x/24, vlan tag 4 on my secondary nic. and all VMs get correct IP config and routing works but when you try and add a LB like you do with a VM it dosent work. Could a idea here to be able to assign which NIC/Network this LB should be published to like a VM? and not only a Ip Pool

adventurous-portugal-91104

07/11/2024, 11:43 AM

You’ll need to add a static route for network 10.0.0.30/32 to go to the first NIC of the Harvester node on the router. I don’t have a complex environment to check if that’s all you need to do.

I will check up on this.

red-king-19196

07/11/2024, 11:43 AM

The drawing is to use the “cluster-type” LB instead of the “VM-type” LB. The orange-dotted arrow means moving the LB IP address into the guest cluster.

adventurous-portugal-91104

07/11/2024, 11:43 AM

aaaah

adventurous-portugal-91104

07/11/2024, 11:43 AM

right

red-king-19196

07/11/2024, 11:43 AM

Is that acceptable?

adventurous-portugal-91104

07/11/2024, 11:44 AM

sure, I will try and implement this with RKE2 and Rancher type deployment of kubernetes and see if it works 🙂

red-king-19196

07/11/2024, 11:45 AM

This should work out of the box if you use the Rancher Integration to spin up an RKE2 guest cluster.

some-addition-13540

07/11/2024, 11:51 AM

What does s3tting the type really do under the hood?

red-king-19196

07/11/2024, 11:52 AM

Do you mean the LB type or the IPAM mode?

some-addition-13540

07/11/2024, 11:53 AM

lb type

red-king-19196

07/11/2024, 11:59 AM

The difference is that “cluster-type” LB is for LB-type of Service objects on the guest cluster to work. The actual LB IP address lives in the guest cluster nodes (the VMs); The “VM-type” LB is more general-purpose. The LB IP address is bound to the Harvester nodes, not inside the VM.

some-addition-13540

07/11/2024, 12:02 PM

Inside the VM network you mean?

some-addition-13540

07/11/2024, 12:03 PM

Does that mean that cluster type lb needs to have kube-vip inside a given guest cluster,

red-king-19196

07/11/2024, 12:04 PM

Exactly

some-addition-13540

07/11/2024, 12:05 PM

that is what rke2 does,

some-addition-13540

07/11/2024, 12:05 PM

red-king-19196

07/11/2024, 12:06 PM

harvester cloud provider comes with kube-vip as dependency

red-king-19196

07/11/2024, 12:07 PM

Do you have the Harvester cluster imported into the external Rancher?

adventurous-portugal-91104

07/11/2024, 12:07 PM

No not yet, that is the next step

adventurous-portugal-91104

07/11/2024, 12:08 PM

one question we are able to use say Cilium as CNI in here ? I know for a standard rancher deployment not on harvester it is possible.

some-addition-13540

07/11/2024, 1:40 PM

I sont get it why You need to have the CPI if you have cilium or k-vip or mlb running in l2 mode here?

cool-thailand-26552

07/11/2024, 9:13 PM

CPI (in general, not only for Harvester) is also needed for node

ProviderID

tagging and labelling with availability zones. For instance, without CPI on a workload cluster, CAPI is not able to recognize a

Machine

as being

Ready

(CAPI connects to the Workload cluster API, checks the

Node

objects and matches them with

Machine

object in the Management Cluster, and bubbles up the

ProviderID

to the

Machine

Object).

👍 1

red-king-19196

07/12/2024, 2:27 AM

one question we are able to use say Cilium as CNI in here ? I know for a standard rancher deployment not on harvester it is possible.

Yes, you can choose Cilium as the CNI plugin during cluster creation.

red-king-19196

07/12/2024, 2:34 AM

I sont get it why You need to have the CPI if you have cilium or k-vip or mlb running in l2 mode here?

From the LB point of view, it also allows you to allocate IP addresses for LB-type service from the IP pools managed by the underlying Harvester cluster. You can still use kube-vip or metallb in the guest Kubernetes cluster, it’s just that you’ll need to decide and manage what IP addresses to be allocated for LB services.

5 Views

Open in Slack

Previous Next