This message was deleted Rancher Users #general

Join Slack

This message was deleted.

# general

adamant-kite-43734

04/03/2024, 5:58 PM

This message was deleted.

creamy-pencil-82913

04/03/2024, 6:20 PM

this is usually controlled by the externalTrafficPolicy

creamy-pencil-82913

04/03/2024, 6:21 PM

https://kubernetes.io/docs/tasks/access-application-cluster/create-external-load-balancer/#preserving-the-client-source-ip

•
.spec.externalTrafficPolicy
- denotes if this Service desires to route external traffic to node-local or cluster-wide endpoints. There are two available options:
Cluster
(default) and
Local
.
Cluster
obscures the client source IP and may cause a second hop to another node, but should have good overall load-spreading.
Local
preserves the client source IP and avoids a second hop for LoadBalancer and NodePort type Services, but risks potentially imbalanced traffic spreading.

If you set it to Local it will only send traffic to nodes with a pod

creamy-pencil-82913

04/03/2024, 6:22 PM

if you’re manually configuring the load balancer, its up to you to do that for yourself.

abundant-hair-58573

04/03/2024, 6:30 PM

hmm.... ok thank you. I'm looking at the ingress that this app created (deployed via helm) and at the bottom it has

Copy code

status:
  loadbalancer:
   ingress:
 <all k8s worker node IPs>

abundant-hair-58573

04/03/2024, 6:37 PM

I assume that's what the above link is referencing? Pods belonging to that app could be running on any one of those nodes, although we only have about a third of those nodes in the target group for our AWS load balancer. It's worked mostly fine up until this point, but we've grown a lot and are seeing more latency issues in the app itself. I know I'm being light on the details but the rest is probably more specific to the app itself and out of scope here. The nginx ingress is created by the helm chart so it's nothing we manually configure.

creamy-pencil-82913

04/03/2024, 6:42 PM

what are you using for your LoadBalancer controller?

abundant-hair-58573

04/03/2024, 6:42 PM

Actually yes I see in the helm charts where we can set the externalTrafficPolicy, which is still the default Cluster. Sorry I'm not a networking person at all, so just summarize. If the externalTrafficPolicy is set to Cluster, as long as the AWS External Load balancer gets the traffic to a node in the k8s cluster, it will find it's way to the correct node/pod with an extra hop. If it is set to local, then presumably the node where the traffic originates must be part of the target group so it can get back to the right node in the cluster

creamy-pencil-82913

04/03/2024, 6:42 PM

the policy on the service tells the LoadBalancer controller how to route traffic

creamy-pencil-82913

04/03/2024, 6:43 PM

in the case of the AWS LB controller, it will change whether all nodes go in the target group, or just nodes with pods for that service.

abundant-hair-58573

04/03/2024, 6:46 PM

Where would I find which LoadBalancer controller it is using?

creamy-pencil-82913

04/03/2024, 6:47 PM

generally there is only one per cluster. What have you deployed?

abundant-hair-58573

04/03/2024, 6:55 PM

This is our older RKE1 cluster running in ec2. We have a NLB for Rancher itself, and an ALB for the specific app we're running. We actually have a couple app deployments in different namespaces, with their own ingresses. This is the values file and helm chart we're using, the only difference is we set the type to ClusterIP https://github.com/coder/enterprise-helm/blob/main/values.yaml#L22

creamy-pencil-82913

04/03/2024, 7:05 PM

sounds like you’re not using a LB controller and are just manually configuring the ALB to point at your service?

creamy-pencil-82913

04/03/2024, 7:06 PM

although you said you have IPs under status.loadbalancer.ingress so I suppose you must have something, as that would remain pending otherwise

abundant-hair-58573

04/03/2024, 7:06 PM

in AWS? Yes we're manually configuring that

abundant-hair-58573

04/03/2024, 7:07 PM

we do not have the aws-cloud-controller configured in this cluster

creamy-pencil-82913

04/03/2024, 7:07 PM

the cloud controller is separate from the AWS lb controller

creamy-pencil-82913

04/03/2024, 7:07 PM

you can run one without the other, if you set things up right

abundant-hair-58573

04/03/2024, 7:09 PM

we have our AWS Autoscale groups set to register the instances with the ALB target group. Although we have a couple different ASGs for nodes within that cluster, and only one of them is registering instances with the ALB target group. That's an easy fix but I'm just trying to determine if that is what's causing our issues, or if it's not a problem

abundant-hair-58573

04/03/2024, 7:09 PM

yea we don't have any aws integration with this cluster

abundant-hair-58573

04/03/2024, 7:11 PM

I don't know how the status.loadbalancer.ingress is updated, it's possible it just adds every k8s worker node. Or maybe the service itself adds them when a pod is spun up on a node? I have no idea how that works though. This isn't a case of something just not working, it's intermittent which is so damn annoying to troubleshoot, so I'm trying to rule out everything I can

Open in Slack

Previous Next