abundant-apple-86556
03/27/2025, 4:37 PMFailed to put key - error=Put "<http://127.0.0.1:8500/v1/kv/object/host/agent/>...": dial tcp 127.0.0.1:8500: connect: connection refused
We see this on enforcer startup, and when it happens the enforcer fails to cluster up and show as connected in the manager/web console. For a while we just patched a liveness + readiness probe on the pods to check 8500 but with the latest version (5.4.3) it's happening a lot more often.clean-magazine-25026
03/27/2025, 4:56 PMabundant-apple-86556
03/27/2025, 5:00 PMabundant-apple-86556
03/27/2025, 5:04 PMclean-magazine-25026
03/27/2025, 5:05 PMclean-magazine-25026
03/27/2025, 5:06 PMabundant-apple-86556
03/27/2025, 5:07 PMabundant-apple-86556
03/27/2025, 6:01 PMclean-magazine-25026
03/27/2025, 6:03 PMabundant-apple-86556
03/31/2025, 10:48 PMneuvector.role
label being required on the container image. The Ironbank image is not built with that label so it was getting stuck on the affinity check and not starting up consul.abundant-apple-86556
03/31/2025, 10:50 PMabundant-apple-86556
04/02/2025, 10:06 PMneuvector.role
on the other images? We got it added to the enforcer image (and that resolved the enforcer issue) but experiencing some weirdness on controllers in some clusters and trying to debug if its related.
I skimmed through the code and didn't see anything obvious where the label would be needed on the controller - working to get logs here to better debug...clean-magazine-25026
04/02/2025, 10:12 PMabundant-apple-86556
04/02/2025, 10:19 PMclean-magazine-25026
04/02/2025, 10:26 PMabundant-apple-86556
04/02/2025, 10:27 PMabundant-apple-86556
04/03/2025, 2:57 PMWarning Unhealthy 3m20s (x341 over 32m) kubelet Readiness probe failed:
Controller logs appear normal, we even see the readiness log (which should be when the /tmp/ready
file gets created/updated?):
❯ kl -n neuvector neuvector-controller-pod-xxx | grep "ctrl init done"
2025-04-03T14:10:15.407|INFO|CTL|utils.SetReady: - value=ctrl init done
Is it possible that the probe is getting killed by the enforcers? I was seeing this in enforcer logs:
2025-04-03T14:53:15.843|DEBU|AGT|main.reportIncident: - eLog={LogUID: ID:11 HostID:xxx HostName:xxx AgentID:xxx AgentName:xxx WorkloadID:xxx WorkloadName: ReportedAt:2025-04-03 14:53:15.843069795 +0000 UTC ProcName:cat ProcPath:/usr/bin/busybox ProcCmds:[cat /tmp/ready ] ProcRealUID:0 ProcEffUID:0 ProcRealUser: ProcEffUser:root FilePath: Files:[] LocalIP:<nil> RemoteIP:<nil> EtherType:0 LocalPort:0 RemotePort:0 IPProto:0 ConnIngress:false LocalPeer:false ProcPName:runc ProcPPath:/usr/bin/runc Count:15 StartAt:2025-04-03 14:52:10.811128089 +0000 UTC m=+2506.654157597 Action:deny RuleID:00000000-0000-0000-0000-000000000006 Group:NV.Protect Msg:Process profile violation, not from its root process: execution denied}
abundant-apple-86556
04/03/2025, 3:45 PMcat
being in a different place 😞