This message was deleted Rancher Users #harvester

Join Slack

This message was deleted.

# harvester

adamant-kite-43734

08/01/2024, 11:38 AM

This message was deleted.

bland-article-62755

08/01/2024, 2:02 PM

yuuup

bland-article-62755

08/01/2024, 2:03 PM

https://github.com/harvester/harvester/issues/5251

bland-article-62755

08/01/2024, 2:03 PM

rebooting is only a temp fix.

bland-article-62755

08/01/2024, 2:03 PM

at least for me

bland-article-62755

08/06/2024, 12:19 AM

Did you dig into this any more @ambitious-knife-27333? Were you ever able to grab logs of the serial port write failures? I had a ticket open with Suse but they're giving me crap saying it's working in OpenSuse so tell Alma.

bland-article-62755

08/06/2024, 6:00 PM

Actually it seems like 9.3 is the only one effected? Ugh. we'll see

ambitious-knife-27333

08/07/2024, 2:07 PM

@bland-article-62755 afraid we’ve not dug in more. We’re definitely seeing this behaviour in Alma 9.4 as well.

bland-article-62755

08/07/2024, 2:33 PM

Granted, I'm on Harvester 1.2.2, which is actually newer than 1.3.0

bland-article-62755

08/07/2024, 2:33 PM

But I downloaded the 9.4 Alma image from a few days ago, and I can't get it to reproduce currently.

bland-article-62755

08/07/2024, 2:34 PM

our older 9.3 image is easy to replicate. (takes less than 5 minutes)

bland-article-62755

08/07/2024, 2:37 PM

I thought it might be related to the qemu-guest-agent version (8.2.2) but Fedora 40 and Ubuntu 24.04 LTS are on that version too, and I can't replicate it there either.

bland-article-62755

08/07/2024, 2:40 PM

My current thought of how it might be different is that virtual kernel serial port. I'm guessing that a kernel function, so maybe the issue is there? IDK. I'm still running tests and I'm kinda hopeful I can get it to trigger on any of the other OSes.

bland-article-62755

08/07/2024, 2:52 PM

Can you tell me what version you from

qemu-ga --version

bland-article-62755

08/07/2024, 2:56 PM

Ok, turns out the ones that are wacky are version 8.2.0 and it seems to be fixed in 8.2.2

bland-article-62755

08/07/2024, 3:04 PM

Suse Support said 8.2.4 but that's not what I'm seeing on my testing.

bland-article-62755

08/07/2024, 3:40 PM

Ok... I'm about ready to call this quits. After running

dnf update -y ; systemctl reboot

the frequency in which the IP is there/missing is inverted. After updating/rebooting, it's there most of the time vs gone most of the time in my cluster. I haven't been able to get any of the ubuntu or fedora boxes running

8.2.2

to vanish. @ambitious-knife-27333 I'm happy to help with testing if you have another idea.

ambitious-knife-27333

08/09/2024, 12:11 PM

So I pulled the latest version of Alma (5th Aug) and OOTB this does not appear to be a problem. This version has agent version

8.2.0-11

in it. However, you can make the issue occur by restarting

qemu-guest-agent

. If you restart the VM, then it seems to be okay again

ambitious-knife-27333

08/09/2024, 12:12 PM

In essence, I think it is the agent restart that causes this. Are you using the default

cloud-init

config in Harvester?

bland-article-62755

08/09/2024, 2:09 PM

I am. I did get 1/15 instances using that same version to replicate the issue, but it was very brief.

bland-article-62755

08/09/2024, 2:11 PM

less than 30 seconds was the IP missing. I'm guessing they patched it somehow, but I have no idea how as the RPM version seems to be the same before it too.

3 Views

Open in Slack

Previous Next