This message was deleted Rancher Users #longhorn-storage

Join Slack

This message was deleted.

# longhorn-storage

adamant-kite-43734

08/23/2023, 1:13 AM

This message was deleted.

blue-kitchen-51801

08/23/2023, 6:24 AM

I'm also interested for an answer

narrow-egg-98197

08/23/2023, 7:05 AM

The approach I'm currently thinking of is indeed the approach you said, and it's similar to the approach mentioned in the document: Updating the Node OS or Container Runtime.

narrow-egg-98197

08/23/2023, 7:06 AM

The method that can speed up the rebuilding first is to adjust the

Replica Replenishment Wait Interval

(doc ) first.

narrow-egg-98197

08/23/2023, 7:08 AM

In addition, in 1.4.0, there is also a performance improvement for rebuilding(#4783) , and there is a new improvement(#5002) plan to replace the original HTTP method with other data transfer protocols to improve the performance of rebuilding.

narrow-egg-98197

08/23/2023, 7:08 AM

What version number are you currently using?

nice-tent-65195

08/23/2023, 1:40 PM

we are experiencing the same problem with 1.5.1 LH in our case, we do not exceed the default Replica Replenishment Wait Interval (with the node drain state) so increasing it would not help with the problem

abundant-hair-58573

08/23/2023, 2:20 PM

We are at 1.3.2. I actually reduced the replenishment wait interval so the new replicas would start building sooner, since I was waiting for that to finish before draining and rebooting the next node. Many of our nodes are pretty full and Unschedulable, if I get the node back up within the Replinishment wait interval will it still rebuild the replica on that same node even if it's unschedulable?

abundant-hair-58573

08/23/2023, 5:23 PM

With a little testing it looks like Longhorn will not rebuild the failed replica on the same node when it comes back up if it is Unschedulable. At least half of our nodes are unschedulable so that's unfortunate.

4 Views

Open in Slack

Previous Next