Revert "W/A NFS server becoming unreachable mid run" #14362

jean-edouard · 2025-03-27T15:26:28Z

What this PR does

This reverts the workaround added by #13443, as I believe it is now hurting us more than helping.

Fixes #

Why we need it and why it was done in this way

The following tradeoffs were made:

The following alternatives were considered:

Links to places where the discussion took place:

Special notes for your reviewer

Checklist

This checklist is not enforcing, but it's a reminder of items that could be relevant to every PR.
Approvers are expected to review this list.

Design: A design document was considered and is present (link) or not required
PR: The PR description is expressive enough and will help future contributors
Code: Write code that humans can understand and Keep it simple
Refactor: You have left the code cleaner than you found it (Boy Scout Rule)
Upgrade: Impact of this change on upgrade flows was considered and addressed if required
Testing: New code requires new unit tests. New features and bug fixes require at least on e2e test
Documentation: A user-guide update was considered and is present (link) or not required. You want a user-guide update if it's a user facing feature / API change.
Community: Announcement to kubevirt-dev was considered

Release note

NONE

jean-edouard · 2025-03-27T15:26:57Z

/cc @akalenyu
/cc @fossedihelm

fossedihelm · 2025-03-27T15:30:22Z

/lgtm
/cc @brianmcarey

enp0s3 · 2025-03-27T15:39:49Z

/approve

kubevirt-bot · 2025-03-27T15:39:57Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: enp0s3

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~hack/OWNERS~~ [enp0s3]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

kubevirt-commenter-bot · 2025-03-27T15:40:03Z

Required labels detected, running phase 2 presubmits:
/test pull-kubevirt-e2e-windows2016
/test pull-kubevirt-e2e-kind-1.30-vgpu
/test pull-kubevirt-e2e-kind-sriov
/test pull-kubevirt-e2e-k8s-1.32-ipv6-sig-network
/test pull-kubevirt-e2e-k8s-1.30-sig-network
/test pull-kubevirt-e2e-k8s-1.30-sig-storage
/test pull-kubevirt-e2e-k8s-1.30-sig-compute
/test pull-kubevirt-e2e-k8s-1.30-sig-operator
/test pull-kubevirt-e2e-k8s-1.31-sig-network
/test pull-kubevirt-e2e-k8s-1.31-sig-storage
/test pull-kubevirt-e2e-k8s-1.31-sig-compute
/test pull-kubevirt-e2e-k8s-1.31-sig-operator

brianmcarey

Looks good to me - no issues with it. Can you just expand on why you think it is hurting more than helping?

Its a little too soon to say but it looks like we found a way of getting stable compute-migrations lanes again - #14354

I would prefer to take in #14354 before merging this just to limit the change see and to see if it helps.

brianmcarey · 2025-03-27T15:42:46Z

/hold

jean-edouard · 2025-03-27T15:45:01Z

Looks good to me - no issues with it. Can you just expand on why you think it is hurting more than helping?

Just in case the nfs server pod ever restarts

Its a little too soon to say but it looks like we found a way of getting stable compute-migrations lanes again - #14354

That's not a bad idea, many tests there are quite resource-heavy, but I doubt that will solve the NFS issues we're seeing...

I would prefer to take in #14354 before merging this just to limit the change see and to see if it helps.

Absolutely, 1 test at a time is the way to go

brianmcarey · 2025-03-27T15:48:55Z

Looks good to me - no issues with it. Can you just expand on why you think it is hurting more than helping?

Just in case the nfs server pod ever restarts

+1 - that would be a problem

Its a little too soon to say but it looks like we found a way of getting stable compute-migrations lanes again - #14354

That's not a bad idea, many tests there are quite resource-heavy, but I doubt that will solve the NFS issues we're seeing...

I have 10 runs so far without hitting the nfs timeouts - at the current failure rate on main I would expect to hit it at least once or twice in 10 but you're right - proof is in the pudding - will have to see how it goes on main.

I would prefer to take in #14354 before merging this just to limit the change see and to see if it helps.

Absolutely, 1 test at a time is the way to go

Cheers - thanks.

But yeah we should look at removing this WA as soon as we can.

jean-edouard · 2025-03-27T16:23:19Z

See also kubevirt/kubevirtci#1407

brianmcarey

/hold cancel

This workaround did not improve the compute-migrations stability as we had hoped and there is a risk that this can break if pod restarts occur.

This reverts commit 24872e9. Signed-off-by: Jed Lejosne <jed@redhat.com>

kubevirt-commenter-bot · 2025-03-30T09:55:17Z

Required labels detected, running phase 2 presubmits:
/test pull-kubevirt-e2e-windows2016
/test pull-kubevirt-e2e-kind-1.30-vgpu
/test pull-kubevirt-e2e-kind-sriov
/test pull-kubevirt-e2e-k8s-1.32-ipv6-sig-network
/test pull-kubevirt-e2e-k8s-1.30-sig-network
/test pull-kubevirt-e2e-k8s-1.30-sig-storage
/test pull-kubevirt-e2e-k8s-1.30-sig-compute
/test pull-kubevirt-e2e-k8s-1.30-sig-operator
/test pull-kubevirt-e2e-k8s-1.31-sig-network
/test pull-kubevirt-e2e-k8s-1.31-sig-storage
/test pull-kubevirt-e2e-k8s-1.31-sig-compute
/test pull-kubevirt-e2e-k8s-1.31-sig-operator

kubevirt-bot added release-note-none dco-signoff: yes sig/buildsystem size/S labels Mar 27, 2025

kubevirt-bot requested review from enp0s3 and xpivarc March 27, 2025 15:26

kubevirt-bot requested review from akalenyu and fossedihelm March 27, 2025 15:26

akalenyu approved these changes Mar 27, 2025

View reviewed changes

kubevirt-bot assigned akalenyu Mar 27, 2025

kubevirt-bot added the lgtm label Mar 27, 2025

kubevirt-bot requested a review from brianmcarey March 27, 2025 15:30

kubevirt-bot assigned fossedihelm Mar 27, 2025

kubevirt-bot added the approved label Mar 27, 2025

brianmcarey reviewed Mar 27, 2025

View reviewed changes

kubevirt-bot added the do-not-merge/hold label Mar 27, 2025

brianmcarey reviewed Mar 28, 2025

View reviewed changes

kubevirt-bot removed the do-not-merge/hold label Mar 28, 2025

Revert "W/A NFS server becoming unreachable mid run"

Loading
Loading status checks…

f006b9b

This reverts commit 24872e9. Signed-off-by: Jed Lejosne <jed@redhat.com>

jean-edouard force-pushed the warevert branch from af38f7c to f006b9b Compare March 28, 2025 19:48

kubevirt-bot removed the lgtm label Mar 28, 2025

akalenyu approved these changes Mar 30, 2025

View reviewed changes

kubevirt-bot added the lgtm label Mar 30, 2025

kubevirt-bot merged commit 60d8d14 into kubevirt:main Mar 30, 2025
38 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revert "W/A NFS server becoming unreachable mid run" #14362

Revert "W/A NFS server becoming unreachable mid run" #14362

Uh oh!

jean-edouard commented Mar 27, 2025

Uh oh!

jean-edouard commented Mar 27, 2025

Uh oh!

fossedihelm commented Mar 27, 2025

Uh oh!

enp0s3 commented Mar 27, 2025

Uh oh!

kubevirt-bot commented Mar 27, 2025

Uh oh!

kubevirt-commenter-bot commented Mar 27, 2025

Uh oh!

brianmcarey left a comment

Uh oh!

brianmcarey commented Mar 27, 2025

Uh oh!

jean-edouard commented Mar 27, 2025

Uh oh!

brianmcarey commented Mar 27, 2025 •

edited

Loading

Uh oh!

jean-edouard commented Mar 27, 2025

Uh oh!

brianmcarey left a comment

Uh oh!

kubevirt-commenter-bot commented Mar 30, 2025

Uh oh!

Uh oh!

Revert "W/A NFS server becoming unreachable mid run" #14362

Revert "W/A NFS server becoming unreachable mid run" #14362

Uh oh!

Conversation

jean-edouard commented Mar 27, 2025

What this PR does

Why we need it and why it was done in this way

Special notes for your reviewer

Checklist

Release note

Uh oh!

jean-edouard commented Mar 27, 2025

Uh oh!

fossedihelm commented Mar 27, 2025

Uh oh!

enp0s3 commented Mar 27, 2025

Uh oh!

kubevirt-bot commented Mar 27, 2025

Uh oh!

kubevirt-commenter-bot commented Mar 27, 2025

Uh oh!

brianmcarey left a comment

Choose a reason for hiding this comment

Uh oh!

brianmcarey commented Mar 27, 2025

Uh oh!

jean-edouard commented Mar 27, 2025

Uh oh!

brianmcarey commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jean-edouard commented Mar 27, 2025

Uh oh!

brianmcarey left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kubevirt-commenter-bot commented Mar 30, 2025

Uh oh!

Uh oh!

brianmcarey commented Mar 27, 2025 •

edited

Loading