[BGP] Fix FRRConfiguration cleanup #340

stuggi · 2025-01-24T12:57:54Z

Currently the process of processing the FRRConfigurations is:

get a list of all pods in the ctlplane namespace
filter that list to the pods which have a secondary IF configured
run through that list of pods with an IF attached and validate if there is a change of the node it is running on.

While running through that nested loop to check if there is a an update needed we are also checking there is an frr config for an already deleted pod.

This is ok unless there is at least one pod with a secondary IF, if not the cleanup won't happen, which is the case for the FRRConfiguration for the last deleted pod.

This issue was only seen randomly because the functional test was not waiting for the pod and its FRRConfiguration to exist before it started the delete test.

This change fixes the functional test and moves the cleanup out of the mentioned loop so it also gets checked properly when the last pod gets deleted.

Currently the process of processing the FRRConfigurations is: * get a list of all pods in the ctlplane namespace * filter that list to the pods which have a secondary IF configured * run through that list of pods with an IF attached and validate if there is a change of the node it is running on. While running through that nested loop to check if there is a an update needed we are also checking there is an frr config for an already deleted pod. This is ok unless there is at least one pod with a secondary IF, if not the cleanup won't happen, which is the case for the FRRConfiguration for the last deleted pod. This issue was only seen randomly because the functional test was not waiting for the pod and its FRRConfiguration to exist before it started the delete test. This change fixes the functional test and moves the cleanup out of the mentioned loop so it also gets checked properly when the last pod gets deleted. Signed-off-by: Martin Schuppert <[email protected]>

openshift-ci · 2025-01-24T13:33:17Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: lmiccini, stuggi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [stuggi]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

softwarefactory-project-zuul · 2025-01-24T14:38:28Z

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/163374cbedcc480a8a04255da14ca84b

✔️ openstack-k8s-operators-content-provider SUCCESS in 1h 39m 36s
❌ podified-multinode-edpm-deployment-crc FAILURE in 1h 21m 09s
❌ cifmw-crc-podified-edpm-baremetal FAILURE in 51m 38s

lmiccini · 2025-01-24T14:53:50Z

recheck

softwarefactory-project-zuul · 2025-01-24T16:39:07Z

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/3ee459fefc6c4e0f8deda6e0d370de3d

✔️ openstack-k8s-operators-content-provider SUCCESS in 1h 44m 08s
❌ podified-multinode-edpm-deployment-crc FAILURE in 1h 25m 37s
❌ cifmw-crc-podified-edpm-baremetal FAILURE in 51m 44s

stuggi · 2025-01-27T09:37:47Z

recheck

softwarefactory-project-zuul · 2025-01-27T12:49:24Z

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/c6e466a3021f4734991caafeab482523

✔️ openstack-k8s-operators-content-provider SUCCESS in 2h 13m 37s
❌ podified-multinode-edpm-deployment-crc FAILURE in 1h 23m 50s
❌ cifmw-crc-podified-edpm-baremetal FAILURE in 1h 21m 48s

abays · 2025-01-27T19:52:23Z

Build failed (check pipeline). Post recheck (without leading slash) to rerun all jobs. Make sure the failure cause has been resolved before you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/c6e466a3021f4734991caafeab482523

✔️ openstack-k8s-operators-content-provider SUCCESS in 2h 13m 37s ❌ podified-multinode-edpm-deployment-crc FAILURE in 1h 23m 50s ❌ cifmw-crc-podified-edpm-baremetal FAILURE in 1h 21m 48s

Same error here as [1]:

2025-01-27 12:18:11.971 26 ERROR tempest.lib.common.utils.linux.remote_client [-] (TestNetworkBasicOps:test_connectivity_between_vms_on_different_networks) Executing command on 192.168.122.248 failed. Error: Command 'set -eu -o pipefail; PATH=$PATH:/sbin:/usr/sbin; ping -c1 -w1 -s56 10.100.0.20' failed, exit status: 1, stderr:

Looks like we have a cross-repo Zuul Tempest problem.

[1] openstack-k8s-operators/openstack-operator#1278 (comment)

stuggi · 2025-02-04T07:16:29Z

recheck

openshift-ci bot requested review from abays and lewisdenny January 24, 2025 12:58

openshift-ci bot added the approved label Jan 24, 2025

stuggi requested review from lmiccini and removed request for lewisdenny January 24, 2025 13:32

lmiccini approved these changes Jan 24, 2025

View reviewed changes

openshift-ci bot assigned lmiccini Jan 24, 2025

openshift-ci bot added the lgtm label Jan 24, 2025

openshift-merge-bot bot merged commit ae8379c into openstack-k8s-operators:main Feb 4, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BGP] Fix FRRConfiguration cleanup #340

[BGP] Fix FRRConfiguration cleanup #340

stuggi commented Jan 24, 2025

openshift-ci bot commented Jan 24, 2025

softwarefactory-project-zuul bot commented Jan 24, 2025

lmiccini commented Jan 24, 2025

softwarefactory-project-zuul bot commented Jan 24, 2025

stuggi commented Jan 27, 2025

softwarefactory-project-zuul bot commented Jan 27, 2025

abays commented Jan 27, 2025

stuggi commented Feb 4, 2025

[BGP] Fix FRRConfiguration cleanup #340

[BGP] Fix FRRConfiguration cleanup #340

Conversation

stuggi commented Jan 24, 2025

openshift-ci bot commented Jan 24, 2025

softwarefactory-project-zuul bot commented Jan 24, 2025

lmiccini commented Jan 24, 2025

softwarefactory-project-zuul bot commented Jan 24, 2025

stuggi commented Jan 27, 2025

softwarefactory-project-zuul bot commented Jan 27, 2025

abays commented Jan 27, 2025

stuggi commented Feb 4, 2025