Wednesday, 2023-09-27

opendevreviewMohammed Naser proposed openstack/neutron master: Skip trunk sub-ports in "update_virtual_port_host"  https://review.opendev.org/c/openstack/neutron/+/89659001:33
opendevreviewNatz cheng proposed openstack/os-vif master: cleanup py27 support  https://review.opendev.org/c/openstack/os-vif/+/89659801:57
*** ralonsoh_afk is now known as ralonsoh06:21
slaweqralonsoh hi, please check mnaser's patch https://review.opendev.org/c/openstack/neutron/+/896590 and his findings related to the bug https://bugs.launchpad.net/neutron/+bug/2024160 which he talked about yesterday evening our time06:43
opendevreviewSlawek Kaplonski proposed openstack/neutron master: [OVN] Add baremetal support without Neutron DHCP agent for IPv6  https://review.opendev.org/c/openstack/neutron/+/89068307:05
slaweqralonsoh and please also check my last comment in https://review.opendev.org/c/openstack/neutron/+/890683 - thx in advance07:05
ralonsohsure07:08
ralonsohslaweq, https://review.opendev.org/c/openstack/neutron/+/890683/comments/2c953a3b_9761428307:34
ralonsohI think that makes sense (but please check it)07:34
opendevreviewMerged openstack/neutron master: contributor docs: Architectural overview for metadata  https://review.opendev.org/c/openstack/neutron/+/89491108:06
opendevreviewElvira García Ruiz proposed openstack/neutron stable/2023.1: [OVN] Fix rate and burst for stateless security groups  https://review.opendev.org/c/openstack/neutron/+/89661408:19
slaweqralonsoh thx, I will address Your comment and add this upgrade check to this patch08:23
ralonsohslaweq, thanks a lot!08:44
ralonsohplease, check https://review.opendev.org/c/openstack/neutron/+/893447 if you have time08:44
opendevreviewStefan Hoffmann proposed openstack/neutron stable/2023.1: Reduce lock contention on subnets  https://review.opendev.org/c/openstack/neutron/+/89308209:14
opendevreviewStefan Hoffmann proposed openstack/neutron stable/zed: Reduce lock contention on subnets  https://review.opendev.org/c/openstack/neutron/+/89308409:15
opendevreviewStefan Hoffmann proposed openstack/neutron stable/yoga: Reduce lock contention on subnets  https://review.opendev.org/c/openstack/neutron/+/88923809:17
opendevreviewMerged openstack/networking-bagpipe master: Update master for stable/2023.2  https://review.opendev.org/c/openstack/networking-bagpipe/+/89508309:43
MaxLamprecht[m]ralonsoh: hi, I checked your mysql snipped on https://bugs.launchpad.net/neutron/+bug/2037107 and actually it proves the subnet X networkrbacs problem to me. Maybe you can have a look at that topic and my new comments again. thx10:09
ralonsohMaxLamprecht[m], how is that?10:14
ralonsohwhy this query proves the problem?10:14
MaxLamprecht[m]because the group by happens only in a subquery(not sure why sqlalcehmy is behaving like that). Afterwads we can see again a left outer join to networkrbacs in the query which is not needed (IMO)10:18
ralonsohMaxLamprecht[m], exactly, what this subquery is retrieving?10:19
MaxLamprecht[m]the 2nd outer join happens due to the orm/ovo model with the lazy='joined" option10:19
MaxLamprecht[m]but the subquery it retrieving the network.id10:21
ralonsohexactly and that subquery is returning just one ID10:21
ralonsohthis is why the group-by clause 10:21
ralonsohthat is explained in the series of patches I pushed10:21
MaxLamprecht[m]yeah and all the networkrbacs and subnets have the same network_id -> thats why the combinatorial blow-up happens for subnets X networkrbacs10:23
ralonsohagain, the subquery is returning one network ID10:24
ralonsohthen the outer query is retrieving the subnets from this network ID10:25
ralonsohthe inner query is optimized with the group-by clause and returns one value10:25
ralonsohthe outer query selects the subnets from this single ID10:25
ralonsohsubnets don't have RBACs, the query filters first the network using the RBAC registers10:26
ralonsohthen retrieves the subnets from the filtered network IDs10:26
MaxLamprecht[m]ralonsoh: exactly, but why do we have than the left outer join to networkrbacs in the outer query10:27
ralonsohyou can't execute the inner query without adding the networkrbacs to the query10:28
ralonsohbut your assumption that this is combining both is wrong10:29
ralonsohdid you performed these queries manually?10:29
MaxLamprecht[m]but in the inner query there is also a left outer join to networkrbacs10:29
ralonsohdid you execute this query manually?10:30
ralonsohdid you check how many results return?10:30
MaxLamprecht[m]16588 rows in set (0.06 sec)10:30
ralonsohplease post the query10:30
MaxLamprecht[m]I used the non admin query from your comment #3 (link [3])10:31
MaxLamprecht[m]just replaced the project_id10:32
ralonsohMaxLamprecht[m], no, I don't have this result10:35
ralonsohso the reproducer script is not correct and something is missing10:35
ralonsohplease check the networkrbac table and what other registers have10:36
MaxLamprecht[m]but you see the 2 left outer joins to networkrbacs in this query?10:36
ralonsohI do, yes10:36
MaxLamprecht[m]I will try to reproduce it in a new devstack10:36
ykarelbhaley, lajoskatona ralonsoh, ovs/linuxbridge job fails even with os-vif reverted patch so it's something else10:40
ralonsohykarel, when the issue started?10:40
ykarelralonsoh, 22nd evening10:41
lajoskatonaykarel: the traceback was the same as I remember, but possible that we have to make the lb job non-voting?10:46
lajoskatonaralonsoh: see the opensearch link in the comment: https://bugs.launchpad.net/neutron/+bug/2037239/comments/210:46
ykarellajoskatona, ovs jobs also impacted10:46
lajoskatonaykarel: I mean with os-vif fix/revert10:47
ralonsohthere is nothing relevant merged recently in the Neutron code10:47
ralonsohare we using a different OVS version?10:47
ykarellajoskatona, yes with os-vif revert issue still reproduces with both ovs/lb jobs10:47
lajoskatonaralonsoh: that would not effect linuxbridge jobs 10:48
ykarellast night run had ovs jobs passing, but i rechecked today and it reproduces even in ovs jobs so os-vif patch is not related10:48
MaxLamprecht[m]<MaxLamprecht[m]> "but you see the 2 left outer..." <- The 1st join joins (**networks** X networkrbacs). The 2nd join joins (**subnets** X networkrbacs). 10:52
MaxLamprecht[m]The group by/rbac filtering happens only for (**network** X networkrbacs) but not for the 2nd join. So due to that 2nd join we have the combinatorial blow-up between **subnets** and networkrbacs in one network.10:52
ralonsohMaxLamprecht[m], ok, now I can reproduce it. I'll check how to introduce a new group-by in the subnet query11:07
ralonsohykarel, the GW (HA) port of the L3 agent receives an empty host11:12
ralonsohSep 22 21:26:41.021000 np0035327482 neutron-server[57688]: DEBUG neutron.db.l3_hamode_db [None req-c532b479-c945-41c8-81ef-e6fb0530d8e2 None None] HA router e8e3b263-ff89-4e6d-b1c0-44abd2403282 gateway port 6d7bf419-238a-4af4-8f78-9c459e82e22c binding host: , status: DOWN {{(pid=57688) _get_gateway_port_host /opt/stack/neutron/neutron/db/l3_hamode_db.py:784}}11:12
ralonsohfrom https://89ad67c64b91ca5d8e4c-59f0897e687e7dfcd443cecd2cd3f6cf.ssl.cf1.rackcdn.com/896299/1/check/neutron-tempest-plugin-openvswitch/61cc665/job-output.txt11:13
ralonsohwell, the server logs11:13
opendevreviewElvira García Ruiz proposed openstack/neutron stable/2023.1: [OVN] Fix rate and burst for stateless security groups  https://review.opendev.org/c/openstack/neutron/+/89346011:16
MaxLamprecht[m]<ralonsoh> "Max Lamprecht, ok, now I can..." <- ahh nice, thought already that i´m doing something heavily wrong with my setup :D... (full message at <https://matrix.org/_matrix/media/v3/download/matrix.org/pccEHcGrurFBgriAJquxbjUD>)11:20
opendevreviewFelipe Figueroa Vergara proposed openstack/tap-as-a-service master: Enables external network mirroring  https://review.opendev.org/c/openstack/tap-as-a-service/+/89651511:21
ContinuityHey all, ran across something over the last week or so with openvswitch on zed. Deployed using kolla ansible, with ubuntu containers. When making use of hardware offloading, we have been hitting this bug https://bugs.launchpad.net/ubuntu/+source/openvswitch/+bug/201850011:39
Continuityutil(handler27)|EMER|../include/openvswitch/ofpbuf.h:194: assertion offset + size <= b->size failed in ofpbuf_at_assert() shown in the log11:40
Continuitycauses OVS to crash and restart causing a slight drop in connectivity. 11:40
Continuityjust thought I would mention it if anyone else has/is seeing the issue. 11:40
ContinuityIts fixed in yoga, and antollope, I have requested a backport fix to zed11:40
Continuityfor reference. https://bugs.launchpad.net/cloud-archive/+bug/203753211:41
opendevreviewAnton Vazhnetsov proposed openstack/ovsdbapp master: nb: add 'nexthop' argument to 'lr_route_del'  https://review.opendev.org/c/openstack/ovsdbapp/+/89664511:52
opendevreviewReason li proposed openstack/neutron master: Parameter filters may be None, which cannot be called with **  https://review.opendev.org/c/openstack/neutron/+/89664912:23
opendevreviewFelipe Figueroa Vergara proposed openstack/tap-as-a-service master: Enables external network mirroring  https://review.opendev.org/c/openstack/tap-as-a-service/+/89651512:46
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Allow multiple VIPs per LB  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/88511113:08
opendevreviewMohammed Naser proposed openstack/neutron master: [OVN] Match LSP_TYPE_VIRTUAL in PortBindingUpdateVirtualPortsEvent  https://review.opendev.org/c/openstack/neutron/+/89659013:11
mnaserralonsoh: thank you for your review, your fix is a lot cleaner lol13:12
ralonsohchecking now13:12
mnaserbut hey i'm improving my ovn chops13:12
ralonsohmnaser, do you feel you can implement a FT to avoid future regressions?13:13
ralonsohif not, I can push a new PS with this FT13:13
mnaserralonsoh: i tried to play around with that (i saw a `test_virtual_port_host_update`) but i think its a (little) outside of it13:14
mnaseri would appreciate if you could13:15
ralonsohok13:15
mnaserthank you!13:16
opendevreviewRodolfo Alonso proposed openstack/neutron master: WIP == Add policy enforcer for "tags" service plugin  https://review.opendev.org/c/openstack/neutron/+/89650913:19
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Match LSP_TYPE_VIRTUAL in PortBindingUpdateVirtualPortsEvent  https://review.opendev.org/c/openstack/neutron/+/89659013:48
ralonsohslaweq, hi! if you have 1 min: https://review.opendev.org/c/openstack/neutron/+/89344713:56
ralonsohthanks!13:56
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2023.2: Revert "[OVN][Trunk] Add port binding info on subport when parent is bound"  https://review.opendev.org/c/openstack/neutron/+/89589914:38
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2023.1: Revert "[OVN][Trunk] Add port binding info on subport when parent is bound"  https://review.opendev.org/c/openstack/neutron/+/89479114:38
opendevreviewRodolfo Alonso proposed openstack/neutron stable/zed: Revert "[OVN][Trunk] Add port binding info on subport when parent is bound"  https://review.opendev.org/c/openstack/neutron/+/89479214:39
opendevreviewRodolfo Alonso proposed openstack/neutron stable/yoga: Revert "[OVN][Trunk] Add port binding info on subport when parent is bound"  https://review.opendev.org/c/openstack/neutron/+/89479314:39
opendevreviewRodolfo Alonso proposed openstack/neutron stable/xena: Revert "[OVN][Trunk] Add port binding info on subport when parent is bound"  https://review.opendev.org/c/openstack/neutron/+/89479414:39
opendevreviewLajos Katona proposed openstack/networking-bagpipe master: DNM: test master  https://review.opendev.org/c/openstack/networking-bagpipe/+/89667714:44
opendevreviewAnton Vazhnetsov proposed openstack/ovsdbapp master: nb: add 'nexthop' argument to 'lr_route_del'  https://review.opendev.org/c/openstack/ovsdbapp/+/89664514:55
opendevreviewAnton Vazhnetsov proposed openstack/ovsdbapp master: venv: unit tests don't work for 'hardware_vtep'  https://review.opendev.org/c/openstack/ovsdbapp/+/89668215:36
ralonsohykarel, hi, did you find something?15:42
ralonsohthe only difference I see between a working job and a failing one is that the ovs agent (I know it is failing too in ML2/LB) is not binding the GW port15:43
ralonsohand thus the port is not set to up and the agent is not declared as active15:44
opendevreviewMerged openstack/ovn-octavia-provider stable/yoga: Check multiple address of a LRP plugged to LS  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/89626915:44
opendevreviewJakub Libosvar proposed openstack/neutron master: Introduce ovn_nb_global config section  https://review.opendev.org/c/openstack/neutron/+/89654516:29
mnaserralonsoh: seems like these original test has failed, i marked where it failed, i _suspect_ the old test was maybe not correct16:30
opendevreviewAnton Vazhnetsov proposed openstack/ovsdbapp master: nb: allow to add route even if learned route exists  https://review.opendev.org/c/openstack/ovsdbapp/+/89668616:30
ralonsohmnaser, I tested it manually and wasn't failing16:31
mnaserthe new test or the old one16:31
ralonsohboth16:31
mnaserah ok16:31
mnaserovn version dependent ?16:31
mnaserwell, we know the behaviour does depend on the ovn version actually16:31
mnaserseems like that job used v23.09.016:33
ralonsohthat patch was backported up to ovn-nbctl 22.0316:34
ralonsohbut what version in particular?16:34
haleybhttps://review.opendev.org/c/openstack/neutron/+/895849 just changed functional job to v23.09.016:35
ralonsohI'm running with the jammy package16:35
mnaseri think jammy with uca is different than jammy without uca16:35
ralonsohmnaser, ok, with this check and the new version, this check does not apply16:36
ralonsohbecause the port_binding is deleted but the port is not yet virtual16:36
ralonsohit is in the next event call16:36
mnaseryeah thats what i saw, you can see in the issue i put some logs of the events from neutron debug16:37
mnaserthats kinda why i resorted to my solution cause i couldnt think of a way of detecting a "deleted because being recreated"16:37
ralonsohI'll reinstall my dev env with v23.09.016:37
mnaseryeah in my case I encountered this in 23.03.016:39
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Match LSP_TYPE_VIRTUAL in PortBindingUpdateVirtualPortsEvent  https://review.opendev.org/c/openstack/neutron/+/89659016:39
opendevreviewRodolfo Alonso proposed openstack/neutron master: WIP == Add policy enforcer for "tags" service plugin  https://review.opendev.org/c/openstack/neutron/+/89650916:56
opendevreviewMerged openstack/ovn-octavia-provider stable/zed: Cover the use case of a member non existing  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/89492617:02
opendevreviewAnton Vazhnetsov proposed openstack/ovsdbapp master: venv: unit tests don't work for 'hardware_vtep'  https://review.opendev.org/c/openstack/ovsdbapp/+/89668217:02
*** ralonsoh is now known as ralonsoh_ooo17:02
ykarelralonsoh, no haven't got the root cause for it17:04
ykareljust Error while reading HA state for <router id> Error while reading HA state for [Errno 2] No such file or directory: '/opt/stack/data/neutron/ha_confs/<router id>/state'17:05
ykarelUpdating server with state unknown for router <router id>17:05
ykarelbut may be that's just the side effect of gw port not bound ?17:06
*** ykarel is now known as ykarel|away17:16
haleybykarel|away: that error might be a red herring, if keepalived has never been started for a router there should be no files. The errors about missing pid files are just as annoying and printed multiple times :-/17:38
mnaserralonsoh_ooo: wooo it passed this time around (well the functional test at least)17:49
mnaserhttps://review.opendev.org/c/openstack/neutron/+/896590 this is ready to review19:15
*** JayF is now known as Guest150520:10
*** JasonF is now known as jayf20:10
*** jayf is now known as JayF20:11
*** jlibosva is now known as Guest150620:11
opendevreviewJakub Libosvar proposed openstack/neutron master: functional: Enable test_fip_connection_for_address_scope test  https://review.opendev.org/c/openstack/neutron/+/89672820:54
opendevreviewAnton Vazhnetsov proposed openstack/ovsdbapp master: venv: unit tests don't work for 'hardware_vtep'  https://review.opendev.org/c/openstack/ovsdbapp/+/89668220:57
*** kopecmartin is now known as kopecmartin|off20:57

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!