Friday, 2023-03-24

opendevreviewMiguel Lavalle proposed openstack/neutron master: [DNM] Add rate-limiting to metadata agents  https://review.opendev.org/c/openstack/neutron/+/85887900:11
prometheanfireI have a baremetal port that doesn't seem to be getting responses to dhcp requests, I can see the dhcp requests hitting the only chassis on the network, I'm unsure how to check ovn itself01:03
prometheanfirewell, more than checking that the port was assigned a chassis01:05
prometheanfireI see the dataflow but I don't see packets making it to the the namespace running the ovn metadata agent03:54
prometheanfireI don't think it's related to https://bugs.launchpad.net/neutron/+bug/2007167 but it's hard to tell, I'm not sure where the packet is getting dropped03:56
prometheanfireoffhand, the tftp next server is set correctly, but I don't see any dhcp responses in the first place, so dunno03:57
prometheanfireI do have enable_distributed_floating_ip enabled for ovn, but don't think that's related either03:59
prometheanfiremaybe because the network is external?04:06
tkajinamhi. is there any good documentation to understand the features implemented in ovn-agent (not ovn-metadata-agent) as of 2023.1 ?06:39
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: WIP: ensure redirect-type=bridged not used for geneve networks  https://review.opendev.org/c/openstack/neutron/+/87845006:51
hjensasdtantsur: afict the job does not use ML2 baremetal, mechanism_drivers = openvswitch in /etc/neutron/plugins/ml2/ml2_conf.ini. i.e there is no mechanism driver that bind vnic type: baremetal in this case.07:07
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: Add support for localnet_learn_fdb OVN option  https://review.opendev.org/c/openstack/neutron/+/87767507:08
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: Add support for localnet_learn_fdb OVN option  https://review.opendev.org/c/openstack/neutron/+/87767507:08
hjensasdtantsur: I don't think the failed binding is the issue, i.e like you mention this used to work without the port being bound properly before ML2 baremetal. Unless something changed, is wiring the FIP now depending on the port being bound?07:11
ralonsohtkajinam, hi, this is now an empty shell. The plan during this cycle is to move every agent related stuff to this agent08:29
*** elvira2 is now known as elvira08:30
ralonsohthere is one feature implemented but only applies for HWOL environments that need QoS08:30
ralonsohvery specific08:30
tkajinamralonsoh, ah, ok. thanks.08:51
slaweqralonsoh lajoskatona ykarel hi, please check https://review.opendev.org/c/openstack/neutron/+/876556 when You will have some time09:02
ralonsohsure09:02
slaweqit seems it helps with memory consumption in CI jobs in many projects so we can use it too09:02
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: Ensure redirect-type=bridged not used for geneve networks  https://review.opendev.org/c/openstack/neutron/+/87845009:18
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: Ensure redirect-type=bridged not used for geneve networks  https://review.opendev.org/c/openstack/neutron/+/87845009:28
opendevreviewMerged openstack/ovn-octavia-provider stable/yoga: Fix broken pep8 jobs due to bandit 1.7.5 updated version  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/87746410:46
opendevreviewRodolfo Alonso proposed openstack/neutron master: [sqlalchemy-20] Do not use strings for aatribute names in loader options  https://review.opendev.org/c/openstack/neutron/+/87848010:47
opendevreviewFrode Nordahl proposed openstack/neutron-lib master: ext-gw-multihoming: api-def and api-ref  https://review.opendev.org/c/openstack/neutron-lib/+/87088711:44
opendevreviewRodolfo Alonso proposed openstack/neutron master: [sqlalchemy-20] Provide SQL "case" expression correct input paremeters  https://review.opendev.org/c/openstack/neutron/+/87852611:48
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: Ensure redirect-type=bridged not used for geneve networks  https://review.opendev.org/c/openstack/neutron/+/87845011:50
opendevreviewFrode Nordahl proposed openstack/neutron master: Allow Multiple External Gateways  https://review.opendev.org/c/openstack/neutron/+/87359311:54
opendevreviewFrode Nordahl proposed openstack/neutron master: Add extra router attributes for ECMP and BFD  https://review.opendev.org/c/openstack/neutron/+/87479711:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Add end to end test for QosExtension  https://review.opendev.org/c/openstack/neutron/+/87760311:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Drop use of OVN_GW_PORT_EXT_ID_KEY  https://review.opendev.org/c/openstack/neutron/+/87783111:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Drop use of LR OVN_GW_NETWORK_EXT_ID_KEY  https://review.opendev.org/c/openstack/neutron/+/87771211:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Implement support for external-gateway-multihoming extension  https://review.opendev.org/c/openstack/neutron/+/87419911:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Allow L3 scheduler to be aware of current transaction  https://review.opendev.org/c/openstack/neutron/+/87476011:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Add helper for retrieving LR associated with LRP  https://review.opendev.org/c/openstack/neutron/+/87369811:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Apply soft anti-affinity for LRs with multiple LRPs when scheduling  https://review.opendev.org/c/openstack/neutron/+/87369911:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] OVNClient._get_router_ports: Drop unused parameter  https://review.opendev.org/c/openstack/neutron/+/87852711:54
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: Ensure redirect-type=bridged not used for geneve networks  https://review.opendev.org/c/openstack/neutron/+/87845011:54
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Implement support for external-gateway-multihoming extension  https://review.opendev.org/c/openstack/neutron/+/87419913:52
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Allow L3 scheduler to be aware of current transaction  https://review.opendev.org/c/openstack/neutron/+/87476013:52
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Add helper for retrieving LR associated with LRP  https://review.opendev.org/c/openstack/neutron/+/87369813:52
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Apply soft anti-affinity for LRs with multiple LRPs when scheduling  https://review.opendev.org/c/openstack/neutron/+/87369913:52
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Honor `enable_default_route_ecmp` attribute  https://review.opendev.org/c/openstack/neutron/+/87853113:52
opendevreviewRodolfo Alonso proposed openstack/neutron master: Remove the ``OVNSqlFixture`` class workaround  https://review.opendev.org/c/openstack/neutron/+/87466914:15
opendevreviewRodolfo Alonso proposed openstack/neutron master: Replace "tenant_id" with "project_id" in IPAM engine  https://review.opendev.org/c/openstack/neutron/+/87753314:17
opendevreviewRodolfo Alonso proposed openstack/neutron master: Improve "sync_ha_chassis_group" method  https://review.opendev.org/c/openstack/neutron/+/87202314:18
opendevreviewRodolfo Alonso proposed openstack/neutron master: WIP - Add ``OVNGatewayHAChassisGroup`` scheduler class  https://review.opendev.org/c/openstack/neutron/+/87203314:18
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Remove backwards compatibility with OVN < v20.09  https://review.opendev.org/c/openstack/neutron/+/87062114:23
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] OVN agent should register "Chassis_Private" by default  https://review.opendev.org/c/openstack/neutron/+/87853514:34
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2023.1: [OVN] OVN agent should register "Chassis_Private" by default  https://review.opendev.org/c/openstack/neutron/+/87853614:35
fricklerin my downstream CI, I'm seeing neutron startup issues with yoga for some time now. log looks like https://paste.opendev.org/show/bfIlfudaBGuZUsTSBoAf/ after the crash, neutron restarts and is working fine, but in the meantime CI has failed14:38
fricklerif wait and retry things like port creation some minutes later, everything is fine. any clue on this?14:38
fricklernow happening 100%, mayby 30% or so, likely some kind of race. deployment is on 3 nodes if that matters14:39
fricklers/now/not14:39
opendevreviewLuis Tomas Bolivar proposed openstack/neutron master: Ensure redirect-type=bridged not used for geneve networks  https://review.opendev.org/c/openstack/neutron/+/87845014:42
ykarel_frickler, what's is the neutron version there, latest yoga?14:51
ykarel_20.3.014:51
opendevreviewMohammed Naser proposed openstack/neutron master: fix: add log message for periodic_sync_routers_task fullsync  https://review.opendev.org/c/openstack/neutron/+/87824814:53
fricklerykarel_: latest stable/yoga14:53
ykarel_frickler, i recalled there were couple of fixes in 20.3.0 but if latest then it can be something else15:06
opendevreviewMerged openstack/neutron master: [OVN] Remove "update_port_qos_with_external_ids_reference"  https://review.opendev.org/c/openstack/neutron/+/87410515:06
prometheanfirefor ovn, is there an issue with baremetal ports not being responded to for dhcp requests on external networks?  I see requests but no responses from the only chassis on the network's physical interface15:06
ykarel_like https://review.opendev.org/c/openstack/neutron/+/865159, https://review.opendev.org/c/openstack/neutron/+/85777315:07
ykarel_also seeing the hash empty, seems there is some issue with database15:08
ralonsohykarel_, well, this is the ovsdbapp result15:10
ralonsohcause: Result queue is empty"15:10
ralonsohthis could be an issue with the transaction15:11
fricklercan there be an issue with 865159 if multiple neutron-servers do their initial startup in parallel? just reading the commit message that sounds like a possible issue15:11
fricklerI don't think that scenario is tested in CI, either15:11
ralonsohfrickler, when the Neutron server is started, is the OVN DB up?15:16
ralonsohprometheanfire, the DHCP for baremetal ports is provided in the Neutron controllers. Is there an ovn metadata agent running there?15:17
ralonsohand do you see the corresponding namespace (with the network name)?15:17
ralonsohnetwork ID15:17
sahidbtw ralonsoh are you ok to unblock https://review.opendev.org/c/openstack/neutron/+/871113 ?15:20
ralonsohyes, done15:20
sahidralonsoh: thank you :-)15:21
fricklerralonsoh: how could I check that?15:23
ralonsohfrickler, is the ovsdb service running?15:25
prometheanfireralonsoh: ya, the namespace is there, running haproxy15:25
ralonsohprometheanfire, and do you see the packets arriving to the namespace?15:26
prometheanfireralonsoh: no packets from the external network, only packets from the VMs on the host (controler is on a hype)15:26
ralonsohthat's a problem, let me check with lucasagomes if I'm right on this: the metadata agent port in the Neutron controller should be the one providing the IP for baremetal ports15:28
fricklerralonsoh: ovn and ovs should be started well before neutron, yes, like some minutes. nothing obvious in their logs, too15:28
ralonsohprometheanfire, in any case, please report a launchpad bug describing the issue (and the version used)15:29
ralonsohwe'll triage it ASAP15:29
prometheanfireralonsoh: ya, it's something that's getting beyond my ability to troubleshoot lol, tried looking at flows, etc15:29
prometheanfirekk15:29
ralonsohfrickler, let me check if this is a possible error in ovsdbapp15:32
ralonsohfrickler, what version of ovsdbapp do you have?15:33
lucasagomesralonsoh, hi there, lemme read15:35
lucasagomesralonsoh, yes, that sounds correct to me15:36
ralonsohok, so maybe we have a bug there. Now the point is why the metdata namespace is not receiving the dhcp requests15:36
lucasagomesralonsoh, cause for ports with VNIC_BAREMETAL, what ML2/OVN does is to create a port of type "external" which will be bond to a controller instead of the compute node15:37
lucasagomesI say controller, but it will be bond to a chassis in OVN with the "enable-gw-as-chassis" flag15:37
lucasagomesenable-chassis-as-gw* sorry15:37
ralonsohprometheanfire, ^^ do you have this flag?15:37
frickleractually I just notice that the deployment uses 20.2.0, so it doesn't have the patch ykarel_ mentioned. I need to check that, sorry for the confusion15:38
lucasagomeshttps://docs.openstack.org/neutron/latest/admin/ovn/external_ports.html15:38
prometheanfireralonsoh: yes, I checked that :D15:38
prometheanfirechecked via 'ovs-vsctl get open . external-ids:ovn-cms-options'15:39
ralonsohfrickler, that makes more sense because we use "ovsdb-client" commands15:39
ralonsohprometheanfire, and can you dump the traffic up to the controller? just to know where this traffic is dropped15:39
ralonsohthe dhcp request I mean15:39
prometheanfiretcpdump on the controler shows the dhcp request hitting the expected interface15:40
prometheanfirenot sure where the next place to dump would be15:40
ralonsohyou mean the interface inside the metadata namespace?15:41
prometheanfireno, outside the namespace15:41
ralonsohok, the baremetal port dhcp request should reach the Neutron controller15:41
prometheanfireinside the namespace I just see a bunch of arp-who-has and nothing else (which is odd because ssh is running on that network)15:42
ralonsohand the first interface should be the external bridge interface15:42
lucasagomesprometheanfire, so you are usign ML2/OVN with Neutron DHCP for baremetal ? 15:42
lucasagomesprometheanfire, make sure u have the disable_ovn_dhcp_for_baremetal_ports config option set to True for that15:42
prometheanfireml2/ovn with ovn dhcp15:42
lucasagomesah ok15:43
prometheanfirenot using the 'old' neutron dhcp15:43
lucasagomessure15:43
ykarel_frickler, so then would be better to update to 20.3.0 as that contains couple of those fixes15:44
ralonsohand can you check in the NB database that you have a logical_switch_port with type "external" that matches the ID of the neutron port15:44
prometheanfireralonsoh: ya, nb database had a switch port labeled external15:46
ralonsohand what about the dhcp request in the external bridge interface?15:47
prometheanfireI think I see a configuration problem on my end15:48
prometheanfiremy network mappings map to non-existant bridges/veth pairs15:48
opendevreviewRodolfo Alonso proposed openstack/neutron master: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844115:49
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2023.1: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844215:49
prometheanfirefat finger, probably...15:49
prometheanfirelet me fix that then try again (making sure the correct veth pair is used at least15:49
opendevreviewRodolfo Alonso proposed openstack/neutron stable/zed: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844315:50
prometheanfirethe ovs side name being wrong is a naming issue as long as the data flows into it correctly15:50
opendevreviewRodolfo Alonso proposed openstack/neutron stable/yoga: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844415:50
opendevreviewRodolfo Alonso proposed openstack/neutron master: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844115:50
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2023.1: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844215:50
opendevreviewRodolfo Alonso proposed openstack/neutron stable/xena: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844515:51
opendevreviewRodolfo Alonso proposed openstack/neutron stable/wallaby: Revert "Ensure vlan network traffic is not centralized"  https://review.opendev.org/c/openstack/neutron/+/87844615:51
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVS][QoS] Add QoS support for Trunk service, OVS driver  https://review.opendev.org/c/openstack/neutron/+/83952315:56
prometheanfireralonsoh: ok, with that fixed I do see dhcp requests in that namespace, but only for other interfaces, the one configured got filtered out, trying with the mac I saw in the namespace (updated the baremetal port address)16:04
prometheanfireralonsoh: switching the port address makes that address be filtered, could it be a security group filtering dhcp?16:07
prometheanfireor port security16:09
ralonsohdhcp packets are accepted always16:09
ralonsohso now you see the DHCP requests reaching the namespace interface, right?16:09
prometheanfirehmm, whatever the (mac) address I configure for the baremetal port gets filtered and do not reach the namespace interface16:10
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Implement support for external-gateway-multihoming extension  https://review.opendev.org/c/openstack/neutron/+/87419916:10
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Honor `enable_default_route_ecmp` attribute  https://review.opendev.org/c/openstack/neutron/+/87853116:10
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Allow L3 scheduler to be aware of current transaction  https://review.opendev.org/c/openstack/neutron/+/87476016:10
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Add helper for retrieving LR associated with LRP  https://review.opendev.org/c/openstack/neutron/+/87369816:10
opendevreviewFrode Nordahl proposed openstack/neutron master: [ovn] Apply soft anti-affinity for LRs with multiple LRPs when scheduling  https://review.opendev.org/c/openstack/neutron/+/87369916:10
opendevreviewFrode Nordahl proposed openstack/neutron master: WIP [ovn] Add support for enable_default_route_bfd attribute  https://review.opendev.org/c/openstack/neutron/+/87854316:10
prometheanfirethe namespace interface sees DHCP requests for other mac addresses16:10
ralonsohbut the mac address should be the same as the one in the neutron port database16:10
ralonsohfor other mac address? 16:11
prometheanfirethe mac address it sees is the one not configured for the neutron port16:11
prometheanfirethe mac address configured for the neutron port seems like it's dropped before reaching the namespace16:11
ralonsohthe baremetal port mac address and the DB neutron port mac address must be the same16:11
ralonsohno no16:11
prometheanfirethey are the same16:11
ralonsohthe dhcp request cannot have other mac16:12
ralonsohif so, dhcp server won't reply to the correct mac16:12
prometheanfireI'm saying that the mac address configured dhcp request is not seen, only ports that are not managed by openstack packets get forwared to the namespace16:12
prometheanfirethe server sending requests tries one interface, fails to get a reply then tries the other interface16:13
prometheanfireserver sends dhcp request from ironic/neutron port (mac address matches), packet reaches ovn controller host but does not reach the namespace16:14
ralonsohso the packet is dropped in ovs16:14
prometheanfireyes, that's what it seems like16:15
prometheanfireok, did a tcpdump all along the path, a dhcp reply is being sent before hitting the namespace, so maybe ovs handles it before putting the packet on the namespace network16:19
prometheanfirenow to watch closer...16:19
prometheanfirehmm, ok, I think we are good?  the host isn't responding to whatever is given as the dhcp response (it gets a response that it just may not be happy with16:23
ralonsohso the dhcp reply is reaching the baremetal port16:24
ralonsohlet me check one patch related to this16:24
prometheanfireyes, I think it's short-circuited in ovs somewhere, it's the correct response though, options are set16:24
ralonsohprometheanfire, what version are you running?16:25
ralonsohdo you have this patch: https://review.opendev.org/q/I59038639a8411c11c5fb8b366d9c858ef3db4f7016:26
prometheanfireof what?16:26
ralonsohNeutron version16:26
prometheanfireyes16:26
prometheanfireoption 150 is sent16:26
ralonsohso at this point you need to check why the baremetal server is not accepting this dhcp reply16:27
prometheanfireI'm thinking the hardware doesn't like ipxe, neutron part seems like it's working now from what I can see16:27
prometheanfireyep16:27
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Remove backwards compatibility with OVN < v20.09  https://review.opendev.org/c/openstack/neutron/+/87062116:59
opendevreviewMiro Tomaska proposed openstack/neutron master: Fix intermittent failures in finding metada port in SB DB  https://review.opendev.org/c/openstack/neutron/+/87854917:03
*** ministry is now known as __ministry17:09
opendevreviewRodolfo Alonso proposed openstack/neutron master: Increase port name size and type to internal  https://review.opendev.org/c/openstack/neutron/+/87311817:37
*** Guest8553 is now known as atmark17:48
*** atmark is now known as Guest875517:49
*** Guest8755 is now known as atmark17:58
opendevreviewMerged openstack/neutron master: [OVS] Parse the "permitted_ethertypes" at the FW initialization  https://review.opendev.org/c/openstack/neutron/+/87699719:19

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!