Friday, 2020-07-17

*** hamalq_ has quit IRC00:05
*** artom has quit IRC00:11
openstackgerritMerged openstack/nova master: libvirt: Add vTPM config support  https://review.opendev.org/73920900:17
openstackgerritMerged openstack/nova master: zuul: remove legacy-tempest-dsvm-neutron-dvr-multinode-full  https://review.opendev.org/74044700:31
*** xek_ has quit IRC00:32
*** gyee has quit IRC00:33
*** tetsuro has joined #openstack-nova00:33
*** rcernin has quit IRC00:34
*** rcernin has joined #openstack-nova00:39
*** zhanglong has joined #openstack-nova01:01
*** Liang__ has joined #openstack-nova01:10
*** huaqiang has quit IRC01:14
sean-k-mooneymnaser: that general indeicates an issue with neutron or rabbitmq performance01:29
mnasersean-k-mooney: yeah, so based on my log tracing, the PUT is sent at :33 but n-ovs-agent doesnt actually plug until :3701:29
sean-k-mooneythe PUT?01:30
mnasersorry, the PUT from n-cpu to neutron to plug the port01:30
sean-k-mooneythat is not pluging01:30
sean-k-mooneythat is port binding01:31
sean-k-mooneythey are two differnt things01:31
sean-k-mooneybinding happens before pluging starts01:31
mnasersean-k-mooney: what triggers the plug?01:31
sean-k-mooneyafter teh ports are bound nova plugs the interface which in the case of ovs is the act of createing the ovs port01:32
*** yaawang has quit IRC01:33
sean-k-mooneywhen the port is created the external_id in other_cofnig on the ovs interface is set to the neutron port uuid01:33
sean-k-mooneythat triggeres an event on a unix socket that the l2 agent is monitoring01:33
sean-k-mooneyand then the l2 agent start wiring up the port01:33
mnaserthe PUT is the only operation i'm seeing towards neutron-server which isn't read-only (aka not a GET)01:33
*** yaawang has joined #openstack-nova01:33
sean-k-mooneywhen the port is wired up and the dhcps agnet has also updated the dhcp config the pluging evnet is sent01:34
sean-k-mooneymnaser: nova does not talk to neutron during plugin01:34
sean-k-mooneyit configure the network backend and neutron watches for that01:34
sean-k-mooneywhen the l2 agent finsihs it send a rpc to the neutron server01:34
sean-k-mooneyand that trivers teh event to nova01:35
sean-k-mooney*triggers01:35
mnaserso n-ovs-agent waits for os_vif to do its thing, and then when it sees a new port, it starts the process of wiring it up (aka the whole vlan dance), tells neutron its plugged and then neutron-server does the notify http call?01:35
sean-k-mooneyyes01:36
mnaser(fwiw this is stein)01:36
sean-k-mooneyyep this dance has not change since neutron was qunantum01:36
sean-k-mooneyat least not for spwan01:36
mnaserTIL a lot01:36
sean-k-mooneylive migration has changed but spawn is still the same01:37
mnaserok so once the port is setup by nova and it starts waiting for external event, n-ovs-agent should technically start acting right away01:37
sean-k-mooneyyes on older version when the l2 agent was using vsctl instead of the ovs python binding01:37
sean-k-mooneyit used to poll for the port updates01:37
sean-k-mooneyso there was more latency but it was minimal01:38
mnasersean-k-mooney: well in that case, the theory is that n-ovs-agent is taking a long time to notice the port appearing01:38
sean-k-mooneymnaser: the condition for sending the network_vif_plugged event have 3 parts01:39
sean-k-mooney1 the prot is bound and active on a host01:39
sean-k-mooney2 the agent has finishiend its vlan dance01:39
sean-k-mooney3 the dhcp agent has complted configing dhcp01:39
mnasersean-k-mooney: because when i grep n-ovs-agent logs, i see the first 'record' of something happening with that port id is 21:37, few seconds before nova gets notified.  that tells me that n-ovs-agent didn't see the port right away?01:39
sean-k-mooney3 can get very slow at scale01:39
mnaserah01:40
mnaserso maybe #3 is the issue here01:40
mnaserfwiw, this is doing like01:40
mnaseralmost 250 near-concurrent port plugs01:40
sean-k-mooneymnaser: i would the time stamps for 1 os-vif adding the port to ovs, 2 l2 agent seeign the port, 3 dhcp agent seting it up01:41
sean-k-mooneymnaser: ah ha... and that causes issues :)01:41
sean-k-mooneylet me see if i have any of the down stream bugs01:41
sean-k-mooneyi know our networkg folk have had issue form customer related to the dhcp agent01:41
sean-k-mooneyit might not be the same but it sound similar01:42
mnaseroh this is interesting01:42
mnaserhttps://www.irccloud.com/pastebin/vjt0J52B/01:43
mnasersean-k-mooney: almost 3 minutes for os_vif to plug the port?01:43
mnaserbut i guess, "Plugging" is the step 101:43
mnaserand then "Plugged" happens after getting the external event01:43
sean-k-mooneyyes i think so but i cna check quickly where that log is01:44
mnaserthe first 'reference' to that port id in the neutron logs is at 2020-07-16 21:37:27.517 (port xx updated)01:44
mnaser(neutron ovs logs that is)01:44
mnaser2020-07-16 21:37:34.415 3862298 INFO neutron.plugins.ml2.drivers.openvswitch.agent.ovs_neutron_agent [req-bda6aed7-6f44-474a-9e4a-ff6082e7bc9e - - - - -] Configuration for devices up ['66278068-7509-40cb-8773-ed4d36801723'] and devices down [] completed.01:45
mnaserso the 'successfully plugged' comes in 15 seconds after neutron says im all good01:45
sean-k-mooneyso pluging started at 40,342,70401:46
sean-k-mooneythat didnt work01:47
sean-k-mooney2020-07-16 21:34:21.72601:47
sean-k-mooneyso ya the l2 agent too 3 minutes01:47
mnaseros_vif considers it plugged when the external id gets filled in?01:48
sean-k-mooneyno01:49
sean-k-mooneyhttps://github.com/openstack/os-vif/blob/8042e41f1bfc4b4d279934a0bed94f77454faeff/os_vif/__init__.py#L7801:49
sean-k-mooneythat message gets retruned when we finish adding the port to ovs01:49
mnaserso it took us 3 minutes to simply add a port into ovs o_O01:50
sean-k-mooneythat is what it looks like01:50
sean-k-mooneymnaser: are you using the ovs-vsctl backend or python bindings01:50
mnasersean-k-mooney: i haven't touched it -- but looking at the code, os_vif defaults to ovs-vsctl but neutron switched to python bindings a while back01:51
sean-k-mooneythe default is vsctl01:51
sean-k-mooneyhttps://github.com/openstack/os-vif/blob/stable/stein/vif_plug_ovs/ovs.py#L65-L6801:51
sean-k-mooneyin stien01:51
mnaserim looking at the ovs logs01:52
sean-k-mooneyactully its still the defaul... i really though i changed that before neutron did01:52
mnaseri guess that means01:53
mnaseri can grep the logs to find an ovs-vsctl command01:53
sean-k-mooneyyes but i know where to check the code :) im wondering if we are using --no-wait. i think we are01:54
mnaserthis is interesting though https://www.irccloud.com/pastebin/DERdqYGW/01:55
sean-k-mooneywell its making an external call so eventlit will context switch and start running other code01:56
mnasersean-k-mooney: with debug=true, im not seeing any executions grepping vsctl or ovs01:56
sean-k-mooneyhttps://github.com/openstack/os-vif/blob/stable/ussuri/vif_plug_ovs/ovsdb/impl_vsctl.py#L333-L33501:57
sean-k-mooneyim not sure we are using --no-wait01:57
sean-k-mooney--no-wait return when the port is added to the db01:57
sean-k-mooneywithout it ovs-vsctl blocks for the dataplane to acknolage that it has create teh port01:58
mnaseri guess i have to find privsep logs01:58
mnasercause n-cpu doesnt contain any execution logs01:59
sean-k-mooneythey are part of the nova compute logs01:59
*** yaawang has quit IRC01:59
mnaserps show 25% cpu usage for the privsep proc for ovs02:00
sean-k-mooneywe reduce the privsep logs to info by default https://github.com/openstack/nova/blob/master/nova/config.py#L5802:00
mnaserah that's probably why it's not visible02:01
sean-k-mooneyya prvisep in debug mode dupms sensitive info live vm console to the logs02:01
sean-k-mooneyso we do not enable privsep debug logs wehn you enable nova's debug logging02:01
sean-k-mooneyanyway there are a few things that could be happening.02:02
*** yaawang has joined #openstack-nova02:02
*** mriedem has left #openstack-nova02:02
sean-k-mooney1 nova could be bussy and wehn we context switch away it might take a while before we get back02:02
sean-k-mooney2 ovs could be taking a long time to add the port02:03
sean-k-mooneyif it is you would see this in the ovs db log02:03
sean-k-mooneyit will have a log of when the port add is started and complete i belive02:03
mnaserovsdb-server has no output (well, almost nothing) but vswitchd has a bunch02:04
melwittI thought it was pluggin'02:04
sean-k-mooneymelwitt: it is02:04
sean-k-mooneybut there is a 3 minitu gap betwen it starting and finishing02:05
melwittI know, just replying super late02:05
mnaseri see a bunch of this: `ovs_rcu(urcu5)|WARN|blocked 1000 ms waiting for main to quiesce`02:05
sean-k-mooneymnaser: in the vswitchd log you should seee somthign like this02:06
sean-k-mooney2020-05-06T19:52:07.865Z|00031|bridge|INFO|bridge br-int: added interface tap36660a66-00 on port 302:06
mnaserah, yes02:06
sean-k-mooneyi think you are looking for tap66278068-7502:07
mnaser2020-07-16T21:37:25.283Z|11608|bridge|INFO|bridge br-int: added interface qvo66278068-75 on port 3230702:07
sean-k-mooneyah you are using iptables02:08
sean-k-mooneynot the contrack security group driver?02:08
mnaseryes, there's a plan to move towards openvswitch driver02:08
mnaseriptables_hybrid right now02:08
sean-k-mooneyyep that is why the interface name is diffrent02:08
sean-k-mooneygiven it started plugging at 21:34:21.72602:09
sean-k-mooneythat took a while02:09
sean-k-mooneyplugging finished at 21:37:49.94302:10
sean-k-mooneythere is still a 24 second gap but that is more resonably02:11
mnaserso either: nova is 'distracted' doing something else and not actually running ovs-vsctl commands?02:11
mnaseror the ovs-vsctl command is actually taking 3 minutes02:11
sean-k-mooneyit could be that its in the queue of peneding funciton to execute in privsep for a while too02:12
mnaserthat's very likely because02:13
mnaserhttps://www.irccloud.com/pastebin/hv0DDE8m/02:13
mnaser25% cpu usage for privsep sounds off02:13
*** markvoelker has joined #openstack-nova02:13
sean-k-mooneyi dont know didnt you say your creating 250 port/vms at the same time02:14
sean-k-mooneybut i think stein predates privsep multi threading02:14
mnaseri mean, they're not all 250 ports there, but i've seen a case of 15 vms spawning at teh same time here02:14
mnaserso 15 vms * 5 ports each = 75 ports being attached together at elast02:15
sean-k-mooneyso there are a few things you could try02:18
sean-k-mooneyif i am remeberign correctly the native ovsdb interface in os-vif does not use privsep02:18
*** markvoelker has quit IRC02:18
mnaserhttps://docs.openstack.org/releasenotes/oslo.privsep/stein.html -- "Privsep now uses multithreading to allow concurrency in executing privileged commands. The number of concurrent threads defaults to the available CPU cores, but can be adjusted by the new thread_pool_size config option."02:19
mnaserand i have privsep 1.32.1 -- cool, so that's taken care of -- i think moving to native ovsdb is probably much better/cleaner02:20
sean-k-mooneyyou can set ovsdb_interface="native" in the nova.conf02:20
sean-k-mooneybut im just checkign the group02:20
sean-k-mooneyi think its somthign like vif_plug_ovs but give me a sec02:21
sean-k-mooneyi also need to chagne the default on master02:21
*** zhanglong has quit IRC02:21
mnasersean-k-mooney: https://github.com/openstack/os-vif/blob/d588708f2149b4503e63a2c2165201c9fe399bdb/vif_plug_ovs/tests/functional/ovsdb/test_ovsdb_lib.py#L54 tests show os_vif_ovs02:21
sean-k-mooney ah yes [os_vif_ovs]02:22
*** zhanglong has joined #openstack-nova02:22
mnaserok, setting to native02:23
*** sapd__x has joined #openstack-nova02:23
mnaserthe plugs that happen on start up were also not very fast, taking ~5s each02:23
mnaserperhaps this will show difference02:23
mnaser2020-07-17 02:23:44.773 1550960 DEBUG ovsdbapp.backend.ovs_idl.vlog [-] tcp:127.0.0.1:6640: entering ACTIVE _transition /openstack/venvs/nova-19.0.8/lib/python2.7/site-packages/ovs/reconnect.py:48502:24
sean-k-mooneyah this is is where that gets generateed https://github.com/openstack/os-vif/blob/master/os_vif/plugin.py#L7902:24
sean-k-mooneymnaser: that looks like its connect with the native backend alright02:24
mnaseryeah but nova-compute is pegged at 100% cpu now on start up02:25
mnasertime to see what its doing02:25
sean-k-mooneywell it does a lot02:25
sean-k-mooneyincluding pluging all the port of every vm on the host02:25
sean-k-mooneybut its going to run all the reousce tracker stuff before that form init_host02:25
mnaserjust sent USR202:26
sean-k-mooneyit that the guru meditation report02:26
mnaseryeah02:27
sean-k-mooneyya i have no idea how to read those02:27
mnaseroddly enough though, its still taking 5 seconds to plug a port on start up02:27
sean-k-mooneyhas the privsep load dropped02:28
sean-k-mooneyi dont know if ovsdbapp uses privsep internaly but os-vif nolonger need to use prvisep for the ovs db updates at least02:29
mnaseroooou i have an idea02:29
mnaseri think ipv6 being enabled is hurting this host02:29
sean-k-mooneyit still needs privsep for other thngs02:29
sean-k-mooneyoh hum02:29
mnaserseems like it was stuck on /openstack/venvs/nova-19.0.8/lib/python2.7/site-packages/nova/virt/libvirt/driver.py:649 in _check_my_ip02:29
mnaseron start up02:29
mnaserim going to time that02:30
sean-k-mooneyya maybe02:30
mnaserspecifically `ips = compute_utils.get_machine_ips()`02:30
mnaseryep, just ran that and its eating up a lot of cpu time02:30
sean-k-mooneythat might be trying to do a revers dns lookup02:30
mnaser# ip addr | grep inet | wc -l => 74602:31
sean-k-mooneyam im going to have to call it a night but ill be around in a coule of hour.02:31
sean-k-mooneyoh all the link local addresses02:31
sean-k-mooneyon all the tap devices02:31
sean-k-mooneymnaser: artom saw an issue with this before02:32
mnaserlen(netifaces.interfaces()) => 97302:32
mnaseryeah, i think i'm onto something here02:32
sean-k-mooneyi think he fixed it but maybe not on master02:32
sean-k-mooney* stien02:32
sean-k-mooneymaybe we didnt backport it02:32
mnasert = time.time(); netifaces.ifaddresses('qvo64d61663-06'); print(time.time() - t) => 133ms02:34
mnaser973*133ms = 129s02:35
sean-k-mooneyhttps://review.opendev.org/#/c/672161/02:35
mnaserwe should probably just skip qbr/qvo/qvb/tap in there02:35
sean-k-mooneyya proably02:36
sean-k-mooneyso artom moved it to init_host to fix https://bugs.launchpad.net/nova/+bug/183707502:36
openstackLaunchpad bug 1837075 in OpenStack Compute (nova) stein "Evacuation takes too long when destination host has a large number of NICs" [Low,Fix committed] - Assigned to Artom Lifshitz (notartom)02:36
mnaserbut instead it takes a long time to init_host :)02:36
sean-k-mooneyyes but we used to call it at runtime several times02:37
sean-k-mooneyso we look it up once now and cache it02:37
sean-k-mooneywe used to call it every time we called get_host_ip_addr02:38
sean-k-mooneywhich we did for all move operations02:38
mnaseryeah, perhaps that function could use skipping nova-managed devices02:38
sean-k-mooneythis is all its used for02:39
*** zhanglong has quit IRC02:39
sean-k-mooney  ips = compute_utils.get_machine_ips()02:39
sean-k-mooney        if CONF.my_ip not in ips:02:39
sean-k-mooney            LOG.warning('my_ip address (%(my_ip)s) was not found on '02:39
sean-k-mooney                        'any of the interfaces: %(ifaces)s',02:39
sean-k-mooney                        {'my_ip': CONF.my_ip, 'ifaces': ", ".join(ips)})02:39
sean-k-mooneyit validating that the ip we set in the config if its set is on the host02:39
sean-k-mooneythere is proably a better way to do that02:39
* sean-k-mooney that might be an understatement...02:40
mnaseryeah i cant find a better thing but i think skipping qbr/qvo/qvb/tap might speed it up.. a lot02:40
sean-k-mooneywhere you debuging startup latency02:40
*** zhanglong has joined #openstack-nova02:41
sean-k-mooneyor were you debuging an ongoing issue with new vms02:41
sean-k-mooneythis will be a nice fix but it wont affect the later02:41
mnaseryeah i agree it wont affect the latter, i just switched to native and running the 'benchmark' now against the cloud02:41
sean-k-mooneyok well if it helps or not let me know. ill be back in 8 hours or so after i get some sleep02:43
sean-k-mooneyo/02:43
mnaseryeah, i think i'll leave it to get soak tested overnight with those changes02:44
mnaserbut we're onto something ,thanks sean-k-mooney02:44
sean-k-mooneyno worries02:45
*** mkrai has joined #openstack-nova02:54
*** tetsuro has quit IRC03:13
*** psachin has joined #openstack-nova03:37
*** zhanglong has quit IRC03:47
*** markvoelker has joined #openstack-nova03:48
*** markvoelker has quit IRC03:53
*** vishalmanchanda has joined #openstack-nova04:01
*** huaqiang has joined #openstack-nova04:12
openstackgerritHuaqiang Wang proposed openstack/nova master: hardware: create 'mixed' instance for realtime CPUs  https://review.opendev.org/72848504:21
openstackgerritHuaqiang Wang proposed openstack/nova master: compute: bump nova-compute version and check in API  https://review.opendev.org/74093304:21
openstackgerritHuaqiang Wang proposed openstack/nova master: metadata: export the vCPU IDs that are pinning on the host CPUs  https://review.opendev.org/68893604:21
openstackgerritHuaqiang Wang proposed openstack/nova master: release note: add new features introduces by use-pcpu-and-vcpu-in-one-instance  https://review.opendev.org/73699704:21
openstackgerritHuaqiang Wang proposed openstack/nova master: test: add some notes and new tests for mixed instance  https://review.opendev.org/74139504:21
*** bbowen has quit IRC04:30
*** bbowen has joined #openstack-nova04:30
*** rcernin has quit IRC04:47
*** tetsuro has joined #openstack-nova04:50
*** Liang__ has quit IRC04:54
*** KeithMnemonic has quit IRC04:55
*** Liang__ has joined #openstack-nova04:55
*** udesale has joined #openstack-nova04:57
*** rcernin has joined #openstack-nova04:57
*** Liang__ has quit IRC05:01
*** Liang__ has joined #openstack-nova05:02
*** yaawang has quit IRC05:04
*** yaawang has joined #openstack-nova05:05
*** rcernin has quit IRC05:12
*** suryasingh has joined #openstack-nova05:18
*** rcernin has joined #openstack-nova05:20
*** tetsuro has quit IRC05:21
*** ratailor has joined #openstack-nova05:29
*** links has joined #openstack-nova05:37
*** zhanglong has joined #openstack-nova05:44
*** yaawang has quit IRC05:45
*** yaawang has joined #openstack-nova05:45
*** Liang__ has quit IRC05:47
*** Liang__ has joined #openstack-nova05:48
*** markvoelker has joined #openstack-nova05:49
*** JamesBen_ has quit IRC05:51
*** JamesBenson has joined #openstack-nova05:53
*** markvoelker has quit IRC05:54
*** xiaolin has joined #openstack-nova05:54
*** JamesBenson has quit IRC05:58
openstackgerritHuaqiang Wang proposed openstack/nova master: hardware: create 'mixed' instance for realtime CPUs  https://review.opendev.org/72848506:07
openstackgerritHuaqiang Wang proposed openstack/nova master: compute: bump nova-compute version and check in API  https://review.opendev.org/74093306:07
openstackgerritHuaqiang Wang proposed openstack/nova master: metadata: export the vCPU IDs that are pinning on the host CPUs  https://review.opendev.org/68893606:07
openstackgerritHuaqiang Wang proposed openstack/nova master: release note: add new features introduces by use-pcpu-and-vcpu-in-one-instance  https://review.opendev.org/73699706:07
openstackgerritHuaqiang Wang proposed openstack/nova master: test: add some notes and new tests for mixed instance  https://review.opendev.org/74139506:07
openstackgerritWenping Song proposed openstack/nova-specs master: Add no user token when get Cyborg client  https://review.opendev.org/74018406:12
*** rcernin has quit IRC06:19
*** jsuchome has joined #openstack-nova06:25
*** rcernin has joined #openstack-nova06:32
*** yaawang has quit IRC06:32
*** yaawang has joined #openstack-nova06:33
*** JamesBenson has joined #openstack-nova06:41
*** tetsuro has joined #openstack-nova06:43
*** JamesBenson has quit IRC06:45
*** dklyle has quit IRC06:46
*** rcernin has quit IRC06:46
*** tetsuro has quit IRC06:48
*** tetsuro has joined #openstack-nova06:50
*** rcernin has joined #openstack-nova06:53
*** tetsuro has quit IRC06:55
*** xiaolin has quit IRC06:55
*** maciejjozefczyk has joined #openstack-nova06:56
*** zhanglong has quit IRC07:04
*** yaawang has quit IRC07:04
*** yaawang has joined #openstack-nova07:05
*** zhanglong has joined #openstack-nova07:05
*** rcernin has quit IRC07:06
*** tesseract has joined #openstack-nova07:15
*** udesale has quit IRC07:17
*** mkrai has quit IRC07:21
*** ralonsoh has joined #openstack-nova07:23
*** Liang__ has quit IRC07:25
*** brinzhang0 has joined #openstack-nova07:26
*** brinzhang_ has quit IRC07:29
*** tosky has joined #openstack-nova07:33
*** rcernin has joined #openstack-nova07:35
openstackgerritKevin Zhao proposed openstack/nova master: [WIP] CI: add tempest-integrated-compute-aarch64 job  https://review.opendev.org/71443907:38
*** tetsuro has joined #openstack-nova07:41
*** rcernin has quit IRC07:42
*** nightmare_unreal has joined #openstack-nova07:43
openstackgerritLuigi Toscano proposed openstack/nova stable/ussuri: zuul: remove legacy-tempest-dsvm-neutron-dvr-multinode-full  https://review.opendev.org/74159007:45
*** rcernin has joined #openstack-nova07:46
*** tetsuro has quit IRC07:47
*** jsuchome has quit IRC07:48
*** zhanglong has quit IRC07:50
*** rcernin has quit IRC07:50
*** zhanglong has joined #openstack-nova07:51
*** mkrai has joined #openstack-nova07:52
*** jsuchome has joined #openstack-nova07:52
*** udesale has joined #openstack-nova07:57
*** brinzhang_ has joined #openstack-nova07:59
*** jangutter has joined #openstack-nova08:01
*** tetsuro has joined #openstack-nova08:02
*** brinzhang0 has quit IRC08:02
*** jangutter_ has quit IRC08:03
*** ociuhandu has joined #openstack-nova08:08
*** k_mouza has joined #openstack-nova08:09
openstackgerritKevin Zhao proposed openstack/nova master: [WIP] CI: add tempest-integrated-compute-aarch64 job  https://review.opendev.org/71443908:14
*** tetsuro has quit IRC08:23
*** derekh has joined #openstack-nova08:30
openstackgerritBalazs Gibizer proposed openstack/nova-specs master: Support SRIOV interface attach and detach  https://review.opendev.org/74160108:37
*** JamesBenson has joined #openstack-nova08:42
*** JamesBenson has quit IRC08:47
*** brinzhang0 has joined #openstack-nova08:47
*** spatel has joined #openstack-nova08:49
*** brinzhang_ has quit IRC08:49
*** xek_ has joined #openstack-nova08:51
*** spatel has quit IRC08:53
openstackgerritTobias Urdin proposed openstack/nova master: [PoC] libvirt: Broadcast RARP post live migration  https://review.opendev.org/74152909:02
openstackgerritTobias Urdin proposed openstack/nova master: [PoC] libvirt: Send announce-self post live migration  https://review.opendev.org/74152909:03
*** zhanglong has quit IRC09:04
tobias-urdinsean-k-mooney: yes I agree, it's a dirty workaround but until it is solved I will need to carry this downstream09:04
openstackgerritBalazs Gibizer proposed openstack/nova master: Remove broken legacy zuul jobs  https://review.opendev.org/74160309:05
*** zhanglong has joined #openstack-nova09:05
gibigmann, tosky: I proposed some clean of our jobs ^^09:07
toskygibi: as long as it works for you, I'm fine :)09:08
gibitosky: they never passed09:08
gibitosky: and they are really old jobs09:08
*** brinzhang_ has joined #openstack-nova09:09
gibiand anyhow if somebody needs them and cares about them then we can resurrect them in zuul v3 form later09:09
toskyI don't remember if I mentioned it yesterday: even though it's not part of the main goal (just stretch), if you plan to backport the fixes to the older branches, that will help cleaning up openstack-zuul-jobs09:09
toskythat's why I proposed a backport of the fix that was merged yesterday09:09
gibitosky: I'm happy to backport these job deletes to older branches09:09
toskythanks :)09:10
*** brinzhang0 has quit IRC09:12
*** sapd__x has quit IRC09:12
*** martinkennelly has joined #openstack-nova09:14
*** zhanglong has quit IRC09:14
*** zhanglong has joined #openstack-nova09:17
*** sapd1 has joined #openstack-nova09:18
*** martinkennelly has quit IRC09:19
*** dtantsur|afk is now known as dtantsur09:19
gibilyarwood, melwitt: if you have time could you hit this pike backport? https://review.opendev.org/#/q/topic:bug/1862633+status:open09:23
*** rcernin has joined #openstack-nova09:29
*** zhanglong has quit IRC09:30
*** zhanglong has joined #openstack-nova09:31
*** Liang__ has joined #openstack-nova09:36
*** Liang__ has quit IRC09:41
*** rcernin has quit IRC09:48
lyarwoodgibi: ack I can look today09:52
gibithanks09:52
*** tkajinam has quit IRC09:52
gibisean-k-mooney, stephenfin: fyi I pushed a small spec for SRIOV attach https://review.opendev.org/#/c/741601/09:55
*** udesale_ has joined #openstack-nova10:01
*** yaawang has quit IRC10:02
*** zhanglong has quit IRC10:02
*** zhanglong has joined #openstack-nova10:03
*** udesale has quit IRC10:04
*** yaawang has joined #openstack-nova10:04
*** dtantsur is now known as dtantsur|brb10:07
*** yaawang has quit IRC10:14
*** zhanglong has quit IRC10:15
*** yaawang has joined #openstack-nova10:21
*** brinzhang0 has joined #openstack-nova10:22
openstackgerritWenping Song proposed openstack/nova master: Cyborg shelve/unshelve support  https://review.opendev.org/72956310:22
*** brinzhang_ has quit IRC10:25
brinzhang0gibi: hi, I update the nova runway, added the cyborg-shelve-and-unshelve patches to the queue10:26
*** brinzhang0 has quit IRC10:30
*** JamesBenson has joined #openstack-nova10:43
stephenfingibi: I lost the race for some o.vo versions. Could I get your +2 on this again? https://review.opendev.org/#/c/739210/10:46
*** JamesBenson has quit IRC10:47
stephenfingibi: -1 on spec for nits but otherwise perfect10:51
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Re-enable live snapshot for paused instances  https://review.opendev.org/74033410:53
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Remove workaround for really old QEMU  https://review.opendev.org/74033510:53
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Use better variable names for '_create_guest'  https://review.opendev.org/74094510:53
openstackgerritStephen Finucane proposed openstack/nova master: WIP: Attempt to restore some sanity to snapshot  https://review.opendev.org/74033610:53
*** mkrai has quit IRC10:53
stephenfinlyarwood: The first two of those are pretty simple but would benefit from your eyes, IMO. The third is a FUP for your comments ^10:53
kashyapstephenfin: Good catch on re-enabling the 'paused' thing10:56
kashyapI recall the old change10:56
kashyapYep, it makes sense, given how far we've come w/ newer (and largely improved) versions10:57
*** martinkennelly has joined #openstack-nova10:57
*** sapd1 has quit IRC10:57
*** mgariepy has quit IRC10:58
*** martinkennelly has quit IRC11:01
sean-k-mooneygibi: ill read your sepc shortly. did you see my comments on the code change.11:37
sean-k-mooneyyour code change wont fix PF detach either so i might just add a small patch on top to do that11:37
*** raildo has joined #openstack-nova11:41
*** JamesBenson has joined #openstack-nova11:45
*** xek_ has quit IRC11:50
*** mgariepy has joined #openstack-nova12:02
openstackgerritStephen Finucane proposed openstack/nova master: api: Add type hints for 'nova.compute.api'  https://review.opendev.org/74162312:04
openstackgerritStephen Finucane proposed openstack/nova master: WIP: api: Add 'nova.compute.API.migrate'  https://review.opendev.org/74162412:04
openstackgerritElod Illes proposed openstack/nova stable/queens: Reject boot request for unsupported images  https://review.opendev.org/70860912:05
*** dtantsur|brb is now known as dtantsur12:10
*** ratailor has quit IRC12:20
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Handle VIR_ERR_DEVICE_MISSING when detaching devices  https://review.opendev.org/74156112:21
*** haleyb has quit IRC12:21
*** JamesBenson has quit IRC12:25
*** JamesBenson has joined #openstack-nova12:26
*** huaqiang has quit IRC12:39
*** huaqiang has joined #openstack-nova12:40
*** spatel has joined #openstack-nova12:50
*** jangutter_ has joined #openstack-nova12:54
*** spatel has quit IRC12:55
*** jangutter has quit IRC12:58
jkulikHow often and how do you clean up your instances table in the cell0 db?13:01
sean-k-mooneyi think you do it with nova-manage13:01
sean-k-mooneyhow often is up to you13:02
sean-k-mooneyi think downstream we default to every 30days13:02
jkulikhm ... I thought so, but I can't seem to find the docs or the code for that13:03
sean-k-mooneyi think you do "nova-manage db archive_deleted_rows" then "nova-manage db purge"13:05
sean-k-mooneybut you need to us a config file with the cell0 credentials13:05
gibistephenfin: thanks.13:06
jkulikdeleted_at is NULL for the instance I'm looking at. it's over 200 days old13:06
gibisean-k-mooney: I haven't read the comments on the code yet13:06
gibisean-k-mooney: I have some review to make before I can circle back to the sriov code patches13:07
sean-k-mooneygibi: there is no rush13:07
sean-k-mooneyit can wait till next week13:08
gibiack13:08
jkulikSeems to be just that one instance, that's not deleted, though. Thank you.13:08
mnaseri think the archive code didn't cover cell0 before and recently added support to go over all cells13:09
mnaserif i'm remembering right13:09
sean-k-mooneymnaser: so by default nova manage used to only look at cell113:09
sean-k-mooneyyou could create a second config can use that to run the commands on cell013:09
mnaseryeah, i get this feeling i remember there was an improvement that let you run it against all cells more recently13:10
sean-k-mooneybut i dont know if we have now made it do that by default or not13:10
* mnaser checks13:10
sean-k-mooneyit came up as a downstream bug i think and melwitt? might have imporved it13:10
mnasersean-k-mooney: good memory https://review.opendev.org/#/c/507486/13:11
sean-k-mooney11 months so proably train?13:11
mnaseryes, the earliest tag this commit is part of is train13:11
mnaseror 20.0.013:12
mnaserso yeah, train onwards13:12
lyarwoodmelwitt: https://review.opendev.org/#/c/741561/ - would you be able to take a look at this today?13:12
sean-k-mooneyim not sure if this is backportable or not but jkulik are you pre train13:13
jkulikI'm fine for now, thank you. (we're on queens but going to upgrade soon. it can wait. the instance in question just wasn't marked deleted and thus still came up in "server list" and customers wondered ...)13:13
sean-k-mooneyif it was in cell0 then it would have been in error13:14
jkulikIt was actually in state BUILD13:14
sean-k-mooneyreally? that should not happen13:14
sean-k-mooneythat likely means you had an rpc issue at some point that lost a callback to set it to error13:15
jkulikbut since that happened somewhere last year, I also don't have logs anymore. this being the only instance in that state, I think it can be ignored.13:15
sean-k-mooneyya but if you do see this crop up more often let us know13:15
sean-k-mooneyqueens is sufficently old that if you hit an issue its proably already fixed but still good to know13:16
mnaseryeah, i've ran into a few of the isssues that jkulik ran into and cleaned them up in later release13:16
mnasermostly around making things happen in a single transaction to avoid BUILD inside cell013:16
*** k_mouza has quit IRC13:17
sean-k-mooneymaybe that is something we shoudl add to nova audit13:17
mnaserbtw - debug from yesterday's issue exposes this https://www.irccloud.com/pastebin/FdpUKxPC/13:18
sean-k-mooneyset instance in build in cell0 to error after x time13:18
mnaserso it sounds like the actual commit to addport happened 34 seconds after13:18
sean-k-mooneyya so if your using iptables os-vif need to create a linux bridge then a veth pair then add one end of the veth pair to the linux bridge and the other end to ovs13:19
mnaserso maybe that part took a long time so i need to do some more log grepping13:20
sean-k-mooneyso that can take a few second although i would not normally expect it to be that long13:20
mnasersystem has 1157 total devices13:20
sean-k-mooneygoing form vsctl to native seams to have cut it form 3 mins to ~30 seconds so thats an improvement at least13:21
mnaserfrom qvo/qvb/qbr13:21
mnaseryeah -- i still have instances that take 900s to provision so it might still be bad too13:21
sean-k-mooneymnaser: if you change form iptables to conntrack then it elimiandes all the hybrid plug stuff so os-vif just add a port to ovs and is done13:21
mnasersean-k-mooney: yeah i have another one here that took 3 minutes too13:22
mnaserso i suspect native vs ovsdb wasn't the big help here13:22
sean-k-mooneylikely the large number of ports is cause some of the operation to be slower13:22
sean-k-mooneyah ok13:22
mnasersean-k-mooney: but firewall driver is defined in neutron?  and this is happening inside nova?13:23
mnaserdoes nova become somehow aware from what firewall driver is being used?13:23
sean-k-mooneyyes13:24
sean-k-mooneythe vif binding_details has a hybrid_plug atribute13:24
mnaserah gotcha13:25
sean-k-mooneyits true for iptables and false for contrac13:25
sean-k-mooneyos-vif just does what neutron tells it to do13:25
mnaserand so when donig the PUT for the binding13:25
mnaserwe get the info back telling us how to do it13:25
sean-k-mooneyyep13:25
mnaserok i see.  i wonder if i can restart straight into openvswitch firewall driver13:25
sean-k-mooneywe use that respocne both to generate the libvirt xml and pass it to os-vif to tell it how to add the port13:26
sean-k-mooneynot quite13:26
mnaseri've looked at the code and it seems like it does have some code which 'remanages' it fwiw13:26
sean-k-mooneyyou can but the ports on exiting vms wont be rebound13:26
*** artom has joined #openstack-nova13:26
sean-k-mooneyso you need to live migrate teh vms or shelve and unshelve them13:26
sean-k-mooneyi guess any move operation but hardreboot wont fix it13:27
mnasersean-k-mooney: have you seen this code? https://github.com/openstack/neutron/blob/master/neutron/agent/linux/openvswitch_firewall/iptables.py13:27
*** nhicher has joined #openstack-nova13:27
mnaseri guess i don't really care that much if the existing vms stay hybrid plugged13:27
sean-k-mooney so wehn you change it i think that code is nolonger loaded or run13:28
mnaseras long as the new ones don't use hybrid plugging which likely will result in faster os_vif plug time13:28
sean-k-mooneyso i thnk you will lose the ablity to manage security group rules on the exsting vms13:28
sean-k-mooneyi cant remeber if it was stien we added multiple port bining but when we started using that in nova we got the ability to live migrate between hosts with different firewall driver13:29
sean-k-mooneywhich will allow you to change in a rolling upgade style13:30
*** guilhermesp has joined #openstack-nova13:30
mnasersean-k-mooney: i think looking at the code, switching can work, it removes the iptables rules and then just starts applying the rules on the existing qvo iface13:31
mnaserhttps://github.com/openstack/neutron/blob/master/neutron/agent/linux/openvswitch_firewall/firewall.py#L614-L63613:31
sean-k-mooneyah ok ya i just notice it did not change the hybridg plug state13:32
sean-k-mooneyif it actully removes the iptable rules then it will work fine13:32
sean-k-mooneythe existging vm will keep using hybrid_plug untill there ports are rebound13:32
mnaserwe've switched them in the past and it wasn't an issue13:32
mnaseryep, but the firwall rules will apply on the qvo instead of tap13:32
sean-k-mooneycool13:33
mnaserand new plugs will plumb tap in and eliminate qbr/qvo/qvb13:33
sean-k-mooneyyep13:33
mnaserwhich will significantly reduce the number of ports on the system13:33
sean-k-mooneyit will devide it by 313:33
stephenfinlyarwood: replied on https://review.opendev.org/#/c/699291/13:33
mnaseralso i think tap interfaces won't have a random ipv6 addr show up13:34
sean-k-mooneythat is also likely13:34
sean-k-mooneywell maybe not13:34
sean-k-mooneylet me check13:34
mnaserdmesg shows ADDRCONF events for qvb and qvo13:34
mnaserbut none for tap13:34
sean-k-mooneythey still get link local adresses13:35
sean-k-mooneythat siad it is some thing that we could disable in os-vif13:35
stephenfinyo, bauzas. Want some d̶r̶u̶g̶s̶ patches? https://review.opendev.org/#/c/729595/ https://review.opendev.org/#/c/729596/13:35
sean-k-mooneyactully no we cant13:35
sean-k-mooneywhen os-vif creates the port on ovs its before qemu has created the tap13:36
bauzasstephenfin: https://media.tenor.com/images/a2b7c73a67cf6c1a775466e6ad87d8b7/tenor.gif13:36
stephenfin:P13:37
bauzas(you need to know about Weeds, dude)13:37
stephenfinI know GIFs13:37
* bauzas is currently writing a spec, so *any* review ask is good to me :)13:37
bauzascould I provide a French spec ? :p13:38
stephenfinSure! Just hope you're happy with artom being the only one reviewing it13:38
mnasersean-k-mooney: check this out -- https://github.com/openstack/neutron/blob/23e3213a07eb0b0fcdd2a1da36a847dde9beba57/neutron/tests/fullstack/test_firewall.py13:41
mnaserneutron tests switching to openvswitch with agent restart :)13:41
stephenfinlyarwood: Also, in case you didn't know already, I removed the auto-branch naming feature from git-review. If you want topics for e.g. https://review.opendev.org/#/c/741561/ you need to create the branch yourself13:42
sean-k-mooneymnaser: cool13:43
sean-k-mooneymnaser: that makes the upgrade path much smother13:44
stephenfinsean-k-mooney, gibi, (others): Need input of vTPM design. What should we do if we resize and the new flavor has a different vTPM config?13:44
sean-k-mooneythat was in the spec13:44
sean-k-mooneywe have 2 option reject the resize or what the spec says is we recreat it with the new format lossing all data13:45
stephenfinRight, but what about revert?13:45
sean-k-mooneystephenfin: there is no way to convert form one type to the other13:45
stephenfinIs resize expected to be a destructive operation?13:45
stephenfinI know rebuild is13:45
stephenfinbut didn't think resize was13:45
sean-k-mooneyit should not be an issue unless we are talking about same host resize13:45
stephenfinWell it's awkward to implement, hence why I'm asking :)13:46
stephenfinI need to stash the ID of the old key stored in the key manager service13:46
sean-k-mooneywell we dont want to destoy the old tpm untill resize confimr or reviert13:47
sean-k-mooneyyes you would13:47
sean-k-mooneybut you can do it the same way we do for flavors13:47
sean-k-mooneystephenfin: basically what we said in the spec was pretend it really hardware13:48
stephenfinCan't we just block it like we do for NUMA13:49
sean-k-mooneyfor reall hardware if we swapped the mother board which is what a resize is then it would be lose the data13:49
sean-k-mooneystephenfin: yes we could that was option 113:49
stephenfinI'm tempted to suggest we do that anyway, since I think this is unlikely to be used much in practice13:50
openstackgerritMerged openstack/nova stable/queens: libvirt: Don't delete disks on shared storage during evacuate  https://review.opendev.org/73271713:50
stephenfinso long as I explicitly block it like you did for NUMA13:50
sean-k-mooneyif you block it for resize i assume the same will be true for rebuild13:52
sean-k-mooneyrebuild is not ment to be destuctive13:52
stephenfinthat would be my thinking, yes13:52
sean-k-mooneybut it would be if an only if you cnaged the type13:52
stephenfinconfig from flavor + image meta must be identical13:52
sean-k-mooneyyep whcih is exactly what we do for numa13:52
sean-k-mooneyon rebuild at least13:53
sean-k-mooneybut before going down this route13:53
sean-k-mooneydo you need to have 2 keys13:53
sean-k-mooneycould you not just use the same key for both vtpms13:53
stephenfinthat's an interesting point13:54
stephenfinwe could indeed, given the owner has changed13:54
stephenfinlet me see how that works13:54
* stephenfin reverts everything he's done in the past 24 hours \o/13:55
stephenfintwas all garbage anyway13:55
mnasersean-k-mooney: fyi, you might find this interesting -- https://bugzilla.redhat.com/show_bug.cgi?id=1782834 and neutron workaround https://review.opendev.org/#/c/733674/ for ovs 2.1213:55
openstackbugzilla.redhat.com bug 1782834 in openvswitch "Changing protocols in Bridge table doesn't take effect" [High,New] - Assigned to aconole13:55
*** tobias-urdin is now known as tobias-urdin|pto13:55
sean-k-mooneyif that proves difficutl to implement we can use that as justification for blocking and move the two thing you tried into alternitives13:55
sean-k-mooneymnaser: huh that inconveniant13:57
sean-k-mooneymnaser: strangly enough i was asked to look at this patch earlier today13:57
sean-k-mooneyhttps://review.opendev.org/#/c/738551/7/neutron/plugins/ml2/drivers/openvswitch/agent/ovs_neutron_agent.py@216313:57
melwittlyarwood: sure, will do13:57
sean-k-mooneywhich is also setting the openflow versions13:58
*** xek_ has joined #openstack-nova14:01
mnaserovs-vsctl set bridge br-int protocols=OpenFlow10,OpenFlow11,OpenFlow12,OpenFlow13,OpenFlow14,OpenFlow15 doesn't really fix much14:01
mnaserlet me restart the ovs agent after that14:02
*** mlavalle has joined #openstack-nova14:04
*** k_mouza has joined #openstack-nova14:12
*** haleyb has joined #openstack-nova14:20
gibistephenfin: regarding vTPM. We can block the resize and rebuild if that result in a loss of vTPM data as a first step. I'm fine with taht14:31
*** k_mouza has quit IRC14:31
*** READ10 has joined #openstack-nova14:34
*** READ10 has quit IRC14:36
*** xek_ has quit IRC14:41
*** k_mouza has joined #openstack-nova14:42
*** dklyle has joined #openstack-nova14:42
gibidansmith: hi! I talked to the release team about M2 and it turned out that we don't need a nova release just an os-vif and python-novaclient release and those can be made before M2. So I will propose those lib releases next week and then there is nothing to do at M2 from release perspective14:43
dansmithgibi: I saw, cool, I *definitely* volunteer then :)14:43
gibicool :)14:43
gibidansmith: can I ask you to run the weekly meeting on the 30th?14:45
dansmithgibi: yeah14:46
gibithank you14:46
gmanngibi: zero_downtime upgrade job use zero_downtime/hooks/run_tests.sh which i did not find in git history at all when it was added/remvoed - https://opendev.org/openstack/openstack-zuul-jobs/src/branch/master/playbooks/legacy/grenade-dsvm-neutron-multinode-zero-downtime/run.yaml#L4514:49
gmannwe do not test zero downtime upgrade (i think none of project does) in any other job14:50
gmannso i agree to remove this broken job and if we want/do test zero_downtime in future then we write new job on zuulv314:51
gibigmann: thanks for checking14:51
gmannnone of the projects use zero_downtime upgrade TC tag   https://governance.openstack.org/tc/reference/tags/assert_supports-zero-downtime-upgrade.html14:52
lyarwoodkashyap: https://review.opendev.org/#/c/741561/ - if you have time for a review before leaving for the weekend btw14:52
*** READ10 has joined #openstack-nova14:53
gmannI will remove(once all stable branch remove the job ) the job definition also from opensatck-zuul-jobs as this job is only used by nova14:53
kashyaplyarwood: Hiya, definitely14:53
gibigmann: cool14:54
kashyaplyarwood: Looks good; also bonus marks for the nice reproducer write-up!14:57
sean-k-mooneygmann: im not sure that any of the service got to the poitn where they could have contol planes on two differetn versions did they14:58
sean-k-mooneythat would be the main issue for nova14:58
sean-k-mooneyn an n+1 would expect diffferent db version and potatiall rpcs version so we cannot run both in parralel14:59
sean-k-mooneyeven if we can upgrade the contoelr indepently of the comptue nodes14:59
gmannsean-k-mooney: is not load balancer should disable n-api on node which is going to be upgraded and redirect request to other api node. once first api node is upgraded then bring up the n-api and disable 2nd api node ..15:06
gmanni mean HAProxy  should be disable the request on one api node when we upgrade that so that request can be served from other api node and zero downtime on api service15:09
*** bnemec is now known as beekneemech15:10
sean-k-mooneygmann: no that wont work15:11
*** maciejjozefczyk has quit IRC15:14
bauzasgibi: dansmith: fwiw, sorry if I was unclear but i'll be there on week from July 27th to 30th (excluding Friday probably)15:18
gibibauzas: no worries. It turned out that there is nothing pressing to handle at M215:18
bauzasbut either way, all good if dansmith can release m-2 and lead the meeting :)15:18
gibi:)15:18
*** ociuhandu_ has joined #openstack-nova15:20
sean-k-mooneygmann: its not just the api its also the conductor and schuler that need to agree on the db schemes and rpc verions15:20
*** ociuhandu has quit IRC15:24
*** ociuhandu_ has quit IRC15:24
*** gyee has joined #openstack-nova15:26
openstackgerritBalazs Gibizer proposed openstack/nova-specs master: Support SRIOV interface attach and detach  https://review.opendev.org/74160115:29
gibistephenfin, sean-k-mooney: ^^ fixed up the spec15:29
melwittlyarwood: commented, there are some odd things in the tests (args passed but not used) and tests that are simulating non-real situations, not sure if that was intended15:30
lyarwoodmelwitt: ack looking15:32
lyarwoodsean-k-mooney: https://review.opendev.org/#/c/728057/ btw, looks like this change blows up when we log anything in ERROR15:32
stephenfingibi: +2 for me. Seems like a no-brainer if you're able to do the work15:36
sean-k-mooneyit does?15:37
*** udesale_ has quit IRC15:37
stephenfinsean-k-mooney: I thought you were talking to me there for a sec15:37
sean-k-mooneyit is not even specific to the error log15:37
sean-k-mooneyoh no lyarwood ^15:38
sean-k-mooneygibi: looking now15:39
*** mlavalle has quit IRC15:40
gibistephenfin: I think I can do the work. The WIP patch already works for me in devstack so only automatic testing and documentation is missing.15:40
gibistephenfin, sean-k-mooney: thanks15:40
stephenfin\o/15:40
lyarwoodsean-k-mooney: yeah I assume py27 doesn't like the filter you provided?15:41
sean-k-mooneygibi: +115:41
*** mlavalle has joined #openstack-nova15:41
sean-k-mooneylyarwood: ah yes that is possible15:41
gibiawesome, thanks15:41
sean-k-mooneylyarwood: its a generator expression15:42
lyarwoodhttps://docs.python.org/2/library/logging.html#filter-objects15:42
openstackgerritStephen Finucane proposed openstack/nova master: mypy: Add type annotations to 'nova.pci'  https://review.opendev.org/67620915:42
openstackgerritStephen Finucane proposed openstack/nova master: mypy: Add nova.cmd, nova.conf, nova.console  https://review.opendev.org/70565715:42
openstackgerritStephen Finucane proposed openstack/nova master: mypy: Add type annotations to top-level modules  https://review.opendev.org/70565815:42
openstackgerritStephen Finucane proposed openstack/nova master: rpc: Rework 'get_notifier', 'wrap_exception'  https://review.opendev.org/74166315:42
melwittlyarwood: fyi guh, the ceph job looks like it's been angry on stable branches15:42
sean-k-mooneyreturn not any(msg in log_record.msg for msg in messages)15:42
lyarwoodit needs to be in a filter method15:42
lyarwoodmelwitt: yeah I have a few changes open to look at15:42
melwittack15:42
stephenfinsean-k-mooney, lyarwood: Looks like it needs to be a specific class in Python 2? https://docs.python.org/2/library/logging.html#filter15:43
stephenfinhttps://docs.python.org/2/library/logging.html#filter-objects15:44
lyarwoodright15:44
stephenfinactually idk, nothing in the docs by way of versionedchanged admonitions15:44
stephenfinso if it's a change, it's not well documented15:45
lyarwoodbetween 3 and 2?15:45
lyarwoodha15:45
*** hamalq has joined #openstack-nova15:45
sean-k-mooneystephenfin: ah ok15:45
sean-k-mooneyill go fix that so15:46
stephenfinit might be worth checking what you've done *definitely* works on Python 315:46
stephenfinif it does, I'd be game to submit a PR to the Python docs15:46
lyarwoodstephenfin: I'm pretty sure we've logged stuff at ERROR in CI ;)15:46
lyarwoodstephenfin: oh but you mean actually works15:46
sean-k-mooneyhttps://docs.python.org/3.8/library/logging.html#logging.Filter15:47
sean-k-mooney"You don’t actually need to subclass Filter: you can pass any instance which has a filter method with the same semantics."15:47
*** hamalq_ has joined #openstack-nova15:47
stephenfinsean-k-mooney++ Good spot15:47
stephenfinpython-docs++15:47
sean-k-mooneyi didnt see that was py3 only15:48
sean-k-mooneybut its easy to fix15:48
sean-k-mooneythat said i have dinner/breakfast cooking so ill do it in an hour or so15:48
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Handle VIR_ERR_DEVICE_MISSING when detaching devices  https://review.opendev.org/74156115:50
lyarwood^ melwitt ; hopefully sorted now thanks again15:50
*** hamalq has quit IRC15:50
melwittlyarwood: I didn't catch you with my latest comment but it's just a thought anyway15:52
lyarwoodmelwitt: yeah I'd like to do that in a FUP if possible while simplifying the logic15:53
melwittk15:53
*** derekh has quit IRC15:55
kashyapmelwitt: Thanks for tha close reading of the tests review.  I wasn't as careful16:00
stephenfinneither was I :(16:00
*** psachin has quit IRC16:00
melwittthose tests are confusing 😝16:02
kashyapmelwitt: Yes!  I bow to your eagle eyes.  Sometimes, I give up in despair16:02
*** k_mouza has quit IRC16:02
*** JamesBenson has quit IRC16:02
melwittnot by fault of lyarwood but just the number of combos needed. I'm almost thinking ddt would have helped here16:03
lyarwoodmelwitt: this needs to go back to stable/queens, did we have ddt back then?16:03
stephenfinyup16:04
stephenfinwe've had that since Grizzly or something, afaik16:04
lyarwoodah cool16:04
lyarwoodkk well I can respin again if it makes it easier to read and update in the future16:05
*** k_mouza has joined #openstack-nova16:05
*** READ10 has quit IRC16:06
*** martinkennelly has joined #openstack-nova16:06
melwittlyarwood: well, if you're refactoring and making those error catchings go away, then it doesn't matter [after this] right? but I'm afraid some combinations are still missing so brace yourself. I might be wrong though, somehow looking at these makes me really confused16:07
openstackgerritBalazs Gibizer proposed openstack/nova master: Remove broken legacy zuul jobs  https://review.opendev.org/74160316:11
*** mgariepy has quit IRC16:13
*** dtantsur is now known as dtantsur|afk16:13
*** k_mouza has quit IRC16:15
*** nightmare_unreal has quit IRC16:16
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Use better variable names, types for '_create_guest'  https://review.opendev.org/74094516:23
openstackgerritStephen Finucane proposed openstack/nova master: WIP: Attempt to restore some sanity to snapshot  https://review.opendev.org/74033616:23
dansmithgah, stephenfin can you tell me what the magic oslo config reno linky thing is?16:30
dansmithmy grepping is failing16:30
dansmithor is that docs only?16:30
stephenfinthe sphinx roles? You can't use those in renos16:30
stephenfinjust use ``literals``16:31
*** links has quit IRC16:31
dansmithack, that must be why I can't find it16:31
stephenfinit's :oslo.config:option:`group.opt` for stuff in doc/source16:32
dansmithyeah okay, I was only grepping the reno tree thinking I'd find examples in there16:33
dansmithbut makes sense16:33
*** JamesBenson has joined #openstack-nova16:47
*** markvoelker has joined #openstack-nova16:58
*** artom has quit IRC17:02
*** maciejjozefczyk has joined #openstack-nova17:04
*** mgariepy has joined #openstack-nova17:09
*** vishalmanchanda has quit IRC17:20
*** owalsh has quit IRC17:21
*** jsuchome has quit IRC17:30
openstackgerritsean mooney proposed openstack/nova stable/train: Silence amqp heartbeat warning  https://review.opendev.org/72805717:35
sean-k-mooneystephenfin: lyarwood that should be the python 2 compatiable way to do the filtering17:36
sean-k-mooneybut we will see what the ci says17:36
stephenfinsean-k-mooney: Can you add a NOTE explaining what you've changed?17:37
*** tesseract has quit IRC17:37
sean-k-mooneysure one sec ill do it now17:37
stephenfinand normally I'd suggest a 'Changes:' section, but seeing as there's only one file it's probably not necessary :)17:37
*** owalsh has joined #openstack-nova17:42
*** artom has joined #openstack-nova17:43
openstackgerritsean mooney proposed openstack/nova stable/train: Silence amqp heartbeat warning  https://review.opendev.org/72805717:44
*** maciejjozefczyk has quit IRC17:44
artommordred, in my continuing sweep of sdk I stumbled upon https://review.opendev.org/#/c/741688/ - it's kinda touchy because we're breaking backwards compat (except not really?), otoh leaving it as is is overly restrictive too...17:45
*** JamesBenson has quit IRC17:45
*** JamesBenson has joined #openstack-nova17:49
*** martinkennelly has quit IRC17:52
mnasersean-k-mooney: have a look at what moving to openvswitch firewall driver did => https://docs.google.com/spreadsheets/d/14AzjZ8SaA6e-WOJ_0BDYAKVHtUIEobhyFWOZQPhN9s0/edit#gid=176910004317:52
mnaserswitch was done at around 15UTC17:52
sean-k-mooney it looks like all the long tails are gone17:54
*** JamesBenson has quit IRC17:54
sean-k-mooneywell maybe not from 1600 yes17:55
sean-k-mooneyi think there is still an improvement?17:56
sean-k-mooneybut from the graph alone its hare to tell17:57
mnasersean-k-mooney: yeah, it's still early but the results look much better so far17:58
sean-k-mooneymnaser: i think the median and mean times has droped and the make latencies are better ? is that correct17:58
mnaseryes, and also max time dropped (with noe or two odd ones)17:58
sean-k-mooneymnaser: in this config os-vif does not need to use privesep at all i belive if you are also using native17:58
mnasersean-k-mooney: its using native but it still does because i still see a privsep process up17:58
mnaserbut i cant imagine why it's needed17:59
sean-k-mooneythat is proably novas17:59
mnasersean-k-mooney: /openstack/venvs/nova-19.0.8/bin/python2 /openstack/venvs/nova-19.0.8/bin/privsep-helper --config-file /etc/nova/nova.conf --privsep_context vif_plug_ovs.privsep.vif_plug --privsep_sock_path /tmp/tmpD6Vj2G/privsep.sock17:59
sean-k-mooneyif there is an os vif on i can check quickly17:59
mnasermaybe we should look into taht17:59
sean-k-mooneywell my guess that privsep would not be need is just i dont htink we need to do most of the ops that need it in this config18:00
sean-k-mooneyit actuly was never a goal18:00
mnasersean-k-mooney: oh its used for a few funcs that we need for hybrid plug18:00
sean-k-mooney yes18:01
mnasersean-k-mooney: i suspect because this machine has a mix of both now, it is why privsep-helper went up18:01
sean-k-mooneyah ok that would make sense ya18:01
mnaserif there is no hybrid ports plugged anymore and i restart nova i suspect it would disappear18:01
sean-k-mooneywe need it to create he linux bridges18:01
mnaserya i found those refs here https://github.com/openstack/os-vif/blob/master/vif_plug_ovs/linux_net.py18:01
mnaserif i find a box with only taps, ill try and restart and theoretically it should not go up18:02
sean-k-mooneyyep its also need for ovs-vsctl to acess the ovs-db unix sockt but the python lib connect over tcp and does not need it18:02
sean-k-mooneyi can check my home deployment18:03
mnaseryeah in my case im using tcp (which is also the default)18:03
sean-k-mooneyyep tcp to localhost18:04
sean-k-mooneywell 127.0.0.118:04
sean-k-mooneysame thing18:04
sean-k-mooneyroot        48  0.0  0.0 3099668 61044 ?       Sl   Jun20   0:00 /var/lib/kolla/venv/bin/python /var/lib/kolla/venv/bin/privsep-helper --config-file /etc/nova/nova.conf --privsep_context vif_plug_ovs.privsep.vif_plug --privsep_sock_path /tmp/tmp1n9arrof/privsep.sock18:05
sean-k-mooneyso i still have it but i suspec its basically unused18:05
*** vdrok has quit IRC18:07
*** vdrok has joined #openstack-nova18:08
sean-k-mooneyah18:09
sean-k-mooneymnaser: https://github.com/openstack/os-vif/blob/989f5a7d92b2459444e6956facb332f19e7c6537/vif_plug_ovs/ovsdb/ovsdb_lib.py#L5018:10
sean-k-mooneywe set the mtu18:10
sean-k-mooneyfor the bridge18:10
*** JamesBenson has joined #openstack-nova18:11
mnaserah18:11
sean-k-mooneyas part of create port https://github.com/openstack/os-vif/blob/989f5a7d92b2459444e6956facb332f19e7c6537/vif_plug_ovs/ovsdb/ovsdb_lib.py#L10718:11
sean-k-mooneyso ya that is the only privadge call which is a significat improvment18:11
*** NobodyCam has quit IRC18:12
*** NobodyCam has joined #openstack-nova18:12
openstackgerritStephen Finucane proposed openstack/nova master: tests: Add helpers for suspend, resume and reboot of server  https://review.opendev.org/74128518:15
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Pass context, instance to '_create_domain'  https://review.opendev.org/74128618:15
openstackgerritStephen Finucane proposed openstack/nova master: api: Reject non-spawn operations for vTPM  https://review.opendev.org/74150018:15
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Add emulated TPM support to Nova  https://review.opendev.org/63136318:15
openstackgerritStephen Finucane proposed openstack/nova master: docs: Add docs for vTPM support  https://review.opendev.org/73921318:15
openstackgerritStephen Finucane proposed openstack/nova master: WIP: Add support for resize and cold migration of emulated TPM files  https://review.opendev.org/63993418:15
stephenfindansmith: I won't be doing this tonight, but you might be interested in https://review.opendev.org/#/c/639934/40//COMMIT_MSG@3718:16
dansmithomg18:16
dansmithscary18:16
dansmithI18:16
stephenfintl;dr: I need access at the virt driver layer to both the old and new flavor when confirming/reverting a resize, and I don't have that at the moment because we're zeroing out those attributes before said calls to virt driver18:17
dansmitham covered up in other stuff (some of which I broke) but .. yeah maybe I can look next week18:17
dansmithyeah18:17
stephenfinI'm planning to just delay that zeroing out 'til after, but I don't know if there'll be knock on effects or not18:17
*** bbowen has quit IRC18:17
dansmithyeah18:17
stephenfinseeing as your name is beside all the zeroing out, I figure you're the person to ask :)18:18
stephenfinnext week is good though18:18
sean-k-mooneystephenfin: do you want me to take a look at that too or should i start from the begining of the seriese18:18
stephenfinsean-k-mooney: If you can follow the flow of that code, sure :)18:19
stephenfinYou can do earlier but this is the only bit I personally consider somewhat tricky now. The rest of the series is mostly wiring up stuff18:20
sean-k-mooneywell i have not looke at the code in about 2 months so that is why i was hesitent to jump into the end of the series18:20
sean-k-mooneyi think i need to start form the start to understand all the dettails18:20
stephenfinSure, whatever works. It's big. You can ignore the functional test cleanup patches, obv18:21
*** TheJulia has quit IRC18:22
*** TheJulia has joined #openstack-nova18:23
*** johnsom has quit IRC18:24
*** johnsom has joined #openstack-nova18:24
dansmithstephenfin: just skimming it looks like a lot is missing from that patch you linked18:25
dansmithlike, it doesn't seem to do much of what the tin describes18:25
stephenfin \o/18:25
dansmithlike, it removes some exceptions and adds some privsep things that don't otherwise get used18:25
dansmithjust wondering if you forgot to git add or something18:25
stephenfinmaybe I did18:26
stephenfinchecking18:26
dansmithohh, wait, sorry18:26
dansmiththe sorted-last v-for-virt stuff was getting hidden under tests18:26
dansmithsorry18:26
stephenfinah, yes18:26
stephenfinthis is all libvirt-specific so nothing happens higher than that18:27
sean-k-mooneystephenfin: oh am ill respin the backport after ci completes or monday18:28
sean-k-mooneybut ya i can just move the not to the commit and list the updated file18:28
stephenfinyup, that ought to do the trick18:28
mnasersean-k-mooney: could we not use libvirt to set mtu and avoid that privsep call?18:29
mnaseror maybe we already use it? https://www.irccloud.com/pastebin/ho53ZFfZ/18:30
mnaserso that mtu set is.. probably not super useful when we're not doing a hybrid plug18:30
sean-k-mooneywe do but its needed when using hybrid plug18:31
sean-k-mooneyi dont think we need to do it for non hybrid plug18:31
sean-k-mooneymnaser: that said i really want to stop libvirt doing anything related to networking18:32
mnaserok so architecturally we want os_vif to do the work18:32
sean-k-mooneymnaser: it casues an unavoidble race18:32
sean-k-mooneyyes18:32
mnaserthat's a fair point, so mtu stays then i guess18:32
sean-k-mooneyhttps://review.opendev.org/#/c/602432/18:33
sean-k-mooneyyour familar with https://bugs.launchpad.net/neutron/+bug/1734320 and https://bugs.launchpad.net/neutron/+bug/181598918:33
openstackLaunchpad bug 1734320 in OpenStack Compute (nova) "Eavesdropping private traffic" [Undecided,In progress] - Assigned to sean mooney (sean-k-mooney)18:33
openstackLaunchpad bug 1815989 in OpenStack Compute (nova) "OVS drops RARP packets by QEMU upon live-migration causes up to 40s ping pause in Rocky" [Medium,In progress]18:33
mnaserah yes18:33
mnaserso our lack of control when tap goes up18:33
sean-k-mooneywhat happens today is os-vif set up the port and the l2 agent start to wire it up18:34
sean-k-mooneythen libvirt deletes an recreates it18:34
sean-k-mooneythat is only a problem when using contrack firewall18:35
mnaseroh yikes, i thought it was os-vif that was wiring up what libvirt creates (but then that's just a race in another direction too)18:35
sean-k-mooneynoop18:35
sean-k-mooneylibvirt basicaly does if exist delete and recreate18:35
sean-k-mooneyundoing all our careful external event stuff18:36
sean-k-mooneyfor hybrid plug/iptables-firewall, ovs-dpdk and hardware offload18:36
sean-k-mooneyos-vif and neutron work togeter18:37
sean-k-mooneybut when you use libvirt in the non hybridg plug case it just breaks everything18:37
sean-k-mooneyi really wish i or someone lese had time to finish off https://review.opendev.org/#/c/64025818:38
sean-k-mooneybut i dont have time to work on that at the moement18:39
mnaseri share that sentiment :)18:40
sean-k-mooneythe main proable is while i got it work 15 months ago i dont follow neutron dev that closely anymore so i would basicaly have to start from scrat to figure out why its not working now18:42
*** masayukig has quit IRC18:58
*** masayukig has joined #openstack-nova18:59
*** owalsh has quit IRC19:20
*** owalsh has joined #openstack-nova19:29
*** damien_r has quit IRC19:32
*** mloza has quit IRC19:35
*** mloza has joined #openstack-nova19:36
*** mriedem has joined #openstack-nova19:39
*** mriedem is now known as cremefraiche19:40
*** ralonsoh has quit IRC19:53
*** ralonsoh has joined #openstack-nova19:53
*** johnthetubaguy has quit IRC19:56
*** johnthetubaguy has joined #openstack-nova19:58
*** hemna has quit IRC19:59
*** hemna has joined #openstack-nova19:59
*** bbowen has joined #openstack-nova20:00
openstackgerritGhanshyam Mann proposed openstack/nova master: Add test coverage of baremetal nodes policies  https://review.opendev.org/74170720:01
*** johnthetubaguy has quit IRC20:05
*** johnthetubaguy has joined #openstack-nova20:08
*** johnthetubaguy has quit IRC20:14
*** johnthetubaguy has joined #openstack-nova20:17
openstackgerritGhanshyam Mann proposed openstack/nova master: Introduce scope_types in baremetal node  https://review.opendev.org/74170920:23
*** mloza has quit IRC20:27
*** johnthetubaguy has quit IRC20:32
*** sapd1_x has quit IRC20:32
*** sapd1_x has joined #openstack-nova20:33
*** johnthetubaguy has joined #openstack-nova20:35
openstackgerritAlex Deiter proposed openstack/nova master: Detach is broken for multi-attached fs-based volumes  https://review.opendev.org/74171220:41
*** markvoelker has quit IRC20:42
*** johnthetubaguy has quit IRC20:44
*** johnthetubaguy has joined #openstack-nova20:48
openstackgerritMerged openstack/nova stable/ussuri: zuul: remove legacy-tempest-dsvm-neutron-dvr-multinode-full  https://review.opendev.org/74159020:59
openstackgerritGhanshyam Mann proposed openstack/nova master: Add new default roles in baremetal ndoes policies  https://review.opendev.org/74171621:06
openstackgerritGhanshyam Mann proposed openstack/nova master: Pass the actual target in baremetal nodes policy  https://review.opendev.org/74171721:09
*** raildo has quit IRC21:11
*** rpittau has quit IRC21:12
*** rpittau has joined #openstack-nova21:13
openstackgerritLuigi Toscano proposed openstack/nova stable/train: zuul: remove legacy-tempest-dsvm-neutron-dvr-multinode-full  https://review.opendev.org/74172021:18
*** hamalq has joined #openstack-nova21:28
*** hamalq_ has quit IRC21:31
*** gyee has quit IRC21:40
*** gyee has joined #openstack-nova21:45
*** melwitt is now known as jgwentworth21:49
*** lbragstad has quit IRC21:56
openstackgerritMerged openstack/nova stable/pike: Reproduce bug 1862633  https://review.opendev.org/74063421:56
openstackbug 1862633 in OpenStack Compute (nova) "unshelve leak allocation if update port fails" [Medium,Fix released] https://launchpad.net/bugs/1862633 - Assigned to Balazs Gibizer (balazs-gibizer)21:56
*** lbragstad has joined #openstack-nova21:56
*** ralonsoh has quit IRC22:11
*** k_mouza has joined #openstack-nova22:17
*** k_mouza has quit IRC22:22
*** markvoelker has joined #openstack-nova22:22
*** markvoelker has quit IRC22:27
*** cremefraiche has left #openstack-nova22:33
*** ociuhandu has joined #openstack-nova22:39
openstackgerritAlex Deiter proposed openstack/nova master: Detach is broken for multi-attached fs-based volumes  https://review.opendev.org/74171222:43
*** ociuhandu has quit IRC22:44
*** markvoelker has joined #openstack-nova22:47
*** dklyle has quit IRC22:50
*** markvoelker has quit IRC22:52
*** tosky_ has joined #openstack-nova22:52
*** tosky has quit IRC22:53
*** mlavalle has quit IRC22:59
*** gyee has quit IRC23:01
*** tosky_ is now known as tosky23:01
*** dklyle has joined #openstack-nova23:05
*** tosky has quit IRC23:38
*** hamalq has quit IRC23:38

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!