Monday, 2020-03-16

*** tetsuro has joined #openstack-nova00:05
*** tosky has quit IRC00:14
*** brinzhang has joined #openstack-nova00:18
*** tetsuro_ has joined #openstack-nova00:23
*** tetsuro has quit IRC00:25
*** dave-mccowan has joined #openstack-nova00:35
*** lbragstad has joined #openstack-nova00:38
*** xek has quit IRC00:47
*** xek has joined #openstack-nova00:47
*** brinzhang has quit IRC00:52
*** brinzhang has joined #openstack-nova00:52
openstackgerritBrin Zhang proposed openstack/nova master: Add new default roles in os-instance-actions policies  https://review.opendev.org/70647001:00
brinzhanggmann: done, thanks. And I replied your question, you can review again while your are free.01:03
*** nweinber has joined #openstack-nova01:27
*** Liang__ has joined #openstack-nova01:27
*** yaawang has quit IRC01:40
*** yaawang has joined #openstack-nova01:40
*** dave-mccowan has quit IRC01:43
*** dave-mccowan has joined #openstack-nova01:52
*** ociuhandu has joined #openstack-nova02:04
*** tetsuro has joined #openstack-nova02:08
*** ociuhandu has quit IRC02:10
*** tetsuro_ has quit IRC02:11
*** dave-mccowan has quit IRC02:29
*** yaawang has quit IRC02:39
*** yaawang has joined #openstack-nova02:39
openstackgerritBrin Zhang proposed openstack/nova master: Store instance action event exc_val fault details  https://review.opendev.org/69442802:47
openstackgerritBrin Zhang proposed openstack/nova master: Expose instance action event details out of the API  https://review.opendev.org/69443002:47
openstackgerritBrin Zhang proposed openstack/nova master: Add instance actions v283 samples test  https://review.opendev.org/70625102:47
*** ociuhandu has joined #openstack-nova02:54
*** tetsuro_ has joined #openstack-nova02:59
*** tetsuro has quit IRC03:03
*** ociuhandu has quit IRC03:04
*** ociuhandu has joined #openstack-nova03:06
*** sapd1 has joined #openstack-nova03:08
*** ociuhandu has quit IRC03:11
openstackgerritKevin Zhao proposed openstack/nova master: Add default cpu model for aarch64  https://review.opendev.org/70949403:14
*** tetsuro_ has quit IRC03:16
*** mkrai has joined #openstack-nova03:18
*** tetsuro has joined #openstack-nova03:32
*** tetsuro_ has joined #openstack-nova03:42
*** tetsuro has quit IRC03:45
*** brinzhang_ has joined #openstack-nova03:47
*** brinzhang has quit IRC03:51
*** damien_r has joined #openstack-nova03:51
*** damien_r has quit IRC03:56
*** udesale has joined #openstack-nova04:25
*** ratailor has joined #openstack-nova04:40
*** nweinber has quit IRC04:41
*** tetsuro_ has quit IRC05:00
*** slaweq has joined #openstack-nova05:13
openstackgerritKevin Zhao proposed openstack/nova master: fix ut error on arm64  https://review.opendev.org/71316305:31
*** evrardjp has quit IRC05:35
*** evrardjp has joined #openstack-nova05:36
*** links has joined #openstack-nova05:37
openstackgerritKevin Zhao proposed openstack/nova master: Add default cpu model for aarch64  https://review.opendev.org/70949405:42
*** ociuhandu has joined #openstack-nova05:43
*** ociuhandu has quit IRC05:44
*** ociuhandu has joined #openstack-nova05:44
*** ircuser-1 has quit IRC05:50
*** ociuhandu has quit IRC05:54
*** ociuhandu has joined #openstack-nova05:55
*** ociuhandu has quit IRC06:01
*** irclogbot_0 has quit IRC06:29
*** lbragstad has quit IRC06:37
*** dpawlik has joined #openstack-nova06:58
*** ratailor has quit IRC07:05
*** ratailor has joined #openstack-nova07:08
*** tetsuro has joined #openstack-nova07:19
*** damien_r has joined #openstack-nova07:27
*** irclogbot_2 has joined #openstack-nova07:30
*** damien_r has quit IRC07:33
*** tetsuro_ has joined #openstack-nova07:41
*** tetsuro has quit IRC07:44
*** ociuhandu has joined #openstack-nova07:47
openstackgerritKevin Zhao proposed openstack/nova master: Add default cpu model for aarch64  https://review.opendev.org/70949407:54
*** iurygregory has joined #openstack-nova08:02
*** dpawlik has quit IRC08:07
*** dpawlik has joined #openstack-nova08:07
*** maciejjozefczyk has joined #openstack-nova08:11
*** tesseract has joined #openstack-nova08:12
*** damien_r has joined #openstack-nova08:13
*** rcernin has quit IRC08:14
*** rpittau|afk is now known as rpittau08:15
*** damien_r has quit IRC08:18
*** dpawlik has quit IRC08:18
*** amoralej|off is now known as amoralej08:20
*** ociuhandu has quit IRC08:20
*** tkajinam has quit IRC08:23
*** breizhkoala has joined #openstack-nova08:26
*** dpawlik has joined #openstack-nova08:28
*** sapd1 has quit IRC08:39
*** sapd1 has joined #openstack-nova08:39
*** ralonsoh has joined #openstack-nova08:42
*** elod has joined #openstack-nova08:46
*** ociuhandu has joined #openstack-nova08:50
openstackgerritwaleed mousa proposed openstack/os-vif master: [Follow Up] OVS DPDK port representors support  https://review.opendev.org/70501808:56
*** ccamacho has joined #openstack-nova08:57
*** tetsuro_ has quit IRC09:00
*** jaosorior has joined #openstack-nova09:01
brinzhang_stephenfin: can you review os-volume-attachments refresh default policy patch (the end)? https://review.opendev.org/#/c/710190/  it prevent my destroy-instance-with-datavolume add the completed implementation09:02
*** aarents has quit IRC09:03
brinzhang_stephenfin: as the same as os-instance-actions policies https://review.opendev.org/#/c/706470/, it prevents the action-event-fault-details feature09:03
lyarwoodhttps://zuul.opendev.org/t/openstack/build/790d84d24ac946ec90712585a301ba3f/log/job-output.txt#6750 <- anyone with more bash-foo than me able to tell me why ceph.sh is exiting with 1 here? This works locally and I can't see anything useful dumped with set -oxtrace09:03
brinzhang_johnthetubaguy: also need you check09:04
brinzhang_johnthetubaguy: stephenfin: thanks09:04
openstackgerritLee Yarwood proposed openstack/nova master: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303509:06
*** tosky has joined #openstack-nova09:07
*** ociuhandu has quit IRC09:10
*** ociuhandu has joined #openstack-nova09:11
*** Liang__ has quit IRC09:15
*** mkrai has quit IRC09:21
*** mkrai has joined #openstack-nova09:22
*** aarents has joined #openstack-nova09:32
*** martinkennelly has joined #openstack-nova09:40
openstackgerritLee Yarwood proposed openstack/nova master: images: Move qemu-img info calls into privsep  https://review.opendev.org/70689709:45
openstackgerritLee Yarwood proposed openstack/nova master: images: Allow the output format of qemu-img info to be controlled  https://review.opendev.org/70689809:45
openstackgerritLee Yarwood proposed openstack/nova master: virt: Pass request context to extend_volume  https://review.opendev.org/70689909:45
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Correctly resize encrypted LUKSv1 volumes  https://review.opendev.org/70690009:45
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Use oslo.utils >= 4.1.0 to fetch format-specific image data  https://review.opendev.org/71078509:45
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Always provide the size in bytes when calling virDomainBlockResize  https://review.opendev.org/70759009:45
openstackgerritLee Yarwood proposed openstack/nova master: images: Remove Libvirt specific configurable use from qemu_img_info  https://review.opendev.org/70759109:45
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Remove QEMU_VERSION_REQ_SHARED  https://review.opendev.org/71023909:45
openstackgerritLee Yarwood proposed openstack/nova master: images: Make JSON the default output format of calls to qemu-img info  https://review.opendev.org/71167909:45
lyarwoodhmmm do I need to be part of a particular group to triage bugs? https://bugs.launchpad.net/nova/+bug/186402010:07
openstackLaunchpad bug 1864020 in OpenStack Compute (nova) "libvirt.libvirtError: Requested operation is not valid: format of backing image %s of image %s was not specified in the image metadata (See https://libvirt.org/kbase/backing_chains.html for troubleshooting)" [Undecided,Fix committed] - Assigned to Lee Yarwood (lyarwood)10:07
lyarwoodI thought I was but I can't seem to set the importance of the above bug10:07
*** nightmare_unreal has joined #openstack-nova10:08
donnydnightmare_unreal:  check this out https://docs.openstack.org/nova/train/contributor/index.html10:09
nightmare_unrealsure10:09
donnydthe nova contributor guide is pretty rich in detail on what it takes to get up and running10:10
donnydhttps://docs.openstack.org/nova/train/contributor/how-to-get-involved.html10:10
*** ratailor has quit IRC10:11
*** ratailor has joined #openstack-nova10:13
openstackgerritLee Yarwood proposed openstack/nova master: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303510:29
*** nightmare_unreal has quit IRC10:38
*** zigo has joined #openstack-nova10:52
openstackgerritBalazs Gibizer proposed openstack/nova stable/rocky: Reproduce bug 1862633  https://review.opendev.org/71318710:56
openstackbug 1862633 in OpenStack Compute (nova) "unshelve leak allocation if update port fails" [Medium,Fix released] https://launchpad.net/bugs/1862633 - Assigned to Balazs Gibizer (balazs-gibizer)10:56
*** ociuhandu has quit IRC11:02
*** ociuhandu has joined #openstack-nova11:02
openstackgerritLuyao Zhong proposed openstack/nova master: bug-fix: Reject live migration with vpmem  https://review.opendev.org/70811011:04
openstackgerritLuyao Zhong proposed openstack/nova master: address specific resources cleanup issue  https://review.opendev.org/69914811:04
openstackgerritLuyao Zhong proposed openstack/nova master: support live migration with vpmems  https://review.opendev.org/68785611:04
openstackgerritLuyao Zhong proposed openstack/nova master: Track orphan instances and error migrations in resource tracker  https://review.opendev.org/67845111:04
*** ociuhandu has quit IRC11:07
*** ratailor_ has joined #openstack-nova11:07
*** ratailor has quit IRC11:10
openstackgerritJohn Garbutt proposed openstack/nova master: Add unified limits configuration  https://review.opendev.org/71213711:18
openstackgerritJohn Garbutt proposed openstack/nova master: Add logic to enforce local api and db limits  https://review.opendev.org/71213911:18
openstackgerritLee Yarwood proposed openstack/nova master: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303511:19
elodlyarwood: this is the team you need if i'm not mistaken: https://launchpad.net/~nova-bugs11:22
*** eharney has joined #openstack-nova11:23
*** sean-k-mooney has joined #openstack-nova11:25
lyarwoodelod: thanks, I was sure I was already but nvm11:27
*** tkajinam has joined #openstack-nova11:27
*** tkajinam has quit IRC11:29
lyarwoodah membership expires11:29
lyarwoodweird11:29
elodyeah. You are member since 2016 it says11:29
lyarwoodyeah just renewed11:30
lyarwoodno excuses not to triage now I guess /o\11:30
elod:]11:30
sean-k-mooneylyarwood: do you know if there is an issue with the internal vpn11:31
lyarwoodsean-k-mooney: nope I'm on AFAICT11:31
* lyarwood checks11:31
sean-k-mooneylyarwood: the amsterdam site keeps kicking my connection11:31
*** rpittau is now known as rpittau|bbl11:31
*** ociuhandu has joined #openstack-nova11:31
lyarwoodsean-k-mooney: try FAB11:31
sean-k-mooneyok i have to go fine the config file and update my user name but ill give it a try11:32
* lyarwood assumes every corps VPN endpoints are getting hammered at the moment11:32
luyaolyarwood: I addressed the the 'do_cleanup' flag issue according to your and alex_xu 's comments, could you look at it again?  https://review.opendev.org/#/c/687856/1211:34
lyarwoodluyao: ack will look shortly11:35
luyaolyarwood: thanks11:35
openstackgerritBalazs Gibizer proposed openstack/nova stable/rocky: Clean up allocation if unshelve fails due to neutron  https://review.opendev.org/71319611:40
*** jangutter has joined #openstack-nova11:40
luyaobrinzhang_: your comments were addressed, thanks https://review.opendev.org/#/c/687856/1211:40
luyaostephenfin: Hi, are you around11:45
stephenfinyup11:46
brinzhang_luyao: thanks, I have not reviewed all, I will do continue while I am free :)11:46
luyaostephenfin: Do you have time to review vpmem live migration support? I believe you have been familiar with the vpmem feature. :). https://review.opendev.org/#/q/topic:support-live-migration-with-virtual-persistent-memory+(status:open+OR+status:merged)11:46
luyaobrinzhang_: Cool, Thanks11:47
brinzhang_luyao: because of some works later to continue, sorry11:47
stephenfinluyao: Oh, I meant to take a look at that. Can do11:47
brinzhang_stephenfin: do you see my comments above?11:48
luyaostephenfin: Thanks a lot11:48
stephenfinyup, also on my list11:48
brinzhang_stephenfin: thanks11:48
brinzhang_thess feature all done to review, api change and novaclient change all done :)11:49
*** mkrai has quit IRC12:02
openstackgerritwaleed mousa proposed openstack/os-vif master: [Follow Up] OVS DPDK port representors support  https://review.opendev.org/70501812:03
*** Luzi has joined #openstack-nova12:07
openstackgerritwaleed mousa proposed openstack/os-vif master: [Follow Up] OVS DPDK port representors support  https://review.opendev.org/70501812:07
brinzhang_dansmith: I have added the nova and non-nova exception functional tests for instance action events fault details, pls see https://review.opendev.org/#/c/694430/7/nova/tests/functional/test_instance_actions.py12:09
*** rcernin has joined #openstack-nova12:10
openstackgerritBalazs Gibizer proposed openstack/nova stable/rocky: Reproduce bug 1862633  https://review.opendev.org/71318712:13
openstackbug 1862633 in OpenStack Compute (nova) "unshelve leak allocation if update port fails" [Medium,Fix released] https://launchpad.net/bugs/1862633 - Assigned to Balazs Gibizer (balazs-gibizer)12:13
openstackgerritBalazs Gibizer proposed openstack/nova stable/rocky: Clean up allocation if unshelve fails due to neutron  https://review.opendev.org/71319612:13
*** breizhkoala has quit IRC12:16
*** mkrai has joined #openstack-nova12:18
*** udesale_ has joined #openstack-nova12:26
*** ratailor__ has joined #openstack-nova12:27
*** udesale has quit IRC12:28
*** nweinber has joined #openstack-nova12:29
*** ratailor_ has quit IRC12:30
*** jraju__ has joined #openstack-nova12:33
*** links has quit IRC12:34
openstackgerritLee Yarwood proposed openstack/nova master: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303512:38
*** ratailor__ has quit IRC12:39
*** ociuhandu has quit IRC12:43
*** ociuhandu has joined #openstack-nova12:43
*** mkrai has quit IRC12:47
*** tkajinam has joined #openstack-nova12:48
*** mgariepy has joined #openstack-nova12:49
*** nicolasbock has joined #openstack-nova12:56
*** damien_r has joined #openstack-nova13:01
*** ociuhandu has quit IRC13:02
*** artom has joined #openstack-nova13:03
*** ociuhandu has joined #openstack-nova13:03
*** amoralej is now known as amoralej|lunch13:07
*** lbragstad has joined #openstack-nova13:08
*** ociuhandu has quit IRC13:08
*** ociuhandu has joined #openstack-nova13:09
*** rpittau|bbl is now known as rpittau13:13
openstackgerritJohn Garbutt proposed openstack/nova master: Assert API behavior for noop quota driver  https://review.opendev.org/71214013:17
openstackgerritJohn Garbutt proposed openstack/nova master: Make unified limits APIs return reserved of 0  https://review.opendev.org/71214113:17
openstackgerritJohn Garbutt proposed openstack/nova master: Add logic to enforce local api and db limits  https://review.opendev.org/71213913:17
openstackgerritJohn Garbutt proposed openstack/nova master: Enforce api and db limits  https://review.opendev.org/71214213:17
*** beekneemech is now known as bnemec13:17
*** CeeMac has joined #openstack-nova13:18
*** kaisers_ has joined #openstack-nova13:18
*** elod has quit IRC13:21
*** elod has joined #openstack-nova13:21
*** ociuhandu has quit IRC13:26
dansmithbrinzhang_: okay, after coffee13:26
*** ociuhandu has joined #openstack-nova13:26
brinzhang_dansmith: good morning ^^13:27
*** dpawlik has quit IRC13:29
*** ociuhandu has quit IRC13:31
*** mgariepy has quit IRC13:39
*** tkajinam has quit IRC13:44
*** mkrai has joined #openstack-nova13:48
*** mgariepy has joined #openstack-nova13:48
*** mgariepy has quit IRC13:54
*** ociuhandu has joined #openstack-nova14:01
*** nightmare_unreal has joined #openstack-nova14:06
*** mgariepy has joined #openstack-nova14:08
openstackgerritBalazs Gibizer proposed openstack/nova master: Fix intermittently failing regression case  https://review.opendev.org/71324314:09
*** Luzi has quit IRC14:11
*** haleyb has joined #openstack-nova14:13
*** amoralej|lunch is now known as amoralej14:14
openstackgerritJohn Garbutt proposed openstack/nova master: Update quota_class APIs for db and api limits  https://review.opendev.org/71214314:15
sean-k-mooneybauzas: would you have time to look at https://review.opendev.org/#/c/666914/21 and the follow up patches. it would be nice to be able to close that out. gibi maybe you could take a look to since efried_gone  is nolonger here to review. since stephen an i are the author we need a non redhat person to review unless we are moving away form that requirement. alex_xu or johnthetubaguy  would also work if14:17
sean-k-mooneythey are around.14:17
*** mriedem has joined #openstack-nova14:19
openstackgerritLee Yarwood proposed openstack/nova master: DNM - Test TEMPEST_EXTEND_ATTACHED_ENCRYPTED_VOLUME  https://review.opendev.org/70759314:20
*** udesale_ has quit IRC14:23
*** Luzi has joined #openstack-nova14:24
gibisean-k-mooney: ack. I cannot promise too much (have a long review queue atm) but added it to my queue14:26
gibisean-k-mooney: regarding the non-RH core requirement. I see this requirement as something that will be very problematic due to less diversity in the core team14:28
sean-k-mooneygibi: ya. dont worry if you cant review but the trifect rule is likely to become a problem unless the diversity fo the core team can be restored. as someone who like that rule i would be sad to see it go but i guess we will see how it goes14:29
openstackgerritBalazs Gibizer proposed openstack/nova master: Fix intermittently failing regression case  https://review.opendev.org/71324314:32
brinzhang_gmann: would like check os-instance-action policy again? https://review.opendev.org/#/c/706470/14:33
gmannbrinzhang_: yeah, i will check after my internal meeting14:33
brinzhang_gmann: yeah, if it is ok, I think I can continue to do the os-instance-action (bp/action-event-fault-details) feature, there is a new policy besed on the default policy change14:35
brinzhang_gmann: thanks14:36
gmannbrinzhang_: yeah, you can continue on that, make dependency.14:37
brinzhang_gmann: Adding a default policy is only the remaining part, everything else is ready(TODO in it). So I want to do it after this done.14:40
openstackgerritMerged openstack/nova master: Cleanup test for system reader and reader_or_owner rules  https://review.opendev.org/71251514:40
*** ociuhandu has quit IRC14:41
*** mkrai has quit IRC14:52
*** ociuhandu has joined #openstack-nova14:54
*** gyee has joined #openstack-nova14:54
*** TxGirlGeek has joined #openstack-nova14:55
openstackgerritKevin Zhao proposed openstack/nova master: Add default cpu model for aarch64  https://review.opendev.org/70949414:57
bauzassean-k-mooney: I'll try but tbh, those next weeks will be crazy14:58
bauzasbut adding it to my queue14:59
*** ociuhandu has quit IRC14:59
sean-k-mooneybauzas: yep i understand just trying to see if we can complete some blueprints that are close to being done.14:59
bauzassure15:00
openstackgerritBrin Zhang proposed openstack/nova master: Add new default roles in os-instance-actions policies  https://review.opendev.org/70647015:01
kashyaplyarwood: Hiya, hope the renewed comments make sense: forgot that we also need MIN_QEMU_BLOCKDEV15:01
kashyaplyarwood: And a different, newer version for MIN_LIBVIRT_BLOCKDEV.  (Notes in the review :))15:02
artomRemind me again what are the criteria for reporting a VM status as UNKNOWN? Host being down is one of them, right?15:05
artommelwitt, ^^ if you're awake15:05
dansmithyeah or cell down15:05
artomdansmith, aha, ack, thanks!15:05
dansmithit started at cell down only, but I think melwitt changed that recently15:05
artomdansmith, that's the impression I got - not super important, just clearing out old (downstream) BZs15:06
*** nicolasbock has quit IRC15:08
*** nicolasbock has joined #openstack-nova15:09
*** spatel has joined #openstack-nova15:11
spatelsean-k-mooney: Good morning, if i want to disable Hyper Threading without BIOS setting how do i do that?  I was reading AWS article here using offline cpu threads, do you think this is valid way to do that - https://aws.amazon.com/blogs/compute/disabling-intel-hyper-threading-technology-on-amazon-linux/15:12
sean-k-mooneyspatel: yes you can do it via the sys virtual file system15:14
sean-k-mooneythe bios is the best way to do it as ofline cores will not unpartion the l1 cache15:14
spatelsean-k-mooney: does that perform as good as BIOS setting ?15:15
sean-k-mooneynot quite15:15
sean-k-mooneyif you disable it in the bios it dobles the l1 cache avaiable to the core15:15
sean-k-mooneywhen ht is enabled the l1 cache is partioned so the each ht has its onw region of the l1 cache15:15
spatelWe found erlang doing pretty good job when disable HT but again i don't want to do that setting from BIOS (its painful) i want to give that control to end users15:16
sean-k-mooneybut with it disable at the bios level all the l1 cache is avaiable to the physical core. at least on older intel architecutres15:16
sean-k-mooneyspatel: are you using cpu pinning15:17
spatelYes CPU pinning15:17
sean-k-mooneyif so you can use the cpu_thread_policy15:17
sean-k-mooneycreate multiple flaovrs for the earlang instnace and let them choose15:17
spatelsean-k-mooney: cpu_thread_policy=isolate ?15:18
sean-k-mooneye.g. hw:cpu_thread_policy=prefer vs hw:cpu_threads_policy=isolate15:18
sean-k-mooneyyes15:18
spatelsean-k-mooney: I have tried all kind of combination but erlang doesn't like it.15:18
spatelwhen i run VM on single NUMA perfrmance is really good15:19
sean-k-mooneythe sysfs performace delta is pretty small since normally your app/data wont fit in l1 anyway15:19
spatelsean-k-mooney: look at this - https://imgur.com/a/8zapZ8x15:21
spatelTo understand better CPU topology i am comparing them with AWS and here what i found15:22
spatelOn my openstack VM CPU topo looks very strange15:22
sean-k-mooneythe lower image looks like what i would expect15:23
spatell1d & l1i cache is shared15:23
spatelwhy aws has l1d and l1i outside15:23
sean-k-mooneyi dont know15:23
sean-k-mooneybut you can alter this in the libvirt xml i think15:24
spatelThat is the problem, I have check with Alicloud and aws and both has perfect CPU topo but my openstack has very odd output15:24
sean-k-mooneythis is not something we would expose however15:24
spatelI think it could be QEMU version or bug15:24
sean-k-mooneywell what do you mean by odd15:25
spatelI am planning to upgrade my qemu to 4.2  (currently running 2.12)15:25
sean-k-mooneythe imgae you provide showing the kvm instance15:25
sean-k-mooneylook like real hardware would15:25
spatelqemu-kvm15:25
spatelboth are virtual machine15:25
sean-k-mooneysure but looking at https://imgur.com/a/8zapZ8x the bottom image looks corect the top look incorrect15:26
spatelyou are saying AWS instance looks incorrect?15:27
sean-k-mooneyyes15:27
sean-k-mooneythat is the toplogy that we should see if and only if you had HT disabled15:27
spatelIf i run same command on my host compute it looks exactly like AWS one15:27
spatelsean-k-mooney: no15:27
spatelLet me show you my two physical compute topo (HT vs non-HT)15:28
spatelhold on..15:28
sean-k-mooneyok so looking locally they have changed how this work in later versions15:30
sean-k-mooneythe view that you see in openstack i how it used to work in nehalem and i belive up to sandybridge or ivybridge15:30
sean-k-mooneyspatel: as i said the bios seting used to change the toplogy between the aws one and the openstack one at the hardware level15:32
spatelsean-k-mooney: https://imgur.com/a/at3WBBf15:32
spatelThis is my two compute host (one has HT enable and second has HT enable)15:32
sean-k-mooneyspatel: yep as i said this has changed with different hardware micorarchitecutres15:33
spatelIf you look bottom picture (its very similar to AWS virtual instance, that means AWS virtual machine correctly exposing physical topology including cache)15:33
sean-k-mooneyspatel: openstack/nova is not currently setting the cpu cache toplogy its decied by libvirt15:33
sean-k-mooneyspatel: openstack is not ment to expose the host toplogy by defualt15:34
spateleven in host-passthrough ?15:34
sean-k-mooneycorrect15:34
sean-k-mooneyopenstack does not specify the cache toplogy at all15:34
spatelhmmm15:34
sean-k-mooneythat is left entirely to libvirt today15:34
sean-k-mooneylibvirt allows use to set this but we dont so you get whatever libvirt/qemu decied to provide15:35
spatelhmm! how ALI cloud doing this even they are running openstack15:35
spatelmay be they have hack version of software design by them15:35
spatelAnyway so you think BIOS level is best way then i will do with that but it will be growing pain as my cloud growing :(15:37
*** breizhkoala has joined #openstack-nova15:38
openstackgerritJohn Garbutt proposed openstack/nova master: Update limit APIs  https://review.opendev.org/71270715:39
openstackgerritJohn Garbutt proposed openstack/nova master: Update quota sets APIs  https://review.opendev.org/71274915:39
openstackgerritJohn Garbutt proposed openstack/nova master: WIP: Enforce unified limits using oslo.limit  https://review.opendev.org/61518015:39
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Use virDomainBlockCopy to swap volumes when using -blockdev  https://review.opendev.org/69683415:48
lyarwoodkashyap: ^ updated btw15:48
openstackgerritBrin Zhang proposed openstack/nova master: Add new default roles in os-volumes-attachments policies  https://review.opendev.org/71019015:52
openstackgerritBrin Zhang proposed openstack/nova master: Add PATCH volume attachments api to os-volume_attachments  https://review.opendev.org/69382815:52
openstackgerritBrin Zhang proposed openstack/nova master: Add new policy to PATCH update volume API  https://review.opendev.org/71119415:52
openstackgerritBrin Zhang proposed openstack/nova master: Add functional tests for PATCH volume attachments API  https://review.opendev.org/71096515:52
*** ociuhandu has joined #openstack-nova15:55
dansmithsean-k-mooney: AFAIK, the cyborg patch that generates the libvirt xml hasn't changed much, and you've tested that at some point with real devices (right?) so we can assume it works without much fanfare?15:57
*** iurygregory is now known as iurygregory|brb15:57
kashyaplyarwood: Will check; thx15:59
*** ociuhandu has quit IRC16:00
sean-k-mooneydansmith: i havent tested with real device no16:00
dansmithoh I thought you had okay16:01
sean-k-mooneydansmith: i can go specific review that patch however16:01
dansmithpresumably sundar has16:01
sean-k-mooneyi belive you are correct in that it does not change much16:01
sean-k-mooneydansmith: yes sundar has apparently tested it with the rushcreak fpga card16:01
dansmithI looked over it a while back and I think the only way I'd be able to find stuff really wrong with it is through log examination16:02
dansmithit's pretty straightforward16:02
*** ivve has joined #openstack-nova16:03
sean-k-mooneydansmith: i rebased the cyborg devstack pluging multinode this moringin by the way. just to resovle the merge conflict16:05
dansmithI saw, thanks16:05
sean-k-mooneyif i rebase it again do you want me to move your host name fix patch lower? hoepfully they will merge soon anyway but that is usefaul outside of multinode testing16:06
dansmithit's not super critical unless it's blocking people.. I put it later just to avoid messing up your series, but obviously it's probably an easy merge.. your call16:06
*** mmethot_ has quit IRC16:07
sean-k-mooneyok if i need to respin i can move it down. i dont think other have really complained about it plus you can always override the host via the local.conf anyway16:08
dansmithyup.. I imagine that's because most people are using more throwaway machines for their testing (and it's probably not getting a very wide audience anyway) but.. yep, not critical and there is a workaround16:11
melwittartom, dansmith: fyi I didn't change the meaning of the UNKNOWN status, it originally was only for host down and then when the down cells handling was added, it was used for that as well. my change was just a new policy rule to allow UNKNOWN status to be seen by non-admin if indicated by policy16:12
dansmithmelwitt: it was originally for cell down only, AFAIR16:13
melwittthis is the logic for host down https://github.com/openstack/nova/blob/master/nova/compute/api.py#L5339-L535116:13
dansmithmark-host-down left the status in place16:13
dansmiththat's host status16:13
sean-k-mooneydansmith: on a slightly different topic do you have interest in/ time to review the porvider.yaml series? just trying to figure out which redhat cores to bug as a reviewer when its ready.16:14
dansmithhe's talking about instance status right?16:14
melwittwasn't that the question?16:14
sean-k-mooneymelwitt: since your here same question ^16:14
melwittoh, sorry. sigh16:14
*** sapd1 has quit IRC16:14
dansmithmelwitt: vm status16:14
melwittwell, either way I didn't change the meaning of vm status either16:14
dansmithmelwitt: I thought you were proposing the vm status change but okay16:15
* sean-k-mooney brb16:15
melwittdansmith: I did but you explained why it wouldn't be a good idea and I agreed with your reasoning and updated the spec to stop proposing it. the spec was approved some time after that16:16
*** mmethot has joined #openstack-nova16:16
dansmithsean-k-mooney: I dunno, I don't have a huge interest in reviewing that16:16
melwitt*I did originally16:16
dansmithmelwitt: ack, I didn't remember, I thought you had kept that in.16:16
*** derekh has joined #openstack-nova16:17
melwittartom: sorry I got vm status and host status mixed up. UNKNOWN vm status is for down cell only and was not changed as a result of my adding a host_status:unknown-only policy rule16:21
artommelwitt, oh? So what happens when the host isn't reachable? We report the last recorded status from the DB?16:22
melwittartom: correct. only host status will say UNKNOWN16:22
dansmithhost_status on the instance gives you a sanitized "don't expect this instance to be actionable because the host is not healthy"16:22
dansmithnormally only admins can see info about hosts, so that field is the indicator to the user that "things are not as they appear" without exposing too much16:23
melwittartom: and host status is normally admin-only, so I added a new policy rule host_status:unknown-only that defaults to admin-only intended for operators who want to let non-admin users see UNKNOWN host status16:23
melwitthost_status policy rule includes showing UP, DOWN, MAINTENANCE, UNKNOWN and host_status:unknown-only shows only UNKNOWN16:24
artommelwitt, aha, thanks :) Can you find a link to that patch/spec/whatever?16:24
melwittartom: https://review.opendev.org/67918116:24
*** sapd1 has joined #openstack-nova16:27
*** tesseract has quit IRC16:33
sean-k-mooneydansmith: no worries. gibi and erric were the main reviews so im just thinking of who can take over form erric. when its ready ill add it to the runway list and see how is interested16:34
*** ociuhandu has joined #openstack-nova16:34
gibisean-k-mooney: the provider config series are also on my radar16:43
evrardjphttps://review.opendev.org/711950 has merged. Congratulations!16:44
sean-k-mooneygibi: yep im adressing the feedback on the last patch currently16:44
gibisean-k-mooney: cool. thanks16:44
sean-k-mooneyill take a look at the unit tests once i have that done16:45
*** ociuhandu has quit IRC16:53
*** ociuhandu has joined #openstack-nova16:53
*** brtknr_ has quit IRC16:57
*** brtknr has joined #openstack-nova16:57
gibicores, there is a trivial, functional test only change: https://review.opendev.org/#/c/713243/16:58
*** ociuhandu has quit IRC16:58
*** rpittau is now known as rpittau|afk17:07
*** iurygregory|brb is now known as iurygregory17:08
*** Luzi has quit IRC17:14
openstackgerritmelanie witt proposed openstack/nova master: Follow-ups for host_status:unknown-only policy rule  https://review.opendev.org/71329517:21
openstackgerritMerged openstack/nova master: Fix intermittently failing regression case  https://review.opendev.org/71324317:23
melwittdansmith: pedantic correction to what I said earlier, the spec was abandoned bc it was decided to no longer need a spec when it became policy rule only. and then blueprint was approved after it became policy rule only17:28
melwitt(in case you ever go looking at the spec and can't find it, like I just did)17:29
melwitt*for17:29
*** evrardjp has quit IRC17:35
*** evrardjp has joined #openstack-nova17:36
openstackgerritJohn Garbutt proposed openstack/nova master: WIP: Enforce resource limits using oslo.limit  https://review.opendev.org/61518017:49
openstackgerritJohn Garbutt proposed openstack/nova master: WIP: Tell oslo.limit how to count nova resources  https://review.opendev.org/71330117:49
melwittgibi: I dunno if you saw my comment in https://review.opendev.org/712674 test just needs a tweak to handle kwarg vs positional arg17:54
dansmithmelwitt: heh okay..17:55
melwittjust in case you opened it and went, hWHAT! abandoned! ?!?!17:57
*** dtantsur is now known as dtantsur|afk17:57
*** jangutter has quit IRC17:57
*** derekh has quit IRC18:00
*** damien_r has quit IRC18:06
*** ociuhandu has joined #openstack-nova18:07
openstackgerritDan Smith proposed openstack/nova master: Remove non-optional kwarg for virt block_device_info  https://review.opendev.org/71331018:09
*** damien_r has joined #openstack-nova18:10
*** maciejjozefczyk has quit IRC18:13
*** ociuhandu has quit IRC18:23
*** ociuhandu has joined #openstack-nova18:24
*** ociuhandu has quit IRC18:29
*** maciejjozefczyk has joined #openstack-nova18:29
gibimelwitt: ack, now see it. thanks. I will get back to that tomorrow18:36
*** breizhkoala has quit IRC18:37
melwittgibi: k. I didn't want to update it, so I will be able to +218:38
gibimelwitt: sure18:39
openstackgerritBalazs Gibizer proposed openstack/nova master: [Community goal] Update contributor documentation  https://review.opendev.org/71242018:42
gibistephenfin, brinzhang_: fixed the comments ^^18:43
*** CeeMac has quit IRC18:48
sean-k-mooneygibi are you done for the day or are you still around18:58
gibisean-k-mooney: I'm here for quick questions, not for longer things18:59
sean-k-mooneygibi: its related to the provider.conf19:00
sean-k-mooneybasically the way i was asserting that the triats dont conflict with the virt driver traits does not quite work19:00
sean-k-mooneywhat hapens is it works on the first iteration then fails on the second as the trait is already there19:01
sean-k-mooneythe end to end functional test you asked for found the issue19:01
sean-k-mooneyim just wonder what the best way to adress that is19:01
sean-k-mooneygibi: this is what im doing which works fine for inventories as we start from scratch each time https://review.opendev.org/#/c/676522/44/nova/compute/resource_tracker.py@175119:03
sean-k-mooneybut for taits we start with the traits from placment19:03
sean-k-mooneyi guess i need to think about it again19:03
gibisean-k-mooney: let me sleep on it19:04
sean-k-mooneyya kno worries19:04
gibisean-k-mooney: can it be that we say CUSTOM traits are always overwritten by the provider config as we don't expect that a virt driver reports CUSTOM traits anyhiw19:04
sean-k-mooneyit might be as simple as if we have prover.yaml remove all custome triats19:04
sean-k-mooneygibi: ya so i think that is what erric wanted19:05
sean-k-mooneyeither you manage if form the api and dont use the provider.yaml19:05
*** lbragstad_ has joined #openstack-nova19:05
sean-k-mooneyor you use the provider.yaml in which case we can reset the traits and build them up again19:05
gibisean-k-mooney: yeah, this make senese19:07
gibisense19:07
sean-k-mooneyill give that a try if you think that is valid19:07
gibiwe just need to document it carefully19:07
sean-k-mooneyya ok ill see if i can make that work and get back to you.19:07
*** lbragstad has quit IRC19:08
sean-k-mooneythe pardes may be canceled but its still st patricks day tomorrow so ill be off until wednesday19:08
gibisean-k-mooney: sure. happy st patricks day!19:09
*** ociuhandu has joined #openstack-nova19:10
openstackgerritBalazs Gibizer proposed openstack/nova master: Ensures that COMPUTE_RESOURCE_SEMAPHORE usage is fair  https://review.opendev.org/71267419:11
gibimelwitt: fixed it up real quick19:12
gibiand now I'm gone for today19:12
melwittgibi: awesome thanks19:13
*** ralonsoh has quit IRC19:15
*** ociuhandu has quit IRC19:20
*** martinkennelly has quit IRC19:21
*** ociuhandu has joined #openstack-nova19:21
*** maciejjozefczyk has quit IRC19:24
*** ociuhandu has quit IRC19:26
*** CeeMac has joined #openstack-nova19:29
*** jraju__ has quit IRC19:31
*** lbragstad_ is now known as lbragstad19:32
*** amoralej is now known as amoralej|off19:40
melwittdansmith: test coverage for fair locking is ready https://review.opendev.org/71267419:50
*** ccamacho has quit IRC19:50
dansmithah, I was like "this is dumb you're just testing your fixture" but I see now19:51
dansmithlike I said, I'm not really sure it's that important, but as long as it doesn't get in the way too much..19:51
melwittyeah, I thought it's a nice way to cover this and catch any future uses without fair=True for the compute semaphore19:57
*** ociuhandu has joined #openstack-nova20:04
openstackgerritsean mooney proposed openstack/nova master: Provider Config File: Enable loading and merging of provider configs  https://review.opendev.org/69346020:04
*** ociuhandu has quit IRC20:08
*** mgariepy has quit IRC20:35
openstackgerritMerged openstack/nova stable/train: Functional test for UnexpectedDeletingTaskStateError  https://review.opendev.org/71121020:36
*** nweinber has quit IRC20:38
*** mgariepy has joined #openstack-nova20:49
melwittartom: I just hit https://bugs.launchpad.net/nova/+bug/1813789 intermittent gate failure on one of my patches and saw you have patches/comments in the lp bug. do you have any idea where this is at right now? I saw you landed https://review.opendev.org/644881 9 months ago but it wasn't for this bug. just wondering you happen to know anything about the current bug we have in the gate21:11
openstackLaunchpad bug 1813789 in OpenStack Compute (nova) "Evacuate test intermittently fails with network-vif-plugged timeout exception" [Medium,In progress] - Assigned to Artom Lifshitz (notartom)21:11
*** ociuhandu has joined #openstack-nova21:11
melwittmriedem: thought your ghost might find this interesting https://review.opendev.org/71303521:13
artommelwitt, IIRC my patch only addresses the revert-resize case21:13
artommelwitt, so any other race was outsize of its scope21:14
mriedemspooky21:14
melwittartom: oh, I see. thanks, helps to know that. I wonder if the same pattern could be applied to the evacuate case21:14
mriedemso that's why the changes for that job on pike always failed?21:14
melwittmriedem: yeah21:15
artommelwitt, at first pass I'd say no - I have to reload context, but it was a *really* specific scenario with revert resize21:15
melwittmriedem: lyarwood figured it out. it's failing nearly 100% on the pike branch21:15
mriedemyeah i gave up on https://review.opendev.org/#/c/700072/ and thought it was due to some other pike thing that was fixed by QA awhile back21:16
melwittartom: ah, k. yeah even that vague info helps. I know nothing about it till now21:16
mriedembut obviously not21:16
mriedemartom: at the time we had talked about the same issue in evacuate21:16
artommelwitt, like, the source host had to have the NIC already wired, and it had to be OVS21:16
mriedemthere is an old gate bug for that race21:16
artommriedem, yeah, but it can't have been the same root cause21:16
mriedemhttp://status.openstack.org/elastic-recheck/#181378921:16
mriedemno it's not due to using OVN or whatever :)21:17
*** ociuhandu has quit IRC21:17
artommriedem, right, which is why I ended up filing https://bugs.launchpad.net/nova/+bug/1832028 and using that in my patch21:17
openstackLaunchpad bug 1832028 in OpenStack Compute (nova) stein "revert resize: vif-plugged external event sent too soon if Neutron is using OVS hybrid plug" [Medium,Fix committed] - Assigned to Artom Lifshitz (notartom)21:17
artomBecause turns out my thing from downstream was different than the intermittent upstream evacuate failures21:18
melwittdang21:18
mriedemunless i was wrong on https://bugs.launchpad.net/nova/+bug/1813789 i had left comments about the order of events that showed the race21:18
openstackLaunchpad bug 1813789 in OpenStack Compute (nova) "Evacuate test intermittently fails with network-vif-plugged timeout exception" [Medium,In progress] - Assigned to Artom Lifshitz (notartom)21:18
artommelwitt, brutal honesty: stay away :P It's not a can of worms you want to open21:19
mriedemi think by "the same" i meant the fix for evacuate is similar, we need to register the callback before plugging vifs21:19
mriedembecause right now for evacuate we bind ports to the new host and then spawn the guest and it's the low level spawn in the driver that registers the callback21:19
artommriedem, yeah, but did we ever work out *why* that was necessary?21:19
mriedemand we could have already gotten the response from the port bind21:19
melwittartom: yeah. I already got my ass kicked looking at http://status.openstack.org/elastic-recheck/#1844929 spent days digging in and no dice so far21:19
artommriedem, wouldn't that depend on the Neutron backend though?21:20
mriedemartom: i think i just said why :) and it's in the bug21:20
artommriedem, I'm pretty sure at least some of them would wait until libvirt plugs the VIF before sending out the event21:20
melwittartom: but anecdotally I see http://status.openstack.org/elastic-recheck/#1813789 fail really often in my gerrit notifications so argh ... just want to fix some of these21:20
mriedemwe use ovs in the gate and for that backend neutron sends the event when the port binding host changes21:21
sean-k-mooneyartom: ovs waits to send the event yes21:21
mriedemwhich is why we *don't* get the event for things like hard reboot21:21
artommriedem, sean-k-mooney, get your stuff in line and stop contradicting yourselves ;)21:21
* mriedem sharpens knife21:22
artommelwitt, I get you - it's annoying and you want to fix it21:22
artomIt's just such a mess21:22
sean-k-mooneymriedem: do you know if we create a second port binding and activate it our just update the host21:22
sean-k-mooneymriedem: that would cahnge when teh event is sent21:22
mriedemevacuate doesn't use multiple port bindings like live migration21:22
*** ociuhandu has joined #openstack-nova21:22
mriedemwhen i left the only things that used multiple port bindings were live migration and cross-cell resize21:23
sean-k-mooneyok we the condition to sent the event is the port must be in the active state and be bound to a host21:23
sean-k-mooneyso since its already in an active state when we bind it in the evacuate it might send the event imideatly21:23
artomsean-k-mooney, so what mriedem was saying then21:23
sean-k-mooneyyep i havent checkt it but i would guess that he is correct21:23
artommelwitt, well, if you want to take a whack at it, you could probably use the bind-time stuff I added to the model to change when evacuate starts listening for the event21:24
melwittok so the main idea is find a way to register the callback earlier on21:24
artomBut there be dragons21:24
sean-k-mooneyartom: if we careted a second port binding it would not send it until we activate it21:24
melwittartom: thanks21:25
artommelwitt, well, yes and no - looking at my own code, depending on whether the port has what I called "bind_time_events", you wait in the compute manager when you send the Neutron request21:26
*** xek has quit IRC21:26
artomAnd if they're "plug_time_events", you wait in the virt driver when you plug the VIFs21:26
melwittahhh ok21:26
sean-k-mooneymriedem: yes those are still the only things that use the multiple prot bindings21:26
artommelwitt, also, ask sean-k-mooney ;)21:27
artom(Bus, meet Sean :D )21:27
melwittlol21:27
sean-k-mooneyartom:  then you just delete the code and start again21:27
artomCan we do that for all of Nova? ;)21:27
melwittI think I vaguely get it. I can read though your patch, just knowing a generic idea of what's going on helps a lot. saves a lot of time21:28
mriedem"if we careted a second port binding it would not send it until we activate it" is a bigger non-backportable change most likely because of the behavior changes between compute and conductor21:29
artommelwitt, ping me if you have questions / need review / whatever21:29
melwittthanks ++21:29
artom(/me needs pressure to "re-join" upstream)21:29
artomI've been neglecting y'all21:29
mriedemdon't forget to loop dansmith into this when you want to talk about it, i'm sure he'd love to21:30
artomYou're such a good friend21:30
melwittartom: heh. I might end up running away from this screaming after I try to work on it, so if that doesn't happen maybe I'll ping you21:30
artommelwitt, screaming would be a good sign, actually21:31
sean-k-mooneyya we cant backport adopting multiple port bindings for evacuate21:31
artomMeans you're sane (inasmuch as that's still possible)21:31
sean-k-mooneyim not sure if we want to do that or not in general21:31
melwittlol21:31
sean-k-mooneyit might be useful but its alot of work to untangel things and make sure it works21:31
artomYep. We'd also need to run it with a couple of other Neutron backends21:32
artomIIRC we created a DNM job to run against... OVS? OVN?21:32
artomOVB? OG?21:32
sean-k-mooneyovs ovn and lb21:32
artomRunDMC?21:32
melwitt😂21:33
melwitthad to use an emoji for that one21:33
sean-k-mooneysure you "had too" :P21:34
melwittyeah, it wouldn't let me type anything else until I posted the emoji21:34
sean-k-mooneydid https://bugs.launchpad.net/nova/+bug/1813789 come up recently downstream or in relation to the nova-livemigation job21:35
openstackLaunchpad bug 1813789 in OpenStack Compute (nova) "Evacuate test intermittently fails with network-vif-plugged timeout exception" [Medium,In progress] - Assigned to Artom Lifshitz (notartom)21:35
sean-k-mooneyi rembere talking about it a few days ago21:35
sean-k-mooneyi just dont recall the context21:35
melwittlyarwood mentioned it in the nova meeting21:35
sean-k-mooneyah ya it was in context of the zuul v3 migration21:36
melwittjust saying he hit that bug and http://status.openstack.org/elastic-recheck/#1844929 a bunch of times while trying to get some work done21:36
sean-k-mooneyyep21:36
melwittso I started looking at http://status.openstack.org/elastic-recheck/#1844929 and got nowhere. and now I know http://status.openstack.org/elastic-recheck/#1813789 is also horrid21:37
sean-k-mooneyso given we dont use the multiple port bindings flow i think we can assume that as long as the port is still active it will recive a bind time even rather then plug time21:38
mriedemthose probably aren't even in the same ballpark of terrible21:38
sean-k-mooneybut i would need to think that true more carefully to make sure that is correct21:39
mriedemin the bug i linked in logs where the things were happening so it's not really a question of where the race is21:39
mriedemthough those log links are going to be dead by now21:39
mriedemunshelve has the same issue21:40
*** ociuhandu has quit IRC21:40
mriedem1. bind to new host triggers async network-vif-plugged event, 2. driver.spawn plugs vifs which sets up the callback handler21:40
mriedemif you get the event before 2 you're stuck21:40
*** ociuhandu has joined #openstack-nova21:40
melwittright, ok21:40
sean-k-mooneymriedem: in the unsevle case you could argue that when we go to shevle offloaded teh status of the port shoudl be down21:41
sean-k-mooneywhich woudl prevent the event being sent in the ovs case atleast until its plugged in the driver21:42
mriedemyup, i opened an old bug for that as well21:42
sean-k-mooneybut we would need to sitll use the bind_time vs plugtime thing to check21:42
sean-k-mooneyis the state of the port something we can contol from nova?21:43
sean-k-mooneyor rather are we allowed to set it21:43
sean-k-mooneyif so we could set it to down when we do an evaucate21:43
melwittwell, I'll try looking at it, see how it goes21:44
*** ociuhandu has quit IRC21:45
mriedemthis is the thing i was thinking about for a shelve related bug https://github.com/openstack/nova/blob/stable/stein/nova/network/neutronv2/api.py#L333221:47
mriedemlooks like i updated that as part of the cross-cell series https://github.com/openstack/nova/blob/master/nova/network/neutron.py#L333921:47
mriedemhttps://review.opendev.org/#/c/697162/21:48
mriedemit's all coming back to me21:48
sean-k-mooney so in that function we would jsut set the binding host to None21:48
sean-k-mooneywhich would unbind it21:49
sean-k-mooneyand the status should go to down as a result21:49
mriedemyeah like how _unbind_ports works21:50
mriedemexcept you can't clear the device_owner on the port when shelve offloading21:50
mriedemthe nova instance needs to continue to "own" the port21:50
sean-k-mooneywe need to keep  device_id which is the nova instance uuid too but ya21:51
mriedemlet us *shelve* this discussion for 6 months from now when it comes up again :)21:51
sean-k-mooney:)21:51
mriedemo/21:51
*** mriedem has left #openstack-nova21:51
*** ircuser-1 has joined #openstack-nova21:55
sean-k-mooneymelwitt: so calling self.network_api.cleanup_instance_network_on_host on the source node durign an evacuate might allow us to use the bind vs plug time evnet code to determin when to wait in the evacuate case too21:57
sean-k-mooneymelwitt: it is currently only called for cross cell resize21:57
sean-k-mooneybut ya we should be doing that definetly during shelve offload to fix the shelve case21:58
melwittsean-k-mooney: so does that mean that you think evacuate is pretty "easy" but shelve will be more difficult? or are they a similar level of complexity22:01
*** ociuhandu has joined #openstack-nova22:02
sean-k-mooneyshelve shoudl be easy evaucate might be more difficutly but i think we just need to call cleanup_instance_network_on_host in the right place22:02
sean-k-mooneyso in shevel case we shoudl be calling it in shelve offload22:03
sean-k-mooneyand in evauate we need to do it before we call spawn on the dest host22:03
sean-k-mooneyso pretty early on22:03
sean-k-mooneythat will put the port into an unbound state which will set the port status to down22:04
melwittok. just saying it sounds like the utilities are available, just have to leverage them22:04
melwittwith the model bind_time stuff22:04
sean-k-mooneythey were not in place until recently22:04
melwittI'm not saying they were, just trying to understand what's the landscape today22:05
sean-k-mooneyyep22:05
sean-k-mooneyi think they were both added last cycle so they are there form train on22:05
sean-k-mooneythey should be backportable too i think22:06
*** ociuhandu has quit IRC22:07
melwittok22:08
*** slaweq has quit IRC22:09
*** bbowen has quit IRC22:10
*** bbowen has joined #openstack-nova22:10
*** damien_r has quit IRC22:13
*** slaweq has joined #openstack-nova22:21
*** slaweq has quit IRC22:26
*** spatel has quit IRC22:27
openstackgerritMerged openstack/nova stable/train: Unplug VIFs as part of cleanup of networks  https://review.opendev.org/71125122:31
openstackgerritsean mooney proposed openstack/nova master: [WIP] unbind port before evacuate and shelve offload  https://review.opendev.org/71334222:38
sean-k-mooneymelwitt: im not sure if ^ will work but i think it would be something like that, at least as a start22:39
melwittcool thanks sean-k-mooney22:39
*** threestrands has joined #openstack-nova22:51
*** tkajinam has joined #openstack-nova22:51
*** CeeMac has quit IRC22:59
*** lbragstad has quit IRC23:08
*** zzzeek has quit IRC23:21
*** zzzeek has joined #openstack-nova23:22
*** gyee has quit IRC23:38
*** eharney has quit IRC23:45
*** Wellie has quit IRC23:57

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!