Thursday, 2019-06-06

*** frankwang has quit IRC00:06
*** markvoelker has quit IRC00:07
*** markvoelker has joined #openstack-nova00:07
*** _hemna has quit IRC00:08
*** frankwang has joined #openstack-nova00:10
*** markvoelker has quit IRC00:12
*** jaypipes has quit IRC00:15
*** spatel has joined #openstack-nova00:21
*** tetsuro has joined #openstack-nova00:43
*** rcernin has joined #openstack-nova00:45
*** rcernin has quit IRC00:45
*** rcernin has joined #openstack-nova00:45
*** lbragstad has quit IRC00:58
*** spsurya has joined #openstack-nova01:01
*** spatel has quit IRC01:01
*** slaweq has quit IRC01:07
*** rcernin has quit IRC01:21
*** rcernin has joined #openstack-nova01:21
*** itlinux has joined #openstack-nova01:25
*** guozijn has joined #openstack-nova01:27
*** guozijn has quit IRC01:28
*** dave-mccowan has joined #openstack-nova01:31
*** guozijn has joined #openstack-nova01:33
*** mriedem has quit IRC01:38
*** hongbin has joined #openstack-nova01:55
*** boxiang_ has quit IRC01:59
*** boxiang_ has joined #openstack-nova01:59
*** _hemna has joined #openstack-nova02:04
*** Sundar has joined #openstack-nova02:07
*** whoami-rajat has joined #openstack-nova02:07
*** markvoelker has joined #openstack-nova02:08
*** guozijn has quit IRC02:10
*** takashin has joined #openstack-nova02:12
alex_xusean-k-mooney: thanks02:17
yonglihesean-k-mooney: thanks02:18
*** _hemna has quit IRC02:38
*** markvoelker has quit IRC02:42
*** Sundar has quit IRC02:44
*** JamesBenson has joined #openstack-nova02:44
*** _hemna has joined #openstack-nova02:59
*** Sundar has joined #openstack-nova03:03
*** frankwang has quit IRC03:07
*** frankwang has joined #openstack-nova03:07
openstackgerritXiaojueGuan proposed openstack/nova master: Fix code intendent of file wsgi.py  https://review.opendev.org/66348703:09
*** brinzhang has quit IRC03:09
*** _hemna has quit IRC03:14
*** tetsuro has quit IRC03:22
*** BjoernT has quit IRC03:31
*** dikonoor has joined #openstack-nova03:41
*** _hemna has joined #openstack-nova03:41
*** dave-mccowan has quit IRC03:50
*** frankwang has quit IRC03:53
*** tetsuro has joined #openstack-nova03:56
*** udesale has joined #openstack-nova04:00
*** brinzhang has joined #openstack-nova04:06
*** hongbin has quit IRC04:11
*** _hemna has quit IRC04:15
*** Sundar has quit IRC04:27
*** pcaruana has joined #openstack-nova04:50
*** dikonoor has quit IRC04:59
*** tkajinam has quit IRC05:00
*** tetsuro has quit IRC05:08
*** dikonoor has joined #openstack-nova05:12
*** ratailor has joined #openstack-nova05:12
*** boxiang_ has quit IRC05:13
*** boxiang_ has joined #openstack-nova05:14
*** _hemna has joined #openstack-nova05:29
*** janki has joined #openstack-nova05:30
*** luksky has joined #openstack-nova05:34
openstackgerritMerged openstack/nova master: Hide hypervisor id on windows guests  https://review.opendev.org/57989705:35
*** frankwang has joined #openstack-nova05:48
openstackgerritTakashi NATSUME proposed openstack/nova master: Remove an unused method  https://review.opendev.org/66350205:48
*** janki has quit IRC05:49
*** dtantsur|afk is now known as dtantsur05:49
*** janki has joined #openstack-nova05:49
*** janki has quit IRC05:50
*** janki has joined #openstack-nova05:51
*** luksky has quit IRC05:53
*** damien_r has joined #openstack-nova05:56
*** damien_r has quit IRC06:00
*** tkajinam has joined #openstack-nova06:00
*** _hemna has quit IRC06:03
*** lpetrut has joined #openstack-nova06:03
*** liuyulong_ has joined #openstack-nova06:07
*** slaweq has joined #openstack-nova06:14
*** xek has joined #openstack-nova06:16
*** Spencer_Yu has joined #openstack-nova06:18
*** udesale has quit IRC06:21
*** udesale has joined #openstack-nova06:21
*** dpawlik has joined #openstack-nova06:22
*** maciejjozefczyk_ has joined #openstack-nova06:30
*** dklyle has quit IRC06:38
*** dklyle has joined #openstack-nova06:38
*** maciejjozefczyk_ is now known as maciejjozefczyk06:48
*** udesale has quit IRC06:49
*** udesale has joined #openstack-nova06:50
*** tetsuro has joined #openstack-nova06:55
*** tetsuro has quit IRC07:00
*** gyee has quit IRC07:04
*** Spencer_Yu has quit IRC07:06
*** frankwang has quit IRC07:07
*** damien_r has joined #openstack-nova07:09
*** damien_r has quit IRC07:10
*** _hemna has joined #openstack-nova07:12
*** tesseract has joined #openstack-nova07:12
*** damien_r has joined #openstack-nova07:13
*** rcernin has quit IRC07:14
*** frankwang has joined #openstack-nova07:15
*** _hemna has quit IRC07:16
*** brault has joined #openstack-nova07:17
*** tssurya has joined #openstack-nova07:17
*** ratailor has quit IRC07:18
*** ratailor has joined #openstack-nova07:18
*** rpittau|afk is now known as rpittau07:19
*** rnoriega has quit IRC07:22
*** weshay has quit IRC07:22
*** weshay has joined #openstack-nova07:23
*** rnoriega has joined #openstack-nova07:23
*** tetsuro has joined #openstack-nova07:25
*** janki has quit IRC07:27
*** janki has joined #openstack-nova07:27
*** janki has quit IRC07:28
openstackgerritSharat Sharma proposed openstack/nova master: Modifying install-guide to include public endpoint for identity service  https://review.opendev.org/66353007:32
*** ttsiouts has joined #openstack-nova07:43
*** brault has quit IRC07:56
*** ratailor_ has joined #openstack-nova07:56
*** brault has joined #openstack-nova07:57
*** helenafm has joined #openstack-nova07:58
*** ralonsoh has joined #openstack-nova07:58
*** ttsiouts has quit IRC07:58
*** markvoelker has joined #openstack-nova07:58
*** ratailor has quit IRC07:59
*** ttsiouts has joined #openstack-nova07:59
*** takashin has left #openstack-nova08:00
*** udesale has quit IRC08:01
*** ratailor__ has joined #openstack-nova08:01
*** brault has quit IRC08:01
*** udesale has joined #openstack-nova08:03
*** ttsiouts has quit IRC08:03
*** ratailor_ has quit IRC08:03
*** ttsiouts has joined #openstack-nova08:07
*** ociuhandu has joined #openstack-nova08:07
*** ociuhandu has quit IRC08:07
*** ociuhandu has joined #openstack-nova08:08
*** kashyap has joined #openstack-nova08:14
*** tetsuro has quit IRC08:15
kashyapalex_xu: Want to put this through?  I already had your +2 on it (and it now has +2 from Eric): https://review.opendev.org/#/c/661574/08:19
*** luksky has joined #openstack-nova08:20
alex_xukashyap: checking now08:22
lyarwoodmdbooth: morning, think I've worked out the test_show_update_rebuild_list_server RDO failure. It's a simple race, when passing the test isn't verifying the image as other tests have already downloaded it.08:22
*** ttsiouts has quit IRC08:22
mdboothlyarwood: Ouch!08:23
lyarwoodmdbooth: I can reproduce easily by running tempest.api.compute.servers.test_servers and everything passes and tempest.api.compute.servers.test_servers.ServerShowV263Test where the test fails08:23
mdboothlyarwood: Also, that's awesome sleuthing08:23
*** ttsiouts has joined #openstack-nova08:23
lyarwoodmdbooth: ta, the DNM change helped by not actually printing anything in the logs08:23
lyarwooda true W T F moment again this morning08:23
* lyarwood writes this up in a bug08:24
*** udesale has quit IRC08:24
*** ttsiouts has quit IRC08:27
*** tetsuro has joined #openstack-nova08:28
*** ratailor__ has quit IRC08:30
*** ratailor has joined #openstack-nova08:31
*** markvoelker has quit IRC08:32
*** brault has joined #openstack-nova08:34
*** ratailor_ has joined #openstack-nova08:36
*** ratailor has quit IRC08:38
*** derekh has joined #openstack-nova08:41
*** ttsiouts has joined #openstack-nova08:42
*** ociuhandu has quit IRC08:44
openstackgerritLiang Fang proposed openstack/nova master: [WIP] Leverage OCF cache framework for VM disks  https://review.opendev.org/66354208:46
*** brault has quit IRC08:47
*** udesale has joined #openstack-nova08:51
*** davidsha has joined #openstack-nova08:58
*** liuyulong_ has quit IRC09:00
*** tetsuro has quit IRC09:01
*** tkajinam has quit IRC09:01
openstackgerritBalazs Gibizer proposed openstack/nova master: allow getting resource request of every bound ports of an instance  https://review.opendev.org/65511009:04
openstackgerritBalazs Gibizer proposed openstack/nova master: Pass network API to the conducor's MigrationTask  https://review.opendev.org/65511109:04
openstackgerritBalazs Gibizer proposed openstack/nova master: Add request_spec to server move RPC calls  https://review.opendev.org/65572109:04
openstackgerritBalazs Gibizer proposed openstack/nova master: re-calculate provider mapping during migration  https://review.opendev.org/65511209:04
openstackgerritBalazs Gibizer proposed openstack/nova master: update allocation in binding profile during migrate  https://review.opendev.org/65642209:04
openstackgerritBalazs Gibizer proposed openstack/nova master: Extend NeutronFixture to handle migrations  https://review.opendev.org/65511409:04
openstackgerritBalazs Gibizer proposed openstack/nova master: prepare func test env for moving servers with bandwidth  https://review.opendev.org/65510909:04
openstackgerritBalazs Gibizer proposed openstack/nova master: func test for migrate server with ports having resource request  https://review.opendev.org/65511309:04
openstackgerritAlvaro Lopez Garcia proposed openstack/nova master: Ensure that periodic reclaim cleans DB deleted instances  https://review.opendev.org/32325009:07
bauzaslyarwood: FWIW, I'm categorizing https://bugs.launchpad.net/nova/+bug/1831538 as High since 'q35' isn't the gate default machine type09:09
openstackLaunchpad bug 1831538 in OpenStack Compute (nova) "IDE config drive CDROM doesn't work with q35 machine type" [High,In progress] - Assigned to Lee Yarwood (lyarwood)09:09
lyarwoodbauzas: ack that's fair09:10
bauzaslyarwood: but if we consider 'q35' to be more useful to our users than 'pc', I think https://review.opendev.org/#/c/662887/ is much appreciated09:10
kashyapbauzas: It's not necessarily "more useful"; in some cases 'pc' chipset maybe what the user precisely wants09:11
lyarwoodbauzas: yeah I spoke to artom yesterday about potentially dropping DNM from that09:11
*** tetsuro has joined #openstack-nova09:11
bauzaslyarwood: https://review.opendev.org/#/c/662887/ is still getting -1 from Zuul09:11
lyarwoodbauzas: it's an unrelated failure09:11
kashyapbauzas: But in general, for new guests, we recommend 'q35'.09:11
lyarwoodbauzas: I rechecked this morning09:11
bauzasoh sorry09:12
*** _hemna has joined #openstack-nova09:12
bauzaskashyap: yeah i don't disagree, that's why we leave machine types to be configurable09:12
*** priteau has joined #openstack-nova09:13
bauzaskashyap: here I'm talking of choosing a default value for most users, and it looks to me 'q35' gives more benefits than 'pc'09:13
bauzasso, technically, we should have it in the gate09:13
kashyapYep; also one reason I avoided changing the default in Nova (https://bugs.launchpad.net/nova/+bug/1780138), because we've delegated that decision to the orchestrator for now.09:13
openstackLaunchpad bug 1780138 in OpenStack Compute (nova) "Don't assume the guest machine type to be of 'pc'" [Medium,Confirmed] - Assigned to Kashyap Chamarthy (kashyapc)09:13
kashyap[We may revisit it later]09:13
kashyapbauzas: Yeah, agreed -- on testing in Gate.09:14
openstackgerritMerged openstack/nova master: Document mitigation for Intel MDS security flaws  https://review.opendev.org/66157409:16
*** tetsuro has quit IRC09:20
*** jistr is now known as jistr|lnl09:28
*** luksky has quit IRC09:28
*** markvoelker has joined #openstack-nova09:29
*** janki has joined #openstack-nova09:29
*** abhishekk has joined #openstack-nova09:31
openstackgerritBrin Zhang proposed openstack/nova-specs master: Add flavor metadata or metadata group  https://review.opendev.org/66356309:35
openstackgerritBrin Zhang proposed openstack/nova-specs master: Add flavor metadata or metadata group  https://review.opendev.org/66356309:37
*** boxiang_ has quit IRC09:39
*** boxiang_ has joined #openstack-nova09:40
*** tetsuro has joined #openstack-nova09:43
*** _hemna has quit IRC09:46
openstackgerritDongcan Ye proposed openstack/nova master: Raise BuildAbortException while updating instance task_state conflict  https://review.opendev.org/63316009:47
*** luksky has joined #openstack-nova10:00
*** ociuhandu has joined #openstack-nova10:01
*** markvoelker has quit IRC10:03
*** brinzhang has quit IRC10:06
*** ttsiouts has quit IRC10:14
*** ttsiouts has joined #openstack-nova10:14
*** dikonoor has quit IRC10:17
*** ttsiouts has quit IRC10:19
*** sapd1_x has joined #openstack-nova10:22
*** abhishekk has quit IRC10:26
*** ivve has joined #openstack-nova10:27
*** frankwang has quit IRC10:33
*** frankwang has joined #openstack-nova10:33
openstackgerritSharat Sharma proposed openstack/nova master: "SUSPENDED" description changed in server_concepts guide and API REF  https://review.opendev.org/66359010:34
*** priteau has quit IRC10:35
*** bbowen has quit IRC10:42
*** tetsuro has quit IRC10:45
*** tbachman has quit IRC10:47
mdboothlyarwood: Re: https://review.opendev.org/#/c/663596/ has that test ever worked?10:53
mdboothI mean, it sorta worked, but only because we weren't doing image validation, right?10:53
lyarwoodmdbooth: correct10:55
*** brault has joined #openstack-nova10:56
lyarwoodmdbooth: so that change disables it by default and leaves some configurables we can wire up over in https://review.opendev.org/#/c/515210/ if we wanted to10:56
lyarwoodmdbooth: just respinning to make the requirements more clear and link https://review.opendev.org/#/c/515210/ in the commit message10:58
*** frankwang has quit IRC10:58
mdboothlyarwood: Even with that change though, the test is still broken right?10:58
mdboothBecause if the image has been previously cached by another test it won't actually run.10:59
lyarwoodmdbooth: if a valid image and trusted certs are provided then no10:59
mdbooths/run/validate/10:59
*** markvoelker has joined #openstack-nova10:59
lyarwoodmdbooth: it's no longer using the default image that wouldn't work10:59
mdboothIt will pass, but it won't have tested image validation.10:59
mdboothAh... you also updated the ref10:59
lyarwoodmdbooth: it should10:59
lyarwoodyeah11:00
lyarwoodit should cause a download etc11:00
mdboothlyarwood: Sorry, was being sloppy didn't read that far.11:00
lyarwoodthe test isn't specifically looking at that btw11:00
lyarwoodthat's more to just avoid failures11:00
lyarwoodthe barbican plugin is really testing all of that11:00
openstackgerritSharat Sharma proposed openstack/nova master: [Docs] Update the confusing console output  https://review.opendev.org/58900411:00
*** brault has quit IRC11:01
*** takamatsu has quit IRC11:01
*** takamatsu has joined #openstack-nova11:02
*** rafaelweingartne has joined #openstack-nova11:04
*** jistr|lnl is now known as jistr11:18
kashyapsean-k-mooney: So, for the PCIe root ports, default to 32 / max 32 -- yeah?  Based on DanPB's tests?11:22
sean-k-mooneyyes11:23
*** udesale has quit IRC11:23
kashyapThx11:27
openstackgerritEdward Hope-Morley proposed openstack/nova master: Fix python3 compatibility of rbd get_fsid  https://review.opendev.org/66360711:28
*** markvoelker has quit IRC11:32
*** _hemna has joined #openstack-nova11:42
*** ttsiouts has joined #openstack-nova11:47
kashyapgit fetch gerrit11:49
kashyapOops11:50
openstackgerritKashyap Chamarthy proposed openstack/nova master: [WIP] libvirt: Update the default number of PCIe root ports to 32  https://review.opendev.org/66361411:50
kashyapsean-k-mooney: ^ Still need to reword the commit, and tweak the test, perhaps11:50
kashyap(And also to see if any other code path is effected)11:50
*** ratailor_ has quit IRC11:51
sean-k-mooneykashyap: ya just commented on the test you should leave it at 8 as you are testing non defualt config values11:54
kashyapsean-k-mooney: Yeah, you're right.  (Aside: Instead of "Update", I'd use the phrase "Preallocate" -- as that captures the intention more correctly?)11:55
kashyapsean-k-mooney: I stole the preallocate word from you11:55
sean-k-mooneyi didnt look into the test fully but i suspect we want to test the default, when its set to 0 and when its set to a non default non 0 values11:56
kashyapRight, let me twiddle11:57
openstackgerritVlad Gusev proposed openstack/nova stable/stein: Hide hypervisor id on windows guests  https://review.opendev.org/66361611:57
sean-k-mooneywell we are updating the config value to preallocate the pcie-ports so tehy are availabel for hotplug when needed11:57
kashyap(Nod)11:58
*** dikonoor has joined #openstack-nova11:59
*** lpetrut has quit IRC12:00
*** lpetrut has joined #openstack-nova12:00
*** jaypipes has joined #openstack-nova12:01
*** tetsuro has joined #openstack-nova12:02
*** tetsuro has quit IRC12:03
openstackgerritLee Yarwood proposed openstack/nova master: Fix python3 compatibility of rbd get_fsid  https://review.opendev.org/63522012:06
lyarwoodmdbooth: ^ FYI just stumbled across this12:07
mdboothlyarwood: Interesting. I wonder why we haven't hit that, yet.12:09
artomlyarwood, yeah, a ML post needs to happen about that12:10
artom(that = q35)12:10
*** bbowen has joined #openstack-nova12:12
*** tbachman has joined #openstack-nova12:13
*** eharney has quit IRC12:14
mdboothlyarwood: I wonder if we've already pulled in the ceph fix downstream?12:14
kashyapartom: I have a half draft sitting; will send something "soon"12:14
artomkashyap, appreciated :)12:14
kashyapartom: Based on the description notes here: https://bugs.launchpad.net/nova/+bug/178013812:14
openstackLaunchpad bug 1780138 in OpenStack Compute (nova) "Don't assume the guest machine type to be of 'pc'" [Medium,Confirmed] - Assigned to Kashyap Chamarthy (kashyapc)12:14
lyarwoodmdbooth: it would be pretty transparent if we hadn't12:15
mdboothPresumably snapshot would be failing on ceph12:15
mdboothWhy isn't the upstream gate broken?12:15
lyarwoodupstream gate is py212:15
* mdbooth guesses no py3/ceph testing12:15
lyarwoodright, actually ceph is still nv on py212:16
*** lbragstad has joined #openstack-nova12:16
lyarwoodceph py3 is in the experimental queue12:16
mdboothlyarwood: Ack. So this looks like a fix we approve of, but probably not a downstream blocker.12:16
mdboothShould be high priority upstream, though.12:17
*** _hemna has quit IRC12:17
*** claudiub has joined #openstack-nova12:20
*** derekh has quit IRC12:21
*** trident has quit IRC12:21
*** trident has joined #openstack-nova12:26
*** dave-mccowan has joined #openstack-nova12:28
*** markvoelker has joined #openstack-nova12:29
*** jchhatbar has joined #openstack-nova12:36
*** janki has quit IRC12:38
*** jchhatbar has quit IRC12:49
kashyapsean-k-mooney: So I dug the upstream logs of #virt channel, and the "max 28" thing for PCIe ports came from this:12:51
*** pcaruana has quit IRC12:52
kashyap[For aarch64, *apparently*:]12:52
kashyap<paste>12:53
kashyap13:23 < hrw> ok. 28 pcie-root-port entries are maximum12:53
kashyap13:23 < hrw> 29 == uefi dumps to shell instead of booting12:53
kashyap13:25 < abologna> hrw: that might be a bug rather than an actual limit, but there's a limit of 256 on... something? possibly devices, each pcie-root-port has 8 functions so that would be 31 to get to 256 devices12:53
kashyap13:25 < abologna> hrw: except pcie-root-ports are of course devices themselves12:53
kashyap13:25 < abologna> hrw: and even after plugging in 28 pcie-root-ports some of pcie-root's slots and functions will be empty12:53
kashyap13:26 < hrw> will discuss that with our uefi developers ;D12:53
kashyap</end-paste-spam>12:53
kashyap[That snippet is from Feb 2018, BTW]12:53
sean-k-mooneyok so we should leave it at 28 and just set teh default to 2812:53
kashyapsean-k-mooney: I wondering if I should step into the rabbit hole to test with an AArch64 guest -- to see what is the current limitation?12:54
kashyapsean-k-mooney: As abologna, the author of https://libvirt.org/pci-hotplug.html, seems to imply the 28 is not even a valid limit.12:55
sean-k-mooneykashyap: proably not as we cant assume they are using a new version of qemu or whatever impose the 28 limit on arrch6412:55
kashyapsean-k-mooney: The thing is, we're not sure that 28 is an _actual_ limit or not.  Nobody seem to have confirmed.12:56
sean-k-mooneyif you want to grab a aarch64 guest image and test then sure12:57
sean-k-mooneybut we shoudl play it safe with the default12:57
kashyapYeah, let me do the test.  I want to be sure.12:57
*** brault has joined #openstack-nova12:59
*** dikonoor has quit IRC13:01
*** markvoelker has quit IRC13:03
*** brault has quit IRC13:03
*** derekh has joined #openstack-nova13:06
stephenfingibi: Any chance you could take a look at https://review.opendev.org/#/c/660774/ again today and hit me up if you need more info?13:07
gibistephenfin: give me an hour13:07
stephenfinta13:08
gibistephenfin: I'm on a meeting now13:08
stephenfinAll good. Thanks :)13:08
*** bnemec has joined #openstack-nova13:11
kashyapDoes anyone here (or know someone who) uses OpenStack on AArch64?13:22
* kashyap will expect crickets to chirp13:22
sean-k-mooneythere are some folk from lenaro that hang out on the openstack-kolla irc form time to time13:23
*** mriedem has joined #openstack-nova13:23
stephenfinkashyap: Might want to talk to tonyb13:25
kashyapstephenfin: Yep, noted13:26
kashyapsean-k-mooney: Yeah, I was looking for 'hrw'13:27
kashyapsean-k-mooney: What a beast this whole PCIe saga is13:29
*** mloza has joined #openstack-nova13:29
*** udesale has joined #openstack-nova13:29
*** tbachman has quit IRC13:29
*** BjoernT has joined #openstack-nova13:30
openstackgerritStephen Finucane proposed openstack/nova master: api: Remove 'Debug' middleware  https://review.opendev.org/66250613:30
sean-k-mooneykashyap: well its in good hands. we could just do nothing and leave it to the operator to configur the config for there needs which is what nova originally opted to do13:32
sean-k-mooneybut if there is a sane default then that is also a good outcome too13:33
sean-k-mooneybrb13:33
kashyapsean-k-mooney: Yeah, indeed.  But was just exclaiming about the general subtlety involved here...13:33
*** Sundar has joined #openstack-nova13:34
*** ttsiouts has quit IRC13:35
*** ttsiouts has joined #openstack-nova13:36
*** ttsiouts has quit IRC13:40
lyarwoodmriedem / efried ; https://review.opendev.org/#/c/663011/ - morning, if you have time to day I'm looking for some non-RH core review on this libvirt specific bugfix. There's a change on top of this that's testing the q35 machine type in the gate. I'm looking into the extend volume failure at the moment.13:41
efriedlyarwood: looking13:41
*** ttsiouts has joined #openstack-nova13:47
*** whoami-rajat has quit IRC13:47
*** jaosorior has joined #openstack-nova13:47
*** jaosorior has quit IRC13:51
efriedlyarwood: Are you looking to backport this?13:52
lyarwoodefried: only to stable/stein13:53
lyarwoodand the last I checked it was still clean13:53
lyarwoodjust13:53
efriedorly?13:54
lyarwoodyeah I know right13:54
*** frankwang has joined #openstack-nova13:54
efriedI thought I'd seen a bunch of twiddling of test_driver that seems unavoidable considering how many places you hit13:54
openstackgerritLee Yarwood proposed openstack/nova stable/stein: libvirt: Use SATA bus for cdrom devices when using Q35 machine type  https://review.opendev.org/66367713:55
lyarwood^ just to show I'm not making it up13:55
stephenfinlyarwood: comments incoming on that, btw13:57
stephenfinefried: too13:57
lyarwoodstephenfin: thanks13:57
*** whoami-rajat has joined #openstack-nova13:57
*** ricolin has joined #openstack-nova13:57
*** frankwang has quit IRC13:58
stephenfinlyarwood: done13:59
*** markvoelker has joined #openstack-nova13:59
stephenfinApologies if you explored that idea already but I couldn't see any comments suggesting it13:59
openstackgerritArnaud Morin proposed openstack/nova master: Refresh instance network info on deletion  https://review.opendev.org/66076114:01
*** tbachman has joined #openstack-nova14:01
efriedlyarwood: It's early, but I think you changed the logic of the extracted method.14:02
*** ttsiouts has quit IRC14:05
*** brault has joined #openstack-nova14:06
*** ttsiouts has joined #openstack-nova14:06
gibiexit14:07
dansmithno14:08
gibi:)14:09
efriedyou can check out any time you like,14:09
efriedbut you can never leeeeaaave14:09
gibisorry14:09
dansmithefried: kinda infringing on mriedem's turf there buddy14:10
gibistephenfin: which test case was too brittle to change in https://review.opendev.org/#/c/660774 ?14:10
*** ttsiouts has quit IRC14:11
stephenfingibi:14:11
stephenfin* tbachman (~tbachman@128.107.241.188) has joined #openstack-nova14:11
stephenfin<efried> lyarwood: It's early, but I think you changed the logic of the extracted method.14:11
stephenfin* ttsiouts has quit (Remote host closed the connection)14:11
stephenfin* brault (~brault@lfbn-1-9197-156.w86-238.abo.wanadoo.fr) has joined #openstack-nova14:11
stephenfin* ttsiouts (~ttsiouts@2001:1458:204:1::101:9145) has joined #openstack-nova14:11
stephenfindamn you HexChat14:11
*** brault has quit IRC14:11
stephenfingibi: https://review.opendev.org/#/c/660774/3/nova/tests/unit/compute/test_compute.py@1287514:11
tbachmanstephenfin: accidental beep?14:11
stephenfintbachman: Yup. Sorry for the noise14:12
lyarwoodefried: yeah you're right but I'm not sure the original logic was there tbh14:12
tbachmanno worries!14:12
lyarwoodefried: that seems to suggest that the config would overwrite the image metadata14:12
lyarwoodefried: shouldn't it be (mach_type or libvirt_utils.get_default_machine_type(caps.host.cpu.arch))?14:12
efriedlyarwood: No, the image meta took first priority, then the config, then the caps (IIUC)14:13
*** _hemna has joined #openstack-nova14:13
lyarwoodah that diff was all messed up on my screen14:13
efriedI don't know what it *should* be. I'm just parsing what it *was* vs what it is in your patch.14:13
lyarwoodack thanks14:13
*** dikonoor has joined #openstack-nova14:14
efriedif image_meta... is not None:14:15
efried    mach_type = image_meta...14:15
efried    ==> and the rest of the logic was in the `else`, so would be skipped and we go right to the return14:15
*** cmart has joined #openstack-nova14:15
lyarwoodyup sorry the indentation was all messed up in gerrit so I missed that before14:16
*** lpetrut has quit IRC14:16
efriedin the `else` we did:14:16
efriedif caps.onething:14:16
efried    mach_type = onething14:16
efriedif caps.anotherthing14:16
efried    mach_type = anotherthing14:16
efriedand then the weirdness of:14:16
efriedmach_type = get_default_machine_type... or mach_type <== i.e. get_default_machine_type gets priority here14:16
efriedThis has struck me as pretty tough to follow every time I've looked at this method, so any refactor to make it more explicit would be welcomed :)14:17
gibistephenfin: OK. I don't think I should block your patch. We can revisit the removal of _resize() at a later point14:18
gibistephenfin: I will play a bit with that test case but I put a +2 on your patch14:19
*** tssurya has quit IRC14:20
*** ttsiouts has joined #openstack-nova14:23
*** Sundar has quit IRC14:28
lyarwoodstephenfin: okay, another way of doing this with less crazy passing of _host14:32
lyarwoodstephenfin: actually defining the two arch:machine_type configs we have in code here in nova.conf14:32
*** markvoelker has quit IRC14:33
lyarwoodstephenfin: they are the only reason we need _host to fetch things anyway14:33
stephenfinlyarwood: Left more comments there but probably easier discuss here14:33
stephenfinAlso, thanks gibi :)14:33
stephenfinlyarwood: To which two configs do you refer?14:33
lyarwoodstephenfin: virt for fields.Architecture.ARMV7 & fields.Architecture.AARCH6414:35
lyarwoodstephenfin: and s390-ccw-virtio for fields.Architecture.S390 & fields.Architecture.S390X14:35
stephenfinGotcha. Yeah, it's weird that they're there14:35
lyarwoodhmm we still pass caps.host.cpu.arch to get_default_machine_type so nvm14:36
stephenfinAt the very least, that should probably be done in the 'libvirt_utils.machine_type_mappings' function instead14:36
*** priteau has joined #openstack-nova14:36
stephenfinlyarwood: We pass an arch14:36
stephenfinwhich we're getting from caps.host.cpu.arch but I don't think that's necessary14:36
stephenfinlyarwood: Any reason we could pass the arch from 'libvirt_utils.get_arch' instead?14:37
*** jaosorior has joined #openstack-nova14:37
stephenfinThat would probably be more correct since we surely want to retrieve the machine type for the _guest_ architecture14:38
* lyarwood *slams head into desk*14:39
kashyapefried: Actually _very_ good observation on the priority of 'image_meta'...14:39
lyarwoodstephenfin:yeah that works14:39
lyarwoodstephenfin: totally missed that as an option14:39
*** jaosorior has quit IRC14:39
efriedkashyap: Does it actually wind up mattering? I'll feel less nitpicky if it does.14:39
stephenfinlyarwood: That's got to be a bug too, right?14:39
kashyapefried: I'm not 100% sure; but from my reading, it doesn't.  (Sorry for the weasel words.)14:40
lyarwoodstephenfin: hmm it's inefficient but I don't think it was a bug14:41
stephenfinkashyap, lyarwood: So from https://github.com/openstack/nova/blob/master/nova/virt/libvirt/driver.py#L4327-L4355 we'll try to retrieve the machine type from image metadata14:41
stephenfinIf can't do that, we'll fall back to using something based on the host architecture14:41
*** mlavalle has joined #openstack-nova14:41
kashyapstephenfin: Correct14:41
*** jaosorior has joined #openstack-nova14:41
stephenfinBut if the guest is e.g. x86 running on an ARMV7 host, we'll return a machine type of 'virt'14:42
kashyapAm I wrong in insisting to do this extraction thingie in a separate change?14:42
*** xek_ has joined #openstack-nova14:42
kashyapstephenfin: You mean, an emulated x86 guest running on an ARMv7 host?14:42
stephenfinSo the guest would have an x86 architecture but a ARM'y machine type14:42
stephenfinkashyap: Yeah14:42
lyarwoodthat would be abug14:42
kashyapYes, that's a bug.14:43
*** xek has quit IRC14:43
stephenfinSweet :)14:43
kashyapHOWEVER14:43
stephenfinI imagine no one would ever see this because what sane operator would run fully-emulated guests14:43
kashyapstephenfin: Who in their right mind would do that for any production workload?14:43
stephenfinkashyap: Jinx14:44
kashyap_Exactly_, that was my "HOWEVER"14:44
kashyapSo, we can't have 100% guards for people willingly sticking knives in thier necks.14:44
kashyapIf that's an analogy at all :D14:44
*** maciejjozefczyk has quit IRC14:45
*** rpittau is now known as rpittau|brb14:45
mriedemstephenfin: i know of an operator in the ML asking for that, to run powervm guests on an x86 host14:45
kashyaplyarwood: Sorry, your good deed is getting punished, is it?14:45
stephenfinUltimately though, this seems to suggest we can remove the 'caps' argument to '_get_machine_type' and retrieve the *guest* architecture via the 'libvirt_utils.get_arch' function instead14:45
stephenfinand lyarwood gets to fix two bugs in one14:45
stephenfinfor the win14:45
sean-k-mooneystephenfin: operator that want to support cross arch developemnt14:45
mriedemi'm pretty sure danpb was also ok (in the ML) with the fully emulated thing14:45
kashyapBut in two separate changes, obviously.14:45
kashyapsean-k-mooney: Yeah, for development, yes...14:46
stephenfinmriedem: In production??14:46
sean-k-mooneyyes so its also a valid usecase for build farms14:46
mriedemlet me get the thread14:46
stephenfinIf so, can they share some of the cash they're burning with me?14:46
* stephenfin wants a new bike14:46
sean-k-mooneye.g. if you are a software company uing the cloud to bulid your product for mulitple target archatecure full emulation it totally vailid14:47
mriedemthis guy is funded by the US military so they have infinite funds14:47
kashyapsean-k-mooney: I contend that anyone who *REALLY* cares about multiple archs, they will get a devel box for that arch.14:47
*** _hemna has quit IRC14:47
stephenfinCalled it :)14:47
sean-k-mooneykashyap: that is not what i think the majority of people do14:47
mriedemhttp://lists.openstack.org/pipermail/openstack-operators/2018-August/thread.html#1561714:48
mriedem^ is the thread14:48
sean-k-mooneymost use qemu to develop and test locally and only get real hardware if they ware writing low level software14:48
mriedemdanpb's reply is in this one http://lists.openstack.org/pipermail/openstack-operators/2018-August/015653.html14:49
kashyapYeah, I'm not denying there's no devel/test use case14:49
mriedem"> Yes, it should do exactly that IMHO !"14:49
stephenfinmriedem: How'd you manage to link to a specific email on the list page? Manually adding the anchor (or whatever the #foo part of a URL is called)?14:49
kashyapI totally missed that thread :-(14:50
kashyapBecause of e-mail filtering snafu14:50
mriedemstephenfin: "thread" under " Messages sorted by:                [ date ]               [ thread ]               [ subject ]               [ author ]          ""14:50
stephenfinooh, nifty14:51
stephenfinmriedem++14:51
sean-k-mooneymriedem: was a blueprint ever filed for that14:52
kashyapYeah, the thread" view is easy to miss if you're not often parsing the archives :-)14:52
kashyapmriedem: Some elephant-like memory you've got there!14:53
*** luksky has quit IRC14:53
kashyapWhat is it that you munch on for breakfast?14:53
kashyapsean-k-mooney: No, it wasn't filed, near as I see.14:54
*** tbachman has quit IRC14:54
* kashyap adds it to the KM-long TODO list this month; will get to it14:54
*** luksky has joined #openstack-nova14:55
*** jaosorior has quit IRC14:55
*** brault has joined #openstack-nova14:56
sean-k-mooneythats a shame it would be nice to be able to handel cross arch emultaiton properly in openstack14:56
mriedemsean-k-mooney: not that i'm aware of14:57
kashyapsean-k-mooney: Filing it right now...14:57
*** jaosorior has joined #openstack-nova14:57
mriedemheh so you guys went from "this is a bug burn it burn it!" to "hey it's a feature let's support it!"14:58
mriedemkashyap: i remember the thread because i was the only one engaging chris on it14:58
mriedemand it took me awhile to understand what he was trying to do14:59
kashyapmriedem: I completely missed it due to filtering :-(  Normally anything with 'qemu' or 'libvirt' in the thread, I make it a point to engage14:59
*** cmart has quit IRC14:59
kashyapThe case is valid for *test* / *devel*: because as of a couple of hours ago, I was running an AArch64 guest on x86_64 -- to test some PCIe stuff15:00
openstackgerritLee Yarwood proposed openstack/nova master: WIP libvirt: Use SATA bus for cdrom devices when using Q35 machine type  https://review.opendev.org/66301115:00
openstackgerritLee Yarwood proposed openstack/nova master: DNM: Run tempest-full-py3 with q35 machine type  https://review.opendev.org/66288715:00
lyarwoodstephenfin: ^ can you take a look at that during the team call and I'll sort tests out in the background.15:00
kashyapBut even for that test to be reliable, I had to ask a person with actual AArch64 hardware15:00
mriedemcouldn't the same be said for nested virt?15:00
sean-k-mooneykashyap: test and dev are two of the larger useces for openstack15:01
sean-k-mooneynot everything is a long lived NFV app :)15:01
kashyapmriedem: Somewhat; some people use nested for real workloads15:01
mriedemthat's my point15:02
kashyapAs that still is using hardware extensions15:02
kashyaps/that/that's/15:02
*** sapd1_x has quit IRC15:02
kashyapFor pure emulation (or "TCG") -- no, every instruction is emulated15:02
sean-k-mooneysome people use emulation for real workload too15:02
* kashyap back in a bit; meeting15:02
*** _hemna has joined #openstack-nova15:15
*** gyee has joined #openstack-nova15:16
*** tbachman has joined #openstack-nova15:19
stephenfinlyarwood: Done. Think there's _another_ bug here. Might be helpful to get aspiers input on it15:22
stephenfinaspiers: Referring to https://review.opendev.org/#/c/663011/9/nova/virt/libvirt/utils.py@56315:23
*** rpittau|brb is now known as rpittau15:26
lyarwoodstephenfin: yup that's true15:27
lyarwoodstephenfin: and when you say dedent?15:27
stephenfinthe opposite of indent?15:28
kashyapI think he means to unindent15:28
stephenfinoh, I thought dedent was a word15:28
stephenfinTIL15:28
kashyapI got used to "stephenfin speak" on that :D15:28
lyarwoodstephenfin: right it is, but I didn't think that was valid tbh15:28
stephenfinthe word or what I'm suggesting?15:28
lyarwoodstephenfin: what you're suggesting15:29
lyarwoodstephenfin: if pep8 is happy then I'm happy, I just didn't think it would be ;)15:29
stephenfinThink I suggested two dedents. Which one are you asking about?15:29
*** markvoelker has joined #openstack-nova15:29
lyarwoodstephenfin: the comment is the one I'm looking at15:30
stephenfinthe docstring for get_disk_bus_for_device_type? It's indented by 8 but it should only be indented by 415:31
lyarwoodkk15:31
kashyapsean-k-mooney: On "emulation for real workloads": it is completely and utterly baloney because, TCG is completely insecure and upstream provides no guarantee whatsover.15:32
kashyap[It is to be limited to test/dev/CI; that's it.]15:32
*** aloga has quit IRC15:32
sean-k-mooneykashyap: ci is a real workload15:33
kashyapOn the "security" bit, of course, some people will come back with: "but, we're in a 'trusted network' [whatever that means]15:33
kashyapsean-k-mooney: Right.  I didn't define the word "real", though :-)15:33
*** aloga has joined #openstack-nova15:34
kashyapsean-k-mooney: Fresh off the oven:15:39
kashyap < abologna> danpb, kashyap: I tried starting an aarch64/virt guest with 32 pcie-root-ports with ,io-reserve=0 (hacked libvirt) and it boots fine15:39
aspiersstephenfin, lyarwood: dedent is definitely a word ;-) https://docs.python.org/3/library/textwrap.html#textwrap.dedent15:43
aspierseven though maybe not in the official dictionaries ...15:44
*** damien_r has quit IRC15:44
*** rpittau is now known as rpittau|afk15:46
*** jaosorior has quit IRC15:50
*** _hemna has quit IRC15:54
*** wwriverrat has joined #openstack-nova15:59
*** luksky has quit IRC16:00
*** helenafm has quit IRC16:01
*** markvoelker has quit IRC16:02
*** tesseract has quit IRC16:03
*** whoami-rajat has quit IRC16:07
*** ttsiouts has quit IRC16:07
*** ttsiouts has joined #openstack-nova16:08
*** ttsiouts has quit IRC16:12
*** whoami-rajat has joined #openstack-nova16:12
openstackgerritJohn Garbutt proposed openstack/nova master: Admin password check for project or system scope  https://review.opendev.org/66371516:13
openstackgerritJohn Garbutt proposed openstack/nova master: WIP: Add new style rule for admin_password  https://review.opendev.org/66371616:13
openstackgerritJohn Garbutt proposed openstack/nova master: WIP: test admin_password with opt out  https://review.opendev.org/66371716:13
openstackgerritMatt Riedemann proposed openstack/nova master: Update quota known issues docs  https://review.opendev.org/66257016:22
openstackgerritMatt Riedemann proposed openstack/nova master: Cleanup quota user docs  https://review.opendev.org/66257316:22
*** tjgresha has joined #openstack-nova16:25
*** dikonoor has quit IRC16:27
mriedemartom: i dumped some thoughts into your bottom change that renames the nova-live-migration job16:43
mriedemkind of scatterbrained though16:43
*** xek_ has quit IRC16:45
*** mgoddard has quit IRC16:48
openstackgerritsean mooney proposed openstack/nova master: extend libvirt video model support  https://review.opendev.org/64773316:49
openstackgerritJohn Garbutt proposed openstack/nova master: WIP: stop admin_password information leakage  https://review.opendev.org/66372116:50
sean-k-mooneyjohnthetubaguy: sorry for the delay but i have added the extra testing to ^16:50
*** mgoddard has joined #openstack-nova16:50
artommriedem, thank you for your dump (should TYFYD become a thing?)16:50
johnthetubaguysean-k-mooney: I have to run now, tuba rehearsal later, putting it on the pile16:50
sean-k-mooneyno worries16:51
sean-k-mooneyam i think it more or less ready to go but we can proably drop it out of the runway at this point16:51
* artom is confused about cold migration and resize16:53
artomI thought they were different operations?16:53
*** ricolin has quit IRC16:54
sean-k-mooneythey are but they share alot of the same code16:54
sean-k-mooneythe only real differens is a cold migrate does not change teh flavor16:54
sean-k-mooneyeverything else is the same16:54
artomAha...16:55
sean-k-mooneyi think there is some special caseing for resize to the same host but again its minimal16:55
artommriedem, so, my first thought is, if we already have it in tempest-slow, then keep it there16:56
artomSo I'd just abandon https://review.opendev.org/#/c/653498/ entirely16:56
artomBut since I don't want to block on https://review.opendev.org/#/c/663405/1, I'd put the fix patch below it16:57
artomSo we'd get the revert test coverage with the fix16:57
*** davidsha has quit IRC16:58
mriedemartom: cold migrate and resize are the same except for the flavor16:58
mriedemfor the intent of your bug fix, it doesn't matter which one runs16:58
artomConversely, skip the revert tests in tempest-slow entirely, and keep my multinode patch. Because of the -slow part16:58
mriedemartom: i was also thinking that if i can get that cold migrate resize revert test unskipped then yeah we don't need to add new tests to the nova-live-migration job16:59
kashyapstephenfin: mriedem: sean-k-mooney: As promised, I bring you https://blueprints.launchpad.net/nova/+spec/pick-guest-arch-based-on-host-arch-in-libvirt-driver16:59
kashyap[Corrections, snide remarks, rotten tomatoes welcome.]16:59
*** markvoelker has joined #openstack-nova16:59
mriedemartom: i'm not sure how you'd depend on https://review.opendev.org/#/c/663405/ if you drop your job rename patch16:59
* kashyap AFK; back later16:59
*** derekh has quit IRC17:00
artommriedem, I'd "rebase" my fix below it in order to, in the top patch (aka 663405), still have a revert test with my fix17:00
sean-k-mooneykashyap: there is a way to select host that have hardware accleration already17:01
*** mrhillsman is now known as openlab17:02
mriedemartom: you can't rebase a nova change "below" https://review.opendev.org/#/c/663405/ since it's in a different repo...17:02
sean-k-mooneykashyap: you can use the vm_mode image property and specify hvm https://github.com/openstack/glance/blob/master/etc/metadefs/compute-hypervisor.json#L30-L4117:03
artom*facepalm*17:03
*** damien_r has joined #openstack-nova17:03
artommriedem, make yours depend on mine then, same effect, right?17:03
*** dpawlik has quit IRC17:04
sean-k-mooneykashyap: that said we never test that in the ci as far as im aware so no idea if it really works17:04
*** dpawlik has joined #openstack-nova17:04
*** openlab is now known as codebauss17:05
mriedemartom: that's one option yeah. i'm not sure it's the right option...since unskipping that test should be possible without your change - although i guess one could argue that maybe the test isn't safe if you're using ovs hybrid plugging w/o your fix17:05
*** dtantsur is now known as dtantsur|afk17:06
artommriedem, unskipping that change *is* possible without my fix (upstream gate doesn't use hybrid plug)17:06
artomMy idea was more to have integration test coverage of my code, at least in the non-hybrid-plug case17:06
mriedemartom: i think you missed my point,17:08
mriedemtempest can be run anywhere on any config,17:08
artomIt happens a lot17:08
mriedemsince it's just hitting APIs,17:08
artomOOOIC17:08
mriedemso while unskipping it could be ok in the gate b/c of how most of the devstack jobs are setup (ovs w/o hybrid plug or lb),17:08
artomUpstream gate would be fine, but we'd break someone else who runs it with hybrid plug17:08
mriedemthat doesn't mean someone, like your downstream ci, could break on it17:08
mriedemcorrect17:09
mriedemi'm basically convincing myself on your behalf17:09
artomOur downstream CI is permanently on fire anyways17:09
mriedemb/c of tripleo17:09
mriedemsure17:09
mriedem:)17:09
artom(Did I say that out loud? It's not true, obviously)17:09
sean-k-mooneysure...17:09
*** dpawlik has quit IRC17:10
artommriedem, what I selfishly *don't* want to do is have my fix depend on your tempest patch17:10
artomBecause your tempest patch might be stuck in recheck hell for a while, and I'm in a *massive* hurry ;)17:11
artomBut... I think you made the case for me that it's better the other way around anyways17:11
efriedkashyap, mriedem: https://blueprints.launchpad.net/nova/+spec/pick-guest-arch-based-on-host-arch-in-libvirt-driver for train?17:11
sean-k-mooneyefried: proably not17:12
mriedemefried: no17:12
efriedight, thx17:12
mriedemimo17:12
efriedjust wanted to tag series goal if so, so it shows up on dashboard things.17:12
efriedNOT tagging.17:12
sean-k-mooneydo we have a NEXT tag or somthing we can use17:13
mriedemartom: i have a hard time knowing what your team considers a priority ever really...17:13
sean-k-mooneye.g. after train17:13
mriedemuntil the boss (mdbooth) shows up asking for reviews17:13
*** codebauss is now known as openlab17:13
sean-k-mooneymriedem: our relase folk considers it a blocker for the osp 15 beta17:13
artommriedem, speaking for myself I try not to push that angle too much, because what's important for upstream isn't necessarily what's important for us17:13
artomOr vice versa17:14
mriedemsean-k-mooney: did those people never realize this has always been broken?17:14
sean-k-mooneybut its a beta so its not going to break production deployments17:14
artomI just feel weird saying "well this thing is super important for *us*, so dump everything else and work on my thing"17:14
mriedemunless you guys just changed to hybrid plug17:14
*** openlab is now known as codebauss17:14
artomBut if you'd rather I just be up front about it, I can do that too17:14
mriedembtw, whatever happened with russell ovn'ing the world at rh?17:15
sean-k-mooneymriedem: its because its showing up in the downstream ci and that makes them unhappy17:15
mriedemsean-k-mooney: oh easy, just stop doing testing :)17:15
*** codebauss is now known as openlab17:15
mriedemthen you're in the same state as before17:15
mriedemheads in the sand and whatnot17:15
melwittI think we switched our default network backend to ovn recently. sean-k-mooney correct me17:16
*** openlab is now known as codebauss17:16
mriedemartom: if you're cool with dropping the nova-live-migratoin rename + resize revert stuff (i still thing the rename is good at some point, just not while hurrying for a fix) i'm ok with adding a depends-on from my unskip to your fix17:16
sean-k-mooneyhehe well for 15 we were planning to test with ovn but it has other issues17:16
mriedemooo the plot thickens17:16
artommriedem, 🤝17:16
artom(That's a handshake shaped like a yellow heart)17:17
mriedemok - yeah pidgin doesn't render those17:17
melwittI can see it. looks cool17:17
artomsean-k-mooney, so what's geneve? another name for OVN?17:17
sean-k-mooneymelwitt: yes but ovn has only works when we disabel waiting for the network-evnets on live migation so i think they wanted to test ml2/ovn with iptabls as a fallback plan17:17
artom'cuz that's the default networking thingee in our ospd custom job17:17
sean-k-mooneygeneve is a l3 tunneling protocol like vxlan17:17
sean-k-mooneyand its the protocol ovn uses for its networking by defualt17:18
sean-k-mooneyartom: you are refering to the neutron type driver which for ovn would be geneve17:19
*** _hemna has joined #openstack-nova17:19
artomsean-k-mooney, "Choose the network variant (default for OSP15 is 'geneve')17:19
artomFor the geneve option set the NETWORK_OVN paremeter to yes."17:19
artom(Form the Jenkins job build page)17:19
artomSo yeah, looks like OSP15 default is OVN17:19
sean-k-mooneymriedem: but to your point yes this has always been broken in one way or another17:20
openstackgerritArtom Lifshitz proposed openstack/nova master: Revert resize: wait for events according to hybrid plug  https://review.opendev.org/64488117:21
sean-k-mooneybut we i think we only started waithing in stien so its only showing up now as they are testing Stien/osp1517:21
artomsean-k-mooney, I guess you can abandon https://review.opendev.org/#/c/660782/ now, since we're settling on https://review.opendev.org/#/c/663405/117:22
sean-k-mooneyam sure i never intended it to merge anyway so if you dont need it for testing then sure ill abandon17:23
artomWe could replace it with another DNM on top of mriedem's tempest patch to run tempest-slow with hybrid plug17:24
artomFor the extra confidence17:24
artomBut we've seen it pass already, not sure how necessary a new patch would be17:24
*** codebauss is now known as mrhillsman17:24
sean-k-mooneywell i can add a depens on to matts change too but ok ill just abandon it17:24
artomWell no you can't, because your change was in the multinode job17:25
artomWhereas mriedem's change unksips a test in tempest-slow17:25
sean-k-mooneyyes but may change didnt diable tempest-slow17:25
sean-k-mooneybut anyway its abandoned17:26
sean-k-mooneyyou know how to reporduce this if you need to so i dont need to keep a patch open17:26
*** dpawlik has joined #openstack-nova17:26
artomI need lunch17:26
* artom -> pheeding17:26
*** dpawlik has quit IRC17:31
*** markvoelker has quit IRC17:32
*** damien_r has quit IRC17:33
*** ociuhandu_ has joined #openstack-nova17:39
*** _hemna has quit IRC17:39
*** spsurya has quit IRC17:40
*** udesale has quit IRC17:42
*** dpawlik has joined #openstack-nova17:42
*** ociuhandu has quit IRC17:42
*** ociuhandu_ has quit IRC17:43
*** dpawlik has quit IRC17:46
*** jdillaman has joined #openstack-nova17:49
*** _hemna has joined #openstack-nova17:56
*** priteau has quit IRC17:56
*** panda has quit IRC17:59
*** lennyb has quit IRC18:00
*** panda has joined #openstack-nova18:01
openstackgerritMatt Riedemann proposed openstack/nova master: Add functional recreate test for bug 1829479 and bug 1817833  https://review.opendev.org/66373718:02
openstackbug 1829479 in OpenStack Compute (nova) "The allocation table has residual records when instance is evacuated and the source physical node is removed" [Medium,Triaged] https://launchpad.net/bugs/182947918:02
mriedemcfriesen: sean-k-mooney: ^18:02
openstackbug 1817833 in OpenStack Compute (nova) "Check compute_id existence when nova-compute reports info to placement" [Undecided,In progress] https://launchpad.net/bugs/1817833 - Assigned to xulei (605423512-j)18:02
mriedembugtastic18:02
*** mriedem is now known as mriedem_lunch18:02
*** mgoddard has quit IRC18:03
*** mgoddard has joined #openstack-nova18:03
*** ralonsoh has quit IRC18:04
sean-k-mooneymriedem_lunch: the comment help but ya that looks correct18:05
*** bbowen_ has joined #openstack-nova18:07
*** bbowen has quit IRC18:09
*** bbowen__ has joined #openstack-nova18:09
*** bbowen_ has quit IRC18:12
*** _hemna has quit IRC18:16
*** damien_r has joined #openstack-nova18:22
*** mvkr has quit IRC18:28
*** markvoelker has joined #openstack-nova18:29
efriedI wonder, if we made a bug whose title referenced another bug, and that bug's title referenced the first, could we make patchbot spin forever?18:32
*** dpawlik has joined #openstack-nova18:37
openstackgerritEric Fried proposed openstack/nova master: Introduces the openstacksdk to nova  https://review.opendev.org/64366418:41
openstackgerritEric Fried proposed openstack/nova master: Use OpenStack SDK for placement  https://review.opendev.org/65602318:41
openstackgerritEric Fried proposed openstack/nova master: Introduces SDK to IronicDriver and uses for node.get  https://review.opendev.org/64289918:41
openstackgerritEric Fried proposed openstack/nova master: Use SDK instead of ironicclient for node.list  https://review.opendev.org/65602718:41
openstackgerritEric Fried proposed openstack/nova master: Use SDK instead of ironicclient for validating instance and node  https://review.opendev.org/65602818:41
openstackgerritEric Fried proposed openstack/nova master: Use SDK instead of ironicclient for setting instance id  https://review.opendev.org/65969018:42
*** dpawlik has quit IRC18:42
openstackgerritMerged openstack/nova master: Add testing guide for down cells  https://review.opendev.org/65016718:50
*** dpawlik has joined #openstack-nova18:53
*** dpawlik has quit IRC18:58
*** markvoelker has quit IRC19:02
*** mriedem_lunch is now known as mriedem19:13
*** owalsh has quit IRC19:18
*** bnemec has quit IRC19:23
*** d34dh0r53 has quit IRC19:25
*** zbr has quit IRC19:25
*** bnemec has joined #openstack-nova19:25
*** bnemec has quit IRC19:31
*** d34dh0r53 has joined #openstack-nova19:31
*** owalsh has joined #openstack-nova19:34
*** bnemec has joined #openstack-nova19:34
*** bnemec has quit IRC19:41
*** bnemec has joined #openstack-nova19:42
*** luksky has joined #openstack-nova19:49
*** bbowen__ has quit IRC19:53
*** imacdonn has quit IRC19:53
*** imacdonn has joined #openstack-nova19:54
*** markvoelker has joined #openstack-nova19:59
*** panda has quit IRC20:04
mriedemartom: i guess you missed these comments https://review.opendev.org/#/c/644881/19//COMMIT_MSG@1120:04
*** panda has joined #openstack-nova20:05
*** ccamacho has quit IRC20:08
*** hongbin has joined #openstack-nova20:14
*** bnemec has quit IRC20:16
*** bnemec has joined #openstack-nova20:17
*** tjgresha has quit IRC20:21
mriedemdansmith et al, I'm +2 on the nova/cyborg spec now https://review.opendev.org/#/c/603955/20:27
*** markvoelker has quit IRC20:32
*** mdbooth_ has joined #openstack-nova20:40
*** mdbooth has quit IRC20:41
*** damien_r has quit IRC20:46
*** takashin has joined #openstack-nova20:48
*** damien_r has joined #openstack-nova20:49
mriedemmelwitt: see what you think about my proposed wording in this "specify az on unshelve" spec and if you agree i'll update it https://review.opendev.org/#/c/624689/10/specs/train/approved/support-specifying-az-when-restore-shelved-server.rst@6220:49
*** dpawlik has joined #openstack-nova20:54
*** damien_r has quit IRC20:55
*** BjoernT has quit IRC20:56
efriednova meeting in 3 minutes in #openstack-meeting20:58
*** dpawlik has quit IRC20:59
melwittmriedem: ack20:59
efriednova meeting now21:02
mriedemno meeting?!21:07
*** tbachman has quit IRC21:07
mriedemwas reviewing something...21:07
*** tbachman has joined #openstack-nova21:08
*** dpawlik has joined #openstack-nova21:10
*** dpawlik has quit IRC21:14
*** mriedem is now known as mriedem_afk21:26
*** whoami-rajat has quit IRC21:27
melwittmriedem_afk: +1 to proposed wording21:28
*** markvoelker has joined #openstack-nova21:29
*** JamesBenson has quit IRC21:35
openstackgerritDustin Cowles proposed openstack/nova master: WIP: Use SDK instead of ironicclient for add/remove instance info from node  https://review.opendev.org/65969121:43
tonybkashyap: I don't but it's on my list of things to get workign this year21:53
tonybkashyap: 'sup?21:53
*** bbowen__ has joined #openstack-nova21:54
*** d34dh0r53 has quit IRC22:01
*** markvoelker has quit IRC22:03
*** JamesBenson has joined #openstack-nova22:07
*** JamesBenson has quit IRC22:12
openstackgerritMerged openstack/nova master: [Docs] Update the confusing console output  https://review.opendev.org/58900422:14
*** rcernin has joined #openstack-nova22:23
*** rcernin has quit IRC22:23
*** rcernin has joined #openstack-nova22:24
*** rcernin has quit IRC22:26
*** slaweq has quit IRC22:27
*** rcernin has joined #openstack-nova22:28
*** d34dh0r53 has joined #openstack-nova22:28
*** JamesBenson has joined #openstack-nova22:30
*** JamesBenson has quit IRC22:35
*** mlavalle has quit IRC22:41
mnaseralright.  has anyone ever ran into this?  i'm convinced this feels like an oslo bug or something.22:47
mnaseri've noticed that if i have a 3 node rabbitmq cluster, and the cluster loses a node and regains it, for some reason, its like nothing gets routed across rabbitmq22:47
mnaserin this case, an instance gets booted and goes into SCHEDULING state and gets stuck forever22:47
mnaserthis is with rabbitmq 3.6.16 -- the only way i've solved it often is by deleting all the queues and restarting nova22:48
mnaserthe one 'odd' thing is with osa, we deploy a cluster with ssl for rabbitmq22:48
mnaseri have a (non-prod) cluster stuck at this state right now and i really want to get to the bottom of what/why22:49
mnaserall queues listing 0 messages.. except for notifications22:51
*** luksky has quit IRC22:51
*** tkajinam has joined #openstack-nova22:53
*** slaweq has joined #openstack-nova22:55
melwittmnaser: feel like you're going to need to get some other operator input about that one.. I haven't personally heard of it22:59
*** markvoelker has joined #openstack-nova22:59
melwittimacdonn ^23:00
mnasermelwitt: i've ran into it a bunch of times and had others who ran to it and the "drop all queues" thing fixed it..23:00
*** _hemna has joined #openstack-nova23:00
melwittI also asked penick to look at your messages here and let us know if he's seen it23:00
mnasermelwitt: yay cool thanks, i'm going to continue digging, this is a non prod environment so i have the time to dig into it23:01
mnaserusually things are under fire when i need to do this23:01
melwittyeah, that's awesome you got a repro in non-prod23:01
mnaserfeels like it's gonna be a really annoying one though23:02
*** penick has joined #openstack-nova23:02
melwittyeah. another person we could ask is kgiusti, he knows oslo.messaging23:03
mnaserso, instance in scheduling state, 7 messages in a bunch of scheduler_fanout_* queues23:04
mnaserbut nothing happening, so i guess the conductor is doing what its suppoed to do23:04
penick@mnaser we've had weird issues with rabbitmq and ssl. Mostly with performance and the queue faceplanting though. I don't think we've ever had to wipe and recreate all queues.23:04
mnaser:( sads, yeah, it's pretty destructive but not sure..23:05
penickThough the way we worked around the ssl issue is we switched from native SSL to using stunnel. Which is awful23:05
penicksomething about native rabbit ssl is borked23:05
mnaseroh wow, is the native rabbit ssl that bad23:05
mnasercan i ask what release were you running then (or now)?23:05
penickOcata23:05
penickoh, of rabbit23:05
penickuh, one sec23:05
mnaserthanks :>23:06
mnaserthere is exactly 48 scheduler_fanout_* queues23:06
mnasereach controller has 16 nova-scheduler processes, which adds up to 4823:06
mnaserso the theory is nova-conductor is doing its job but the scheduler just isn't picking up work off the queue..23:07
penick3.6.1523:07
melwittpenick: I wondered whether it might be related to queue mirroring or non, clustered or non, how it behaves when a node leaves and rejoins23:08
mnaseryeah we're at 3.6.16 so we're not far away23:08
*** slaweq has quit IRC23:08
mnasermelwitt: yeah so actually i wondered if that was maybe the issue..23:08
imacdonnFWIW, I don't run clustered rabbitmq - just a single instance23:08
melwittI had thought there might have been some configuration where that doesn't work well, but I can't remember anything more about it23:09
mnaserimacdonn: you're living life23:09
* mnaser would like to just buy a big box with a zillion cores and memory and run a single instance of rabbit23:09
melwitt:D23:09
imacdonnI think so :) My career experience with clustering amounts to "it usually causes more problems than it solves"23:09
mnaserthanks penick for the insights, i guess we might be hitting the weird ssl things you might have ran into23:09
mnaseri guess i can bring the log levels up on the mq stuff and see how/why its getting bound and why its not picking up any messages23:10
imacdonnyeah, there's stuff you can turn on with default_log_levels23:10
mnaseri mean i think the api is doing its job pushing things out to the scheduler fanout, but then things faceplant there23:11
*** slaweq has joined #openstack-nova23:11
* mnaser always hates dealing with default_log_levels23:11
*** dpawlik has joined #openstack-nova23:11
melwittlog levels in python are totally easy to control (not)23:12
mnaseri'll at least shut down the other 2 schedulers so i can debug on one ost23:12
*** dpawlik has quit IRC23:15
mnaseroh that's interesitng23:16
mnaserit actually schedules it23:16
mnaserit even puts allocations in placement23:16
mnasersends notifications..23:16
melwittso, it doesn't get past the scheduler? doesn't move on back to conductor and then to compute?23:18
mnaserit gets to the scheduler, then the scheduler sends something, gets a reply, and it seems to stall out there23:20
*** brinzhang has joined #openstack-nova23:22
mnaseri guess i need to find out what happens after an instance is scheduled, it send something back to the conductor (or it casts to that cell?)23:22
penick@mnaser if it helps we don't use durable queues23:22
melwittmnaser: this is old but sounds kind of the same? https://github.com/rabbitmq/rabbitmq-server/issues/22423:22
mnaserhmm, wonder what the defaults are23:23
mnaseroslo_messaging_rabbit.amqp_durable_queues = False yeah same here23:23
mnaseroh interesting23:24
*** slaweq has quit IRC23:24
melwittare you using mirrored queues?23:25
melwittpenick doesn't use mirrored and I think that's the thing that had issues that I'm vaguely remembering. and that issue I linked from rabbitmq also seems to have to do with mirrored queues23:26
mnaseri think we mirror a subset of queus in OSA23:26
*** dpawlik has joined #openstack-nova23:27
melwittthat issue was fixed eons ago but was about "some kind of race with mirrored, auto_delete queues"23:29
mnaserhttps://github.com/openstack/openstack-ansible/blob/dc5729ad6f10b7e6083d60c2270f75090cf3d5f4/inventory/group_vars/all/infra.yml#L27-L3123:29
melwittI wonder if a new race could have cropped up recently23:29
mnaserso question, after an instance is scheduled, does it then contact the rabbitmq cluster @ the cell?23:31
*** markvoelker has quit IRC23:31
*** dpawlik has quit IRC23:31
*** jaypipes has quit IRC23:33
melwittwe do a sync call over rpc from conductor to scheduler, get the result in conductor, then conductor async casts to compute over rpc23:33
melwittit sounds like what you're saying is scheduler does fine, creates allocs in placement etc, but then the reply never goes back to conductor over rpc23:34
mnaserit looks like the reply to the conductor does get sent out.. so i need to maybe go back and check the conductor this time i guess23:35
melwittif that is the case, then it's never getting the to cell mq part23:35
melwittif the super conductor receives the reply, the next step is super conductor casts to the cell mq23:35
mnaserok, ill move away from scheduler and start looking at conductor now23:36
mnaserconductor, the worlds quietest service23:37
melwitt:)23:37
melwittyou know.... we do cache mq connection info. so if you find it's hanging up at the step where super conductor is trying to drop the "build instance" message onto the cell mq and that cached info is no longer valid, that could be the culprit23:38
melwittbut no, you said things return to normal if you kill the rabbit queues and recreate them without restarting any nova services yeah?23:38
*** claudiub has quit IRC23:38
melwittthat would mean the nova cache thing doesn't matter23:38
mnaserok so schedule_and_build_instances gets to the point where its gets an asigned host23:40
mnaserand it creates a block device mapping in the db for the cell23:42
melwittok, so it's getting the reply from the scheduler. so it must be that the message going onto the cell mq is going into the ether23:43
melwittwhich makes me suspicious about the mq connection caching23:43
melwittmnaser: when you do the queues recreate thing, are you restarting nova services after?23:44
mnaseryes i have to restart them too after (but i do that just to keep things clean rather than make it work)23:45
melwitt*do you restart nova services after?23:45
melwittok, then it might be a problem with that cache (it never refreshes)23:45
melwitthave you ever tried restarting services without recreating the queues?23:46
mnaseryep, no bueno23:46
melwittrestarting would only help new server creates though, not the stuck ones23:47
openstackgerritTakashi NATSUME proposed openstack/nova master: Fix cleaning up console tokens  https://review.opendev.org/63771623:47
mnaserso i know i at least get to here: https://github.com/openstack/nova/blob/3370f0f03ce17aaf3a7ebaa95d497f62bef238c0/nova/conductor/manager.py#L1400-L140123:47
melwittmnaser: does recreating the queues and restarting services make the stuck servers finish building?23:48
mnasermelwitt: yeah but ive restarted them all in this case and no bueno23:48
openstackgerritTakashi NATSUME proposed openstack/nova master: Add a live migration regression test  https://review.opendev.org/64120023:48
mnasernope, all the stuck ones stay stuck23:48
melwittok, so restarting services only will NOT make new server creates work?23:48
openstackgerritTakashi NATSUME proposed openstack/nova master: Remove mox in unit/network/test_neutronv2.py (13)  https://review.opendev.org/57602023:48
openstackgerritTakashi NATSUME proposed openstack/nova master: Remove mox in unit/network/test_neutronv2.py (14)  https://review.opendev.org/57602723:48
openstackgerritTakashi NATSUME proposed openstack/nova master: Remove mox in unit/network/test_neutronv2.py (15)  https://review.opendev.org/57603123:49
mnasermelwitt: correct23:49
melwittif restarting services only doesn't make new server creates work, then I'd think the issue has to be with rabbit or oslo.messaging23:49
mnaseryeah my feeling is this is oslo.messaging23:50
mnaserhttps://github.com/openstack/nova/blob/3370f0f03ce17aaf3a7ebaa95d497f62bef238c0/nova/conductor/manager.py#L1462-L1473 -- i get here, i see it connect to rabbit, i see it cast on compute.<hostname> under exchange nova23:50
mnaserand then nothing23:50
melwittcould also be a mirrored queues thing in rabbit, there's been that similar bug in the past23:51
melwittthat I linked earlier23:51
mnaseryeah.. it also connects to rabbitmq successfuly... well the weird thing is i think maybe what is broken is casts only?23:51
mnaserbecause the normal rpc stuff works23:51
mnaserits the compute cast that fails23:51
melwittyou could try something that is a call to compute to see if that works. that would be an interesting test23:52
melwittlike getting vnc console url23:52
mnaserthe service does report as up too inside nova23:52
mnaserdont have any vms unfortunately :P23:52
melwittlemme see if there's something else23:52
mnaserhypervisor-stats maybe23:52
melwittget_diagnostics23:53
mnasererr -utpime23:53
mnaserisnt diag per vm23:53
mnasernova hypervisor-uptime <hypervisor-uuid> hangs23:54
melwittoh yeah, sorry23:54
openstackgerritMatt Riedemann proposed openstack/nova-specs master: Specifying az when restore shelved server  https://review.opendev.org/62468923:54
*** mriedem_afk has quit IRC23:54
melwittok, so it doesn't have to do with cast vs call, it's an issue with trying to get to the cell message queue in general23:54
*** rcernin has quit IRC23:55
*** rcernin has joined #openstack-nova23:56
mnaseri mean the configs match.. in terms of cell_mapping23:56
mnaserand it successfully connects..23:56
mnaseri guess now that i know its just reproducable with something far more basic..23:56
melwittyeah... true23:56
sean-k-mooneymelwitt: are you suspecting this is is similar to the wsgi issue?23:58
mnaserCALL msg_id: 7dc9fa9615ed4ef8bf083ac84e40beef exchange 'nova' topic 'compute.<hostname>' _send23:59
melwittsean-k-mooney: no, this is something different. I wondered if it's something like this old bug that was fixed in the past, if another race like this has re-emerged https://github.com/rabbitmq/rabbitmq-server/issues/22423:59
mnaserthat's how far it'll get and then hang, same way it hung for the new compute23:59
mnaseri mean let me restart the nova-compute... but still feels like it should be recovering on its own..23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!