Friday, 2019-05-24

*** ttsiouts has quit IRC00:03
*** itlinux_ has joined #openstack-nova00:04
*** rcernin has quit IRC00:07
*** itlinux has quit IRC00:07
*** frankwang has joined #openstack-nova00:08
*** samueldmq has quit IRC00:12
*** markvoelker has quit IRC00:31
*** brinzhang has joined #openstack-nova00:34
*** ttsiouts has joined #openstack-nova00:35
*** gyee has quit IRC00:39
*** ttsiouts has quit IRC00:40
*** sapd1_x has quit IRC00:41
*** rcernin has joined #openstack-nova01:05
*** ykarel has joined #openstack-nova01:11
*** ykarel has quit IRC01:15
*** ttsiouts has joined #openstack-nova01:15
*** bhagyashris has joined #openstack-nova01:19
*** itlinux_ has quit IRC01:26
*** itlinux has joined #openstack-nova01:30
*** cfriesen has quit IRC01:47
*** minmin has joined #openstack-nova01:48
*** ttsiouts has quit IRC01:49
*** ykarel has joined #openstack-nova01:55
*** itlinux has quit IRC01:56
openstackgerritmelanie witt proposed openstack/nova master: Use instance mappings to count server group members  https://review.opendev.org/63832402:03
openstackgerritmelanie witt proposed openstack/nova master: Add documentation for counting quota usage from placement  https://review.opendev.org/65384502:03
*** itlinux has joined #openstack-nova02:04
*** altlogbot_0 has quit IRC02:12
*** altlogbot_2 has joined #openstack-nova02:13
*** tbachman has quit IRC02:14
*** ykarel has quit IRC02:15
*** markvoelker has joined #openstack-nova02:32
*** gmann has joined #openstack-nova02:42
openstackgerritMerged openstack/nova stable/stein: Disable limit if affinity(anti)/same(different)host is requested  https://review.opendev.org/65923902:42
*** tbachman has joined #openstack-nova02:46
*** ttsiouts has joined #openstack-nova02:55
*** markvoelker has quit IRC03:06
*** nicolasbock has quit IRC03:09
*** whoami-rajat has joined #openstack-nova03:11
*** dave-mccowan has quit IRC03:14
yaawangbauzas: Hi, in spec expose-auto-converge-post-copy(https://review.opendev.org/#/c/651681/), what do you mean by "preffering"?03:23
*** ykarel has joined #openstack-nova03:26
*** ttsiouts has quit IRC03:29
*** psachin has joined #openstack-nova03:31
*** ricolin has joined #openstack-nova03:31
*** itlinux has quit IRC03:40
*** itlinux has joined #openstack-nova03:43
*** boxiang has joined #openstack-nova03:59
*** frankwang has quit IRC04:00
*** ttsiouts has joined #openstack-nova04:02
*** markvoelker has joined #openstack-nova04:02
*** itlinux has quit IRC04:03
*** itlinux has joined #openstack-nova04:06
*** ttsiouts has quit IRC04:07
*** ykarel has quit IRC04:08
*** tbachman has quit IRC04:14
*** boxiang has quit IRC04:15
*** boxiang has joined #openstack-nova04:16
*** ykarel has joined #openstack-nova04:24
*** markvoelker has quit IRC04:35
*** ttsiouts has joined #openstack-nova04:39
*** itlinux has quit IRC04:42
*** itlinux has joined #openstack-nova04:49
openstackgerritSundar Nadathur proposed openstack/nova-specs master: Nova Cyborg interaction specification.  https://review.opendev.org/60395504:58
*** itlinux has quit IRC05:04
*** itlinux has joined #openstack-nova05:05
*** pcaruana has joined #openstack-nova05:08
openstackgerritSundar Nadathur proposed openstack/nova master: ksa auth conf and client for cyborg access  https://review.opendev.org/63124205:10
openstackgerritSundar Nadathur proposed openstack/nova master: WIP: Add Cyborg device profile groups to request spec.  https://review.opendev.org/63124305:10
openstackgerritSundar Nadathur proposed openstack/nova master: WIP: Create and bind Cyborg ARQs.  https://review.opendev.org/63124405:10
openstackgerritSundar Nadathur proposed openstack/nova master: WIP: Get resolved Cyborg ARQs and add PCI BDFs to VM's domain XML.  https://review.opendev.org/63124505:10
*** ttsiouts has quit IRC05:11
*** pcaruana has quit IRC05:16
*** dpawlik has joined #openstack-nova05:16
*** pcaruana has joined #openstack-nova05:17
*** sapd1_x has joined #openstack-nova05:23
*** markvoelker has joined #openstack-nova05:32
*** tkajinam has quit IRC05:32
*** tkajinam has joined #openstack-nova05:34
*** sapd1_x has quit IRC05:35
*** imacdonn has quit IRC05:42
*** imacdonn has joined #openstack-nova05:42
*** frankwang has joined #openstack-nova05:43
*** itlinux has quit IRC05:44
*** tbachman has joined #openstack-nova05:44
*** lpetrut has joined #openstack-nova05:45
*** lpetrut has quit IRC05:46
*** lpetrut has joined #openstack-nova05:46
*** tbachman has quit IRC05:49
eanderssonWhat is the general recommended value for heal_instance_info_cache_interval on a large deployment (~1k nodes)?05:49
eanderssonWith the default we couldn't scale neutron fast enough to actually handle the load.05:50
*** Luzi has joined #openstack-nova06:01
*** sapd1_x has joined #openstack-nova06:05
*** markvoelker has quit IRC06:05
*** kaisers has quit IRC06:13
*** kaisers has joined #openstack-nova06:16
openstackgerritBoxiang Zhu proposed openstack/nova master: Add compute_nodes_uuid field to Destination object  https://review.opendev.org/66118806:18
*** sapd1_x has quit IRC06:18
*** stakeda has joined #openstack-nova06:19
*** ttsiouts has joined #openstack-nova06:27
mnasermelwitt, efried: wrt nova-consoleauth, we've killed it in production (and we've killed it in OSA and looks like Matt proposed a change to remove the cleanup code) -- https://review.opendev.org/#/c/661126/06:31
*** slaweq has joined #openstack-nova06:35
openstackgerritLeehom Li proposed openstack/nova master: RT keep report usage in case CPUPinning conflict  https://review.opendev.org/66120806:42
openstackgerritLeehom Li proposed openstack/nova master: RT keep report usage in case CPUPinning conflict  https://review.opendev.org/66120806:44
*** ivve has joined #openstack-nova06:52
*** dpawlik has quit IRC06:56
*** ttsiouts has quit IRC06:58
*** markvoelker has joined #openstack-nova07:03
*** ratailor has joined #openstack-nova07:06
*** awalende has joined #openstack-nova07:08
*** minmin has quit IRC07:11
*** minmin has joined #openstack-nova07:12
*** tssurya has joined #openstack-nova07:15
jangutterIf only this comic came out at the start of the month: https://xkcd.com/2153/07:24
*** dpawlik has joined #openstack-nova07:26
*** gmann has quit IRC07:28
*** ttsiouts has joined #openstack-nova07:30
*** dtantsur|afk is now known as dtantsur07:30
*** donnyd has quit IRC07:34
*** icey has quit IRC07:34
*** markvoelker has quit IRC07:35
*** mnaser has quit IRC07:35
*** helenafm has joined #openstack-nova07:36
*** fyx has quit IRC07:36
*** rcernin has quit IRC07:36
*** guilhermesp has quit IRC07:37
*** seyeongkim has quit IRC07:39
openstackgerritYongli He proposed openstack/nova master: clean up orphan instances  https://review.opendev.org/62776507:39
*** jungleboyj has quit IRC07:40
*** tkajinam has quit IRC07:41
*** fyx has joined #openstack-nova07:43
*** jungleboyj has joined #openstack-nova07:43
*** seyeongkim has joined #openstack-nova07:43
*** guilhermesp has joined #openstack-nova07:43
*** mnaser has joined #openstack-nova07:43
*** ralonsoh has joined #openstack-nova07:50
*** mnaser has quit IRC08:02
*** tkajinam has joined #openstack-nova08:04
*** jungleboyj has quit IRC08:04
*** mnaser has joined #openstack-nova08:05
*** geekinutah has quit IRC08:06
*** jungleboyj has joined #openstack-nova08:07
*** bbowen_ has joined #openstack-nova08:07
*** rpittau|afk is now known as rpittau08:08
*** rm_work has quit IRC08:09
*** rm_work has joined #openstack-nova08:10
*** geekinutah has joined #openstack-nova08:10
*** fyx has quit IRC08:10
*** bbowen has quit IRC08:11
*** boxiang has quit IRC08:11
*** boxiang has joined #openstack-nova08:11
*** fyx has joined #openstack-nova08:13
*** yankcrime has joined #openstack-nova08:16
*** icey has joined #openstack-nova08:17
*** zbr has joined #openstack-nova08:20
*** ykarel is now known as ykarel|lunch08:21
lyarwoodganso: LGTM but as tonyb said, as this is stable-only we typically get full Nova Core members to also sign off on things08:25
*** davidsha has joined #openstack-nova08:26
lyarwooddansmith / efried ; https://review.opendev.org/#/c/659338 - as above, as this is stable-only would you mind giving this a once over?08:26
lyarwoodmdbooth: https://review.opendev.org/#/c/658903/ - tgif, am I missing the point here?08:28
mdboothlyarwood: looking08:28
lyarwoodah nvm I think I see it now08:31
lyarwoodthey want to pull directly from rbd when using a file based backend08:31
*** awalende has quit IRC08:32
*** awalende has joined #openstack-nova08:32
*** markvoelker has joined #openstack-nova08:32
*** trident has quit IRC08:33
mdboothlyarwood: Right08:34
*** trident has joined #openstack-nova08:34
*** ttsiouts has quit IRC08:35
mdboothlyarwood: I hesitate to say it, but I think we want an os-glance library for this08:35
mdboothOtherwise we're embedding details of glance backend drivers in nova08:36
mdboothI suspect cinder would also use it08:36
lyarwoodwe already have rbd_utils.py08:36
mdboothThat's generic08:36
mdboothAnd glance presumably has other backend drivers08:36
lyarwoodas is this stuff tbh08:36
mdboothwhich aren't ceph08:37
lyarwoodthere's nothing glance specific you'd need here08:37
lyarwoodyou already have the location08:37
*** awalende has quit IRC08:37
lyarwoodif anything rbd_utils etc could all live in os-brick08:37
mdboothlyarwood: See: file, http, rbd, swift, sheepdog, cinder, vmware08:38
openstackgerritBoxiang Zhu proposed openstack/nova master: Add valid compute nodes with host and/or node in api layer  https://review.opendev.org/66123708:42
*** tesseract has joined #openstack-nova08:48
lyarwoodmdbooth: right, I just don't think we need yet another lib for this08:53
*** janki has joined #openstack-nova08:53
*** ccamacho has joined #openstack-nova08:56
*** derekh has joined #openstack-nova08:57
*** bhagyashris has quit IRC09:03
*** markvoelker has quit IRC09:06
*** ttsiouts has joined #openstack-nova09:11
*** ykarel|lunch is now known as ykarel09:11
*** sapd1_x has joined #openstack-nova09:14
*** ttsiouts has quit IRC09:15
*** tkajinam has quit IRC09:36
*** cdent has joined #openstack-nova09:44
*** lpetrut has quit IRC09:46
*** sapd1_x has quit IRC09:46
*** ttsiouts has joined #openstack-nova09:52
*** awalende has joined #openstack-nova09:52
*** awalende has quit IRC09:56
*** tesseract has quit IRC09:56
*** tesseract has joined #openstack-nova09:59
*** awalende has joined #openstack-nova09:59
*** tesseract has quit IRC10:00
*** tesseract has joined #openstack-nova10:01
*** markvoelker has joined #openstack-nova10:03
openstackgerritSilvan Kaiser proposed openstack/nova stable/stein: Added mount fstype based validation of Quobyte mounts  https://review.opendev.org/66070610:08
openstackgerritMerged openstack/nova stable/stein: Fix target used in nova.policy.check_is_admin  https://review.opendev.org/66033010:23
openstackgerritMerged openstack/nova stable/queens: Define irrelevant-files for tempest-full-py3 job  https://review.opendev.org/65046010:23
openstackgerritMerged openstack/nova stable/queens: Error out migration when confirm_resize fails  https://review.opendev.org/65215010:23
*** ttsiouts has quit IRC10:25
openstackgerritMerged openstack/nova stable/queens: Teardown networking when rolling back live migration even if shared disk  https://review.opendev.org/65814910:32
openstackgerritMerged openstack/nova master: Link versioned notification talk into docs  https://review.opendev.org/66111510:32
*** markvoelker has quit IRC10:35
*** panda has quit IRC10:38
*** stakeda has quit IRC10:40
*** brinzhang has quit IRC10:42
*** boxiang has quit IRC10:45
*** panda has joined #openstack-nova10:46
*** luksky has quit IRC10:48
*** panda is now known as panda|rover10:48
openstackgerritAdam Spiers proposed openstack/nova master: Move patch_exists() to nova.test.TestCase for reuse  https://review.opendev.org/66050010:57
*** gmann has joined #openstack-nova11:02
openstackgerritAdam Spiers proposed openstack/nova master: Move patch_exists() to nova.test.TestCase for reuse  https://review.opendev.org/66050011:07
*** tesseract has quit IRC11:10
*** luksky has joined #openstack-nova11:17
*** psachin has quit IRC11:29
*** markvoelker has joined #openstack-nova11:32
*** ttsiouts has joined #openstack-nova11:33
*** cdent has quit IRC11:35
*** cdent has joined #openstack-nova11:36
*** psachin has joined #openstack-nova11:38
lyarwoodhas anyone ever seen LM try to claim resources on the source at the start of a migration?11:39
lyarwoodIHAC suggesting that's happening but that seems totally wrong11:39
*** cdent has quit IRC11:40
*** dave-mccowan has joined #openstack-nova11:46
*** ratailor has quit IRC11:47
*** ratailor has joined #openstack-nova11:48
*** amorin has quit IRC11:49
*** dave-mccowan has quit IRC11:50
*** amorin has joined #openstack-nova11:51
*** cdent has joined #openstack-nova11:55
*** markvoelker has quit IRC11:57
*** xek has joined #openstack-nova12:02
openstackgerritAdam Spiers proposed openstack/nova master: Move patch_exists() to nova.test.TestCase for reuse  https://review.opendev.org/66050012:03
*** jaosorior has joined #openstack-nova12:06
*** ttsiouts has quit IRC12:06
*** psachin has quit IRC12:08
efriedstephenfin: in case you didn't see http://eavesdrop.openstack.org/irclogs/%23openstack-nova/%23openstack-nova.2019-05-24.log.html#t2019-05-24T06:31:3912:09
openstackgerritAdam Spiers proposed openstack/nova master: Move patch_exists() to nova.test.TestCase for reuse  https://review.opendev.org/66050012:09
*** derekh has quit IRC12:15
aspiersgod sometimes Python really pisses me off12:15
aspiersPython 2/3 unicode stuff is a never-ending nightmare12:16
aspiersI'm literally seeing heisenbugs with it12:17
aspiersone minute everything's fine, the next it magically breaks12:17
cdenti've never met a language that does it in a way that makes me happy. python is one of many that make me want to flip tables12:17
cdentaspiers: bugs on your own machine or remote nodes? if the latter there are so many variables involved (system encoding, python build, blah blah)12:17
aspierslocal12:18
* cdent flips all the tables12:18
aspiersI'm on the verge12:18
aspiersI just want to slurp in the contents of __file__ (sounds weird I know, but there is a good reason) and then do a substring match on it12:18
aspiersif I try to decode('utf-8') on each line I get UnicodeDecodeError: 'utf8' codec can't decode byte 0xf3 in position 1: unexpected end of data12:19
aspiersdespite it being an ASCII file (nova/nova/tests/unit/test_test.py)12:19
*** dims has left #openstack-nova12:20
aspiersascii is no better: UnicodeDecodeError: 'ascii' codec can't decode byte 0xf3 in position 1: ordinal not in range(128)12:20
aspiersWTF!!  I literally ran the test twice in a row and it only failed the second time!!12:24
aspiersOK, I've spotted a pattern12:27
aspiersIt always passes on py3612:27
aspiersIt *only* passes on py27 if I run it after editing and saving the file12:28
aspiersIf I re-run it, subsequent runs fail12:28
* aspiers <- head explodes12:28
aspiersWT actual F12:28
*** eharney has joined #openstack-nova12:30
cdentif __file__ always the .py file and never the .pyc file?12:31
aspiersohhhhhhh12:31
cdents/if/is/12:31
aspiersyou genius12:31
aspiersthat will be it12:31
cdentit was a guess12:31
aspiersthat would perfectly explain it though12:32
cdentmy genius is never expecting things to be sensible12:32
aspiershaha12:32
cdent"what if it's this totally stupid thing that shouldn't happen"12:32
aspiersmy folly is always expecting things to be sensible12:32
cdenthow long have you been in this business ? :)12:32
aspiers(except in politics. I'm not *that* stupid)12:32
*** ttsiouts has joined #openstack-nova12:35
aspierscdent: works! __file__.rstrip('c')12:35
cdentwacky12:35
aspiersyou just saved me bashing my head over this for hours - THANK YOU12:35
cdentyou're very welcome12:35
aspiersno amount of rubber-ducking would have helped on that one, I fear12:36
* cdent quacks12:36
aspiers:)12:36
openstackgerritAdam Spiers proposed openstack/nova master: Move selective patching of open() to nova.test for reuse  https://review.opendev.org/66126612:38
aspierscdent: ^^^12:38
aspiersBTW I think I fixed the contextlib issue in the parent of that one12:39
*** ttsiouts has quit IRC12:39
cdent\o/12:41
efriedtonyb: done12:51
aspierscdent: just noticed some functional tests are calling os.path.exists('/etc/policy.d')12:54
aspierspresumably that's not good in a functional test environment12:55
cdentThere's a few bleeds like that in the nova functional tests12:55
aspiersalso .netrc, yikes12:55
cdentI remember there were some (since fixed) I think, which would be change behavior by using the /etc/nova/nova.conf12:56
aspiersmy new helpers might help clean some of those cases up12:58
*** tbachman has joined #openstack-nova13:05
openstackgerritSilvan Kaiser proposed openstack/nova stable/stein: Fixes multi-registry config in Quobyte driver  https://review.opendev.org/66070613:06
*** jistr is now known as jistr|afk13:11
*** mriedem has joined #openstack-nova13:12
*** ttsiouts has joined #openstack-nova13:12
*** awalende has quit IRC13:16
*** awalende has joined #openstack-nova13:16
*** awalende_ has joined #openstack-nova13:20
*** awalende has quit IRC13:21
*** dpawlik has quit IRC13:23
openstackgerritEric Fried proposed openstack/nova master: Clarify --before help text in nova manage  https://review.opendev.org/66128913:24
*** awalende_ has quit IRC13:24
*** boxiang has joined #openstack-nova13:29
*** ykarel is now known as ykarel|meeting13:29
stephenfinefried: I did not. Thanks for the heads up :)13:29
*** luksky has quit IRC13:31
stephenfinefried: So TripleO is being handled (mschuppert is on the case) and I'm working through the Kolla change. That's pretty much all we need, right?13:35
*** cmart has quit IRC13:35
efriedstephenfin: IIUC from yesterday's meeting, yes.13:37
lyarwoodmriedem: prior to https://review.opendev.org/#/q/topic:bug/1469179+(status:open+OR+status:merged) the only real way to avoid over allocation issues when booting from volumes was to use flavors with a disk size of 0 right?13:37
efriedstephenfin: Assume you caught up on those logs (which prompted your ML thread)?13:38
*** ttsiouts has quit IRC13:40
*** tbachman has quit IRC13:40
*** ratailor has quit IRC13:40
*** ttsiouts has joined #openstack-nova13:40
mriedemlyarwood: sounds right13:49
openstackgerritDan Smith proposed openstack/nova master: Make nova-next archive using --before  https://review.opendev.org/66100213:49
*** mlavalle has joined #openstack-nova13:50
mriedemcan i get a novaclient core to review https://review.opendev.org/#/c/659886/ since it's holding up a release? both tssurya and i have things in the upcoming novaclient release that are needed for enabling changes in other projects (osc and watcher)13:50
*** lorenjan has quit IRC13:50
*** betherly has joined #openstack-nova13:55
efrieddansmith:13:55
efriedefried@efried-ThinkPad-W520:~/openstack/nova$ f() {13:55
efried> echo $113:55
efried> }13:55
efriedefried@efried-ThinkPad-W520:~/openstack/nova$ f $(date)13:55
efriedFri13:55
efriedefried@efried-ThinkPad-W520:~/openstack/nova$ f "$(date)"13:55
efriedFri May 24 08:53:45 CDT 201913:55
efriedunless argparse is doing something magical...13:55
dansmithefried: what?13:56
*** jistr|afk is now known as jistr13:56
efriedsee comment on https://review.opendev.org/66100213:56
aspiersYeah, that comment looks right13:56
dansmithoh yeah13:56
openstackgerritDan Smith proposed openstack/nova master: Make nova-next archive using --before  https://review.opendev.org/66100213:57
aspiersefried: BTW I can't believe you are still using a W52013:57
aspiersI mean, I am too, but only as a VPN server hidden under my desk13:57
efriedaspiers: Heh, it was my IBM-provided workstation, did a nice SSD and memory upgrade, it was really stable and solid, so I bought it back when I left.13:58
aspiersMine weighed a ton and the battery used to last 2 hours on a good day13:58
efriedMy Intel-provided HP thingy has roughly equivalent specs, weighs about a quarter as much... and performs like a dog.13:58
efried...and won't run not-Windows.13:58
efried(because of Intel gorp)13:59
efried(without major hoops of fire)13:59
*** ttsiouts has quit IRC13:59
*** betherly has quit IRC14:00
*** whoami-rajat has quit IRC14:00
*** hemna has quit IRC14:05
openstackgerritSilvan Kaiser proposed openstack/nova stable/stein: Fixes multi-registry config in Quobyte driver  https://review.opendev.org/66070614:05
*** hemna has joined #openstack-nova14:05
*** hongbin has joined #openstack-nova14:06
*** tbachman has joined #openstack-nova14:08
mriedemefried: we might want to see https://review.opendev.org/#/c/661002/ working in action before +2ing :)14:10
efriedmriedem: It worked at a previous PS14:11
efriedas in, didn't blow up14:11
efriedsyntax weirdness notwithstanding14:11
efriedi.e. archived rows older than [earlier date than intended]14:11
efriedpresumably all the rows in that job are newer than "yesterday" anyway, so the change is a no-op?14:12
mriedemhttp://logs.openstack.org/02/661002/1/check/nova-next/d88596c/logs/devstack-gate-post_test_hook.txt.gz#_2019-05-23_23_53_30_03114:12
mriedemnova-manage: error: unrecognized arguments: --max-rows14:12
efriedoh, that doesn't cause the test to fail? Guess that should be fixed :)14:12
mriedemwe should probably have a set -e around this b/c it's blowing up but not failing14:12
efriedokay14:13
*** bnemec is now known as beekneemech14:14
*** stephenfin is now known as finucannot14:14
mriedemset -e in a separate change so we can backport it imo14:14
finucannotefried: I did, yup, cheers. The kolla patch has a +2 already too. Anything else needed to get that approved now? :)14:15
openstackgerritDan Smith proposed openstack/nova master: Make nova-next archive using --before  https://review.opendev.org/66100214:21
*** _erlon_ has joined #openstack-nova14:22
*** jaosorior has quit IRC14:23
*** Anticime1 is now known as Anticimex14:26
melwittdansmith: do you recall whether we exclude certain exceptions from incrementing the build_failed counter for the auto-disabling of computes? I thought it was discussed in the past but looking at the code I don't see anything like that https://github.com/openstack/nova/blob/stable/queens/nova/compute/manager.py#L178914:30
dansmithyes, there are two classes14:31
dansmithso things we catch and handle in a certain way don't count,14:32
dansmithand anything we don't catch counts14:32
dansmithand then some gray in the middle14:32
dansmithand part of the problem is we don't do nearly as good at distinguishing as we thought we did14:33
dansmithhmm, although looking again, maybe we lumped them all together in the end after that realization14:34
*** jchhatbar has joined #openstack-nova14:34
melwittyeah, I don't remember this, I thought there was some kind of whitelist that wouldn't increment the counter but not seeing it now14:34
dansmithI think it was the failed/rescheduled result14:35
dansmithbut we increment for both now14:35
melwittI see14:35
*** janki has quit IRC14:36
openstackgerritEric Fried proposed openstack/nova master: Clarify --before help text in nova manage  https://review.opendev.org/66128914:36
mriedemthere is no whitelist14:36
mriedemwe talked about it14:36
mriedembut it got out of hand14:36
mriedemi had a wip patch for a bit for one type of exception but it was gross and dropped it14:37
*** liuyulong has joined #openstack-nova14:37
mriedemthere are about a billion ways you can get an exception in that build path in the compute14:37
dansmithyeah, that's what I mean by we thought it would be clearer which things are terminal failures or not, but it's not14:37
mriedemand since it's python and not strictly typing any of that stuff, we can't really just catch one14:37
dansmithwell, I dunno about _that_ :)14:38
mriedemwell, meaning like in java where you have to declare and explicitly handle exceptions that get raised up during compiling14:39
dansmithah sure14:39
dansmithi.e. "the most annoying part of java"14:39
*** ttsiouts has joined #openstack-nova14:39
mriedemzvm driver could add a new MainframeKaputError to raise and we'd not handle it14:39
mriedemif you're ocd like me it's not so bad :)14:39
mriedemi'd like some credit for my witty use of a german word in that zvm joke please...14:40
melwittok, this is sounding familiar now, I vaguely remember your wip patch about it14:40
melwittoh yeah, kaputt, nicely done14:40
mriedemmelwitt: so is your customer on just a release that is too old to have the weigher?14:41
efriedfinucannot: If two out of three of melwitt, mriedem, and sean-k-mooney ack it, I'll be happy to approve it (nova-consoleauth removal bp)14:41
melwittno, they have the weigher, it's just incrementing the counter for user-caused build failures we think14:41
dansmiththat's the whole thing14:42
dansmiththat's why we moved it to a weigher14:42
dansmithbecause we suck so bad at knowing why a thing failed that auto-disable isn't really sane14:42
mriedemmelwitt: malicious user?14:42
dansmithso we made it a weigher so it doesn't take it out of rotation and you can adjust how much score you give to that decision14:42
melwittmriedem: I doubt it but not sure14:42
mriedemso jed at the bank wrote a script to boot 100 vms at once and it's DoSing their private cloud14:43
melwittdansmith: got it. yeah, IIUC this customer is trying to pack instances and they see it start spreading instances when they don't want it to, because of the user-caused failures14:43
*** jchhatbar has quit IRC14:44
*** ttsiouts has quit IRC14:44
dansmithmelwitt: right so null out that weigher and move on14:44
*** ivve has quit IRC14:45
melwittmdbooth: turns out I was mistaken. we had tried some exception whitelist patches but it got unwieldy, so that idea was abandoned. the recommended way to handle this scenario is to disable the failed build weigher14:45
mdboothmelwitt: Ack, thanks.14:46
*** cdent has quit IRC14:46
*** Sundar has joined #openstack-nova14:47
*** ykarel|meeting is now known as ykarel14:48
mriedemonce again let me state i dislike the non-border table rendering in our docs now https://docs.openstack.org/nova/latest/admin/configuration/schedulers.html#id1714:48
gibimriedem: do you have opinion about the CLI syntaxt propsal in https://review.opendev.org/#/c/651783/3/osc_placement/resources/resource_provider.py@130 ?14:51
*** boxiang has quit IRC14:53
*** KH-Jared has joined #openstack-nova14:54
mriedemsec14:54
gibithanks14:54
finucannotmriedem: I'll trade you a fix for that table issue for your ack on my remove-consoleauth bp15:02
finucannotbeen meaning to get to that for weeks now15:03
*** itlinux has joined #openstack-nova15:03
*** markmcclain has joined #openstack-nova15:07
*** jangutter has quit IRC15:10
*** gyee has joined #openstack-nova15:13
*** itlinux has quit IRC15:15
aspiersefried: asking on #openstack-requirements about the contextlib2 issue, I have not much clue what's going on15:17
*** itlinux has joined #openstack-nova15:17
aspiersbizarrely it's moaning about sphinx too15:17
aspierswhere's stephenfin when you need him?15:17
*** hemna has quit IRC15:17
*** hemna has joined #openstack-nova15:17
finucannotaspiers: I'm here (it's Friday)15:18
aspiershah15:18
* aspiers learns his nova IRC nicks15:18
openstackgerritArnaud Morin proposed openstack/nova master: Force refresh instance network info on deletion  https://review.opendev.org/66076115:18
finucannotWhat's the specific Sphinx issue?15:19
finucannotaspiers: ^15:19
aspiersfinucannot: see #openstack-requirements15:19
*** itlinux has quit IRC15:19
*** ttsiouts has joined #openstack-nova15:20
finucannotMan, I wish we'd drop Python 2 this cycle instead of in U :(15:20
*** luksky has joined #openstack-nova15:21
*** itlinux has joined #openstack-nova15:21
amorinhi efried, mriedem and others. I updated the patch to refresh instance network cache on deletion, thanks for commenting/reviewing!15:22
gansodansmith: Hi! When you have a few minutes, could you please take a look at and sign off https://review.opendev.org/#/c/659338/ if everything is ok? thanks in advance =)15:23
openstackgerritAdam Spiers proposed openstack/nova master: Move patch_exists() to nova.test.TestCase for reuse  https://review.opendev.org/66050015:24
dansmithganso: I was going to challenge the directness of your query, but I see tonyb threw me under the bus, so.. okay :)15:24
mriedemefried: finucannot: i've put my ack in https://blueprints.launchpad.net/nova/+spec/remove-consoleauth15:25
*** owalsh has quit IRC15:25
mriedemamorin: questions in there,15:30
mriedembut without the force will this even fix the case in your bug? have you tested it without the force_refresh=True?15:31
mriedemb/c the point of the force flag was to rebuild the cache from neutron rather than the cache itself15:31
*** owalsh has joined #openstack-nova15:32
*** cdent has joined #openstack-nova15:33
amorinmriedem: good point15:33
amorinhum15:34
amorinI need to test that, it worked in my dev infra, but I need to double check that15:34
*** macza has joined #openstack-nova15:34
amorinI wont have time today, but I will do next week15:35
mriedem"it" being the original patch with force_refresh=True?15:35
mriedemor the current version of https://review.opendev.org/#/c/660761/3 ?15:35
mriedemif terminate is just racing with build and the copy of the instance during terminate just doesn't have the info cache data from the db, then the refresh should fix your problem i think,15:36
mriedembut if the info cache in the db is empty when you refresh, that bottom change isn't going to help15:36
amorinthe cache is populated in the DB15:36
mriedemit won't make things worse of course15:36
mriedemok, well in that case you might be saved,15:36
amorinbut after the copy of instance is used for deletion15:37
amorinyup, anyway I will double check that point15:37
melwittdansmith: do you remember where we landed when we were talking about disabling oslo.messaging heartbeats in nova-api, as far as whether we would recommend running that configuration in our upstream docs?15:37
mriedembecause https://github.com/openstack/nova/blob/976d1b89c2b754729903291050e01c9bf49704b9/nova/network/neutronv2/api.py#L175715:37
sean-k-mooneyefried: sorry was on a call for the last 90 mins15:37
amorin_gather_port_ids_and_networks is using the nova cache DB right?15:37
sean-k-mooneyefried: did ye reach an agreement on the nova-console auth blueprint15:37
openstackgerritAdam Spiers proposed openstack/nova master: Move selective patching of open() to nova.test for reuse  https://review.opendev.org/66126615:37
melwittdansmith: this is about the idle nova-api wsgi app heartbeat error messages stuff15:38
mriedemamorin: yeah15:38
amorinmriedem, by the way, thanks for the tip about mocking in setup, I was pretty sure that this was possible15:38
mriedemamorin: yeah so tl;dr if you're just racing to refresh from the db on delete then your fix is probably ok15:38
amorinbut I am beginning in that subject15:38
mriedemnp15:38
mriedemi've only recently-ish started using that trick15:39
efriedsean-k-mooney: It sounds like all the deployment projects are either sorted or have patches in flight to sort. I was looking for you & others to vet and +1 basically so I could approve the bp15:39
melwittefried, finucannot: I'm also +1 on removing nova-consoleauth since it looks like everything is sorted with deployment tools15:39
sean-k-mooneyefried: kolla has an open bug but no progess in a while15:40
sean-k-mooneyif that is the only blocker i can go write the patches needed15:40
sean-k-mooneyit is already optional but it would jsut invovle deleting the container on upgrade an removing the deployment code15:40
efriedfinucannot: do you have any further words about kolla ^ ?15:41
sean-k-mooneyso yes im +1 on removing it and can help the kolla folks as needed since i used to work on kolla and kolla ansibel a few release ago15:41
finucannotsean-k-mooney: So there's more to it than this? https://review.opendev.org/#/c/661251/15:42
finucannotefried: ^15:42
sean-k-mooneythat removed it from kolla15:42
sean-k-mooneybut they also need to remove it in kolla ansible15:42
sean-k-mooneye.g. you just deleted teh container15:42
*** helenafm has quit IRC15:43
sean-k-mooneybut the upgrade playbook need to koll all running instance of the contain15:43
sean-k-mooney*kill15:43
finucannotAh, I figured there was more to it than that. I can try drafting those patches based on what they did in OSA. I'd need one to kill the running instances (which would be backported to stable/stein) then a follow up to remove all references entirely15:43
sean-k-mooneywe need to kill this handeler https://github.com/openstack/kolla-ansible/blob/master/ansible/roles/nova/handlers/main.yml#L112-L13615:44
efriedfinucannot: Okay, mriedem, melwitt, and sean-k-mooney are all +1, I'm approving the bp assuming the above is fairly easy and will be taken care of before we pull the trigger.15:44
sean-k-mooneyand remove like 2 or 3 other things but its a fairly small patch15:44
*** rpittau is now known as rpittau|afk15:45
finucannotsean-k-mooney: I'll do that before I clock off15:46
sean-k-mooneyok you need to kill the service default https://github.com/openstack/kolla-ansible/blob/master/ansible/roles/nova/defaults/main.yml#L83-L93 and as i sad just update https://github.com/openstack/kolla-ansible/blob/master/ansible/roles/nova/tasks/rolling_upgrade.yml and https://github.com/openstack/kolla-ansible/blob/master/ansible/roles/nova/tasks/legacy_upgrade.yml to nuke the running container on15:47
sean-k-mooneyupgrade15:47
*** itlinux has quit IRC15:49
sean-k-mooneylet me know if you need any help navagaing kolla-ansible15:49
*** Luzi has quit IRC15:50
*** itlinux has joined #openstack-nova15:50
mriedemgibi: replied, i think your suggestion is ok15:50
finucannotmriedem, efried: https://review.opendev.org/66134415:52
dansmithmelwitt: I think disabling heartbeats will increase recovery time if the connection goes stale, and will cause stale connections to stay open on the rabbit side15:53
dansmithmelwitt: so I think threads=1, leave heartbeat behavior15:53
*** ttsiouts has quit IRC15:54
melwittsean-k-mooney, mdbooth ^15:54
aspiersefried: I'm pondering whether my helpers fall foul of https://docs.python.org/3/library/unittest.mock.html#where-to-patch15:54
melwittdansmith: thanks15:54
*** itlinux has quit IRC15:55
*** ivve has joined #openstack-nova15:55
aspiersefried: although that wouldn't explain why they have been working so far15:56
*** ykarel is now known as ykarel|away15:56
gibimriedem: thanks.15:56
sean-k-mooneydansmith: if we leave teh hearbeat enable should we at least change the log level in the nova-api for the oslo messaging logs?15:57
sean-k-mooneyor the heart beat on in partiaclar15:57
efriedfinucannot: why does starlingx get different zebra stripe?15:59
dansmithI dunno, but we might want to soften the log level in o.msg where it raises holy hell about the reconnct15:59
efried*I* want special zebra stripe15:59
efriedaspiers: I'm going to have to get back to you (possibly much) later16:00
aspiersefried: it's OK. Attacking it with a debugger now16:00
sean-k-mooneydansmith: so we suggested that and got some push back to droping it to info in oslo16:00
*** dtantsur is now known as dtantsur|afk16:01
finucannotefried: Whoops, missed that16:01
finucannotfixed16:01
dansmithsean-k-mooney: so what is your thought, to specifically set o.msg level to error in nova-api?16:01
*** wwriverrat has joined #openstack-nova16:01
sean-k-mooneydansmith: effectivly but i think we can be more granular then that. we do this for some privesep stuff right?16:02
dansmithsean-k-mooney: we can set defaults on the logger name level, I think16:02
sean-k-mooneydansmith: actully setting it to error wont help as o.msg is currently loggin it as an error16:02
sean-k-mooneyso more we woudl have to install a log filter to speficaly drop that error16:03
dansmithsean-k-mooney: well, I dunno, it *really* seems like that shouldn't be an error, it's just a reconnect, which is exactly what it's supposed to be doing16:04
dansmithso whatever.. however you make it not show up, or leave it and document it or whatever, I care about that less16:04
dansmithwhat I do care about is that we not disable heartbeats just because it's making a statement in the logs we don't want16:04
mriedemmonkeypatch the oslo.messaging code!16:04
beekneemechmriedem: Crazy talk! :-P16:05
sean-k-mooneydansmith: ya i agree it not an error. maybe we could make it a warning in o.mesg and then set the log level to error for o.msg in nova16:05
dansmithsean-k-mooney: yeah16:05
mriedemif only DEFAULT_LOG_LEVELS wasn't a gd list16:07
*** betherly has joined #openstack-nova16:07
*** ykarel|away has quit IRC16:07
mriedemmakes it a pita to parse and override from code16:08
melwittdansmith: IIUC from past conversation, the ideal way to handle the eventlet monkey patch issue longer term would be to separate the wsgi app part of nova-api from the rest of it, so we don't have behavior that doesn't fit in with the wsgi app happening inside of it. what would that look like? a separate process for each of nova-api-wsgi and nova-api?16:08
dansmithmelwitt: yeah16:10
dansmiththat's how wsgi apps area really supposed to be done, AFAIK,16:11
dansmiththe wsgi bit is just the "view" and you delegate everything to the controller16:11
openstackgerritMatt Riedemann proposed openstack/nova stable/queens: Delete allocations even if _confirm_resize raises  https://review.opendev.org/65215316:11
openstackgerritMatt Riedemann proposed openstack/nova stable/queens: Add functional confirm_migration_error test  https://review.opendev.org/65813616:11
openstackgerritMatt Riedemann proposed openstack/nova stable/queens: [stable-only] Delete allocations even if _confirm_resize raises (part 2)  https://review.opendev.org/66134916:11
mriedemdelicious MVC16:12
*** betherly has quit IRC16:12
mriedemganso: ^ likely needs some work b/c the functional test in queens isn't failing when i remove the fix16:13
* cdent can smell mriedem yearning for java again16:13
mriedemstruts baby!16:13
*** BjoernT has joined #openstack-nova16:13
mriedemthat's the last time i did UI Work16:13
mriedem*work16:13
mriedemFaces and AJAX was a brand new world to me when i left that dept16:13
melwittdansmith: ok, cool. I wanted to get an idea about what's involved to see if we could make that happen this cycle.16:14
melwittto do it properly for future16:14
cdentmelwitt, dansmith : are you talking about a "controller" as a separate process?16:15
dansmithmelwitt: uh, that would be a significant undertaking16:15
dansmithcdent: yeah16:15
melwittoh :(16:15
cdentyeah, that's why I ask. It would be significant16:15
dansmithcdent: definitely16:15
*** brault has joined #openstack-nova16:16
cdenta somewhat shorter step would be to make a "scatter-gatther-cells-agent" that can be threaded (in whatever way) that the wsgi app talks to over a unix socket16:16
dansmithmelwitt: going from a thing designed to be eventlet-based to just slapping a wsgi frontend on it is the kind of thing that doesn't end with a small refactor to substantially change how it works :)16:16
*** hongbin has quit IRC16:17
cdentbut even that agent idea would be an undertaking16:17
melwittthe concern was, the near term things we can do are to somehow hide the log message and set threads=1. but what's the long term thing to do, or is the near term fix going to be the forever fix16:17
dansmithcdent: yeah, could do that, but that's a lot of deployment change for just that thing16:17
* mriedem goes to solve his short-term problem of an empty stomach16:17
cdentdansmith: yeah, no doubt. (I guess you could also fork it as needed)16:17
cdentcall scatter-gather-cells over rpc to the conductor ?16:18
* cdent is spitballing16:19
dansmithcdent: that will slow down instance list quite a bit16:19
* cdent nods16:19
*** hongbin has joined #openstack-nova16:19
cdentYeah, I don't really have much in the way of ideas. We've got impedance mismatches galore.16:19
cdents/ideas/ideas that can be realized/16:20
melwittyeah... just wanted to understand whether we'll have a next step here or if this is it. sounds like realistically, there's not more we could do16:20
dansmithyep.. IMHO, fix the short term and then let's properly design the bigger change after we see how that goes16:20
dansmithand not armchair quarterback it the day before a holiday weekend (for some of us) :)16:20
melwitthaha, no, I was only looking for a high-level ballpark idea of what it would look like16:21
*** brault has quit IRC16:21
* cdent was the one playing football16:21
melwittbecause when you said "separate wsgi part" initially, I didn't really know what that meant16:21
* cdent goes outside not for football16:25
*** cdent has quit IRC16:25
*** ccamacho has quit IRC16:30
*** itlinux has joined #openstack-nova16:32
*** davidsha has quit IRC16:32
*** tssurya has quit IRC16:44
openstackgerritMerged openstack/nova master: Add --before to nova-manage db archive_deleted_rows  https://review.opendev.org/55675116:50
*** manjeets has quit IRC16:51
*** ttsiouts has joined #openstack-nova16:56
*** hongbin has quit IRC16:58
aspiersgibi: you around? I'm trying to get a functional test to restart the compute service in a way which triggers init_host()16:58
aspiersgibi: and I see you worked on restart_compute_service()16:59
aspiersunfortunately that only restarts the Host, not the Service16:59
*** itlinux has quit IRC17:01
aspiersI tried self.compute.stop(); self.start_compute() but the latter invokes nova.test.TestCase.start_service() which borks by trying to create a duplicate HostMapping17:02
*** ricolin has quit IRC17:03
aspiersdansmith: I see your fingerprints on this code too :)17:03
aspiersThis is about the point where my newbie-ness makes it pretty hard to figure out a way forward17:03
*** ralonsoh has quit IRC17:06
openstackgerritAdam Spiers proposed openstack/nova master: Provide HW_CPU_AMD_SEV trait when SEV is supported  https://review.opendev.org/63868017:10
melwittaspiers: I don't know much about that but you reminded me of a patch of gibi's that might be of help https://review.opendev.org/51255217:11
aspiersmelwitt: thanks! yeah that's the exact area of code I'm talking about17:12
aspiersmaybe that change will yield some clues17:13
melwittthat's what I'm hoping :)17:13
*** ykarel|away has joined #openstack-nova17:14
dansmithaspiers: "restart the host not the service" does not make sense to me, fwiw17:14
*** itlinux has joined #openstack-nova17:15
dansmithaspiers: calling start_service() again is not the right thing to do, because it's really create_and_start_service() I think17:15
aspiersyes, I think that's the problem17:15
dansmithaspiers: if you want to re-run init_host() I would just run it17:15
*** itlinux has quit IRC17:16
*** itlinux_ has joined #openstack-nova17:16
dansmithaspiers: the restart_compute_service() comment explains why doing a restart in a functional test isn't super realistic, and that's why it saves/restores the RT across the start/stop17:17
dansmithso yeah, I think you just want to re-run init if that's what you're trying to do17:18
aspiersOK, thanks17:18
aspiersI wasn't sure if init_host() depended on a bunch of other stuff being run before it in the same context17:18
dansmithwell, you can see what it's doing17:19
aspiersI guess self.basic_config_check() is the only real thing done before17:19
aspiersand I don't need to re-run that17:19
*** itlinux has joined #openstack-nova17:19
dansmithinit_host doesn't really do much other than run the driver's init really.. what about it do you need to restart?17:20
dansmithre-run the init_instance parts?17:20
*** itlinux_ has quit IRC17:20
aspiersthe SEV check I've added to it17:20
dansmithoh okay17:20
aspiersoh, actually I don't think this will be good enough by itself17:20
aspiersbecause the SEV check only sets an instance variable17:20
aspierswhich is later consumed in u_p_t() to provide the trait17:21
*** itlinux_ has joined #openstack-nova17:21
aspiersbut maybe _run_periodics() will take care of that17:21
*** BjoernT has quit IRC17:22
aspiersah, looks like my previous confusion was an incorrect assumption that in functional tests, self.compute is a Host object, but it's actually a Service or something17:23
dansmith...as in a real deployment17:24
aspiersI think I just misread the code somewhere17:24
aspiersor had a stack overflow in my brain17:24
*** itlinux has quit IRC17:25
*** ttsiouts has quit IRC17:30
aspiersinteresting LibvirtDriver.init_host() has an unused host parameter17:33
*** cmart has joined #openstack-nova17:39
*** JamesBenson has joined #openstack-nova17:46
*** itlinux_ has quit IRC17:47
artomaspiers, probably another virt driver needs it17:48
aspiersyeah probably17:48
aspiersartom: nope, none of them do17:50
artomaspiers, heh, you actually checked :)17:50
aspiersjust17:50
artomHistoric then, maybe? Still needed by something out of tree?17:51
aspiershttps://opendev.org/openstack/nova/commit/f51526b596f3d89cda2ec4501e117:51
aspiershistoric17:51
aspiersI think someone probably forgot to remove it during some redesign17:52
aspiersit was used here https://opendev.org/openstack/nova/src/commit/f51526b596f3d89cda2ec4501e19baf085c534e0/nova/virt/libvirt_conn.py#L16317:52
aspierswould be a bit of a pain to trawl through the git history to find out what removed it17:53
aspiersor maybe not17:53
aspierslet's see17:53
gansomriedem: hmm this is weird, as it failed in the gate on PS-3 (parent was fix part 1 only)17:53
*** pcaruana has quit IRC17:53
mriedemganso: i rebased the queens backport series so there is the (1) original fix, (2) the part 2 fix which makes the functional test pass and then (3) the functional test17:55
*** bbowen_ has quit IRC17:55
mriedemthe problem is, i commented out the fix from (2) and (3) still passed on queens17:55
mriedemjust got back to my desk though so haven't investigate17:55
mriedem*investigated17:55
aspiersartom: found it https://opendev.org/openstack/nova/commit/8d97118be776fcaad3053d1f93f61d339685a4ae17:56
gansomriedem: hmm this is weird, as it failed in the gate on PS-3 (parent was fix part 1 only)17:57
*** ttsiouts has joined #openstack-nova17:57
artomaspiers, OK :)17:57
aspiersartom: should I submit a patch to remove it? seems like technical debt17:57
aspiersah, would that break out-of-tree drivers?17:58
aspiersin fact, are there any?17:58
mriedemseems like ever since we enabled novnc in the multi-cell job there is a novnc test that continues to fail17:58
aspierslol, this is how little I know about nova17:58
artomaspiers, https://github.com/openstack/nova-powervm17:58
mriedemwhich is maybe bad for the consoleauth removal stuff...17:58
aspiersartom: weird, so why is powervm also in-tree?17:59
mriedemaspiers: there are several out of tree drivers17:59
artomaspiers, ¯\_(ツ)_/¯17:59
aspiers:)17:59
aspiersmriedem: thanks17:59
mriedemthe out of tree drivers are maintained by vendors that push more features into them than what's in the in-tree versions17:59
aspiersgotcha17:59
mriedemsome aren't in tree at all, like lxd17:59
aspiersthought it might be something like that17:59
aspiersOK, so just randomly removing parameters from the ComputeDriver interface isn't cool18:00
mriedemsome we could probably deprecate from in-tree like xen, zvm and powervm18:00
mriedemthere are no guarantees on the virt driver interface since it's internal and not versoined18:01
mriedem*versioned18:01
mriedemwe generally try to be nice and at least email to the list if we're changing an interface18:01
aspiersright18:01
mriedemalso, yup, enabling n-novnc in the multi-cell job on 5/20 is when the test started failing https://github.com/openstack/nova/commit/c5b83c3fbca83726f4a956009e1788d26bcedde0#diff-7415f5ff7beee2cdf9ffe31e12e4c08618:02
*** ttsiouts has quit IRC18:02
mriedemmelwitt: finucannot: ^18:02
mriedemi'm going to report a bug since i don't have time to dig in,18:02
mriedembut that could put a damper on the remove-consoleauth shindig18:02
aspierswell I suppose I could at least add a NOTE to the base class referencing https://opendev.org/openstack/nova/commit/8d97118be776fcaad3053d1f93f61d339685a4ae and https://opendev.org/openstack/nova/commit/f51526b596f3d89cda2ec4501e118:02
aspiersto save anyone else from needing ancient archaelogy to understand why the parameter is there18:03
mriedemi've walked backward into this conversation so i don't know what the context is18:03
aspiersmriedem: I just noticed that the host parameter to ComputeDriver.init_host() is never used18:03
mriedemi'd say we want the host arg in init_host18:03
aspiersat least, not by anything in-tree18:03
mriedembecause we can use that for our fake virt driver to avoid global config18:03
mriedemin fact i have a patch doing that i think18:04
aspiersoh, I didn't think to check the fakevirt driver18:04
mriedemCONF.host doesn't work in tests when you have multiple computes18:04
mriedemhttps://review.opendev.org/#/c/656709/2/nova/virt/fake.py@14918:04
artomWait, so we officially have a precedent for writing code *just* for CI to work?18:05
finucannotmriedem: I'll stick that on my queue to investigate early next week18:05
*** finucannot is now known as stephenfin18:05
aspiersmriedem: OK, that's a bit over my head at this stage so I'll just defer to your judgement and not do anything18:06
*** wwriverrat has quit IRC18:06
mriedemmelwitt: stephenfin: https://bugs.launchpad.net/nova/+bug/183041718:07
openstackLaunchpad bug 1830417 in OpenStack Compute (nova) "NoVNCConsoleTestJSON.test_novnc fails in nova-multi-cell job since 5/20" [Medium,Confirmed]18:07
aspiersat least the history is now in eavesdrop if we ever need it again18:07
artomaspiers, the premier searchable, accessible, information repository ;)18:08
aspiersartom ;-)18:08
*** cmart has quit IRC18:11
mriedemmelwitt: in this case i don't see anything going on in the novnc service for cell2 http://logs.openstack.org/73/638073/31/check/nova-multi-cell/6ea3306/controller/logs/screen-n-novnc-cell2.txt.gz18:16
mriedemand that's where the instance is18:16
openstackgerritMatt Riedemann proposed openstack/nova master: Skip novnc tests in multi-cell job until bug 1830417 is fixed  https://review.opendev.org/66137118:24
openstackbug 1830417 in OpenStack Compute (nova) "NoVNCConsoleTestJSON.test_novnc fails in nova-multi-cell job since 5/20" [Medium,Confirmed] https://launchpad.net/bugs/183041718:24
mriedemefried: you said in the nova meeting yesterday that the multi-cell job was wonky, well, that's why ^18:24
mriedemoooo this is great http://logs.openstack.org/04/656304/1/check/nova-grenade-live-migration/083d5ec/logs/subnode-2/screen-n-cpu.txt.gz?level=TRACE#_May_24_16_55_08_21908318:26
mriedemMay 24 16:55:08.219083 ubuntu-bionic-rax-ord-0006536941 nova-compute[867]: ERROR nova.virt.libvirt.driver [-] [instance: da0c957c-7a7d-4673-bd2d-0336d22f6fff] Live Migration failure: internal error: process exited while connecting to monitor: Failed to initialize module: /usr/lib/x86_64-linux-gnu/qemu/block-rbd.so May 24 16:55:08.219083 ubuntu-bionic-rax-ord-0006536941 nova-compute[867]: Note: only modules from the same build18:26
mriedem be loaded.18:26
*** ttsiouts has joined #openstack-nova18:29
mriedemi now realize there was no release note with the eventlet monkeypatch change in stable/stein https://review.opendev.org/#/c/647310/18:36
mriedemi wonder if we should have had a release note on that18:36
mriedemto at least mention overriding that with OS_NOVA_DISABLE_EVENTLET_PATCHING18:36
melwittmriedem: gah, looking18:37
sean-k-mooneymriedem: well OS_NOVA_DISABLE_EVENTLET_PATCHING is not ment to be a public thing for people to use18:37
mriedemiow, because of bug 1829062 on stable/stein, should we have a release note for ^18:37
openstackbug 1829062 in StarlingX "nova placement api non-responsive due to eventlet error" [Critical,In progress] https://launchpad.net/bugs/1829062 - Assigned to Gerry Kopec (gerry-kopec)18:37
mriedemsean-k-mooney: starlingx is using it18:37
sean-k-mooneyit was intended for use in tox18:37
sean-k-mooneymriedem: well i know mdbooth never intended it to be used by anything in production18:38
mriedemok no reno it is18:39
sean-k-mooneyhave we told starling x that they can work around the issue by seting the wsgi server threads=1 instead18:40
mriedemi haven't18:40
mriedemand there is nothing in that bug18:40
mriedemas far as i know when you guys all talk about this every other week none of the results of the conversation are written down publicly18:40
sean-k-mooneyok... ill try and write something up as a post to the mailing list on monday18:42
*** Sundar has quit IRC18:43
*** xek has quit IRC18:47
melwittmriedem: I think what's wrong is that the vnc proxy urls in the nova-cpu.conf files are both using port 6080 when one of them (the compute in cell2) should be using 608418:47
melwitthttp://logs.openstack.org/73/638073/31/check/nova-multi-cell/6ea3306/controller/logs/etc/nova/nova-cpu_conf.txt.gz vs http://logs.openstack.org/73/638073/31/check/nova-multi-cell/6ea3306/compute1/logs/etc/nova/nova-cpu_conf.txt.gz18:48
*** pcaruana has joined #openstack-nova18:48
*** xek has joined #openstack-nova18:48
*** pcaruana has quit IRC18:48
mriedemah yup18:48
melwittand from what I recall in devstack, I don't know how we can configure separate ones and use them in an easy way18:49
mriedemwe can pass that information into each node from the job config if we want to hard-code them18:49
melwittoh, I see, so if we know the multi-cell job only has two cells and we know what cell2 port is going to be, hard-code it into the job config18:50
mriedemright we have NOVA_SERVICE_LISTEN_ADDRESS in devstack18:51
mriedemfor the host18:51
mriedemwhich points back at the controller18:51
melwittbecause having devstack properly set up a separate nova-cpu.conf per cell and then getting the right ones passed in job configs would be a pretty big change18:51
mriedemi think we'd add a devstack variable alongside NOVA_SERVICE_LISTEN_ADDRESS but for NOVNC_PROXY_PORT or something and default to '' but allow the job to set that via zuul/ansible on the subnode18:52
melwittI think anyway. I wondered about this when I did the patch for the different ports to avoid collisions but then it was working in your devstack env so I thought I was missing something. I briefly looked through thinking how different nova-cpu.conf could be made and it didn't look straightforward18:53
*** xek_ has joined #openstack-nova18:53
*** xek has quit IRC18:54
mriedemit might just be a matter of taking the NOVA_CPU_CELL variable into account on the subnode18:55
mriedemhttps://github.com/openstack/nova/blob/master/.zuul.yaml#L27518:55
melwittoh, and letting devstack configure nova-cpu.conf accordingly? that would be a better way, if we can. because I'm thinking, in theory, there's more than just the novnc port, there's all the other possible console proxy ports too18:56
sean-k-mooneymriedem: wait that bug was for the placemnet api18:56
mriedemso rather than this https://review.opendev.org/#/c/649473/3/lib/nova@61318:56
sean-k-mooneywe dont monkeypatch the placmenet api?18:57
mriedemmaybe configure_console_proxies should just be using NOVA_CPU_CELL as the offset18:57
mriedembut...18:57
melwittoh, hmm18:57
mriedemthen we won't have separate n-novnc services for cell1 and cell2 on the controller18:57
mriedemsean-k-mooney: in stein, placement is in nova18:58
mriedemfor some18:58
sean-k-mooneystarlingx is not using extracted plamcent?18:58
*** lbragstad has joined #openstack-nova18:59
sean-k-mooneyeventls and wsgi shoudl normally be ok if you dont have long lived threads like the heartbeat18:59
sean-k-mooneyso even using the intree placment they should have been fine18:59
sean-k-mooneyoh they are hitting the other issue19:00
sean-k-mooneywith the cannot switch to a different trhead19:00
mriedemmost people are probably not going to be using extracted placement at stein ga19:01
sean-k-mooneymy comment still applies then19:01
*** luksky has quit IRC19:01
mriedemyour comment about threads=1?19:01
sean-k-mooneyya https://bugs.launchpad.net/nova/+bug/1829062/comments/719:01
openstackLaunchpad bug 1829062 in StarlingX "nova placement api non-responsive due to eventlet error" [Critical,In progress] - Assigned to Gerry Kopec (gerry-kopec)19:01
mriedemif so, then my comment about a known issue release note applies :)19:02
sean-k-mooneyif they set wsgi to use 1 thread per process then they wont get the context swtich error19:02
sean-k-mooney*thread switch19:02
melwittmriedem: I'm actually thinking that in devstack lib/nova, we could use NOVA_CPU_CELL to mimic the other offset code to set the ports properly in the one nova-cpu.conf it writes. I'll give it a try and see if it could work19:02
*** ttsiouts has quit IRC19:03
sean-k-mooneyalso i spend the last 5 minutes trying to figure out how the starling x repos work and i am more confused then when i started19:03
openstackgerritAdam Spiers proposed openstack/nova master: Provide HW_CPU_AMD_SEV trait when SEV is supported  https://review.opendev.org/63868019:03
melwittI didn't realize there was a variable that would tell us what cell we (nova-compute) we are19:03
melwitt-we19:04
aspiersefried: this might be ready to go now ^^^19:04
sean-k-mooneymelwitt: we do?19:04
mriedemmelwitt: ok, yeah that's probably easy to test with a nova patch depending on it since nova-multi-cell doesn't run on devstack changes (we could add it to the devstack experimental queue for testing stuff like this on-demand)19:04
mriedemi'd just add nova-multi-cell to devstack experimental, i can push a change for that19:04
melwittsean-k-mooney: yeah NOVA_CPU_CELL, zuul job configs can set it19:04
sean-k-mooneymelwitt: oh a devstack variable19:04
melwittyes19:04
melwittmriedem: ok, sounds good19:05
sean-k-mooneyah ok i taught you ment in the nova code19:05
melwittheh, yeah no19:05
openstackgerritAdam Spiers proposed openstack/nova master: Reduce logging of host hypervisor capabilities to DEBUG level  https://review.opendev.org/66137919:05
efriedaspiers: I was partway through PS12, was going to suggest moving your trait setting into _get_cpu_traits itself. And also, where's the bit where the SEV trait was going to be moved under the X86 namespace?19:09
aspiersefried: I did consider putting it in _get_cpu_traits but that already seemed pretty over-grown19:10
openstackgerritMerged openstack/nova stable/rocky: [stable-only] Delete allocations even if _confirm_resize raises (part 2)  https://review.opendev.org/65933819:11
openstackgerritMerged openstack/nova stable/rocky: Add functional confirm_migration_error test  https://review.opendev.org/65883419:11
aspiersefried: maybe it can be split up19:11
*** xek__ has joined #openstack-nova19:11
aspiersefried: oh, I remember now - that wasn't the only reason19:11
aspiersefried: I'm still behind with the latest news on the great CPU trait taxonomy debate19:13
aspiersefried: this None check gave me pause https://opendev.org/openstack/nova/src/branch/master/nova/virt/libvirt/driver.py#L677719:14
*** xek_ has quit IRC19:14
aspiersefried: *theoretically* all the traits could vanish from a host, and then this code would fail to remove them19:14
aspiersbut if that actually happened, maybe it would be a sure sign things have gone very badly wrong, in which case maybe that's a desirable accident anyway19:15
dansmithefried: mriedem I'll admit I had my head on backwards with what I was expecting, but: http://logs.openstack.org/02/661002/4/check/nova-next/39819ea/job-output.txt.gz#_2019-05-24_18_14_23_95446919:16
dansmithseems to work to make sure we don't archive recent stuff19:16
mriedemi just -1ed that :)19:16
mriedemwe are sympatico19:16
dansmithmy intent was to make the job actually call this with "tomorrow" or something19:16
mriedemheh yeah19:16
efriedokay, that was clear to me, thought it was intentional. But I guess if this is the only place we test the script, it makes sense to invoke it in a way that does something.19:18
*** whoami-rajat has joined #openstack-nova19:18
dansmithefried: I was expecting to actually land this, so it needs to not not run19:18
dansmithI was expecting to make it do what we do today, but with a date, which would be tomorrow not yesterday19:19
dansmithbut in reality, making sure it doesn't eff up today's records when called for yesterday is what we wanted to sanity check19:19
mriedemplus note that purge failed19:19
mriedemas a result19:19
mriedembut b/c no set -e the job doesn't fail19:19
efrieddansmith: What about running it twice, once like this to make sure it outputs "Nothing was archived" and then the second time without --before (or with --before $(date -d tomorrow)) to make sure it works?19:19
mriedemdansmith: you could run it both ways19:19
mriedemjinx19:19
dansmithyep, doing that now19:19
efriedwhat went wrong with purge?19:20
efriedAnd is there a fup change to turn on -e?19:20
mriedempurge fails with rc=3 if nothing was purged19:20
mriedem"fails"19:20
mriedemi'm all fup'ed out19:20
efriedyeah, so 3 shouldn't be a failure condition, from the pov of the calling script, right?19:22
openstackgerritDan Smith proposed openstack/nova master: Make nova-next archive using --before  https://review.opendev.org/66100219:24
dansmithI'm just about done for the holiday, we can finish this up next week19:24
*** kaiokmo has joined #openstack-nova19:24
efriedaspiers: If the traits all vanished, that dict would still not be None. It would just have all False values.19:25
*** bbowen has joined #openstack-nova19:26
aspiersefried: OK, because CPU_TRAITS_MAPPING is hardcoded. Makes sense19:28
efriedyah19:28
aspiers_get_cpu_traits() is still too big though19:28
efriedaspiers: When mriedem was doing the trait reporting in the RT, I made sure we were covering that corner case19:28
aspiersnice19:29
efriedaspiers: Sure, I would be fine splitting that up.19:29
aspiersefried: well, shouldn't be too hard to do that and then move the SEV bit in there19:29
aspiersefried: other than that and maybe some trait taxonomy thing which I don't yet know about, hopefully everything else is in order with this patch19:30
efriedaspiers: https://review.opendev.org/#/c/538498/6/nova/compute/resource_tracker.py@904 (patch look familiar?)19:30
aspiersefried: yeah, I do remember reading that and vaguely understanding19:31
aspiersI'm sure a re-read will make a lot more sense now19:31
*** ykarel|away has quit IRC19:35
*** panda|rover has quit IRC19:36
*** panda has joined #openstack-nova19:40
mriedemganso: ok so i think things in queens are ok except the functional test is failing because of some missing test handling stuff on teardown in queens, working on that now19:47
gansomriedem: thanks Matt! =D19:48
openstackgerritMatt Riedemann proposed openstack/nova stable/queens: Add functional confirm_migration_error test  https://review.opendev.org/65813619:52
mriedemganso: i think you can drop the -1 on https://review.opendev.org/#/c/652153/ now19:52
gansomriedem: yup, dropped =)19:53
kaiokmohey nova folks! I was wondering if something changed regarding how metadata are defined/handled by nova-scheduler20:07
kaiokmospecifically on the aggregate_instance_extra_specs filter20:08
kaiokmoI have a Pike deployment in which this filter works properly20:08
kaiokmolast week I deployed a environment on Rocky (tag is 18.1.7), and this filters is not working as expected20:10
*** ttsiouts has joined #openstack-nova20:10
kaiokmonova-scheduler is ignoring the filter and all hosts are passing the filtering phase20:10
kaiokmonova.conf is configured as follows (only the aggregate_instance_extra_specs is enabled): http://paste.openstack.org/show/752054/20:12
kaiokmoI appreciate any help.20:12
kaiokmothanks in advance20:13
*** slaweq has quit IRC20:19
aspiersalright, I'm off. happy weekend o/20:22
melwittkaiokmo: not aware of any change off the top of my head. will look around the code to see if anything has changed in that area20:28
kaiokmomelwitt: thank you very much20:32
melwittI'm not seeing that anything has changed in the area20:36
melwittkaiokmo: have you enabled debug logging in nova-scheduler and look at what's going on? if so, do you see DEBUG messages like these? https://github.com/openstack/nova/blob/master/nova/scheduler/filters/aggregate_instance_extra_specs.py#L6420:37
*** aspiers has quit IRC20:38
*** amodi has quit IRC20:38
melwittinteresting, my devstack change started running on a node within 14 minutes of uploading it20:39
melwittreminds me of something in a ML thread where things in the nova queue have to wait longer because of something to do with zuul scheduling?20:40
efriedkaiokmo: Does this happen to be on a rebuild?20:41
kaiokmomelwitt: debug is set to True on nova.conf but can't see messages like that on nova-scheduler log20:42
kaiokmoefried: no. RUN_ON_REBUILD is set to false on the filter, and I'm only spawning new instances, not rebuilding an existing one20:43
*** ttsiouts has quit IRC20:43
melwittkaiokmo: hm, did you restart nova-scheduler after adding debug=True? that config should work. we'll probably need to look at the debug logs to get to the bottom of this because I didn't find any open bugs mentioning the AggregateInstanceExtraSpecsFilter and no recent code changes20:44
efriedkaiokmo: I'm looking specifically at https://review.opendev.org/#/c/523212/ which introduced RUN_ON_REBUILD and wondering if you're running afoul of the logic at https://review.opendev.org/#/c/523212/2/nova/scheduler/host_manager.py20:45
efriedthat's some complex stuff with check type and force hosts/nodes, I don't really understand it.20:46
efriedkaiokmo: If you felt like experimenting, you could try reverting (pieces of) that change and see if the problem goes away.20:46
kaiokmomelwitt: yes, I did restart the service. right now I can see some DEBUG messages on the logs20:47
melwittoh, good. argh, I just realized you had said the problem is all hosts are included, not the other way around20:48
kaiokmoefried: I can test reverting this change, although I don't feel like it is going to change anything for me20:48
melwittso you wouldn't see messages like that if it thinks the hosts are good when they should not be20:48
*** aspiers has joined #openstack-nova20:49
kaiokmoI see. I'm wondering if even the host_passes function is being called. I managed to put some LOG.debug myself inside the function, but none of my messages are displayed on the logs20:50
efriedkaiokmo: It's just the only thing that changed on pike in that particular filter, so it's the first suspect for root cause.20:50
efriedright20:50
melwittkaiokmo: does your flavor have extra_specs in it? because the filter code is showing it will "pass" the host if the flavor does not have any extra_specs20:50
melwittoh really, ok that's not good. lemme see...20:50
*** BjoernT has joined #openstack-nova20:51
kaiokmobtw, the is being loaded correctly by the scheduler. LOG.debug placed before host_passes appear on the logs20:51
melwittkaiokmo: this might be a problem with your config, the enabled_filters and available_filters are supposed to be lists. I wonder if that's what's wrong?20:53
melwittI can never remember how to format lists in ini conf20:56
melwittlooks like it's just comma separated: enabled_filters = RetryFilter,AvailabilityZoneFilter20:57
efriediiuc you're using the same config file as before?20:58
efriedkaiokmo: Oh, disregard the thing about reverting that patch. I misread your original issue, you said this was working on pike and broke on rocky. That patch went back to pike, so probably isn't to blame.20:59
melwittI was wondering the same thing. because according to this, it should be the class name only, not the whole path. but I don't know whether having the whole path would hurt or not20:59
kaiokmomelwitt: yes. metadata on aggregate is custom: "aggregate" with "test" value20:59
kaiokmometadata on flavor is "aggregate_instance_extra_specs:aggregate" with "test" value21:00
melwittkaiokmo: when you added your own debug messages, did you put one here? https://github.com/openstack/nova/blob/master/nova/scheduler/filters/aggregate_instance_extra_specs.py#L48 wondering if you would see that one?21:01
kaiokmomelwitt: yes, it is LOG.debug("***OMG***") :)21:02
melwitthaha xD21:03
melwittdid you put one right at the beginning of host_passes before anything else? because if that's not showing, then you're right the filter isn't running21:04
melwittif the filter isn't running, I'd try to use config options like available_filters = AggregateInstanceExtraSpecsFilter enabled_filters = AggregateInstanceExtraSpecsFilter and then see if you see the filter run21:05
kaiokmoyes. I put one on the beginning of the function, and it is not showing. but, I also did put one right before the function and this one appears on the logs when the nova-scheduler service is restarted (which I think means that the filter was loaded)21:06
kaiokmothat's what I did: this is the filter section on nova.conf for nova-scheduler http://paste.openstack.org/show/752054/21:07
melwittyeah, I agree, the message you do see indicates the class was loaded by something21:07
melwittI'd try also changing available_filters to match enabled_filters, i.e. use class name only21:08
kaiokmook. trying that now21:08
melwittbecause available_filters is a superset of enabled_filters21:08
melwittif that doesn't work to get the filter to run, then I will be lost again21:09
openstackgerritMatt Riedemann proposed openstack/nova master: Fix hard-delete of instance with soft-deleted referential constraints  https://review.opendev.org/66139821:10
*** slaweq has joined #openstack-nova21:11
*** mriedem has quit IRC21:13
*** ttsiouts has joined #openstack-nova21:13
kaiokmomelwitt: well, if available_filter is class only, oslo importutils (called by nova-scheduler) doesn't seem to be able to load the filter21:14
kaiokmoValueError: Empty module name21:14
melwitt:\21:14
melwittok, then I guess the config help on that one is wrong. these available_filters and enabled_filters are supposed to work the same way21:16
melwittnext I would try commenting out the available_filters setting altogether and let it default to all filters21:16
melwittit will still only use what you have in enabled_filters and won't use any additional filters21:16
melwittand if that enables the filter to run, then it sounds like we have some kind of bug or config help doc problem with the available_filters option21:17
*** ttsiouts has quit IRC21:17
melwittI can't find any examples of it being used and not being left as the default, so I don't know it's intended to be set as a list21:18
melwitt*how it's intended21:19
*** slaweq has quit IRC21:24
*** whoami-rajat has quit IRC21:28
*** xek__ has quit IRC21:31
kaiokmomelwitt: did as you said. the filter was loaded (like before), but it did not run (also like before)21:34
melwitt... I don't understand what is going on21:36
melwittI'm going to try on my devstack21:37
*** yankcrime has quit IRC21:37
kaiokmome neither21:38
melwittok, when I appended AggregateInstanceExtraSpecsFilter to enabled_filters in devstack it runs the filter21:40
kaiokmoruns it or loads it? did you created a host aggregate and a flavor (both with the proper metadata)?21:42
*** ttsiouts has joined #openstack-nova21:42
melwittruns it, I added a log message right at the beginning inside the host_passes method21:42
melwittno, I didn't do anything with aggregate and flavor. the filter not running would be unrelated to aggregate/flavor AFAIK21:43
melwittI'll try it21:44
kaiokmodoesn't seem to work for me. I configured nova.conf as you suggested, with enabled_filters=AggregateInstanceExtraSpecsFilter and I left available_filters commented to load all filters by default21:44
kaiokmoin which tag are your nova/devstack on?21:45
openstackgerritDustin Cowles proposed openstack/nova master: Introduces SDK to IronicDriver and uses for node.get  https://review.opendev.org/64289921:45
openstackgerritDustin Cowles proposed openstack/nova master: Use SDK instead of ironicclient for node.list  https://review.opendev.org/65602721:45
openstackgerritDustin Cowles proposed openstack/nova master: Use SDK instead of ironicclient for validating instance and node  https://review.opendev.org/65602821:45
openstackgerritDustin Cowles proposed openstack/nova master: Use SDK instead of ironicclient for setting instance id  https://review.opendev.org/65969021:45
openstackgerritDustin Cowles proposed openstack/nova master: WIP: Use SDK instead of ironicclient for add/remove instance info from node  https://review.opendev.org/65969121:45
melwittI'm at a5e3054e1d6df248fc4c00b9abd7289dde160393 in train21:46
melwittwhen I set enabled_filters = AggregateInstanceExtraSpecsFilter I get only the one filter running21:47
*** lbragstad has quit IRC21:52
melwittgonna try now with aggregate and flavor21:52
kaiokmook, thank you. it is probably gonna work, since the filter is running21:53
kaiokmoI'm at 18.0.0, deployed with OSA21:54
*** JamesBenson has quit IRC22:00
melwittok, booting with matching metadata works. now I'll try no match22:00
kaiokmomy knowledge about nova behaviou (api and etcetera) is limited. could this be related with the placement service or something?22:01
melwittnonmatch fails to boot22:01
kaiokmos/behaviou/behavior22:01
melwittI thought about whether placement could be involved, but that would *not* affect whether your filter runs22:02
melwittwhen something changes related to placement, it would a change where the host candidates coming back from placement are problematic in some way22:03
melwittplacement will pre-filter hosts before they run through the nova scheduler filters22:03
kaiokmoah. I see22:03
melwittthe fact that your filter isn't running at all is bizarre and I can't think of how that could be happening other than a config problem22:04
*** BjoernT has quit IRC22:06
*** slaweq has joined #openstack-nova22:11
melwittkaiokmo: when the nova-scheduler service starts up, the DEBUG log will show what values it picked up for available_filters and enabled_filters. you might take a look at that to see if it yields any clues22:14
openstackgerritMerged openstack/nova master: Move get_pci_mapping_for_migration to MigrationContext  https://review.opendev.org/64302322:14
openstackgerritMerged openstack/nova master: Allow driver to properly unplug VIFs on destination on confirm resize  https://review.opendev.org/64302422:14
melwittfor example May 24 21:46:45 ubuntu-xenial nova-scheduler[15474]: DEBUG oslo_service.service [None req-3351fca8-ca06-4931-bebb-e1f40771a478 None None] filter_scheduler.enabled_filters = ['AggregateInstanceExtraSpecsFilter']22:14
melwittMay 24 21:46:45 ubuntu-xenial nova-scheduler[15474]: DEBUG oslo_service.service [None req-3351fca8-ca06-4931-bebb-e1f40771a478 None None] filter_scheduler.available_filters = ['nova.scheduler.filters.all_22:15
melwittfilters']22:15
kaiokmomine is filter_scheduler.available_filters = ['nova.scheduler.filters.all_filters']22:16
kaiokmoand filter_scheduler.enabled_filters = ['AggregateInstanceExtraSpecs Filter']22:16
kaiokmowhich seems correct22:17
melwittis the space in the middle of ['AggregateInstanceExtraSpecs Filter'] just a typo in chat or?22:17
melwittbecause it should not have a space22:17
kaiokmotypo in chat.22:19
kaiokmofor a moment I thought "omg, it can't be" :)22:20
melwittscheduler.driver = filter_scheduler should be in the log too22:23
*** slaweq has quit IRC22:25
kaiokmoyeah, it is here22:25
melwittw t f22:26
kaiokmodoesn't make sense for me either22:27
kaiokmobeen trying to get this working since yesterday.22:28
melwittyeah, I dunno what to tell you. I'd be putting prints all over the code and just try to trace it at this point22:30
melwittdo you see a log like this at least? DEBUG nova.filters [None req-4d2d1a33-c0da-4285-a500-3806a3313c4d admin admin] Starting with 1 host(s)22:30
melwittthat's what it says before it starts running filters22:30
kaiokmonot really. only a bunch of "Lock acquired...", "Running periodic tasks...", and "Succesfully synced instances...22:33
kaiokmoso, the filters are not running at all? that's not good22:33
melwittbacking up, you are seeing a server successfully boot right? and it's going onto hosts you don't want?22:34
melwittif so, are you running one scheduler or multiple? at least one of them should be showing filtering related messages if debug=True in the nova-scheduler config22:35
melwittunless you are forcing to specific host or something in your nova boot command22:35
*** _erlon_ has quit IRC22:36
kaiokmoyeah, I can boot servers but they are going to whatever host chosen by scheduler. I'm running three schedulers on separated infra nodes, behind a haproxy.22:39
kaiokmoyes, I'm looking for the messages on the three of them, and replicating the same configuration.22:40
melwittO.o22:40
kaiokmothis is the entire nova.conf I'm using http://paste.openstack.org/show/752060/22:42
kaiokmodoesn't seem to be anything wrong with scheduler and filter_scheduler sections22:42
melwittI see a few config options whose names have changed in later releases but nothing that should be hurting22:45
melwittshould also see a log message like this on at least one scheduler22:46
melwittDEBUG nova.scheduler.filter_scheduler [None req-4d2d1a33-c0da-4285-a500-3806a3313c4d admin admin] Filtered22:46
kaiokmonone of the sort. nothing like "nova.scheduler.filter_scheduler"22:49
melwittis your deployment custom patched or anything?22:53
tonybdansmith: thanks! and sorry22:56
*** macza has quit IRC23:02
*** luksky has joined #openstack-nova23:04
*** rcernin has joined #openstack-nova23:05
openstackgerritMerged openstack/nova master: Move patch_exists() to nova.test.TestCase for reuse  https://review.opendev.org/66050023:07
*** slaweq has joined #openstack-nova23:11
*** rcernin has quit IRC23:11
*** KH-Jared has quit IRC23:16
*** mkarpiarz has quit IRC23:24
*** slaweq has quit IRC23:24
openstackgerritAdam Spiers proposed openstack/nova master: Move selective patching of open() to nova.test for reuse  https://review.opendev.org/66126623:39
openstackgerritAdam Spiers proposed openstack/nova master: Provide HW_CPU_AMD_SEV trait when SEV is supported  https://review.opendev.org/63868023:46
openstackgerritAdam Spiers proposed openstack/nova master: Reduce logging of host hypervisor capabilities to DEBUG level  https://review.opendev.org/66137923:47
openstackgerritAdam Spiers proposed openstack/nova master: Add <launchSecurity> element to libvirt guest XML for AMD SEV  https://review.opendev.org/63631823:49
openstackgerritAdam Spiers proposed openstack/nova master: Extract SEV-specific bits on host detection  https://review.opendev.org/63633423:52
*** ttsiouts has quit IRC23:53
openstackgerritMerged openstack/nova master: Skip novnc tests in multi-cell job until bug 1830417 is fixed  https://review.opendev.org/66137123:58
openstackbug 1830417 in devstack "NoVNCConsoleTestJSON.test_novnc fails in nova-multi-cell job since 5/20" [Undecided,In progress] https://launchpad.net/bugs/1830417 - Assigned to melanie witt (melwitt)23:58

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!