Thursday, 2020-11-05

*** jangutter has joined #openstack-nova00:06
*** jangutter_ has quit IRC00:07
*** tosky has quit IRC00:14
*** macz_ has quit IRC00:17
*** martinkennelly has quit IRC00:21
*** martinkennelly has joined #openstack-nova00:21
*** tbachman has joined #openstack-nova00:29
*** ccstone has quit IRC00:57
*** eandersson has quit IRC00:57
*** ccstone has joined #openstack-nova00:57
*** eandersson has joined #openstack-nova00:57
*** whoami-rajat___ has joined #openstack-nova01:06
openstackgerritBrin Zhang proposed openstack/nova master: Add os-volume_attachments reference docs  https://review.opendev.org/76097101:09
openstackgerritMaAoyu proposed openstack/os-traits master: bump py37 to py38 in tox.ini  https://review.opendev.org/75743201:24
*** macz_ has joined #openstack-nova01:27
*** macz_ has quit IRC01:32
*** k_mouza has joined #openstack-nova01:35
*** Liang__ has joined #openstack-nova01:37
*** Liang__ has quit IRC01:39
*** k_mouza has quit IRC01:40
*** sapd1 has joined #openstack-nova01:41
*** rcernin_ has joined #openstack-nova01:42
*** rcernin has quit IRC01:43
*** LinPeiWen has joined #openstack-nova01:48
*** martinkennelly has quit IRC01:58
*** spatel has joined #openstack-nova02:10
*** spatel has quit IRC02:11
*** spatel has joined #openstack-nova02:12
*** spatel has quit IRC02:14
*** kaisers has quit IRC02:45
*** swp20 has joined #openstack-nova02:49
*** xinranwang has joined #openstack-nova03:03
*** xinranwang has quit IRC03:04
*** xinranwang has joined #openstack-nova03:04
*** hamalq has quit IRC03:21
*** rcernin_ has quit IRC03:24
*** rcernin_ has joined #openstack-nova03:34
*** rcernin_ has quit IRC03:47
*** ratailor has joined #openstack-nova03:56
*** sapd1 has quit IRC03:57
*** k_mouza has joined #openstack-nova04:11
*** Liang__ has joined #openstack-nova04:13
*** k_mouza has quit IRC04:15
*** rcernin_ has joined #openstack-nova04:19
*** rcernin_ has quit IRC04:19
*** rcernin has joined #openstack-nova04:19
*** vishalmanchanda has joined #openstack-nova04:31
*** rcernin has quit IRC05:13
*** rcernin has joined #openstack-nova05:14
*** whoami-rajat___ is now known as whoami-rajat__05:32
*** evrardjp has quit IRC05:33
*** evrardjp has joined #openstack-nova05:33
openstackgerritWenping Song proposed openstack/nova-specs master: Support vGPU management by Cyborg  https://review.opendev.org/75011605:43
*** ratailor has quit IRC06:33
*** rcernin has quit IRC06:37
*** rcernin has joined #openstack-nova06:50
*** Liang__ has quit IRC07:06
*** Liang__ has joined #openstack-nova07:07
*** lpetrut has joined #openstack-nova07:11
*** rcernin has quit IRC07:12
*** melwitt has quit IRC07:20
*** melwitt has joined #openstack-nova07:21
*** dklyle has quit IRC07:24
*** swp20 has quit IRC07:30
*** rcernin has joined #openstack-nova07:33
*** rcernin has quit IRC07:43
xinranwanggibi:  Hi gibi, I have replied to your comment about smartnic support,  and there are some open question need your suggestion, please check it when you got time. Thanks in advance.  https://review.opendev.org/#/c/742785/6/specs/wallaby/approved/support-sriov-smartnic.rst07:47
*** slaweq has joined #openstack-nova07:54
*** swp20 has joined #openstack-nova07:56
*** ralonsoh has joined #openstack-nova07:59
gibixinranwang: ack, I will look at it today08:04
xinranwanggibi:  great, thanks08:04
*** luksky has joined #openstack-nova08:09
*** tosky has joined #openstack-nova08:13
bauzasgood morning Nova08:13
*** andrewbonney has joined #openstack-nova08:14
gibibauzas: o/08:17
*** hoonetorg has quit IRC08:21
*** tesseract has joined #openstack-nova08:21
openstackgerritMerged openstack/nova master: Use subqueryload() instead of joinedload() for (system_)metadata  https://review.opendev.org/75892808:22
*** hoonetorg has joined #openstack-nova08:26
openstackgerritSylvain Bauza proposed openstack/nova master: Add a regression test for 5.12 compute API issue  https://review.opendev.org/76145708:36
openstackgerritSylvain Bauza proposed openstack/nova master: Fix the compute RPC 5.12 issue  https://review.opendev.org/76145808:36
*** macz_ has joined #openstack-nova08:37
*** rpittau|afk is now known as rpittau08:39
bauzasgibi: you probably missed my pings yesterday night, but I spotted a critical upgrade issue in victoria08:40
bauzashttps://bugs.launchpad.net/nova/+bug/190292508:41
openstackLaunchpad bug 1902925 in OpenStack Compute (nova) "Upgrades to compute RPC API 5.12 are broken" [Critical,In progress] - Assigned to Sylvain Bauza (sylvain-bauza)08:41
*** macz_ has quit IRC08:42
bauzasmade a better explanation of the impact https://bugs.launchpad.net/nova/+bug/1902925/comments/308:47
openstackLaunchpad bug 1902925 in OpenStack Compute (nova) "Upgrades to compute RPC API 5.12 are broken" [Critical,In progress] - Assigned to Sylvain Bauza (sylvain-bauza)08:47
*** songwenping_ has joined #openstack-nova08:51
*** swp20 has quit IRC08:54
gibibauzas: thanks, now I read back. good catch08:57
bauzaswell, just found it when writing the RPC major bump08:58
bauzaswhen you know the RPC usage, it's simple08:58
bauzasoh shit, I forgot to add the conditional I promised to dansmith ^_^08:59
*** ociuhandu has joined #openstack-nova09:06
bauzasactually, we don't need it \o/09:06
*** ociuhandu has quit IRC09:16
*** jangutter_ has joined #openstack-nova09:22
*** ociuhandu has joined #openstack-nova09:22
*** martinkennelly has joined #openstack-nova09:25
*** ociuhandu has quit IRC09:25
*** jangutter has quit IRC09:25
*** ociuhandu has joined #openstack-nova09:26
gibibauzas: I'm confused about the naming here https://review.opendev.org/#/c/761457/2/nova/tests/functional/regressions/test_bug_1902925.py@3109:26
bauzasthat's what happens when you copy/paste some methods...09:27
*** k_mouza has joined #openstack-nova09:34
gibibauzas: when you respin it, could you update the doc here too https://review.opendev.org/#/c/761458/2/nova/compute/manager.py@335509:35
gibibesides these, the fix looks good to me09:37
*** suryasingh has joined #openstack-nova09:40
*** macz_ has joined #openstack-nova09:43
gibilyarwood, elod: when the bugfix ^^ is merged to victoria we need to push a point release09:44
gibias this is a critical upgrade issue to V09:44
*** derekh has joined #openstack-nova09:45
gibibauzas: btw, one more request, could you add an upgrade reno to the fix? It would help making visible that upgrading to V needs this fix09:46
bauzasgibi: sure for both09:47
gibithanks09:47
bauzasI was just about to upload but I killed it09:47
*** macz_ has quit IRC09:48
lyarwoodgibi: ack09:51
*** ociuhandu has quit IRC09:57
*** ociuhandu has joined #openstack-nova09:58
*** ratailor has joined #openstack-nova10:00
*** ociuhandu has quit IRC10:01
*** ociuhandu has joined #openstack-nova10:02
*** kaisers has joined #openstack-nova10:02
lyarwoodso are we not testing rebuild in grenade?10:02
openstackgerritSylvain Bauza proposed openstack/nova master: Add a regression test for 5.12 compute API issue  https://review.opendev.org/76145710:02
openstackgerritSylvain Bauza proposed openstack/nova master: Fix the compute RPC 5.12 issue  https://review.opendev.org/76145810:02
bauzasgibi: done ^10:03
elodgibi: thx, I've planned to propose release patches for stein + train + ussuri + victoria today, but then I'll wait with the victoria release patch :]10:03
elodlyarwood: fyi ^^^10:03
bauzaselod: hopefully, we'll merge it today10:03
bauzasthe fix is simple10:04
elodbauzas: \o/10:04
elodthen we just have to wait the gate :]10:04
lyarwoodelod: ack thanks10:04
lyarwoodI guess we don't test rebuilds in a mixed upgrade state10:04
lyarwoodand that's why grenade multinode didn't hit this10:05
gibibauzas: looking10:05
bauzaslyarwood: stephenfin: gibi: I'm not telling you were bad about reviewing (i also sometimes misses some issues), but maybe it would be nice for you to look at both the fix https://review.opendev.org/#/c/761458/ but also to review https://review.opendev.org/#/c/761452/ to understand how RPC API works10:06
bauzasagain, no worries at all10:06
bauzasit's more for providing a knowledge help for you folks about how RPC versions work10:07
stephenfinah, I knew that and forgot about it :(10:07
bauzasif you knew it, all good then10:07
gibibauzas: yeah, thanks for the pointers. I'm wondering if we can make some test enhancements to catch these in the future10:08
lyarwoodI didn't even review the broken patch here so I'm not sure what you're trying to say10:08
lyarwood....10:08
bauzasbut hopefully the proxy change I'm providing is nice for knowing how to have a major version10:08
bauzaslyarwood: not about any previous reviews, just for helping you to know what to review when you have a change with a RPC modification10:09
lyarwoodsure10:09
*** Liang__ has quit IRC10:10
lyarwoodbauzas: look forward to your reference docs patches10:11
stephenfingibi: We could probably hash the signature or something?10:14
bauzasactually, I could write something in https://docs.openstack.org/nova/latest/contributor/code-review.html10:14
bauzasstephenfin: gibi: testing it is not simple10:14
stephenfini.e. identify all the position, non-optional arguments and generate/save a hash for those10:14
stephenfinthen compare each time, like we do for o.vos10:15
bauzassince the arguments are different between RPC versions10:15
gibistephenfin: that would be the ovo way yes10:15
stephenfin*positional10:15
lyarwoodbauzas: https://docs.openstack.org/nova/latest/reference/rpc.html I was thinking more in here10:15
lyarwoodbauzas: but either way10:15
bauzasstephenfin: we have non-positional arguments that are unrelated to RPC versions10:15
bauzaswe just keep them optional10:16
bauzaslyarwood: oh, TIL this page was existing10:16
* gibi needs to think about the testing10:16
bauzasgibi: we fixed it by code reviews10:17
bauzasah, this is already documented https://docs.openstack.org/nova/latest/contributor/code-review.html#rpc-api-versions10:17
gibibauzas: sure, code review is the fallaback, human intelligence is king, but if we can automate it then we could avoid failing humans like me at the original code rview10:17
lyarwoodbauzas: ah cool10:18
bauzasbut I guess "The manager-side method needs to tolerate older calls as well as newer calls" is maybe too much overall, and we need to explain it more10:18
*** songwenping_ has quit IRC10:18
bauzasgibi: we could enforce owners to propose functional tests10:18
bauzasfor testing the RPC pins10:18
bauzaslike I did in my regression test10:19
bauzasthis would be a simpliest approach10:19
gibiI guess enforce by code review10:19
bauzasthat, yeah10:20
gibiI agree10:20
bauzasbut from what I've seen, nobody is really doing it10:20
gibistill I want to automate it if possible :D10:20
lyarwoodshouldn't we cover mixed compute upgrades in the multinode grenade job?10:20
bauzasgibi: well, we don't really set new versions a lot right?10:20
gibibecuase all are code review rules are as good as the way we enforce them10:20
gibibauzas: we do it less and less, I agree10:21
bauzaslike, we only had one rpc minor bump per release since a while10:21
bauzasgibi: well, we have a code review documentation10:21
bauzasand I expect cores to know it at least10:21
bauzasI mean, that's a breaking change to accept a RPC change10:21
bauzasmaybe we rushed over accepting some feature that was long overdue, but maybe considering to require a functest would ensure that we would put the burden on code owners10:22
gibibauzas: you are correct that we assume that core reviews catch these kind of problems, but they don't as you found10:22
*** CeeMac has joined #openstack-nova10:23
gibiif we do these thing less and less it means that we will easier to forget what to look at in these changes10:23
gibias we don't excersize this knowledge10:24
*** jangutter has joined #openstack-nova10:25
gibiso I agree that one thing is to raise awerness for this issue as you did.10:26
gibibut also I will think about some kind of automation as I cannot promise I won't forget this rule again 6 months from now when we bump the next10:27
*** jangutter_ has quit IRC10:29
*** noonedeadpunk has quit IRC10:32
*** noonedeadpunk has joined #openstack-nova10:32
bauzasgibi: ahah lol, i had to rush off home because I forgot my kids at the school :whoops:10:44
bauzasgibi: fwiw, the change we merged was a bit hairy, so I do understand that it was difficult to find the problem10:44
bauzasgibi: that's why I said we should at least ask to provide a functional test, that's it10:45
gibiyeah, I should not forget to ask a functional test pining to old RPC version when a new RPC version is proposed10:45
*** jangutter_ has joined #openstack-nova10:58
*** ratailor_ has joined #openstack-nova10:58
*** jangutter has quit IRC11:01
*** ratailor has quit IRC11:01
*** ratailor__ has joined #openstack-nova11:08
*** ratailor_ has quit IRC11:11
*** ratailor__ has quit IRC11:17
*** ratailor has joined #openstack-nova11:17
*** ratailor has quit IRC11:18
*** ratailor has joined #openstack-nova11:21
*** noonedeadpunk has quit IRC11:21
*** ratailor has quit IRC11:21
*** noonedeadpunk has joined #openstack-nova11:25
*** ratailor has joined #openstack-nova11:26
*** dtantsur|afk is now known as dtantsur11:26
*** ratailor has quit IRC11:27
*** ratailor has joined #openstack-nova11:27
*** ratailor has quit IRC11:31
*** ratailor has joined #openstack-nova11:31
*** ratailor_ has joined #openstack-nova11:40
*** ratailor has quit IRC11:43
*** ratailor__ has joined #openstack-nova11:47
*** ratailor_ has quit IRC11:50
*** xinranwang has quit IRC11:53
*** tbachman has quit IRC11:58
*** ociuhandu has quit IRC12:00
*** ratailor_ has joined #openstack-nova12:00
*** ratailor_ has quit IRC12:01
*** ratailor has joined #openstack-nova12:01
*** ociuhandu has joined #openstack-nova12:01
*** ratailor__ has quit IRC12:03
brinzhang0gibi: hi good morning12:05
brinzhang0gibi: Hope you can review12:05
brinzhang0Cyborg shelve/unshelve support patch https://review.opendev.org/#/c/729563/ :D12:06
*** ociuhandu has quit IRC12:06
gibibrinzhang0: add to my queue12:06
gibiadded12:06
brinzhang0gibi: thanks12:06
*** JamesBenson has joined #openstack-nova12:10
*** raildo has joined #openstack-nova12:23
*** ratailor has quit IRC12:25
*** jangutter has joined #openstack-nova12:59
*** jangutter_ has quit IRC13:03
*** ociuhandu has joined #openstack-nova13:14
*** sapd1 has joined #openstack-nova13:18
*** tbachman has joined #openstack-nova13:28
*** nweinber has joined #openstack-nova13:28
*** suryasingh has quit IRC13:33
bauzasbrinzhang0: gibi: hah, this time the new argument is nullable :p13:36
bauzasbut maybe it's time to ask for a functional testclass verifying the RPC API ? :)13:36
*** diconico07 has joined #openstack-nova13:43
*** derekh has quit IRC13:49
*** derekh has joined #openstack-nova13:50
*** jangutter_ has joined #openstack-nova14:02
*** jangutter has quit IRC14:06
*** arxcruz has joined #openstack-nova14:08
openstackgerritBalazs Gibizer proposed openstack/nova master: Bump the lowest eventlet version to 0.26.1  https://review.opendev.org/76142714:11
*** dtantsur has quit IRC14:16
*** vishalmanchanda has quit IRC14:30
*** dtantsur has joined #openstack-nova14:37
*** dtantsur has quit IRC14:37
*** dtantsur has joined #openstack-nova14:38
*** jangutter_ is now known as jangutter14:48
*** belmoreira has joined #openstack-nova14:56
*** dtantsur has quit IRC14:56
*** dtantsur has joined #openstack-nova14:56
*** macz_ has joined #openstack-nova15:07
*** macz_ has quit IRC15:12
iurygregoryHi nova folks, a friend of mine using openstack queens asked me if it's possible to update the config drive of an instance?15:12
bauzasgibi: the next nova meeting is in 45 mins, right?15:12
bauzastz change15:12
*** otubo has joined #openstack-nova15:13
gibibauzas: yes, the meeting is at 16:00 UTC which is 17:00 CET15:14
bauzascool cool15:14
bauzasnicer for us :)15:14
gibi:)15:15
sean-k-mooneyopenstack meetings are alwasy utc and never move15:18
sean-k-mooneyits other that do15:18
sean-k-mooneyfortunetlly DLS will not be a thing in europe after 202115:19
dansmithhopefully not on the west coast either, but it's not set yet15:20
sean-k-mooneyit was ment to happen this year but got delayed so this was ment to be the last switch15:20
*** ociuhandu has quit IRC15:21
sean-k-mooneythe current plan is contries adopting permenatn summer time will swap for the last time in the spring15:21
sean-k-mooneyand the rest will swap for the last time in the fall15:21
gibistephenfin, bauzas: spent some time thinking automating to catch bugs like https://bugs.launchpad.net/nova/+bug/1902925 . Besides code review (that fails some time like in this case) what we can do is to extend the grenade testing.15:32
openstackLaunchpad bug 1902925 in OpenStack Compute (nova) "Upgrades to compute RPC API 5.12 are broken" [Critical,In progress] - Assigned to Sylvain Bauza (sylvain-bauza)15:32
gibiAs far as I understand it run livemigration between mixed computes15:32
bauzassean-k-mooney: I'm against summer time15:32
bauzasgibi: you need then two compute services15:33
gibibauzas: we have multinode grenade15:33
bauzasand a rolling upgrade scenario15:33
bauzasbecause the rpc pins will automatically set the version to the oldest compute one15:33
bauzas(if set to 'auto')15:33
gibiI think nova-grenade-multinode does what we need15:34
bauzasand again, tbh, I wonder whether it's just a code review usage15:34
gibias per https://github.com/openstack/nova/blob/d25bc07d26212408211b64953af7ef6047ca3d9d/playbooks/legacy/nova-grenade-multinode/run.yaml#L47-L5015:34
bauzasdansmith: your thoughts on it ? tl;dr: automatical uprade testing vs. asking for functional tests that would verify a RPC version minor bump15:34
bauzasgibi: if we run two computes, then okay, we don't need them to be on separate nodes but the other services15:35
bauzasie. aio+compute15:35
bauzaswhich is what grenade-multinode is doing AFAIR15:35
bauzasso you're right15:35
bauzasgibi: but then we need to test all the RPC calls in tempest15:36
bauzasgood luck with this15:36
bauzasI just feel the simpliest is just to ask for functests15:36
bauzasI wrote them yesterday night and it took me 20 mins15:36
*** dklyle has joined #openstack-nova15:37
gibibauzas: never said that we should not ask for a func test. I'm saying that we tend to forget about it as the current bug shows15:37
dansmithyeah, so we could always pin the version to .0,15:37
dansmithbut coverage in tempest will be hard,15:37
dansmithplus tempest needs to be graceful as some api calls will fail expectedly if the version doesn't support the new feature15:37
bauzastechnically, we need to set the pin to the previous release version15:38
sean-k-mooneybauzas: same i want to stick on utc in my case15:38
dansmithI'd prefer some test that ensures we've hit all the versions for each call in unit/func or something15:38
bauzasdansmith: that's my thoughts15:38
gibidansmith: I thinked about that angle as that would be a good thing in my eys15:38
bauzaschecking it thru tempest is something I'd love, but I'm pragmativ15:38
bauzaspragmatic15:39
dansmithgibi: yeah15:39
bauzaswe honestly have the pattern to ask with my functest15:39
bauzasit's just a simple request15:39
bauzasand I guess (or I hope) none of the cores to miss this15:39
gibiI have my doubt about my memory15:39
sean-k-mooneytesting rpc versions? trying to catch up on the converstation15:40
gibiso I won't promise I will always remember15:40
gibisean-k-mooney: basically avoiding the bug https://bugs.launchpad.net/nova/+bug/190292515:40
openstackLaunchpad bug 1902925 in OpenStack Compute (nova) "Upgrades to compute RPC API 5.12 are broken" [Critical,In progress] - Assigned to Sylvain Bauza (sylvain-bauza)15:40
bauzasgibi: well, RPC and DB upgrades are possibly the hugest changes we could review, right?15:40
gibiright15:40
bauzasI could understand this for a simple method15:40
bauzasbut for all the manager services, the risk is present15:41
bauzasbut either way, the meeting is in 20 mins15:41
bauzasprobably the best is to discuss it there15:41
sean-k-mooneythat had functional test for what it was worth15:41
bauzassean-k-mooney: the cyborg patches ? nope15:41
sean-k-mooneymy orginial one did https://review.opendev.org/#/c/715326/15:42
sean-k-mooneyhttps://review.opendev.org/#/c/715326/29/nova/tests/functional/test_servers.py15:42
dansmithgibi: details of db, rpc, and general upgrade issues in patches have always required lots of human review to get right.. in the early days when we went from not-upgradeable to where we are now.. we added lots of tests where we could, and developer traps like the required db migration tests,15:43
dansmithgibi: but automating all the things is hard and the issues are complex15:43
bauzassean-k-mooney: you won't catch this error then15:43
dansmithgibi: so I'm all for trying to catch more stuff, especially in a case like this where we just lacked such a test, but ... human review is not replaceable, obviously15:43
bauzassean-k-mooney: see my regression test, it does capture the bug https://review.opendev.org/#/c/761457/1/nova/tests/functional/regressions/test_bug_1902925.py15:43
bauzasactually https://review.opendev.org/#/c/761457/3/nova/tests/functional/regressions/test_bug_1902925.py15:44
*** ociuhandu has joined #openstack-nova15:45
bauzasstephenfin: I don't get your -1 https://review.opendev.org/#/c/761458/3/releasenotes/notes/bug_1902925-351f563340a1e9a5.yaml@1115:45
bauzasstephenfin: the 'fixes' reno section is purposed to show the fixed bugs15:45
bauzasso that's normal we won't show this note until we merge the patch15:45
stephenfinI'm saying that the docs job won't pass until you do what gibi suggested15:46
sean-k-mooneybauzas: becaue it need the version cap to trigger it15:46
stephenfinyou need to add a leading to underscore '.. bug 1902925:', i.e. '.. _bug 1902925:'15:47
openstackbug 1902925 in OpenStack Compute (nova) "Upgrades to compute RPC API 5.12 are broken" [Critical,In progress] https://launchpad.net/bugs/1902925 - Assigned to Sylvain Bauza (sylvain-bauza)15:47
bauzasstephenfin: ah that, no worries I'll fix it15:47
gibidansmith: I agree that we need human review. All I want is to aid that huma review _if possible_15:48
dansmithgibi: for sure15:48
sean-k-mooneyok i see what you have changed. hum ok15:49
bauzassean-k-mooney: I explained the issue in https://bugs.launchpad.net/nova/+bug/1902925/comments/315:51
openstackLaunchpad bug 1902925 in OpenStack Compute (nova) "Upgrades to compute RPC API 5.12 are broken" [Critical,In progress] - Assigned to Sylvain Bauza (sylvain-bauza)15:51
sean-k-mooneyya i just wanted to read the repoducer and code fix15:51
sean-k-mooneyi guess we have not extended this api since we did the 5.0 rpc bump15:52
sean-k-mooneynone of the other fileds are optional15:52
tobias-urdinany good (and somewhat "supported" way) to extend the nova metadata API to include some custom paths? IIRC some ways of extending nova has been deprecated/removed over the years15:53
sean-k-mooneyyes15:53
sean-k-mooneytobias-urdin: https://docs.openstack.org/nova/latest/admin/vendordata.html15:53
sean-k-mooneythat or i guess you coudl use middleware15:54
sean-k-mooneybut in general nova is not extensible in this way intentionally15:54
noonedeadpunksean-k-mooney: I think I might figued out why isolated aggregates got instances from time to time. any reason not to pass rebuilds through scheduler? https://opendev.org/openstack/nova/src/branch/master/nova/scheduler/manager.py#L14615:55
sean-k-mooneynoonedeadpunk: rebuilds cant change host15:55
sean-k-mooneynoonedeadpunk: they are not move operations15:55
sean-k-mooneyand they do go to the schduler if the image changes15:55
noonedeadpunkuh.... I see15:55
noonedeadpunkwell, I continue get 1 instance per month or smth like that on the isolated aggregate15:56
sean-k-mooneyto validate that the current host is still ok with the new image15:56
noonedeadpunkand have no clue how that might happen...15:56
sean-k-mooneyare you using the placment way fo doing it15:57
sean-k-mooneyor the filter15:57
sean-k-mooneythe placement way that should not happen as the traits request should block it but the filter reuires all tenanats to be mapped to an aggreate15:58
sean-k-mooneyor unmpped tenants can go to any host15:58
openstackgerritSylvain Bauza proposed openstack/nova master: Fix the compute RPC 5.12 issue  https://review.opendev.org/76145815:58
bauzasdansmith: gibi: stephenfin: last round, hopefully15:59
bauzasand then I'll backport the changes15:59
gibithanks15:59
*** belmoreira has quit IRC16:09
noonedeadpunksean-k-mooney: do placement traits, exactly like specified in https://docs.openstack.org/nova/latest/reference/isolate-aggregates.html which you gave me one day16:11
noonedeadpunkout of code I see no reason why this might happen16:11
noonedeadpunkbut it does16:11
sean-k-mooneyya im not sure either16:12
sean-k-mooneyunless you have multiple schduler adn one of them has a differnt config16:13
noonedeadpunkand if previously it was only during resizes or smth like that I found just new VM created a week ago...16:13
sean-k-mooneye.g. on one of them you dont have the prefilter enabled16:13
noonedeadpunkwell, I think I've checked that.... in terms of prefilter you mean scheduler.enable_isolated_aggregate_filtering ?16:14
sean-k-mooneybauzas: so that is what was breakign the grenade jobs? good find16:15
bauzasonly for the rebuild case16:16
bauzasso, maybe...16:16
bauzasidk16:16
gibigrenade does not run evacuate or grenade tests as far as I know16:16
gibiit runs live migration16:16
sean-k-mooneyit runs full tempest before and after16:16
sean-k-mooneyi think16:17
gibireally? I only found a smoke result16:17
sean-k-mooneyim thinging that if the vm that landed on the unupgraded node was rebuilt it would fail16:17
gibi+ the live migration16:17
sean-k-mooneymaybe im wrong16:17
sean-k-mooneyhttps://github.com/openstack/nova/blob/master/playbooks/legacy/nova-grenade-multinode/run.yaml#L4016:20
sean-k-mooneyits running the compute api tests and senario tests16:20
sean-k-mooneyoh just the smoke subset of those?16:21
dansmithjust smoke before, not sure about full after t hough16:21
sean-k-mooneyif its runnign rebuild after then if it booted on the upgraded node we would get teh type error16:22
sean-k-mooneyif it booted on the un upgraded node it would have rebuilt fine16:22
sean-k-mooneywhich would have made the test failure intermitent16:23
dansmithI dunno why you say that,16:23
dansmiththe control plane would be upgraded,16:23
dansmithoh you mean because the pin is set to auto and the presence of an old compute would keep it pinned I guess?16:23
sean-k-mooneyyes16:24
sean-k-mooneyit would be pinned but the old nova code would not expect the parmater and the new code would16:24
dansmiththat only works for U->V jobs, since V supported it, it'll be using the new version16:24
dansmithyou need to be looking at U->V grenade multinode jobs I'd expect right?16:24
dansmithalso, as bad as the gate has been lately, it wouldn't surprise me if people have just been rechecking past that occasional fail16:25
sean-k-mooneywell v->master woudl work since they woudl both use 5.12+16:25
dansmiththat's my point16:25
sean-k-mooneyu->v would (posssible) be intermitent16:25
sean-k-mooneyso yes16:25
dansmithright, very many fewer things running that configuration16:25
*** jangutter has quit IRC16:26
sean-k-mooneyi have just been seeign some intermitent grendade job failure before the ptg so was wondering if this was the issue or if there are others16:26
*** jangutter has joined #openstack-nova16:26
sean-k-mooneymost of the issue seam to be realted to volumes however rather then rebuild16:27
dansmithcould be.. so many CI fails lately, I expect people are doing a lot of recheck grinding16:27
*** macz_ has joined #openstack-nova16:27
gibithis is a recent grenade multinode run from stable/victoria https://1cc2260295ba1f69c29d-8ad4cd99420b0d8b2b27089e00008c76.ssl.cf1.rackcdn.com/761424/1/check/nova-grenade-multinode/e3cf1bf/logs/index.html16:32
gibiI see two test reports16:32
gibihttps://1cc2260295ba1f69c29d-8ad4cd99420b0d8b2b27089e00008c76.ssl.cf1.rackcdn.com/761424/1/check/nova-grenade-multinode/e3cf1bf/logs/old/testr_results.html16:32
gibiand16:32
gibihttps://1cc2260295ba1f69c29d-8ad4cd99420b0d8b2b27089e00008c76.ssl.cf1.rackcdn.com/761424/1/check/nova-grenade-multinode/e3cf1bf/logs/testr_results.html16:32
gibiis there a 3rd report somewhere in the tree?16:32
sean-k-mooneynope16:33
sean-k-mooneyjust those two16:33
sean-k-mooneyso we are not running rebuild in the grenade job16:33
sean-k-mooneyi tought we were but i guess not16:34
sean-k-mooneythe grenade failures i was seeing were likely something else so16:35
sean-k-mooneyits been like 2 weeks so all that is left in my brain on the topic is "i have seen more grenade failures lately then i normally do"16:36
openstackgerritSylvain Bauza proposed openstack/nova stable/victoria: Add a regression test for 5.12 compute API issue  https://review.opendev.org/76163816:48
openstackgerritSylvain Bauza proposed openstack/nova stable/victoria: Fix the compute RPC 5.12 issue  https://review.opendev.org/76163916:48
bauzaselod: stable changes are up there ^16:50
bauzashopefully master changes will be merged tonight so we could move on tomorrow16:51
*** sapd1 has quit IRC16:51
bauzasand ideally release subsequently16:51
bauzas(release stable/victoria)16:51
elodbauzas: thx, looking :)16:51
bauzaselod: don't16:52
bauzasthe master change isn't merged yet so I -2 it16:52
eloddon't worry I'll wait with the +2 until master is merged ;)16:52
elod(if I don't find any mistake with the backport, ofc)16:53
elod:]16:53
openstackgerritMerged openstack/nova master: Add a regression test for 5.12 compute API issue  https://review.opendev.org/76145717:00
bauzaselod: heh ^17:03
*** jamesden_ has joined #openstack-nova17:08
*** JamesBen_ has joined #openstack-nova17:11
elodbauzas: ok, so the regression test part is ready and looks OK. +2'd17:11
bauzas<317:11
elodone more to go :)17:11
*** JamesBenson has quit IRC17:14
elodthe backport of the fix also looks good to me and the fix is on the gate in master, so we just have to wait.17:16
elodi'll prepare a release patch tomorrow for victoria if the fix gets merged17:17
*** k_mouza has quit IRC17:18
*** k_mouza has joined #openstack-nova17:18
*** rpittau is now known as rpittau|afk17:21
*** k_mouza has quit IRC17:23
*** gyee has joined #openstack-nova17:35
*** hamalq has joined #openstack-nova17:38
*** derekh has quit IRC18:00
*** ociuhandu has quit IRC18:04
*** ociuhandu has joined #openstack-nova18:12
*** ociuhandu has quit IRC18:16
*** haleyb has quit IRC18:18
*** haleyb has joined #openstack-nova18:19
*** andrewbonney has quit IRC18:29
*** ralonsoh has quit IRC18:41
*** k_mouza has joined #openstack-nova18:44
*** k_mouza has quit IRC18:48
*** dtantsur is now known as dtantsur|afk18:53
*** lpetrut has quit IRC19:16
*** _mlavalle_2 has quit IRC19:16
*** lbragstad has quit IRC19:18
*** tesseract has quit IRC19:31
*** bbowen has quit IRC20:02
*** rchurch has quit IRC20:22
*** ociuhandu has joined #openstack-nova20:57
*** ociuhandu has quit IRC20:58
*** ociuhandu_ has joined #openstack-nova20:58
*** bbowen has joined #openstack-nova21:13
*** ociuhandu_ has quit IRC21:23
*** ociuhandu has joined #openstack-nova21:25
*** rcernin has joined #openstack-nova21:27
*** ociuhandu has quit IRC21:35
*** k_mouza has joined #openstack-nova21:45
*** nweinber has quit IRC21:48
*** k_mouza has quit IRC21:49
*** rcernin has quit IRC21:53
*** rcernin has joined #openstack-nova21:54
openstackgerritMerged openstack/nova master: Fix the compute RPC 5.12 issue  https://review.opendev.org/76145821:55
*** slaweq has quit IRC22:06
*** mlavalle has joined #openstack-nova22:08
*** hamalq has quit IRC22:11
*** jangutter_ has joined #openstack-nova22:23
*** jangutter has quit IRC22:25
*** jamesden_ has quit IRC22:33
*** rcernin has quit IRC22:57
*** rcernin has joined #openstack-nova23:01
*** spatel has joined #openstack-nova23:20
openstackgerritMerged openstack/nova stable/victoria: Add a regression test for 5.12 compute API issue  https://review.opendev.org/76163823:20
*** spatel has quit IRC23:25
*** ociuhandu has joined #openstack-nova23:36
*** ociuhandu has quit IRC23:40
*** luksky has quit IRC23:46
*** tosky has quit IRC23:55

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!