Friday, 2018-11-09

*** tetsuro has joined #openstack-placement00:40
openstackgerrit98k proposed openstack/os-traits master: Add python 3.6 unit test job  https://review.openstack.org/61674903:18
*** openstackstatus has quit IRC04:59
*** openstack has joined #openstack-placement07:11
*** ChanServ sets mode: +o openstack07:11
*** e0ne has quit IRC07:31
openstackgerritBrin Zhang proposed openstack/nova-specs master: Support admin to specify project to create snapshot  https://review.openstack.org/61684307:52
*** takashin has left #openstack-placement08:03
*** tssurya has joined #openstack-placement08:07
*** helenaAM has joined #openstack-placement08:31
*** tetsuro has quit IRC09:20
*** e0ne has joined #openstack-placement09:39
*** ttsiouts has joined #openstack-placement10:05
*** ttsiouts has quit IRC10:10
*** ttsiouts has joined #openstack-placement10:11
*** ttsiouts has quit IRC10:15
*** ttsiouts has joined #openstack-placement10:20
*** ttsiouts has quit IRC10:21
*** ttsiouts has joined #openstack-placement10:22
*** ttsiouts has quit IRC10:26
*** ttsiouts has joined #openstack-placement10:30
*** e0ne has quit IRC10:56
*** ttsiouts has quit IRC10:59
*** ttsiouts has joined #openstack-placement11:00
*** tssurya has quit IRC11:04
*** e0ne has joined #openstack-placement11:44
*** rubasov has quit IRC13:07
*** mriedem has joined #openstack-placement14:08
*** tssurya has joined #openstack-placement14:57
openstackgerritMerged openstack/placement master: Harden placement init under wsgi  https://review.openstack.org/61244414:57
*** tssurya has quit IRC16:22
*** e0ne has quit IRC16:35
*** helenaAM has quit IRC16:59
*** ttsiouts has quit IRC17:51
*** e0ne has joined #openstack-placement18:09
*** e0ne has quit IRC18:33
*** e0ne has joined #openstack-placement18:36
mriedemso i'm wondering if delete_consumers_if_no_allocations is broken18:40
mriedembig ass triage in https://bugs.launchpad.net/nova/+bug/179868818:40
openstackLaunchpad bug 1798688 in OpenStack Compute (nova) "AllocationUpdateFailed_Remote: Failed to update allocations for consumer. Error: another process changed the consumer after the report client read the consumer state during the claim" [High,Triaged]18:40
mriedemtl;dr tempest has a test which shelve offloads a server and then immediately unshelves it,18:40
mriedemand we're hitting consumer generation conflicts during the unshelve18:40
mriedempresumably because the consumer/allocations aren't gone by the time we unshelve18:40
mriedemeven though shelve offload deletes the allocations for the server18:41
mriedemand by "delete" I mean PUT /allocations/consumer with an empty allocations dict18:41
mriedemhttps://review.openstack.org/#/c/591597/8/nova/scheduler/client/report.py18:41
*** mriedem is now known as mriedem_hangry18:55
*** e0ne has quit IRC18:58
*** mriedem_hangry is now known as mriedem19:34
*** e0ne has joined #openstack-placement20:01
*** e0ne has quit IRC20:04
*** efried is now known as fried_rice20:09
fried_ricemriedem: Shouldn't we have a retry covering that race?20:09
mriedemfried_rice: we used to, before the consumer generatoin stuff20:19
fried_riceconsumer generation stuff should allow us to do a *better* retry :(20:20
mriedemhttps://review.openstack.org/#/c/583667/29/nova/scheduler/client/report.py20:20
fried_riceBogus, who would approve THAT patch??20:20
mriedemi have a debug patch up to try and figure out what the existing allocations GET response looks like20:20
mriedemhard to know whats going on without that20:20
mriedemhttps://review.openstack.org/61701620:20
mriedemalso, i think tihs is always true https://review.openstack.org/#/c/583667/29/nova/scheduler/client/report.py@180520:22
mriedemi don't see a consumer generation in the allocation candidates response20:22
mriedemprobably b/c those are not suppose to be specific to a consumer20:22
mriedemhow/where does the root_provider_uuid get set on a provider?20:30
mriedemi'm looking at https://review.openstack.org/#/c/613304/20:30
mriedemand trying to figure out how that could happen - old computes?20:31
mriedemoh so if i create a resource provider but omit parent_provider_uuid, then it's automatically the root, right?20:32
mriedemb/c it doesn't have a parent20:32
fried_riceshould be, yeah20:32
fried_riceI don't think you're allowed to set the root provider UUID on a provider. It happens automatically.20:33
fried_riceIf you don't set the parent ID, the node is a root node, period.20:33
mriedemright so https://bugs.launchpad.net/nova/+bug/179989220:34
openstackLaunchpad bug 1799892 in OpenStack Compute (nova) "Placement API crashes with 500s in Rocky upgrade with downed compute nodes" [Undecided,New]20:34
fried_riceand yes, the test case in that patch is theoretically old computes, or manual diddling.20:34
mriedemany providers created before queens https://review.openstack.org/#/c/377138/ wouldn't have parent_provider_uuid or root_provider_id set20:34
mriedemand we apparently didn't have an online data migration to set the root_provider_uuid for existing providers...20:35
mriedemand at some point along the way to supported nested resource providers in the scheduler, allocation candidates assumes all providers have a root_provider_uuid set20:35
fried_ricewe *do* have a data migration20:36
fried_riceWe even set the root provider ID on reads if we find it unset20:36
fried_riceBut the hole here may have been that we're looking at this particular provider in a place that doesn't have that migration thing going.20:37
mriedemoh i see https://review.openstack.org/#/c/377138/62/nova/objects/resource_provider.py@91720:39
mriedemyeah so when we get the provider, we migrate the data20:39
mriedembut maybe when getting allcoation candidates, we're going straight through the db rather than the objects20:39
fried_ricejust so20:39
mriedemthis is where placement-manager db online_data_migrations is going to be useful20:41
mriedemstuff like this i mean20:42
mriedemfried_rice: btw, are you berlin bound or didn't get approval?20:42
fried_riceI got approval a week ago. Also saw an ENT this morning who told me my eyeball won't explode out of my head on ascent. I leave Sunday, arrive Monday, leave Friday.20:43
mriedemhaving an eyeball will be good20:44
mriedemthe bruise will also be great if you're giving a talk20:44
fried_riceI'm not giving a talk, but the bruise is still nice.20:44
mriedemdid Kevin hook you up with that placement / public clouds one?20:44
mriedemoh20:44
mriedemmaybe he was shy20:44
fried_riceIt's actually disappointingly non-spectacular. I don't bruise very easily.20:44
mriedemso full of iron youare20:45
mriedemlike a human bean20:45
fried_riceBut between that and my hair being longer, I look even more like Ash Williams :P20:45
mriedemi'm also growing my hair out20:45
mriedemgets real big and wavy20:45
fried_riceTired of people comparing you to Doogie Howser?20:45
mriedemlike eric from that 70s show20:45
mriedemi don't look anything like doogie20:45
mriedemkevin bacon on the other hand..20:45
fried_riceit's more of a general mien20:46
openstackgerritMatt Riedemann proposed openstack/placement master: Add recreate test for bug 1799892  https://review.openstack.org/61703120:52
openstackbug 1799892 in OpenStack Compute (nova) rocky "Placement API crashes with 500s in Rocky upgrade with downed compute nodes" [Medium,Triaged] https://launchpad.net/bugs/179989220:52
mriedemfried_rice: jaypipes: i've got a question in https://review.openstack.org/#/c/613305/ which maybe you can answer21:02
* fried_rice reads21:02
fried_ricemriedem: Which question specifically?21:03
mriedemhttps://review.openstack.org/#/c/613305/2/nova/api/openstack/placement/objects/resource_provider.py@300221:04
fried_riceoy, that's not going to be a quick answer, with my sandbox in its current state21:05
fried_ricesomething tickling my brain says I looked at that when I first reviewed this patch...21:06
fried_ricemebbe not21:07
fried_riceI was in a chatty mood at that time, I would have said something.21:07
fried_ricemriedem: Okay, I get it. Have you looked at the other usage of that method via get_trees_matching_all ?21:10
mriedemi didn't dig into all that uses it no21:13
mriedemmost of this giant resource_provider.py module is greek to me by now21:13
fried_ricemriedem: Replied.21:13
fried_ricemriedem: I suspect we can't get there because we only hit that "trees" path if we find proper tree-ness in the initial stages of GET /a_c. But I suppose it may be possible.21:15
mriedemidk21:16
mriedemwithout a test for that scenario it's hard to know what could break21:16
fried_riceright. If anyone can swing that test, tetsuro can.21:16
fried_riceI think it was here, I found a hole in the test coverage that he closed up.21:17
*** mriedem has quit IRC22:29
*** s10 has joined #openstack-placement23:28

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!