Thursday, 2020-03-19

artomalex_xu, sure, I'll take another look tomorrow00:08
*** tetsuro has joined #openstack-nova00:10
*** tetsuro has quit IRC00:12
*** tetsuro has joined #openstack-nova00:13
openstackgerritGhanshyam Mann proposed openstack/nova master: Add new default roles in os-flavor-access policies  https://review.opendev.org/71369700:18
*** tetsuro has quit IRC00:19
*** tetsuro has joined #openstack-nova00:20
*** sean-k-mooney2 has joined #openstack-nova00:26
*** sean-k-mooney1 has quit IRC00:27
*** nicolasbock has quit IRC00:28
*** tetsuro_ has joined #openstack-nova00:30
*** tetsuro has quit IRC00:33
*** mlavalle has quit IRC00:40
*** martinkennelly has quit IRC00:48
*** tosky has quit IRC00:50
*** brinzhang has joined #openstack-nova00:52
*** sean-k-mooney2 has quit IRC00:53
*** TxGirlGeek has quit IRC00:53
*** sean-k-mooney2 has joined #openstack-nova00:55
*** brinzhang has quit IRC00:57
*** brinzhang has joined #openstack-nova00:57
openstackgerritMerged openstack/nova master: Refine and introduce correct parameters for test_get_guest_config_numa_host_instance_topo_cpu_pinning  https://review.opendev.org/71335101:05
*** sean-k-mooney2 has quit IRC01:08
*** Liang__ has joined #openstack-nova01:15
openstackgerritmelanie witt proposed openstack/nova master: DNM: try to get some debug info for bug 1844929  https://review.opendev.org/70147801:18
openstackbug 1844929 in OpenStack Compute (nova) "grenade jobs failing due to "Timed out waiting for response from cell" in scheduler" [High,Confirmed] https://launchpad.net/bugs/184492901:18
*** brinzhang_ has joined #openstack-nova01:20
*** brinzhang has quit IRC01:23
openstackgerritBrin Zhang proposed openstack/nova master: Add new default roles in os-instance-actions policies  https://review.opendev.org/70647001:30
*** liuyulong has quit IRC01:33
*** vishalmanchanda has joined #openstack-nova01:33
openstackgerritBrin Zhang proposed openstack/nova master: Add new default roles in os-instance-actions policies  https://review.opendev.org/70647001:36
*** mgoddard has quit IRC01:49
*** sapd1 has joined #openstack-nova01:50
*** sapd1_x has joined #openstack-nova01:52
*** mgoddard has joined #openstack-nova01:57
*** ociuhandu has joined #openstack-nova02:04
*** ociuhandu has quit IRC02:09
*** zhanglong has joined #openstack-nova02:28
*** psachin has joined #openstack-nova02:30
*** mkrai has joined #openstack-nova03:17
*** tetsuro_ has quit IRC03:51
*** gyee has quit IRC04:08
*** tetsuro has joined #openstack-nova04:16
*** udesale has joined #openstack-nova04:32
openstackgerritmelanie witt proposed openstack/nova master: Synchronize sqlalchemy models with migrations for alembic 1.4.1  https://review.opendev.org/71377804:39
*** tetsuro_ has joined #openstack-nova04:49
*** tetsuro has quit IRC04:52
*** ratailor has joined #openstack-nova04:54
*** tetsuro has joined #openstack-nova05:03
*** tetsuro_ has quit IRC05:06
*** dave-mccowan has quit IRC05:10
*** tetsuro_ has joined #openstack-nova05:15
*** tetsuro has quit IRC05:18
*** macz_ has joined #openstack-nova05:19
*** macz_ has quit IRC05:24
*** links has joined #openstack-nova05:32
*** evrardjp has quit IRC05:36
*** evrardjp has joined #openstack-nova05:36
*** zhanglong has quit IRC05:57
*** zhanglong has joined #openstack-nova05:57
*** macz_ has joined #openstack-nova06:28
*** macz_ has quit IRC06:33
*** mkrai has quit IRC06:35
*** mkrai has joined #openstack-nova06:40
*** yedongcan has joined #openstack-nova06:47
*** damien_r has quit IRC06:51
*** dpawlik has joined #openstack-nova06:53
*** lbragstad has quit IRC07:01
*** mkrai has quit IRC07:02
*** mkrai_ has joined #openstack-nova07:02
openstackgerritKevin Zhao proposed openstack/nova master: fix scsi disk unit number of the attaching volume when cdrom bus is scsi  https://review.opendev.org/71260707:10
*** ociuhandu has joined #openstack-nova07:28
*** amoralej|off is now known as amoralej07:32
*** ociuhandu has quit IRC07:33
*** damien_r has joined #openstack-nova07:39
*** damien_r has quit IRC07:44
*** damien_r has joined #openstack-nova07:47
*** tetsuro_ has quit IRC07:49
*** tesseract has joined #openstack-nova07:51
*** jawad_axd has joined #openstack-nova07:51
*** maciejjozefczyk has joined #openstack-nova07:58
*** klindgren has quit IRC08:01
*** klindgren has joined #openstack-nova08:01
*** threestrands has quit IRC08:01
gibigood morning nova08:02
gibistephenfin: hi! dansmith +2 all over the qos remaining patches, could you check back to those? https://review.opendev.org/#/q/topic:bp/support-move-ops-with-qos-ports-ussuri08:03
gibistephenfin: the major change since you looked at is a compute service version check in the API08:04
gibito ensure the computes are on Ussuri version before we start moving the servers around as the feauture needs support from the compute service08:05
gibidue to the PCI claim magic08:05
*** slaweq has joined #openstack-nova08:06
*** portdirect has quit IRC08:08
*** kevinz has quit IRC08:08
*** guilhermesp has quit IRC08:09
*** fyx has quit IRC08:09
*** portdirect has joined #openstack-nova08:10
*** mnaser has quit IRC08:10
*** kevinz has joined #openstack-nova08:11
*** fyx has joined #openstack-nova08:11
*** mnaser has joined #openstack-nova08:11
*** rpittau|afk is now known as rpittau08:11
*** guilhermesp has joined #openstack-nova08:11
*** tkajinam has quit IRC08:12
*** lpetrut has joined #openstack-nova08:18
*** tosky has joined #openstack-nova08:21
*** zhanglong has quit IRC08:22
*** zhanglong has joined #openstack-nova08:27
*** ygk_12345 has joined #openstack-nova08:28
*** macz_ has joined #openstack-nova08:29
*** klindgren has quit IRC08:31
*** klindgren has joined #openstack-nova08:31
*** macz_ has quit IRC08:33
*** tetsuro has joined #openstack-nova08:34
*** klindgren has quit IRC08:38
*** klindgren has joined #openstack-nova08:39
*** tetsuro_ has joined #openstack-nova08:40
*** tetsuro has quit IRC08:43
*** ralonsoh has joined #openstack-nova08:53
*** xek has joined #openstack-nova08:55
openstackgerritLee Yarwood proposed openstack/nova master: gate: Ensure subnode is fenced during evacuation testing  https://review.opendev.org/71367408:55
nightmare_unrealhello is there specific channel for python-openstackclient08:56
brinzhang_It seems there is not, you can send to ML, and add [openstackclient] [your_project] tag09:00
brinzhang_nightmare_unreal09:00
*** psachin has quit IRC09:01
*** psachin has joined #openstack-nova09:02
*** yoctozepto has quit IRC09:07
*** klindgren has quit IRC09:10
*** ociuhandu has joined #openstack-nova09:10
*** klindgren has joined #openstack-nova09:11
*** yoctozepto0 has joined #openstack-nova09:16
*** ociuhandu has quit IRC09:22
*** yoctozepto9 has joined #openstack-nova09:27
*** yoctozepto0 has quit IRC09:27
nightmare_unrealthanks09:28
*** dtantsur|afk is now known as dtantsur09:39
*** jangutter has joined #openstack-nova09:40
*** tetsuro has joined #openstack-nova09:40
*** zhanglong has quit IRC09:42
*** tetsuro_ has quit IRC09:43
luyaodansmith, artom: Thanks for your comments on https://review.opendev.org/#/c/687856, I replied on them, and thanks for alex_xu helping me clarifying.09:43
fricklernightmare_unreal: brinzhang_: #openstack-sdks09:43
*** derekh has joined #openstack-nova09:43
luyaoartom: FYI. I also run your live migration functional tests on top of my patch, they all passed09:44
*** Liang__ has quit IRC09:45
*** ociuhandu has joined #openstack-nova09:54
*** tetsuro has quit IRC09:56
*** vishalmanchanda has quit IRC10:03
*** jangutter has quit IRC10:07
*** amoralej is now known as amoralej|afk10:13
*** kaisers_ has joined #openstack-nova10:16
*** mkrai_ has quit IRC10:19
openstackgerritArthur Dayne proposed openstack/nova master: libvirt:driver:Disallow AIO=native when 'O_DIRECT' is not available  https://review.opendev.org/68277210:20
*** macz_ has joined #openstack-nova10:20
*** vishalmanchanda has joined #openstack-nova10:22
openstackgerritArthur Dayne proposed openstack/nova master: libvirt:driver:Disallow AIO=native when 'O_DIRECT' is not available  https://review.opendev.org/68277210:22
*** macz_ has quit IRC10:25
*** mkrai_ has joined #openstack-nova10:30
openstackgerritMerged openstack/nova master: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303510:30
*** mkrai has joined #openstack-nova10:32
*** mkrai_ has quit IRC10:35
*** tesseract has quit IRC10:36
*** yoctozepto9 is now known as yoctozepto10:37
*** tesseract has joined #openstack-nova10:42
*** tesseract has quit IRC10:46
*** tesseract has joined #openstack-nova10:48
*** ivve has joined #openstack-nova10:55
*** ociuhandu has quit IRC10:58
*** tesseract-RH has joined #openstack-nova11:00
lyarwoodwow it finally landed11:01
openstackgerritLee Yarwood proposed openstack/nova master: gate: Ensure subnode is fenced during evacuation testing  https://review.opendev.org/71367411:01
openstackgerritLee Yarwood proposed openstack/nova master: nova-live-migration: Ensure subnode is fenced during evacuation testing  https://review.opendev.org/71367411:03
lyarwood^ sorry just trying to get this some additional runs before review to ensure we don't need the VIF timeout again11:03
*** tesseract has quit IRC11:03
openstackgerritLee Yarwood proposed openstack/nova stable/train: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71383611:03
*** ociuhandu has joined #openstack-nova11:03
openstackgerritLee Yarwood proposed openstack/nova stable/stein: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71383711:04
*** sapd1_x has quit IRC11:04
*** ociuhandu has quit IRC11:04
*** ociuhandu has joined #openstack-nova11:05
openstackgerritLee Yarwood proposed openstack/nova stable/rocky: Replace ansible --sudo with --become in live_migration/hooks scripts  https://review.opendev.org/71383911:05
openstackgerritLee Yarwood proposed openstack/nova stable/rocky: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71384011:05
*** sapd1 has quit IRC11:05
*** nicolasbock has joined #openstack-nova11:06
kashyaplyarwood: Why sorry?11:07
kashyaplyarwood: You're doing the donkey work, people better be grateful ;-)11:07
kashyapI mean to say it in a _good sense_, BTW!11:08
lyarwoodthanks :)11:10
openstackgerritLee Yarwood proposed openstack/nova stable/queens: Stop setting glance_api_version in cinder.conf in nova-live-migration  https://review.opendev.org/71384311:10
openstackgerritLee Yarwood proposed openstack/nova stable/queens: Replace ansible --sudo with --become in live_migration/hooks scripts  https://review.opendev.org/71384411:10
openstackgerritLee Yarwood proposed openstack/nova stable/queens: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71384511:10
openstackgerritLee Yarwood proposed openstack/nova stable/pike: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303611:15
openstackgerritLee Yarwood proposed openstack/nova stable/pike: Replace ansible --sudo with --become in live_migration/hooks scripts  https://review.opendev.org/71384811:15
*** dtantsur is now known as dtantsur|afk11:20
*** klindgren has quit IRC11:23
*** klindgren has joined #openstack-nova11:24
*** rpittau is now known as rpittau|bbl11:30
openstackgerritLee Yarwood proposed openstack/nova master: nova-live-migration: Ensure subnode is fenced during evacuation testing  https://review.opendev.org/71367411:34
brinzhang_damsmith: I have replied your question in https://review.opendev.org/#/c/693828/11:34
*** ccamacho has quit IRC11:38
*** ccamacho has joined #openstack-nova11:44
*** ociuhandu has quit IRC11:50
*** udesale_ has joined #openstack-nova12:19
*** udesale has quit IRC12:20
*** ociuhandu has joined #openstack-nova12:22
openstackgerritJohannes Kulik proposed openstack/nova master: Don't recompute weighers' minval/maxval attributes  https://review.opendev.org/71386312:27
*** amoralej|afk is now known as amoralej12:28
*** PetrTuma has joined #openstack-nova12:28
openstackgerritLee Yarwood proposed openstack/nova stable/pike: Replace ansible --sudo with --become in live_migration/hooks scripts  https://review.opendev.org/71384812:28
openstackgerritLee Yarwood proposed openstack/nova stable/pike: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303612:28
*** ociuhandu has quit IRC12:28
openstackgerritJohannes Kulik proposed openstack/nova master: Don't recompute weighers' minval/maxval attributes  https://review.opendev.org/71386312:34
*** PetrTuma has quit IRC12:39
*** jraju__ has joined #openstack-nova12:43
*** links has quit IRC12:44
*** PetrTuma has joined #openstack-nova12:46
*** sean-k-mooney2 has joined #openstack-nova12:53
openstackgerritMerged openstack/nova master: images: Allow the output format of qemu-img info to be controlled  https://review.opendev.org/70689812:53
*** sean-k-mooney2 is now known as sean-k-mooney12:53
*** rpittau|bbl is now known as rpittau12:59
*** jangutter has joined #openstack-nova13:07
*** nweinber has joined #openstack-nova13:07
*** mkrai has quit IRC13:09
lyarwoodsean-k-mooney: when you're around, PetrTuma had some quesitons about https://bugs.launchpad.net/nova/+bug/1763766 and stable/rocky13:12
openstackLaunchpad bug 1763766 in OpenStack Compute (nova) queens "nova needs to disallow resource consumption changes on image rebuild" [Medium,In progress] - Assigned to sean mooney (sean-k-mooney)13:12
sean-k-mooneylyarwood: i can take a look now13:12
PetrTumahello, hold on :)13:12
PetrTumaI'm wondering about state of https://bugs.launchpad.net/nova/+bug/1763766 in Rocky release, from the bug report it would seem Rocky should be affected by this (there is fix proposed to Rocky). I'm trying to reproduce the issue, but no matter what combination of flavors / images with different sets of properties I use, the resource consumption is13:12
PetrTumanot changed at all during rebuild (all properties on image I'm rebuilding to are ignored). This is follow up on https://bugs.launchpad.net/nova/+bug/1804502. We had to include the fix in our Rocky (disable the filter on rebuild), because we were having the issue similar to the one described in the bug report (i.e. rebuild failing on compute host13:12
openstackLaunchpad bug 1804502 in OpenStack Compute (nova) rocky "Rebuild server with NUMATopologyFilter enabled fails (in some cases)" [Medium,In progress] - Assigned to Lee Yarwood (lyarwood)13:12
PetrTumanearly filled up to its capacity by NUMA instances, because the NUMATopologyFilter returns empty set of hosts). I'm not sure what I might be doing wrong that I don't see the resource change during rebuild or did I miss some special condition?13:12
sean-k-mooneyPetrTuma: during a rebuil you should _NOT_ be able to change the resouce usage13:13
*** lbragstad has joined #openstack-nova13:13
sean-k-mooneyso if you should not be able to rebuild from an image with 1 numa node to 2 numa nodes13:14
sean-k-mooneyor form a pinned immage to an unpined one13:14
sean-k-mooneyso the fact that the topology you are rebuilding too is ignored is partly correct13:14
sean-k-mooneythe fix on master is to block any rebuild attpemnt that would alter the numa toplogy requested13:15
sean-k-mooneyso with this backported the expected behavior would be that the rebuild is rejected by the api13:15
sean-k-mooneywe can only fix https://bugs.launchpad.net/nova/+bug/180450213:16
openstackLaunchpad bug 1804502 in OpenStack Compute (nova) rocky "Rebuild server with NUMATopologyFilter enabled fails (in some cases)" [Medium,In progress] - Assigned to Lee Yarwood (lyarwood)13:16
sean-k-mooneyby adressing https://bugs.launchpad.net/nova/+bug/1763766 and explictily blocking the case where the toplogy would change13:17
openstackLaunchpad bug 1763766 in OpenStack Compute (nova) queens "nova needs to disallow resource consumption changes on image rebuild" [Medium,In progress] - Assigned to sean mooney (sean-k-mooney)13:17
sean-k-mooneyin doing so we can disabel the numa toplogy filter safely on rebuild13:17
sean-k-mooneyit is not correct to disabel the numa toplogy filter on rebuild without blocking numa toplogy changes first13:17
PetrTumawell, that's the thing, if I try to rebuild instance with hw_numa_node=1 to image with hw_numa_node=2 I have same result, resources of the instance are unchanged (this is the case where I don't have the API change to block this in my nova)13:17
sean-k-mooneyPetrTuma: does ^ make sense13:17
sean-k-mooneyPetrTuma: yes that is becasue rebuild uses a noop claim13:18
sean-k-mooneyso the resouces calimed on the host are not updated13:18
*** nicolasbock has quit IRC13:18
sean-k-mooneybut we considerd it to be a bug to allow you to requst a numa toplogy and not honour that requst13:19
sean-k-mooneywhich is why we block it13:19
PetrTumayes, I understand that, I was trying to actually somehow recreate the issue, but so far I failed utterly13:19
sean-k-mooneywell recreating is simple you just boot a vm with 1 numa node and rebuild to 213:20
sean-k-mooneyon rocky without any backport that will result in the image stating the vm has 2 numa nodes but it will only have 113:20
sean-k-mooneyuntil you cold migrate and then it will change to two13:20
*** nicolasbock has joined #openstack-nova13:21
sean-k-mooneyif you live migrate it will schdule based on 2 claime 2 and pin to one.13:21
*** psachin has quit IRC13:21
sean-k-mooneyso live migration wont fix the inconsetnecy and will like cause others13:21
PetrTumaah, so that's what I misunderstood13:22
PetrTumait's not the issue with rebuild itself, but any subsequent action that might be run on the instance13:22
sean-k-mooneyyes13:23
sean-k-mooneycold migate and shelve/unshevel could fix the instnace. as would evacuate i guess13:23
sean-k-mooneybut live migate, hard reboot rescue would not help13:23
PetrTumaok, thanks for clarification, I'm going to run some more tests then (I have the fix backported to my nova)13:24
sean-k-mooneysure let me know if you have any specific questions.13:24
sean-k-mooneywe technically could allow rebuild to change resouce usage but its invaive as we would have to alter rebuild to not use a noop claim and instead use a move claim. basicaly allowing a rebuild to change host13:25
sean-k-mooneylike a resize13:25
lyarwoodsean-k-mooney: https://review.opendev.org/#/c/713674/ is ready for review again btw, had to remember to escapte a few things.13:26
*** derekh has quit IRC13:26
sean-k-mooneythat is not something that would be backportable and im not sure its something we want to do13:27
sean-k-mooneylyarwood: cool13:27
sean-k-mooneyso ya the virsh list was not evaluated on the target host13:27
sean-k-mooneyi was assuming it would not be13:27
sean-k-mooneylook like the job passed which is a good sign13:28
lyarwoodyeah I haven't seen it fail yet since we stopped all of the devstack@* services on the subnode13:28
lyarwoodI also don't see the ceph errors in n-cpu on the primary host13:29
sean-k-mooneypresumably because the vm is actully stopped on the source host13:29
lyarwoodyeah13:29
lyarwoodjust confirming13:29
sean-k-mooneylyarwood: ya so i think that looks good13:31
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Use domain capabilities to get supported device models  https://review.opendev.org/66691513:32
openstackgerritStephen Finucane proposed openstack/nova master: Add transform_image_metadata request filter  https://review.opendev.org/66577513:32
sean-k-mooneyhttps://review.opendev.org/#/c/713342/1 isnt doing what i wanted it too. i might take a look at it again but i think im going to rebase it on top of your change13:33
sean-k-mooneyim seeing some port not fond messages with it13:33
stephenfingibi, alex_xu: Addressed your comments in that disk image metadata series ^13:33
sean-k-mooneywhich makes me think its doing more then removing the binding host13:34
gibistephenfin: ack13:34
alex_xustephenfin: got it13:35
alex_xusean-k-mooney: stephenfin just replied https://review.opendev.org/#/c/665775/25/nova/conf/scheduler.py@203, maybe I missunderstand sean-k-mooney's comment13:35
sean-k-mooneyalso jsut replied13:35
alex_xusean-k-mooney: yes, that is what I said13:36
stephenfinalex_xu: Yeah, you're correct. That's what I've said in the new patchset I think13:36
sean-k-mooneyyou are saying we should not set it to true until all host are upgraded yes13:36
alex_xuyes13:36
stephenfinIn addition to saying this only works with libvirt13:36
stephenfinfor now, anyway13:36
alex_xuat least not suggest to set as true13:36
sean-k-mooneyalex_xu: ya so it default to false for two reasons.13:37
sean-k-mooneyone not all virt drivers support it13:37
alex_xuah, right13:37
sean-k-mooneyand second because on upgrade you want to finish the upgrade before enableing it13:37
alex_xuack13:38
sean-k-mooneywe wont be able to change that default untill all the in tree virt drivers supprot this13:38
sean-k-mooneyso it will proably always default to false13:38
sean-k-mooneyat least in the short to medium term13:38
alex_xuagree with that13:38
openstackgerritLee Yarwood proposed openstack/nova stable/pike: nova-live-migration: Wait for n-cpu services to come up after configuring Ceph  https://review.opendev.org/71303613:39
openstackgerritLee Yarwood proposed openstack/nova stable/pike: pike-only: remove broken non-voting ceph jobs  https://review.opendev.org/70007213:46
openstackgerritLee Yarwood proposed openstack/nova stable/pike: Init HostState.failed_builds  https://review.opendev.org/66953813:46
openstackgerritLee Yarwood proposed openstack/nova stable/pike: Mask the token used to allow access to consoles  https://review.opendev.org/70887613:46
openstackgerritLee Yarwood proposed openstack/nova stable/pike: Remove exp legacy-tempest-dsvm-full-devstack-plugin-nfs  https://review.opendev.org/70206113:46
openstackgerritLee Yarwood proposed openstack/nova stable/pike: Improve metadata server performance with large security groups  https://review.opendev.org/69752313:46
lyarwoodgibi / stephenfin: https://review.opendev.org/#/c/713674/ - can you take a look at this nova-live-migration fix today please.13:47
*** mkrai has joined #openstack-nova13:48
*** StevenK has quit IRC13:49
gibilyarwood: do you need to make sure that the stopped services started later? or the whole subnode will be rebooted?13:50
*** StevenK has joined #openstack-nova13:50
lyarwoodgibi: evacuation is the final thing that happens in the job at the moment13:50
sean-k-mooneygibi: we do the evacuation tests after all the others13:50
gibiOK. thanks. then it make sense13:51
lyarwoodyeah the previous behaviour was to leave libvirt stopped on the subnode anyway13:51
lyarwoodI'm still plannig to break this out into seperate jobs btw13:51
*** ygk_12345 has left #openstack-nova13:51
lyarwoodJust want to get it stable again before I start with that13:51
sean-k-mooneyyep makes sense13:51
gibilyarwood: thank you for taking care of this. +213:52
lyarwoodnp it has stopped my stuff from landing so this is the least I could do ;)13:52
*** amoralej is now known as amoralej|lunch13:53
*** mkrai has quit IRC13:54
gibilyarwood: you are unblocking everyone and that is highly appreciated :)13:54
*** udesale_ has quit IRC14:00
*** derekh has joined #openstack-nova14:00
*** dklyle has joined #openstack-nova14:15
*** amoralej|lunch is now known as amoralej14:22
sean-k-mooneygibi: is the nova team meeing happing now by the way14:33
sean-k-mooneyoh its later i need to update my calandar14:33
gibisean-k-mooney: each occasion moved to 16:00 UTC14:33
sean-k-mooneyyep14:33
sean-k-mooneyical file for anyone else http://eavesdrop.openstack.org/calendars/nova-team-meeting.ics14:37
*** jraju__ has quit IRC14:42
*** mriedem has joined #openstack-nova14:45
*** macz_ has joined #openstack-nova14:55
gibidansmith: I went through the cyborg patch series. left comments. I did not find any ground breaking problem.15:02
dansmithgibi: okay I haven't even looked at my reviews box yet this morning but will try to do that soon15:02
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Use virDomainBlockCopy to swap volumes when using -blockdev  https://review.opendev.org/69683415:03
gibidansmith: ack15:04
gmannsean-k-mooney: a bug related to flavor. id with name or vice versa are not unique. If we do some change in flavor things what we discussed yesterday, i think we can stop taking flavor id in request and always generate one- https://bugs.launchpad.net/nova/+bug/185801915:08
openstackLaunchpad bug 1858019 in OpenStack Compute (nova) "The flavor id is not limited when creating a flavor" [Wishlist,Triaged] - Assigned to Choi-Sung-Hoon (knu-cse)15:08
gmannbecause i do not think fixing 1858019 alone is worth for microversion bump15:08
sean-k-mooneygmann: im not sure about that15:08
gmannany specific reason for flavor id in request ?15:08
sean-k-mooneywell people often just use the id15:08
sean-k-mooneyi know we frequently do that in ci15:09
sean-k-mooneyso we know that the m1.nano flaovr is 42 and that is just used in ci jobs15:09
sean-k-mooneygmann: the flavor id is always uniqute15:09
sean-k-mooneybut the name is not15:09
gmannname is also unique to each other but id with name is not15:10
sean-k-mooneyyes but you can have two flaovr with the same name15:10
sean-k-mooneyyou cant have two flavors with the same id15:10
sean-k-mooneyright?15:10
gmannno, it will be 409 i think15:10
gmannin name case also15:10
sean-k-mooneyi would have to go check15:11
gmannlet me confirm quickly15:11
sean-k-mooneyi know we look for deleted status15:11
sean-k-mooneygmann: i guess we could stop taking the id in the new microverion i just know i often specify the id if i am using ci jobs or locally so i dont know how wide spread that is15:12
sean-k-mooneygmann: you suggesign blocking it on flavor create right15:12
openstackgerritKevin Zhao proposed openstack/nova master: fix scsi disk unit number of the attaching volume when cdrom bus is scsi  https://review.opendev.org/71260715:12
sean-k-mooneynot for other operations15:12
gmannyeah, in create15:12
gmanncreate request only15:12
sean-k-mooneyhttps://github.com/openstack/nova/blob/master/nova/db/sqlalchemy/models.py#L415-L41815:13
sean-k-mooneyim not sure if we still use the instance_types table15:13
sean-k-mooneybut the unique constratif for that is just flavor id and deleted15:14
sean-k-mooneyoh and name15:14
gmannonly thing will be create will generate the uuid. current  id in request can be just anything easy to remember like number of string15:15
gmannnumber or string15:15
sean-k-mooneywell its ment to be an int or a uuid15:15
sean-k-mooneybut a sting can work15:16
sean-k-mooney*string15:16
sean-k-mooneythe idiomatic usage is uuid or int15:16
sean-k-mooneyi guess we would make it just a uuid if we auto generated it. i think that is what we do today correct15:17
gmannyeah, if people find uuid difficult to use than int15:17
gmannyes, https://github.com/openstack/nova/blob/master/nova/compute/flavors.py#L6915:17
sean-k-mooneyya if i set it i alwasy set it to an int15:17
sean-k-mooneyif i dont set it i use the name15:17
sean-k-mooneythe only reason i have ever used the id is when its shorter then typing the name15:18
sean-k-mooneyi guess since you can always use an older microversin if you need to set the id then i would be ok with the change15:18
gmannsame. i do not know why bug repartee want to give exiting flavor's name as id for new one15:19
gmanni do not think that is normal usage that is why i marked that bug as wishlist but we could say would not fix or somethings.15:19
sean-k-mooneyso the microverion would make the flavor_id auto generated and then either make is_public mutable or remove is_public right?15:19
sean-k-mooneythose are the two changes you were suggesting combining?15:20
gmannyeah and anything else there in flavor side cleanup. other way is to do another single cleanup microversion we did in stein or train and combine other API cleanup which are worth to do.15:21
*** ratailor has quit IRC15:21
sean-k-mooneyya i think that makes sense. im not sure if there are other change we were thinking of for the flavor15:21
openstackgerritLee Yarwood proposed openstack/nova master: virt: Provide block_device_info during rescue  https://review.opendev.org/70081115:22
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Add support for stable device rescue  https://review.opendev.org/70081215:22
openstackgerritLee Yarwood proposed openstack/nova master: compute: Report COMPUTE_RESCUE_BFV and check during rescue  https://review.opendev.org/70142915:22
openstackgerritLee Yarwood proposed openstack/nova master: api: Introduce microverion 2.83 allowing boot from volume rescue  https://review.opendev.org/70143015:22
openstackgerritLee Yarwood proposed openstack/nova master: compute: Extract _get_bdm_image_metadata into nova.utils  https://review.opendev.org/70521215:22
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Support boot from volume stable device instance rescue  https://review.opendev.org/70143115:22
openstackgerritLee Yarwood proposed openstack/nova master: DNM - Test stable device rescue tests with BFV instances  https://review.opendev.org/71005015:22
sean-k-mooneywe have talked about things like removing rxtx_factor and vcpu_weight in the past15:22
sean-k-mooneyi think those are both for vmware but they should have been extraspecs15:22
gmannlet me find the etherpad of cleanup tihngs if we have those already15:22
*** lpetrut has quit IRC15:23
sean-k-mooneythe disabled field also has some overlap with private/public https://github.com/openstack/nova/commit/f371198b843ba17ad6a6e4bc77a58afb006ab67715:23
gmannhttps://etherpad.openstack.org/p/nova-api-cleanup15:25
openstackgerritLee Yarwood proposed openstack/nova master: Add config option for neutron client retries  https://review.opendev.org/71222615:25
gmannyeah, only admin in access list make it disable15:26
sean-k-mooneyah its an internal field15:26
sean-k-mooneyi was looking at the client and did not see how to set it15:27
gmannI will add these two candidates in etherpad.15:28
sean-k-mooneyok. i assume this would all be done in Victoria too right15:29
sean-k-mooneye.g. we cant approve a spec for this at this point and this would not qualify for a bug fix right15:29
*** jawad_axd has quit IRC15:31
gmannyeah, not before Vistoria. or i will say wait  till W cycle also to collect more.15:32
openstackgerritKevin Zhao proposed openstack/nova master: fix unit test error block info on non x86 architecture  https://review.opendev.org/71390515:36
lyarwoodhmm so zuul doesn't kick a failing change out of the gate until all jobs have returned?15:36
*** mriedem has quit IRC15:37
gmannlyarwood: yeah.15:38
lyarwoodgmann: kk, this might take some time then :D15:38
sean-k-mooneyzuul will kick it out if you propsoe a new version15:39
lyarwoodit's not my change15:39
gmannone trick is if you have Depends-On then failure or change on Depends-On15:39
lyarwoodhttps://review.opendev.org/#/c/704759/ is first in the queue but failing15:40
lyarwoodI was just checking what zuul was going to do in this case15:40
gibilyarwood: yeah that is mine, failing on some neutron tempest test case :/15:41
sean-k-mooneylyarwood: is this related to your evacuate change15:41
*** jawad_axd has joined #openstack-nova15:42
*** vishalmanchanda has quit IRC15:43
lyarwoodsean-k-mooney: no, my change is below it15:44
lyarwoodactually that doesn't matter does it15:44
lyarwoodgibi: was that the subnet creation bug I linked in my email?15:44
* lyarwood looks15:44
gibilyarwood: there was some timeout but did not look deeper15:45
gibilyarwood: nvm, I looked at a different job15:46
lyarwoodhttps://4cc1d8431535199f530e-fcfc85786424b7f5a3fcadff9da35f94.ssl.cf2.rackcdn.com/704759/10/gate/grenade-py3/9dc93d2/logs/grenade.sh.txt - looks like grenade failed to update libvirt/qemu15:47
gibiyep15:47
*** spatel has joined #openstack-nova15:48
gibinova team meeting in 10 minutes on #openstack-meeting-315:50
*** spatel has quit IRC15:51
stephenfinCan someone != gibi take a look at this, the final remove-nova-network patch? https://review.opendev.org/#/c/703974/15:52
* gibi will look elswhere :)15:52
stephenfinI don't want your kind round here :P15:52
gibinova meeting is running on #openstack-meeting-316:01
*** ociuhandu has joined #openstack-nova16:04
*** ociuhandu has quit IRC16:06
*** ociuhandu_ has joined #openstack-nova16:06
*** maciejjozefczyk has quit IRC16:10
*** yedongcan has left #openstack-nova16:15
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Fix unit test error block info on non x86 architecture  https://review.opendev.org/71390516:24
*** tesseract-RH has quit IRC16:31
stephenfinCI stability patch here looking for another core https://review.opendev.org/#/c/713611/16:46
*** TxGirlGeek has joined #openstack-nova16:47
openstackgerritMerged openstack/nova master: nova-live-migration: Ensure subnode is fenced during evacuation testing  https://review.opendev.org/71367416:48
openstackgerritMerged openstack/nova master: virt: Pass request context to extend_volume  https://review.opendev.org/70689916:48
lyarwoodstephenfin: nice!16:50
lyarwoodstephenfin: did you end up fixing that upstream?16:50
stephenfinYup, thankfully16:50
sean-k-mooneyso they accpeted your change related to async stream handeling?16:50
stephenfinMy original fix was correct. It just took a while to figure out why16:51
lyarwoodawesome16:51
stephenfinIt wasn't anything to do with async. Turns out 'read(N)' on anything inheriting from 'RawIOBase' doesn't guarantee N bytes16:52
stephenfinunlike 'BufferedIOBase'16:52
stephenfinlyarwood: is this a real issue? https://zuul.opendev.org/t/openstack/build/9dc93d24e9634d7e95ee6f2763b38613/log/logs/screen-n-cpu.txt#550616:54
stephenfintrying to figure out why https://review.opendev.org/#/c/704759/10 failed16:54
* lyarwood clicks16:57
stephenfinactually, nvm, it's not that. It's a simple 404 https://zuul.opendev.org/t/openstack/build/9dc93d24e9634d7e95ee6f2763b38613/log/logs/grenade.sh.txt#4654216:57
lyarwoodstephenfin: no that's fine16:57
lyarwoodstephenfin: yeah I spoke about it earlier, apt-get install failing right16:58
stephenfinyuup16:58
stephenfinfrom http://mirror.ord.rax.opendev.org/16:58
sean-k-mooneystephenfin: it returns upto N bytes correct16:58
stephenfinsean-k-mooney: correct16:59
sean-k-mooneystephenfin: i think that is the same behavior as the read syscall16:59
sean-k-mooneye.g. if you were to invoke it form c16:59
sean-k-mooneyso you have to check how many bytes you actully got16:59
stephenfinalso correct17:00
sean-k-mooneyim guessign that was causing the header or soemthign to be tuncated17:00
sean-k-mooneyor other strange hard to dignose error17:00
sean-k-mooneyin anycase im glad you managed to fix it17:01
efried_gonesean-k-mooney: re rootwrap, I think there's some vestigial code that we needed to keep because of a weird os-brick thing. I don't remember the details exactly. But we never figured out whether we could safely remove it, or how.17:03
*** adrianc_ has quit IRC17:06
*** adrianc has joined #openstack-nova17:06
sean-k-mooneyefried_gone: did we not remove all usage in nova itself17:10
sean-k-mooneyyou might be very well right about os-brick17:10
sean-k-mooneybut we merged all the patches in the remove root wrap series17:11
stephenfinsean-k-mooney: https://github.com/openstack/nova/tree/master/etc/nova/rootwrap.d/compute.filters17:12
efried_goneyeah17:17
*** gyee has joined #openstack-nova17:19
sean-k-mooneyright so that is so it can launch  its own privsep deamon right17:20
sean-k-mooneyos-vif also launches its own privsep deamon and we dont have a rootwrap entry17:21
*** mlavalle has joined #openstack-nova17:23
*** evrardjp has quit IRC17:36
*** evrardjp has joined #openstack-nova17:36
sean-k-mooneyis there a reason that os-brick needs to have rootwap to start privsep?17:40
*** lbragstad_ has joined #openstack-nova17:41
*** ociuhandu_ has quit IRC17:42
*** ociuhandu has joined #openstack-nova17:42
sean-k-mooneyok so it still uses rootwap directly https://github.com/openstack/os-brick/blob/master/os_brick/privileged/rootwrap.py17:43
*** lbragstad has quit IRC17:44
sean-k-mooneystephenfin: so ya i think nova nolonger need to have that rootwap filter. it could be move to os-brick but basically if that is the only usage i satand by my comment that nova does not use rootwap os-brick does17:46
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Use virDomainBlockCopy to swap volumes when using -blockdev  https://review.opendev.org/69683417:47
sean-k-mooneyit looks like os-brick only has 4 usages of rootwap remining too17:47
*** ociuhandu has quit IRC17:48
sean-k-mooneyactully 317:49
sean-k-mooneyactully no its 7 i didnt scroll im going to check if those need to be elevated or not17:50
sean-k-mooneyya so all of them pass run_as_root=true which means they actully use privsep to run them17:51
sean-k-mooneybecause we hit here https://github.com/openstack/os-brick/blob/9649f17228203186b523e400080a300f28b7e6ff/os_brick/privileged/rootwrap.py#L16917:52
sean-k-mooneyand execute_root is a privsep function https://github.com/openstack/os-brick/blob/9649f17228203186b523e400080a300f28b7e6ff/os_brick/privileged/rootwrap.py#L190-L19417:52
sean-k-mooneyit looks like oslo rootwrap was removed as a os-bricks depency in liberty too https://github.com/openstack/os-brick/commit/7d341f154397ef38c480bdd9cf6d2ccad6502f13 but https://github.com/openstack/os-brick/commit/dbf77fba1061cb4e93b3db5f8117d6ccc689f702#diff-0d141267b46cdfd7a9dfe6100d79fe33 removed it as an actul depency in newton17:59
*** ociuhandu has joined #openstack-nova18:01
sean-k-mooneythe way os-bricks is using privsep is not secure but i dont think its usign the filters at all18:01
*** derekh has quit IRC18:05
*** rpittau is now known as rpittau|afk18:06
*** jangutter has quit IRC18:20
*** ociuhandu has quit IRC18:27
*** ociuhandu has joined #openstack-nova18:28
*** ociuhandu has quit IRC18:33
openstackgerritLee Yarwood proposed openstack/nova stable/train: nova-live-migration: Ensure subnode is fenced during evacuation testing  https://review.opendev.org/71396118:42
openstackgerritLee Yarwood proposed openstack/nova stable/stein: nova-live-migration: Ensure subnode is fenced during evacuation testing  https://review.opendev.org/71396218:43
*** gyee has quit IRC18:44
*** mriedem has joined #openstack-nova18:48
*** gyee has joined #openstack-nova18:51
*** maciejjozefczyk has joined #openstack-nova19:02
artomdansmith, quick chat about https://review.opendev.org/#/c/705545/4/nova/tests/functional/integrated_helpers.py@370 ?19:03
*** jawad_axd has quit IRC19:04
dansmithartom: just back to my desk, have to do something first and then yeah19:10
artomdansmith, so, I looked around some more, thought some more, and I think I've come around to your way of thinking19:12
artomI gotta go walk the kiddos19:12
dansmithheh okay19:13
*** TxGirlGeek has quit IRC19:14
openstackgerritMerged openstack/nova stable/rocky: Fix os-keypairs pagination links  https://review.opendev.org/71189819:34
*** maciejjozefczyk has quit IRC19:40
*** maciejjozefczyk has joined #openstack-nova19:41
*** mlavalle has quit IRC19:49
*** bbowen has quit IRC19:57
*** bbowen has joined #openstack-nova19:58
*** mlavalle has joined #openstack-nova19:58
*** gyee has quit IRC20:03
*** maciejjozefczyk has quit IRC20:06
*** ralonsoh has quit IRC20:19
*** bbowen has quit IRC20:45
*** nweinber has quit IRC20:46
artomdansmith, hrmm, so in the end maybe talking about https://review.opendev.org/#/c/705545/4/nova/tests/functional/integrated_helpers.py@370 could help21:06
artomSo, places that redefine _setup_compute_service just don't inherit from the ServicesMixing21:07
artomI think I brought about some confusion when I called it the ServicesHelper mixin21:08
artomIt's really the PeriodicHelperMixin21:08
artomIe - you only get run_periodics in it21:08
*** ivve has quit IRC21:08
artomBut because run_periodics depends on self.computes, you get _start_compute as well, which populates self.compute21:08
openstackgerritMerged openstack/nova stable/train: Lowercase ironic driver hash ring and ignore case in cache  https://review.opendev.org/71373921:09
openstackgerritMerged openstack/nova stable/queens: Avoid circular reference during serialization  https://review.opendev.org/71313221:09
openstackgerritMerged openstack/nova stable/ocata: Add functional regression test for bug 1849409  https://review.opendev.org/69074621:09
openstackbug 1849409 in OpenStack Compute (nova) ocata "openstack server list --deleted --limit -1 hangs" [High,In progress] https://launchpad.net/bugs/1849409 - Assigned to Matt Riedemann (mriedem)21:09
openstackgerritMerged openstack/nova stable/ocata: Fix listing deleted servers with a marker  https://review.opendev.org/69074721:09
openstackgerritMerged openstack/nova stable/ocata: Fix incompatible version handling in BuildRequest  https://review.opendev.org/66045621:09
openstackgerritMerged openstack/nova stable/ocata: Hook resource_tracker to remove stale node information  https://review.opendev.org/67646721:09
*** tbachman_ has joined #openstack-nova21:13
*** tbachman has quit IRC21:13
*** tbachman_ is now known as tbachman21:13
openstackgerritmelanie witt proposed openstack/nova stable/stein: Lowercase ironic driver hash ring and ignore case in cache  https://review.opendev.org/71398221:14
openstackgerritMerged openstack/nova master: Switching new default roles in os-volumes-attachments policies  https://review.opendev.org/71019021:17
*** xek has quit IRC21:19
*** mriedem has left #openstack-nova21:32
*** ociuhandu has joined #openstack-nova21:33
melwittzzzeek: I've been looking at some debug logs in gate runs and I notice a lot of ROLLBACK logged, would you think it's concerning? https://zuul.opendev.org/t/openstack/build/b4f6a11e4b0940e0a9513f7113bfb95c/log/logs/screen-n-sch.txt#169121:48
*** PetrTuma has quit IRC21:48
zzzeekmelwitt: when a connection is done being used, it is returned to the pool where a ROLLBACK is emitted.  if you have lots of short single-statement transactions then you'd see a lot of these21:50
melwittzzzeek: oh, I see. thanks21:50
zzzeekmelwitt: these can be a problem performance wise21:50
zzzeekbut only if yo're observing that21:51
melwittzzzeek: yeah ... not yet sure what we're seeing, but this is the file where we have the gate bug where we wait 60 seconds without getting rows back from the database. could be a bug on our side. but I'm studying the logs looking for anything that seems like it could cause trouble. what would be a way of avoiding the rollbacks? for the sake of experiment21:53
*** gyee has joined #openstack-nova22:00
zzzeekmelwitt: there's a parameter "rollback_on_return" to craete_engine22:02
zzzeekoops, that's not it, one sec22:02
melwittrollback_reader_sessions ?22:03
zzzeekreset_on_return22:03
zzzeekhttps://docs.sqlalchemy.org/en/13/core/engines.html?highlight=reset_on_return#sqlalchemy.create_engine.params.pool_reset_on_return22:03
melwittoh k thanks22:03
zzzeekbut...it's dangerous to use this22:03
zzzeekin fact it can't actually work correctly, because there is always a transaction with MySQL InnoDB22:04
zzzeekso you will have very poor effects from using htis22:04
*** tbachman has quit IRC22:04
zzzeekmelwitt: your only option is to try the NullPool and turn off pooling entirely22:04
zzzeekmelwitt: unfortunately oslo.db has no option for that so if you wanted to test it you'd need to monkeypatch oslo.db22:04
*** tbachman has joined #openstack-nova22:05
melwittoh. good to know. I wouldn't want to use it then. I was just curious if there was any good/recommended way to address the performance problem with rollbacks if there were one22:05
melwittit's weird. in the scheduler, there's this constant flow of service select/update/commit happening for the service heartbeats, and when things fail, all of the other selects (for services, compute_nodes) just disappear entirely22:08
melwittand all I see are the steady select/update/commit for the heartbeats22:08
*** ociuhandu has quit IRC22:09
*** ociuhandu has joined #openstack-nova22:10
openstackgerritMerged openstack/nova master: libvirt: Fix unit test error block info on non x86 architecture  https://review.opendev.org/71390522:10
*** lbragstad_ has quit IRC22:13
melwittohhh dang I actually see now that the debug logging I added in my DNM patch is _not_ showing up, meaning we're not even trying to query for compute_nodes. that's bad. /me keeps digging22:15
*** ociuhandu has quit IRC22:15
*** zigo has quit IRC22:22
*** bbowen has joined #openstack-nova22:27
zzzeekmelwitt: sorry I dont know that code very well, keep me in the loop :)22:31
zzzeek(dont know it at all really ... :)  )22:31
*** zigo has joined #openstack-nova22:31
melwittoh, derp, false alarm I think. I only put the debug statements in the master branch and this is a grenade job so if it failed on the old side, that would be stable/train which doesn't have the debug logging on it. I need to cherry pick this DNM to train again and run with Depends-On22:37
*** dpawlik has quit IRC22:38
openstackgerritmelanie witt proposed openstack/nova master: DNM: try to get some debug info for bug 1844929  https://review.opendev.org/70147822:42
openstackbug 1844929 in OpenStack Compute (nova) "grenade jobs failing due to "Timed out waiting for response from cell" in scheduler" [High,Confirmed] https://launchpad.net/bugs/184492922:42
openstackgerritmelanie witt proposed openstack/nova stable/train: DNM: try to get some debug info for bug 1844929  https://review.opendev.org/71311622:42
openstackgerritmelanie witt proposed openstack/nova master: DNM: try to get some debug info for bug 1844929  https://review.opendev.org/70147822:43
*** slaweq has quit IRC22:45
*** CeeMac has quit IRC22:50
*** macz_ has quit IRC23:01
*** spatel has joined #openstack-nova23:42
*** spatel has quit IRC23:46
*** CeeMac has joined #openstack-nova23:49

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!