Monday, 2024-03-18

opendevreviewSylvain Bauza proposed openstack/nova master: Add a Caracal prelude section  https://review.opendev.org/c/openstack/nova/+/91297310:01
opendevreviewSylvain Bauza proposed openstack/nova master: Update contributor guide for 2024.2 Dalmatian  https://review.opendev.org/c/openstack/nova/+/91348110:03
bauzaselodilles: the gate failed again for https://review.opendev.org/c/openstack/nova/+/899625 :(10:06
bauzaselodilles: I rechecked it so I hope it will work now10:06
elodillesACK10:25
opendevreviewMerged openstack/nova master: libvirt: Cap with max_instances GPU types  https://review.opendev.org/c/openstack/nova/+/89962512:31
bauzassean-k-mooney: gibi: could you please look at https://review.opendev.org/c/openstack/nova/+/912973 ?13:49
bauzasthanks13:49
artomUggla, I think after the weekend and sleeping on it, the least bad solution I could come up with, at least in the short term, is to add a separate helper that "filters out" that new cpu shared set field from dst_numa_info, and make my code use it14:32
artomBut medium term we need to get rid of the conditional do_cleanup RPC call14:32
artomI think if we were to test an instance with mdevs or vpmem with Lee's functional test, it would fail as well14:33
artomFor the exact same reason: the presence of either one of those causes the cleanup to get called14:33
bauzasartom: Uggla: because today is the last day for creating a RC1, I find we will punt the cpu_shared_set changes to Dalmatian :(14:34
bauzasI'm just awaiting for https://zuul.openstack.org/status#902084 + the prelude14:34
artomErr, CPU power management doesn't work properly without it...14:34
artomgibi's going to be very angry at you if we don't fix it entirely14:35
artom;)14:35
artomI guess we can debate releasing it with a known issue (if you have different cpu_shared_set on source and dest and use libvirt CPU power management, not only does live migration doesn't update the pins, but also you might end up on CPUs that are powered down and the instance just doesn't start)...14:37
bauzasartom: Uggla: so, then tell me about https://review.opendev.org/c/openstack/nova/+/87777314:37
artomIMO it's fixable with a relatively small and clean-ish (as much as you can be clean when the whole basic principle is a mad genius hack) adjustment14:39
dansmithbauzas: will you +W that patch yourself?14:40
dansmith... or do you need another review? looks like its 2x+2 already14:42
artomUggla seems to be AFK at the moment, so I can update it myself since we're in a time squeeze14:42
gibiartom: bauzas: my powermgmt needs are targeting antelope. So if the cpu_share_set fix only lands in D then that will not prevent the backport to antelopej14:42
bauzasdansmith: which change you mean ?14:42
dansmithI'm talking about 90208414:42
bauzasdansmith: it's already accepted, but it's running on the gate 14:42
dansmith902084 has no +W14:43
bauzashttps://zuul.openstack.org/status#90208414:43
bauzas?*14:43
dansmithit's in check now14:43
dansmith"WorkflowNo votes"14:43
bauzasoh right14:43
artomgibi, fair... when does master switch to D? Tomorrow after today's RC1?14:43
bauzasshit14:43
bauzasdansmith: no worries, I'll +W it14:43
dansmithbauzas: cool14:43
bauzasthanks btw.14:44
bauzasartom: gibi: so you would want to have https://review.opendev.org/c/openstack/nova/+/877773 to be merged in C ?14:44
bauzasfwiw, we're still missing a second core here https://review.opendev.org/c/openstack/nova/+/903706/214:45
artombauzas, I think gibi's saying that as long as it gets backported to Antelope, he's happy, regardless if it's merged on master in C or D14:45
bauzasmy concern is that we couldn't backport https://review.opendev.org/c/openstack/nova/+/903706/2/nova/objects/migrate_data.py#24614:45
gibiartom: yes, after RC1 we will have master open for D, but I guess bauzas wants to limit the changes on master until final RC to make sure any last minute change to C is easy to backport from masterj14:46
artombauzas, right, that's why there's a backportable hack below it in https://review.opendev.org/c/openstack/nova/+/877773/914:46
bauzasI just +2d both of the changes14:46
artombauzas, err, want to jump on a call for me to explain stuff?14:47
bauzasguys, that's like 4 days i told about rc114:48
bauzasI wonder what you guys want14:48
gibiI want to enable powermgmt in antelope :)14:49
gibibackporting the fix from C or D does not matter for mej14:51
dansmithbauzas: I'm trying to decide if that cpu shared set thing is okay or not15:13
dansmithI don't love deleting a non-versioned property and replacing it with a versioned one like it's doing15:14
dansmithI'm trying to decide if it's actually a problem, or just _very_ confusing (the latter is definitely true, IMHO15:14
dansmithah, basically what sean-k-mooney said on PS115:16
bauzasdansmith: gibi: artom: Uggla: okay, so now I understand more15:18
artomdansmith, heh, bauzas and I were just on a call where I explained the nuances15:18
artomI can do the same for you if you wish :)15:18
dansmithbummer15:18
bauzasso, basically, we have an already bug with live-migration for cpu_shared_set15:18
dansmithyeah I get that15:19
dansmithhow far back does it need to be backported?15:19
bauzasnow with power management, you can live-migrate an instance that will use a wrong cpu which is a dedicated one, like before, but which will be off15:19
bauzasmy point is that I don't think it's a RC1 needed bugfix15:20
bauzaswhat we can rather do is to merge https://review.opendev.org/c/openstack/nova/+/877773 in Dalmatian and then backport it down to Antelope if we want15:20
bauzashttps://review.opendev.org/c/openstack/nova/+/903706/ won't be backported15:21
bauzas(as it's a new object version)15:21
bauzasso I'm just asking Uggla to modify https://review.opendev.org/c/openstack/nova/+/877773 to make sure it fixes the wrong functional test which is https://github.com/openstack/nova/blob/master/nova/tests/functional/regressions/test_bug_1899835.py15:21
bauzas(the last assert is wrong in the regression bug)15:22
* bauzas goes off for 20 mins15:22
bauzas(school)15:22
gibielodilles: could you look at this b15:28
gibi... this backport series https://review.opendev.org/c/openstack/nova/+/91319815:28
elodillesgibi: ACK, i'll try to look into it when i get there16:03
bauzasgibi: dansmith: we need approval for the prelude patch https://review.opendev.org/c/openstack/nova/+/91297316:03
dansmithack16:04
bauzasthanks16:04
gibielodilles: thanks16:07
dansmithbauzas: done16:22
bauzascool thanks !16:22
* bauzas will rebase now16:23
opendevreviewSylvain Bauza proposed openstack/nova master: Add a Caracal prelude section  https://review.opendev.org/c/openstack/nova/+/91297316:26
bauzasdansmith: ^ (when you have time)16:26
opendevreviewMerged openstack/nova master: vgpu: Allow device_addresses to not be set  https://review.opendev.org/c/openstack/nova/+/90208416:58
zigobauzas: Typ3 on the last 3 lines:17:18
zigoand yet another one another one (twice "another one")17:18
zigoLive-mig of vGPU is a nice feature, thanks!17:19
zigoAlso, good thing for SEV. We were looking at implementing it, now I'll make sure we do that over Caracal! :)17:20
opendevreviewMerged openstack/nova stable/2023.2: Add cpuset_reserved helper to instance NUMA topology  https://review.opendev.org/c/openstack/nova/+/91319318:35
opendevreviewMerged openstack/nova stable/2023.2: Reproducer for not powering on isolated emulator threads cores  https://review.opendev.org/c/openstack/nova/+/91319418:35
opendevreviewSylvain Bauza proposed openstack/nova master: Add a Caracal prelude section  https://review.opendev.org/c/openstack/nova/+/91297318:59
bauzasdansmith: melwitt: could you please quickky approve now the prelude above $ so I could provide a new RC1 release patch version tonight?19:00
dansmithbauzas: done19:10
bauzasdansmith: melwitt: ta folks, as soon as it merges, I'll update the RC1 patch19:17
opendevreviewMerged openstack/nova master: Add a Caracal prelude section  https://review.opendev.org/c/openstack/nova/+/91297319:46
opendevreviewMerged openstack/nova stable/zed: Reproducer for bug 2003991 unshelving offloaded instance  https://review.opendev.org/c/openstack/nova/+/88809322:15

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!