Wednesday, 2024-03-20

opendevreviewMerged openstack/osc-placement master: Update master for stable/2024.1  https://review.opendev.org/c/openstack/osc-placement/+/91262302:44
opendevreviewMerged openstack/os-vif master: Update master for stable/2024.1  https://review.opendev.org/c/openstack/os-vif/+/91262002:45
opendevreviewMerged openstack/python-novaclient master: Update master for stable/2024.1  https://review.opendev.org/c/openstack/python-novaclient/+/91262702:48
opendevreviewMerged openstack/placement master: Update master for stable/2024.1  https://review.opendev.org/c/openstack/placement/+/91335003:58
opendevreviewMerged openstack/nova master: Update master for stable/2024.1  https://review.opendev.org/c/openstack/nova/+/91363304:00
*** mklejn_ is now known as mklejn07:55
opendevreviewsean mooney proposed openstack/nova master: Test live migration between hosts with differnet cpu_shared_sets  https://review.opendev.org/c/openstack/nova/+/91374409:55
sean-k-mooney[m]spatel_: no snapshot is just of the root disk.10:03
sean-k-mooney[m]not memory10:04
*** blarnath is now known as d34dh0r5312:49
*** mklejn__ is now known as mklejn13:03
sean-k-mooneydansmith: just an fyi i added you to this anyway https://review.opendev.org/c/openstack/nova/+/913744 but that is proply executing Uggla migration change and testing it in a way we i think can merge and keep with little to no downside13:21
sean-k-mooneyits proably a converstaion for the ptg but i think there are a limited subset of whitebox test we may want to conser adding to exsitng jobs. for example we have a non voting barbican-tempest-plugin-simple-crypto job today.13:23
sean-k-mooneyi would like to either make that voting or have a diffent voting barbican job to test the ephmeral encyption series when taht is complete but i would alos like to enable the whitebox vtpm test in that job too. we dont need to enabel all of whitebox but it hink we can enable select tests in specific jobs if we agree that is desireable13:24
sean-k-mooneyi would like to add a whitebox tempest test to validate Uggla change at some point (i.e. checking the xml is updated correctly) and enable that in the live migration job if peopel were ok with that.13:26
sean-k-mooneygiven whitebox is an offical deliverable of the qa project i dont think that is out of place but if we were to take that step we would certenly need to ensure that whitebox does not break teh nova gate going forward.13:27
sean-k-mooneyto that end if have also been explofing using the rdo zuul instance to instead execute the whitebox tests as a third party ci for nova13:27
sean-k-mooneyi proably wont have time to create a poc of that between now and the ptg13:28
sean-k-mooneybut im going to explore that option regardless of of we do decied to integrate a subset of whitebox in the first party ci or not13:29
dansmithI'm not sure what you're trying to say... this tests mix of shared/pinned that you said we don't want to support anyway, or some other scenario?13:30
sean-k-mooneydansmith: another senairo13:30
sean-k-mooneyspecificly i want a test for two host with diffen cpu_shared_set and numa or cpu pinning13:31
sean-k-mooneyi.e. vanaill vm where the compute node just happen to have diffent cpu_shared_sets perhaps because the host have diffent number of cpus13:31
sean-k-mooneyin fact just to make that extra clear im going to update the patch to remove cpu_dedicated_set 13:32
dansmithI thought we already had that because we flip the shared/dedicated set between the two?13:32
sean-k-mooneyin whitebox we partly do, we just dont have any test that asserts the xml was updated for non numa instnaces13:34
opendevreviewMerged openstack/nova master: Correctly reset instance task state in rebooting hard  https://review.opendev.org/c/openstack/nova/+/86783213:34
sean-k-mooneywe do live migrations with non numa instance we jsut dont check the xml13:34
dansmithah13:34
sean-k-mooneyso i just want to add a dedicated test with a vanial flaovr that checks as a follow up13:34
sean-k-mooneyi did in manually in the modifed ci job13:35
sean-k-mooneybut we can automate that13:35
dansmithwe should do that in a test though and not with a different job right? or is that what you mean by "automate that"13:38
dansmith?13:38
sean-k-mooneyyes not a diffent job13:38
sean-k-mooneyjust a tempest testcase13:38
dansmithack13:41
opendevreviewMerged openstack/os-vif master: Update python classifier in setup.cfg  https://review.opendev.org/c/openstack/os-vif/+/90461313:54
opendevreviewMerged openstack/os-vif master: tox: Drop envdir  https://review.opendev.org/c/openstack/os-vif/+/90760413:57
opendevreviewsean mooney proposed openstack/nova master: Test live migration between hosts with differnet cpu_shared_sets  https://review.opendev.org/c/openstack/nova/+/91374414:09
sean-k-mooneyupdate ^ to make it clear this is testing non numa instances for the vanilla case where the cpu_share_set just happens to be different on each host14:10
dansmithsean-k-mooney: this is modifying our live migration job, but we don't run whitebox there right? so you're thinking this will cause us to run over the code in question and would fail if not on top of the fix?14:16
sean-k-mooneyim tempeted to push a second patch to test numa live migration in one of the other jobs but i wont for now. that litrally just need us to modify the flavor to have hw:mem_page_size=small but im not sure if we currently rely on memory over allcation in theyse job. i belvie we dont given the default for initial_ram_allocation_ratio is 1.0  so it should just work perhaps it worth14:16
sean-k-mooneya try as it would be nice to get that coverage effectivly for free14:16
sean-k-mooneydansmith: it does make use run the code in questio. it will not fail unless the code breaks live migratoin14:16
sean-k-mooneybut it not asserting that the xml is update14:17
sean-k-mooneywhich is why i did that by hand by inspecting the logs14:17
dansmithoh right, because it won't fail it will just be wrong14:17
dansmithright, okay sorry14:17
sean-k-mooneyit would have caught if the int("of random string") was actully exploding14:17
dansmiththe caffeine absorption is slow this morning14:17
sean-k-mooneyya so if we dint have the same number of cpus in both hosts adn we were using one on the source that was not on the dest it would fail14:18
sean-k-mooneybut that is not the case here14:18
sean-k-mooneyit woudl just be runnign on the wrong core 14:18
sean-k-mooneyit woudl get fixed on teh next hard reboot of the vm14:18
sean-k-mooneywhat happens today fi the core does not exist is we get an error for the call to libvirt to start the migration and rollback gracefuly and mark the migration as failed14:21
sean-k-mooneythe same thing happens if you have the power managemnt feature enabeld and the core is offline14:22
sean-k-mooneyqemu check for that and reports an error to libvirt which we then get and we rollback in the same way as if the cpu were not compatiable14:23
opendevreviewMerged openstack/nova stable/2024.1: [stable-only] Update .gitreview for stable/2024.1  https://review.opendev.org/c/openstack/nova/+/91363115:43
opendevreviewMerged openstack/nova stable/2024.1: [stable-only] Update TOX_CONSTRAINTS_FILE for stable/2024.1  https://review.opendev.org/c/openstack/nova/+/91363215:43
zigoWho's trying to use systemctl in unit tests?!?16:19
zigohttps://buildd.debian.org/status/fetch.php?pkg=nova&arch=all&ver=2%3A29.0.0~rc1-1&stamp=1710857860&file=log16:19
zigoNot a good idea, IMO. Or at least, this has to be explicited somewhere.16:20
clarkbthe upstream unittest jobs should remove sudo powers before running the tests. Unprivileged systemctl isn't the worst thing?16:20
zigoIt's not that...16:21
zigoFileNotFoundError: [Errno 2] No such file or directory: 'systemctl'16:21
zigoThere's no systemd in the Debian buildd chroots ...16:21
zigoPlus WTF is it even trying to do?16:21
clarkbits checking if the system is running systemd and assumes if systemd is present then so is systemdctl16:23
zigo        sysdout, sysderr = processutils.execute("systemctl",16:23
zigo                                                "is-system-running",16:23
zigo                                                check_exit_code=[0, 1])16:23
zigoWell, no it's not, thanks ... :)16:23
clarkbdoes seem like the sort of check that could catch exceptions and then mark no to systemd16:24
* zigo will probably open a bug tomorow, now is time for ... a beer ! :)16:24
clarkbzigo: there must be some systemd due to the checks just above that statement.16:24
clarkbso I don't think it is accurate that the build chroots have no systemd16:24
clarkbif psutil.Process(1).name() == "systemd" or os.path.exists(SYSTEMCTL_CHECK_PATH):16:24
zigoI could add a depends on systemd...16:25
zigoNot sure if that will help though.16:25
zigoIt does build fine on my laptop, just not on buildd.16:25
zigoThat's kind of the worse situation ! :)16:25
zigoI probably will end up blacklisting yet another bunch of tests ...16:25
clarkbI think changing that or to an and may fix it for you16:27
opendevreviewMerged openstack/nova master: Fix tipo  https://review.opendev.org/c/openstack/nova/+/90720816:52
opendevreviewribaudr proposed openstack/nova master: Fix: Live migrating to a host with cpu_shared_set configured will now update the VM's configuration accordingly.  https://review.opendev.org/c/openstack/nova/+/90370616:54
opendevreviewribaudr proposed openstack/nova master: Test live migration between hosts with differnet cpu_shared_sets  https://review.opendev.org/c/openstack/nova/+/91374416:54
opendevreviewsean mooney proposed openstack/nova master: enable numa live migration in the ceph job  https://review.opendev.org/c/openstack/nova/+/91384219:34
opendevreviewsean mooney proposed openstack/nova master: enable numa live migration in the ceph job  https://review.opendev.org/c/openstack/nova/+/91384219:37
opendevreviewsean mooney proposed openstack/nova master: imporve nova object logging  https://review.opendev.org/c/openstack/nova/+/89805721:14
opendevreviewsean mooney proposed openstack/nova master: refactor numa claims  https://review.opendev.org/c/openstack/nova/+/89805621:18
sean-k-mooneyartom: ^ those are my patches i sstarted when debuging the numa aware vsiwtch issue as an fyi21:24
sean-k-mooneyi just inverted the order so the logging patch is first and the refactor patch is second21:24
sean-k-mooneymelwitt: ^ i would prefer to use my logging patch instead of https://review.opendev.org/c/openstack/nova/+/907976 by the way just because its more systemtic and general21:25
sean-k-mooneyit would be nice to have your review feedback on that when you have time21:25
opendevreviewsean mooney proposed openstack/nova master: enable numa live migration in the ceph job  https://review.opendev.org/c/openstack/nova/+/91384221:30
artomsean-k-mooney, ack. I've got a couple of appointment on Friday, but between them, if you're up for it, I'll load context from you21:35
sean-k-mooneysure 21:50
melwittsean-k-mooney: ack, ok21:51

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!