Wednesday, 2024-01-24

opendevreviewJie Song proposed openstack/nova master: hardware: Fix image_meta.id within get_mem_encryption_constraint  https://review.opendev.org/c/openstack/nova/+/90419704:26
opendevreviewTakashi Kajinami proposed openstack/nova master: GMR: Omit signal handleris in wsgi app  https://review.opendev.org/c/openstack/nova/+/90627905:11
opendevreviewTakashi Kajinami proposed openstack/nova master: GMR: Omit signal handleris in wsgi app  https://review.opendev.org/c/openstack/nova/+/90627905:12
opendevreviewTakashi Kajinami proposed openstack/nova master: GMR: Omit signal handlers in wsgi app  https://review.opendev.org/c/openstack/nova/+/90627905:54
opendevreviewJie Song proposed openstack/nova master: hardware: Fix image_meta.id within get_mem_encryption_constraint  https://review.opendev.org/c/openstack/nova/+/90419707:01
TK__Hello Team,I have an openstack environment with 4 compute Nodes however yesterday out of the blue, the Instances on one of the compute nodes started shutting themselves down. Then I realized the the compute service was disabled when I checked in Horizon. I enabled it and restarted the instances and all was well but I cant figure out what caused it. I have attached the nova_compute logs here incase someone would like to support08:05
TK__https://paste.openstack.org/show/bxqpgjvqT3DlOYWoeejc/08:05
bauzasTK__: 2024-01-23 08:27:49.761 6 INFO nova.compute.manager [req-d9713a02-2cfd-4503-9dd8-cce1a0db0c73 - - - - -] [instance: 6d26e7bc-363a-4d0d-90c4-8708332f9621] Destroying instance as it has been evacuated from this host but still exists in the hypervisor08:28
bauzasTK__: looks like your compute somehow failed and you issued evacuations that were in progress08:28
bauzasonce the compute went back up, nova started to delete the rogue evacuated instances08:29
TK__We actually didnt evacuate them....   08:33
*** LarsErik1 is now known as LarsErikP08:35
bauzasTK__: you could try to look at the instance actions API08:35
bauzaspick one instance UUID that's been shutdown and see all its actions08:36
TK__I see it evacuated but I think it was done by Masakari but not a user 08:41
bauzasdoh, I was about to reply08:47
bauzasTK_: (if you still read the logs) you can find the user/project IDs that issued the evacuations in the instance action API return08:48
bauzasbut tbc, if you're using some top-level API that manages instance HA, you should rather look at their logs to see what happened and you shouldn't IMHO force to restart the compute instances, if they were evacuated08:49
opendevreviewJie Song proposed openstack/nova master: hardware: Fix image_meta.id within get_mem_encryption_constraint  https://review.opendev.org/c/openstack/nova/+/90419709:46
pslestangHello team, do you know if there is a will to make this WIP (https://review.opendev.org/c/openstack/nova/+/508345) work in cinderclient part of nova? We'd like to get it working to improve requests reliabily and use options such as connect_retries, connect_retry_delay and so on. 10:10
Roamer`sean-k-mooney, sorry we are only asking this after your zswap patches have been reviewed and merged (and thanks for your work BTW), but, hm, shouldn't configure_zswap() check for ENABLE_ZSWAP and not for ENABLE_KSMTUNED? I mean at https://review.opendev.org/c/openstack/devstack/+/890693/12/lib/host#4010:16
Roamer`(FTR, this made the StorPool Cinder CI jobs fail because the z3fold kernel module is not loaded by default on Ubuntu Jammy... which might be a different problem related to the way we create our own images, we do not use nodepool-builder; we shall find a way to work around that, but it was kind of surprising that the Cinder jobs tried to enable zswap at all)10:18
fricklerRoamer`: oh, do you happen to be responsible for/related to the StorPool CI? we would really really like it to stop voting on devstack and other unrelated projects, where it only produces garbage. we were pretty close to actually disabling the account for it even10:21
Roamer`frickler, wait, it is trying to vote?! argh10:22
Roamer`sorry about that!10:22
Roamer`yeah, okay, we will look into it ASAP10:22
bauzaspslestang: that patch is very old, and we changed our opinions in between10:22
fricklerRoamer`: see for example the devstack patch you just mentioned10:23
bauzaspslestang: now, we'd prefer to now use sdk instead of cinderclient10:23
bauzaspslestang: https://bugs.launchpad.net/nova/+bug/199621010:23
pslestangbauzas: ok understood thx10:30
sean-k-mooneyRoamer`: yes it should10:48
sean-k-mooneyRoamer`: i can put up a patch to fix that quickly its just a silly copy past error10:49
sean-k-mooneyi have been testing this with everything turned on on 22.04 ubuntu cloud images however so im surpised it cause you issues10:50
sean-k-mooneyi also spot checked it on centos9 stream to test the rpm based distros10:51
sean-k-mooneyRoamer`: https://review.opendev.org/c/openstack/devstack/+/906504 that should fix it. until that is merged you can add ENABLE_KSMTUNED=false as a workaround10:55
sean-k-mooneyfrickler: dansmith  ^ if ye have time for that fix please take a look11:00
fricklersean-k-mooney: +3 already11:00
sean-k-mooney:) thanks11:01
bauzasgosh the scheduler manager UTs framework is sooo crazy, I'm buggy from the last hours12:48
*** blarnath is now known as d34dh0r5315:17
auniyalbauzas, gibi I understand all conf will be updated under nova/conf dir15:29
auniyalbut is there a way we can update  update this page as well https://docs.openstack.org/ocata/config-reference/compute/config-options.html15:29
bauzasthe conf doc page is automatically generated15:29
bauzasyou just don't have the right link :)15:30
auniyalthis is the first page  got when I googled for conf option15:30
bauzashttps://docs.openstack.org/nova/latest/configuration/config.html15:30
auniyalyeah, this one I am aware of, and I understand whatever conf opt we want to update, update under nova/conf, but the link I shared is the first link comes up when we google nova conf variables15:33
auniyalso I think this should be updated too15:33
auniyalor is someone else maintaing it15:33
bauzasauniyal: the link you refer is some Ocata release doc15:39
bauzasso that's an EOL release15:40
auniyalyeah, and its not opening for zed too, but it has nice table format15:40
bauzasyou can get docs from EOL releases for sure (as people can still operate them) but people looking at docs should target the right release docs15:41
bauzasauniyal: because we changed the docs structure somewhere around Ussuri-ish15:41
auniyalit must be just template right15:41
auniyalokay, so updating like that may be lot of backport15:41
auniyalshall we do it ? i think its more informative/user friendly  then https://docs.openstack.org/nova/latest/configuration/config.html15:43
bauzasauniyal: you don't understand, we shouldn't touch the ocata documentation *at all*16:05
bauzasif we were touching some stable docs, this would be due to some config backport we have in our code16:05
bauzasbut given ocata is EOL, this is now pristine doc16:05
auniyalack thanks bauzas for clearing it16:06
bauzasthat's not because we're bad at Google's SEO that we should temper our docs16:06
opendevreviewAmit Uniyal proposed openstack/nova master: enforce remote console shutdown  https://review.opendev.org/c/openstack/nova/+/90182416:37
gibibauzas: sean-k-mooney: the master revert of https://review.opendev.org/q/topic:%22power-mgmt-fixups%22 is finally grean17:33
gibigreen even17:33
bauzas+2d17:35
bauzasI can mark the series as a prio in the etherpad17:35
bauzas(done)17:36
gibithanks17:38
opendevreviewSylvain Bauza proposed openstack/nova master: Fix verifying all the alloc requests from a multi-create  https://review.opendev.org/c/openstack/nova/+/84678618:26
opendevreviewsean mooney proposed openstack/nova master: add initial healthcheck support  https://review.opendev.org/c/openstack/nova/+/82501519:43
opendevreviewsean mooney proposed openstack/nova master: add healthcheck manager to manager base  https://review.opendev.org/c/openstack/nova/+/82784419:43
opendevreviewsean mooney proposed openstack/nova master: add healthcheck tracker to nova context  https://review.opendev.org/c/openstack/nova/+/82946819:43
opendevreviewsean mooney proposed openstack/nova master: add healthcheck utils and constants  https://review.opendev.org/c/openstack/nova/+/82946919:43
opendevreviewsean mooney proposed openstack/nova master: add healthcheck endpoint to proxy commands  https://review.opendev.org/c/openstack/nova/+/83070319:43

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!