Friday, 2024-02-02

*** dtantsur_ is now known as dtantsur01:50
opendevreviewNobuhiro MIKI proposed openstack/nova master: libvirt: Support maxphysaddr.  https://review.opendev.org/c/openstack/nova/+/90751602:38
opendevreviewmelanie witt proposed openstack/nova master: libvirt: Configure and teardown ephemeral encryption secrets  https://review.opendev.org/c/openstack/nova/+/82675406:41
opendevreviewmelanie witt proposed openstack/nova master: imagebackend: Add support to libvirt_info for LUKS based encryption  https://review.opendev.org/c/openstack/nova/+/82675506:41
opendevreviewmelanie witt proposed openstack/nova master: Add encryption support to convert_image  https://review.opendev.org/c/openstack/nova/+/87093406:41
opendevreviewmelanie witt proposed openstack/nova master: Add hw_ephemeral_encryption_secret_uuid image property  https://review.opendev.org/c/openstack/nova/+/87093506:41
opendevreviewmelanie witt proposed openstack/nova master: libvirt: make <encryption> a sub element of <source>  https://review.opendev.org/c/openstack/nova/+/90551506:41
opendevreviewmelanie witt proposed openstack/nova master: Support create with ephemeral encryption for qcow2  https://review.opendev.org/c/openstack/nova/+/87093206:41
opendevreviewmelanie witt proposed openstack/nova master: Support (resize|cold migration) with ephemeral encryption for qcow2  https://review.opendev.org/c/openstack/nova/+/87093306:41
opendevreviewmelanie witt proposed openstack/nova master: Add encryption support to qemu-img rebase  https://review.opendev.org/c/openstack/nova/+/87093606:41
opendevreviewmelanie witt proposed openstack/nova master: Support snapshot with ephemeral encryption for qcow2  https://review.opendev.org/c/openstack/nova/+/87093706:41
opendevreviewmelanie witt proposed openstack/nova master: Support rebuild and unshelve with ephemeral encryption  https://review.opendev.org/c/openstack/nova/+/87093906:41
opendevreviewmelanie witt proposed openstack/nova master: Support rescue with ephemeral encryption  https://review.opendev.org/c/openstack/nova/+/87367506:41
opendevreviewmelanie witt proposed openstack/nova master: WIP Support live migration with ephemeral encryption  https://review.opendev.org/c/openstack/nova/+/90551206:41
opendevreviewmelanie witt proposed openstack/nova master: libvirt: Introduce support for raw with LUKS  https://review.opendev.org/c/openstack/nova/+/88431306:41
opendevreviewmelanie witt proposed openstack/nova master: libvirt: Introduce support for rbd with LUKS  https://review.opendev.org/c/openstack/nova/+/88991206:41
opendevreviewmelanie witt proposed openstack/nova master: DNM test ephemeral encryption + resize: qcow2, raw, rbd  https://review.opendev.org/c/openstack/nova/+/86241606:42
melwittsean-k-mooney: this is a checkpoint ^ there are so many changes already, I wanted to push them for now while I keep working on stuff06:43
opendevreviewMerged openstack/nova master: testing: Use inspect.isfunction() to check signatures  https://review.opendev.org/c/openstack/nova/+/88321706:51
opendevreviewFabian Wiesel proposed openstack/nova master: vmware: Integer division Python 2 -> 3 fix  https://review.opendev.org/c/openstack/nova/+/90744408:25
gibielodilles: when you are up, could you take a quick look at https://review.opendev.org/q/topic:%22power-mgmt-fixups%22+branch:stable/2023.1 thanks!08:29
elodillesgibi: ack, added to my TODO!09:24
gibielodilles: thanks a lot09:24
opendevreviewSylvain Bauza proposed openstack/nova master: Reserve mdevs to return to the source  https://review.opendev.org/c/openstack/nova/+/90420910:35
opendevreviewSylvain Bauza proposed openstack/nova master: Modify the mdevs in the migrate XML  https://review.opendev.org/c/openstack/nova/+/90425810:35
sean-k-mooneygibi: elodilles: im +2w on the 2023.1 gpu backports12:18
sean-k-mooneysorry not gpu backport power management backport12:22
sean-k-mooneyhttps://review.opendev.org/q/topic:%22power-mgmt-fixups%2212:22
sean-k-mooneymelwitt: ack ill deploy that in a 2 node envionment and start testing it later today12:23
gibisean-k-mooney: thank you 12:26
elodillessean-k-mooney: ack, thank you o:)12:29
opendevreviewRajesh Tailor proposed openstack/nova master: Add support for showing requested az in output  https://review.opendev.org/c/openstack/nova/+/90456812:32
opendevreviewRajesh Tailor proposed openstack/nova master: Add support for showing requested az in output  https://review.opendev.org/c/openstack/nova/+/90456813:15
opendevreviewTakashi Kajinami proposed openstack/placement master: tox: Drop envdir  https://review.opendev.org/c/openstack/placement/+/90759014:41
opendevreviewTakashi Kajinami proposed openstack/nova master: tox: Drop envdir  https://review.opendev.org/c/openstack/nova/+/90759114:42
opendevreviewTakashi Kajinami proposed openstack/osc-placement master: tox: Drop envdir  https://review.opendev.org/c/openstack/osc-placement/+/90759614:49
*** blarnath is now known as d34dh0r5314:53
opendevreviewTakashi Kajinami proposed openstack/os-vif master: tox: Drop envdir  https://review.opendev.org/c/openstack/os-vif/+/90760414:57
noonedeadpunkhey folks. I have something weird going on. In a brand new (still not production) environment I've accidentally spotted, that one of computes is not mapped to the cell15:29
noonedeadpunkso `nova-manage cell_v2 list_hosts` and `openstack compute service list --service nova-compute` differ by 1 compute not added to cell15:29
noonedeadpunk`nova-manage cell_v2 discover_hosts` does not discover it15:30
noonedeadpunkno errors or weird output is seen15:32
noonedeadpunkOr well, shceduler does have `Host mapping not found for host compute01-az2. Not tracking instance info for this host.` -> thats exactly host that's missing from the cell15:32
noonedeadpunkonly 1 cell is present justi n case15:34
noonedeadpunkhm... seem like running nova-manage cell_v2 discover_hosts --cell_uuid <UUID> --by-service did discover the host....15:41
noonedeadpunkand it was `--by-service` specifically15:42
noonedeadpunkI can recall now somebody already suggested running by-service previously :)15:43
bauzasnoonedeadpunk: as a reminder, when adding a new compute, you need to use nova-manage for telling which cell you should use for it15:45
bauzashence why you can't find it15:45
noonedeadpunkwell, we have `discover_hosts_in_cells_interval` set 15:46
noonedeadpunkand I was trying to run without --by-service 10 times by now15:47
noonedeadpunkalso running nova-manage cell_v2 discover_hosts --cell_uuid UUID was just giving out "Found 0 unmapped computes in cell: UUID"15:47
noonedeadpunkso eventually I was expecting compute to be auto-discovered...15:49
noonedeadpunk(like all others were)15:49
bauzasah ok15:55
bauzasmaybe this was a bug then 15:56
sean-k-mooneynoonedeadpunk: do you have the same hostname in multiple cells or perhaps is it in a diffent cell then the others16:27
noonedeadpunksean-k-mooney: no, not that I'm aware of - that was actuall output: https://paste.openstack.org/show/bOGJJWefzMBvhKkdHkiZ/16:32
sean-k-mooneyhum ok ya thats odd16:34
sean-k-mooneythe names all look unique16:34
sean-k-mooneyand they are all in cell 116:34
noonedeadpunkyeah, and there was no weird db records either16:34
sean-k-mooneyi think there is a verbose mode for discover_hosts but now that its fixed im usre it wont repoduce16:35
noonedeadpunkI can imagine there could be some connectivity issues in the region as it's heavily under develpment... but I've restarted all schedulers and compute that was not found without any result16:35
noonedeadpunkverbose was not helpful fwiw16:36
sean-k-mooneyand it worked fine when you mapped it with --by-service16:36
noonedeadpunkhttps://paste.openstack.org/show/bx7p6C37Qv8LHym7Dviq/16:36
noonedeadpunkyup16:36
sean-k-mooneythe urllib3 thing can be ingored by the way 16:36
sean-k-mooneywe should proably remove that warning16:36
noonedeadpunkor well. I run --by-service without verbose... So I kinda assume it's what worked, as nothing else was done in between....16:36
sean-k-mooneyit was fixed a lon gtime ago16:36
noonedeadpunklike that's the output that was in between basically when I found it fixed: https://paste.openstack.org/show/bvFhJpJyhSv0NYn6WHaF/16:38
sean-k-mooneyi guess if you see this again try logging with --by-service and verbose and let us know if there is anything intersting16:38
noonedeadpunkyeah, sure...16:39
noonedeadpunkbut potentially by-service makes sense, as compute was in service... so I assume it got mapped from there then16:39
noonedeadpunkbut yeah...16:39
sean-k-mooneyi dont remmeber how that works off the top of my head so ya im not sure 16:41
sean-k-mooney... i forgot when doing multi node devstack you need ot sync the data dir to make the ssl ca aviable if you dont turn that off16:45
* sean-k-mooney has not done multi node devstack by hand in about 2 years. i shoudl proably just use my ansible roles...16:46
sean-k-mooneybasiclaly i need to do https://opendev.org/openstack/devstack/src/branch/master/roles/sync-devstack-data/tasks/main.yaml or the compute wont be able to talk to the contoler because the ssl cert will be rejected16:48
noonedeadpunkit's osa aio, so I SSLs should be taken care of nicely17:19
sean-k-mooneyim not refering to osa17:20
* noonedeadpunk can't even recall how to do devstack manually17:20
sean-k-mooneyi have have a repo where i resue the zuul job playbooks and roels to deploy multi node devstack17:20
noonedeadpunkyeah, ok, gotcha, I guess I just missed the context17:21
sean-k-mooneyhttps://github.com/SeanMooney/ard/blob/6abfcae59013165404ab38ec80fa143a1c96b86a/ansible/deploy_multinode_devstack.yaml17:21
sean-k-mooneyright now im trying to figure out how to fix the ssl issues manually but might use swap to ansible17:22
sean-k-mooneybut i havent used my repo in 2 years so i dont know if it still works17:22
sean-k-mooneyi used to just turn off tls when doing multi ndoe to not need to copy the self signed certs17:23
sean-k-mooneythats all that is currently failing17:23
clarkbsean-k-mooney: I think you just need to copy the ca dir. IIRC devsatck doesn't centralize the ca. It makes one and then copies it so each host can use it directly17:37
sean-k-mooneyya i did that and it didnt update the ca on the compute17:37
sean-k-mooneythe zuul jobs just copy it to the subnode before its stacks17:38
sean-k-mooneyso im not sure why its different17:38
sean-k-mooneyperhaps permissions but im using hte same user on both so im not sure17:38
sean-k-mooneylrwxrwxrwx 1 root root     49 Feb  2 17:02  devstack-int.pem -> /usr/local/share/ca-certificates/devstack-int.crt17:44
sean-k-mooneylrwxrwxrwx 1 root root     50 Feb  2 17:02  devstack-root.pem -> /usr/local/share/ca-certificates/devstack-root.crt17:44
sean-k-mooneythos are broken symlinks so i think if i just remove them17:44
sean-k-mooneyand stack it might fix its self17:44
sean-k-mooneyah i missed one17:52
sean-k-mooneyi also need /opt/stack/data/devstack-cert.pem17:52
sean-k-mooneyi coppied /opt/stack/data/ca-bundle.pem and /opt/stack/data/CA/17:53
sean-k-mooneynow it stacked fine17:53
sean-k-mooneymelwitt: https://termbin.com/ba59 less successful this time18:10
sean-k-mooneythats using https://review.opendev.org/c/openstack/nova/+/88991218:11
sean-k-mooneyso it looks like there is an error here https://review.opendev.org/c/openstack/nova/+/873675/22/nova/virt/libvirt/blockinfo.py#40518:14
sean-k-mooneymy guess is you remvoed a fucntion or did not do a git add at the right time when rebasing18:15
bauzassean-k-mooney: noonedeadpunk: fwiw, I'm getting the same issue with my own multinode devstack I just installed18:34
sean-k-mooneydiscover hosts worked fine for me18:34
bauzasdiscover_hosts only works if I use --by-service18:34
sean-k-mooneyi had no issue with it using melwitt serise18:35
sean-k-mooneyi wasnt tecnially runing master but close enough18:35
bauzasthis was bizarre, the first compute service was working but I wasn't having any host_mappings values18:37
bauzasoh wait, I found why https://paste.opendev.org/show/br3ghOvqs7NRQAfY80HX/18:41
sean-k-mooneyoh you nit the centos libvirt bug18:41
sean-k-mooneyya so libvirt is borked on centos for 2 days18:41
bauzasso the service ref was creating but not the compute node one18:41
bauzassean-k-mooney: which one ?18:42
sean-k-mooneythat unicode error18:42
sean-k-mooneyUnicodeDecodeError: 'utf-8' codec can't decode byte 0x91 in position 0: invalid start byt18:42
sean-k-mooneythat form libvirt18:42
sean-k-mooneywe have a cix for the next gen installer18:42
sean-k-mooneyit started seeing that in ci 2 days ago18:42
sean-k-mooneyhttps://issues.redhat.com/browse/OSPCIX-17618:43
sean-k-mooneythat should be public for everyone ?18:43
clarkbit asks me to login18:44
sean-k-mooneyok the ci escalation project must be private...18:44
sean-k-mooneythe cijob is runign in rdo based on github patches https://review.rdoproject.org/zuul/build/5cf66b0bd0b6402faaa3306d3b193f8118:45
bauzasthe RHEL bug is telling about the VDP issue, but I don't have this hardware18:47
sean-k-mooneyhttps://logserver.rdoproject.org/27/127/e7992da92f6f67327ffbf593a64b712e36b04cc6/github-check/tcib-podified-multinode-edpm-deployment-crc/5cf66b0/controller/ci-framework-data/logs/192.168.122.100/log/containers/nova/nova-compute.log18:47
sean-k-mooneyi commented that is looks like the same vpd parsing issue18:47
sean-k-mooneybut it might now be18:47
sean-k-mooneybasiclaly the libvirt api is returning non unicode bytes18:48
sean-k-mooneyso we can decode them into a strign properly18:48
sean-k-mooneywe are calling self.get_connection().getType()18:48
sean-k-mooneywhich is exploding in the libvirt python module18:49
sean-k-mooneywhen it calls libvirtmod.virConnectGetType(self._o)18:49
sean-k-mooneyso that  a libvirt or libvirt python bug18:49
bauzasthat said I had no problem with the other compute18:49
bauzaslemme look if I use the same version between the two computes18:50
sean-k-mooneyif your not can you record which one works18:50
bauzasnope, I use the same version :(18:51
bauzasthat's bizarre18:51
sean-k-mooneybauzas: can you commet on the downstream bug tracker18:52
sean-k-mooneylets see if we can get some of our virt team folks to have a look on monday before it starts impacting the upstream gate18:53
bauzasthe CIX one ?18:53
sean-k-mooneyya18:53
bauzascool18:53
sean-k-mooneyso it  sound like it something works and somethime does not18:53
bauzasactually, I can confirm that the other compute got the same exception, hence why I had no compute node18:59
sean-k-mooneyoh ok so both failed18:59
sean-k-mooneymelwitt: i fixed the missing fucntion but then it failed elsewher eon the comptue with AttributeError: 'Qcow2' object has no attribute 'disk_encryption'19:00
sean-k-mooneyin disk.libvirt_info()19:01
bauzasapparently, now the call works for the first node19:01
sean-k-mooneymelwitt: ill try and take a look at your code again on monday if you can fix the rebase issues 19:02
sean-k-mooneybauzas: so its flaky?19:02
sean-k-mooneyi.e. on the same node it sometiems works and somethimes does not19:02
bauzasactually, when directly calling libvirt with the python binding, I have the problem19:03
bauzas(for the host that now works)19:03
bauzashttps://paste.opendev.org/show/blEFSbHapI8eCPEMRgyY/19:03
sean-k-mooneyif you had a standalone repoducer that would proably help19:05
sean-k-mooneyi.e. just a python script that import libvirtmod and calls virtConnectGetType19:05
bauzasthat's what I did19:05
sean-k-mooneyoh you didnt use the nova code19:05
sean-k-mooneyok well if you can repoduce jsut with the libvirt module in a short script19:06
sean-k-mooneyit will be a lot simpler to report this to the libvirt folks19:06
sean-k-mooneywe dont have  a libvirt bug for this yet19:06
bauzasjust added a comment19:07
bauzaslemme see if virsh gets that too19:08
sean-k-mooneyim not sure what the virsh equivlent command would be19:08
bauzasat least domcapabilities doesn't say anything19:17
melwittsean-k-mooney: ugh, ok, sorry about that :(19:18
bauzasok I need to leave19:19
bauzaseventually I was able to start both of the services19:20
bauzasbut we still have the bug19:20
melwittsean-k-mooney: every time I splice commits apart I manage to f something up 🙄 I'll get it fixed. thanks for trying it out19:20
sean-k-mooneybauzas: i filed a bug with the libvirt folks19:20
bauzascool19:21
* bauzas goes off for the weekend \o19:21
sean-k-mooneymelwitt: its fine that happens to me too19:24
artomdansmith, I think with your stable node UUID series https://bugs.launchpad.net/nova/+bug/1817833 is fixed, no?19:24
sean-k-mooneymelwitt: i usuallly end up going commit by commmint and running a subset of tox -e py3,functional,pep8 19:25
artomI happened upon its functional reproducer doing the downstream backport...19:25
melwittsean-k-mooney: yeah. I do that too ... most of the time 😒 19:28
dansmithartom: um, not sure19:29
dansmithartom: they seem to be complaining that they can't delete the compute to get it to have a new identity and my thing doesn't really fix that, it just means nova-compute will fail to start19:30
sean-k-mooneyartom: dansmith my readin was maybe in some specifica casses as a side effect19:30
dansmithif you reset the uuid it will probably be right back to this problem when the name on the RP conflicts19:30
sean-k-mooneyso in the fucntionl tests the repoduce might be impacted becasue the agent will fail to start or simialr19:31
dansmithright, it doesn't solve the problem (if you agree it's a problem) it just refuses to fail at the place it did before, and does so earlier (at start)19:31
sean-k-mooneyya that was my feeling but i didnt fully parse the bug details19:32
artomI wasn't really pushing one way or another, just wanted to check that we don't have stable bug reports laying around19:33
artom(I mean, I'm sure we do, but if we can close _one_... ;)19:33
opendevreviewMerged openstack/nova stable/2023.1: Revert "[pwmgmt]ignore missin governor when cpu_state used"  https://review.opendev.org/c/openstack/nova/+/90567419:34
*** priteau_ is now known as priteau21:44

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!