Monday, 2025-01-13

opendevreviewTakashi Kajinami proposed openstack/os-resource-classes master: Fix outdated envlist  https://review.opendev.org/c/openstack/os-resource-classes/+/93901804:55
opendevreviewTakashi Kajinami proposed openstack/python-novaclient master: Remove environment for Python 3.8  https://review.opendev.org/c/openstack/python-novaclient/+/93902205:04
opendevreviewTakashi Kajinami proposed openstack/nova master: Drop environment for Python 3.8  https://review.opendev.org/c/openstack/nova/+/93902705:11
opendevreviewHemanth N proposed openstack/nova master: Add memory stats to compute monitor  https://review.opendev.org/c/openstack/nova/+/93904405:37
opendevreviewHemanth N proposed openstack/nova master: Add memory stats to compute monitor  https://review.opendev.org/c/openstack/nova/+/93904406:08
opendevreviewHemanth N proposed openstack/nova master: Add memory stats to compute monitor  https://review.opendev.org/c/openstack/nova/+/93904407:18
opendevreviewHemanth N proposed openstack/nova master: Add memory stats to compute monitor  https://review.opendev.org/c/openstack/nova/+/93904407:48
gibibauzas: when you are up. I would like to formally ask for a spec freeze exception for https://review.opendev.org/c/openstack/nova-specs/+/938910 08:40
gibiThis is a spin off of the vTPM live migration spec08:40
gibibut the impact is moved out of it as a) it has API impact b) it is independently useful feature08:41
gibiwe already have plenty of core approval on it so it does not seems to controversial08:41
gibithe assignee of the impl is a questionmark today but just because we want to give the opportunity to auniyal or ratailor to pick it up. But if it does not fit to their timeline for E then I can be the fallback to push the implementation. Fortunately it is a small impact one08:43
gibimore context is in the comment thread https://review.opendev.org/c/openstack/nova-specs/+/938843/1#message-c65ec478107012ab4c141bd79812528a74f7761d08:44
auniyalhey gibi, can you please have a look at  https://bugs.launchpad.net/nova/+bug/2093869/, nova-place placement bug09:10
ratailorgibi, I could try to do that. 09:10
gibiratailor: cool. Thanks for stepping up09:11
gibiauniyal: looking09:12
opendevreviewRajesh Tailor proposed openstack/nova master: Add support for showing finish_time  https://review.opendev.org/c/openstack/nova/+/92893309:38
gibiauniyal: so this is a system with PCI in Placement enabled and after a while, even if the devstack sits idle the compute starts failing with the conflict exception 09:46
gibiauniyal: you wrote that "If we recreate resource provider, then new once does not have DISK_GB resource-class09:53
gibi"09:53
gibidoes it mean you stopped the compute, deleted the RP tree, and then started the compute?09:53
gibidid you have VMs running on this compute?09:54
gibido you just enabled PCI in Placement or also have some devices configured via [pci]device_spec config option?09:56
auniyalnot in right now- but I  encountered the same issue, earlier (do not this VM-setup anymore )09:56
auniyalthat time, I had VM, which was failing to start usin nova cmd, - on checking compute logs I saw this error.09:56
auniyalthat time, I teventually endup deleting R-P and creating new, 09:56
auniyal> enabled PCI in Placement or also have some devices configured via [pci]device_spec config option?09:57
auniyalno09:57
auniyalboth time, there is no PCI changes09:57
auniyalor any update in any service confs, the system is as same as how devstack deploy it09:58
auniyalonly thing is, I have ceph enabled for storage 09:59
opendevreviewDmitriy Chubinidze proposed openstack/nova master: Adding link for RabbitMQ installation during nova deployment on controller node.  https://review.opendev.org/c/openstack/nova/+/93870209:59
auniyal>>> not in right now- but I  encountered the same issue, earlier (do not this VM-setup anymore )10:03
auniyal meant, I do not have that setup anymore, and this is a new setup having similar issue10:03
gibiauniyal: so the error is coming from https://github.com/openstack/nova/blob/a459467899d2b406aa8cf530ae481255eaf3c957/nova/compute/resource_tracker.py#L1360-L1370 Interstingly the comment in the code contradicts with the actual InventoryInUse exception in the logs. So I think the assumption that all the InventoryInUse exception from update_from_provider_tree is realted to PCI in Placement is wrong10:17
gibiso that exception translation needs to be fixed to only react on real exceptions from the PCI in Placement codepath and do not react on other InventoryInUse exceptions. 10:18
gibiStill I'm wondering why the exception is raised in you env10:18
gibicould you replace the raise exception.PlacementPciException(error=str(e))10:19
auniyalyes, I was surprised seeing PlacementPciException !10:19
gibiline with a raise e10:19
gibiand try to start up the compute10:19
auniyalack, please give a min10:20
auniyalgibi, https://paste.openstack.org/show/bqQZYiVU8NA5PdRxn7Rm/10:23
auniyalcompute went uo though, in compute service list10:24
auniyalhttps://paste.openstack.org/show/bL5nGTkXQjNgVMltv6fL/10:25
auniyalsame msg isstill coming on compute restart10:26
gibiyepp10:28
gibiso there are two separate issues at least 1) we made that exception a hard fail since PCI in Placement as we too eagerly translate the exception 2) for some unknown reason in you env nova-compute cannot update the disk inventory at startup10:29
gibiauniyal: could you put your nova-cpu.conf in paste.openstack.org?10:34
auniyalhere https://paste.openstack.org/show/bqpnfND7kpB29US0UA6P/10:37
gibithanks10:37
gibiOK so this is a compute with rbd configured10:39
auniyalcan't create new VM - https://paste.openstack.org/show/b2TQPF1lLwbFj3QZsaeC/10:40
gibifor the 2) I assume, but no proof yet, that for some reason the compute tries to modify the total DISK_GB total inventory to a value that is smaller than the current usage in placement causing placement to reject the change10:40
gibiin the last log you pasted, it seems that the compute try to report an inventory without any DISK_GB meaning that any existing DISK_GB inventory is deleted, and placement rejects that as there is still usage from the RC10:42
gibiis this a devstack that built with rbd in the first place or was it built with local disk first and then reconfigured with rbd on after?10:43
auniyalno reconfiguration, only ceph/rbd10:44
gibihttps://github.com/openstack/nova/blob/a459467899d2b406aa8cf530ae481255eaf3c957/nova/virt/libvirt/driver.py#L9456-L9473 this is the code that causes that DISK_GB is reported. 10:48
gibiso for some reason disk_gb is 0 there10:49
gibibut you could try to prove that by adding some extra logs around that code10:49
gibithat disk_gb is filled in https://github.com/openstack/nova/blob/a459467899d2b406aa8cf530ae481255eaf3c957/nova/virt/libvirt/driver.py#L8306-L8307 in case of rbd10:50
gibibased on the info from the rbd pool10:50
gibiso you should check what the rbd reports for the vms pool10:53
auniyalhttps://paste.openstack.org/show/b6WyabNygyB5hiTBUGdN/10:58
auniyalin between delimeters -- ##10:58
gibiyepp that proofs that compute tries to remove the DISK_GB inventory in placement due to getting 0 total available disk from rbd11:04
gibiso you should check your the vms rbd ceph pool in you deployment11:05
gibiwhy it is reporting 0 disk11:05
auniyalsudo rbd ls volumes hangs11:14
auniyalgibi - https://paste.openstack.org/show/bzYaK4id1g94RyidGpcU/11:16
auniyalno keyring found11:17
auniyalhttps://paste.openstack.org/show/bfgQbk0ZCHeDPzszszSj/11:22
gibiOK. so you have some ceph issues. 12:13
gibiI suggest to ping storage folks to get help debugging it12:14
gibiI filed a separate bug for the missleading error message conflating DISK_GB with PCI in Placement https://bugs.launchpad.net/nova/+bug/209387912:26
gibibauzas: could you check my ping from this morning regarding the spec freeze exception?12:30
opendevreviewBalazs Gibizer proposed openstack/placement stable/2023.2: Add round-robin candidate generation strategy  https://review.opendev.org/c/openstack/placement/+/93894712:33
opendevreviewBalazs Gibizer proposed openstack/nova master: [CI][nova-next]test with placement ac breadth-first  https://review.opendev.org/c/openstack/nova/+/93727512:41
bauzassorry I missed your ping12:43
bauzasgibi: I just +Wd your spec12:45
gibibauzas: thanks12:45
opendevreviewMerged openstack/nova-specs master: Image properties in server show  https://review.opendev.org/c/openstack/nova-specs/+/93891012:57
opendevreviewMerged openstack/nova-specs master: vTPM live migration  https://review.opendev.org/c/openstack/nova-specs/+/93677513:15
auniyalgibi ack, thanks13:23
PrzemekKHow to connect encrypted volume (LUKS) to different VM ? Is it just detach/atach ?13:30
sean-k-mooneyPrzemekK: that depend if the vm is in the same project and the user that issues the attach hass acesses to the encyption secret then i think that shoudl work13:34
sean-k-mooneydetach for encycped vs non encypted volumes is the same more or less13:34
opendevreviewTakashi Natsume proposed openstack/nova-specs master: Create specs directory for 2025.2 Flamingo  https://review.opendev.org/c/openstack/nova-specs/+/93909113:35
PrzemekKok. Same project / user admin. I want backup via Commvault and strange error that unable create snapshot. It creates snapshot and attach it to backup machine13:42
sean-k-mooneyso an admin cannot retrive the secret from barbican, but if the admin user that is doign the backup is added as a member of the project. a generic user with the admin cannot access secrets stored in barbican14:15
sean-k-mooney* is added as a member of the project it shoudl be able to retrive it.14:16
sean-k-mooney*with the admin role cannot ...14:16
sean-k-mooneyPrzemekK: by the way commvaults backup soltuion is a purly out of tree impelmeation and may or may not be supproted by your openstack vendor or work with encypted volumes14:17
sean-k-mooneyif it relies on attaching the volume to a backup vm for exampel that wont work with Boot form volume guests14:18
dansmithsean-k-mooney: did we have a bug for the iso+gpt thing or was it just something we stumbled upon? I can't find it in the bug tracker if so14:36
sean-k-mooneywe have a bug for generic iso support and after we got that working a new oslo.utils releasel with the gpt issue was cut14:37
sean-k-mooneyi can see if i can find the other bug if you want to at least track it as related14:37
dansmithI found the generic iso problem, but it's way old14:37
dansmithI mean, predates and is unrelated to the gpt thing14:38
sean-k-mooneythe multi format issue came out of https://review.opendev.org/c/openstack/nova/+/90961114:38
dansmithright14:38
sean-k-mooneyso i have comemnted on  https://bugs.launchpad.net/nova/+bug/205444614:38
sean-k-mooneythat iso supprot in general was only partly broken i.e. isos that supprot block device booting stilll worked14:39
sean-k-mooneyso its not really the same thing but they are releated14:39
sean-k-mooneyi dont know if you want to file a spereate bug or not14:39
dansmithyeah related by virtue of us causing that fix another issue14:39
dansmithjust wondering what to use for a related-bug on my fix patch.. I can file one with basically the details from the comments on the other patch,14:40
dansmithbut it will be pretty much just paperwork at that point14:40
sean-k-mooneydansmith: the intoduction fo the gpt inspctor came out of the ironic cve is that correct14:40
dansmithno14:40
sean-k-mooneyoh ok i tought it was motivated by that orginally.14:40
dansmithit came out of the original thing, to get us to stop using raw for two things, it just wasn't critical for the qcow fix of course14:40
sean-k-mooneydansmith: paperwork wise i would just tack it however you find simplest unless bauzas has a specific request of for a dedicate bug14:41
dansmithyeah, so I was just going to say:14:41
dansmithbauzas: this fix needs to get in: https://review.opendev.org/c/openstack/nova/+/931833 we noticed it because it caused a problem for the fix of another bug, but we never filed a bug for this specifically,14:41
dansmithsince it was just on the heels of landing the initial gpt detection support14:42
bauzasdansmith: context ?14:42
dansmithso if we need paperwork, please say, but if not, could you review?14:42
bauzaswant me to review ?14:42
bauzasack, ok14:42
bauzasI already added myself for reviewing it indeed14:42
dansmiththanks14:42
dansmithI've got a tangled web of reviews for disk inspection across several projects and I need to start reducing that before I lose it :)14:43
opendevreviewBalazs Gibizer proposed openstack/nova master: [CI][nova-next]test with placement ac breadth-first  https://review.opendev.org/c/openstack/nova/+/93727514:53
*** ykarel_ is now known as ykarel15:02
sean-k-mooneygibi: ^ fyi see comment inline, im pretty sure peers is not the correct group15:02
sean-k-mooneyim gong to see if i can quickly update https://review.opendev.org/c/openstack/nova/+/933365 to adress the review feedback. if i cant get that to a mergable state today however im goign to have to ask someone else to take it over to unblock the requirements patch15:04
gibisean-k-mooney: thanksh15:04
gibisean-k-mooney: thank15:04
gibis15:04
opendevreviewBalazs Gibizer proposed openstack/nova master: [CI][nova-next]test with placement ac breadth-first  https://review.opendev.org/c/openstack/nova/+/93727515:06
opendevreviewsean mooney proposed openstack/nova master: [eventlet] update nova tests for eventlet 0.37.0  https://review.opendev.org/c/openstack/nova/+/93336515:34
opendevreviewsean mooney proposed openstack/nova master: [eventlet] update nova tests for eventlet 0.38.2  https://review.opendev.org/c/openstack/nova/+/93336515:35
sean-k-mooneyhberaud ^ i think that might be enough. the issue we had with the funcitoal test seam to have been resolved with the oslo.log release as well15:36
opendevreviewDr. Jens Harbott proposed openstack/nova master: DNM: Test eventlet bump  https://review.opendev.org/c/openstack/nova/+/93887915:38
sean-k-mooneyfrickler: i rebased ^ to confirm if my other change is enogh but it works locally so i suspect it is15:47
sean-k-mooneylocally im emulating the test env by doing " .tox/functional/bin/python3 -m pip install -U -c https://opendev.org/openstack/requirements/raw/commit/27ded6c22e5f1ea9a16c8d3dabbd7fad72775f00/upper-constraints.txt -r requirements.txt -r test-requirements.txt eventlet"15:48
sean-k-mooneythat the url to the uc bump patch15:48
MengyangZhang[m]sean-k-mooney: Hey sean, are you planning to talk about this https://review.opendev.org/c/openstack/nova-specs/+/932653 on Thur? I see it listed on the agenda of Thu's cinder-nova meeting. Just want to give a quick update about. I already brought it up to cinder last week and they gave the green light since there was no cinder changes needed. 16:43
MengyangZhang[m]I was wondering what's next step regarding submitting the proposed code changes. Since the spec freeze has passed I heard, would it be possible to follow the specless workflow? Happy to discuss it in the next nova meeting.16:43
sean-k-mooneyMengyangZhang[m]: so technially we are past the spec approval deadline for this release. it was last thrusday17:35
sean-k-mooneyso procedurely this would need a spec freeze excption. it might be better however to plan this for next cycle but start on the impementation in parallel17:36
sean-k-mooneyhi folks https://review.opendev.org/c/openstack/nova/+/933365 is green and so is https://review.opendev.org/c/openstack/nova/+/938879/2 so i think we are ok to proceed with merging that18:08
sean-k-mooneygibi: melwitt: would ye be able to review https://review.opendev.org/c/openstack/nova/+/933365/318:09
melwittsean-k-mooney: did you ever find some reason that the new eventlet cause the waitall() calls to change?18:11
sean-k-mooneyno18:12
sean-k-mooneyi dont think its particalarly relevent to the behavior of the test however18:13
sean-k-mooneyso i didnt spend much time trying to find out18:13
JayFNew eventlet fixed a few locking issues around os.read()/os.write() patched interactions aiui; the unit test change made sense to me in that context18:13
JayFbut I'm obviously not a nova or eventlet expert, just someone who has read all the involved code :)18:14
sean-k-mooneyim going to quickly test without the change to make sure it still fails18:28
sean-k-mooneyits possibel that the oslo change make it not required18:28
sean-k-mooneybut i suspect that the extra call to waitall is related to that18:29
sean-k-mooneywith the new oslo release i was able to remove the funcitonal test change18:29
sean-k-mooneybut i didnt test master with the new versions which im doing now18:30
JayFnice, that'll build even more confidence that the oslo.log pipemutex stuff fixed something \o/18:30
* JayF is mainly trying to be the openstack liason for Itamar's efforts on eventlet/oslo compat items18:30
sean-k-mooneyhum ok master apprelty passed. but i need to verify that it has the correct versions fo eventlet. changing branch might have cause the venv to be recreateded 18:33
sean-k-mooneyhttps://paste.opendev.org/show/bUq1UNJTP1NHybsQJHPN/18:34
sean-k-mooneyso ya i actully think the nova change is not requried any more18:34
sean-k-mooneyfrickler: do we want to try droping the depends-on form the requirements change?18:35
sean-k-mooneyoslo.log==7.0.0 might be all tha twas requried to fix nova with eventlet 0.38.218:36
opendevreviewmelanie witt proposed openstack/nova master: Bump requirement to PrettyTable>=2.4.0  https://review.opendev.org/c/openstack/nova/+/93915719:04
fricklersean-k-mooney: ack, confirmed locally, updating the change now19:13
sean-k-mooneycool im goign to call it a night, ill leave my nova change open but ill abandon it if the requiremetn patch passes19:32
opendevreviewmelanie witt proposed openstack/nova stable/2024.2: libvirt: Wrap un-proxied listDevices() and listAllDevices()  https://review.opendev.org/c/openstack/nova/+/93915819:38
opendevreviewDr. Jens Harbott proposed openstack/nova master: DNM: Test eventlet bump  https://review.opendev.org/c/openstack/nova/+/93887920:19
opendevreviewmelanie witt proposed openstack/nova stable/2024.1: libvirt: Wrap un-proxied listDevices() and listAllDevices()  https://review.opendev.org/c/openstack/nova/+/93916220:33
opendevreviewmelanie witt proposed openstack/nova stable/2023.2: libvirt: Wrap un-proxied listDevices() and listAllDevices()  https://review.opendev.org/c/openstack/nova/+/93916320:34

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!