Thursday, 2024-01-18

opendevreviewmelanie witt proposed openstack/nova master: DNM test ephemeral encryption + resize: qcow2, raw, rbd  https://review.opendev.org/c/openstack/nova/+/86241600:09
*** efried1 is now known as efried01:04
opendevreviewmelanie witt proposed openstack/nova master: DNM test ephemeral encryption + resize: qcow2, raw, rbd  https://review.opendev.org/c/openstack/nova/+/86241602:03
opendevreviewlikui proposed openstack/nova master: Add py3.11 in tox file  https://review.opendev.org/c/openstack/nova/+/90158702:10
opendevreviewlikui proposed openstack/nova master: Update python classifier in setup.cfg  https://review.opendev.org/c/openstack/nova/+/90158702:10
tonybAny chance I can get some eyes on: https://review.opendev.org/c/openstack/nova/+/905314 ?02:11
opendevreviewAmit Uniyal proposed openstack/nova master: Fixes: bfv vm reboot ends up in an error state.  https://review.opendev.org/c/openstack/nova/+/90481706:50
bauzastonyb: sent to the gate08:28
opendevreviewMerged openstack/nova master: Fix test failures with oslo.limit 2.3.0  https://review.opendev.org/c/openstack/nova/+/90531409:21
tonybbauzas: thank you09:23
tonybmelwitt: and thank you too for the quick review 09:24
*** priteau_ is now known as priteau12:14
opendevreviewMerged openstack/nova master: Updates glance fixture for create image  https://review.opendev.org/c/openstack/nova/+/90568414:37
opendevreviewSylvain Bauza proposed openstack/nova master: Add a functest for verifying multiple VGPU allocations  https://review.opendev.org/c/openstack/nova/+/84574715:16
opendevreviewSylvain Bauza proposed openstack/nova master: Support multiple allocations for vGPUs  https://review.opendev.org/c/openstack/nova/+/84575715:16
opendevreviewMerged openstack/nova master: Fixes: bfv vm reboot ends up in an error state.  https://review.opendev.org/c/openstack/nova/+/90481716:40
*** tosky_ is now known as tosky16:41
opendevreviewDoug Szumski proposed openstack/nova master: Use dedicated live migration network during pre-migration  https://review.opendev.org/c/openstack/nova/+/90605316:53
_colbybauzas: thanks for your help yesterday. So from what you said do we need to manually create the mdevs now? I was confused on that part. I see all the VF and available_instances for them all. But mdevctl shows no devices and nova does not seem to create them any more. So is manual creation required now?17:16
bauzas(on a meeting now but I'll reply later)17:17
opendevreviewDoug Szumski proposed openstack/nova master: Use dedicated live migration network during pre-migration  https://review.opendev.org/c/openstack/nova/+/90605317:17
bauzas_colby: I'll need to disappear shortly, but please explain me more : are you able to see mediated devices in /sys/bus/mdev ?17:30
opendevreviewDoug Szumski proposed openstack/nova master: Use dedicated live migration network during pre-migration  https://review.opendev.org/c/openstack/nova/+/90605317:32
_colbybauzas: yes I see 1 here for all the available VFs:  cat /sys/class/mdev_bus/*/mdev_supported_types/nvidia-563/available_instances17:38
bauzasso basically all GPU VFs are available17:39
bauzasdo you have instances on the host ?17:39
_colbyno this is a fresh host Im testing with17:39
bauzasand do you have placement allocations ?17:39
bauzas_colby: re: the bug I mentioned yesterday, once you create enough mdevs equal to the max number of vGPUs the type can create, then all other VFs will start having 0 available instances but placement will continue to say 'sure, you can use them'17:40
_colbyyes the resource provider shows all the VF addresses as seperate items and looking at inventory of one I see the expected output: VGPU, allocation_ratio=1.0,min_unit=1,max_unit=1,reserved=0,step_size=1,total=1,used=017:41
bauzasthat's why I'm recommending to precreate the mediated devices on the VFs you want and then start nova-compute17:41
bauzasoh, you already started nova-computz17:41
bauzasthen, 17:41
bauzas1/ create the mdevs by hand (or by mdevctl)17:41
bauzas2/ stop nova-compute17:41
bauzas3/ delete the resource providers matching the unused VFs17:42
bauzas4/ restart nova-compute17:42
bauzasyou should then have placement reflecting the correct resources17:42
bauzasI need to leave now17:42
_colbyok I can do that. Seems like a bit of heavy admin work every time we want to reconfig the vGPU slices. Nova does not want to create them automatically any more?17:43
bauzasthat's the bug report I mentioned yesterday17:43
_colbybut if that will get it working then we can do that17:44
bauzasnova can continue to create the mediated devices17:44
bauzasbut as soon as you create the Nth mdev from a GPU (where N equals the max number of vGPUs you can create from a type), then you'll have inconsistencies between placement and sysfs17:45
_colbyI see17:45
_colbybut in our case Nova does not seem to be creating any mdevs17:45
bauzasa fix is there, we gonna review it this cycle17:45
bauzasI need to go but shouldn't be17:45
_colbyno problem thanks17:46
bauzasI can't remember when we added more logs to _allocate_mdevs() but yoga seems recent enough to have those17:46
bauzashttps://github.com/openstack/nova/commit/6feb3350b048606297068841e3feba110bb0b0ab17:48
bauzasnah, antelope17:48
_colbyI can manually patch our system to add these logs for testing17:49
_colbybauzas: whenever you get back I made more progress: https://pastebin.com/Xv2bvaf718:19
_colbyits compaining about no available instances on that device but there is: cat /sys/class/mdev_bus/0000\:17\:02.0/mdev_supported_types/nvidia-563/available_instances18:20
_colbyalso that device is not listed in the config for that type: device_addresses=0000:17:00.4,0000:17:00.5,0000:17:00.6,0000:17:00.718:22
opendevreviewMerged openstack/nova master: Allow config to support virtiofs (driver)  https://review.opendev.org/c/openstack/nova/+/88652221:51
opendevreviewMerged openstack/nova master: Attach Manila shares via virtiofs (db)  https://review.opendev.org/c/openstack/nova/+/83119321:51

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!