Tuesday, 2022-07-12

sean-k-mooney[m]mnaser: that is something that libvirt not nova has always been in charge of00:31
sean-k-mooney[m]intel can change the cpu flags via microcode and when it comes to tsx they have on several ocations00:31
sean-k-mooney[m]nova for the most part triese to leave all cpu compatiablity checkign to the hyperviors the virt driver is managing00:32
sean-k-mooney[m]the excption to this is the abstction we have via traits00:33
sean-k-mooney[m]nova sepcficaly the libvirt driver uses the libvirt api to introspect the cpu to report traits00:35
sean-k-mooney[m]we do not use cpuid or msrs to detect this as its libvirts jobs to unify the feature flags in a vendor indepentent way00:35
sean-k-mooney[m]and its also libvirts job to determin cpu comparitblity for the most part in live migration.00:36
sean-k-mooney[m]nova has never used the current cpu model or cpu flags to make schduling decision for migrations00:36
sean-k-mooney[m]if you configure required traits in the flavor or image we can take those into account but the current cpu flags of a vm are not an input into the schduleing desicion.00:37
sean-k-mooney[m]and they never have been that has alwasy been delegated to the operator to enforce using host aggreates00:38
sean-k-mooney[m]so nova has not changed in this regard since i started working on openstack for the most part.00:39
opendevreviewSylvain Bauza proposed openstack/nova master: api: Drop generating a keypair and add special chars to naming  https://review.opendev.org/c/openstack/nova/+/84913307:33
bauzasgibi: sean-k-mooney: I'm done with the keypair generation removal07:33
bauzasgiven Uggla's patches seem good for unshelve, I'll rebase my branch up on Uggla's unshelve API change once he rebases 07:34
bauzasgibi: sean-k-mooney: I'll actually be off from tonight to next week (I'll also take Friday)07:35
gibibauzas: thanks for moving you stuff top of Uggla's that is nice selflessness07:38
gibibauzas: have a nice PTO, do you have someting on your PTL table we should keep in mind while you are away?07:39
bauzasgibi: nothing in my mind, I'll abandon the yoga open specs next week then07:39
bauzaswe're on yoga-2 on Thursday07:40
bauzasJuly-1407:40
bauzasthat will mean we won't accept new specs07:40
bauzasbut actually, we don't have a lot of them for zed07:40
gibiOK, so I guess you will do the official freeze mail when you are back on Monday. That is KO07:42
gibiOK07:42
bauzasyup07:42
bauzasunless you wanna use the axe07:42
gibinope07:43
gibithe axe is yours :)07:43
gibiand as you said we don't have much open07:43
gibiso there is no need for the axe07:43
bauzasgibi: yeah, in my email, I'll clarify the situation https://review.opendev.org/q/project:openstack/nova-specs+status:open+file:%255Especs/zed/.*07:45
bauzasabout ironic's discussion, this won't need to be hold by the deadline07:46
bauzasso only artom's spec is impacted... unless he's able to revive it before the deadline07:46
bauzasI don't know if sean-k-mooney had wheels for https://review.opendev.org/c/openstack/nova-specs/+/82141907:47
bauzasgibi: about the API changes, those are stacking07:49
bauzaswith the same API microversion07:49
bauzasI'm not really worried yet 07:50
bauzasbut maybe next week, I'd propose some etherpad for trying to organize series between them07:50
bauzaslike, Uggla would take the 2.91 as he's close to be merged07:50
bauzasmine would take 2.92 as this is a quite self-contained change07:51
bauzasand we would debate on other patches for 2.93 and others07:51
bauzasidea being that owners of those patches would have time in advance to rebase 07:51
opendevreviewSylvain Bauza proposed openstack/nova master: zuul: Put Centos9 Stream job periodic-weekly and experimental  https://review.opendev.org/c/openstack/nova/+/84946308:03
bauzasgibi: sean-k-mooney: ^08:03
bauzasUggla: good morning08:05
gibibauzas: thanks for the summary above. I agree with the plans08:06
Ugglabauzas, o/08:06
gibiI'm +2 on the centos9 patch08:06
gibiUggla: o/08:06
bauzasUggla: as I mentioned above, I'll rebase my keypair generation API change on top of your unshelve API patch08:07
bauzasgibi: do you think we actually need to rebase all our branches ? Can't I just write my patch saying "this is 2.92" ?08:08
bauzasof course, I would get a merge conflict because gerrit wouldn't be able to rebase the rest api microversion list doc08:08
bauzasbut this would waaaaay simplify the merge conflict resolution08:09
Ugglabauzas, regarding unshelve have you entered your comments ?08:10
bauzasUggla: not yet, that's my next move08:10
bauzasUggla: I have a direct interest in merging your branch08:11
gibibauzas: I'm not sure you can verify your code if it is on 2.92 without 2.91 existing08:11
gibibut other than that I'm OK to have 2.92 haning off 2.90 with a merge conflict 08:12
Ugglabauzas, ok I'll wait then I will fix gibi and yours quickly so you could probably merge.08:12
gibiI'm here so I can quickly re-review 08:12
bauzasgibi: yeah the tests will probably fail08:14
bauzasbut I see this as a security layer in case of a distracted core reviewer08:14
bauzaspeople can work on the the latest microversion, wait for Zuul +1ing08:15
bauzasand then modifying their patches with a placeholder microversion08:15
bauzasZuul would say no, but we'd have evidence this was working before08:15
bauzasand a merge resolution would solve it quickier once the concurrent patch merges08:16
bauzasI think I'm fool enough to test it on my series08:16
bauzasonce Zuul blesses my last revision08:17
gibibauzas: yeah if you want then you can test this on the keypair series08:17
opendevreviewAmit Uniyal proposed openstack/nova master: add regression test case for bug 1978983  https://review.opendev.org/c/openstack/nova/+/84910408:23
bauzasUggla: -1 on https://review.opendev.org/c/openstack/nova/+/831507 due to missing UTs on nova.compute.api09:33
bauzasyou wrote excellent conditionals (kudos to gibi and you) but you don't verify them :)09:34
bauzasalso, please help poor reviewers by not reindenting tests, that doesn't help to see the bone of the change :)09:34
gibibauzas: there is a bunch of functional coverage that I felt enough09:34
bauzasgibi: yeah but we already have UTs for az 09:35
gibiack, I'm not against having exta UTs too, just stated why I '09:35
bauzasand the functests are done on the latter patch09:35
gibiwhy I'm OK as is09:35
opendevreviewAmit Uniyal proposed openstack/nova master: Adds check, if admin has set compute service down  https://review.opendev.org/c/openstack/nova/+/84888609:40
Ugglabauzas, you mean the conditionals with host and az ? There are fully tested with functional tests. Am I missing something ? 09:42
bauzasUggla: you test them on the functests in https://review.opendev.org/c/openstack/nova/+/845897/4/nova/tests/functional/test_servers.py09:44
bauzaswhich is the latter patch09:44
bauzasUggla: but you also touch https://review.opendev.org/c/openstack/nova/+/831507/17/nova/tests/unit/compute/test_shelve.py in the compute patch09:44
bauzasyou're actually just reindenting a few calls09:44
bauzasbut you could also test the host param in some other tests09:45
opendevreviewMerged openstack/nova master: Catch an exception in power off procedure  https://review.opendev.org/c/openstack/nova/+/81717609:46
opendevreviewMerged openstack/nova master: Optimize _local_delete calls by compute unit tests  https://review.opendev.org/c/openstack/nova/+/84428509:46
sean-k-mooneybauzas: ill review your api removal patch shortly. i spend a lot of time reviewing this morning before going up to the office so im getting a little burnt out by it but i can do one or two more09:47
sean-k-mooneybauzas: the centos 9 patch is on its way to merging09:48
sean-k-mooneybauzas: gibi  im going to leave the unshleve to host serise to ye. ping me if needed but since ye had open comments on them ill let ye take lead on teh review of that09:49
gibisean-k-mooney: ack, make sense09:49
sean-k-mooneybauzas: regarding the external power manamgment i have not had time to look at it but its on my todo list for today09:49
gibisean-k-mooney: my only concern is that bauzas is off the rest of this week and I'd like to merge the unshelve09:50
sean-k-mooneygibi: i can review just proably not today09:50
gibisean-k-mooney: superb, thanks09:50
bauzasthanks09:51
sean-k-mooneyif ye agree on the path forward ill review when Uggla respins the patch to adress your comments09:51
bauzassean-k-mooney: gibi: we deserve to be humble with Uggla https://review.opendev.org/c/openstack/os-traits/+/83276910:05
gibiUggla, bauzas: I'm -1 on https://review.opendev.org/c/openstack/os-traits/+/832769 10:07
bauzasgibi: excellent point10:08
gibieasy to fix :)10:08
opendevreviewManuel Bentele proposed openstack/nova-specs master: Add configuration options to set SPICE compression settings  https://review.opendev.org/c/openstack/nova-specs/+/84948810:08
opendevreviewManuel Bentele proposed openstack/nova-specs master: Add configuration options to set SPICE compression settings  https://review.opendev.org/c/openstack/nova-specs/+/84948810:11
fricklersean-k-mooney: wow, you really get me wondering now why gerrit is sending me mails about a nova patch, which it usually doesn't. finding out that I reviewed it 5 years ago was ... interesting ;)10:14
opendevreviewManuel Bentele proposed openstack/nova-specs master: Add configuration options to set SPICE compression settings  https://review.opendev.org/c/openstack/nova-specs/+/84948810:15
sean-k-mooneyfrickler: i have a dashboard that i sometimes use when i want to find patches to review10:56
sean-k-mooneyi went througyh some of hte small ones this moringin then looked for ones with one +2 that were not in merge conflict10:57
sean-k-mooneythen looked at a few form my normal todo list10:57
sean-k-mooneyso ya some of those were old10:57
sean-k-mooneyhttps://review.opendev.org/dashboard/?foreach=%28+project%3Aopenstack%2Fnova+OR%0Aproject%3Aopenstack%2Fpython-novaclient+OR%0Aproject%3Aopenstack%2Fnova-specs+OR%0Aproject%3Aopenstack%2Fos-vif+OR%0Aproject%3Aopenstack%2Fos-traits+%29%0Astatus%3Aopen%0ANOT+owner%3Aself%0ANOT+label%3AWorkflow%3C%3D-1%0Alabel%3AVerified%3E%3D1%2Czuul%0ANOT+reviewedby%3Aself%0Abranch%3Amaster&tit10:58
sean-k-mooneyle=Nova+Review+Inbox&Small+patches=%28project%3Aopenstack%2Fnova+OR+project%3Aopenstack%2Fpython-novaclient+OR+project%3Aopenstack%2Fos-vif+OR+project%3Aopenstack%2Fos-traits%29+NOT+label%3ACode-Review%3E%3D2%2Cself+NOT+label%3ACode-Review%3C%3D-1%2Cnova-core+NOT+message%3A%22DNM%22+delta%3A%3C%3D10&Needs+final+%2B2=%28project%3Aopenstack%2Fnova+OR+project%3Aopenstack%2Fpython10:58
sean-k-mooney-novaclient+OR+project%3Aopenstack%2Fos-vif+OR+project%3Aopenstack%2Fos-traits%29+NOT+label%3ACode-Review%3E%3D2%2Cself+label%3ACode-Review%3E%3D2+limit%3A50&Bug+fix%2C+Passed+Zuul%2C+No+Negative+Feedback=NOT+label%3ACode-Review%3E%3D2%2Cself+NOT+label%3ACode-Review%3C%3D-1%2Cnova-core+message%3A%22bug%3A+%22+limit%3A50&Wayward+Changes+%28Changes+with+no+code+review+in+the+las10:58
sean-k-mooneyt+two+days%29=NOT+label%3ACode-Review%3C%3D-1+NOT+label%3ACode-Review%3E%3D1+age%3A2d+limit%3A50&Needs+feedback+%28Changes+older+than+5+days+that+have+not+been+reviewed+by+anyone%29=NOT+label%3ACode-Review%3C%3D-1+NOT+label%3ACode-Review%3E%3D1+age%3A5d+limit%3A50&Passed+Zuul%2C+No+Negative+Feedback=NOT+label%3ACode-Review%3E%3D2+NOT+label%3ACode-Review%3C%3D-1+limit%3A50&Need10:58
sean-k-mooneys+revisit+%28You+were+a+reviewer+but+haven%27t+voted+in+the+current+revision%29=reviewer%3Aself+limit%3A50&Specs=project%3Aopenstack%2Fnova-specs+status%3Aopen+limit%3A2010:58
sean-k-mooneyok thats longer then i tought it was10:58
sean-k-mooneyits also a little buggy sometimes10:58
sean-k-mooneylike it sometimes need to be opened twice to get current data10:59
opendevreviewAmit Uniyal proposed openstack/nova master: add regression test case for bug 1978983  https://review.opendev.org/c/openstack/nova/+/84910411:04
opendevreviewAmit Uniyal proposed openstack/nova master: Adds check, if admin has set compute service down  https://review.opendev.org/c/openstack/nova/+/84888611:04
opendevreviewsean mooney proposed openstack/nova master: Adds check, if admin has set compute service down  https://review.opendev.org/c/openstack/nova/+/84888612:01
sean-k-mooneyauniyal_: ^12:02
sean-k-mooneythat fixes your release note issue12:02
sean-k-mooneybut now i need to rebase them both12:03
opendevreviewsean mooney proposed openstack/nova master: add regression test case for bug 1978983  https://review.opendev.org/c/openstack/nova/+/84910412:03
opendevreviewsean mooney proposed openstack/nova master: Adds check, if admin has set compute service down  https://review.opendev.org/c/openstack/nova/+/84888612:03
sean-k-mooneyauniyal_: so now gerrit sees them both as the most recent revision12:03
auniyal_ack12:03
*** dasm|off is now known as dasm13:02
opendevreviewAmit Uniyal proposed openstack/nova master: Adds link in releasenotes for hw machine type bug  https://review.opendev.org/c/openstack/nova/+/84953213:23
opendevreviewribaudr proposed openstack/os-traits master: Add 'COMPUTE_STORAGE_VIRTIO_FS', 'COMPUTE_MEM_BACKING_FILE'  https://review.opendev.org/c/openstack/os-traits/+/83276913:25
Ugglabauzas, do you have prepared the notes for today's meeting ? 13:28
bauzasUggla: done : https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting13:57
bauzasand thanks13:57
Ugglabauzas, thx14:03
Ugglagibi, you will start the meeting right ?14:06
bauzasI have to drop by now14:06
bauzassee you folks, you'll be missed14:07
gibibauzas: o/ have a nice one14:07
gibiUggla: as you would like to. I can start and run it until 18:30 CEST and then pass the rest to you.  Or you can start from the beginning and I can be just your support running the meeting14:07
Ugglagibi, option 1 is fine.14:08
gibiOK, then I will start14:08
Ugglagibi, I would rather because sometime I'm not fully available to start at 18h.14:11
gibisure, no problemo :)14:11
ralonsohsean-k-mooney, https://review.opendev.org/c/openstack/releases/+/84954414:45
ralonsohis it ok to have a new os-vif version?14:45
ralonsohwe need the trunks improvement14:45
sean-k-mooneysure we can do one for m214:46
sean-k-mooneyi can propose a patch14:46
sean-k-mooneyralonsoh: oh you already have14:46
ralonsohhehehe yes14:47
opendevreviewAmit Uniyal proposed openstack/nova master: Adds check, if admin has set compute service down  https://review.opendev.org/c/openstack/nova/+/84888615:31
gibifoks, weekly nova meeting starts in 15 minutes here in the channel15:43
gibi#startmeeting nova16:00
opendevmeetMeeting started Tue Jul 12 16:00:38 2022 UTC and is due to finish in 60 minutes.  The chair is gibi. Information about MeetBot at http://wiki.debian.org/MeetBot.16:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.16:00
opendevmeetThe meeting name has been set to 'nova'16:00
*** frickler is now known as frickler_pto16:00
gibi#chairs gibi Uggla 16:00
gibi#chair gibi Uggla 16:00
opendevmeetCurrent chairs: Uggla gibi16:00
gibio/ folks16:01
Ugglao/16:01
gibibauzas is away for the rest of the week so Uggla and I will be your host today16:01
gibilets wait a bit and see if there are others here for the meeting :)16:01
gibireally? only me an Uggla? then it will be a quick meeting :)16:03
gibi#topic Bugs (stuck/critical) 16:03
gibi#info One Critical bug16:04
gibi#link https://bugs.launchpad.net/nova/+bug/1979047 Centos 9 Stream bug failure16:04
gibi#link https://review.opendev.org/c/openstack/nova/+/849463 move the C9S job to both experimental and periodic-weekly16:04
gibi#action bauzas to track results of this job on nova weekly meeting16:04
elodilleso/16:04
gibiI actually closed that critical16:04
gibias we merged the move of the job to our periodic queue today16:04
gibiso no need to track this as a critical bug16:04
gibielodilles: o./16:04
gibi#link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 11 new untriaged bugs (+1 since the last meeting)16:05
gibi#link https://storyboard.openstack.org/#!/project/openstack/placement 27 open stories (+0 since the last meeting) in Storyboard for Placement 16:05
elodilles(sorry for being late)16:05
gibiis there any bug we need to talk about here?16:05
gibiI assume no16:06
gibi#info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster16:06
gibi#info Next bug baton is passed to Uggla16:06
gibiUggla: are you OK to take the baton?16:06
Ugglayep16:06
gibiawesome16:06
gibithanks16:06
gibi#topic Gate status 16:06
gibi#link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:07
gibiI don't see any new on in that list16:07
gibiis there any gate bug we should discuss?16:07
elodillesnot a bug, but i guess the ovh.net issue ("Payment Required") impacts nova gate as well, doesn't it?16:08
gibiI haven't checked but could be16:08
* gibi was busy hacking k8s operator for the placement service16:09
elodillesit causes POST_FAILURES16:09
gibielodilles: do we have a tracking bug for it?16:09
elodillesoh, i see that is fixed16:09
elodilles2022-07-12 15:02:25 UTC Log uploads to OVH's Swift are resuming and our voucher is renewed; thanks again amorin!16:09
gibiOK, so POST_FAILURES are OK to recheck now if the failure was Payment Required ;)_16:10
elodilles(from here: https://wiki.openstack.org/wiki/Infrastructure_Status )16:10
gibielodilles: thanks for the info16:10
elodillesnp16:10
gibiany other gate issue?16:10
elodillesnothing i'm aware of at master branch16:10
gibithen moving on16:12
gibi#link https://zuul.openstack.org/builds?project=openstack%2Fplacement&pipeline=periodic-weekly Placement periodic job status 16:12
gibi#link https://zuul.opendev.org/t/openstack/builds?job_name=nova-emulation&pipeline=periodic-weekly&skip=0 Emulation periodic job runs16:12
gibiboth placement and nova emulation are green16:12
gibifrom next week we will check centos9 job here as well16:12
gibi#info Please look at the gate failures and file a bug report with the gate-failure tag.16:12
gibi#info STOP DOING BLIND RECHECKS aka. 'recheck' https://docs.openstack.org/project-team-guide/testing.html#how-to-handle-test-failures16:12
gibianything else about the gate before I move on?16:13
elodilles-16:13
gibi#topic Release Planning16:13
gibi#link https://releases.openstack.org/zed/schedule.html16:13
gibi#info Zed-2 is in 2 days16:13
gibi#info we'll stop accepting specs by Monday16:13
gibi#action bauzas to send an email on Monday about specs and abandon the yoga specs16:13
gibiwe have a small amount of open specs16:13
gibiif you have one then this is the last chance for Zed16:14
gibifeel free to ping me if you need help to land them16:14
gibiis there any other Release info to share?16:15
gibi#topic Review priorities 16:16
gibi#link https://review.opendev.org/q/status:open+(project:openstack/nova+OR+project:openstack/placement+OR+project:openstack/os-traits+OR+project:openstack/os-resource-classes+OR+project:openstack/os-vif+OR+project:openstack/python-novaclient+OR+project:openstack/osc-placement)+label:Review-Priority%252B116:16
gibi#link https://review.opendev.org/c/openstack/project-config/+/837595 Gerrit policy for Review-prio contributors flag. We need project-config cores to merge it.16:16
gibi#link https://docs.openstack.org/nova/latest/contributor/process.html#what-the-review-priority-label-in-gerrit-are-use-for Documentation we already have16:16
gibiat some point we should start to review the list of prio marked review here but not today as so few of us here16:16
gibi#topic Stable Branches16:16
gibielodilles: ?16:16
gibior more like <mic> -> elodilles 16:17
elodillesunfortunately not so much news, but let me copy them here16:17
elodilles#info stable/train is blocked, fix exists but hasn't merged yet due to intermittent failures16:17
elodilles#info stable branch status / gate failures tracking etherpad: https://etherpad.opendev.org/p/nova-stable-branch-ci16:17
gibithanks16:17
elodillesso in short, the intermittent failures are still causing pain for us :(16:18
elodillesnp16:18
gibielodilles: so https://review.opendev.org/c/openstack/nova/+/844530 the one we need for train?16:18
elodillesyes16:18
gibiack16:19
gibithanks16:19
gibi#topic Open discussion 16:19
gibi(bauzas) Opportunities for low-hanging-fruits, anyone ? (to be punted to next week)16:19
gibiI guess we punt this to next week again16:19
gibibut if you see low hangig fruits then note them for bauzas 16:19
gibiany other topic to discuss today? 16:19
elodillesnothing from me16:20
gibiit seems we are in summer mode16:20
gibibut at least I can leave in time for a game night ;)16:20
elodilleshahh, have fun then! ;16:21
elodilles:)16:21
gibithanks16:21
gibiso lets close this16:21
gibi#endmeeting16:21
opendevmeetMeeting ended Tue Jul 12 16:21:35 2022 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)16:21
opendevmeetMinutes:        https://meetings.opendev.org/meetings/nova/2022/nova.2022-07-12-16.00.html16:21
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/nova/2022/nova.2022-07-12-16.00.txt16:21
opendevmeetLog:            https://meetings.opendev.org/meetings/nova/2022/nova.2022-07-12-16.00.log.html16:21
Ugglagibi, thx for running the meeting16:21
gibiUggla, elodilles: thanks for joining :)16:21
* gibi logs off16:21
elodilleso/16:22
elodilles:)16:22
Ugglagibi, have fun16:22
colby_Hey All. Ive messaged before about our vGPU issues. Ive been working on trying to get this to work for a while now and we would really like to be able to offer this product to our users. Someone here mentioned that Nova should reuse the mdev devices that get created. The problem is that it is not. After spinning up vgpu instances then removing the mdev devices stay, but when we try to spin up a new instance its trying 17:29
colby_to create another medev device (I think it is since its trying to use a resource provide of a different pci address on the card that already has all the mdevs created). 17:29
colby_mdevctl list output: 67e63f1e-07f2-474f-874c-826a024c10ec 0000:21:01.7 nvidia-563 manual17:30
colby_3a974d13-5dea-4bfc-b034-533f6e754349 0000:21:03.4 nvidia-563 manual17:30
colby_d3586a0a-2e56-421e-923f-20797fe74ab5 0000:21:03.7 nvidia-563 manual17:30
colby_150c155c-da0b-45a6-8bc1-a8016231b100 0000:21:04.1 nvidia-563 manual17:30
colby_but spinning up a new instance tried to use the resource provider _pci_0000_21_02_4 (and pci 21 is already full)17:31
colby_how does nova detect the already created devices and use those? Should it be using the resource provider of those already created mdev (eg _pci_0000_21_01_7)17:32
colby_We are on Victoria release, Centos 8 Stream, Nvidia A40 GPU17:34
opendevreviewMerged openstack/nova master: zuul: Put Centos9 Stream job periodic-weekly and experimental  https://review.opendev.org/c/openstack/nova/+/84946317:54
sean-k-mooneycolby_: there is definetly a bug with this we are hitting it downstream too and still investigating18:36
sean-k-mooneycolby_: i can see if i can get you the link to where we try to reuse the mdev18:36
sean-k-mooneycolby_: https://github.com/openstack/nova/blob/de65131f92ba5ba812e33e6ff63be0991687413a/nova/virt/libvirt/driver.py#L8261-L8278=18:37
colby_sean-k-mooney: oh good glad to know this is not just us. Is there a bug filed yet that I could follow?18:40
sean-k-mooneydownstream definitly ill grab it and see if we have an upstram one. we were still trying to root cause it18:40
sean-k-mooneycolby_: basically we were QEing cold migration and noticed that the devices were not being reused18:41
sean-k-mooneyso depening on the order the test ran it either worked or failed18:41
sean-k-mooneyso we are looking at it as part of https://bugzilla.redhat.com/show_bug.cgi?id=170128118:42
sean-k-mooneybut i think we are going to break this out as a seperate upstream and downstream bug18:42
colby_would it be the same root cause as we are seeing just deleting and trying to create new instances?18:43
sean-k-mooneya host reboot or deleteing the unused mdevs is the work around we are usign right now18:43
sean-k-mooneyso if you loop over the domain xmls and delete any mdev not used by an xml that "fixes it" temporally18:44
colby_yea thats what I ended up having to do manually is remove the mdevs that got created then new instances could be spun up18:44
sean-k-mooneybut thats not the correct fix18:44
colby_I suppose I could create a cron job to do that so we can remove the manual part18:45
sean-k-mooneycolby_: bauzas is on on pto tomorrow and friday but i think they are here tursday18:45
sean-k-mooneythey tought it might be related to who we do the mdev lookup18:46
sean-k-mooneybut since you have the issue18:46
sean-k-mooneycould you compare the list of mdevs returned by mdevctl and libvirt via virsh 18:46
sean-k-mooneylibvirt does some caching so one of the guesses we had is it might be getting out of sync18:47
sean-k-mooneyactully hum18:47
sean-k-mooneyhttps://review.opendev.org/c/openstack/nova/+/832489/1/nova/virt/libvirt/utils.py18:49
sean-k-mooneyi wonder if its this ^18:49
sean-k-mooneyhttps://bugs.launchpad.net/nova/+bug/195165618:49
sean-k-mooneycolby_: do you know what version fo libvirt you are using18:50
colby_7.9.0-118:50
colby_you just want the output form `virsh nodedev-list` ?18:50
sean-k-mooneythat was in 7.7 https://github.com/libvirt/libvirt/commit/3bd8181bc5548a0ce81107cbfb480dfdcba5679d18:50
sean-k-mooneycolby_: yes please nodedev-list shoudl have the names 18:51
sean-k-mooneyand we can check the format18:51
sean-k-mooneyto see if it has the parent info or not18:51
colby_https://pastebin.com/94YfBRH918:54
sean-k-mooneythere is also https://review.opendev.org/c/openstack/nova/+/838976 as another possible fix18:55
sean-k-mooneyodd i dont see any mdevs there18:55
colby_ha woops sorry18:56
colby_wrong machine18:56
colby_https://pastebin.com/NAfjxUt718:56
colby_mdevctl list output: https://pastebin.com/rXmUftzj18:57
sean-k-mooneyyep so virsh has the extended names18:57
sean-k-mooneyso its not a caching issue but it proably is a parsing issue18:58
sean-k-mooneysince both consitent18:58
sean-k-mooney"""Note that the lookup of the mdev device by UUID are needed in order18:59
sean-k-mooneyto keep the ability to recreate assigned mediated devices on a reboot of18:59
sean-k-mooneythe compute node18:59
sean-k-mooney"""18:59
sean-k-mooneybut i bet its also needed to be able to reuse the mdevs at all18:59
sean-k-mooneycolby_: im not 100% sure this will fix it but i have set https://review.opendev.org/c/openstack/nova/+/838976 as a review priority and ill follow up with sylvain when they are back19:01
colby_ok sounds good. Im happy to test out the patches on our system if you want19:03
colby_no one is using this hypervisor right now but the admins19:03
sean-k-mooneyif you wanted to test https://review.opendev.org/c/openstack/nova/+/838976 and provide feedback on the review that is the more compelte fix19:04
colby_sure. Ill get those in place today and let you know if it helps our case at all19:04
sean-k-mooneymost of the opencomment are about updating the doc strings but the patch should work as is19:04
sean-k-mooneywe might also add a functional repoducer if we can recaret the mdev resue issue19:05
sean-k-mooneycolby_: thanks19:05
*** dasm is now known as dasm|off22:14

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!