Tuesday, 2023-11-07

bauzashappy spec review day, everyone :)07:50
gibiI did my first pass on the spec. I skipped re-proposes as those probably doesn't need much discussion anyhow09:21
gibis/spec/specs/09:22
bauzasgibi: I still need to do my duty, still working on f... vgpus09:28
* gibi sending good vibes09:29
opendevreviewBalazs Gibizer proposed openstack/nova master: [libvirt]Add migration_inbound_addr  https://review.opendev.org/c/openstack/nova/+/90020309:35
opendevreviewBalazs Gibizer proposed openstack/nova master: DNM: test hostname based migration URL  https://review.opendev.org/c/openstack/nova/+/90027509:38
opendevreviewSylvain Bauza proposed openstack/nova master: WIP: Support to pass PFs in device_addresses  https://review.opendev.org/c/openstack/nova/+/90025109:42
bauzasthere we go, I'm done09:42
bauzasI'll just await sean-k-mooney and dansmith for discussing about that09:43
dvo-plvbauzas, Hello. I'm currently resolving your comments in the spec file. I have a question regarding dependencies10:46
bauzasdvo-plv: I need to go off in a few mins10:47
dvo-plvSure, I will udpate all what i can, another discuss later10:47
bauzasdvo-plv: ack, I'll ping you this afternoon so10:49
opendevreviewDanylo Vodopianov proposed openstack/nova-specs master: Re-propose using VirtIO PackedRing Configuration support for 2024.1  https://review.opendev.org/c/openstack/nova-specs/+/89592410:49
opendevreviewStephen Finucane proposed openstack/nova master: docs: Add documentation on server groups  https://review.opendev.org/c/openstack/nova/+/89997912:18
opendevreviewDmitriy Rabotyagov proposed openstack/nova-specs master: [spec] Add Cross-AZ scheduling blueprint  https://review.opendev.org/c/openstack/nova-specs/+/90029614:06
opendevreviewDmitriy Rabotyagov proposed openstack/nova-specs master: [spec] Add Cross-AZ scheduling blueprint  https://review.opendev.org/c/openstack/nova-specs/+/90029614:12
noonedeadpunkI guess I'm a bit late for the spec review day, sorry :(14:13
noonedeadpunkhad some off time during previous week so wasn't able to work on it then 14:13
bauzasnoonedeadpunk: no worries, we plan to do a second round of specs in a next review day14:20
bauzaslike mine I didn't had time to write yet :)14:20
dvo-plvbauzas, Hello, I get your latest comments14:24
dvo-plvhttps://review.opendev.org/c/openstack/nova-specs/+/895924/6/specs/2024.1/approved/virtio_packedring_configuration_support.rst#154 Those patches which you mentioned are from another our blueprint14:25
dvo-plvI believe that patches are group by topic14:25
dvo-plvShould I add links on all patches to the spec file ?14:26
bauzasdvo-plv: hey 14:26
bauzasdvo-plv: I don't particularly want to track the implementation patches in a spec, I'd rather prefer if you can write something in the spec saying that the nova changes depend on neutron and os-vif changes that are on the fly14:27
opendevreviewJohn Garbutt proposed openstack/nova-specs master: WIP: Add spec for PCI Groups  https://review.opendev.org/c/openstack/nova-specs/+/89971914:27
dvo-plvI see, okay I will do it14:28
bauzasI see https://review.opendev.org/q/topic:bp%252Fvirtio-packedring-configuration-support14:28
bauzasdoes that mean anything is required on the neutron side ?14:28
bauzasnothing*14:28
bauzasdvo-plv: ^14:28
dvo-plvno, this feature is relayted to the nova14:28
bauzasI'm actually confused14:28
bauzasnova will pick the computes that are recent enough, gotcha14:29
bauzasbut14:29
dvo-plvits about acceleration for qemu14:29
bauzasoh I see14:30
bauzasso we're just exposing the feature as a trait14:30
dvo-plvyes14:30
bauzasokay, then nevermind my comment on dependencies, but there are still some things to write about testing :)14:30
bauzasI'll update 14:31
bauzasdvo-plv: I may propose an alternative, I'm not particarly fond of adding another trait for something only QEMU14:32
bauzaswe already add a lot of those14:33
dvo-plvMaybe we should leave as is. We already did alot of work here and testing for implement this14:34
bauzasdvo-plv: I still don't get why we need a trait, that's the point14:40
bauzashttps://docs.openstack.org/nova/latest/reference/libvirt-distro-support-matrix.html#min-libvirt-qemu-version-and-next-min-libvirt-qemu-version-table14:40
bauzasit says here that Antelope libvirt min is 6.014:40
bauzasand Bobcat 7.014:40
bauzasso basically, the problem we need to solve is only with Antelope computes or I'm stupid14:41
dvo-plvFor scheduling process, when operator makes OpenStack updates 14:42
bauzasyeah, I got it, but look at that table14:42
dvo-plvit case the live migration when one node works with packed ring and another not14:42
dvo-plvwhen user make update of the system 14:42
bauzassure, but Bobcat (and Caracal) require libvirt 7.0 as min14:43
bauzasso you're guaranteed that libvirt is newer than 6.7, which is what you want14:43
bauzasthe only case we need to solve is a very particular case, which is when the operator has a rolling upgrade environment on a SLURP cadence, ie. Caracal controllers and computes + a certain number of Antelope computes14:44
bauzasin that specific case, we don't wanna land on an Antelope compute, for sure14:44
dvo-plvbut when user hsa antelope and do system update, he could faced with the situation when one compute has antelope with old qemu and other already bobcat. to prevent wrong migration we will filter nodes with trait14:45
bauzasthis is why I'm proposing to add a prefilter that would only check the compute version14:45
bauzashow would you do that for antelope computes since you can't backport that detection code ?14:45
bauzaswhat I'm suggesting is to ask this feature to be Bobcat min14:46
bauzasI know I approved https://review.opendev.org/c/openstack/os-traits/+/876069 but I wasn't understand this was just for decorating a libvirt version14:47
dvo-plvit will required to delete it from the os-traits, rewrite nova code https://review.opendev.org/c/openstack/nova/+/876075 and glance14:49
dvo-plvrewrite spec file14:49
dvo-plvor we can move with it and not limit us with bobcat14:50
bauzassure, but we'll keep forever some trait support for something we don't need in the very next future, ie. D :)14:51
bauzasthis is the exact reason why we do specifications, in order to avoid such confusions14:51
bauzasand again, I don't get how antelope computes will magically decorate them with a trait, given the code isn't there14:52
dvo-plvwhen user starts to update the system14:52
bauzasdvo-plv: let me explain you14:53
dvo-plvhe could have situation when one compute old, another has altests release and vm was run on the latest and to avoid issue with qemu we could filter it with trait14:53
bauzasdvo-plv: in https://review.opendev.org/c/openstack/nova/+/876075/25/nova/virt/libvirt/driver.py#904614:53
bauzasyou're adding a trait14:53
bauzasand in the conditional below, you're decorating the compute with that trait14:54
bauzasso, indeed, Caracal computes will be decorated with that trait (since libvirt min is enough), I don't disagre14:55
bauzasdisagree*14:55
bauzasnow, Bobcat computes won't cointain that code, rightN14:55
bauzasand Antelope ones, too, right?14:55
dvo-plvyes14:55
bauzasfor Bobcat, this isn't a problem : since the min is recent enough, you're good14:58
bauzasfor Antelope, indeed, depending on the OS version, you may or may not have the right libvirt version14:58
bauzasbut unfortunately Antelope computes can't decorate themselves with https://review.opendev.org/c/openstack/nova/+/876075/25/nova/virt/libvirt/driver.py since they don't have that code running15:00
dvo-plvyes and in case if Antelope will have old qemu, it will help to avoid wrong qemu command15:01
bauzasdvo-plv: so again, why do we need to use a trait if we just say "we will only send that instance to Bobcat and newer computes" ?15:02
bauzaswe could also do something in the API like https://github.com/openstack/nova/blob/b64ecb0cc776bd3eced674b0f879bb23c8a4b486/nova/compute/api.py#L284-L31215:05
bauzasand return some HTTP40x exception if not all the computes are Bobcat or newer15:05
dvo-plvSo, you suggest to remove trait and add somewhere in the scheduler check that if packed ring was requested and release  of the target node is equal or higher then Bobcat we approve live migration to this to this node ?15:07
bauzasdvo-plv: that's a good question, we should either restrict the placement query to >=Bobcat computes (but I don't know if we have a construct like this) or we could just have some API check that would ensure that all computes are above or equal Bobcat version, like the link I passed to you 15:11
dvo-plvI see your point and it sounds correctly15:12
bauzasdvo-plv: I'm not telling you to drop the libvirt additions you made in https://review.opendev.org/c/openstack/nova/+/87607515:13
bauzasand nothing changes on the UX side (image property or flavor extraspec)15:13
bauzasthat's just the scheduling part15:14
bauzasand we could even lift that restriction in the API once we arrive in D timeframe, since we'll only support N-1 upgrade 15:15
dvo-plvBut maybe we could also invite sean-k-mooney and gibi  who also take a part in the spec file design to make one more common decision in that topic to choose new vector which i have to investigate how and where to implement it15:15
bauzassure15:15
bauzasdvo-plv: just my guess, was that spec originally proposed in Antelope ?15:15
bauzasif so, the libvirt check would sound more important to me by that time15:16
bauzasbut the fact is, we bumped our mins in Bobcat, so that simplifies the logic :)15:16
dvo-plvi'm not familiar with release date, I thought that it was before Zed release May 03 2023 https://review.opendev.org/c/openstack/nova-specs/+/86837715:17
bauzasI need to disappear for 20 mins, but I can read15:18
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/2023.1: add a regression test for all compute RPCAPI 6.x pinnings for rebuild  https://review.opendev.org/c/openstack/nova/+/90030615:28
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/zed: add a regression test for all compute RPCAPI 6.x pinnings for rebuild  https://review.opendev.org/c/openstack/nova/+/90030715:28
dvo-plvsean-k-mooney, gibi15:29
dvo-plvSo, in a conclusion we have two open questions. How we need to proceed with scheduling, trait or release filter15:29
dvo-plvDoes tempest test is a required from us ?15:29
bauzasnoonedeadpunk: thanks for proposing the backparts, hadn't time yet15:47
bauzasreminder: nova meeting in 13 mins here.15:47
noonedeadpunkno worries. I will push rest on top in couple of mins15:47
bauzasdvo-plv: I'm not saying a tempest test is mandatory, just a preference15:48
bauzasthat said, at least a functional test seems needed15:48
bauzasshit, forgot the meeting was in another impromptu call :(16:02
bauzas#startmeeting16:02
opendevmeetbauzas: Error: A meeting name is required, e.g., '#startmeeting Marketing Committee'16:02
elodilleso/16:02
bauzas#startmeeting nova16:02
opendevmeetMeeting started Tue Nov  7 16:02:15 2023 UTC and is due to finish in 60 minutes.  The chair is bauzas. Information about MeetBot at http://wiki.debian.org/MeetBot.16:02
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.16:02
opendevmeetThe meeting name has been set to 'nova'16:02
elodilleso/16:02
bauzas#link https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting16:02
bauzas(this will be live, I haven't updated the wiki yet)16:02
dansmitho/16:03
gibio/16:03
bauzasthere, let's start16:04
bauzas#topic Bugs (stuck/critical) 16:04
bauzas#info No Critical bug16:04
bauzas#link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 32 new untriaged bugs (-4 since the last meeting)16:04
bauzas#info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster16:05
bauzasUggla_: any bug you wanted to tell us ?16:05
bauzasanyway, let's move on16:05
bauzaselodilles: fancy taking the baton ?16:06
elodillesbauzas: yepp16:06
elodillesi can take it :)16:06
bauzascool thanks16:06
bauzas#info bug baton is elodilles16:07
bauzaselodilles: ++16:07
Uggla_oh16:08
Uggla_yes16:08
bauzas#topic Gate status 16:08
bauzas#link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:08
bauzas#link https://zuul.openstack.org/builds?project=openstack%2Fnova&project=openstack%2Fplacement&pipeline=periodic-weekly Nova&Placement periodic jobs status16:08
Uggla_https://bugs.launchpad.net/nova/+bug/203938116:08
bauzas#info Please look at the gate failures and file a bug report with the gate-failure tag.16:08
bauzasUggla_: oh sorry we moved to the gate section, can we discuss your bug in the open discussion then ?16:09
dansmithmelwitt has a fix up for the vnc ting.. I've asked her a question, but assume we'll get that on the way soon once she's around16:09
bauzasfwiw, all periodics are in green16:09
Uggla_yep sure16:10
bauzasdansmith: oh, nice to hear, any change I could dent ?16:10
dansmithsec16:10
dansmithhttps://review.opendev.org/c/openstack/grenade/+/90025716:11
bauzasdansmith: cool, CCing it16:11
bauzasoh that's why it's failing intermittently16:12
bauzasdepending on the host you land16:12
bauzasI nacked the idea when gibi suggested it because I was too lazy to look at the job and I just trusted the fact we got green runs16:12
bauzaswhat a shame16:12
dansmithbauzas: that's what I'm wondering16:13
dansmithseems like we should be failing a lot more though,16:13
dansmithand why this has become a thing all of a sudden also seems weird if it's been broken like this for a while16:13
dansmithbut otherwise yeah, hopefully an easy fix16:13
bauzasbecause yesterday we merged my patch that added the servers actions checks in grenade16:13
bauzasthis test was flakey but never run, and I just opened the can of worms 16:14
dansmithbut I think it has failed on other patches, not just yours16:14
bauzaswell, then the vnc check itself could be present on other tempest tests16:15
bauzasor we could run the server actions list in other jobs, rather16:15
bauzasanyway16:15
bauzasmoving on16:16
bauzas#topic Release Planning 16:16
bauzas#link https://releases.openstack.org/caracal/schedule.html16:16
bauzas#info Nova deadlines will be proposed in the schedule above16:16
bauzasI just need to file a release patch with the correct dates :)16:17
bauzas#info Caracal-1 milestone in 1 week16:17
bauzas#info Spec review day today16:17
bauzasI think I made a correct round of reviews but there are still some specs I haven't looked yet16:17
bauzasI'll continue my duty until EOB16:18
dansmithany list of specs just waiting for a +W?16:18
bauzassure16:19
opendevreviewSteven Relf proposed openstack/nova master: Adding basic auth to dynamic vendordata api calls  https://review.opendev.org/c/openstack/nova/+/90025216:19
gibithere is a set of reproposals that are easy wins 16:19
bauzasdansmith: you mean a second +2 or just a plain +W ? :)16:19
dansmithbauzas: second review I mean?16:19
bauzashttps://review.opendev.org/q/(project:openstack/nova-specs)+status:open+NOT+owner:self+NOT+label:Workflow%253C%253D-1+label:Verified%253E%253D1%252Czuul+NOT+reviewedby:self+is:mergeable+NOT+label:Code-Review%253C%253D-1%252Cnova-core+label:Code-Review%253E%253D216:19
bauzasnot sure the link will render correctly16:19
bauzasbut tl;dr: dansmith you have the sole spec that needs a second +2 :)16:20
bauzasand yeah, there are reproposals on the way16:20
bauzasbtw. I wonder why they don't show up16:20
dansmithbauzas: ah bummer.. maybe gibi can circle back on that .. that's the only one I can't finish :)16:21
bauzashttps://review.opendev.org/q/project:openstack/nova-specs+status:open+label:Code-Review%253E%253D216:21
dansmithbut yeah I'll look at the re-proposals16:21
bauzassorry, my dash link seems to be wrong16:21
bauzasfwiw, had no time yet to correctly write a mdev live-migration spec 16:21
auniyal6on CI bugs,  nova-emulation job is failing too (I wanted to check if its failing for more patches, but builds page is not opening for me right now ) 16:22
bauzasI'll bug folks directly :)16:22
sean-k-mooney i can also try and loop back16:22
sean-k-mooneyill be doing more review later today16:22
sean-k-mooneydansmith: on your spec 16:22
* gibi will check back on the device alias16:22
johnthetubaguy(I keep trying to do the ironic re-proposal, but I keep getting distracted, sorry)16:22
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/2023.1: Fix rebuild compute RPC API exception for rolling-upgrades  https://review.opendev.org/c/openstack/nova/+/90033616:22
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/2023.1: Adding server actions tests to grenade-multinode  https://review.opendev.org/c/openstack/nova/+/90033716:22
bauzasgibi: you left a +1 because of Uggla_'s concern but AFAICR, it was resolved16:22
bauzasso I just went +216:23
gibibauzas: OK, thanks16:23
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/2023.2: add a regression test for all compute RPCAPI 6.x pinnings for rebuild  https://review.opendev.org/c/openstack/nova/+/90030916:23
bauzasokay, moving on16:23
bauzasdvo-plv wanted us to do a group discussion on https://review.opendev.org/c/openstack/nova-specs/+/895924/ since I asked to *not* use a trait but let's not discuss this now16:24
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/2023.2: Fix rebuild compute RPC API exception for rolling-upgrades  https://review.opendev.org/c/openstack/nova/+/90033816:24
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/2023.2: Adding server actions tests to grenade-multinode  https://review.opendev.org/c/openstack/nova/+/90033916:24
bauzasif people want, we could discuss this during open discussion16:24
bauzas(or just read my gerrit comments, you'll get my point, which is please avoid adding traits for just a libvirt version check that's already supported for all computes except Antelope and olders)16:25
bauzasanyway, moving on16:25
dansmithyeah16:26
dansmithneeds to be capability-based, IMHO16:26
dansmith"supports X" not "is version X"16:26
sean-k-mooneyyep16:27
sean-k-mooneyso supprot virtio-packed format 16:27
sean-k-mooneyis a capablity16:27
bauzasI said we could resolve that with a service version check16:27
sean-k-mooneynot a version check16:27
sean-k-mooneyso it shoudl be a trait16:27
sean-k-mooneyand we shoudl not need a comptue service check16:27
bauzassean-k-mooney: that's not exactly what's written in both code and spec16:27
sean-k-mooneyas this is not a feature that is impemnted at the comptue mager level16:27
bauzasthe requirement is "is libvirt >6.7"16:27
sean-k-mooneyright that is the requireemtn for the feature too work16:28
bauzassince bobcat supports 7.0, we're good16:28
sean-k-mooneybut it is modeled by reportign a triat for the host where the capablitys is supproted16:28
sean-k-mooneyyep16:28
sean-k-mooneybut we now need to supprot n-2 and diffent oss16:28
sean-k-mooney*operating systems 16:29
bauzasso it only leaves the rolling upgrade case with a caracal env + an antelope node16:29
bauzaswhich will no longer be a problem with D16:29
sean-k-mooneywell this featur like many other can be docusmeed as only supproted after a full upgrade is complete16:29
bauzasso again, I'm very against adding a trait for modeling a libvirt version that's gonna be minimum for all our computes next cycle anyway16:29
sean-k-mooneybauzas: that not what its modeling16:29
sean-k-mooneyand its required in my view for as long as we have any virt driver (other then ironic and libvirt) that supprot vms16:30
bauzashttps://review.opendev.org/c/openstack/nova/+/876075/25/nova/virt/libvirt/driver.py16:30
bauzasthat exactly models the libvirt version 16:30
dansmithnot really16:31
sean-k-mooneyits the only way to detect it because libvirt does not have a way to detect it without a version check16:31
sean-k-mooneybut16:31
sean-k-mooneysince we now have the min versio nrequiremnet16:31
dansmithit's exposing whether or not the feature is supported, but the way it does that is by the libvirt version internally16:31
sean-k-mooneythis would be a static trait that is exposed by computes using the libvirt driver16:31
dansmiththat's different than exposing a trait of the actual version16:31
sean-k-mooneydansmith: exactly16:31
bauzassean-k-mooney: then I'd suggest a prefilter that would only ensure that we land on a libvirt hostr16:31
dansmiththe prefilter can be the trait though16:32
sean-k-mooneybauzas: i belive the prefilter was aldreay in the spec16:32
sean-k-mooneybauzas: its somethign i have defintly dicussed for this feature previously16:32
bauzasyeah, but again, I don't like us adding yet another trait for this16:32
sean-k-mooneybauzas: i belive this is exactly what traits shoudl be used for16:32
dansmithbut this is what we do right? all the other traits are just like this I think16:32
sean-k-mooneydansmith: more or less16:33
bauzasthen, let's just provide this trait without a conditional16:33
sean-k-mooneybauzas: yes i also said that in my comments16:33
sean-k-mooneyand i said this last cycle too when we origianlly proposed it16:33
bauzasif it's really about directing to libvirt nodes16:33
dansmithwe can do that if we hard-require the min version16:33
sean-k-mooneybauzas: basically the context yoru missing is we had prviously agree that if we enforced the min verison we would update the sepc depending on which feature landed first16:34
sean-k-mooneykasyaps min version bump16:34
sean-k-mooneyor this feature16:34
sean-k-mooneyin the repopoal it should be updated to drop the check because the min verion bump already happend in bobcat16:34
dansmithso to be clear, we're already hard-requiring the min version needed for this, and thus the trait is just for the upgrade case where we might have old computes?16:35
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/zed: Fix rebuild compute RPC API exception for rolling-upgrades  https://review.opendev.org/c/openstack/nova/+/90034116:35
opendevreviewDmitriy Rabotyagov proposed openstack/nova stable/zed: Adding server actions tests to grenade-multinode  https://review.opendev.org/c/openstack/nova/+/90034216:35
bauzaswhat *you* missed is that I figured this out before the meeting (that spec is older than our min bump) but I missed the fact we want to direct to libvirt-only nodes, hence my mistake16:35
sean-k-mooneydansmith: upgrade case or where your mixing virt drivers16:35
sean-k-mooneydansmith: but use oru current min is 7.0.0 i belvie and the feature is in 6.x16:35
dansmithsean-k-mooney: okay libvirt and ironic being the only possibilities there :)16:35
bauzasdansmith: the former (upgrade case) was my concern16:35
bauzassean-k-mooney: it requires 6.716:35
dansmithso I mean, I would probably lean towards just a service version check to not let this work until everything is upgraded, but the multi-virt driver thing is a fair point16:36
sean-k-mooneybauzas: yep i said 6.x because i know we met the min requirement with our min supported version16:36
bauzasI don't want us to buy a new trait for something that's only because we support N-2 this cycle16:36
dansmithand even though we're dropping basically all the others, we could have a new one in the future without this support, so...16:36
sean-k-mooneydansmith: well we dont need a new compute service verion for this feature in general16:36
sean-k-mooneyaltough i guess we coudl do oen for the new prefilter16:37
bauzasdansmith: so you're on the same page than me, a service version check for upgrades is enough, but if we really want to avoid other hypervisors we could need a trait16:37
dansmithsean-k-mooney: we don't need it, but we could use it (cheaper than a trait) for the usual purpose of not exposing features until everything is upgraded16:37
dansmithbauzas: no, I said I lean that way, but I'm also fine with a trait because of the virt driver possibility16:37
sean-k-mooneydansmith: oh i disagree i alwasy conisderd a comptue version bump more expensive or at most the same16:37
dansmithsean-k-mooney: well, we disagree then.. we bump service versions all the time for stuff like this, where there isn't even a rpc bump to correlate16:38
dansmithand it is a single integer in one tree versus a new enum in traits, a package release, a dep update, and then a nova patch16:38
bauzastbh, I'd rather prefer having the prefilter asking 'get me a libvirt compute' rather than 'get me a compute that supports foo' since this feature is very QEMU-centric16:38
bauzassean-k-mooney: the fact is, this service version check can drop next cycle16:39
sean-k-mooneybauzas: that would be a misuse of traits16:39
bauzassean-k-mooney: starting with D, all computes will support a libvirt recent enough16:39
dansmithsean-k-mooney: how is that a misuse of traits?16:39
sean-k-mooneyif we want to also have a servic verion bump and an additon check in the api prior to the call to the schduler we can16:39
sean-k-mooneydansmith: we previosly said we didnt want to have a trait for which virt driver is in use16:39
bauzasif two virt drivers were proposing the same feature, then yah a trait sounds good to me16:39
sean-k-mooneywe can revert that if we want but i know there was pushback to that in the past16:40
bauzasbut here that capability is purely qemu-based16:40
sean-k-mooneybauzas: so is the only reason your takign this stance because we did the min libvirt bump last cycle16:40
dansmithsean-k-mooney: we have traits for which type of hardware is on the host, this seems similar, but we can also filter on hypervisor type from host state anyway right? so we can do it without a trait16:40
bauzassay some other feature does the same, we gonna add another trait and another prefilter16:40
sean-k-mooneybauzas: yes we should anytime thre is a capablity that is not supproted by all supproted drivers16:41
bauzasand boom, traits explosion, plus the fact we yet again push hypervisor features upfront16:41
sean-k-mooneybauzas: traits are cheap and placment was built to deal with many of them16:41
dansmithhonestly, this particular feature is probably not critical, right? meaning:16:41
bauzasservice version checks that can get rid next cycle are cheaper IMHO and,16:42
bauzasno traits is cheaper than a single one :)=16:42
dansmithif we restrict to libvirt hosts with a filter, then it's easy for us to say in the reno that if you're not upgraded that feature request will not be honored by old computes that don't know about it16:42
dansmithafter the upgrade it's all good16:42
dansmiththis is an optimization not like "I *need* 32G of memory else please reject"16:42
bauzasI just feel we can say in the notes 'please do what you need in case you have mixed hypervisors'16:43
sean-k-mooneybauzas: i really dislike that direction as i feel our schduler shoudl ensure you land on a host that can supprot the request feature without additonal admin intervention16:43
sean-k-mooneydansmith: in this partical case if we dont have the trait it will be non critical only in that on older host it willl be ignored16:44
bauzasonly if you have mixed hypervisors, right?16:44
sean-k-mooneybauzas: no even with a singel hypervior16:44
bauzasand in that case, you probably already did the setup in order to shard your cloiud16:44
dansmithsean-k-mooney: right, it just seems like a best-effort sort of thing compared to some others16:44
bauzassean-k-mooney: still talking of the 'I want a compute with libvirt recent enough' then ? 16:45
bauzasagain, doesn't sound to me worth adding a trait for this16:45
sean-k-mooneydansmith: if its a flavor extra spec we shoudl always guarentee it abel to work on the slected host16:45
sean-k-mooneyso if we are fine with rejecting it on the compute node sure16:45
bauzasanyway, I feel we're arguing right, but the time flies and we're on a meeting16:45
sean-k-mooneyim not ok with allowing the vm to boot16:45
bauzascan we drop this until the end of this meeting 16:46
bauzas?16:46
dansmithyep16:46
sean-k-mooneysure or to thet spec review16:46
bauzasand we could try to find a way forward just after 16:46
bauzascool, moving on then16:46
bauzas#topic Review Priorities16:47
bauzasstill an action item on me16:47
bauzasI have to create an etherpad as we agreed at PTG16:47
bauzasso I'll keep this bullet until I'm set 16:47
dvo-plvI have some issue with electricity, battery dead, so ping me and i will answer later when I will be back online16:47
bauzas#action bauzas to create a tracking etherpad for this cycle16:48
bauzasdvo-plv: np, we also captured this conversation in the meeting logs that'll show up once we end the meeting16:48
bauzas#topic Stable Branches 16:48
bauzaselodilles: your time16:49
elodilles#info stable gates don't seem blocked16:49
elodilles(stable/victoria's nova-ceph-multistore looked suspicious as it failed with POST_FAILURE a couple of times, but it has passed now)16:49
elodilles#info stable release patches proposed: https://review.opendev.org/q/project:openstack/releases+is:open+intopic:nova16:49
elodilleslast time we agreed to wait for some patches to merge ^^^16:50
elodillesfeel free to update the patch when they are merged16:50
bauzasyeah and noonedeadpunk proposed backports 16:50
elodilles++16:50
bauzasrelated to the RPC fixes we wanted to land16:50
bauzasso, we need reviews16:50
bauzasI surely can review things I wrote16:50
elodillesACK, will review them then :)16:51
elodilles#info stable branch status / gate failures tracking etherpad: https://etherpad.opendev.org/p/nova-stable-branch-ci16:51
elodillesand that's all from my side16:51
bauzaselodilles: ping me if you need the change numbers16:51
bauzascool, thanks16:51
bauzasrushing, we have a few time left16:51
bauzas#topic Open discussion 16:51
bauzas(gibi): Seeking for specless approval https://blueprints.launchpad.net/nova/+spec/libvirt-migrate-with-hostname-instead-of-ip based on the PTG discussion https://etherpad.opendev.org/p/nova-caracal-ptg#L815. I have a proposed impl https://review.opendev.org/c/openstack/nova/+/90020316:52
gibiso16:52
gibias we discussed on the PTG I would like to make it possible for the libvirt driver to use hostnames during cold migration instead of IP addresses16:52
noonedeadpunkbauzas: fwiw, backport to Zed is failing unit tests16:52
noonedeadpunkso some look there is appreciated16:52
gibithe live migration already uses hostnames by default16:53
dansmithgibi: ++16:53
gibithis specless bp propose a new config option to make the new behavior opt in16:53
bauzasnoonedeadpunk: I have a clue, but let's not discuss this now (I guess this is the rpc version check unittest that fails)16:53
dansmithgibi: no new RPC or object stuff needed, just a change on which thing we advertise to the remote side right?16:53
sean-k-mooneydansmith: we might need a db change16:53
gibino rpc, no object, just a config option for the libvirt driver16:53
dansmithsean-k-mooney: why/16:53
gibisean-k-mooney: no DB change is needed afaik16:53
bauzasand this is per-computes ?16:54
dansmithgibi: cool, ++ for specless BP from me16:54
gibibauzas: this is per compute for the incoming migrations16:54
sean-k-mooneydont we look up the remote systme "connection" info via fiels in the obejct16:54
sean-k-mooneythat are stored in teh db16:54
gibiit is in the migration object16:54
gibiwe store the IP there today16:54
gibias a string16:54
dansmithbut it's just a string, AFAIK16:54
sean-k-mooneyright so will the content of that change16:54
bauzasyeah16:54
gibinow it will be either IP or a hostname16:54
sean-k-mooneyok that is what i was wondering about16:54
bauzasand we said it was opt-in16:55
sean-k-mooneyok provided we dont assume that its an ip anywhere today16:55
bauzasso this is not a breaking upgrade change16:55
sean-k-mooneyand given its opt in16:55
gibiit is opt-in the default of the new config is to use my_ip as before16:55
sean-k-mooneyi think im fine with that as specless16:55
bauzasyeah, so I'm favor of approving it16:55
bauzasany concerns ? 16:55
sean-k-mooneythe only thing we need to do is docuemnt that you shoudl not set the new config option until the cloud is fully upgraded16:56
sean-k-mooneyit will likely work with the hostname16:56
sean-k-mooneyand old nova16:56
gibiI can add a reno16:56
sean-k-mooneyack then all good form me16:56
gibiand amend the config doc with a warning16:56
dansmithI bet it works even with old ones16:56
gibiit depends16:57
sean-k-mooneyprovided its resolveable it likely will16:57
gibiyeah16:57
sean-k-mooneybut it will depend on dns and /etc/hosts16:57
bauzasI mean, that will depend on how the cloud is configured, but old computes can work 16:57
bauzasfor sure16:57
sean-k-mooneygibi: and just to clarify it can be an fqdn right (ip, hostname or fqdn) 16:57
sean-k-mooneyit being the new config option16:57
gibiit can be IP, it is defulted to my_ip, it can be a user defined FQDN or hostname, or it can be "%s" which will be replaced with the hostname of the node16:58
sean-k-mooney+116:58
dansmithyup16:58
bauzasokay, if you don't mind,16:58
gibithe last is the most useful for me in my deployment work16:58
bauzas#agreed https://blueprints.launchpad.net/nova/+spec/libvirt-migrate-with-hostname-instead-of-ip is approved as a specless BP16:59
bauzaswe're on time16:59
bauzasany last min question ?16:59
Uggla_yep16:59
Uggla_looking at https://bugs.launchpad.net/nova/+bug/203938116:59
Uggla_do you think it could be linked to a configuration (service token) issue ?16:59
bauzasoh dman, forgot17:00
bauzasI'll end the meeting now, we'll discuss this right after17:00
Uggla_ok17:00
bauzasthanks all17:00
bauzas#endmeeting17:00
opendevmeetMeeting ended Tue Nov  7 17:00:48 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)17:00
opendevmeetMinutes:        https://meetings.opendev.org/meetings/nova/2023/nova.2023-11-07-16.02.html17:00
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/nova/2023/nova.2023-11-07-16.02.txt17:00
opendevmeetLog:            https://meetings.opendev.org/meetings/nova/2023/nova.2023-11-07-16.02.log.html17:00
bauzasdvo-plv: you have the logs there^17:00
bauzasUggla_: so17:01
* bauzas looks17:01
bauzasUggla_: did they do the service token thing ?17:02
bauzasnow that's mandatory ?17:02
Uggla_I don't know. But do we do it on devstack ?17:03
*** d34dh0r5- is now known as d34dh0r5317:03
bauzasI bet we do17:04
bauzashttps://docs.openstack.org/nova/latest/admin/configuration/service-user-token.html17:04
bauzasUggla_: can you check ?17:05
gmannbauzas: FYI, with daylight saving things, nova meeting time conflict with my other appointment so I would not be able to join. If anything for me  during meeting, please ping me and i will reply later. 17:05
bauzasgmann: no worries17:05
bauzasDST sucks 17:05
gmannyeah :)17:05
Uggla_bauzas, yes the section is part of my nova.conf (devstack).17:06
bauzasdansmith: sean-k-mooney: very briefly because today is spec review day and you both have other urgencies, please note that none of https://review.opendev.org/q/topic:bug%252F2041519-new actually works since we don't delete existing Resource Providers17:07
bauzasso we need some logic like https://review.opendev.org/c/openstack/nova/+/899625 in order to clean up the no-longer needed RPs 17:07
bauzas(talking of upgrades, for sure a greenfields deployment with no existing RPs will correctly create the only needed RPs)17:08
dansmithbauzas: yeah understand we need to clean up the current situation17:09
bauzasanyway, let's not discuss this for now, this is more a fyi17:09
bauzasand we'll revisit that later17:09
elodillessean-k-mooney: when you'll have time could you please have a quick look at this pip-23.1-support / PBR related patch? https://review.opendev.org/c/openstack/releases/+/900053/717:11
clarkbelodilles: see scrollback in #openstack-infra17:14
dvo-plvbauzas, sean-k-mooney, densmith I read logs and my conclusion that we do not have final decision, how I need to managed this? Ping you tomorrow or add this topic to some another meeting?17:33
bauzasusually we let people chime in on Gerrit17:34
bauzasIMHO we clarified exactly the usecases so all people should understand others' concerns17:35
elodillesclarkb: looking17:35
dvo-plvSo i need just wait for discussion in the spec comments ?17:36
bauzasdvo-plv: fwiw, sean-k-mooney and I already commented17:46
bauzasmaybe dansmith could add his thoughts once he has time (later today or tomorrow)17:46
bauzasin the mean, this doesn't really change what you wrote,17:46
bauzasonly the scheduling part may be reworked17:47
dansmithI thought we were going to have a discussion about it?17:47
bauzasdansmith: you had other prios17:47
bauzasif you have time, sean-k-mooney and I are debating on the need for a trait17:47
bauzasI captured my thoughts in gerrit17:47
bauzasI just can't wait for the next feature coming by and asking for the same17:47
bauzas"hey, I wanna be sure we land on libvirt node"17:48
bauzasso my question is, shall we create a trait for every single image property or flavor extra spec that matches a specific libvirt feature ?17:49
bauzas(and a prefilter of course)17:49
dansmithyeah, honestly for a thing that is just based on libvirt version, and especially a thing in a version from the past, I'd hate to get into the habit of doing a new trait for each of those things because it's such a heavyweight process17:50
dansmithso I think I'm leaning more heavily towards a service version and a hypervisor filter for this kind of stuff17:50
bauzasin the past, we left people to configure their clouds the way they wanted if they had mixed virt drivers17:50
dansmithhowever, gibi also approved the spec with the trait, so might be good to get him to weigh in there17:50
bauzasthe spec never got approved, right?17:51
bauzasoh my bad, no17:51
dansmithbauzas: right the mixed hypervisor thing is definitely a concern, but this is also the sort of thing where a flavor isn't likely to work between ironic and libvirt anyway,17:51
dansmithso I guess I'm less concerned about that specifically17:51
dansmithbauzas: yeah, it's approved17:51
bauzasin the past, we leaned towards saying it's the operator responsibility to make the matching that works17:52
bauzaswe had aggregates 17:52
bauzasso I'm surprised that we now all go full steam on adding traits for such things17:52
dvo-plvyeah, unfortinatelly i did not get finnal decision how I should move with my solution according to your thoughts17:53
dansmithbauzas: yeah I mean sean-k-mooney's point of *not* making it "some assembly required" definitely resonates with me- I want us to do the right thing when we can17:53
dansmithbut a trait is pretty heavy for each "just need a new libvirt version" sort of thing,17:54
bauzasdvo-plv: I'm sorry I went into trampling your previous approval, but I guess my disapproval is more about the pattern we follow than your own spec itself17:54
dansmithand I do not think we assert that all the extra specs are schedulable17:54
bauzasdansmith: yeah, so let's do some kind of automatic filter that would say 'I'm libvirt specific, please give me a libvirt thingies'17:54
bauzasif we reconsider the operator experience and the burden of setting aggregates17:55
dansmithI gotta jump to my next thing, but yeah I think I agree17:56
bauzasfwiw, we have filters that do mappings between flavors/images and aggregates17:57
bauzasoperators could just amend their 'libvirt' aggregates by adding an agg property equal to hw:virtio_packed_ring and problem is solved17:59
bauzasbut that's adding some operator burden I don't disagree17:59
dvo-plvits okay, never mind 17:59
sean-k-mooneybauzas: i think a post schduler filter is far to heavy weight for this19:17
sean-k-mooneywe can do a min compute server version check if we really want19:17
sean-k-mooneyi feel like that is the midel gorund we can all live with but that means makign this feature unaviabel during the upgrade to caracal19:18
sean-k-mooneyi think that is an ok requirement but its somethign we could have supproted via the trait if we wanted too19:18
bauzasyou know what ? I'm tired of this, so let's just use the trait19:28
bauzasI'll just don't +2 the spec19:30
gmannsean-k-mooney: just in case you missed these virt driver deprecation backport https://review.opendev.org/q/topic:deprecate-virt-drivers+status:open21:11

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!