opendevreview | Takashi Kajinami proposed openstack/nova master: Use consistent program name for wsgi scripts and entry points https://review.opendev.org/c/openstack/nova/+/942605 | 03:20 |
---|---|---|
opendevreview | ChungWon Lee proposed openstack/nova master: Remove condition for kvm serial condition https://review.opendev.org/c/openstack/nova/+/947780 | 05:28 |
opendevreview | ChungWon Lee proposed openstack/nova master: Remove condition for kvm serial console https://review.opendev.org/c/openstack/nova/+/947780 | 05:29 |
opendevreview | Arnaud Morin proposed openstack/nova master: Fix limit when instances are stuck in build_requests https://review.opendev.org/c/openstack/nova/+/947804 | 08:48 |
opendevreview | Lajos Katona proposed openstack/nova master: blueprint: iothreads-for-instances https://review.opendev.org/c/openstack/nova/+/939254 | 09:26 |
opendevreview | Kamil Sambor proposed openstack/nova master: Replace eventlet sleep with time.sleep https://review.opendev.org/c/openstack/nova/+/947212 | 10:06 |
*** ralonsoh_ is now known as ralonsoh | 10:19 | |
*** haleyb|out is now known as haleyb | 12:57 | |
gibi | dansmith: welcome back! We made progress with the eventlet removal. You can read up on it here https://gibizer.github.io/categories/eventlet/ or look at the patches here https://review.opendev.org/q/topic:%22eventlet-removal%22+project:openstack/nova | 13:20 |
dansmith | ack! | 13:23 |
bauzas | gibi: nice blog writing on eventlet effort ! If I can help by reviewing patches, lemme know because now I better have context :) | 14:07 |
opendevreview | Thomas Rotsaert proposed openstack/nova master: feat(affinity) - Create Affinity/Anti-Affinity based on a new hypervisor property: hypervisor_rack https://review.opendev.org/c/openstack/nova/+/947833 | 14:16 |
opendevreview | Thomas Rotsaert proposed openstack/nova master: feat(affinity) - Create Affinity/Anti-Affinity based on a new hypervisor property: hypervisor_rack https://review.opendev.org/c/openstack/nova/+/947833 | 14:20 |
andrewbonney | If any cores have a second could I request a review of the small bug fix in https://review.opendev.org/c/openstack/nova/+/919961? Sean has added +2 already | 15:15 |
Uggla | Nova meeting in around 20mn | 15:41 |
kevko | Hi folks, I would like to ask ... if i have auto_converge to false , complete_action to force_complete , timeouts set , post_copy disabled ...why my instance is in infinity live-migrate action ? I would say that if there is instance under heavy LOAD and there are too many dirty pages to transfet which can't be handled ..it should fail because of | 15:43 |
kevko | timeout and then PAUSE trigerred ... I can imagine that auto_converge and post copy can help ..but i can't turn on because of customer ... But i am curious ..why nova is not pausing instance and instead of this ...still trying to copy ram ...of course only last 5 percent which is permanently changing because of heavy load to instanced | 15:43 |
sean-k-mooney | kevko: simple answer is because nova does not contole live migration | 15:59 |
sean-k-mooney | kevko: basically once we call libvirt or any other hypervior to start the migration we are hands off until it completes | 15:59 |
sean-k-mooney | the timeout and timeout action are the excpetion | 15:59 |
kevko | sean-k-mooney: it's normally working for normal instances | 16:00 |
sean-k-mooney | if you set ti to force compelte we can ask libvirt to force compelte the migration which involves pausign the guest on the dest and then compleinting the copy | 16:00 |
kevko | sean-k-mooney: yes, of course | 16:00 |
Uggla | hi guys, sorry to interrupt, we gonna start the meeting. | 16:00 |
kevko | sean-k-mooney: so why it is not paused ? and during that time the RAM copied ? | 16:00 |
Uggla | #startmeeting nova | 16:01 |
opendevmeet | Meeting started Tue Apr 22 16:01:07 2025 UTC and is due to finish in 60 minutes. The chair is Uggla. Information about MeetBot at http://wiki.debian.org/MeetBot. | 16:01 |
opendevmeet | Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. | 16:01 |
opendevmeet | The meeting name has been set to 'nova' | 16:01 |
Uggla | Hello everyone | 16:01 |
sean-k-mooney | kevko: because it would not be a lilve migration any more, but we can talk about that after the meeting | 16:01 |
Uggla | awaiting a moment for people to join. | 16:01 |
kevko | sean-k-mooney: okay, i will wait ... | 16:02 |
gibi | o/ | 16:02 |
bauzas | o/ | 16:02 |
elodilles | o/ | 16:02 |
sean-k-mooney | o/ | 16:04 |
Uggla | #topic Bugs (stuck/critical) | 16:04 |
Uggla | #info No Critical bug | 16:04 |
Uggla | #topic Gate status | 16:04 |
Uggla | #link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs | 16:04 |
Uggla | #link https://etherpad.opendev.org/p/nova-ci-failures-minimal | 16:04 |
Uggla | #link https://zuul.openstack.org/builds?project=openstack%2Fnova&project=openstack%2Fplacement&branch=stable%2F*&branch=master&pipeline=periodic-weekly&skip=0 Nova&Placement periodic jobs status | 16:05 |
bauzas | nothing to say | 16:05 |
Uggla | #info Please look at the gate failures and file a bug report with the gate-failure tag. | 16:05 |
Uggla | #info Please try to provide meaningful comment when you recheck | 16:05 |
Uggla | tbh, I have not looked at the jobs this time. Shame on me. | 16:05 |
Uggla | thanks bauzas for the info. | 16:06 |
Uggla | #topic tempest-with-latest-microversion job status | 16:06 |
Uggla | #link https://zuul.opendev.org/t/openstack/builds?job_name=tempest-with-latest-microversion&skip=0 | 16:06 |
Uggla | #link https://zuul.opendev.org/t/openstack/builds?job_name=tempest-with-latest-microversion&skip=0 | 16:07 |
Uggla | Status is the same as I have not checked with Gmann about it. | 16:07 |
Uggla | #topic PTG summary | 16:08 |
Uggla | #link https://etherpad.opendev.org/p/r.bf5f1185e201e31ed8c3adeb45e3cf6d | 16:08 |
Uggla | as a reminder | 16:08 |
Uggla | #topic Release Planning | 16:08 |
Uggla | #link https://releases.openstack.org/flamingo/schedule.html | 16:08 |
Uggla | #info Nova deadlines are set in the above schedule | 16:08 |
Uggla | #topic Review priorities | 16:09 |
Uggla | #link https://etherpad.opendev.org/p/nova-2025.2-status | 16:09 |
Uggla | Still draft. | 16:09 |
Uggla | #topic Stable Branches | 16:10 |
Uggla | elodilles, please go ahead. | 16:10 |
elodilles | thanks o/ | 16:10 |
elodilles | #info stable gates seems to be in good health | 16:10 |
elodilles | #info stable/2023.2 (bobcat) will move to End of Life next week | 16:10 |
Uggla | \o/ | 16:10 |
elodilles | #info final 2023.2 bobcat releases are out: nova 2023.2 (28.3.1), placement 2024.2, 2024.1, 2023.2 (12.0.1, 11.0.1, 10.0.1) | 16:10 |
elodilles | thanks melwitt and sean-k-mooney for the tons of stable reviews o/ | 16:11 |
Uggla | melwitt++ | 16:11 |
gibi | indeed, thanks for making these releases full of backports | 16:11 |
Uggla | sean-k-mooney++ | 16:11 |
elodilles | i think we just have to wait until the deadline and hope we did not introduce regressions o:) | 16:12 |
gibi | nah, what regressions, those are upgrade oportunities to caracal :) | 16:12 |
elodilles | :] | 16:12 |
elodilles | yepp :) | 16:12 |
Uggla | gibi, I like your mindset ! :) | 16:12 |
elodilles | btw, i've prepared nova stable releases for dalmatian and caracal as well | 16:13 |
elodilles | to not leave everything till the end o:) | 16:13 |
elodilles | nova: Release 2024.1 Caracal 29.2.1 https://review.opendev.org/c/openstack/releases/+/947846 | 16:13 |
gibi | early upgrade opportunities :) | 16:13 |
elodilles | nova: Release 2024.2 Dalmatian 30.0.1 https://review.opendev.org/c/openstack/releases/+/947847 | 16:13 |
elodilles | #info stable branch status / gate failures tracking etherpad: https://etherpad.opendev.org/p/nova-stable-branch-ci | 16:14 |
elodilles | please add here if you find any gate issue ^^^ | 16:14 |
elodilles | and that's all about stable from me for now | 16:14 |
elodilles | Uggla: back to you | 16:14 |
Uggla | thx elodilles ! | 16:14 |
elodilles | np | 16:14 |
Uggla | #topic Gibi's news about eventlet removal | 16:15 |
Uggla | #link Series: https://gibizer.github.io/categories/eventlet/ | 16:15 |
Uggla | #link reminder patches to review: https://review.opendev.org/q/topic:%22eventlet-removal%22+project:openstack/nova | 16:15 |
Uggla | gibi, anything new you want to share ? | 16:15 |
gibi | nope, you shared all what I had :) | 16:16 |
bauzas | as I said, I can help for reviewing the series | 16:16 |
gibi | any help is appreciated :) | 16:16 |
bauzas | cool | 16:16 |
Uggla | I know sean-k-mooney already reviewed some of them. | 16:16 |
Uggla | Having someone else would be great. | 16:17 |
sean-k-mooney | some yes melwitt looked at a couple too | 16:17 |
sean-k-mooney | but ya more the merrier | 16:17 |
Uggla | ok moving on to next topic | 16:19 |
Uggla | #topic Bug scrubbing | 16:19 |
Uggla | So I wanted to clean up a bit the new bugs. But I didn't have a chance to to it yet. Anyway I have selected 5 new bugs that looks properly defined. | 16:20 |
Uggla | So first one : https://bugs.launchpad.net/nova/+bug/2104255 | 16:20 |
Uggla | So my expected outcome would be to know if we think it is a valid bug or not. If it is incomplete what to ask. | 16:22 |
sean-k-mooney | its valid. i looked at it breifly and chated about it on irc | 16:22 |
bauzas | Seems a caching issue | 16:23 |
bauzas | so yeah valid | 16:23 |
sean-k-mooney | tl;dr if the nova-comptue agent is started when the vf is attached to the vm | 16:23 |
sean-k-mooney | we cant get the network capablities | 16:23 |
sean-k-mooney | and we cache the vf info on start up in both nova and libvirt | 16:23 |
sean-k-mooney | libvirt invalidates its cache based on udev events | 16:23 |
sean-k-mooney | i.e. when the vf is returned to the host | 16:24 |
sean-k-mooney | nova does nto invlaidate its cache so that part of the problem | 16:24 |
Uggla | ok cool do not worry about settings the bug fields I'll do that later after the call | 16:24 |
sean-k-mooney | lets mark it valid and medium | 16:24 |
Uggla | great | 16:24 |
sean-k-mooney | and we can decied how to fix it later | 16:24 |
Uggla | next one: | 16:24 |
Uggla | https://bugs.launchpad.net/nova/+bug/2105987 | 16:25 |
sean-k-mooney | so melwitt has a patch that might fix this indirectly | 16:26 |
sean-k-mooney | gibi: do you know if we merged the one to auto discover the config options and use the sdk to get the limits? | 16:26 |
gibi | sean-k-mooney: not top of my head | 16:27 |
sean-k-mooney | ack | 16:27 |
gibi | but the bug above links to a separate oslo.limit fix | 16:27 |
sean-k-mooney | so if this config option is not being repected in nova today | 16:27 |
sean-k-mooney | then its valid but we need to confirm that | 16:27 |
gibi | so I'm wondering if just using the latest oslo.limits fixes it for nova | 16:27 |
sean-k-mooney | ya i suspect that might be hte case | 16:27 |
sean-k-mooney | although im a bit confused | 16:28 |
sean-k-mooney | because the also have endpoint_interface | 16:28 |
sean-k-mooney | which i tought was ment to do the same thing as the new filed they just added | 16:28 |
sean-k-mooney | https://review.opendev.org/c/openstack/oslo.limit/+/946128/4/oslo_limit/opts.py | 16:28 |
sean-k-mooney | maybe we should ask if they set endpoint_interface=internal | 16:29 |
sean-k-mooney | in the bug and come back to this | 16:29 |
sean-k-mooney | ill go ask that | 16:29 |
Uggla | ok so we can set it to incomplete and then review it when it will be back in new. | 16:30 |
sean-k-mooney | tkajinam: sicne you wrote https://review.opendev.org/c/openstack/oslo.limit/+/946128 you might be able to add some context to https://bugs.launchpad.net/nova/+bug/2105987 | 16:31 |
sean-k-mooney | Uggla: for what its worht we configure our dwonstream installer to use internal too | 16:31 |
sean-k-mooney | so im not entirly sure why the oslo change is requried | 16:32 |
sean-k-mooney | but we can move on | 16:32 |
Uggla | ok next one: https://bugs.launchpad.net/nova/+bug/2106073 this one was opened by Mohammed Naser, corfirmed on Ironic. And I'm pretty sure Mohammed proposed a fix for it. | 16:33 |
Uggla | btw I have just noticed sean-k-mooney commented on it. | 16:34 |
sean-k-mooney | ya so this is unfixable in nova without rewriting parts of the ironic deriver | 16:35 |
sean-k-mooney | tl;dr ironic is modifying port bidinges, vnic typs and other atributes out of band of nova | 16:36 |
sean-k-mooney | that means that the info we provide in teh configdrive string can never be correct becuase ironic changes things in neutron that its not allowed too after we call provision | 16:36 |
Uggla | ok but that sounds a valid bug despite we rely on Ironic to fix it ? | 16:37 |
sean-k-mooney | so if itgs going to do that it also need ot update the config drive info it has | 16:37 |
sean-k-mooney | ya its somewhat valid to fix in ironic | 16:37 |
sean-k-mooney | again they are modifying neutron ports in a way that only valid wehn using ironic without nova | 16:37 |
sean-k-mooney | that the root of the issue. | 16:37 |
sean-k-mooney | ironic port groups only exixits in the ironic api | 16:38 |
sean-k-mooney | they are not modled in nova or neutron | 16:38 |
sean-k-mooney | so there is no portable way to express nic bonds ectra today | 16:38 |
Uggla | should we set it as "won't fix" on our side ? | 16:39 |
sean-k-mooney | i would proably mark the nova sid as cant fix | 16:39 |
sean-k-mooney | wont fix is fine too | 16:39 |
sean-k-mooney | i think we have invalid? | 16:39 |
sean-k-mooney | not sure which is better | 16:39 |
Uggla | yes invalid is an option too | 16:39 |
sean-k-mooney | anywya the tl;dr is because ironic and the ironic virt driver are doing thign virt drivers are not allowed too it can only be "fixed" in ironic | 16:40 |
Uggla | I vote for invalid on our side. Explaing ^ any objection with that ? | 16:40 |
Uggla | 3... | 16:41 |
Uggla | 2... | 16:42 |
Uggla | 1... | 16:42 |
Uggla | ok let's do that. | 16:42 |
Uggla | next one: https://bugs.launchpad.net/nova/+bug/2106380 | 16:43 |
sean-k-mooney | that seam valid ish | 16:44 |
sean-k-mooney | if we delete a vm we should remove it form instance_group_member in some form | 16:44 |
sean-k-mooney | i suspect that wont happen until we archive it however | 16:44 |
sean-k-mooney | although we dont have shadow tables in the api db | 16:45 |
sean-k-mooney | so perhaps we should just delete it when the vm is deleted | 16:45 |
gibi | yeah it seems like we should remove the vm from instance_group_member during delete, but it is hard as it is in the api db while the vm is in the cell db | 16:46 |
sean-k-mooney | there is a seperate bug for the slownes due to the request spec | 16:46 |
sean-k-mooney | loading | 16:46 |
sean-k-mooney | so we can keep that out of scope | 16:46 |
sean-k-mooney | so dansmith and i were wondering about a related topic. if you do not have api access from teh cell condutor and do local delete beacuse the conenction to the cell is down | 16:47 |
sean-k-mooney | will anything clean up the instnace mapping? | 16:47 |
sean-k-mooney | like to me this feels similar i feel like the instnace group membership is somethign we shoudl eb cleaning up once the instnace goes to delete in the cell db | 16:48 |
gibi | I think it would have the same problem that the VM delete happens in the cell and we don't have an up call | 16:48 |
sean-k-mooney | but i dont know if anythign will actully do it today | 16:48 |
sean-k-mooney | gibi: this kind of feels like something the supercondutor could do as a perodic | 16:48 |
gibi | so the only reliable way would be a periodic | 16:49 |
gibi | yepp | 16:49 |
sean-k-mooney | but i belive that would be the first such perodic in teh super conductor | 16:49 |
gibi | yeah, I think so | 16:49 |
gibi | not impossible to do just not a low hanging bugfiz | 16:49 |
gibi | x | 16:49 |
sean-k-mooney | Uggla: to summerise, this is valid, it proably need a new periodic and there are likely other cases like this that should be fixed | 16:49 |
Uggla | perfect. | 16:50 |
sean-k-mooney | i would also agree not a low hanging fuirt but perhaps low to medium imporantce | 16:50 |
Uggla | Latest one for this week: https://bugs.launchpad.net/nova/+bug/2106817 | 16:52 |
gibi | no stack trace so it is hard to tell | 16:55 |
sean-k-mooney | so neutron supproted adressless port long before nova did | 16:56 |
sean-k-mooney | sylvain added supprot for adressless port but i dont know if that in antelope? | 16:56 |
bauzas | bobcat iirc | 16:56 |
sean-k-mooney | https://specs.openstack.org/openstack/nova-specs/specs/yoga/implemented/boot-vm-with-unaddressed-port.html | 16:57 |
bauzas | or maybe antelope, can't remember this | 16:57 |
sean-k-mooney | it was actully yoga | 16:57 |
bauzas | ohoho | 16:57 |
sean-k-mooney | but i do know there are still some issue with metadta | 16:57 |
sean-k-mooney | so they are aslo using trunk ports | 16:58 |
sean-k-mooney | whic i dont know if that was tested as part of this | 16:58 |
sean-k-mooney | they are also doing this via heat | 16:59 |
sean-k-mooney | which is fun... | 16:59 |
Uggla | maybe we can ask for a reproducer. At least the procedure to reproduce it | 16:59 |
Uggla | without heat | 16:59 |
sean-k-mooney | i knwo that heat uses the same micorverion for all nova calls | 16:59 |
sean-k-mooney | and it default to a pretty old one | 16:59 |
sean-k-mooney | Uggla: yes i think we shoudl ask for a simple repoducer without heat | 17:00 |
sean-k-mooney | what im unsure about is is this a nova or neutron issue | 17:00 |
sean-k-mooney | bauzas: i dotn think we added a an api microversion for this correct | 17:00 |
sean-k-mooney | Uggla: supproting adress port in combiniation with turnk ports was not in socpe fo the spec | 17:01 |
sean-k-mooney | so if this is repoducable without heat | 17:01 |
sean-k-mooney | we have two choices | 17:02 |
sean-k-mooney | treat it as a new feature to actully supprot using both feature togetehr | 17:02 |
sean-k-mooney | or consider it a bug in the existing implimation | 17:02 |
Uggla | Ok, that works for me — incomplete plus a reproducer. That way, we’ll be able to narrow down the possible issue. | 17:03 |
sean-k-mooney | i think to do it properly in nova nova woudl have to check the port bidning of all the sub ports instead fo teh trunk port | 17:03 |
sean-k-mooney | so im inclidne to say this is a feature request but sure lest see wht they come back with | 17:03 |
Uggla | yep I was thinking the same. Maybe that's not a "bug". But it will end up in a feature request. | 17:04 |
Uggla | cool ok thanks all for this triage session. Latest topic | 17:05 |
Uggla | #topic Open discussion | 17:05 |
Uggla | one point from my side | 17:05 |
Uggla | Next week's meeting: I’ll be on vacation and unable to run it. Would someone be available to run it on my behalf? | 17:05 |
Uggla | I'll propably prepare a list of bug to triage. (but that's not mandatory) | 17:06 |
bauzas | Sorry, I can only help for the first 30 mins | 17:07 |
bauzas | (double meeting at the same time with TC one started) | 17:07 |
Uggla | bauzas, ok better than nothing. IMHO. | 17:08 |
bauzas | okay, I'll try to help then | 17:09 |
bauzas | let's then do a short one | 17:09 |
Uggla | ok | 17:10 |
Uggla | Time to close. | 17:10 |
gibi | o/ | 17:10 |
Uggla | sorry for the 10mn overtime. | 17:10 |
sean-k-mooney | o/ | 17:11 |
Uggla | Thanks all | 17:11 |
Uggla | #endmeeting | 17:11 |
opendevmeet | Meeting ended Tue Apr 22 17:11:14 2025 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4) | 17:11 |
opendevmeet | Minutes: https://meetings.opendev.org/meetings/nova/2025/nova.2025-04-22-16.01.html | 17:11 |
opendevmeet | Minutes (text): https://meetings.opendev.org/meetings/nova/2025/nova.2025-04-22-16.01.txt | 17:11 |
opendevmeet | Log: https://meetings.opendev.org/meetings/nova/2025/nova.2025-04-22-16.01.log.html | 17:11 |
Uggla | That's in the box. | 17:11 |
Uggla | I hope the bug triage was ok. Feel free to let me know if something is not appreciated. | 17:12 |
sean-k-mooney | Uggla:i think it was effeective | 17:12 |
sean-k-mooney | thats all triage is really ment to be, look at the issue, see if there is enough info to say if its valid or not, ask for it if its missing and mark it triaged if there is | 17:13 |
sean-k-mooney | if we have an idea of how to fix it even better but we dont need to actully come up with the solution but its good to take notes in the bugs if we know of any workarounds | 17:14 |
sean-k-mooney | gibi: i know you less of a fan of doing this on the fly, how did you find it? | 17:14 |
Uggla | yep, but that's the first time I'm doing this with IRC. So I wonder about the timing. | 17:14 |
Uggla | Of course if you are uncomfortable to do it on the fly, I could send the bugs upfront (e.g Friday). Just let me know. | 17:15 |
sean-k-mooney | kevko: so looping back to your live migration quetion | 17:18 |
sean-k-mooney | looking at the code im not sure why you are not seeing the guest pause as expected | 17:18 |
sean-k-mooney | waht i will say is we likely shoudl add some extra logging | 17:18 |
sean-k-mooney | becuase there isnt a debug level log showitn that we took the pause codepath unless there is an excption when we call libvirt | 17:19 |
sean-k-mooney | kevko: it might be worht checkign your libvirt logs for the domain to see if there is a pause request or any error on teh libvirt side | 17:19 |
sean-k-mooney | kevko: it would be also good to confirm thate you do not see teh "failed to pause" warning https://github.com/openstack/nova/blob/master/nova/virt/libvirt/migration.py#L755-L769 | 17:20 |
opendevreview | Merged openstack/nova master: Remove eventlet based WSGI server entry points https://review.opendev.org/c/openstack/nova/+/947240 | 17:22 |
sean-k-mooney | :) | 17:24 |
sean-k-mooney | kevko: at the very minium we should likely imporve debug loging to that its similar to understand why you are not seeing the behvior we expect | 17:25 |
sean-k-mooney | kevko: it may be good to report a bug for this but likely it will take effort to try and repoduce | 17:25 |
sean-k-mooney | kevko: hopefully the issue you are seeign is not disto/version sepciifc. | 17:26 |
opendevreview | Merged openstack/nova master: Remove WSGIService and WSGIServer classes https://review.opendev.org/c/openstack/nova/+/947245 | 17:38 |
sean-k-mooney | Uggla: we triaged some bugs in the meeting but i dont see that reflected in launchpad | 18:37 |
sean-k-mooney | Uggla: are you going to update them async based on what we discussed | 18:37 |
opendevreview | Merged openstack/nova master: Fix description of [pci] alias https://review.opendev.org/c/openstack/nova/+/944611 | 22:28 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!