Tuesday, 2024-11-26

opendevreviewMerged openstack/nova master: Add metadata for shares  https://review.opendev.org/c/openstack/nova/+/85050008:40
opendevreviewRajesh Tailor proposed openstack/nova master: Fix instance vm_state during shelve  https://review.opendev.org/c/openstack/nova/+/93429410:07
opendevreviewIvan Tkachuk proposed openstack/nova master: Reduce calls to qemu-img for disk_info  https://review.opendev.org/c/openstack/nova/+/93624611:34
opendevreviewBalazs Gibizer proposed openstack/nova master: Show candidate combinatorial explosion by dev number  https://review.opendev.org/c/openstack/nova/+/85588512:33
opendevreviewRajesh Tailor proposed openstack/nova-specs master: Show finish_time field in instance action show  https://review.opendev.org/c/openstack/nova-specs/+/92978013:08
greatgatsbyGood day.  I think we've found a bug in os-brick fibre_channel.py and just looking to discuss quickly before I submit a ticket or even a PR (I have a working fix currently).  Is this the best place to ask about it?14:47
fricklergreatgatsby: well technically os-brick belongs to the cinder team, but there might be some overlap. I'd still suggest to start in #openstack-cinder first14:54
greatgatsbythanks, will do14:54
sean-k-mooneygibi: when you have time would you mind reviewing this revert https://review.opendev.org/c/openstack/nova/+/90912215:41
sean-k-mooneyits related to our scp converstaion15:41
gibisean-k-mooney: good point. thanks. +A15:45
sean-k-mooneygibi: that what requried to fix using ip adresses for migration indirectly15:46
gibiI see15:46
sean-k-mooneythat is why we dotn need the bluepirnt for that feature15:47
sean-k-mooneyit was only failing because of this scp.15:47
sean-k-mooneyif you set the migration/live migration inbound adress to an ip it will work once this is reverted15:47
sean-k-mooneythat workaround is hardcoded to use instance.host as the source host to pull the file form15:48
bauzas#startmeeting nova16:00
opendevmeetMeeting started Tue Nov 26 16:00:27 2024 UTC and is due to finish in 60 minutes.  The chair is bauzas. Information about MeetBot at http://wiki.debian.org/MeetBot.16:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.16:00
opendevmeetThe meeting name has been set to 'nova'16:00
bauzas#link https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting16:01
tkajinamo/16:01
elodilleso/16:01
s3rj1khi all16:01
Ugglao/16:02
bauzashey16:03
bauzasstarting slowly16:04
bauzas#topic Bugs (stuck/critical) 16:04
bauzas#info No Critical bug16:04
bauzas#info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster16:05
bauzasany questions about bugs ?16:05
bauzasok moving on16:07
bauzas#topic Gate status 16:07
bauzas#link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:07
bauzas#link https://etherpad.opendev.org/p/nova-ci-failures-minimal16:07
bauzas#link https://zuul.openstack.org/builds?project=openstack%2Fnova&project=openstack%2Fplacement&branch=stable%2F*&branch=master&pipeline=periodic-weekly&skip=0 16:07
bauzas#info Please look at the gate failures and file a bug report with the gate-failure tag.16:08
bauzas#info Please try to provide meaningful comment when you recheck16:08
bauzasI saw a couple of failures but those are known issues16:08
bauzasanything about CI failures that is pretty new ?16:08
bauzas(all periodics are green)16:08
bauzaslooks not, moving on16:10
bauzas#topic Release Planning 16:10
bauzas#link https://releases.openstack.org/epoxy/schedule.html16:10
bauzas#action bauzas to add Epoxy nova deadlines in the schedule16:10
bauzasI'm pretty done with the patch proposal but I need to fix something before uploading it16:10
bauzas#topic Review priorities 16:11
bauzas#link https://etherpad.opendev.org/p/nova-2025.1-status16:11
bauzasthe page should be up to date, feel free to use it and amend it16:11
bauzasanything about that ?16:12
gibio/16:13
sean-k-mooneyo/ nothing from me on that topic16:13
bauzascool16:13
bauzas#topic Stable Branches 16:13
bauzaselodilles: shoor16:13
elodilles#info stable/2024.2 gate seem to be OK16:14
elodilles#info stable/2024.1 gate is blocked on grenade-skip-level & stable/203.2 is blocked on nova-grenade-multinode16:14
elodillesfailure is due to stable/2023.1->unmaintained/2023.1 transition, devstack and grenade fixes are proposed16:14
elodillesand actually the 2024.1 branch fix (grenade) patch is already in the gate queue16:14
elodillesthough: other workaround is to set these jobs as non-voting - given that gate should not rely on an unmaintained branch16:15
elodillessee further details:16:15
elodilles#info stable branch status / gate failures tracking etherpad: https://etherpad.opendev.org/p/nova-stable-branch-ci16:15
elodillesand that's all from me about stable branches now16:15
bauzasthanks16:16
elodillesnp16:16
bauzas#topic vmwareapi 3rd-party CI efforts Highlights 16:16
bauzasfwiesel: around ?16:16
bauzaslooks he's AFK16:17
bauzasno worries, moving on16:17
fwieselSorry , I am here16:17
bauzasah16:17
bauzasanything to raise from your side ?16:17
fwieselThere was a regression in oslo.utils (master) and I have created a change to fix it: https://review.opendev.org/c/openstack/oslo.utils/+/93624716:17
fwieselHopefully the builds will back to the two failures and I will tackle these then.16:18
fwieselThat's from my side16:18
sean-k-mooneyah is that related to removign netifaces16:18
bauzasokay, gtk16:18
bauzasthanks16:18
* tkajinam is aware of the proposed fix and will ping the other cores to get in16:19
tkajinamget that in16:19
bauzasnice, thanks tkajinam 16:19
tkajinamfwiesel, if you need a new release with the fix early then ping me16:19
tkajinamonce that is merged16:19
fwieseltkajinam: Thanks, I'll let you know16:20
bauzascool16:20
bauzasthen moving to the last item from the agenda16:20
bauzas#topic Open discussion 16:20
bauzasanything in the agenda, so anything anyone ?16:21
s3rj1kthere is this https://bugs.launchpad.net/nova/+bug/208938616:21
sean-k-mooneyi have one followup form last week too16:21
sean-k-mooneylets start with s3rj1k topic16:21
bauzasok, s3rj1k, shoot16:22
s3rj1kidea is to allow for host discovery to be concurrent, both cli and internal using distributed locking16:23
sean-k-mooneyso perhaps i can provide some context16:24
s3rj1kthi mostly needed for k8s like envs where discovery is run in multiple places 16:24
sean-k-mooneys3rj1k is interested in using the discover hsost perodic in a ha env16:24
bauzass3rj1k: I think that topic requires a proper discussion that can't be done during a meeting16:25
sean-k-mooneycurrently we require that if you use the perodic its enabled on at most one host16:25
sean-k-mooneythey would like to adress that pain point16:25
bauzasif we want to discuss about the design, it has to be an async conversation that has to be in a proper formatted document16:25
bauzasthat's the reason why we introduced our specification program for those kind of feature requests16:26
s3rj1kbauzas: spec? or rfe is enough for this time?16:26
sean-k-mooneyso this would defneitlly be a spec if you were going to work on it16:26
bauzass3rj1k: are you familiar to the specs writing or do you need guidance ?16:26
s3rj1kbauzas: done one for neutron, so all ok16:27
sean-k-mooneyi think before going that far however s3rj1k wanted some intiall feedback on is this in scope of nova to fix16:27
bauzassean-k-mooney: well, I'm not sure we have a quorum today for such design discussion16:27
bauzasif that was something before the PTG, we would have said "sure, just add that to the PTG and we'll discuss it"16:28
sean-k-mooneythats still an option16:28
sean-k-mooneyi suggested that s3rj1k  bring it here to advertise that it exist16:28
bauzashonestly, I haven't yet formally written the nova deadlines for Epoxy but we're already running short in tome16:28
sean-k-mooneyand then start eithe r a mailing list or spec dicussion after that16:28
bauzastime*16:28
bauzaswhat exact problem are we trying to solve then ?16:29
sean-k-mooneycurrently if you enable the discover host perodic task in more then one schdluer it can get duplict key error form the db16:29
bauzasare we speaking of concurrent nova-scheduler services that need to be HA active-active for X reasons ?16:29
sean-k-mooneyas 2 process can race to create the mappings16:29
sean-k-mooneyleading to errors in the logs16:29
sean-k-mooneywe dont actully supprot that today16:30
bauzasI think we always said that nova-scheduler has to be active-passive16:30
sean-k-mooneybut our documention on that is kind of lacking16:30
sean-k-mooneyno16:30
bauzasI pretty bet we documented it16:30
sean-k-mooneythe schdluer has been supproted in active active for a very long time16:30
bauzasnevre16:30
sean-k-mooneyyes16:30
tkajinamas far as I can tell Tripleo in the past deployed it in all controllers16:31
bauzaswith placement, we thought that we /could/ run it active-active but there were reasons not to16:31
sean-k-mooneynope 16:31
bauzastkajinam: which was a bug that we raised a couple of times16:31
sean-k-mooneydownstream its been active active since like 16 maybe before16:31
bauzasand I think TripleO changed it to A-P16:31
bauzasfor that exact reason16:31
sean-k-mooneynope16:31
tkajinamno16:31
sean-k-mooneyok well i think we need a longer discussion on this RFE request16:32
sean-k-mooneylikely a spec and we probly dont have time to complete it in epoxy16:32
sean-k-mooneybut we shoudl dicuss this more async16:32
s3rj1kno prob, thanks sean-k-mooney for taking a lead on explaining16:33
bauzasI have to admit that none of that tribal knowledge is written in https://docs.openstack.org/nova/latest/admin/scheduling.html16:34
sean-k-mooneyits also not in the config option16:34
sean-k-mooneyi left my inital feedback on the bug when i traged it as opion16:35
sean-k-mooneyi dint make it as invlaid as i tought we shoudl atelast dicuss it more widely first16:35
bauzasfor now, we should document that active-passive HA configuration for sure16:36
sean-k-mooneyfor the perodic only16:36
bauzasbecause indeed, we know that there is no eventual consistency betwen schedulers16:36
sean-k-mooneythe schduler shoudl be generally deploy active active16:36
bauzasthat's your opinion :)16:36
sean-k-mooneybut also the perodic has perfomance issuues16:36
sean-k-mooneybauzas: its waht we use in our product16:36
sean-k-mooneyand what almost all instller do by defult16:36
bauzashttps://specs.openstack.org/openstack/nova-specs/specs/abandoned/parallel-scheduler.html16:37
tkajinamyeah > almost all installer do by default16:37
sean-k-mooneythat a diffent proposal16:38
bauzasI litterally quote the first sentence of that spec :16:38
bauzas"If you running two nova-scheduler processes they race each other, they don’t find out about each others choices until the DB gets updated by the nova-compute resource tracker. This has lead to many deployments opting for an Active/Passive HA setup for the nova-scheduler process."16:38
tkajinampeople may not prefer using act-act for simplicity and avoid clustering mechanism to implement active-passive.16:39
tkajinamwithout large warning :-P16:39
sean-k-mooneybauzas: that does not really apply as of placement16:39
sean-k-mooneybauzas: i woudl condier it to be very incorrect advice to deocument that active active is not supported16:39
gibiyeah the goal of placement to shrink the race window between parallel schedulers16:40
gibiit is a solved problem for those resources that are tracked in placement16:41
bauzasI don't disagree with the fact that HA active-active schedulers is a problem to solve16:41
gibifor those that are not tracked there, the compute manager has a lock around claim to prevent overallocation16:41
gibiand we have alternatives to reschedule16:41
bauzasgibi: exactly, hence the A/P mechanism16:41
gibino this is A A16:41
gibithe only A P problem is in the periodic discovery16:42
bauzasin the very early times, we were considering reschedules as a way to address the problem16:42
bauzaswe stopped that tenet by wanting to reduce the reschedules, leading to indeed a broader problem16:42
gibiwe reduced reschedules with placement16:43
bauzasoriginally, the scheduler wasn't intended to provide an exact solution16:43
gibiand we improved reschedules with alterntive generation16:43
bauzasright, which is why we never solved that problem16:43
bauzaswe reduced the scope of reschedules, that's it16:43
sean-k-mooneywe solved it to the point that we recomemnd active active as the defualt16:43
gibiin a distributes system you have limits what you can solve exactly16:43
gibiI agree with sean-k-mooney we can recomend A A16:44
gibiactually OSP 18 does A A A 16:44
gibi(or as many As as you want :D)16:44
sean-k-mooneyright our product does not supprot active passive but i belive that was true in 17 as well16:44
bauzasA A A is OK to me with resources tracked by placement16:44
sean-k-mooneyanyway perhasp we should move on?16:45
bauzasagreed16:45
sean-k-mooneywe can talk about this more but proably dont need to in the meeting16:45
bauzasand agreed on the fact we need a spec16:45
bauzasbut maybe the solution is to add more resources to placement16:45
sean-k-mooneywell that is the general direction anyway16:46
bauzasor consider this as a non-solvable problem and accepting reschedules as a caveart16:46
bauzascaveat16:46
sean-k-mooneybut that does not adress the reporte problem16:46
gibion the proposal of a distributed discover I can suggest to do the discover outside of a scheduler periodic to avoid the race16:46
sean-k-mooneynova-audit would 16:46
bauzasanyway, moving on16:46
sean-k-mooneygibi: yes its very diffent 16:46
bauzass3rj1k: fancy writing a spec ?16:46
sean-k-mooneybauzas: ack so i had one quick topic16:46
bauzassean-k-mooney: shoot16:46
s3rj1kgibi: similar issue would be with CLI, check out RFE16:46
s3rj1kbauzas: will do16:47
sean-k-mooneyso last week i raised adding rodolfo to os-vif core16:47
sean-k-mooneyi sent a mail to the list and no one objected16:47
gibis3rj1k: I mean if you control to only run the discover from a single CLI session at a time then I assume there is no race16:47
sean-k-mooneyso if there is no other objection here i will proceed with that after the call.16:47
s3rj1kgibi: yes, need external control on how CLI gets run16:48
s3rj1klets move on, yes16:48
gibisean-k-mooney: no objection on my side16:48
bauzassean-k-mooney: no objections indeed16:48
sean-k-mooneyack so that is all i had16:49
tkajinamI have no objections but +1 :-) (I'm not a core, though)16:49
sean-k-mooneyill send a mail to the list and then ill add them after that16:50
sean-k-mooneyjsut to keep a record of it beyond this meeting16:50
bauzas++16:51
bauzasokay, then I think we're done for today16:52
bauzasanything else ?16:52
bauzaslooks not16:52
bauzashave a good end of day16:52
bauzasthanks all16:52
bauzas#endmeeting16:52
opendevmeetMeeting ended Tue Nov 26 16:52:50 2024 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)16:52
opendevmeetMinutes:        https://meetings.opendev.org/meetings/nova/2024/nova.2024-11-26-16.00.html16:52
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/nova/2024/nova.2024-11-26-16.00.txt16:52
opendevmeetLog:            https://meetings.opendev.org/meetings/nova/2024/nova.2024-11-26-16.00.log.html16:52
s3rj1kthanks all16:52
tkajinamthanks !16:52
tkajinamnothing urgent but I've added a few patches to drop deprecated/unmaintained deps to review priority list just fyi16:53
elodillesthanks o/16:53
gibio/16:54
opendevreviewBalazs Gibizer proposed openstack/nova master: Show candidate combinatorial explosion by dev number  https://review.opendev.org/c/openstack/nova/+/85588517:16
opendevreviewTakashi Kajinami proposed openstack/nova master: Add unit test coverage of get_machine_ips  https://review.opendev.org/c/openstack/nova/+/93628717:26
opendevreviewDouglas Viroel proposed openstack/nova-specs master: Add spec for show scheduler hints in server details  https://review.opendev.org/c/openstack/nova-specs/+/93614019:01
opendevreviewAndrei Yachmenev proposed openstack/nova-specs master: Dynamic disk qos updates support  https://review.opendev.org/c/openstack/nova-specs/+/93630219:14
opendevreviewMerged openstack/nova master: api: Add response body schemas for remaining server action APIs  https://review.opendev.org/c/openstack/nova/+/91574320:24
*** haleyb is now known as haleyb|out21:12
*** iurygregory__ is now known as iurygregory23:37

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!