Monday, 2024-10-14

opendevreviewMerged openstack/nova master: Update contributor guide for 2025.1 Epoxy  https://review.opendev.org/c/openstack/nova/+/92991810:16
opendevreviewAmit Uniyal proposed openstack/nova master: Update Nova bdm with updated swap info  https://review.opendev.org/c/openstack/nova/+/92985811:12
opendevreviewAmit Uniyal proposed openstack/nova master: Update Nova bdm with updated swap info  https://review.opendev.org/c/openstack/nova/+/92985811:18
opendevreviewTakashi Kajinami proposed openstack/nova master: Remove libvirt tunnelled migration  https://review.opendev.org/c/openstack/nova/+/87902111:26
opendevreviewMerged openstack/nova master: Imported Translations from Zanata  https://review.opendev.org/c/openstack/nova/+/92971911:44
opendevreviewMerged openstack/python-novaclient master: tox: Drop envdir  https://review.opendev.org/c/openstack/python-novaclient/+/93226311:46
opendevreviewTakashi Kajinami proposed openstack/placement master: Drop db migration tool  https://review.opendev.org/c/openstack/placement/+/93232411:59
tkajinam^^^  I wonder if we can drop the migration script directly or we need deprecation step (I feel like no one may use it from master so direct removal doesn't annoy real users)12:02
sean-k-mooneytkajinam: i assuem we have a placement-manage db_sync command so it shoudl not be in use but im not sure. i have never checked that12:22
sean-k-mooneyoh12:23
sean-k-mooneythis is the tool to migrate form in nova placment to split out placemnet12:23
sean-k-mooneythat a good question we have not requried this for so long that enven the unmaintaiend brances shoudl not need it12:23
sean-k-mooneyso i feel like a direct removal would be ok12:24
sean-k-mooneywe are talking about somethign that shoudl have been done when upgrading to stien so i dont think deprecation of a shel script really shoudl be required at this point12:24
opendevreviewBalazs Gibizer proposed openstack/nova master: Revert "Test live migration between hosts with differnet cpu_shared_sets"  https://review.opendev.org/c/openstack/nova/+/93232712:42
gibisean-k-mooney: ^^ revert to stabilize CI12:43
sean-k-mooneygibi: ah yep thanks i ment to do that for two weeks now and have been putting it off12:43
sean-k-mooneyit fiail infrequetly but it will increase as more raxflex host come online12:43
gibiI got hit by it twice in a row so got enough motivation to at least push the revert12:43
sean-k-mooneydo we need another to +w i added +212:44
gibibauzas: could you put a +w to ^^12:44
gibiit is a revert for CI stablization12:44
bauzasokay12:44
sean-k-mooneyill be changign which jobs to what anyway when i get time to work on this again12:45
gibithanks12:45
tkajinamsean-k-mooney, yeah that's the tool to pull placement tables from old nova db.12:55
*** haleyb is now known as haleyb|out16:42
opendevreviewMerged openstack/nova master: Revert "Test live migration between hosts with differnet cpu_shared_sets"  https://review.opendev.org/c/openstack/nova/+/93232717:19
melwittsean-k-mooney: you had mentioned that scatter_gather_cells is not currently waiting for timeout in the main thread but I thought it is? this is using the eventlet.timeout.Timeout context manager in the main thread right? https://github.com/openstack/nova/blob/741e1b38705efecc083f091c51164b3cafb9697c/nova/context.py#L43718:50
dansmithmelwitt: I think in uwsgi mode it can't since we're not running the eventlet hub right?19:07
dansmithIIRC that's what sean-k-mooney was referring to.. it "waits" but the timeout will never fire in fully uwsgi mode19:08
melwittdansmith: oh, ok I didn't realize that was why19:08
melwittI see. thanks19:08
sean-k-mooney[m]it was related to a comment gibi made22:11
sean-k-mooney ... peopel changed my topic22:14
sean-k-mooneymelwitt: i was refering to this comment thread https://review.opendev.org/c/openstack/nova/+/905284/comment/ea51fb75_6013c0cd/22:15
sean-k-mooneyim pretty sure that the entire query_wrapper can go away and the timeout logic can move as we only use it for one thing22:15
sean-k-mooneyits been a month since i looke at it but i think i was leaning towards puting the timeout here https://github.com/openstack/nova/blob/01b207e50d307a7e7650b5839535fbd3ad40bc1b/nova/context.py#L421 or https://github.com/openstack/nova/blob/01b207e50d307a7e7650b5839535fbd3ad40bc1b/nova/compute/multi_cell_list.py#L26222:17
melwittsean-k-mooney: I remember that thread but I didn't realize that today the timeout doesn't work because of no eventlet hub in full uwsgi mode22:17
sean-k-mooneyim not sure if that is true or not22:18
sean-k-mooneywhen run under uwsig today we are monkey patching so i think we have a hub but honestly i have not looked at that in a long time22:19
sean-k-mooneywe were not starting the hub at one point22:19
sean-k-mooneybut i think matt booth changed that as a bugfix?22:19
sean-k-mooneyhttps://github.com/openstack/nova/commit/3c5e2b0e9fac985294a949852bb8c83d4ed77e0422:20
melwittok well ... if the timeout context manager works then it seems to me the timeout is in the right place in today's code (in the main thread and kills the child threads if the timeout fires). and I'm not making an opinion about where the timeout should go as part of eventlet removal22:21
sean-k-mooneyso i htink the issue is the timeout fires on the main thread but does not kill the child threads22:22
sean-k-mooneyagain i dont currently have the context loaded form gibi's testing but i blieve we need the timeout to fire on the child thread?22:23
melwittI thought gibi was testing with native threads not eventlet?22:23
sean-k-mooneyyes but with this patch we are using native treads too22:24
melwittyeah, gibi's comment is about the code proposed in your patch22:24
sean-k-mooneyright that is what i was brignin up today22:24
melwittok, maybe I misunderstood what you said. I thought you said the timeout doesn't work at all today22:24
sean-k-mooneywhen we remove eventelte we need to ensure the timeout still works  for the treads in the tread pool22:25
sean-k-mooneybaisicly what we want to have happen is if the timout expires we want the thread to be killed and clean up by the timeout expction when resumed by the kernel22:26
melwittright. I had thought you were commenting on what's in tree right now. I misunderstood that you were talking about future stuff, my bad22:26
sean-k-mooneyby startign the timer in the child thread if we dont get io before it expires then the kernel shoud resume it whtn the timer interupt fires22:26
sean-k-mooneyno worries, ya i was talking about the unresolved comments in the current series22:27
melwittI agree the timer needs to be in the child thread with native threads22:27
melwittwe don't have the ability anyway to kill them from the parent22:27
melwitt*them = native threads22:27
melwittok, gotcha. sorry about that22:28
sean-k-mooneyright fi we used processes we could but because native thread could be holding locks (even if this is not) it would be unsafe to kill them form the parent even if we coudl do that without hacks22:28
melwittright22:29
sean-k-mooneythe other thing i brought up is https://github.com/openstack/nova/blob/master/nova/console/websocketproxy.py#L169-L17222:30
sean-k-mooneythe nova/cmd moduels for the proxy dont call eventlet but we do import that moduel here https://review.opendev.org/c/openstack/nova/+/904424/13/nova/cmd/standalone/baseproxy.py#2922:31
melwittah, yeah. that one I haven't known what it's for, none of the work I did on the proxy was related to it22:32
sean-k-mooneyso it was added by https://github.com/openstack/nova/commit/cb25bc4530323aaa33d5c42eb01f998d463f210622:32
sean-k-mooneywhich is apprently for https://bugs.launchpad.net/nova/+bug/119303122:33
melwittoh, huh.22:33
sean-k-mooneyThe solution is to either not monkey_patch for /cmd/nova-novncproxy and do the rpc call that validates the token with the consolauth service in a separate process that is monkey_patched (the rpc won't work without it), or to make openstack/common/rpc/amqp/multicall not use green-threads for this particular call (ie. block the caller).22:34
sean-k-mooneyso consolauth is dead22:34
sean-k-mooneyand we are trying to not monkeypatch22:34
sean-k-mooneyso maybe we just delete that and we are good?22:34
melwittdelete all of consoleauth? sure, what could go wrong22:35
sean-k-mooneyi ment the hub22:36
sean-k-mooneybut also didnt you remove the console auth service22:36
sean-k-mooneywe now do auth in the cell db22:36
melwittoh, the console auth service, yeah that is no longer used22:36
melwittfor some reason I was thinking the db call but yeah that doesn't use rpc22:37
sean-k-mooneywell even if it did we could deal with this differently22:38
melwittoh wait, there is an rpc call for "validating the console port22:38
melwitthttps://github.com/openstack/nova/blob/741e1b38705efecc083f091c51164b3cafb9697c/nova/console/websocketproxy.py#L13322:39
melwittthat's the only one AFAICT22:40
sean-k-mooney so we are changing the oslo messaging executor to threading22:40
sean-k-mooneyso im not sure it that will be a problem or not but ok this is somethign we need to test22:40
melwittok, yeah. I would think it should work fine if you delete the hub at the same time as the switch to native threads in oslo.messaging. I agree, definitely something to test22:42
sean-k-mooneythat is tomorrows problem o/22:43
melwittyes o/22:44
sean-k-mooney[m]so i reinstalled my home openstack and entirly redidi my sotrage to use nova lvm instead of cinder lvm, most of the speedu is form other change but i wend form about an 80 minute devstack install to about 21 minutes23:14
sean-k-mooney[m]im hoping with a liitlme more turning to shave another miniute or so but thats closer to the speed i would expect on this host23:15
sean-k-mooney[m]i used to reliable be able to get a fresh vm deployed in about 18-20 mins so that pretty close already23:15
sean-k-mooney[m]although there isnt much standing out as a bottle neck to be honest although i think i can speed up the disk io slightly more. beyond that i would likely need a local mirror/cache but the install time for package is already pretty good https://paste.opendev.org/show/825791/23:19

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!