Tuesday, 2023-11-07

clarkbalmost meeting time18:59
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Nov  7 19:01:11 2023 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/MRP4DFT7DBT56U56R6LCFHG7X36SS554/ Our Agenda19:01
clarkb#topic Announcements19:01
clarkbI believe that the majority (all?) of us have had DST start or end over the last month. Double check your meeting times :)19:01
clarkbRelated to that the OpenInfra Foundation Board meeting for November will start in 2 hours19:01
clarkbalso I'll be AFK November 10-13 (tahts Friday and Monday on both ends of the weekend)19:03
clarkb#topic Mailman 319:03
clarkbAll lists are now hosted on mailman 3, the mailman 3 services are upgraded to their latest versions, and the old mailman2 servers have been deleted19:04
clarkbWe're just about done with this item (thank you fungi!), but there was a django template parsing error during the upgrade we need to run down as we thought that was corrected19:04
clarkbhttps://paste.opendev.org/show/bc7jfeZCt97fZm0dCPKw/ is the paste of that I pulled out of logs when the upgrade occurred19:04
fungiyeah, i need to check whether those show up in the log in zuul19:05
clarkbit doesn't appear to be fatal (probably because we aren't relying on social media logins or similar functionaltiy in django so I think the bulk of the issue here is to understand why this happened19:05
fungialso whether it was only during the initial restart or whether it recurs19:05
clarkb++19:05
clarkbI think we can probably drop this off of next week's agenda19:06
fungiagreed19:07
clarkbthanks again for getting this over the finish line fungi 19:07
tonyb#GreatSuccess19:07
fungii just hope it keeps working19:07
fungikeep an ear to the ground for people talking about delivery issues19:07
clarkb#topic Upgrading Servers19:08
clarkbtonyb has started pushing on this for mirrors.19:08
clarkbAnd I THink started investigating meetpad servers19:08
clarkbtonyb: any concerns or items that need review etc?19:08
tonybNope.  I think the mirror servers are ready to launch new versions.  I'm assuming that's paused due to 90022019:09
tonybI think meetpad will be pretty quick19:09
tonybafter that it's just the hard ones, cacti, wiki, translate and storyboard19:10
clarkbyup I think we go through the 900220 stuff and use this all as a good learning experience19:10
clarkblet's move on. We'll discuss 900220 shortly19:10
clarkb#topic Python container updates19:10
clarkbEverything is running python3.11 except for zuul-operator19:10
clarkbThe reason for that is zuul-operator's k8s jobs haven't been working19:11
clarkbdpawlik was poking at it and details ended up in https://review.opendev.org/c/zuul/zuul-operator/+/881245 and its depends on19:11
clarkbWe don't use the zuul-operator so I don't have a ton of context for this stuff. Despite that I've been meaning to try and page it in just haven't had time19:11
tonybI can work with dpawalik to get that all finished.19:12
fungijust in time to start talking about 3.12 ;)19:12
tonyb\o/19:12
clarkbI think the short version is taht something with the way k8s is deployed there causes the operator to not function. dpawlik's changes addrss the k8s issues and now there is maybe a problem in zuul-operator itself that needs fixing19:12
clarkbbut ya start from that change its and depends on and once we can get it green then we should be good to land changes that update the python version for zuul-operator as well19:13
tonyb++19:13
clarkb#topic Gitea 1.2119:13
clarkbI've left this item on the agenda because each week I think "this is the week there will be a release and changelog we can discuss"19:13
clarkbunfortunately this week is not that week19:14
clarkbI saw a message from one of the gitea maintainers on discord/matrix saying that the main release blocker at this point is the blog post. I think this must include writing up a change log because the change log doesn't exist yet19:14
fungichangelog-as-an-afterthought always baffles me19:14
clarkbmaybe next week will be the week :)19:15
clarkb#topic Gerrit 3.8 Planning19:15
clarkb#link https://etherpad.opendev.org/p/gerrit-upgrade-3.819:15
clarkbif others could look over that etherpad I think it is ready for review19:16
clarkbOtherwise I think we are about as ready as we can be. We got the commentlink update in and restarted Gerrit 3.7 to ensure that is working as expected. The downgrade back to 3.7 is tested and the only issue we've found so far is related to a plugin bug in a plugin we don't use19:16
fungi898989 isn't marked as done, should be thought yeah?19:17
fungiwe restarted onto it and manually tested19:17
clarkbyup marked as done now19:17
fungiawesome, just wanted to be sure there wasn't anything outstanding there19:18
clarkbas far as gerrit upgrades go this one seems to be an easy one (I've just jinxed it)19:18
fungiuncool man19:18
clarkbfeel free to review the chagne log as well to make sure I didn't miss anything19:18
clarkbbut I tried to put the important bits in the etherpad19:19
fungiyeah, seems to me like we're ready for maintenance day19:19
fungi~1.5 weeks out?19:19
clarkbwhich as a reminder is November 17, 2023 at 15:30 UTC19:19
fungijust shy of 10 days now19:19
clarkbI actually failed to remember that I would be on standard time for that day so 15:30 UTC is a bit early for me19:19
clarkbbut I'll be fine, just get up a little early 19:20
fungi07:30 pst i guess19:20
clarkbyup19:20
* tonyb will be around for the morning FWIW19:20
clarkbI thought it was 8:30 am19:20
fungii'm happy to run the maintenance if you want to focus on getting your tea steeped19:20
clarkbtonyb: awesome19:21
tonybDST strikes again19:21
clarkbfungi: cool we can decide when we get closer to the day of19:21
fungiwfm19:21
clarkb#topic Adding tonyb to infra-root19:22
fungirocketship emoji19:22
clarkbwe've had discussions about this outside of the meeting, but tonyb is willing to be adding to infra-root and help us out with even more stuff :)19:22
clarkb#link https://review.opendev.org/c/opendev/system-config/+/900220 Will make it official19:22
clarkbthank you tonyb!19:22
tonybThank you all19:23
corvusyay!19:23
tonybI understand the level of trust that's being shown here19:23
tonybI apprecaite that19:23
corvustry not to give away the homeworld19:23
fungifeel free to pester me for access to things as you find you're missing something (we don't really have a checklist of everything)19:23
clarkbthe "plan" I've got here is we can approve this change after the meeting. Then I need to edit gerrit groups and some other things. Maybe tomorrow and/or thursday we can meet up and work through things like server boots and adding a gerrit admin account and so on19:24
tonybfungi: will do.  It will be a slow process as my "comfort zone" increases.19:24
fungiyeah, there's no need to ask for access to stuff until you're ready to do something with it anyway19:24
tonybSounds good.19:24
corvusit doesn't come up that often, because it doesn't change that often, but i do think a lot of the docs are mostly current: https://docs.opendev.org/opendev/system-config/latest/sysadmin.html#root-only-information19:25
clarkbyup I mostly want to make sure we've given a reasonable base line of access so that you aren't in a weird spot of not being abel to say approve changes but can ssh into things19:25
tonybIf you want to do that via meetpad or similar I can make sure I'm in a quiet place19:25
clarkbtonyb: ya I was thinking a call like that then we can use shared screen sessions (gnu screen) to share context19:25
tonybperfect19:25
fungialso for stuff like the upcoming gerrit upgrade maintenance we explicitly start a screen session on the server so that other sysadmins can observe or participate as needed19:26
fungi(you'll see it called out in the maintenance plan)19:27
tonybCool.  I'll have to page in my gnu screen keybindings etc19:27
tonybI recently "switched" to tmux/tmate19:27
clarkbfor a long time we used screen because not all the systems (there were old centos systems for cgit) had tmux19:28
clarkband then we never switched19:28
fungii've been using tmux personally for a decade or more, but still fall back on screen for some stuff it does better19:28
fungithese days though, about the only thing screen does better is connect to serial lines19:28
clarkbI've got a usb to rs232 cable I use with screen :)19:28
tonybYeah that was the only thing I really notcied19:29
fungibingo19:29
clarkb#topic Open Discussion19:29
clarkbThat was it for the posted agenda, is there anything else?19:29
fungireminder that there's an openinfra foundation board of directors meeting in 2.5 hours19:29
fungi#link https://board.openinfra.dev/en/meetings/2023-11-0719:30
clarkb1.5 I think19:30
fungi1.5, yep, i can't count19:30
fungi21:00 utc19:30
clarkbyup I've got lunch then that consuming my next 3.5 hours or whatever the scheudled time is19:30
fungispoiler: the budget discussion will probably have nice things to say about our work19:30
tonyb\o/19:31
fungialso there's discussion of upcoming bylaws changes, updating the diversity and inclusion wg's charter, and use of ai in code contributions19:31
fungisomething for everyone19:31
tonybooo that could be fun19:31
clarkbyup I think it will be one where there is a lot of interesting content which isn't always the case19:31
fungias long as you can make it through the first 15 minutes of rollcall19:32
tonybLOL19:32
corvusoh hi19:33
fungiheh19:33
corvusjust a heads up that we merged a nodepool change that is having a small performance impact19:33
clarkbcorvus: is this the ssh keyscanning state machine change?19:33
corvusyep19:33
clarkbI keep meanign to look at what motivated that19:34
corvusi don't think it's user-visible, but i did notice some extra time-to-ready19:34
corvusand some extra launch retries19:34
corvusi have a fix up19:34
clarkbseems like scan in a loop until good or timeout doesn't really need a proper state machine :)19:34
corvusclarkb: paralellization19:34
corvuswe could only do 10 before; get 10 slow machines booting and everything stops19:34
clarkbah is that the size of our threadpool?19:35
corvusyep.  and increasing threads pool workers was :( because it would 2x the threads thanks to paramiko19:35
fungiyeah, i guess you want to be able to have fewer active loops than node requests19:35
corvusso now it's N+1 instead of 2N19:35
corvusthreads19:35
clarkbgot it19:36
fungipolling state machine architecture takes me back to my mud coding days19:36
corvusanyway, i don't think we need to revert or anything, and i'll be monitoring it.  but wanted to bring it up so folks are aware.19:37
clarkbthanks. I'll try to review that change (as well as rereview that one zuul error handling change) this afternoon either during or after the baord meeting19:37
corvuscool, thx :)19:37
clarkbsounds like that may be everything. I'm going to hit +A on 900220 then go find lunch19:38
clarkbthank you for your time today everyone and for all the help running these services19:38
fungithanks!19:38
clarkb#endmeeting19:38
opendevmeetMeeting ended Tue Nov  7 19:38:48 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:38
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2023/infra.2023-11-07-19.01.html19:38
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2023/infra.2023-11-07-19.01.txt19:38
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2023/infra.2023-11-07-19.01.log.html19:38
tonybThanks everyone19:39

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!