Tuesday, 2023-05-09

*** diablo_rojo is now known as Guest92409:16
clarkbjust about meeting time18:59
fungiso it is18:59
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue May  9 19:01:02 2023 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/BQ5T6VULIAYPCN6LPWSEMA4XITIXTSZB/ Our Agenda19:01
clarkbI didn't hvae any announcements. We can dive right into our topics19:01
clarkb#topic Migrating to Quay19:01
clarkbA bunch of progress has been made on this since ~Friday19:01
clarkb#link ttps://etherpad.opendev.org/p/opendev-quay-migration-2023 Plan/TODO list19:01
clarkbI wrote this plan / todo list document to keep everything straight19:01
clarkbsince then I've pushed like 25 changes and a good number of them have landed. At this point I think about 12/34 imgaes are being pushed and pulled from quay.io19:02
clarkbThat said late yesterday I realized we were pushing change tagsto quay.io which we didn't want to do19:02
clarkbThere are two potential fixes for this19:02
clarkb#link https://review.opendev.org/c/opendev/system-config/+/882628 Don't run the upload role at all19:02
clarkbIn this one we modify our jobs to not run the upload role in the gate at all (the upload role in the gate is what is pushing those change tags)19:03
clarkb#link https://review.opendev.org/c/zuul/zuul-jobs/+/882724 Upload role skip the push19:03
clarkbIn this option we modify the upload role to check the flag for promotion behavior and only push if the one that needs the push is selected19:03
clarkbcorvus: likes 882724 more. I do as well as it simplifies reorganizing all of the jobs as we go through this process. The changes to swap image publication over to quay.io are pretty minimal with this appraoch which is nice19:04
clarkbIf you have time to review both of these changes and weigh in that would be great. If we get one of them landed I can continue with my progress in migrating things (depending on which choice we choose I will need to do different things so don't want to move ahead right now)19:04
ianwthe only thing about that is all the documentation now says "use build in check and gate" and then promote for the IR path19:04
ianwexcept then the main example of it doesn't do that :)19:05
clarkbianw: ya and I think we could possibly switch it over to that after the migration too. I'm mostly concerned that the amount of stuff to change to swap the images increases quite a bit if we go that route now. But both appraoches should work and I'm only preferring one for the amount of work it creates :)19:05
clarkbbut ya if we prefer the more explicit approach thats fine. I will just need to modify some changes and land the fixp change that ianw started19:06
corvusi think things changed once we accepted the idea of being able to switch the promote mechanism with a variable19:06
corvusi'm not sure it makes as much sense to switch that with a variable and then have to change the job structure too...19:07
clarkbya though it may be confusing to have two different jobs that behave identically if the flag is set19:08
clarkbI can really go either way on this.19:08
clarkbbut making a decisions between one approach and the other is the next step in migrating so that we don't create more of a cleanup backlog. I can also do the cleanup once this is sorted out. Its only a few extra tags that can be removed by hand19:09
corvusarguably, we should probably not have the role switch but instead have the job switch which role it includes... but this is sprawling and complex enough that putting the logic in the role seems like a reasonable concession.19:09
clarkbmaybe ianw and fungi can weigh in on the two changes and we can take it from there?19:10
fungisounds good19:10
clarkbI'll use grafyaml's image as the confirmation whatever choice we make is functional19:10
clarkbit is next in my list19:10
clarkbanything else related to the container images moving to quay?19:11
clarkb#topic Bridge Updates19:13
clarkb#link https://review.opendev.org/q/topic:bridge-backups19:13
clarkbThis topic could still use an additional infra-root to sanity check it19:13
clarkbOther than that I'm not aware of any changes to bridge. It seems to be happy19:13
clarkb#topic Mailman 319:14
clarkbfungi: looks like you have some updates on this one19:14
funginothing exciting yet. new held node at 23.253.160.97 and the list filtering per-domain is working correctly by default, but there is a singular list domain name which gets displayed in the corner of the hyperkitty archive pages19:15
fungithink i've figured out how that's piped in through the settings, though it also impacts the domain name used for admin communications the server sends out19:16
clarkbyes I don't think that is going to be configurable unfortunately19:16
fungithe current stack of changes tries to turn that default domain name into one which isn't one of the list domains, but i missed that setting i just found so it needs another revision19:16
clarkbor at least not with current mailman 3. It has all of the info it needs to properly configure that though it would require updates to mailman to do so19:16
fungiwell, or it needs more advanced django site partitioning, below the mailman services layer19:17
clarkbI think that template is in mailman itself19:17
fungipostorius and hyperkitty are just delegating to django for that stuff, and this is bubbling up from django site info in settings and database19:18
fungiso creating the sites in django and having per-site settings files would probably work around it19:18
fungibut yes, using a singular settings.py is part of the problem, i think19:19
clarkbmailman/hyperkitty/hyperkitty/templates/hyperkitty/*.html fwiw19:19
fungiwe could possibly just hide that string displayed in the webui19:19
fungito reduce confusion. though i still need to check whether it does the right thing with headers in messages19:19
fungiwhich, with the held node, means fishing deferred copies out of the exim queue19:20
fungi(and injecting them locally to start with)19:20
fungihopefully i'll get to that shortly. i'm coming out of a dark tunnel of other work finally19:21
fungianyway, i didn't have any real updates on it this week19:21
clarkblet us know if/when we can help or review things19:21
clarkb#topic Gerrit Updates19:21
fungiwill do19:21
clarkb#link https://review.opendev.org/c/openstack/project-config/+/882075 last little bit of ACL cleanup work19:21
clarkbThis change (and its child) are unmerged due to documentation/ease of use concerns. I like the suggestion of having a tox target for it19:22
clarkbsince we expect people to be able to do that generally to run local tooling19:22
clarkbNot urgent but also something we can probably clear out of the backlog quickly if that works for others19:23
fungiwould be easy to add a tox testenv to run that command in a followup change19:24
fungii'll try to knock that out really quick19:24
clarkbthanks!19:24
clarkbfor replication tasks leaking I haven't seen any movement on the issues I filed. I'm || that close to creating a proper discord account to join the server to try and have a conversation about it (I think the matrix bridge may be losing messages and they are discussing using discord for the community meeting next month after May's was effectively cancelled due to no googler19:25
clarkbbeing present to let people into the google meet...)19:25
clarkbAssuming I can get through some higher priority itmes I can also spin up a gerrit dev env again and try to fix it myself.19:25
clarkbThe good news is the growth isn't insane. Currently just less than 16k files19:26
clarkb#link https://review.opendev.org/c/opendev/system-config/+/880672 Dealing with leaked replication tasks on disk19:26
clarkbhappy for feedback on my poor attempt at working around this locally too19:26
clarkbAnd finally for Gerrit we should plan to restart the server on the new image once we move things to quay. That will also pick up the theming changes that were done for 3.8 upgrade prep19:26
clarkb#topic Upgrading older servers19:27
clarkbWe've upgraded all of the servers we had been working on and need to pick up some new ones19:28
clarkb#link https://etherpad.opendev.org/p/opendev-bionic-server-upgrades Notes19:28
clarkbmirror nodes, registry, meetpad, etc could all be done.19:28
clarkbI'll be looking at this once quay is off my plate. Help very much welcome if you have time to do some as well19:28
clarkbThis also ties into the docker-compose stuff we hit last week19:29
clarkbbasicall if we can get all the container based services running on focal or newer it ooks like we can pretty safely switch from pip docker-compose to distro docker compose19:30
* tonyb has some time to look at migrating some of the older servers19:30
clarkbthereis also a newer docker compose tool written in go that we can switch to but it changes container names and other things so we need to be more careful with this one19:30
clarkbtonyb: thanks! we can sync up on that and go through it. But one of the first thigns that can be done is updating the CI for a service to run against focal or jammy (jammy would be preferred) and ensure everything is working there. Then an infra root can deploy a new server and add it to inventory19:31
clarkbtonyb: you should be able to do everything up to the point of deploying the replacement server and adding it to inventory19:31
tonybclarkb: sounds good19:32
clarkbtonyb: system-config/zuul.d/system-config-run.yaml is the interesting file to start on that as it defines the nodesets for each service under testing19:32
clarkb#topic OpenAFS disk utilization19:32
clarkbin unexpected news utilization is down slightly from last week19:33
clarkbthis is a good thing. It isn't drastic but it is noticeable on the grafana dashboard19:33
clarkbI also started the discussion about winding down fedora. Either just the mirrors for the disro or the test images entirely19:33
tonybIs there a timeline we'd like to hit to begin winding things down?19:34
clarkbSo far I haven't seen any objections to removing the mirrors. Some libvirt and fedora folks are interested in keeping the images to help ensure openstack works with fedora and new virtualiation stuff. But new virtualization stuff is being built for centos stream as well so less important I think19:34
clarkbtonyb: I think I'll give the thread a bump later today and maybe give it anothe rweek for feedback just to be usre anyone with an important use case and/or willingness to help hasn't missed it. But then I suspect we can start next week if nothing on the feedback changes19:35
clarkbMy main concern is moving too quickly and someone missing the discussion. 2 weeks seems like plenty to avoid that problem19:36
tonybOkay.  I have it on my TODO for today to raise awareness inside RH, as well19:37
clarkbI think even if we just drop the mirrors that would be a big win on the opendev side19:37
fungialso if libvirt/fedora folks find it useful, then they clearly haven't been taking advantage of it for a while given it's not even the latest fedora any longer19:37
clarkbrocky seems to do well enough without "local" mirrors since we don't run a ton of jobs on it and I think fedora is in a similar situation19:37
clarkbbut ya given the current situation I think we can probably go as far as removal. But we'll see where the feedback takes us19:38
tonybIt can be a staged thing right19:38
clarkbtonyb: yes we could start with mirror removal first as well. That won't solve fedora being 2 releases behind though19:38
tonybwe can cleanup the mirrors and then work on winding up Fedora and/or focusing on stream19:38
clarkbstep one would be configuring jobs to not use the mirrors, then delete the mirrors, then $somethingelse19:39
clarkbyup19:39
tonybokay, got it19:39
fungiif the objections to removing fedora images are really because they provide newer $whatever then i don't mind keeping it around, but that implies actually having newest fedora which we don't, and nobody but us seems to have even noticed that19:39
clarkbwe will free up 400 GB of disk doing that or about 10%19:40
clarkbwill have a big impact19:40
tonybfungi: Yup I agree, seems like a theoretical objection19:41
clarkb#topic Quo vadis Storyboard19:41
clarkbThere continues to be a steady trickle of projects moving off of storyboard19:41
clarkbI haven't seen evidence of collaboration around tooling for that. I think the bulk of moves are just creating a line in the sand and switching19:42
clarkbwhich is fine I guess19:42
funginothing new to report on my end, but i have been making a point of switching projects to inactive and updating their descriptions to link to their new bug trackers. if anyone spots a "move off sb" change in project-config please make sure it's come to my attention19:42
clarkbcan do!19:42
fungii comment in them once i've done any relevant post-deploy cleanup19:43
fungijust for tracking purposes19:43
clarkb#topic Open Discussion19:44
clarkbThe wiki cert will need to be renewed. Historically I've done that with about 7 days remaining on it.19:44
clarkbApologies for the email it generates until then19:44
fungithanks for handling that19:45
clarkbit is our last remaining non le cert :/19:46
clarkbbut one cert a year isn't so bad19:46
clarkbLast call for anything else19:47
clarkbThank you everyone. genekuo tonyb feel free to ping me directly if I'm not reviewing things or if you have questions about where ou can help. I'm definitely feeling like I've got too many things to think about at once right now and I appreciate the help you have offered so don't want you to feel ignored19:49
clarkband with that I think we can end the meeting a little early19:49
clarkbthanks again!19:49
clarkb#endmeeting19:49
opendevmeetMeeting ended Tue May  9 19:49:53 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:49
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2023/infra.2023-05-09-19.01.html19:49
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2023/infra.2023-05-09-19.01.txt19:49
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2023/infra.2023-05-09-19.01.log.html19:49
fungithanks clarkb!19:50
tonybThanks everyone19:50

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!