Tuesday, 2023-01-31

clarkbmeeting time19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Jan 31 19:01:06 2023 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
ianwo/19:01
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/R7TJT7TMQICXIIWNGACKHQN4HB3ORQWF/ Our Agenda19:01
clarkb#topic Announcements19:01
clarkbThe service coordinator nomination period opened up today19:01
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/32BIEDDOWDUITX26NSNUSUB6GJYFHWWP/19:01
clarkbIt runs for ~2 weeks. I will send a followup email to ^ to remind everyone that today is the day19:01
clarkbThat was all I had for announcements. Let's dive in19:03
clarkb#topic Bastion Host Updates19:03
clarkbI've been terrible and not reviewed the backup script change :(19:03
clarkbI believe that is still outstanding19:03
clarkbIn better news the old bridge did get shutdown I believe. Thank you ianw for contining to move this along19:04
ianwyep, maybe give it another week and we can "server delete" it 19:04
clarkb++19:04
clarkbanything else bridge related? as far as I can tell we've moved onto the new bridge at this point and are happy with it19:05
clarkb#topic Mailman 319:07
ianwyep, i should get back to some of the parallel running work, remerge and re-evaluate19:07
ianw(sorry, that's it)19:07
clarkbsounds good19:07
clarkbfungi: I've seen you pushing on mailman 3 items and seem to have made progress? Want to catch us up?19:07
fungii managed to work out the configuration fiddling dance necessary to bootstrap the server into a steady configuration state that supports correct per-domain filtering of lists19:08
fungihowever, there are outstanding steps we need to perform to complete addition of the sites and domain mappings19:09
clarkbThis is the stuff we were just discussing in #opendev that may involve db migrations?19:09
fungithe greatest hurdle is that each list domain needs to be associated with a distinct django "site" and creating new django sites with automation is... nontrivial19:09
clarkbjust talking out loud here: we add domains infrequently enough that maybe "root approves new domain then uses django web ui to create the site" is fine?19:10
fungiapparently it's assumed you'll "just" visit the django admin webui and click the "add site" button which runs database migrations in the background to create and populate new tables and create new configs19:10
clarkbbut maybe its too soon to give up19:10
fungiin order to do that from the django-admin cli you need to make new migrations for each site and run them19:11
fungii'm sure for someone well-versed in django-based applications this is easy, but i'm struggling to wrap my head around it still19:11
fungipart of the problem with doing that part manually is order of operations. if we want a new mailman domain to be automatically associated with a site, we need to create the site before creating the domain before adding mailing lists...19:12
clarkbright considering that the value we get from mm3 is pretty high I'm ok punting on automating this specific bit if it gets us moving. But I'll let you decide when you've spent enough time trying to automate it first19:13
clarkboh :(19:13
fungiwe have a lot of automation that would need to be blocked to prevent it running before we do the manual step19:13
fungior else we do additional manual steps to move the domain to the new site after creating it19:13
clarkbya considering that I guess we should try to figure out the migrations process if possible19:14
clarkbotherwise we'd have to break up new domains into two changes and manually create sites between?19:14
clarkbdoable, but annoying/complicated too19:14
fungiit's probably quite similar to some of the commands ansible is already running in playbooks/roles/mailman3/tasks/main.yaml19:15
fungidjango does have a cli to "makemigrations" and so on19:15
fungiso it seems like a lot of this is automatable if i can wrap my head around the terminology19:15
fungi#link https://docs.djangoproject.com/en/4.1/topics/migrations/19:16
ianwis https://docs.djangoproject.com/en/4.1/ref/contrib/sites/#enabling-the-sites-framework the same thing?19:17
clarkbhaving the held nodes definitely helped me when I did the initial bootstrapping19:17
clarkbianw: yes19:17
clarkbfungi: the other thing I wanted to mention is that the services did get restarted which should've fixed our site owner email addr. But we still have a broken root alias for receiging mail at at that address? I guess fixing one thing at a time19:19
fungioh, right, i think i tracked that down to a missing bit in the host vars19:20
fungiwe failed to include the alias stuff that we normally put on our other servers19:21
clarkbthat would do it19:21
clarkbfungi: is there a change for that yet?19:21
fungino, it sounded like frickler was going to push a fix, i thought, but i can do it i just need to remind myself what specifically needs adding19:22
clarkbmostly wanted to ensure I wasn't holding things up by missing a change to review19:22
clarkbthanks! anything else on this topic?19:22
fungimmm, actually it looks like the necessary bit is already there in lists01.opendev.org.yaml19:24
funginope, nothing else for now19:24
clarkb#topic Git updates on docker images19:24
clarkbUpstream debian has patched git which means we don't need our manually patched packages anymore. In fact due to version differences apt complains about this in some situations too.19:25
clarkb#link https://review.opendev.org/q/topic:flip-to-distro-git+status:open19:25
clarkbI pushed changes to our images to unwind that and flip back to distro packges.19:25
clarkbAt this point gitea and gerrit are outstanding. ianw took care of zuul and nodepool too so by the weekend hopefully we'll have put all of this in the rear view mirror19:26
clarkbMostly a heads up that those changes need reviews. The gitea update should auto deploy and the gerritone will need a manual gerrit restart19:26
ianwthanks for the work making the packages etc!19:26
clarkbI did check if the base python images have updated git yet and they have not (did this a few hours ago)19:26
clarkbbut I think once the base python images do update we should update our opendev base python images19:27
clarkbI'll try to check on that periodically and push a base image update change once that is in place19:27
clarkb#topic Gerrit updates19:27
clarkbI have a handful of changes out there for various Gerrit updates19:28
clarkb#link https://review.opendev.org/c/opendev/system-config/+/872238 Fixup Gerrit Submit Requirements usage19:28
clarkbThis one is straightfowrard update to our gerrit testing and use of submit requirements as part of the 3.6 -> 3.7 transition prep19:28
clarkbMy emails to the repo discuss list didn't get any responders and I ended up digging into the source code and think I decided that pushing MaxWithBlock explicitly is not allowed, but if you don't set a function value then it is still used via the default :/19:29
clarkbAnyway that addresses this by explicitly setting Noop to ensure we're exercising submit requirements (and the screenshots seem to show this is working too)19:29
clarkb#link https://review.opendev.org/c/opendev/system-config/+/870114 Add Gerrit 3.6 -> 3.7 Upgrade test job19:30
clarkbthis adds the upgrade job. ianw I did respond to your comments and basically punted on trying to fully automate the upgrade process. I think doing that isn't a bad idea but maybe out of scope for this change (basically good improvement for later)19:30
ianw++ i'll go back over those two19:31
clarkb#link https://review.opendev.org/c/openstack/project-config/+/867931 Cleaning up deprecated copy conditions in project ACLs19:31
clarkbthis is ianw's which i asked we not merge until after we announced it19:31
clarkbjust beacuse potential is there for behavior changes. But I think it is ready to go when we are ready to announce it and do it19:32
clarkbThat leaves the two "scary" changes19:32
clarkb#link https://review.opendev.org/c/opendev/system-config/+/870874 Convert Gerrit to run on our base python image19:32
clarkbThis swaps out the docker openjdk image for debian bullseye openjdk packages on top of our python base image19:32
clarkbalso I've just realized this change needs a new ps to explicitly install git to ensure that it is up to date19:33
clarkbIn theory this is largely a noop for us other than that the python version goes from debian package to version specific compile for the image and the openjdk is switched to the distro openjdk19:34
clarkbI'll push a new ps to this after the meeting to address the git install19:34
clarkbCareful review is always appreciated (our CI helps a lot too!)19:34
clarkb#link https://review.opendev.org/c/opendev/system-config/+/870877 Run Gerrit under Java 1719:34
clarkband this is the last one. I think this one deserves the most careful consideration since java version changes can be a big deal. In particular I've had to implement a workaround for a bug despite gerrit saying java 17 is fully compatible (it isn't)19:35
clarkbwe should think about whether or not we can live with that workaround (both because it is clunky as we have to execute java a specific way anytime we invoke gerrit and because there may be security/runtime implications?)19:35
clarkbThis email hasn't gotten any movement upstream either unfortunately.19:36
clarkbreviews much appreciated.19:36
clarkb#topic Linaro Cloud Migration19:36
clarkbWe are running jobs on the new linaro cloud (thank you ianw and kevinz!)19:36
clarkbwe should be off of the old cloud with our nodepool configs and we told Ed at equinix the underlying systems could go away at the end of this month (today)19:37
clarkbone thing I noticed is that we're still checking the ssl cert for the old cloud's api endpoint. ianw  should we update that to be the new cloud's endpoint?19:37
ianwummm, yes.  i have a change out that might fix that 19:38
ianwi think it's in 19:38
ianw#link https://review.opendev.org/c/opendev/system-config/+/87167219:38
ianwwhich removes all of the cloud bits19:38
clarkbexcellent19:40
clarkbThe other bit you wanted to discuss was how involved we wanted to be in hte management of this cloud19:40
clarkbI selfishly think the less the better :) but I have no objections to us having that access if it is useful (and it already has been)19:41
ianwyeah i'm not sure we want to run base.yaml against it ... but we could19:42
ianwif we're going to have access, we should probably at least have bridge deploying our keys?19:42
fungiwe did that for some cloud previously (not infracloud, more recent). which one was it? inmotion? older?19:44
clarkbinmotion I did manually19:45
clarkbbut a good idea for both I guess19:46
ianwjust not sure if we add to inventory if the base setup will take it over in weird ways, it might do something that kolla doesn't like19:46
ianwbut i can check it out if we're ok with basically including it19:47
clarkbianw: maybe we can add a new group that is basically a subset of base19:47
clarkbI would worry about the firewall rule changes in particular. THose are likely to break openstack19:47
clarkbso ya we should limit it to users if we can19:48
ianwyeah, ok i'll take a look19:48
clarkbsound sgood thanks19:48
clarkb#topic SqlAlchemy 2.019:49
clarkblast week zzzeek released sqlalachemy 2.0. Unfortunately this version of sqlalchemy is not backward compatible with 1.x19:50
clarkbthe good news is that there are good docs about it and zzzeek has been responsive to questions I've asked about this.19:50
clarkbI know that Zuul and lodgeit are affected. Zuul I've been working on fixing. Lodgeit I don't know that anything has been done yet but we should probably push a sqlalchemy pin change for lodgeit19:50
clarkbI wanted to bring this up so people are aware of the issues and also to double check there aren't any other services relying on sqlalchemy19:51
clarkbI guess thats it for sqlalchemy using projects?19:53
clarkb#topic Server Upgrades19:53
ianwoh wow, yes lodgeit would be a pin19:53
clarkbMaybe no surprise but I haven't made progress here. Too many things19:53
clarkb#topic Quo vadis Storyboard19:53
clarkbsame story here.19:54
clarkbSorry for rushing along but we are almost at time and I wanted to open things up to ensure we aren't overlooking anything19:54
clarkb#topic Open Discussion19:54
ianwlodgeit is so far behind on everything, this is another thing :/19:54
ianwi guess there's an element of "if it ain't broke" ... it takes some input and saves it, so maybe it can just live in it's container forever more19:55
fungii fixed the missing root alias for lists01, no gerrit change needed. apparently it's set as a private hostvar on bridge, and we missed copying it from the old listservers. that's done now, so the next periodic run should update /etc/aliases accordingly19:55
clarkbfungi: aha19:55
clarkbfungi: what is the var name? I'm curious to grep for it and see where it is used19:55
fungilistadmins19:55
fungithe listservs define their own custom aliases lists and have a generator in there to create a root: alias, but was defaulting to empty on the new server and so the /etc/aliases template was skipping it since there was no value for the "root" key19:57
clarkbgotcha19:57
fungialso i just pushed a new revision of the donor logos addition addressing your layout comment19:57
fungi#link https://review.opendev.org/869091 Feature our cloud donors on opendev.org19:57
clarkbcool I'll try to review the linaro cloud cleanup change and the donors change after lunch.19:58
ianwfungi: speaking of both, i don't think i saw linaro on that list?19:58
fungiwe should add them. they're not currently on the sets of logos on openstack.org or openinfra.dev either19:59
ianwa linaro logo anyway.  although i'm not sure between equinix/linaro who is actually in charge :)19:59
fungii copied the current sets but left iweb out19:59
fungi(it needs to be cleaned up in the old locations too)19:59
clarkbI think we've not done that because the total numebr of nodes is small. ut consdering the effort to make this happen I think that is fine20:00
clarkband we are at time.20:00
clarkbTHank you everyone20:00
fungithanks clarkb!20:00
clarkbWe'll be back here same time and place next week20:00
clarkb#endmeeting20:00
opendevmeetMeeting ended Tue Jan 31 20:00:37 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)20:00
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2023/infra.2023-01-31-19.01.html20:00
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2023/infra.2023-01-31-19.01.txt20:00
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2023/infra.2023-01-31-19.01.log.html20:00

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!