Tuesday, 2021-08-31

clarkbAnyone else here for the meeting? Apologies for getting the agenda out late today19:00
fungiyup19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Aug 31 19:01:25 2021 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
corvuso/19:01
clarkb#link http://lists.opendev.org/pipermail/service-discuss/2021-August/000279.html Our Agenda19:01
clarkb#topic Announcements19:02
clarkbThe openstack feature freeze has begun. Keep that in mind as we make changes. In the past we've described it as a good time to be "slushy" eg we dno't have to freeze but keep in mind potential impacts and try to avoid impacting the release process if possible19:02
ianwo/19:03
clarkbLooking at zuul queues they aren't quite as deep as I expected. but I expect that to grow through the week as last minute changes get merged19:03
clarkb#topic Actions from last meeting19:04
clarkb#link http://eavesdrop.openstack.org/meetings/infra/2021/infra.2021-08-24-19.01.txt minutes from last meeting19:04
clarkbianw had an action to look into logo hosting. That has happened and I've added a separate agenda topic just for that to catch up19:04
clarkb#topic Specs19:04
clarkb#link https://review.opendev.org/c/opendev/infra-specs/+/804122 Prometheus Cacti replacement19:04
clarkbfrickler and tristanC have left some thoughts on ^ if others can take a look and let me know what you think that would be helpful19:05
clarkbadding a service like prometheus seems like the sort of safe slushy activity we could do over the next few weeks while openstack puts its release together19:05
clarkb#topic Topics19:06
clarkbAt this point I'm not sure that topic will ever go away :)19:06
clarkb#topic Mailman server upgrades19:06
clarkbFungi ran through the lists.katacontainers.io upgrade to Focal last week. It appears to be working as expected19:07
clarkbI've cleaned up some of the testing nodes we had held/created for that process today.19:07
fungiyeah, upgrade went smoothly for the most part19:07
clarkbfungi: you had talked about measuring resource consumption on the upgraded server to try and infer if it would be a problem with the vhosts on lists.o.o19:07
fungii did not find time to restart services on the servers over the weekend19:08
fungibut maybe some night this week when things are slow19:08
clarkbok, I think that may be the last thing we are waiting on before we do an upgrade of lists.o.o? Do we want to schedule that upgrade assuming the metrics gathering happens and doesn't show any problems and postpone otherwise?19:09
fungiyeah, i think that would be fine19:10
fungithe sooner we can get it off xenial and ua the better19:10
clarkbfungi: you mentioned doing it on a weekend which works for me as well. Maybe September 12?19:10
clarkbThat is the Sunday before openstack's RCs go out19:11
fungiyeah, the 11th or 12th would work fine for me19:11
clarkbdo you think that would be enough time for prep? Also how concerned are you about impacts post upgrade? You did the first server so probably have the best grasp of that19:12
clarkbfungi: ^19:13
funginot especially concerned, the sysvinit scripts are the only real difference19:13
fungiand associated disk layout19:13
clarkbok in that case lets say start at 1500UTC Sunday?19:13
fungialso the age of the server, size of the rootfs, et cetera19:13
clarkber 1500UTC Sunday September 1219:14
fungisure, that seems good19:14
clarkbfungi: do you want to draft an announcement of that or should I?19:14
fungistill early enough it probably won't impact apac monday19:14
fungii can write up an announcement later today19:14
clarkbthanks!19:14
clarkbAnything else on the subject of list servers?19:15
funginot this week i don't think19:15
clarkb#topic Improve OpenDev's CD Throughput19:15
clarkbThis should be quick as I didn't have time last week to do the mapping. Sorry about that. I ended up being very distracted with some family stuff that poppped up. Things appear far more normal this week thankfully19:16
clarkbBasically everything after tuesday last week is getting transplanted to after tuesday this week. I plan to get started on that in the next day or two19:16
clarkbThat said I wanted to ask if the small easy improvements we have already made are helping19:17
clarkbFrom what I can see it seems to have helped quite a bit, but also wasn't paying too much attention last week after the inventory file matchers got updated19:17
fungiyeah, stuff is deploying faster i think, anecdotally observed anyway19:18
clarkbgreat and no concerns of jobs not running when we expected them?19:19
funginone that i've noticed, but that could easily go unspotted for weeks/months19:19
clarkbya something to watch out for, but not an immediate issue it seems19:20
clarkb#topic Gerrit Account Cleanups19:21
clarkbLast Tuesday I said I'd do the external id deletion for the last set of retired accounts tomorrow but that iddn't happen becuase I wasn't able to sit at a computer for reliably long enough periods of time19:21
clarkbI intend on actually doing it tomorrow and really hope nothing comes up this week :)19:21
clarkbJust a heads up19:21
clarkb#topic OpenDev Logo Hosting19:22
clarkbianw dug into this after the meeting last week and came up with a few different ideas. Ultimately it seems like maybe we were going to use docker images like a packaging format?19:22
ianwyes, i think this works19:22
fungiyeah, that seemed like a great solution19:22
clarkbianw: are there changes we should be reviewing for that?19:23
fungiwe basically build a docker image of our logos and then that can be used in building or deploying our containerized services19:23
ianw#link https://review.opendev.org/c/opendev/system-config/+/80593219:23
ianwthis builds a small container that has nothing but images/logos/whatever at a defined location19:23
ianw#link https://review.opendev.org/c/opendev/system-config/+/80593319:23
ianwthen tries to use it19:24
ianwalthough 805933 failed because we didn't build the container, i'm not sure how to get around that19:24
clarkbcool I'll review those after lunch. Are there related changes to update the paste and gerrit themes to pull from gitea hosting ? or maybe copy them similarly to what you have there for gitea?19:24
ianwit doesn't touch anything that triggers the container build in the gate, and there isn't one available for download19:25
clarkbianw: you need the requires and provides in the jobs?19:25
clarkbI think if you do that it may work?19:25
ianwhrm, will that *trigger* the job, or just order it correct?19:26
clarkbI think its meant to say that job must run (or if it is a soft requirement then it will only run if otherwise triggered)19:26
clarkbbut I could be wrong about how that interacts with a brand new image.19:26
ianwanyway we can discuss in #opendev, the only other thing was it uses buildkit to achieve this19:27
clarkbcorvus: do you know if we've written this down anywhere for opendev specifically? That might be another good output of the job mapping exercise and reorg we can be more consistent about those thinsg and write down how to use them properly for our jobs19:27
ianwiirc mordred switched zuul to this anyway, and it seems like it will one day be the docker default19:27
clarkbya there was some buildkit feature around mounts that I think zuul wanted to use? seems fine to use buildkey for richer image creation19:28
ianwyes, it's the same feature; being able to copy things in from another container19:29
fungiahh, as opposed to having to use the container image as a "layer" i guess19:30
ianw#link https://review.opendev.org/c/zuul/zuul/+/71271719:30
ianwis the zuul one, it's not merged19:30
corvusprovides/requires won't cause a job to run, just allow artifacts to pass between queue items19:31
corvuswhich includes docker images19:32
clarkbgot it so the first change could provide the image to the second change19:32
clarkband that would work even without an image published to docker hub because we have the zuul registry19:32
corvusso if change A runs a job that provides docker image foo, and change B runs a job that requires docker image foo, then change B's job will run with foo built for change A19:33
corvusclarkb: exactly19:33
clarkbianw: thank you for looking into this.19:34
corvusafter a really quick look, i think that since 805933 has a "RUN" command that references the assets image, then the gitea building jobs should require the assets image19:34
ianwright, but in this case, system-config-build-image-assets has file matchers that mean it only runs if one of the asset files are updated19:34
ianwmaybe the matchers for that job need to include the Dockerfile of images that use it19:35
corvusianw: but 805932 builds the image, so 933 would get it19:35
ianw(that job being ...build-image-assets)19:35
clarkbianw: no, the chagne that adds the assets image would build and create the image and put it in the zuul registry. Then requiring that image would allow the gitea job to get it from zuul registry19:35
clarkbyou don't need the assets job to run when gitea updates19:35
corvusprovides/requires is specifically for bridging across changes (it doesn't do anything for jobs within a single change).  to make both work, you need provides/requires and job dependencies19:36
fungiand later updates to the gitea image would just pull it from dockerhub if not provided by a change queued ahead of it?19:37
corvusso in this case we do want both things.19:37
ianwoh right, ok the jobs ran in parallel and the image wasn't pushed into the bulidset registry19:37
corvusfungi: correct19:37
ianwzuul didn't wait to start the gitea job because it didn't have the requires/provides.  ok, can fix that19:37
fungimakes sense19:38
corvus3 cases: 1) no change to an image: pull from hub.  2) change to assets image in same change as gitea image change: job dependencies cause assets image build to run before gitea image build. 3) change to assets image in one change, change to gitea in second: provides/requires causes assets build to run before gitea image build.19:38
corvusand provides/requires works whether or not the items are in the queue at the same time.19:39
corvus(if change B runs long after change A, it uses the database)19:39
clarkbThanks19:41
clarkbThat was all I had on the agenda I sent out.19:41
clarkb#topic Open Discussion19:41
clarkbIs there anything else to bring up?19:41
ianwcorvus: speaking of all this, https://review.opendev.org/c/zuul/zuul-jobs/+/798969 could probably do with your review.  that adds some more info on container builds19:42
fungioh, openstackid is in the process of moving to foundation webdev management19:42
fungii updated dns for it during the meeting and am working on the system-config cleanup change now19:42
fungibut expect some failures for, e.g., cert updates until we get things cleared out of system-config19:43
ianwi've been working on updating nodepool images to bullseye19:43
ianw#link https://review.opendev.org/c/openstack/diskimage-builder/+/80631819:44
ianwseems to be the major thing required in a dib release, if someone wants to double-check19:44
clarkbianw: how are you handling arm64 wheels when bumpingto bullseye? Do we have bullseye wheels built for openstack?19:44
ianwyes we should, the buildx phase has been ok19:45
ianw#link https://review.opendev.org/c/zuul/nodepool/+/80631219:45
clarkbcool, that would be my only concern since that can be slow without wheels pre built19:46
ianwis the change that updates thing.  i'll put some more documentation in.  the release job there will fail until we release a fixed dib19:46
clarkbI'll add those to the list of reviews to do this afternoon19:46
ianware we now in a process of updating the other images on a case-by-case basis?19:47
clarkbI think so, but I'm not sure if anyone has started yet. Doing the matrix eavesdrop bot should be easy since that one was done previously?19:47
ianw(updating them to bullseye)19:47
ianwok, i can look at a few too.  while there's no rush, it also seems worth getting the pain over when there isn't a pressing need19:48
clarkbthanks!19:48
clarkbSounds like this may be it. Thank you everyone. We'll see you here next week. Feel free to reach out in #opendev or on the service-discuss mailing list19:49
fungithanks clarkb!19:50
clarkb#endmeeting19:50
opendevmeetMeeting ended Tue Aug 31 19:50:05 2021 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:50
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2021/infra.2021-08-31-19.01.html19:50
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2021/infra.2021-08-31-19.01.txt19:50
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2021/infra.2021-08-31-19.01.log.html19:50

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!