Thursday, 2025-04-03

tonybhuzzah01:27
tonybWhen y'all are awake any chance I can get some reviews on: https://review.opendev.org/q/project:zuul/nodepool+status:open+owner:tonyb06:04
fricklertonyb: I can't ensure being awake, but in my current state I only have a -1 for you06:13
tonybfrickler: I'll take a -106:15
fricklerzuul-lb02 says: Packages with upgradable origin but kept back: Ubuntu noble-updates: rsyslog                                                                                    06:32
frickleroops where did these trailing spaces come from? anyway, I'll check what apt says about this when invoked manually06:33
noonedeadpunkI guess this can be landed now?:) https://review.opendev.org/c/opendev/system-config/+/94606807:41
noonedeadpunkas seems that ubuntu has updated packages after the release: https://ubuntu-cloud.archive.canonical.com/ubuntu/dists/noble-updates/epoxy/07:42
tonybI think there are concerns about the AFS quota07:42
noonedeadpunkthere was plenty for uca last time I checked?07:43
noonedeadpunk(and UCA is quite small)07:43
tonybI can look at it in a bit, just explaining one potential holdup07:43
noonedeadpunkIf I'm not mistaken - that's current usage? https://grafana.opendev.org/d/9871b26303/afs?orgId=1&from=now-6h&to=now&timezone=utc&viewPanel=panel-3407:44
noonedeadpunk++07:44
noonedeadpunkI think when we discssed couple of days ago folks just wanted to wait for the release, not to fetch packets that will be updated in a day or so07:44
tonybnoonedeadpunk: Well if you're correct (and you probably are), there's no space issue there.07:47
tonybfrickler: reason I, or you, shouldn't +A https://review.opendev.org/c/opendev/system-config/+/946068 ?07:48
fricklertonyb: noonedeadpunk: ah, right, I wanted to that yesterday already, but somehow I forgot about it with all the release things happening. approved and I'll check the sync log later07:58
tonybfrickler: Thank you08:02
noonedeadpunkthanks!08:02
noonedeadpunkwell, packages got updated today morning, so it's good it wasn't done yesterday :)08:02
fricklertonyb: I guess once noble is in, we can also finally proceed with kicking xenial out? ;)  https://review.opendev.org/c/opendev/system-config/+/88346808:12
fricklerbtw. this is the output from zuul-lb02, didn't we disable phased-upgrades everywhere? or only in our mirrors? I assume that we do not use our mirrors on these servers in order to be able to get security updates as fast as possible? https://paste.opendev.org/show/bRW1B90Hvu5pO9PMXH3K/08:13
tonybfrickler: Maybe?  I'd want to double check we don't have any old servers running xenial08:20
tonybfrickler: I have no idea about phased updates, I'd like to think that updates coming from ubuntu-security wouldn't be phased?08:31
opendevreviewMerged opendev/system-config master: Add Epoxy UCA to mirrors  https://review.opendev.org/c/opendev/system-config/+/94606808:37
frickleroh my, we're really still building xenial images. then I agree it isn't as trivial as I assumed. I'd still vote to proceed with the cleanup, but will wait for more feedback first08:58
frickleruca update deployed successfully, next run should be around 10 UTC09:00
tonybfrickler: I doubt I'll be of any help but I'll be around at 10 UTC09:22
noonedeadpunkok, nice (about uca)09:41
noonedeadpunkseems epoxy is there (ᵔᴥᵔ)10:55
fungiclarkb: yeah, the tomllib used in python throws a parsing error if you try breaking an inline table into multiple lines12:01
fungiapparently this would need toml 1.1.0 which still doesn't seem to have emerged12:04
fungifrickler: note that's coming from noble-updates not noble-security12:09
fungiwere you expecting a security update for it?12:10
fricklerfungi: no, I only mentioned security updates because that's the main reason I see why we'd use upstream repos instead of our mirrors, see the paste I posted. using our mirror would avoid delaying phased upgrades iirc. but there's also no real issue, I only checked the automated mail because we had long-stuck upgrades earlier 12:31
Clark[m]the concern with afs quotas is for Ubuntu and centos stream volumes. Not the uca volume. We need to increase quota for Ubuntu and uca13:34
Clark[m]Xenial sticks around for a number of reasons. On the infra side I'm happy to drop our testing and muddle through. However Openstack relies on it for translations? Or is that bionic? The main thing is we have to clean up what we can before removing the test images then dropping it from the mirror13:35
Clark[m]We have never used our CI mirrors for production nodes. I don't think we should either. We can however disable the phased updates on newer nodes whose apt supports it. It's just a config flag iirc?13:36
frickleryes to the latter13:37
fricklerand I think translation updates run on bionic, but I need to check again13:37
fricklernoonedeadpunk: noble epoxy update seems to have worked fine, do you have a job already where you can check it?13:39
fricklerthe other thing that came to my mind when looking at dib-image-list: do we want to clean up the 3y old gentoo images? and possible also > 120d old openeuler?13:42
Clark[m]I think we should clean out Gentoo. And openeuler was in the process of getting updated (it broke because the mirror updated and then they didn't also update the images) but only updated half of what needed updating13:43
Clark[m]I think we were hoping that there would be more of a push for the second half but I guess that fizzled out13:44
fricklertranslation is ubuntu-bionic, see e.g. https://zuul.openstack.org/build/f330dfc184be480187a10f59c5f6d435 . there isn't a way to filter builds by nodeset, or is there?13:44
fricklerthe images are named openEuler-22-03-LTS*, new ones would be *-24-* I think? but anyway let me look at cleaning gentoo and hope JayF doesn't get too sad ;)13:46
fungiworth keeping an eye out for breakage today related to https://discuss.python.org/t/upcoming-changes-in-the-pypa-wheel-project/8596713:46
fungii checked back when the plan was first announced and couldn't find anywhere we're doing stuff they're removing, but just be aware as i could have missed something13:48
fricklernice, very considerate of them to wait until after our release ;)13:51
Clark[m]Er I realize I said we need to increase quota for Ubuntu and uca earlier. I meant Ubuntu and centos stream13:52
noonedeadpunkfrickler: well, partially. https://review.opendev.org/c/openstack/openstack-ansible-openstack_hosts/+/945569 is passing now, I could push some with depends-on, but rather wait for this one to merge and the in any other job will see the result13:56
noonedeadpunkbut at least apt is fine13:57
noonedeadpunkhttps://zuul.opendev.org/t/openstack/build/fe899483e4274a83b5de4b5837092cb8 was failing before fwiw13:57
JayFfrickler: I didn't even know we had published Gentoo images13:59
fungithey've been held and broken for a long time14:06
opendevreviewDr. Jens Harbott proposed opendev/base-jobs master: Drop gentoo-17-0-systemd nodeset  https://review.opendev.org/c/opendev/base-jobs/+/94626414:47
clarkbtonyb: I know you expressed interest in doing afs quotas14:58
clarkbtonyb: I'll hold off on doing them myself so that you can potentially dothat today. Let me know if you have questions (afs has its quirks)14:59
dan_withHi, I work for Rackspace Flex. I would like to see if you can reboot a VM in your account? VM instance ID: 902d04c1-e5b5-45dc-8317-ce90e3114ebc, name: mirror01.dfw3.raxflex.opendev.org? It has a volume attached that only has one multipath connection. 14:59
dan_withEven better if you can power it off for 15 minutes to let me migrate the VM and volume15:00
fungidan_with: how urgent is it? if you can wait a couple hours, that would give us an opportunity to let any running jobs in that region complete so we don't cause any failures15:04
fungithen we can take the server offline as long as you need15:04
fungiif it's urgent, we can of course absorb the hit, just not as gracefully15:05
dan_withIt can wait a few hours. It's not in immediate danger. I just don't want it to be more than a few hours. Thanks15:05
fungidan_with: you've got it, i'll get things underway for that asap and let you know once we've got the server powered off15:05
dan_withok thanks15:06
clarkbanother option (and I'm not advocating for this now just noting it for the future) is that nodepool is quota aware. So dan_with could set our instance quota to 0 then wait for graphs on https://grafana.opendev.org/d/6d29645669/nodepool3a-rackspace-flex?orgId=1&from=now-6h&to=now&timezone=utc&var-region=$__all to show the region as unused and do the reboot on the cloud side15:06
clarkbI appreciate the coordination with us. Just wanted to make note of ^15:07
opendevreviewJeremy Stanley proposed openstack/project-config master: Temporarily turn down raxflex-dfw3 use  https://review.opendev.org/c/openstack/project-config/+/94626515:08
opendevreviewJeremy Stanley proposed openstack/project-config master: Revert "Temporarily turn down raxflex-dfw3 use"  https://review.opendev.org/c/openstack/project-config/+/94626615:08
dan_withYeah, ideally we would want to actually poweroff the VM from Skyline or OpenStack, then do migration to a new host. 15:09
fungii've self-approved 946265 but will also apply it locally on nl05 so it takes effect a little sooner15:10
fungiclarkb: nodepool-launcher notices config changes straight away and doesn't need a restart, right?15:11
clarkbfungi: changes to its own config yes. I think its less reliable (or doesn't happen at all) for clouds.yaml15:14
clarkbso ya if you manually edit the file (be careful to get the region correct the ansible yaml rewriting changes the order and makes things harder to read) it should take effect immediately then the ansible run should noop and enforce the new value15:14
fungicool, that's what i did15:15
clarkbwe will also want to put the server in the emergency file while it is offline15:15
clarkbseveeral ansible playbooks will be sad if it is down15:15
fungiaha, yeah i'll go ahead and add it now15:16
clarkbthe grafana dashboard I linked previously reflects your update15:18
clarkband we are down to 2 in use nodes15:19
clarkbhrm max just went back to 3215:19
clarkbdid you race the hourly jobs maybe?15:19
clarkbI think you must've15:19
fungiclarkb: no, i accidentally set the wrong region to 0 initially because of the way the associative arrays are randomly arranged in the generated nodepool.yaml file15:20
fungibut it's on the right one now15:20
fungiand it's fallen to 23 in-use according to `nodepool-launcher list`15:21
fungiwell, 24 now, i guess one was transitioning from booting to in-use15:22
clarkboh I see I mixed them up on the graphs15:23
opendevreviewMerged openstack/project-config master: Temporarily turn down raxflex-dfw3 use  https://review.opendev.org/c/openstack/project-config/+/94626515:25
fungiwe're down to 5 servers in dfw3 now15:45
corvusfungi: should probably do niz as well15:49
fungioh, right i forgot it was in use there too15:49
corvusi'll whip up a change15:49
fungithanks! saves me remembering where that is off the top of my head15:49
opendevreviewJames E. Blair proposed opendev/zuul-providers master: Disable rax-flex-dfw3  https://review.opendev.org/c/opendev/zuul-providers/+/94627315:52
corvusnow we'll see if the schedulers are running with the patch that implements that....15:53
corvussurvey says yes15:53
fungidown to 2 in-use servers in dfw3 now15:55
opendevreviewJames E. Blair proposed opendev/zuul-providers master: Revert "Disable rax-flex-dfw3"  https://review.opendev.org/c/opendev/zuul-providers/+/94627415:55
opendevreviewJames E. Blair proposed opendev/zuul-providers master: Use built-in noop job  https://review.opendev.org/c/opendev/zuul-providers/+/94627515:57
opendevreviewMerged opendev/zuul-providers master: Disable rax-flex-dfw3  https://review.opendev.org/c/opendev/zuul-providers/+/94627315:58
opendevreviewMerged opendev/zuul-providers master: Use built-in noop job  https://review.opendev.org/c/opendev/zuul-providers/+/94627515:59
fungiokay, dfw3 is empty now, shutting down the mirror there temporarily16:01
dan_withokay let me know when you are ready for me to do migrations16:02
fungistill reporting ACTIVE state in nova, so may take a little longer16:03
fungier, i was looking at the wrong region/mirror16:04
fungidan_with: mirror01.dfw3.raxflex.opendev.org is reporting POWEROFF state in nova now, so should be all set for you. take as long as you need, but please let us know when it's done16:04
dan_withRoger that. I will let you know when done16:04
fungis/POWEROFF/SHUTOFF/16:05
fungithanks!16:05
fungiclarkb: this is redundant, right? https://opendev.org/opendev/yaml2ical/src/branch/master/setup.cfg#L21-L2316:08
fungilooks like it's been in there since that file was first created16:10
fungiso i'm guessing it's cargo culted16:10
clarkbfungi: that is a bit brain melty16:15
fungiyeah16:15
clarkbfungi: I think it may not be redundant bceause pbr is saying its own setup hook is that one but others may override?16:15
clarkbit depends on how we look that up16:15
clarkboh but this is in yaml2ical16:16
fungiright16:16
fungiit's not clear why a pbr-using project would have to specifically indicate pbr's setup hook, since i thought pbr injected that16:17
clarkbfungi: pbr/util.py looks for [global].setup_hooks not setup-hooks. So ya its not doing anything. But then pbr always runs its setup hook after the project specific setup hooks so it is also redundant16:17
clarkbI agree it is redundant (and also buggy so it does nothing and then uses pbr defaults)16:17
fungithanks, that's what i thought. will clean it up as part of the overhaul i'm in th emiddle of16:18
opendevreviewJeremy Stanley proposed openstack/project-config master: Move yaml2ical to the opendev tenant  https://review.opendev.org/c/openstack/project-config/+/94628016:36
dan_withIf you start the VM now without putting it back in rotation/pool, can you do that so I can check status of multipath connections?16:39
clarkbwe can16:40
fungilooks like it's already started16:40
clarkbhttps://mirror.dfw3.raxflex.opendev.org isn't returning data for me yet16:41
fungistatus ACTIVE16:41
fungiupdated 2025-04-03T16:23:27Z16:41
fungiso it was presumably booted about 18 minutes ago16:41
fungithough i'm currently unable to ssh into it16:42
clarkbdoesn't seem to ping either. Might need to check the cnosole log?16:42
fungi`openstack console log show ...` returns an error16:42
fungiInstance 902d04c1-e5b5-45dc-8317-ce90e3114ebc could not be found.16:43
dan_withhold on just a second16:43
clarkbinfra-root can I get reviews on https://review.opendev.org/c/opendev/system-config/+/946050 to update our gerrit image to the latest bugfix release? If we think we're sufficiently past the openstack release Id be happy to do a gerrit restart for that early tomorrow morning? That should be a good time for it17:03
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Update Python versions and boilerplate  https://review.opendev.org/c/opendev/yaml2ical/+/94628417:04
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Address W504 linting rule  https://review.opendev.org/c/opendev/yaml2ical/+/94628517:04
clarkbfungi: I left a question on 946284 before doing a more in depth review (wanted to see test results too)17:22
clarkbmnaser: I'm starting to look into replacing our gerrit server (review02 currently hosted in vexxhost ca-ymq-1) with a new server in the same location to do two things: 1) update the base operating system and 2) switch flavors to vexxhosts v3 flavors so that we can avoid boot from volume (in particular this will simplify server rescueing should we ever need that). Currently our quota17:28
clarkblimit for memory is 204800, we're using 156672 of which ~128gb is the existing server. Would it be possible to bump that limit up so that we can have a second 128gb server up for a few weeks (my goal is to have the new server in production by the 18th so maybe we would shutdown the old server by the 25th preserving only its old root disk and volume?)17:28
clarkbmnaser: happy to chat more if it helps or answer questions.17:28
mnaserclarkb: if you drop boot from volume it will make it harder for us to live migrate the server though :<17:30
mnaserbut we can totally do it that's fine17:30
clarkbthat is good to know about live migration. I wonder if the gerrit server is very live migrateable anyway just due to its size and activity? We can probably take cold migrations periodically if they aren't too long.17:31
fungii didn't realize bfv made live migration easier, something to do with avoiding the nova image cache?17:31
clarkbthe problem is server rescuing is an almost completely undocumented magical dark art with boot from volume17:32
mnaserit very much is lol17:32
mnaserclarkb: i'm curious if you check instance actions for the gerrit instance if we've already done live migrations17:32
clarkbmnaser: would that show up under openstack server show output?17:32
mnaseropenstack server action list i think17:33
clarkb`server event list` seems to be it. Double checking now17:34
clarkbyes there are several live migrations. Based on timestamps we have two in july 2021, four a few minuets apart in september 2021, one in october 2022, and three a few minutes apart in april 202317:37
fungiso it's been about 2 years since the last one17:37
fungii guess we're overdue ;)17:37
clarkband at that frequency I feel like we could take a cold migration once in a while if necessary17:38
clarkbI guess its a tradeoff between relying on live migration to hopefully make things painless most of the time nad incurring higher pain if something goes really wrong vs making the really wrong case easier to work with and having more periodic lesser pain17:38
clarkbI personally want to avoid major pain in the major problem cases (where you'd rescue) but I'm open to being convinced otherwise particualrly since we have never had to rescue this sintance17:39
opendevreviewJames E. Blair proposed zuul/zuul-jobs master: Add eatmydata support to ensure-zookeeper  https://review.opendev.org/c/zuul/zuul-jobs/+/94628917:40
mnaserclarkb: also the other thing is i have no data guarantee if you dont use bfv17:42
mnaserit is local storage 100%, so if the local raid array blows up17:42
mnaserit's time to pull out backups17:42
mnaser(i dont wanna nag, i just wanna manage expectations :))17:42
clarkbmnaser: the important data does live on a data volume that we mount17:42
mnaserahhh okay got it17:42
clarkbmnaser: and its definitely not nagging this is important useful info. I appreciate it17:42
mnaserso /var/gerrit or whatnot goes into data, but the actual os in the root17:42
clarkbyup17:43
mnaserok so you're good it's almost like you've used this cloud for a while =)17:43
clarkbmnaser: with v3 flavors you can choose to bfv or use local adta disks?17:43
clarkbor is bfv v2 falvors only?17:43
mnaseryes, if you give it a boot volume, it will boot from it, otherwise, whatever the root_gb is will be on local disk17:43
clarkbgot it. So maybe the thing to do is sleep on it with the info we have and decide if we stick to bfv for its upsides given the downside has never arisen yet17:44
mnaseryeah v3 you can have both :)17:44
clarkband the downside is solveable its just not as easy as it would be without bfv17:44
clarkbmnaser: have you ever successfully rescued an instance that was booted from volume in that region? I'm just wondering if anyone else might be able to point us in the right direction should it come up17:45
clarkbI could boot a test node and figure it out myself. Maybe that is step 0 and document it17:45
corvusi wonder if lack storage guarantees on / would increase the chances we would need to perform a rescue17:46
clarkbcorvus: ya that very well could be17:46
clarkband more generally maybe this is feedback we should try to get back to the nova team. Boot from volume is valuable to users for data integrity but it is difficult to work with whcih can scare people away. Addressing the difficulty would help users take advantage of important features17:48
clarkbok I think I've convinced myself to test bfv rescuing first on a new dummy node (no quota updatse needed for taht)17:49
jrosserI have done bfv rescue on my clouds, I can dig some notes out later if that’s helpful17:49
clarkbjrosser: ya I seem to recall you helped the last time this came up17:49
clarkbbut then we sorted it out otherwise or something and never did the last step of actually testing it17:49
jrosseriirc there was an image property involved and needing to specify a micro version on the cli17:51
jrosserand an unobvious hazard of not being able to rescue bfv instance with the same image17:51
clarkbthat sounds familiar. You basically need a special rescue image with a special property set. And you have to use a microversion that enables rescuring bfv instances at all and due to the special image you have to supply an image and can't rely on the default which is to use the existing image17:52
clarkbjrosser: I guess if you have those details that would be helpful. But no rush I probably won't get to this until tomorrow at the earliest17:54
clarkbfungi: yaml2ical's pyproject.toml has an invalid license field but it isn't apparent to me why that is based on the error and what is in the file17:56
dan_withI'm still working this issue17:58
clarkbdan_with: ack let us know if there is anything we can do to help17:59
fungidan_with: no worries, and best of luck! we're fine however long you need18:01
dan_withThank you. Still get kinks worked out with a new cloud18:01
fungii feel at least partly at fault for whatever trauma openstack is causing you right now ;)18:02
dan_withlol ;)18:06
mnaserfungi: "A previous PEP had specified license to be a table with a file or a text key, this format is now deprecated. Most build backends now support the new format as shown in the following table."18:13
mnaseri wonder if too old setuptools that is doing that validation there18:13
mnasersince it does seem to want a key with file or text18:13
*** dhill is now known as Guest1285718:14
fungimnaser: that's a too-old setuptools, yes18:14
fungii think it needs at last setuptools 7718:14
mnaseror deprecated format :P18:15
mnaserdepends how long you want your afternoon to be =P18:15
clarkbmnaser: if it were you it sounds like you'd use bfv ya?18:16
mnaserclarkb: i feel like there's likeliness that the os would get so borked it would require a rescue, even less if it's all sitting on fairly reliable ceph18:16
mnasersorry, little likeliness18:17
fungimnaser: looks like the fix is that test-release-openstack needs to run on a newer python version18:17
clarkbmnaser: ya I think that is what corvus is saying too. Basically by being bfv we reduce the likelihood and then you don't have to worry so much so maybe its best to take advantage of that18:17
fungiright now it's running on ubuntu-focal with python 3.8 and the necessary setuptools version to support that syntax only works with python 3.9 or newer18:17
mnaserfungi: that'll do it :)18:17
clarkbfungi: we can drop the job particularly if moving into the opendev tenant and just have your nox targets build an sdist first or something18:18
mnaserclarkb: yeah, that's my thoughts to be honest, also less likely to scramble for a rebuild if a hypervisor blows up too18:18
fungiclarkb: yes, that will likely be my "fix"18:18
clarkbmnaser: ack I think that perspective is useful too18:18
clarkbsince you've got a much broader set of experience dealing with the cloud and openstack and when things fail than we do18:18
mnaserif a hypervisor dies, i start your instance up on another node and we go back to regular programming18:19
mnaservs you having to rebuild the whole thing at 2am18:19
clarkbI'm coming around to that and thinking bfv is a good idea even with the separate data volume18:19
mnaseri say bfv + seperate data for sure, to make migration/updates easier18:20
clarkb++18:20
clarkbjust need the quota bump and I should be able to start spinning something up that we can migrate onto18:20
mnasersorry i forgot if i asked the project id18:20
mnaseri should be able to add, just let me know what you want the new values to be18:20
clarkb204800 is the current value. Maybe we double it? that would be 409600 (happy to do less if you prefer).18:21
clarkbLet me find the project id18:21
mnaseropenstack token issue -- or if youhave the project name18:22
mnaseropenstackci maybe it was?18:22
clarkbya just confirmed that appears to be it18:22
clarkbopenstackci specifically18:22
mnaserdone18:22
clarkbI see it reflected in the limits show --absolute output. Many thanks!18:23
clarkband for the record the other limitation is max_total_volume_gigabytes but by my math we have sufficient headroom there18:24
clarkbso no need to update it18:24
opendevreviewJeremy Stanley proposed openstack/project-config master: Move yaml2ical to the opendev tenant  https://review.opendev.org/c/openstack/project-config/+/94628018:25
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Update Python versions and boilerplate  https://review.opendev.org/c/opendev/yaml2ical/+/94628418:25
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Address W504 linting rule  https://review.opendev.org/c/opendev/yaml2ical/+/94628518:25
fungiclarkb: as far as the not wrapping lines after operators rule, that was apparently something pep 8 was edited years back to reverse the earlier recommendation that you should18:26
clarkbI can't help but feel like that is a very pythonic approach. Tell everyone you should do things one arbitrary way and get them used ot it. Then as soon as everyone is comfortable change to do the exact opposite18:27
fungihah18:27
fungiso true18:27
clarkbmnaser: oh I also meant to ask if you've seen any gitea slowness issues? I've tried to sort of keep an eye on it and haven't seen evidence of that since we did the memcache update and futher blocked some ai web crawlers18:28
fungihttps://peps.python.org/pep-0008/#should-a-line-break-before-or-after-a-binary-operator has some of the discussion18:29
mnaserclarkb: it's actually been pretty good lately, haven't seen any issuesb18:29
clarkbmnaser: awesome thanks for confirming. I think our improvements to web traffic handling for crawlers, memcachged, and a bugfix to prevent OOMs in gitea have made gitea much more stable in the last few weeks than it was a couple months ago18:30
clarkbfungi: those arguments are convincing but once you've spent decades doing it the other way you don't get to ban hammer everyone for doing it that way18:30
clarkbjust accept both18:30
fungiindeed, we could just ignore both W503 and W50418:31
clarkbI think it was noonedeadpunk who also had gitea problems fetching constraints? I suspect that problem has faded away too18:36
fungihere's hoping18:36
opendevreviewJames E. Blair proposed zuul/zuul-jobs master: Add eatmydata support to ensure-zookeeper  https://review.opendev.org/c/zuul/zuul-jobs/+/94628919:05
dan_withhey @fungi, you okay with me starting that VM now to make sure everything is good?20:20
opendevreviewJames E. Blair proposed zuul/zuul-jobs master: Add eatmydata support to ensure-zookeeper  https://review.opendev.org/c/zuul/zuul-jobs/+/94628920:21
opendevreviewJames E. Blair proposed zuul/zuul-jobs master: Remove Red Hat support from ensure-zookeeper  https://review.opendev.org/c/zuul/zuul-jobs/+/94630420:21
fungidan_with: you bet, go for it!20:22
dan_withok20:22
corvusdan_with: yes, it's safe to restart it at any time20:22
clarkbI'm going to pop out for a bike ride while the weather is nice20:58
clarkbtonyb: I can help with afs quota stuff when I get back20:58
fungii'm also around to help with afs adjustment questions21:00
dan_withI'm going to need to keep the server down--there is an error with glance/swift and the people who have access are away for the day. Are you all okay with it being down over night or does it need to come back online today?21:52
fungidan_with: yeah, that's perfectly fine. sorry it's giving you so much trouble21:53
dan_withThanks for understanding. It uncovered an interesting issue that will make the cloud better. I'll keep you updated. It will be my priority until it is back online. 21:54
fungimuch appreciated. we're notorious for exposing bugs in our own software somehow21:55
dan_withlol. That's just good work21:55
tonybclarkb, fungi: Yup I'd like to do the AFS quota updates, but I'm in and out this morning so if we don't overlap maybe it'd be quicker if y'all do it this time22:22
tonybI assume I do account setup etc on afsdb01?22:35
fungitonyb: you mean addition of a superuser?22:37
fungitonyb: you can just use localauth on one of the fileservers if you prefer22:38
tonybYeah (I think)  I have created krb5 accounts tonyb{,/admin}@OPENSTACK.ORG, now I'm doing the `pts createuser` step (which I assume I'd need to do to issue quota change commands22:38
fungitonyb: for example, recently i did this on afs01.dfw.openstack.org: sudo fs setquota -localauth /afs/.openstack.org/mirror/ubuntu-ports 85000000022:39
fungibecause i was lazy and didn't feel like fiddling with authenticating my admin account22:40
tonybOh, that's much easier22:40
fungi(or not really lazy so much as i think it was a time period where openafs in debian/sid wasn't building its lkm on the available version of the kernel)22:41
tonybmore efficient than lazy22:42
fungibut yeah, if you add an admin account for yourself, you can do those operations from the comfort of your own workstation too22:42
tonybOh okay, that's also good to know.22:42
fungiif you want help with the kerberos user principal and admin user creation steps, i'm happy to assist though22:43
fungiit eventually comes in handy, not so much for making quick quota adjustments but in other more involved work22:44
tonybI think I've done those already22:44
tonybWell the krb principal part22:44
fungiso not the pts createuser and pts adduser steps yet22:45
fungiand anyway, yeah i'd say do those similarly to the fs setquota example i gave, e.g. on afs01.dfw.o.o it'll be something like `sudo pts createuser -localauth $USERNAME -id UID`22:46
tonybYup, but the docs are good, I was just unclear where to run them.22:47
fungior it might be `pts createuser $USERNAME -id UID -localauth`22:47
funginot sure if the -localauth can go between createuser and your username22:48
fungiwould need to check the manpage/context help22:48
tonybOkay22:48
fungibut yeah, you'll either need to be root or sudo the command to use -localauth22:48
tonybFWIW: ```tonyb@afs01:~$ sudo pts createuser tony.admin -id 9 -localauth22:50
tonybUser tony.admin has id 922:50
tonyb```22:50
funginot quite as auspicious as 8, but still perfectly cromulent22:51
tonybfrickler: has that honor 22:51
fungihis id embiggens us all22:51
tonyb:)22:55
fungitonyb: unrelated, i replied to your comments on 946219, let me know if it's still confusing23:12
tonybOkay so I have accounts (I need to update UserList) I'm learning about AFS and verifying the capacity we have but from grafana it looks like we have 3 servers each with 5TB and we're using roughly 7.5TB, so we have "ooodles" of room left.  I haven't check if the sum of all the quotas in < 15TB but I don't think that's needed .... So how much additional quota am I adding to which volumes (I guess mirror.ubuntu and 23:16
tonybmirror.centos-stream)23:16
Clark[m]Another thing to keep in mind is that you set quotas on the rw volume and the ro volume sort of catches up23:17
Clark[m]I would bump them both by say 50gb?23:17
Clark[m]The afs dashboard in grafana is a good resource for understanding usage and available room23:18
tonybfungi: LGTM Happy for me to +A those chnages?23:18
fungisure23:18
tonybclarkb: Thanks.  I'll go ahead and do that23:20
funginote that it does change the representation in the package metadata slightly, and though pypi is able to figure out the previous arrangement and combine the separate "Author" (name) and Author-email metadata fields, the new arrangement is more correct and results in a name within the Author-email metadata field instead23:21
tonybfungi: noted23:21
opendevreviewMerged opendev/engagement master: Fix authors/maintainers format in pyproject.toml  https://review.opendev.org/c/opendev/engagement/+/94621923:22
fungitonyb: also the official spec is https://packaging.python.org/en/latest/specifications/pyproject-toml/#authors-maintainers23:23
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Update Python versions and boilerplate  https://review.opendev.org/c/opendev/yaml2ical/+/94628423:26
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Address W504 linting rule  https://review.opendev.org/c/opendev/yaml2ical/+/94628523:26
fungiClark[m]: 946280 got updated to remove the previously failing openstack-specific job too23:27
tonybSo mirror.centos-stream is currently 350GB (350000000) to adding 50GB == 40000000023:29
fungithat looks right to me, yep23:30
* fungi counts the zeroes again23:30
tonyband mirror.ubuntu is 1.2TB (1200000000) adding 50GB == 125000000023:30
fungiyes, correct number of nulls there too23:31
tonybThanks23:32
tonyb`sudo fs setquota -localauth ....`  is saying that -localauth isn't a valid flag and the man page indicates I need to login/auth to my tonyb.admin account.  but that is conuter indicated by what fungi said earlier23:35
fungihuh, i thought i had done that, but maybe it was an erroneous example from my shell history23:36
tonybOkay, I need to pop out for a bit.  I'll figure it out when I get back23:36
tonyb#Learning23:37
fungitonyb: i concur, command errors, manpage for fs_setquota also doesn't mention the ability to use localauth, i guess that's limited to pts, vos, et cetera23:39
fungiso the example authentication in the next section of our docs is what you'll want with your admin account (pagsh ..., export KRB5CCNAME=..., kinit ..., aklog)23:40
fungiyou can do that locally if you already have the tools installed, or on the fileserver if you prefer23:41
clarkbfungi: re 946280 I guess we'll figure out pypi later if we need it?23:42
fungiclarkb: no, the change in yaml2ical adds the non-openstack-specific jobs23:43
fungisame ones bindep uses23:44
clarkbah23:44
fungiso should "just work"23:44
opendevreviewJeremy Stanley proposed opendev/bindep master: Fix authors/maintainers format in pyproject.toml  https://review.opendev.org/c/opendev/bindep/+/94621823:45
opendevreviewJeremy Stanley proposed opendev/engagement master: Drop maintainers field from pyproject.toml  https://review.opendev.org/c/opendev/engagement/+/94631423:47
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Update Python versions and boilerplate  https://review.opendev.org/c/opendev/yaml2ical/+/94628423:48
opendevreviewJeremy Stanley proposed opendev/yaml2ical master: Address W504 linting rule  https://review.opendev.org/c/opendev/yaml2ical/+/94628523:48
fungireading https://packaging.python.org/en/latest/specifications/core-metadata/#core-metadata-maintainer i just noticed this disclaimer: "Note that this field is intended for use when a project is being maintained by someone other than the original author: it should be omitted if it is identical to Author."23:49
fungiso technically a mistake we've been making for years in setup.cfg as well23:54

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!