Tuesday, 2025-04-08

mnasererror: RPC failed; curl 56 GnuTLS recv error (-54): Error in the pull function.\nerror: 20 bytes of body are still expected\nfatal: expected flush after ref listing\n02:04
mnaserWe're seeing failed hits on opendev.org for our builds :x02:04
mnaserNoticing some timeouts02:05
tonybmnaser: checking ...02:05
tonybmnaser: Definately something going on02:09
mnaserWe haven't had any changes on our side. 02:13
tonybThe load-average on a few of thew servers is "high" and there seesm to be more than one user-agent crawling 02:19
mnaserah, the classic. 02:23
tonybI'm able to really pinpoint any errors, the load is a little high but not problematically so03:04
tonybmnaser: If see more timeouts can you capture the client ip/timestamp for me to do more detailed digging?03:04
mnaserSure. 03:08
opendevreviewDr. Jens Harbott proposed openstack/project-config master: Only pause update_constraints.sh when needed  https://review.opendev.org/c/openstack/project-config/+/94654105:36
zigoNot sure who to ask, but the single-use code from OpenInfraID are valid for only 10 minutes. That's the time it takes for mail to pass greylisting. Could this be increased to at least 30 minutes ?09:29
tonybI understand your pain.   That's one for fungi or clarkb to raise with tipit09:31
zigobtw, what's the schedule URL for the PTG ? :)09:31
zigoFound it ...09:32
*** ykarel_ is now known as ykarel12:52
*** mrunge_ is now known as mrunge13:59
opendevreviewyatin proposed opendev/irc-meetings master: Move Neutron CI Weekly meeting 1 hour earlier  https://review.opendev.org/c/opendev/irc-meetings/+/94663114:00
fungizigo: thanks! i brought the 600-second openinfraid otp timeout up with the developers who maintain it, and they've indicated it's easily configurable so we'll see what they feel comfortable increasing it to14:51
opendevreviewClark Boylan proposed opendev/system-config master: WIP test gerrit on noble  https://review.opendev.org/c/opendev/system-config/+/94663715:03
clarkbinfra-root ^ I realized I should take a step back on replacing gerrit and ensure testing is happy with it but then also write out a plan and do more ansible behavior checking when adding a new server etc and dump that all into an etherpad we can look over before making potentially disruptive changes15:04
opendevreviewMerged opendev/irc-meetings master: Move Neutron meeting to 1300 UTC on Tuesdays  https://review.opendev.org/c/opendev/irc-meetings/+/94606315:06
opendevreviewMerged opendev/irc-meetings master: Move Neutron CI Weekly meeting 1 hour earlier  https://review.opendev.org/c/opendev/irc-meetings/+/94663115:07
clarkbfollowing up on sad gitea from last night all of the current backends seem to be operating nominally after a quick check. Let us know if that changes and I'm happy to dig in more15:09
fungiclarkb: i think i've addressed your comments on https://review.opendev.org/946284 if you get a chance for another look15:11
clarkbwill do15:13
opendevreviewClark Boylan proposed opendev/system-config master: Update gitea to 1.23.7  https://review.opendev.org/c/opendev/system-config/+/94664015:15
opendevreviewMerged opendev/yaml2ical master: Update Python versions and boilerplate  https://review.opendev.org/c/opendev/yaml2ical/+/94628415:28
opendevreviewMerged opendev/yaml2ical master: Address W504 linting rule  https://review.opendev.org/c/opendev/yaml2ical/+/94628515:28
clarkbhttps://etherpad.opendev.org/p/i_vt63v18c3RKX2VyCs3 gerrit planning document is coming together there. Bit of an outline for now as I'm paying attention to the foundation board meeting15:30
clarkbas a reminder we will not have a team meeting today16:01
clarkbinfra-root https://gerrit-review.googlesource.com/c/homepage/+/464287 gerrit is discussing adding native support for jujutsu change-id headers16:30
clarkbI left a couple comments there with a tl;dr of basically adding that support sounds great but not if it breaks existing users workflows (which it will as proposed so I suggested an alternative)16:30
*** diablo_rojo_phone is now known as Guest1317516:32
clarkbjitsi meet just published a new release16:32
clarkbmeetpad02 and jvb02 are still in the emergency file so I think we're fine and won't update16:32
clarkbok https://etherpad.opendev.org/p/i_vt63v18c3RKX2VyCs3 is starting to have some more interesting content like how do we want to handle manage-projects which will fail on the new server for a bit16:46
corvusclarkb: i think https://gerrit-review.googlesource.com/c/homepage/+/464287/7/pages/design-docs/support-jujutsu/use-cases.md#45 may be the answer to the question in your comment?16:55
corvus(also the next one)16:55
corvusnot sure if you're aiming for a solution that meets those criteria, or are advocating the criteria be changed (or maybe i misunderstand the criteria)16:55
clarkbcorvus: I'm saying that criteria is wrong/broken because it breaks Git users16:57
clarkbI'm trying to argue that adding jujutsu support is fine as long as they do not break Git users who have used Gerrit for more than a decade16:57
corvusgot it16:57
clarkband I think that is possible if you have the commit hook respect the jujutsu change-id and add it to the commit message16:58
clarkbI think this change is particularly problematic for OPenDev because I don't want to spend my time doing git and jujutsu compatibility support16:58
clarkbwhich will be the case if gerrit allows jujutsu to push without a change-id in the commit message due to rebasing losing the header field16:58
corvusyeah, that sounds like a reasonable adaptation. i do wonder if some of the pressure here is coming from the kernel and this is all really just an end-run around kernel devs opposition to Change-Id footers, which would make the current acceptance criteria non-negotiable17:00
fungiclarkb: it looks like a primary reason for their proposal is to stop requiring the commit hook17:01
clarkbya I don't think a clear motiviation is expressed beyond supporting jujutsu change ids17:02
corvusit seems like the real technical issue is that jj change ids are not preserved on cherry-pick/rebase.  if it weren't for that, then everything would work fine with gerrit and i could see us easily getting to a point where there's no footer at all anymore.  they mention that as a future stretch goal in the doc.17:02
clarkbcorvus: yup17:02
corvusprobably no one wants to say :)17:02
fungiin the "background" section it talks about the commit message hook being a constant source of confusion for new gerrit users17:02
corvusif things happened in the other order: git was updated to add and not delete change-id headers, then everything would be a lot easier i think.17:02
corvusgit is a constant source of confusion for new git users17:03
fungiyes17:03
corvusalso old ones17:03
clarkbcorvus: ++ though we'd need a minimum git version but at least then you could tell people there is a light at the end of the tunnel17:03
*** Guest13175 is now known as help17:03
clarkbI just think that someone deciding to use a different client breaking everyone using the old "official" (for lack of a better term) client is a terrible decision17:03
*** help is now known as Guest1317917:04
fungiclarkb: first i've heard of it, but it looks like they consider jujutsu to be a completely separate version control system (somewhat compatible with and implemented on top of git), not merely a special git client17:05
clarkbfedora, debian, and ubuntu lack jujutsu packages but tumbleweed ahs them17:05
*** Guest13179 is now known as diablo_rojo_phone17:06
clarkbfungi: yes jujutsu is its own DVCS system but in the case of Gerrit it will be speaking git 100% of the time aiui17:06
clarkbthe interop layer is jujutsu's support of git fetch and push protocols17:06
clarkbanyway what that means is I as a tumbleweed user can trivially decide to use jujutsu to push changes to gerrit that people using debian, ubuntu, or fedora would either need to be "experts" to update by manually discovering the change id and adding it to commitm essages or they would have to install jujutsu from somwhere else and switch tools17:08
clarkbI'm all for supporting a different set of client tools but not at the expense of existing users is all17:08
clarkband I think that is achievable with a simple hook update17:09
fungithough it would require jj users to still install and use gerrit's hook17:11
corvusyeah, so all gerrit users are equal until git is updated to support jj change-ids17:12
clarkbright17:12
corvusmaybe it's enough to still have the option to require the change-id footer in a particular gerrit install?  so opendev has that option enabled but, like, companies doing kernel dev with gerrit don't?17:16
fungithat is, of course, assuming jj even supports traditional git hook scripts (i suppose it must?)17:16
clarkbya looking further maybe that is the unstated issue17:17
clarkbthe jujutsu does point at the pre-commit tool (ugh)17:17
clarkb*docs17:17
clarkbcorvus: ya maybe that is a workaround17:18
clarkbthinking about this more its also possible that we could update git review -d and -x etc to rewrite things when our users fetch stuff if the data ins't in the commit message17:20
clarkbthen at least our common tooling would mitigate things17:20
corvusyeah, there's an old resolved comment about doing something similar with the download plugin17:20
clarkbin https://etherpad.opendev.org/p/i_vt63v18c3RKX2VyCs3 I've bolded the major open question I have at this point. Feedback on that would be appreciated. Otherwise I think I've collected enough info to at least boot the new server (but problaby not add it to the inventory yet). I'll look into doing the server boot after lunch most likely17:27
opendevreviewMerged zuul/zuul-jobs master: Add role: `ensure-python-command`, refactor similar roles  https://review.opendev.org/c/zuul/zuul-jobs/+/94149017:37
clarkbthe gitea 1.23.7 change looks good https://review.opendev.org/c/opendev/system-config/+/946640 (I checked the system-config screenshot and the job passed)18:19
clarkbthats another one I'm happy to babysit this afternoon18:19
clarkbin prep for booting a new gerrit server I have created a new data volume for gerrit. I kept the same size (1024GB) and type (nvme) as the existing data volume. nvme appears to be the default anyway18:32
clarkbI will boot with a 64gb root disk boot from volume to also match the old server. I think the argument that we're far less likely to have problems with bfv has won me over. I'm willing to deal with more apinful recoveries if the chance of needing to do so in the first place is significantly less18:33
fungimakes sense18:40
*** jamesdenton_ is now known as jamesdenton18:52
clarkblaunching review03 failed when trying to set up the data volume mount path at /home/gerrit219:44
clarkblooks like we leak the bfv volume when that happens19:45
clarkbhttps://opendev.org/opendev/system-config/src/branch/master/launch/src/opendev_launch/mount_volume.sh#L26 this check failed and we exited 1. It was checking /dev/vdb. I notice that review02 has mounted /dev/vdc and wonder if it identified the wrong device name (that comes from the openstack apis)19:46
clarkbI think the issue is that we run make swap before we run mount_volume.sh and make_swap uses /dev/vdb19:48
clarkbone way to solve this would be to flip the order of the two scripts. Another is for me to attach the volume after launch node runs and more manually do things19:48
clarkbI'm going to try swapping the order since I think that is better for us long term. But first I'm going to delete these volumes and start over (the scripts don't work fi there is already content on the volumes)19:50
fungiyeah, i've still never had the separate volume creation feature in launch-node match what i needed/expected19:50
fungiand simply creating and attaching a volume after booting the server is simple enough anyway19:50
clarkbok both new leaked bfv and new data volume have been deleted19:53
clarkbstarting over now19:53
fungicpython 3.14.0a7, 3.13.3, 3.12.10, 3.11.12, 3.10.17 and 3.9.22 were just tagged19:55
fungi3.14.0a7 is the final alpha, next prerelease will be 3.14.0b119:55
opendevreviewClark Boylan proposed opendev/system-config master: Create swap after dealing with data volumes  https://review.opendev.org/c/opendev/system-config/+/94670919:59
clarkbthat change got me past the previous error. I don't know why the bfv volume wasn't automatically claened up19:59
clarkbwe do delete the server then attempt to delete the volume but we got this error: "Invalid volume: Volume status must be available or error or error_restoring or error_extending or error_managing and must not be migrating, attached, belong to a group, have snapshots or be disassociated from snapshots after volume transfer."20:02
clarkbI suspect that we're not waiting long enough for the volume to detach and become available after the server is deleted. We could add a busy wait20:02
clarkbit looks like it tries to delete all volumes attached to the server though which may not be what we want either20:03
clarkbhrm20:03
fungii wonder if we should be explicitly enabling the delete-on-termination feature for bfv instances20:07
clarkbmaybe. That gives us less flexibility for having the image hang aroudn after we delete a server20:08
clarkbwe would have to snapshot I think (which isn't the end of the world)20:08
fungiyeah, i mean, a normal non-bfv server deletes its rootfs when the server instance is deleted, so it's no worse20:10
opendevreviewClark Boylan proposed opendev/zone-opendev.org master: Add review03 to forward DNS  https://review.opendev.org/c/opendev/zone-opendev.org/+/94671120:13
clarkbmaybe only approve ^ when you are happy with the state of the server. Then I can ask for reverse ptr records at that point. I'm going to work on the inventory addition change which I will wip as well since that needs careful consideration20:14
clarkbthen I'll look at improving launch node further20:14
fungilogging into it now20:14
fungiserver (ubuntu version, filesystems, ram/swap) all lgtm20:15
fungiapproved the dns addition20:16
fungiboth ip addresses were correct and working for me to ssh as well20:16
fungimake sure to check talos/senderbase or something similar for the addresses being on relevant e-mail blocklists20:17
fungisince we do send a fair volume of notifications from gerrit20:17
fungii'll check as well20:17
clarkbfungi: yes I checked the two links that launch node emits and they were both clear20:18
fungicool20:18
clarkbthey are spamhaus queries by ip20:18
fungicisco talos reputation lookups are clean for both the v4 and v6 addresses20:20
opendevreviewMerged opendev/zone-opendev.org master: Add review03 to forward DNS  https://review.opendev.org/c/opendev/zone-opendev.org/+/94671120:22
opendevreviewClark Boylan proposed opendev/system-config master: Add new Noble review03 to the inventory  https://review.opendev.org/c/opendev/system-config/+/94663720:22
clarkbI'm going to WIP ^ for now20:23
clarkbthat change needs much more careful review20:23
fungisure20:23
fungias for the manage-projects question in the etherpad, i had some followup questions20:23
fungiyeah, so given your confirmation of my suspicion, i'm in agreement on your preferred solution20:24
clarkbfungi: yup just responded on the etherpad. Since I wrote that original set of questions and ideas down I discovered the review-staging group which 946637 puts review03 into20:25
clarkbits there to solve this exact problem20:25
clarkbmnaser: frickler fyi https://github.com/openstack/kolla-ansible/pull/5220:26
mnaseroh, didnt we have something that autoclosed it?20:26
clarkbmnaser: oh sorry that was meant for mnasiadka 20:26
clarkbmnaser: we do, but the autocloser doesn't bring it to the attention of project memebrs unless they are subscribed to github events. I strongly encourage project membership subscribe to github events if they replicate to github20:27
clarkbmnaser: I did have a question for you though. What is the best way to request reverse PTR records for the new review03 gerrit server20:27
clarkbmnaser: you can see the new A and AAAA forward records in https://review.opendev.org/c/opendev/zone-opendev.org/+/946711/1/zones/opendev.org/zone.db20:28
fungior https://opendev.org/opendev/zone-opendev.org/src/branch/master/zones/opendev.org/zone.db#L690-L691 now as well20:29
clarkbfungi: did you want to review https://review.opendev.org/c/opendev/system-config/+/946709 that is what is currently in place on bridge due to my manual edit and it seemed to work21:05
fungilgtm, thanks21:08
opendevreviewClark Boylan proposed opendev/system-config master: Have launch_node delete volumes more properly  https://review.opendev.org/c/opendev/system-config/+/94671521:10
clarkbI'm going to wip ^ because that is completely untested and potentially destructive21:10
clarkbinfra-root no meeting today but if I can ask everyone to look at https://review.opendev.org/c/opendev/system-config/+/946637 and check https://etherpad.opendev.org/p/i_vt63v18c3RKX2VyCs3 and either give the go ahead to land that or suggestions to make it safer that would be great. Then hopefully we land that in the next few days and start with initial data sync21:15
clarkbfungi: should we upgrade gitea? Looks like you +2'd https://review.opendev.org/c/opendev/system-config/+/94664021:16
fungiyeah, i'm around for a while still if you're ready21:18
opendevreviewAurelio Jargas proposed zuul/zuul-jobs master: ensure-python-command: Install venv in Zuul-scoped path  https://review.opendev.org/c/zuul/zuul-jobs/+/94527721:20
opendevreviewAurelio Jargas proposed zuul/zuul-jobs master: ensure-python-command: Optimize pip install  https://review.opendev.org/c/zuul/zuul-jobs/+/94527821:26
opendevreviewAurelio Jargas proposed zuul/zuul-jobs master: ensure-nox: Add support for global symlink  https://review.opendev.org/c/zuul/zuul-jobs/+/94527621:26
clarkbfungi: yup I'm ready21:27
opendevreviewMerged opendev/system-config master: Create swap after dealing with data volumes  https://review.opendev.org/c/opendev/system-config/+/94670921:27
fungiokay, approved it just now21:28
clarkbthanks21:28
mnasiadkaclarkb: Any guidance what should we do there? Usually there was a bot (I think) that autoclosed such PRs and direct the user to push it via Gerrit, right?21:42
clarkbmnasiadka: there is still a bot and it shoudl get auto closed (I think it runs when you merge something and maybe daily?)21:47
opendevreviewAurelio Jargas proposed zuul/zuul-jobs master: Fix deprecated cleanup-run  https://review.opendev.org/c/zuul/zuul-jobs/+/94671821:47
clarkbbut I strongly encourage anyone whose projects are mirrored to github subscribe ti github notifications so that you can help steer people in the right direction and/or pull in fixes that would otherwise die on the vine in github21:47
fungi946640 hit a post_failure on system-config-upload-image-gitea21:49
fungihttps://zuul.opendev.org/t/openstack/build/dcb3518e599a43bf8affc9d1717a573a21:49
fungiproblem with docker.io/opendevorg/assets?21:50
fungii guess we haven't moved that to quay yet21:51
clarkbwe haven't. This problem looks like the one where we expect a 404 from the buildset registry because that image isn't part of the buildset. This is supposed to cause docker to look at docker hub next. We theorize that if the docker lookup hits rate limits the docker client reports the first error it hit which was the 404 and not the last error or all errors21:52
clarkbI think it should be safe to recheck though that probably means it won't merge until after 5 pm local21:52
clarkbbasically it tried to get the assets image from the buildset registry and didn't find it there (expected) it then tried to get it from docker hub and failed then reported the expected error rather than the unexpected one21:53
fungii directly reenqueued it to the gate to save some time21:57
fungii'll try to be around still as it deploys21:57
clarkbwfm I should be around since it went straight to the gate22:03
clarkbjamesdenton: any update on that low bw over private network instance? No rush just checking in to see if possibly we can delete the node at this point22:24
fungiugh, it failed again22:32
fungihttps://zuul.opendev.org/t/openstack/build/4f898b80134f4d2996bae236e0ae7bf122:32
fungithis time on docker.io/library/debian:bookworm-slim22:33
fungimaybe we should try again tomorrow and see if the dockerhub gods smile more favorably on us?22:33
clarkbworks for me22:44
clarkbit is now on my todo list to upgrade gitea tomorrow22:44
fungithanks22:45
opendevreviewJames E. Blair proposed opendev/system-config master: Add setuptools to python-builder assemble  https://review.opendev.org/c/opendev/system-config/+/94672223:16

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!