Tuesday, 2021-07-06

clarkbAnyone else here for the infra meeting?19:00
diablo_rojoo/19:00
ianwo/19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Jul  6 19:01:12 2021 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
clarkb#link http://lists.opendev.org/pipermail/service-discuss/2021-July/000264.html Our Agenda19:01
clarkbThe agenda went out a bit late due to yesterday's holiday observance btu we do have an agenda :)19:01
clarkb#topic Announcements19:01
clarkbJuly 18 the Gerrit server will be upgraded. Update your firewall rules now if you need to do that (details are on the service-discuss mailing list)19:02
clarkbI've also got this info going out on the foundation newsletter this week to try and spread the word19:02
clarkb#topic Actions from last meeting19:02
clarkb#link http://eavesdrop.openstack.org/meetings/infra/2021/infra.2021-06-29-19.01.txt minutes from last meeting19:02
clarkbI had an action to talk to the openstack tc about the next steps for the ELK stack. I have done this and been asked to bring it up at the TC meeting on Thursday. I updated their agenda and plan to be there to discuss the subject19:03
clarkbIt sounds like the board has asked them for timelines and a few details which I think we can help with too19:03
clarkb#action someone write spec to replace Cacti with Prometheus19:03
clarkbI don't think this has happend yet. But its possible I may have time for that this week looking at my current todo list. We'll see19:04
fungisomeone is falling down on the job19:04
fungiit's times like this i'm glad to be noone19:04
clarkbcareful we can make you a somebody19:04
fungioof19:05
clarkb#topic Topics19:05
clarkb#topic Gerrit Account Cleanup19:05
clarkbThis morning I retired 176 accounts that we identified as unused or unlikely to be unused19:05
fungiyes!19:06
* fungi throws a quick party19:06
clarkbThis was based on account activity, age, and the situation the conflicting account is in. In many cases we could see the accounts haven't been used in almost a decade or one account was used then another took over19:06
clarkbThat should leave us with about 80 accounts where the situation is more complicated and we'll try to reach out to users for those.19:06
clarkbThe next step for these 176 is to wait 2 or 3 weeks then once we've given it time for people to raise any alarms we can remove the conflicting external ids from the retired accounts19:07
fungias usual, if folks complain their account has broken, we should probably start by grepping the id from the logs you saved, yeah?19:07
clarkbI'd like to start reaching out to individuals in the ~80 remaining while we wait on that too19:07
clarkbfungi: yes exactly. Then you should be able to revert the commit I pushed to refs/users/xy/abxy to set them back to the way they were19:07
fungiawesome. thanks for confirming19:08
clarkbI'm 99% sure I tested this and it is only the external ids refs where gerrit will reject the changes. reverts to the refs/users/* refs are fine19:08
clarkbI did have a small network hiccup when retiring users so I had to rerun my retirement script for one user. That is the only oddity in the logs19:08
clarkbbut the revert process doesn't change for that user19:09
clarkb#topic Review Upgrade19:10
clarkbThis has been announced for the end of day July 18 UTC time19:10
clarkbas mentioned before I've got this info going out on the foundation newsletter to help advertise it.19:10
clarkbianw: are there changes we should be reviewing now (like the SSHFP cleanups?)19:10
fungithis is the point in the meeting where i realize i'm actually away from home visiting extended family when the upgrade is scheduled to happen19:10
fungibut i'll try to be online anyway, their internet connectivity willing19:11
ianwyeah, if i could get eyes on 19:11
clarkbfungi: I expect we'll be fine. ianw and I can be around19:11
clarkbbut the help is appreciated19:11
ianw#link https://etherpad.opendev.org/p/gerrit-upgrade-202119:11
ianwand particularly the two pre-merge things19:11
fungii've left a few notes on there, but will go over it more closely and check out the linked changes19:12
ianwahh, thank you, i will go through comments again today19:12
clarkbya I'ev gone over it too, but should go over it again and make sure I've reviewed the chagnes and followed up on any comments to the process19:12
clarkbOne thought I had was that we should maybe land https://review.opendev.org/c/opendev/system-config/+/799225 to update our gerrit image and fix the lp blueprints integration then when we do the upgrade we'll know we have the same happy image19:13
clarkber I mean land that then do a quick restart on prod before the upgrade (like this week?)19:13
fungi#link https://review.opendev.org/799124 Good riddance to track-upstream and its cronjob19:14
fungithat's related, since the new server seems to be opaquely failing to actually run its cronjob19:14
clarkbianw: ^ if you don't think those two changes are a problem for pre merge activities maybe go ahead and add them to the etherpad? Or let me know and I can add them19:14
ianwok, will look19:15
ianwi think maybe the switch of zuul to review01.opendev.org will require a zuul restart?19:15
ianwso it might be a good time to pull in the updated image too19:16
clarkbyes updates to that portion of the zuul config will require a zuul restart19:16
ianwi can do that on my monday when it's quiet19:16
clarkbok19:16
ianwthat gives it a week to bake in19:17
clarkbmight need to coordinate with corvus on zuul restarts as there is a lot of chrun on the zuul side and we may have to do a full restart to safely update depending on the changes that land19:17
clarkb(zuulv5 development is full steam ahead)19:17
ianwindeed it is!19:17
clarkbAnything else on this topic?19:18
ianwnot from me19:19
clarkb#topic Draft matrix spec19:19
clarkbA few of us had a call with Element Matrix Services (EMS) last week to discuss the possibility of using their hosted EMS platform19:19
clarkbFrom what I understood they didn't have any problems with us doing the slightly hacky setup to only maintain admins and bot users on our server then have users use matrix.org or their own homeservers19:20
fungii have a feeling they're happy to see open source communities making use of it19:21
clarkbTheir suggestion to us for next steps and getting started is to spin up a trial instance on the lowest tier of their service. Get things set up and start interacting with it. Then if we want to use the slightly more expensive silver version (they think this will be our best choice) we can upgrade to that painlessly19:21
clarkbfungi: yup they were super helpful in talking us through this19:21
clarkbI then talked to jbryce at the foundation about this and he didn't think this would be a problem. I need to coorindate with him to set up the accoutn and configure the payment details. Hopign to do that this week19:22
fungiwhat does silver get us over the entry level?19:22
fungior is silver the lowest tier paid option?19:22
clarkbfungi: Nickel is the lowest option. Silver is second lowest19:22
fungii see you said "trial" so i suppose that's time-limited?19:22
clarkbhttps://element.io/pricing19:22
fungiaww, missed opportunity there was no "dime" to go with "nickel"19:23
clarkbya nickel is free for the first month as a trial thn you start paying for it19:23
clarkbmy understanding over why silver would probably be best was simply a matter of scale19:23
clarkbwe would probably be ok with nickel for zuul but as we grow that can change19:23
clarkbit is something to figure out as we go and interact with it19:24
clarkbThey also noted that if we don't want our instance hosted in sweden you have to pay for Gold or higher but we don't think that is a problem19:24
corvusoh hai19:24
clarkbOn our end the spec got updated to reflect the plan to try EMS19:24
clarkb#link https://review.opendev.org/79615619:24
fungii like sweden19:24
clarkbcorvus: hello, I was just recounting what we learned from our call with EMS19:24
clarkband noted that I talked to jbryce about it and he seemed happy with it. Now I need to coordinate with him to get an accoutn created with appropriate payment details19:25
ianwwhen it says "active user" that means people with a @user:opendev.org address?19:25
corvuslgtm19:25
corvusianw: yes, so basically, our bots/admin accounts19:25
clarkbianw: yes, and you can actually have a bunch of inactive versions of that in the system. They differentiate between actually active and you can log in and cold storage19:25
clarkb(not sure the active vs inactive designation will end up being useful for us but it means you can create accounts pretty safely and then just be careful about what you activate)19:26
fungii guess if we needed occasional-use admin accounts that could come up19:26
corvusclarkb: i think i have 2 questions: 1) how to proceed with spec approval?  2) do you want to wait for that before setting up the account/server, or go ahead and get started on that and we can start working on bots, etc?19:27
clarkbcorvus: I'd like to go ahead and get started with the account/server creation since we may learn something important doing that.19:27
corvusfungi: yeah, i sort of feel like a single admin account that's either used by a bot or one of us manually when required will probably be fine for things like setting up rooms, etc.19:27
fungiagreed, from what little i know so far19:28
clarkbcorvus: but then once the server is up and spec updated to accomodate any new info and no major issues pop up I think we can land the spec?19:28
corvusclarkb: cool; everything about this can be reversed easily right up until we ask people in #zuul to move, so we have a lot of leeway if we're okay being casual about it.19:28
clarkbcorvus: maybe give the trial a couple of weeks and then land the spec?19:28
clarkbcorvus: good to know19:28
clarkbalso mordred moved homeservers recently and that seems to have gone reasonably well19:28
corvusyeah, mordred also did a test of moving a room from his old homeserver to the new; even that worked without a hitch19:29
mordredI moved a channel I'd created ... yeah that ^^19:29
mordrednow - moving homeservers was a bit more involved and didn't really transition state19:29
clarkbI think I'm mostly worried about finding something about EMS that is a deal breaker for us and deciding we need to run it ourselves which will have a big impact on the spec19:29
clarkbbut once we are past that step I think we should land the spec19:29
clarkbs/step/concern/19:29
fungii keep meaning to set up one since i have a private inspircd with some semi-used channels i'd like to add a bridge for eventually, and could have a vanity username that way too19:29
mordredfrom a user account pov - basically I had @mordred:waterwanders.com and I created @mordred:inaugust.com then invited @mordred:inaugust.com to anything that @mordred:waterwanders.com was in19:30
mordredbut moving the room from waterwanders homserver to inaugust homeserver went amazing19:30
clarkbIt does seem like the plan is congealing which is nice. I'll have to review the spec properly to indicate that19:30
mordredI believe if we have a homeserver with EMS and we decide at a point in the future we need to run it ourselves they can work with us to export the data and do an actual move19:31
corvusi was literally like "what's mordred talking about, this room is on inaugust" oh yeah, it didn't use to be.  ;)19:31
fungii guess the irc bridges work by emulating an irc server and networking with existing servers on the same irc network?19:31
mordredthat seemed like a lot to ask them for just my little homeserver, so I didn't do it :)19:31
corvusfungi: i'm not 100% sure; but https://github.com/matrix-org/matrix-appservice-irc is apparently the software19:32
clarkbfungi: I suspect that the integartion is less coupled than that. They probably get connection limit exceptions for their bridge in the network then just emulate being a bunch of clients19:32
fungiahh, thanks! i'll give that a thorough look19:32
clarkbAlright anything else to talk about on this subject?19:33
corvusoh 1 thing19:33
corvusfeel free to sign up to make a replacement irc bot (see the spec)19:33
corvuseavesdrop/statusbot are available (tristanC wrote a gerritbot)19:34
corvusmeetbot too, but is not needed for zuul19:34
corvusclarkb: otherwise, next steps seem clear to me19:34
clarkband for eavesdrop we may not even need channel logging if we can just grab those directly from the matrix server19:34
fungiis there still any benefit to merging the bots into a single codebase?19:34
clarkbfungi: I'm not sure I know enough to say at this point :)19:35
fungiat a minimum we ought to at least shoot for significant code reuse19:35
corvusclarkb: probably the easiest way to do that though is to have a bot account join a room19:35
clarkbcorvus: ah makes sense since that is the "api"19:35
corvusultimately, i think having something writing an html file to disk is still useful for search engine indexing19:35
corvus(so even if an individual user can use the in-client search feature, if we value the indexing, we should have an eavesdrop bot)19:36
corvus(if we don't value search engine indexing, then, er, maybe we drop that :)19:36
mordredyeah. history is great - but having browsable html archives has been nice so far19:36
ianwit's always handy to be able to link to a prior conversation19:36
mordredyeah. I mean - you can do web links to matrix history - but they take you to a matrix client19:37
clarkboh in that case ++ to having separate html19:37
corvusso yeah, that's a neat feature, but separately the plain html archive is still nice19:37
mordredand that's maybe a bit heavy for wanting to reference in some other context19:37
mordred++19:37
clarkbWe have a few more topics to get to so lets move on19:38
clarkb#topic Gitea01 Backups19:38
corvushttps://matrix.to/#/!eitSLAJcQKeehLruKf:matrix.org/$AB4j0_Z_HOn_pOAWfXdHSI-eHvDE_MMlJD2za1-kspU?via=matrix.org19:38
fungiseems the network connectivity between vexxhost regions is stil broken19:39
clarkbThese continue to not be urgent to fix because we haven't done any recent project renames. However, ianw was looking at the issue any we plan to do renames in a few weeks19:39
clarkbfungi: ianw: want to fill us in on what you found?19:39
fungishort story is that some ipv6 addresses in sjc1 can't communicate with some ip addresses in ca-ymq-1 over some protocols19:39
ianwyeah, there was that19:40
clarkbipv4 is fine though?19:40
ianwbut the backup does seem to be daily running.  i guess it's falling back to ipv4?19:40
fungias if flows are being load balanced between routers at layer 4 and at least one router has a stray route with a greedy/short prefix19:40
clarkbianw: well it is emailing us about the failures as recently as yesterday19:40
ianwthe file-system backup.  but then the db part fails19:40
clarkboh got it19:40
clarkbI think well behaved applications are expected to fallback to ipv4 if v6 doesn't work19:41
ianwi found some dump options that looked promising, but i have to admit i got sidetracked on the ipv619:41
clarkbopenssh is probably well behaved in this way19:41
ianwfungi: you double checked the ipv6 thing right?19:42
fungiyeah, if memory serves, the !h is being returned by the first hop in ca-ymq-119:42
fungiwhich leads me to suspect the core routing in that region, but it's all a black box to me. mnaser was looking into it19:43
ianwi'll ping again, i mentioned something at the end of last week19:43
clarkbsounds good.19:43
clarkb#topic Gerrit project renames19:44
clarkb#link https://review.opendev.org/797990 rename playbook updates19:44
clarkbThat is a change we'll need to have in before we do renames. We're still a few weeks away from our week after the gerrit server upgrade so not in a rush but wanted to point it out19:44
clarkbIf we can get that reviewed that would be great19:44
clarkbAnd if you know of any additional ernames that should be considered now is the time to get them on the list19:45
fungiall of osf/* should probably be renamed to openinfra/* but that's not urgent and i still need to talk to folks at the foundation about it19:45
clarkbfungi: ok19:45
clarkbI'll mention the rename schedulign to the TC on thrusday too19:45
fungigood call19:46
clarkb#topic Should we change our meeting time?19:46
clarkbIt occurred to me that this meeting time is not very good for ianw (at least I don't think it is) and ianw is one of the primary participants.19:47
clarkbfrickler mentions that they lurk the meetings to follow along even though they don't actively participate19:47
clarkbI wanted to put it out there that I'm ok with trying to find a time that works better for others if that would be helpful19:47
fungii'm happy to do whatever time others would prefer19:47
* fungi has no life19:47
fungithough also i expect to miss the next meeting, possibly the text two19:48
clarkbfrickler will miss the next three as well and mentioned that may be a good tiem to experiment with non EU timezone friendly meetings19:49
ianwi don't mind, in (my) summer the meeting moves 6-7am which is perfectly fine.  7-8am usually involves family, and at the tail end of 8-9 i have to do school run19:49
ianwso from my POV earlier is better19:50
clarkbok I wanted to double check. Since we have had changes to who participates and could move things around if it helps19:50
clarkbSounds like the current time slot is fine though and we can keep it as is19:51
clarkbThat was all I had19:51
clarkb#topic Open Discussion19:51
clarkbAnything else before our hour is up?19:51
ianwif i could get a couple of eyes on 19:52
ianw#link https://review.opendev.org/c/opendev/system-config/+/79840019:52
ianwthat adds paste to ansible19:52
clarkb++ I'll add that to my afternoon todo list.19:52
ianwwhat i'm really interested in is the mariadb container behind that, same bits as for gerrit19:52
ianwwould be nice to have a bit more experience with it19:53
clarkbLast call :) Otherwise I'll let everyone go find breakfast/lunch/dinner19:54
clarkb#endmeeting19:55
opendevmeetMeeting ended Tue Jul  6 19:55:12 2021 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:55
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2021/infra.2021-07-06-19.01.html19:55
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2021/infra.2021-07-06-19.01.txt19:55
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2021/infra.2021-07-06-19.01.log.html19:55
clarkbThank you everyone!19:55
clarkbwe'll see you here next week19:55
clarkbAnd as always feel free to reach out in #opendev or service-discuss@lists.opendev.org anytime to bring discussion up. We aren't beholden to this hour19:55
fungithanks clarkb!19:57

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!