Tuesday, 2020-06-16

*** hamalq has quit IRC00:58
*** hamalq has joined #opendev-meeting01:14
*** hamalq has quit IRC01:31
*** hamalq has joined #opendev-meeting01:37
*** hamalq has quit IRC02:32
*** hamalq has joined #opendev-meeting02:57
*** hamalq has quit IRC06:10
*** hamalq has joined #opendev-meeting06:12
*** hamalq has quit IRC06:59
*** diablo_rojo has quit IRC09:54
jentoio#startmeeting OPNFV TSC12:59
openstackMeeting started Tue Jun 16 12:59:52 2020 UTC and is due to finish in 60 minutes.  The chair is jentoio. Information about MeetBot at http://wiki.debian.org/MeetBot.12:59
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.12:59
*** openstack changes topic to " (Meeting topic: OPNFV TSC)"12:59
openstackThe meeting name has been set to 'opnfv_tsc'12:59
AJaegerjentoio: you're in the wrong channel!13:00
jentoioyup, tsorry ;(13:00
AJaegerplease issue an #endmeeting here13:00
jentoio#endmeeting13:01
*** openstack changes topic to "Incident management and meetings for the OpenDev sysadmins; normal discussions are in #opendev"13:01
openstackMeeting ended Tue Jun 16 13:01:01 2020 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)13:01
openstackMinutes:        http://eavesdrop.openstack.org/meetings/opnfv_tsc/2020/opnfv_tsc.2020-06-16-12.59.html13:01
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/opnfv_tsc/2020/opnfv_tsc.2020-06-16-12.59.txt13:01
openstackLog:            http://eavesdrop.openstack.org/meetings/opnfv_tsc/2020/opnfv_tsc.2020-06-16-12.59.log.html13:01
*** mnaser has quit IRC14:14
*** zbr_ has joined #opendev-meeting14:15
*** zbr has quit IRC14:16
*** zbr_ is now known as zbr14:16
*** mnaser has joined #opendev-meeting14:17
*** hamalq has joined #opendev-meeting16:06
*** diablo_rojo has joined #opendev-meeting16:27
-openstackstatus- NOTICE: Zuul is being restarted for an urgent configuration change and may be offline for 15-30 minutes. Patches uploaded or approved during that time will need to be rechecked.16:37
*** hamalq_ has joined #opendev-meeting16:56
*** hamalq has quit IRC16:59
-openstackstatus- NOTICE: Zuul is back online; changes uploaded or approved between 16:40 and 18:00 will need to be rechecked.18:06
*** hamalq_ has quit IRC18:17
*** hamalq has joined #opendev-meeting18:18
clarkbAnyone else here for the meeting? we'll get started in a couple minuts18:59
diablo_rojoo/18:59
ianwo/19:00
clarkb#startmeeting infra19:01
openstackMeeting started Tue Jun 16 19:01:06 2020 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
*** openstack changes topic to " (Meeting topic: infra)"19:01
openstackThe meeting name has been set to 'infra'19:01
clarkb#link http://lists.opendev.org/pipermail/service-discuss/2020-June/000039.html Our Agenda19:01
clarkb#topic Announcements19:01
*** openstack changes topic to "Announcements (Meeting topic: infra)"19:01
clarkbI didn't have any announcements19:01
clarkb#topic Actions from last meeting19:02
*** openstack changes topic to "Actions from last meeting (Meeting topic: infra)"19:02
clarkb#link http://eavesdrop.openstack.org/meetings/infra/2020/infra.2020-06-09-19.01.txt minutes from last meeting19:02
clarkbno actions recorded, but it is feeling like things are returning to normal after the PTG. Oddly ti seemed like we still had the quiet week last week even though people didn't need to travel19:02
clarkb(maybe that was just my perception)19:02
clarkb#topic Specs approval19:02
*** openstack changes topic to "Specs approval (Meeting topic: infra)"19:02
clarkb#link https://review.opendev.org/#/c/731838/ Authentication broker service19:02
clarkbThis isn't ready for approval yet, but wanted to keep pointing eyeballs towards it19:03
mordredo/19:03
clarkbfungi: ^ anything else to say about this spec?19:03
fungii haven't updated it yet, had a "quiet week" ;)19:03
fungimore comments appreciated though19:03
corvuso/19:03
* mordred looks forward to fungi's updates19:03
clarkb#topic Priority Efforts19:04
*** openstack changes topic to "Priority Efforts (Meeting topic: infra)"19:04
clarkb#topic Update Config Management19:05
*** openstack changes topic to "Update Config Management (Meeting topic: infra)"19:05
clarkbI'm not aware of a ton of changes here recently. Anyone have topics to bring up?19:05
mordredcorvus improved the disable-ansible script19:06
clarkb(I'm moving somewhat quickly beacuse our general topics list is pretty long this week and want to be sure we get there but feel free to bring stuff up under priority topics if relevant)19:06
mordrednope. /me shuts up19:06
fungii've got a half-baked change to move the rest of our repo mirrornig configuration from puppet to ansible19:06
clarkbmordred: thats a good call out. We should get into the habit of providing detailed reasons for disabling ansible there19:06
mordred\o/19:06
clarkbfungi: is that ready for review yet?19:07
fungiclarkb: it's ready for suggestions, but no it's not ready to merge yet19:07
fungiit's a lot of me learning some ansible and jinja concepts for the first time19:07
clarkb#link https://review.opendev.org/#/c/735406/ Ansiblify reprepro configs. Is WIP comments welcome19:07
fungiit will be a pretty massive diff19:08
fungi(once complete)19:08
clarkbit should be a pretty safe transition too as we can avoid releasing volumes until we are happy with the end results?19:09
clarkbthanks for working on that19:09
fungiyep, once i rework it following your earlier suggestion19:09
clarkb#topic OpenDev19:10
*** openstack changes topic to "OpenDev (Meeting topic: infra)"19:10
clarkbI don't have much to add here. I've completely failed at sending reminder emails about the advisory board but mnaser has responded. Thank you!19:10
clarkbI'll really try to get to that this week19:10
mnaser\o/19:10
* mordred hands mnaser an orange19:12
clarkbAnything else to add re OpenDev?19:12
mordredoh -19:13
mordredthis is really minor - but I snagged the opendev freenode nick yesterday and put it in our secrets file (thanks for the suggestion mnaser)19:13
mordredin case we want to use it for opendev-branded bots19:14
clarkbthanks19:14
clarkb#topic General Topics19:15
*** openstack changes topic to "General Topics (Meeting topic: infra)"19:15
clarkb#topic pip-and-virtualenv19:15
*** openstack changes topic to "pip-and-virtualenv (Meeting topic: infra)"19:15
clarkbThis change has landed and we're starting to see more and more fallout from it. Nothing unexpected yet I don't think19:16
clarkbpossibly even a new case of it in #zuul right now  too :)19:16
clarkbKeep an eye out for problems and thank you to AJaeger and mordred for jumping on fixes19:16
clarkbianw: where are we with considering the spec completed? and cleanup in the nodepool configs? can we start on that or should we wait a bit logner?19:17
ianwi have pushed changes to do cleanup of the old jobs and images19:17
ianwi guess it's not too high a priority right now, the changes are there and i'll keep on them as appropriate19:18
clarkbthanks. Should I push up a change to mark the spec compelted? or wait a bit more on cleanup for that?19:19
ianwi guess we can mark it complete, if we consider dealing with the fallout complete :)19:20
ianwi thought it was too quiet yesterday, so i'll try to catch up on anything i've missed19:20
clarkbianw: I think a lot of people may have had friday and monday off or something19:21
clarkbbecause ya definitely seems to be picking up now19:21
corvus40% of vacation days are fridays or mondays19:21
clarkb#topic Zookeeper TLS19:23
*** openstack changes topic to "Zookeeper TLS (Meeting topic: infra)"19:23
clarkbThis is the thing that led to the ansible limbo19:23
clarkbcorvus: want to walk us through this?19:23
corvusthis topic was *supposed* to be about scheduling some downtime for friday to switch to zk tls19:23
corvusbut as it turns out, this morning we switched to tls and switched back already19:23
corvusthe short version is that yesterday our self-signed gearman certs expired19:24
corvuswell, technically just the ca cert19:24
corvuswhich means that no zuul component could connect to gearman.  so we lost the use of the zuul cli, and if any component were restarted for any reason, it would be unable to connect, so the system would decay19:25
corvuscorrecting that required a full restart, as did the zk tls work, so we decided to combine them19:25
corvusunfortunately, shortly after starting the nodepool launchers, we ran into a bug19:25
corvus#link kazoo bug https://github.com/python-zk/kazoo/issues/58719:25
corvusso we manually reverted the tls change (leaving the new gear certs in place)19:26
corvusand everything is running again.19:26
corvusnext steps: make sure this is merged:19:26
corvus#link revert zk tls https://review.opendev.org/73599019:26
corvusthen when it is, we can clear the disable-ansible file and resume speed19:27
corvusafter that, i'm going to look into running zk in a mode where it can accept tls and plain connections19:27
clarkband once disable ansible is cleared we'll get updates to a few docker images, apply dns zone backups, and change rsync flags for centos/fedora mirrors19:27
corvusif that's possible, i'd like to restart the zk cluster with that, and then try to repro the bug against production19:27
clarkbcalling that out so toehrs are aware there will be a few changes landing once ansible is reenabled19:27
clarkbcorvus: ++ I like that plan19:28
mordred++19:28
corvusbased on info from tobiash, we suspect it may have to do with response size, so it may help to get a repro case out of production data19:28
clarkbcorvus: we should be able to easily switch over a single builder or launcher without major impact to production to help sort out what is going on19:28
corvusclarkb: agreed19:28
corvuseot19:29
clarkb#topic DNS Cleanup19:29
*** openstack changes topic to "DNS Cleanup (Meeting topic: infra)"19:29
clarkbThe change to implement the recording of zone data has landed and should apply to bridge when ansibel starts rerunning19:30
clarkbianw: ^ fyi19:30
clarkbprobably want to make sure that is working properly once it goes in?19:30
ianwyeah, i added the credentials so should be ok19:30
clarkbI annotated the etherpad with notes on things I thinkwe can cleanup19:31
clarkbwhat are we thinking about for cleanup? wait for backups to run with existing records first so we've got that info recorded then do cleanup?19:31
clarkb(that is sort of what I thought would be a good process)19:31
mordredyeah19:31
clarkbk I can help with the button clicking to clean up records once we're at that point19:32
clarkbianw: anything else worth mentioning on this topic?19:32
ianwyeah, we can iterate on it a bit then too, as the list gets shorter it's easier to see what can go :)19:32
ianwnope, thanks19:32
clarkbsounds good, thanks for putting this together19:32
clarkb#topic Etherpad Upgrade to 1.8.4 or 1.8.519:32
*** openstack changes topic to "Etherpad Upgrade to 1.8.4 or 1.8.5 (Meeting topic: infra)"19:32
clarkbFungi did some work to get our etherpad server upgraded to 1.8.4 (from 1.8.0)19:33
clarkbwe then noticed that there was a UI rendering bug when testing that19:33
clarkb#link https://review.opendev.org/#/c/729029/ Upgrade Etherpad19:33
funginot dissimilar from some of the weirdness we noticed with author colors when going through meetpad19:33
clarkbthis change now addresses that with a simple css hack that I came up with. Upstream they've fixed this differently with a fairly large css refactor and we should see that in the next release (1.8.5?(19:33
fungii wonder if that will also resolve it for meetpad uses19:34
clarkbthe question I've got is do we think we should upgrade with the workaround as 1.8.4 includes a bug fix around db writes? or wait for 1.8.5 to avoid potential UI weirdness19:34
fungii'm in no huge hurry. i'm excited for the potential fix for perpetually "loading..." pads, but other than that there's no urgency19:35
mordredthere's no urgency, but the workaround isn't super onerous either19:35
mordredso I'm ok rolling it out in that form - or with waiting19:35
mordredwe can remove the sed from our dockerfile when we bump the version19:35
mordred(it's not like one of those "in a few months we're not going to be paying attention and our local hack is going to bork us)19:36
corvusclarkb: what's the workaround?19:36
corvusoh i see it now19:36
corvussorry, buried in the run cmd19:36
clarkbcorvus: changing the padding between the spans that contain lines19:36
clarkbthe way the padding was set up before caused the lines to overlap so their colors successively covered each other19:37
corvusi agree it seems safe to move forward with 02919:37
fungiyou tracked that back to a particular change between 1.8.1 and 1.8.3 yeah?19:37
clarkbthe bug is also purely cosmetic so shouldn't affect content directly, just how we see it19:38
clarkbfungi: ya its in 1.8.3 (there was no 1.8.1 or 1.8.2 iirc)19:38
fungiyeah, there was at least no 1.8.2, for sure19:39
clarkbI think what I'm taking away from this is if everything else calms down (uwsgi, pip/virtualenv, dns, zk tls, etc) then we can go ahead with this and watch it19:39
fungisounds fine to me19:39
clarkbthanks for the feedback19:39
clarkband if 1.8.5 happens before then we can drop the workaround19:40
clarkb#topic Getting more stuff off of python219:40
*** openstack changes topic to "Getting more stuff off of python2 (Meeting topic: infra)"19:40
clarkbOne of the things that came out of the PTG was it would be useful for those a bit more familiar with our systems to do an audit of where we stand with python2 usage. This way others can dive in and port or switch runtimes19:41
clarkb#link https://etherpad.opendev.org/p/opendev-tools-still-running-python219:41
clarkbI've started this audit in that etherpad. It is nowhere near complete. One thing that I have discovered is that a lot of our software is python3 capable but running under python2. We'll want to keep that in mind as we update configuration management that a good next step is to switch the runtime too19:42
clarkbI have also found a couple cases that are definitely python2 only right now. Our use of supybot for meetbot and the logstash data pipeline. For meetbot we have a spec to replace it already which is noted on the etherpad19:42
clarkbif you can think of other tools we need to be checking on feel free to add them to the list and I can dig in further as I have time19:42
clarkbThe goal here isn't really to fix everything as much as to be aware of what needs fixing19:43
fungias we move services and automation to platforms without python 2.7, we can fix things where needed19:45
fungiif they don't become urgent before that19:45
clarkbyup and it gives people a list of things they can pick off over time if they want to help out19:45
mordredyeah - most of the things are easy enough to work on - but are pretty opaque that they need to be worked on19:45
mordredwhat clarkb said19:45
fungiuseful to know where we expect the pain points for such moves to be though19:45
clarkb#topic Trusty Upgrades19:46
*** openstack changes topic to "Trusty Upgrades (Meeting topic: infra)"19:46
clarkbI don't have much to add on this topic but did want to point out it seems that osf's interop working group is picking up some steam. I'm hoping that may translate into some better interest/support for refstack19:47
clarkband we can maybe channel that into a refstack server upgrade. The docker work I did is actually really close. Its mostly a matter of having someone test it now (which I'm hoping the interop wg can do)19:47
clarkb#topic Open Discussion19:48
*** openstack changes topic to "Open Discussion (Meeting topic: infra)"19:48
clarkbAnything else to bring up before we end the meeting?19:49
fungiit's just come to light in #opendev that the zuul restart switched our ansible default from 2.8 to 2.9, so expect possible additional behavior changes19:51
clarkbfungi: have we confirmed that (I theorized it and wouldn't be surprised)19:51
fungiianw: compared logs which showed passing results used 2.8 and the failure observed used 2.919:51
corvusyes, latest zuul master uses 2.9 by default19:53
corvuswe can pin the openstack tenant to 2.8 if we need to19:53
clarkbso far its only popped up as a single issue which has a fix19:53
clarkbI guess if it gets worse we can pin19:53
fungii don't know yet if that's warranted, may be the issues are small enough we can just fix them19:53
corvusthough istr we did some testing around this and didn't see a lot of issues19:53
fungiapparently match as a filter is one19:54
clarkbcorvus: ya we tested with a lot of general playbooks in zuul-jobs19:54
corvusso yeah, i think it's probably best to roll forward19:54
frickleralso devstack is broken by a new uwsgi release and still needs fixes19:54
fricklerand neutron-grenade-multinode seems to suffer from the venv removal19:54
clarkbfrickler: ya I meant to look into why multinode was sad about that19:55
clarkbthe last day and a half have been distracting :) After lunch today I'll try to be useful for all the things19:56
clarkbsounds like that may be it for our meeting19:58
clarkbthank you everyone!19:58
clarkb#endmeeting19:58
*** openstack changes topic to "Incident management and meetings for the OpenDev sysadmins; normal discussions are in #opendev"19:58
openstackMeeting ended Tue Jun 16 19:58:21 2020 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:58
openstackMinutes:        http://eavesdrop.openstack.org/meetings/infra/2020/infra.2020-06-16-19.01.html19:58
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/infra/2020/infra.2020-06-16-19.01.txt19:58
openstackLog:            http://eavesdrop.openstack.org/meetings/infra/2020/infra.2020-06-16-19.01.log.html19:58
fungithanks clarkb!19:58
*** yoctozepto has quit IRC22:33
*** corvus has quit IRC22:33
*** tristanC has quit IRC22:33
*** SotK has quit IRC22:33
*** tobiash has quit IRC22:33
*** mnaser has quit IRC22:33
*** hamalq has quit IRC22:33
*** AJaeger has quit IRC22:33
*** ChanServ has quit IRC22:33
*** ianw has quit IRC22:33
*** frickler has quit IRC22:33
*** zbr has quit IRC22:33
*** mordred has quit IRC22:33
*** fungi has quit IRC22:33
*** diablo_rojo has quit IRC22:33
*** clarkb has quit IRC22:33
*** tristanC has joined #opendev-meeting22:37
*** AJaeger has joined #opendev-meeting22:37
*** fungi has joined #opendev-meeting22:37
*** mordred has joined #opendev-meeting22:37
*** zbr has joined #opendev-meeting22:37
*** hamalq has joined #opendev-meeting22:37
*** tobiash has joined #opendev-meeting22:37
*** SotK has joined #opendev-meeting22:37
*** ChanServ has joined #opendev-meeting22:37
*** corvus has joined #opendev-meeting22:37
*** tepper.freenode.net sets mode: +o ChanServ22:37
*** yoctozepto has joined #opendev-meeting22:37
*** ianw has joined #opendev-meeting22:38
*** frickler has joined #opendev-meeting22:38
*** mnaser has joined #opendev-meeting22:39
*** clarkb has joined #opendev-meeting22:39
*** mnaser has quit IRC22:41
*** mnaser has joined #opendev-meeting22:48
*** diablo_rojo has joined #opendev-meeting22:52

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!