Tuesday, 2019-11-26

*** jamesmcarthur has joined #openstack-infra00:03
*** jamesmcarthur has quit IRC00:08
*** mattw4 has quit IRC00:12
*** goldyfruit_ has quit IRC00:20
ianwhrmm, i'd sort of glossed over the openstacksdk part of generating the nodepool images00:20
ianwwe want to maintain testing against master of that too00:21
ianwwhich means multiple inheritance (nodepool depends on dib & openstacksdk)00:21
ianwthat turns things into a different problem than just names00:21
ianwfungi / clarkb: on the release, i thinkhttps://review.opendev.org/#/c/695981/ can work to do all rsync releases via ssh00:22
ianwwe could have a system-config dockerfile that builds nodepool+dib+openstacksdk all zuul checkouts.  however, i can't see how to make it so that for production it only installs from releases00:25
clarkbwe might need two different dockerfiles00:25
clarkbI think logic like this is why projects like kolla generate dockerfiles with a templating language?00:26
clarkbI don't know that we want to go that far though00:26
ianwit just feels very annoying to have different dockerfiles creating testing images and production images00:27
ianwas we know, any slight chance of something being different will be, and result in broken production00:27
clarkbyeah00:29
clarkbmgoddard may have input?00:29
ianwi guess i should get something up to build the "speculative" image and we can work from there00:30
clarkbit is possible that if we are testing the speculative images we just don't care about releases in production anymore00:30
clarkbsince the images are known to work via testing00:30
ianwyeah, maybe i'm too old fashioned.  i guess the status quo is that master nodepool changes are put on the hosts (but don't apply unless manual reload) and dib & openstacksdk are pushed by puppet when new releases come00:33
ianwwe would not want to restart all nodepool-launchers on new containers00:34
ianwi guess that means k8s or something managing them00:34
ianw(restart all at once, and kill the world i mean)00:35
*** jamesmcarthur has joined #openstack-infra00:37
clarkbianw: currently we use docker-compose and ansible to manage that for gitea00:40
clarkbwe serialize the gitea updates so that we only ever stop one at a time allowing the other 7 to carry load. We may want to do similar with nodepool though it is less of a concern there00:41
*** jamesmcarthur has quit IRC00:42
*** sshnaidm has quit IRC00:45
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] nodepool-builder image with nodepool/openstacksdk/dib from master  https://review.opendev.org/69600000:55
ianwoooh, 696000, nice round number00:56
*** sshnaidm has joined #openstack-infra01:00
*** Goneri has quit IRC01:00
openstackgerritKendall Nelson proposed openstack/cookiecutter master: Update CONTRIBUTING.rst template  https://review.opendev.org/69600101:04
*** goldyfruit_ has joined #openstack-infra01:05
openstackgerritClark Boylan proposed zuul/zuul master: Improve functionality and docs around ansible installation  https://review.opendev.org/67540301:06
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] nodepool-builder image with nodepool/openstacksdk/dib from master  https://review.opendev.org/69600001:09
*** ociuhandu has joined #openstack-infra01:12
*** igordc has quit IRC01:13
*** michael-beaver has quit IRC01:18
johnsomianw Any chance we can get some review on https://review.opendev.org/#/c/695823/ ?01:22
johnsomWe are trying to get some work around patches merged in Octavia, but the best option is to just fix that element.01:22
fungijohnsom: there is also a #openstack-dib channel, in case that helps01:24
johnsomha, yeah, that discussion seems to move back and forth between here and the dib channel.01:24
*** ociuhandu has quit IRC01:28
ianwjohnsom: oh, thanks, was just waiting for ci which was blocked on opensuse01:30
ianwspeaking of, mirror update hsould be done01:31
johnsomYeah, I poked it with a recheck, thanks.01:31
johnsomIt's done enough that the CI passed, lol01:31
*** rlandy|bbl is now known as rlandy01:36
*** jamesmcarthur has joined #openstack-infra01:39
*** ociuhandu has joined #openstack-infra01:40
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] nodepool-builder image with nodepool/openstacksdk/dib from master  https://review.opendev.org/69600001:42
*** jamesmcarthur has quit IRC01:43
*** ociuhandu has quit IRC01:44
*** ricolin has joined #openstack-infra01:46
*** eharney has quit IRC01:49
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] nodepool-builder image with nodepool/openstacksdk/dib from master  https://review.opendev.org/69600001:52
*** armax has quit IRC01:57
openstackgerritMerged opendev/storyboard master: Add support for creating attachments  https://review.opendev.org/63342102:10
openstackgerritMerged opendev/storyboard master: Add attachments API availability to /v1/systeminfo  https://review.opendev.org/64356002:10
*** jamesmcarthur has joined #openstack-infra02:10
openstackgerritMerged openstack/diskimage-builder master: Stop installing pydistutils.cfg  https://review.opendev.org/69582302:12
*** jamesmcarthur has quit IRC02:15
ianwso, you can't copy files outside of the docker context .... but we can't really get the zuul source trees in there without some ugly hacks ... it doesn't feel like it's working out02:22
*** rlandy has quit IRC02:22
*** armax has joined #openstack-infra02:31
clarkbdoes it have to be jn the current dir tree?02:33
ianwclarkb: yes basically in or underneath the "Dockerfile" you're building02:36
*** ociuhandu has joined #openstack-infra02:38
*** EmilienM|PTO is now known as EmilienM02:42
*** ociuhandu has quit IRC02:48
*** jamesmcarthur has joined #openstack-infra02:52
*** jamesmcarthur has quit IRC02:57
*** tonyb has joined #openstack-infra03:00
*** diablo_rojo has quit IRC03:02
*** ricolin has quit IRC03:03
*** apetrich has quit IRC03:09
openstackgerritPaul Belanger proposed zuul/zuul-jobs master: WIP: Print extra debug info  https://review.opendev.org/69601303:21
*** jamesmcarthur has joined #openstack-infra03:29
*** jamesmcarthur has quit IRC03:34
*** ociuhandu has joined #openstack-infra03:43
*** ociuhandu has quit IRC03:47
*** sshnaidm_ has joined #openstack-infra03:53
*** ociuhandu has joined #openstack-infra03:53
*** sshnaidm has quit IRC03:57
*** hongbin has joined #openstack-infra04:00
*** ociuhandu has quit IRC04:01
*** ociuhandu has joined #openstack-infra04:02
*** ociuhandu has quit IRC04:07
*** rh-jelabarre has quit IRC04:07
*** ricolin has joined #openstack-infra04:08
*** udesale has joined #openstack-infra04:14
*** ykarel has joined #openstack-infra04:19
*** sshnaidm__ has joined #openstack-infra04:25
*** sshnaidm_ has quit IRC04:26
*** ricolin has quit IRC04:29
*** jamesmcarthur has joined #openstack-infra04:31
*** Lucas_Gray has joined #openstack-infra04:33
*** jamesmcarthur has quit IRC04:35
*** Lucas_Gray has quit IRC04:39
*** Wryhder has joined #openstack-infra04:39
*** Wryhder is now known as Lucas_Gray04:40
*** Lucas_Gray has quit IRC05:21
*** ociuhandu has joined #openstack-infra05:30
*** ociuhandu has quit IRC05:35
*** tkajinam has quit IRC05:37
*** tkajinam has joined #openstack-infra05:38
openstackgerritMerged zuul/zuul-jobs master: Use RDO trunk repos work for openvswitch on centos8  https://review.opendev.org/69583305:40
openstackgerritMerged zuul/zuul-jobs master: update-test-platforms.py : handle non-voting jobs  https://review.opendev.org/69583005:40
openstackgerritMerged zuul/zuul-jobs master: Make opensuse-15 job voting again  https://review.opendev.org/69583105:41
*** raukadah is now known as chkumar|rover05:41
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Add a Dockerfile and related jobs  https://review.opendev.org/69397105:50
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Add a Dockerfile and related jobs  https://review.opendev.org/69397105:52
*** adam_g has quit IRC05:55
*** adam_g has joined #openstack-infra05:55
*** markmcclain has quit IRC05:56
*** markmcclain has joined #openstack-infra05:57
*** jamesmcarthur has joined #openstack-infra06:07
*** jamesmcarthur has quit IRC06:12
*** hongbin has quit IRC06:18
*** dchen has quit IRC06:18
*** dchen has joined #openstack-infra06:19
ianwhttp://lists.openstack.org/pipermail/openstack-infra/2019-November/006529.html is the write-up I promised for the agenda point in the meeting06:31
*** pcaruana has joined #openstack-infra06:32
*** lmiccini has joined #openstack-infra06:34
*** udesale has quit IRC06:40
*** udesale has joined #openstack-infra06:41
*** jtomasek has joined #openstack-infra06:45
*** slaweq has joined #openstack-infra07:23
*** slaweq has quit IRC07:28
*** apetrich has joined #openstack-infra07:29
*** apetrich has quit IRC07:34
*** udesale has quit IRC07:34
*** udesale has joined #openstack-infra07:35
*** xinranwang has joined #openstack-infra07:55
*** pgaxatte has joined #openstack-infra07:55
*** slaweq has joined #openstack-infra07:57
*** pkopec has joined #openstack-infra08:08
*** tesseract has joined #openstack-infra08:16
*** tkajinam has quit IRC08:19
*** dchen has quit IRC08:24
*** tosky has joined #openstack-infra08:25
AJaegeryeah, openSUSE tumbleweed and 15 are working again. ianw, thanks for the mirror run.08:31
*** jtomasek has quit IRC08:34
*** jtomasek has joined #openstack-infra08:34
*** jpena|off is now known as jpena08:44
*** ralonsoh has joined #openstack-infra08:53
*** lucasagomes has joined #openstack-infra08:57
*** priteau has joined #openstack-infra08:58
*** jtomasek has quit IRC09:04
*** ociuhandu has joined #openstack-infra09:04
*** apetrich has joined #openstack-infra09:04
*** ociuhandu has quit IRC09:08
*** jtomasek has joined #openstack-infra09:09
mgoddardclarkb, ianw: missing a lot of context on what you were discussing, but we use templates for differentiation between distros, source/binary image types, to pass through configuration (e.g. base OS image) and customisation (e.g. install these extra packages)09:12
mgoddardloci has a different approach, they use docker build args, which can be passed via CLI args and accessed within the Dockerfile. That can be good for package customisation etc09:13
mgoddardanother option is to use scripts within the image (ADD/COPY) which are agnostic to the thing that changes (distro, release) and either detect it or have it passed in via environment09:14
ianwmgoddard: thanks; yeah at least for me the problem was that we were building images pulling across 3 different namespaces (zuul, opendev and openstack) and it meant nothing had a clear home09:17
ianws/was/is/09:17
ianwthat's why i think removing our idea that there's one Dockerfile in each project that builds the canoncial image might work ...09:17
ianwit doesn't matter that nodepool exports a zuul/nodepool and a opendev/nodepool and a <blah>/nodepool ... as long as we're fairly clear about what the purpose of the tools in a namespace is09:18
mgoddardianw: yeah, sounds like it needs to be in one place - either the 'top level' project among those or a separate one09:18
ianwyeah, to take advantage of zuul's magic i think it's easier to have multiple dockerfiles in each project (because that way you easily speculatively build images based on zuul's checkout) ... then use the excellent work with the intermediate registries to chain those speulative builds together ...09:20
ianwas they say, may you live in interesting times :)09:21
*** hashar has joined #openstack-infra09:27
openstackgerritSimon Westphahl proposed zuul/zuul master: Add optional support for circular dependencies  https://review.opendev.org/68535409:31
*** ociuhandu has joined #openstack-infra09:37
*** SotK has quit IRC09:37
*** ykarel is now known as ykarel|lunch09:42
*** ociuhandu has quit IRC09:43
*** ociuhandu has joined #openstack-infra09:44
*** SotK has joined #openstack-infra09:46
*** ociuhandu has quit IRC09:48
*** derekh has joined #openstack-infra09:49
*** roman_g has joined #openstack-infra09:53
*** pgaxatte has quit IRC09:57
*** pgaxatte has joined #openstack-infra09:59
*** ociuhandu has joined #openstack-infra10:00
*** electrofelix has joined #openstack-infra10:09
openstackgerritSimon Westphahl proposed zuul/zuul master: Keep task stdout/stderr separate in result object  https://review.opendev.org/65027610:11
*** xinranwang has quit IRC10:15
openstackgerritSimon Westphahl proposed zuul/zuul master: Align template formating for reporters  https://review.opendev.org/64330610:18
*** iurygregory has joined #openstack-infra10:19
*** ykarel|lunch is now known as ykarel10:28
*** apetrich has quit IRC10:49
*** lpetrut has joined #openstack-infra10:52
*** takamatsu has quit IRC10:55
*** udesale has quit IRC10:58
*** dpawlik has quit IRC11:00
*** ociuhandu has quit IRC11:03
*** pgaxatte has quit IRC11:10
*** jaosorior has joined #openstack-infra11:13
openstackgerritMatthieu Huin proposed zuul/zuul master: authentication config: add optional token_expiry  https://review.opendev.org/64240811:15
*** sshnaidm__ is now known as sshnaidm11:15
*** priteau has quit IRC11:19
*** apetrich has joined #openstack-infra11:27
*** apetrich has quit IRC11:27
*** dpawlik has joined #openstack-infra11:27
*** apetrich has joined #openstack-infra11:29
*** dpawlik has quit IRC11:31
*** rcernin has quit IRC11:40
*** ociuhandu has joined #openstack-infra11:41
*** electrofelix has quit IRC11:43
*** electrofelix has joined #openstack-infra11:43
*** ociuhandu has quit IRC11:46
*** dpawlik has joined #openstack-infra11:53
*** dpawlik has quit IRC11:58
*** jklare has quit IRC11:59
*** surpatil has joined #openstack-infra11:59
*** derekh has quit IRC12:01
*** derekh has joined #openstack-infra12:01
*** jklare has joined #openstack-infra12:03
openstackgerritMatthieu Huin proposed zuul/zuul master: Authorization rules: support YAML nested dictionaries  https://review.opendev.org/68479012:05
*** rfolco has joined #openstack-infra12:05
*** lmiccini has quit IRC12:12
*** dtantsur|afk is now known as dtantsur12:20
*** udesale has joined #openstack-infra12:21
*** apetrich has quit IRC12:23
*** kjackal has joined #openstack-infra12:24
*** lmiccini has joined #openstack-infra12:25
*** Lucas_Gray has joined #openstack-infra12:26
*** ccamacho has quit IRC12:40
fricklerinfra-root: I'm seeing a couple at gate failures due to inap mirror failures, no time to dig into it myself currently. sample https://634701e5b6b6ac718321-331251c5023ba17307c332949286c53b.ssl.cf1.rackcdn.com/695695/3/gate/openstack-tox-py35/42b263f/job-output.txt12:45
*** rh-jelabarre has joined #openstack-infra12:49
*** dpawlik has joined #openstack-infra12:51
*** Goneri has joined #openstack-infra12:53
*** dpawlik has quit IRC12:56
*** weshay|ruck is now known as weshay13:00
*** pgaxatte has joined #openstack-infra13:00
*** ccamacho has joined #openstack-infra13:02
openstackgerritTristan Cacqueray proposed zuul/zuul master: zuul_stream: handle module that emit msg as a list  https://review.opendev.org/69608113:05
*** goldyfruit_ has quit IRC13:08
*** rlandy has joined #openstack-infra13:11
*** apetrich has joined #openstack-infra13:14
*** surpatil has quit IRC13:14
*** ociuhandu has joined #openstack-infra13:20
*** liuyulong has joined #openstack-infra13:23
*** ociuhandu has quit IRC13:26
*** jpena is now known as jpena|lunch13:26
openstackgerritTristan Cacqueray proposed zuul/zuul master: zuul_stream: handle module that emit non str msg  https://review.opendev.org/69608113:28
openstackgerritSimon Westphahl proposed zuul/zuul master: Keep task stdout/stderr separate in result object  https://review.opendev.org/65027613:30
*** mriedem has joined #openstack-infra13:35
*** kjackal has quit IRC13:36
*** kjackal_v2 has joined #openstack-infra13:37
*** jaosorior has quit IRC13:38
openstackgerritJens Harbott (frickler) proposed opendev/system-config master: rsync mirrors: use localauth vos release  https://review.opendev.org/69598113:40
*** rfolco has quit IRC13:47
*** rfolco has joined #openstack-infra13:49
*** rfolco has quit IRC13:50
rm_workaugh, this pyyaml thing is driving me mad13:51
rm_workit's still happening even without novnc https://dc24826bd4516a6de44b-5bffd7009ada1667875c6ae3efa923db.ssl.cf5.rackcdn.com/695947/5/check/octavia-grenade/77b9004/logs/grenade.sh.txt.gz13:51
rm_worki assume something ELSE pulled it in too13:51
rm_workwhat I don't get is, I see something like 9 times it was requested to be installed, and it just worked13:52
rm_workand then this one time, it explodes? O_o13:52
rm_worki'm not sure where you found it being installed originally13:54
rm_workhmmm nevermind, seems like the changes we made didn't actually successfully disable novnc, it's still getting installed T_T13:57
rm_workhttps://review.opendev.org/#/c/695947/6/playbooks/legacy/grenade-devstack-octavia/run.yaml13:57
*** diga_ has joined #openstack-infra13:58
rm_workahhh nm, it is, it's just done really stangely13:58
rm_workhttps://zuul.opendev.org/t/openstack/build/8d124c9dba8049d1a8a541419031c778/log/logs/old/local_conf.txt.gz#21-5413:58
rm_workenabled and then disabled >_>13:59
rm_workbut that should work13:59
*** tkajinam has joined #openstack-infra14:00
fricklerrm_work: I think we are seeing the same issue in designate, the reason for the issue, afaict, seems to be that capped pip is installed only for py2, not for py3, and then with the newer pip, some things fail14:11
rm_workhmm14:12
rm_workyes14:12
*** dpawlik has joined #openstack-infra14:12
openstackgerritMatthieu Huin proposed zuul/zuul master: enqueue: make trigger optional  https://review.opendev.org/69544614:14
*** jpena|lunch is now known as jpena14:18
*** ociuhandu has joined #openstack-infra14:22
zbr_frickler: *lots* of errors on inap, can't we disable it?14:24
zbr_>340 failures in 24h, http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22Connection%20broken:%20IncompleteRead(0%20bytes%20read)%5C%2214:24
fricklerrm_work: so in fact the old stack is being set up with use_python3=false, the new one with true, this breaks. IIUC gmann did make patches such that the old stack would also run with py3, not sure why this doesn't work here14:24
rm_workwe have some custom grenade stuff14:25
rm_workso maybe it isn't using his14:25
rm_workneed to figure out what he did14:25
fricklerzbr_: I'd like some other infra-root to double-check, someone should be awake soon14:26
*** ociuhandu has quit IRC14:27
*** rfolco has joined #openstack-infra14:28
rm_workcan i just add `export DEVSTACK_GATE_USE_PYTHON3=True` to our grenade script?14:28
rm_worklooks like it14:28
rm_workbased on https://review.opendev.org/#/c/695097/14:31
rm_workand a couple other patches he proposed for the same issue I'd guess14:31
openstackgerritTristan Cacqueray proposed zuul/zuul master: wip: add cleanup-timeout job attribute  https://review.opendev.org/69609814:34
openstackgerritSimon Westphahl proposed zuul/zuul master: Spec for allowing circular dependencies  https://review.opendev.org/64330914:35
*** goldyfruit has joined #openstack-infra14:36
*** goldyfruit_ has joined #openstack-infra14:38
openstackgerritMerged opendev/system-config master: rsync mirrors: use localauth vos release  https://review.opendev.org/69598114:40
*** Goneri has quit IRC14:41
*** goldyfruit has quit IRC14:41
*** dpawlik has quit IRC14:44
*** Lucas_Gray has quit IRC14:45
rm_workyeah, inap mirrors borked maybe?14:46
rm_work2019-11-26 14:38:44.957553 | controller | Err:1 http://mirror.mtl01.inap.opendev.org/ubuntu bionic/main amd64 libharfbuzz0b amd64 1.7.2-1ubuntu114:46
rm_work2019-11-26 14:38:44.957657 | controller |   404  Not Found [IP: 198.72.125.4 80]14:46
openstackgerritPaul Belanger proposed zuul/zuul-jobs master: Update download-artifact to use zuul.artifacts  https://review.opendev.org/69601314:47
rm_workour jobs on inap are dieing too14:47
rm_workzbr_ / frickler ^^14:47
*** ociuhandu has joined #openstack-infra14:49
*** Goneri has joined #openstack-infra14:53
openstackgerritFelix Schmidt proposed zuul/zuul master: Make reporting asynchronous  https://review.opendev.org/69125314:54
openstackgerritFelix Schmidt proposed zuul/zuul master: Make direct-push configurable on project-level  https://review.opendev.org/67710914:54
openstackgerritFelix Schmidt proposed zuul/zuul master: Implement push job in merger  https://review.opendev.org/67711014:54
openstackgerritFelix Schmidt proposed zuul/zuul master: Push changes in GerritReporter if direct-push is enabled  https://review.opendev.org/67711114:54
*** ociuhandu has quit IRC14:54
*** dpawlik has joined #openstack-infra14:56
fungii suspect the problem with the inap mirror (if the failures you're seeing are for proxied http calls?) is that the apache isn't getting its proxy cache culled fast enough or aggressively enough, so the dedicated /var/cache/apache2 volume is filling up from time to time: http://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=67962&rra_id=all14:58
rm_workok so ... we wait for that to get fixed while a bunch of our jobs randomly die? :D14:59
rm_workcan we set up elastic-recheck for this?14:59
rm_work(may have made a meaningless sentence, not sure if that's how that works)15:00
fungiright now htcacheclean is running hourly with -l 70200M which may just not be aggressive enough if jobs are caching very large files15:00
fungithe problem is "fixed" for the moment because there's room on that fs again15:00
rm_workwell right now, this means that using inap in zuul is worse than NOT using it, because having one job run on inap and fail means a recheck and like 10+ jobs need to run again15:01
rm_workhmm k, so ... i guess I recheck and hope?15:01
fungibut presumably still better than not testing anything at all? ;)15:02
fungii can clean up more cached content there, but i'm curious to know why we're not hitting the same problem in other providers which have similar configuration15:02
rm_workis not testing one of the options? :D15:02
fungiif you don't develop any software then there's no need to test anything, so i suppose?15:03
fungicould be the volume of jobs being run there, or i/o performance slowing the htcacheclean run15:03
fungiyeah, looks like it could be i/o performance, the 14:00 utc htcacheclean run is still going an hour later15:04
rm_workyeah i'm just saying, the result of having it enabled when it's broken isn't just "ah, we don't get additional capacity from it", the result is "we have less capacity available than if it wasn't enabled"15:04
*** jaosorior has joined #openstack-infra15:04
openstackgerritPaul Belanger proposed zuul/zuul-jobs master: Update download-artifact to use zuul.artifacts  https://review.opendev.org/69601315:04
fungiright, and i'm saying the condition i suspect caused those failures has cleared, at least temporarily, and i'm looking at what we need to do to reduce the risk that it returns15:04
rm_workk15:05
fungii think we're simply caching too much with apache for slower filesystems, in the past we tried to keep these down closer to 50gb but they've been increased in recent months i think15:05
fungithere's a breaking point where you cache files faster than htcacheclean can remove them, and then you get a runaway train effect (runaway ussuri?)15:06
fungikeeping the cache smaller reduces the context that htcacheclean must traverse, to hopefully keep it ahead of the game for subsequent runs15:07
sshnaidmmordred, hi, please ping me if you're available in your time, wrt openstack ansible modules15:11
fungiutilization is slowly falling there, so once htcacheclean hopefully catches up we can reduce it further. if it starts climbing again i'll disable inap temporarily in nodepool so we can safely wipe the cache before reducing the size we want htcacheclean to keep it at15:12
rm_workreally really wish it was possible to have zuul restart ONE job15:13
openstackgerritElod Illes proposed openstack/openstack-zuul-jobs master: Add tox-py37 to periodic-stable-jobs template  https://review.opendev.org/69610515:13
*** ykarel is now known as ykarel|pto15:14
rm_workand do it while the others are still running so i don't have to wait the entire time and then re-run a bunch of jobs that passed fine already15:14
fungiinterestingly, some of our other mirrors (rax and ovh) have 128gb volumes for /var/cache/apache2 instead of 100gb15:16
fungithat might be why the htcacheclean -l parameter got increased15:17
*** chkumar|rover is now known as raukadah15:17
*** ociuhandu has joined #openstack-infra15:19
*** tkajinam has quit IRC15:19
fungii've temporarily switched out the running htcacheclean process there with a manual one with -l set to 50gb15:20
fungiso hopefully we'll get some more breathing room while i continue digging15:20
rm_workthanks15:20
fungilooks like https://review.openstack.org/575520 raised it from 50gb to 70gb in july of last year15:23
*** dpawlik has quit IRC15:23
fungithough i think inap has been offline for most of that time, until a few weeks ago, so it's possible we simply didn't have data showing it would be a problem there15:24
*** yoctozepto has quit IRC15:24
*** yoctozepto has joined #openstack-infra15:25
*** ociuhandu has quit IRC15:28
clarkbfungi: we could make a 300gb cinder volume and swap in 200gb fs pretty easily?15:33
clarkb(extra 100gb for afs)15:33
fungiyeah, i'm unconvinced giving it more space will solve the problem though, if it can't manage to delete files down to 70gb15:33
fungiwe run htcacheclean hourly under flock, but if it takes 3 hours for htcacheclean to traverse 70gb+ data then that's two additional pulses behind it's getting15:34
clarkbdoes it get near 70gb after that 3 hours though?15:35
fungiincreasing the breathing room might relieve that, i guess, to allow it to skip more htcacheclean pulses?15:35
clarkbif so its only the growth in that windoe we have to accomodate right?15:35
fungilooking at the graph, it was around 70gb as of 0800z but then started climbing15:37
fungifilling the filesystem by around 1100z15:37
clarkbwe are probably very near that boundary then15:38
fungiand it remained full until it suddenly began to drop around 1330z15:38
*** derekh has quit IRC15:38
clarkband extra disk may be sufficient to get us away from it15:38
*** derekh has joined #openstack-infra15:38
fungiso basically htcacheclean could not keep up for around 5.5 hours15:38
clarkbthe cleaning is sudden aiuibecause it does a scan first to determine what to delete then deletes15:38
clarkbas it applies LRU rules to objects if bot already expired15:39
clarkb*not15:39
fungiby the time it did manage to delete content, it had lost 10gb ground and so only deleted down to around 80gb15:39
fungipresumably because it was working of 6-hour-old calculations for what should be expired15:39
fungibut yeah, we could attach another cinder volume, add it to the current vg, extend the volume into it and see if things are able to keep up with the extra headroom15:42
fungibut if it's basically only capable of doing an htcacheclean pass every 6 hours under load, i'm concerned that may still end up ina bad place15:42
*** ociuhandu has joined #openstack-infra15:42
fungiwondering if there are faster storage options there15:42
fungisomething where we might get better read performance (assuming that's the delay)15:43
clarkbya though raid 0 may be the answer15:43
clarkb(which second volume approximates)15:44
fungithat's not a terrible idea, though it does double the risk for catastrpohic failure of the service from a volume outage (so does attaching a second volume to the vg really)15:44
clarkbyup15:45
clarkband with cinder volumes it is really hard to tell if weactually get double the throughput15:45
fungii'm pretty sure lvm2 can stripe blocks across multiple pvs, need to revisit the manpage. if so we could do that without resorting to mdraid15:45
clarkbbecause networking and wedont know what the backend looks like15:45
clarkblooks like lvm can do it but we need anew lv15:48
*** diablo_rojo has joined #openstack-infra15:53
*** lmiccini has quit IRC15:54
*** michael-beaver has joined #openstack-infra15:54
openstackgerritYannick Thomas proposed openstack/project-config master: Create neutron-interconnection repo under x/ namespace  https://review.opendev.org/69611615:55
*** mriedem has quit IRC15:55
*** mriedem has joined #openstack-infra15:59
*** dpawlik has joined #openstack-infra16:00
*** dtantsur is now known as dtantsur|afk16:03
*** dpawlik has quit IRC16:04
*** kjackal_v2 has quit IRC16:12
*** ijw has joined #openstack-infra16:14
clarkbfungi: alsonpossible that the local root disk performs better than cinder volumes there? we could use apache cache on local disk if so16:15
gmannrm_work: frickler yeah that worked for other projects. octavia was facing the broken pipe issue also in that grenade job fix patch..16:16
fungiclarkb: maybe, on the other hand if it doesn't then we fill the rootfs again and *boom*16:17
clarkbya16:18
*** hashar has quit IRC16:19
rm_workgmann: yeah and ALSO some issue with the installation of osc-placement in nova16:19
*** jamesmcarthur has joined #openstack-infra16:19
rm_workhttps://review.opendev.org/#/c/695466/16:19
rm_workso many issues16:19
rm_worki think we're going to temporarily switch our grenade job to non-voting, because this is ridiculous, and wait for some of this stuff to work itself out16:20
rm_workour gates have been down since the middle of last week16:20
gmannrm_work: ok, or I will suggest to make it py2->py2 as it was previously so that you keep running the coverage. later while moving to everything py3 then we fix those issue16:21
rm_workhmm, i guess that's a possibility -- but i think we are ALSO about to have py2 issues16:22
*** tesseract has quit IRC16:23
*** pgaxatte has quit IRC16:25
*** ociuhandu has quit IRC16:33
*** ociuhandu has joined #openstack-infra16:34
AJaegerconfig-core, could you review https://review.opendev.org/694478, please?16:42
*** jpena is now known as jpena|brb16:45
*** lucasagomes has quit IRC16:46
clarkbas a sanity check the inap volume is mounted noatime,errors=remount-ro,barrier=0 which is in line with our other volume mount options16:47
clarkbAJaeger: do you know if the neutron team is aware of the apparent new interest? and if so does that change their plans at all?16:50
clarkb(I want to avoid a situation where we fork the project then 6 months later want to replace the openstack/ project with the fork16:51
slaweqclarkb: hi, yes I know about it16:51
slaweqbut as there was really lack of development in this project for last couple of cycles, we decided to not keep it as stadium project anymore16:52
slaweqand I don't think it will change anytime soon16:52
clarkbslaweq: right I get all that. My question is now that you've said remove it people are claiming that they will invest time into it. Does that change the calculation at all? Mostly because I really want to avoid a potential unfork in the future16:53
*** dpawlik has joined #openstack-infra16:54
slaweqclarkb: I don't think so, there is no reason why it couldn't be developed in x/ namespace if people will really want16:54
slaweqbut based on the past experience with this project, I'm really still not sure how much time they will be able to invest in this project now16:55
slaweqso as for now I can say that we will not want to unfork it in e.g. 6 months16:55
slaweqclarkb: is that good answer for You? :)16:55
clarkbyes16:56
slaweqthx :)16:57
*** dpawlik has quit IRC16:58
*** iurygregory has quit IRC16:58
johnsomAh, ok, so you are already working on the broken inap mirror.17:01
johnsomE: Failed to fetch http://mirror.mtl01.inap.opendev.org/ubuntu/pool/main/h/harfbuzz/libharfbuzz0b_1.7.2-1ubuntu1_amd64.deb  404  Not Found [IP: 198.72.125.4 80]17:01
clarkbjohnsom: well I think we've identified the cause (htcacheclean not cleaning quickly enough) and now trying to sort out if simply adding some headroom is sufficient to avoid the problem or if we need to find faster disk io17:03
johnsomJust minutes ago in https://aa07927f3550f63afd7c-beccc8c74927db18ecc6d28abe62d057.ssl.cf1.rackcdn.com/695947/8/check/octavia-v2-dsvm-scenario/d746404/job-output.txt17:03
clarkbhrm it has plenty of disk right now ~10GB17:03
johnsomCool, thanks. Hopefully not one of those HPE SSDs... lol17:03
clarkbalso that is afs not apache cache17:03
clarkb(that particular url is I mean)17:03
clarkbpossible that something else is going on too17:04
*** sshnaidm is now known as sshnaidm|afk17:05
clarkb-????????? ? ?     ?          ?            ? libharfbuzz0b_1.7.2-1ubuntu1_amd64.deb17:06
clarkbthat is what ls -l shows for that file on afs "disk"17:06
clarkbabout 5 hours ago there were a bunch of afs io errors17:07
clarkbnothing current, but those errors could still affect that particular file I suppose17:07
openstackgerritFabien Boucher proposed zuul/zuul master: WIP pagure: remove connectors burden and simplify code  https://review.opendev.org/69613417:07
clarkbfungi: ^ maybe we should reboot?17:08
*** udesale has quit IRC17:09
openstackgerritFabien Boucher proposed zuul/zuul master: WIP pagure: remove connectors burden and simplify code  https://review.opendev.org/69613417:10
clarkbfungi: I think we should disable inap, increase apache cache fs size (possibly with raid0), reboot, then turn it back on again17:10
clarkbI expect that we are close enough to the limit here that simply adding a bit more disk will make things happy again17:10
clarkbbasically we need to get to where htcacheclean is able to stat its contents before the disk fills up17:11
clarkbthoughts?17:11
fungiclarkb: yeah, i also wonder if whatever impacted afs could be similarly responsible for htcacheclean's poor performance17:13
openstackgerritClark Boylan proposed openstack/project-config master: Disable inap  https://review.opendev.org/69613717:13
rm_workyeah i think it's always been that package for us -- so possible just that one or a couple of them got corrupted at some point17:13
fungiclarkb: also two apache segfaults a few minutes after that burst of afs disk cache read errors17:15
*** ociuhandu has quit IRC17:15
fungiit's possible the cinder volume these are on is sometimes not responding or responding unreasonably slowly17:15
clarkbya and that could affect afs when it goes to read from the cache17:16
clarkb(or contention between the two caches against the volume)17:17
clarkbwe have a single volume type in that region: solidfire017:18
openstackgerritMerged openstack/project-config master: Retire neutron-interconnection project  https://review.opendev.org/69447817:19
fungiclarkb: looks like we're also not collecting disk i/o counters from snmp17:19
*** igordc has joined #openstack-infra17:20
fungitop shows iowait spiking up badly for brief periods though17:20
fungiwatched it go from around 1% wa to almost 80% a minute ago17:21
*** jpena|brb is now known as jpena17:21
clarkbbenj_: ^ any idea how we can more efficiently make use of cinder there?17:21
*** jamesmcarthur has quit IRC17:22
clarkbfungi: another option is to use a bigger flavor then hope the root disk performs better17:23
fungiyeah, maybe. are there other cinder volume types available, did you happen to notice?17:24
clarkbno that is the only type available17:24
fungik, then yeah maybe the rootfs is hypervisor local storage and has less network congestion to worry about17:24
fungibut if we're also seeing problems for afs then we'd presumably want both on the rootfs, so need a larger rootfs17:25
clarkbyes, we can rebuild the server wti ha bigger flavor17:25
fungido the larger flavors come with larger rootfs?17:27
fungii know that's not a given in other providers17:27
AJaegerconfig-core, two small reviews, please: https://review.opendev.org/695661 https://review.opendev.org/69540117:27
clarkbfungi: yes, in this case we can double the rootfs17:28
clarkb320GB from 160GB17:28
clarkb(doubles a bunch of other stuff too but that may just be what we have to accept)_17:29
fungianybody remember how far we got (or where we got stuck) adding a builder for arm64/aarch64 wheels? was it challenges with afs support?17:29
clarkbwe build wheels in zuul jobs now. I don't think anything on the zuul side has problems with that currently. I'm guessing that it is afs that posed a problem (since we have to write and not just read)17:30
fungido the current wheel builds for amd64 get temporary write access to afs, or is the executor retrieving those and writing them into afs?17:31
clarkbfungi: I think a secret is shared on the build node and they write directly17:31
fungik. in that case, yeah, we do still need to solve afs access from arm64 systems17:32
AJaegerianw should know, AFAIK he looked at that...17:32
*** jaosorior has quit IRC17:32
fungiyeah, i was just going to follow up on the multi-arch sig thread on openstack-discuss where jrosser mentioned needing a prebuilt arm64 wheel cache for osa jobs17:33
fungibut can certainly wait until ianw is awake17:33
openstackgerritMerged openstack/project-config master: Disable inap  https://review.opendev.org/69613717:34
*** ykarel|pto has quit IRC17:34
clarkbfungi: fwiw I'm somewhat inclined to build a new inap mirror on the 320GB root disk flavor since that is straightforward and doesn't involve trying to tweak too many settings17:34
fungii wonder if a refactor of the job to have the executor write those into afs instead would make things easier (but i think we'd want to make sure we only transfer the outstanding delta between what was already built and what changed?)17:34
clarkbwe can spend all day tuning cinder or an hour setting up new server :)17:35
fungiclarkb: yeah, i'm with you there. best we focus our limited available time elsewhere17:35
mgagneclarkb: what's the issue?17:35
fungimgagne: we saw read errors from a cinder volume, and are also getting very slow i/o reading from it17:36
clarkbmgagne: the thought is that using a vm local root disk (that is bigger) may provide better throughput17:36
mgagneis the issue intermittent or is it still going on?17:36
mgagnecan't disagree on that one (local disk > cinder volume)17:37
clarkbmgagne: it is a little bit of both :) intermittent iowait slows down htcacheclean to the point where we run out of disk (because it can't clean quickly enough) and we think that the intermittent issue may cause problems in afs that then persists17:37
fungimgagne: it was apparently particularly bad around 12:00 and 12:45 utc17:37
clarkbmy thought is go to the 320GB root disk flavor and not use a cinder volume17:38
mgagneI'm looking at performance graphs on the backend and we didn't see any issue here. Our cinder volume has a minimum of 400 iops guarantied and maximum of 4000 if available (which should always be the case here)17:39
clarkbmgagne: could it be that network is the bottleneck (if shared with the test nodes?)17:39
mgagnewe unfortunately don't have any other volume types but I could manually increase the maximum IOPS to double it.17:39
clarkbif we double the iops we should be able to see if htcacheclean runs quicker17:40
fungimgagne: can you see whether that instance is exceeding the cinder iops guarantee?17:40
mgagnecompute nodes have 10g with lacp. But VMs themselves have limited bandwidth depending on the flavor. The storage network isn't limited though.17:40
clarkbok, likely not networking then17:41
mgagneI'll check if that information is available17:41
mgagneperformance graph isn't real time for volumes. I saw some spike to 100% utilization but IOPS reported is way below the maximum. So I'm a bit confused about how the % is computed. Could be based on the minimum.17:47
*** tobiash_ is now known as tobiash17:47
openstackgerritMerged openstack/project-config master: Manage pyghmi jobs at project level  https://review.opendev.org/69566117:49
openstackgerritMerged openstack/project-config master: Add gerritbot trigger for microstack  https://review.opendev.org/69540117:49
*** openstackgerrit has quit IRC17:49
mgagneI think iops are based on 4k. But your iops are way above 4k. Average ~16k with 64k peaks in your case. I suppose increasing maximum would be a good thing17:50
clarkbmgagne: fwiw this is the only volume we'll be running there (at least we don't have plans for additional volumes). I'm not sure if that system is shared but if not then raising those iops would probably be ok?17:51
mgagneit's shared but it's under utilized afaik.17:51
mgagneI doubled the maximum iops.17:52
clarkbbut also we can probably switch to root disk if that is better for you all (then use the bigger flavor)17:52
mgagneyeah, local disk will always be faster17:53
mgagneIIRC RAID10 with SSD is used on the compute nodes.17:54
clarkboh ya in that case maybe I should just go ahead and build a new server on the bigger flavor17:54
clarkbfungi: ^17:54
fungii concur17:56
*** jklare has quit IRC17:56
fungisounds like the best way forward at this point17:56
clarkbok new mirror is building. I'll push up inventory and dns changes once that info is known17:58
fungithanks!17:59
*** jklare has joined #openstack-infra18:00
*** derekh has quit IRC18:00
*** openstackgerrit has joined #openstack-infra18:09
openstackgerritClark Boylan proposed opendev/system-config master: Replace inap mirror with bigger instance  https://review.opendev.org/69614818:09
clarkbI think we want to get in the dns update first18:09
clarkbso that acme.sh will work18:10
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Replace inap mirror  https://review.opendev.org/69615018:13
clarkbfungi: ^ As soon as inap in-use count falls to 0 I think we can merge the dns update then the system-config update18:13
clarkbin use count is 80 and falling18:14
*** armax has quit IRC18:14
*** pkopec has quit IRC18:22
*** ijw has quit IRC18:26
*** jaosorior has joined #openstack-infra18:26
clarkbgonna step out for a bit before the meeting18:26
*** jpena is now known as jpena|off18:31
*** rosmaita has quit IRC18:32
*** openstackgerrit has quit IRC18:35
*** ijw has joined #openstack-infra18:37
fungihttp://grafana.openstack.org/d/ykvSNcImk/nodepool-inap?orgId=1 says 48 in use18:38
*** ijw has quit IRC18:40
*** ijw has joined #openstack-infra18:40
*** goldyfruit___ has joined #openstack-infra18:41
*** goldyfruit_ has quit IRC18:43
*** rosmaita has joined #openstack-infra18:46
*** electrofelix has quit IRC18:50
*** ralonsoh has quit IRC18:51
*** goldyfruit_ has joined #openstack-infra18:51
*** goldyfruit___ has quit IRC18:54
*** dpawlik has joined #openstack-infra18:55
*** aedc has joined #openstack-infra18:57
*** dpawlik has quit IRC18:59
*** ijw_ has joined #openstack-infra19:03
*** ijw has quit IRC19:06
*** goldyfruit_ has quit IRC19:07
*** lpetrut has quit IRC19:08
*** goldyfruit has joined #openstack-infra19:09
*** goldyfruit_ has joined #openstack-infra19:19
*** eharney has joined #openstack-infra19:20
*** goldyfruit has quit IRC19:22
*** jamesmcarthur has joined #openstack-infra19:36
*** aedc has quit IRC19:38
*** aedc has joined #openstack-infra19:39
*** eharney has quit IRC19:46
diablo_rojoSorry I missed the meeting. thanks fungi for covering the attachment stuff!19:51
funginp!19:51
*** rlandy is now known as rlandy|brb19:56
*** diga_ has quit IRC19:57
*** openstackgerrit has joined #openstack-infra20:01
openstackgerritMerged opendev/zone-opendev.org master: Replace inap mirror  https://review.opendev.org/69615020:01
funginow that the meeting's over, i'm going to go get some very late lunch and run pre-holiday errands, back laterish20:06
*** aedc has quit IRC20:10
*** rlandy|brb is now known as rlandy20:10
*** goldyfruit___ has joined #openstack-infra20:17
*** goldyfruit_ has quit IRC20:20
*** goldyfruit_ has joined #openstack-infra20:21
*** goldyfruit___ has quit IRC20:24
*** nhicher has quit IRC20:33
*** nhicher has joined #openstack-infra20:33
clarkbI've approved https://review.opendev.org/#/c/696148/ google dns reports the new ip for the mirror now so all the acme stuff should be ready to go20:34
*** nhicher has quit IRC20:35
*** nhicher has joined #openstack-infra20:39
*** ccamacho has quit IRC20:46
*** tosky has quit IRC20:46
*** dpawlik has joined #openstack-infra20:55
openstackgerritMerged opendev/system-config master: Replace inap mirror with bigger instance  https://review.opendev.org/69614820:56
*** dpawlik has quit IRC21:00
*** jtomasek has quit IRC21:00
*** armax has joined #openstack-infra21:03
*** rfolco has quit IRC21:14
clarkbinap mirror is ansibling now21:20
openstackgerritClark Boylan proposed openstack/project-config master: Revert "Disable inap"  https://review.opendev.org/69619321:25
clarkbwe aren't ready for ^ yet, but wanted it to get through check so that it is ready when we are ready21:25
*** threestrands has joined #openstack-infra21:27
openstackgerritIan Wienand proposed openstack/project-config master: zuul layout: include openstacksdk in zuul tenant for jobs  https://review.opendev.org/69619421:29
ianwfungi /clarkb: ^ this would be helpful so i can flesh out the images stuff into working changes21:30
clarkbianw: we might want to sort out the zuul tenant's config errors first?21:30
clarkbAJaeger pointed them out and has fixes by way of includnig more projects in the zuul tenant (which is technically fine but the problems are related to the dns test jobs and similar which I wouldn't expect us to care about in the zuul tenant)21:31
clarkbI've not had time to look at it beyond that though21:31
ianwhrm, ok ,will look, we did similar for dib and it didn't make things worse, at least21:32
*** igordc has quit IRC21:33
ianw Job dib-functests-base not defined21:34
ianwthat's weird, that seems like it's all in the one repo21:35
*** ijw_ has quit IRC21:36
*** cloudnull has quit IRC21:37
*** d34dh0r53 has quit IRC21:39
*** igordc has joined #openstack-infra21:39
*** cloudnull has joined #openstack-infra21:39
*** d34dh0r53 has joined #openstack-infra21:39
ianwi agree with the other missing projects, though21:48
openstackgerritClark Boylan proposed opendev/system-config master: Add necessary ansible vars for inap mirror LE  https://review.opendev.org/69619521:53
clarkbianw: fungi ^ I forgot to add that (this is why we don't have an apache or certs on the new host yet)21:53
ianwoh yeah, the number change21:53
*** diablo_rojo has quit IRC21:56
*** jaosorior has quit IRC21:59
openstackgerritVitaliy Lotorev proposed zuul/zuul master: doc: Clarify that some regexp has restricted syntax  https://review.opendev.org/69599122:01
*** diablo_rojo has joined #openstack-infra22:02
*** ociuhandu has joined #openstack-infra22:02
*** rcernin has joined #openstack-infra22:04
openstackgerritVitaliy Lotorev proposed zuul/zuul master: doc: Document regexp usage  https://review.opendev.org/69599122:05
*** ociuhandu has quit IRC22:08
*** pcaruana has quit IRC22:16
*** slaweq has quit IRC22:37
*** pkopec has joined #openstack-infra22:40
tonybclarkb, ianw: FWIW, I tried running gitea locally to see if I can get more data / debuggin info but so far I haven't made a lot of progress.22:40
tonybclarkb, ianw: My nova repo get's 'wedged' every 2nd day but I have a work around and as I'm the only one seeing it I don't think it's anyhting like a high priority22:42
*** armax has quit IRC22:42
fungiyou're not the only one seeing it, since we can all replicate the issue with a copy of your clone22:43
fungiyou're just apparently the only one inconvenienced enough by it to give us a heads-up22:43
*** gfidente has quit IRC22:46
tonybfungi: fair.  I'd really like for someone to reproduce it *without* my repo.  Not that've done anyting funky with my repo but still.22:47
fungigit fsck doesn't think you've adulterated that repo at the least22:49
tonybfungi: Yeah, I mean it's a repo I've been using for 5ish years but I've only done things you'd expect in the repo22:50
tonybI really want to reproduce it with a local gitea, I think that's the only way we're going to make progress22:51
fungiyep, a local master branch tip deployment even22:54
fungii keep reminding myself they're all volunteers with day jobs too22:55
fungithe easier we can make it to confirm this bug, the better22:55
*** dpawlik has joined #openstack-infra22:56
*** dpawlik has quit IRC23:01
*** rh-jelabarre has quit IRC23:06
*** tkajinam has joined #openstack-infra23:08
*** slaweq has joined #openstack-infra23:11
*** goldyfruit___ has joined #openstack-infra23:13
*** slaweq has quit IRC23:15
*** goldyfruit_ has quit IRC23:16
*** goldyfruit_ has joined #openstack-infra23:17
*** goldyfruit___ has quit IRC23:20
*** dchen has joined #openstack-infra23:23
*** pkopec has quit IRC23:28
*** ociuhandu has joined #openstack-infra23:30
clarkbarg the mirror test failed on the fix for inap le23:32
clarkbI've rechecked it23:32
ianwhrm what was the failure?23:34
clarkbstill looking23:34
*** ociuhandu has quit IRC23:35
clarkbhttps://zuul.opendev.org/t/openstack/build/3681d241ffc84924ac20107828374437/log/job-output.txt#275723:35
ianwhrm, we don't capture the journal that would tell us why apache was unhappy23:35
clarkbara doesn't ahve any more than that job-output file either23:36
clarkbhas the "look in journalctl -xe" message23:37
ianwit suggests invalid certs somehow, but the letsencrypt bits seemed to work.  it should have self-signed certs deployed23:37
*** armax has joined #openstack-infra23:38
ianwhttps://zuul.opendev.org/t/openstack/build/3681d241ffc84924ac20107828374437/log/mirror01.openafs.provider.opendev.org/syslog.txt.gz#146023:39
ianwactually we do capture it23:39
clarkboh!23:39
ianw... so ... why did that not generate a self-signed cert23:39
clarkbhttps://storage.bhs1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_368/696195/1/gate/system-config-run-mirror/3681d24/bridge.openstack.org/ara-report/result/7c5e2ae6-30cf-4a12-9365-93801c1bb0a4/ that task was skipped23:40
ianwhttps://zuul.opendev.org/t/openstack/build/3681d241ffc84924ac20107828374437/log/job-output.txt#2507 ... it looks like it ran acme.sh, but didn't get a txt record23:41
ianwi doesn't seem to have captured *any* output from that23:41
clarkbya acme_txt_required is an empty list23:41
clarkbI wonder if this is the same thing we saw on gitea06?23:41
clarkbwhere it seems to do everything but fails anyway?23:42
ianwhttps://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/letsencrypt-acme-sh-install/files/driver.sh23:43
ianwhrm, we should capture the log file, that would tell us what went wrong23:43
*** eernst has joined #openstack-infra23:44
ianwwe do for the letsencrypt tests, but no the mirror jobs23:44
*** rfolco has joined #openstack-infra23:47
openstackgerritIan Wienand proposed opendev/system-config master: mirror jobs: copy acme.sh output  https://review.opendev.org/69620823:47
*** eernst has quit IRC23:48
ianwbut, it still should have failed i would have thought, at least it would stop things earlier23:48
clarkbI think the list being empty means nothign to do so it succeeds anyway23:49
*** goldyfruit_ has quit IRC23:49
clarkbI wonder if we can check that (have something to see if a record is expected? then fail if list is empty anyway?)23:49
ianwlooking at this with fresh eyes ... https://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/letsencrypt-acme-sh-install/files/driver.sh#L39 would do a terrible job when there's a failure23:50
clarkbnot sure if we know that or if we rely on acm.sh to figure it out23:50
ianwit will swallow it23:50
clarkbianw: the log file would have it though right?23:50
ianwwe'll see it in the log files (from the tee) but it won't fail the script23:51
*** rfolco has quit IRC23:51
*** rfolco has joined #openstack-infra23:51
ianwPIPESTATUS is probably the best way to handle this, do a post-check23:52
clarkbshould we set pipefail and errexit?23:52
ianwi think not because that will bail it before it gets to the tee to stash it in the logfile23:53
clarkbgood point23:53
openstackgerritMerged openstack/diskimage-builder master: Add IPv6 support in dhcp-all-interfaces  https://review.opendev.org/69211023:54
*** rfolco has quit IRC23:55
*** rfolco has joined #openstack-infra23:55
*** rlandy has quit IRC23:58
ianwhave to double check the exit code when renewal isn't required23:58
*** rlandy has joined #openstack-infra23:58
*** diablo_rojo has quit IRC23:58

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!