Wednesday, 2019-08-21

*** bobh has joined #openstack-infra00:02
*** sgw has quit IRC00:02
*** aaronsheffield has quit IRC00:05
*** jamesmcarthur has joined #openstack-infra00:06
*** jamesmcarthur has quit IRC00:11
*** markvoelker has joined #openstack-infra00:11
ianwclarkb: if you're in a base-jobs sort of mood, https://review.opendev.org/#/c/676120/ would help for arm64 functional test on dib00:11
*** markvoelker has quit IRC00:15
*** bnemec has joined #openstack-infra00:16
*** pkopec has quit IRC00:20
*** markvoelker has joined #openstack-infra00:26
*** zhurong has quit IRC00:27
*** bobh has quit IRC00:35
*** bnemec has quit IRC00:35
guilhermespthanks fungi ! I will let you know when Im done with it00:41
*** gyee has quit IRC00:41
openstackgerritMerged opendev/base-jobs master: Fix stable branch path  https://review.opendev.org/67755500:46
corvusianw, clarkb: 677584 lgtm; i'm not around -- we can approve it if ianw can keep an eye on it, or i can do so tomorrow00:46
ianwcorvus: sure, i can watch it for the rest of the day and make sure nothing obviously explodes00:47
corvusianw: thanks!  +w00:48
*** bobh has joined #openstack-infra00:52
*** exsdev has quit IRC00:54
openstackgerritMerged zuul/zuul-jobs master: upload-logs-swift: fix keystoneauth1 exceptions  https://review.opendev.org/67758400:57
*** ianychoi has quit IRC00:57
*** ianychoi has joined #openstack-infra00:59
*** spsurya has joined #openstack-infra00:59
*** hongbin has joined #openstack-infra01:04
*** ricolin has joined #openstack-infra01:05
*** rlandy|ruck has quit IRC01:06
*** gregoryo has joined #openstack-infra01:07
*** exsdev has joined #openstack-infra01:14
*** zzehring has quit IRC01:24
*** zzehring has joined #openstack-infra01:24
*** sgw has joined #openstack-infra01:29
*** igordc has quit IRC01:35
openstackgerritMerged openstack/diskimage-builder master: dracut-regenerate: catch failures and exit code  https://review.opendev.org/67603201:48
*** liuyulong has quit IRC01:51
*** bobh has quit IRC01:54
*** igordc has joined #openstack-infra01:58
*** jamesmcarthur has joined #openstack-infra02:20
*** michael-beaver has quit IRC02:21
openstackgerritMerged openstack/diskimage-builder master: block-device-efi : expand disk size calculation  https://review.opendev.org/67635402:30
*** rh-jelabarre has quit IRC02:37
*** bhavikdbavishi has joined #openstack-infra02:40
*** jamesmcarthur has quit IRC02:41
*** bhavikdbavishi1 has joined #openstack-infra02:42
*** jamesmcarthur has joined #openstack-infra02:44
*** bhavikdbavishi has quit IRC02:44
*** bhavikdbavishi1 is now known as bhavikdbavishi02:44
*** igordc has quit IRC02:49
*** bobh has joined #openstack-infra02:53
*** bobh has quit IRC02:56
*** bhavikdbavishi has quit IRC03:03
guilhermespappreciate more votes here https://review.opendev.org/#/c/677538/ :)03:08
*** larainema has joined #openstack-infra03:09
fungiconfig-core: ^03:14
*** bhavikdbavishi has joined #openstack-infra03:20
openstackgerritMerged openstack/project-config master: Add os_murano project  https://review.opendev.org/67753803:25
*** jamesmcarthur has quit IRC03:26
fungiguilhermesp: it could be up to an hour from when that merged until the repository exists in gerrit/gitea but after that it should be safe to recheck the corresponding governance change03:27
guilhermespthanks for the time perspective fungi I was going to wait a bit, but is late here hahaha tomorrow morning I will do it. But if you would be around, feel free to recheck it too!03:30
fungisure, happy to03:31
guilhermespnice thanks!03:31
*** igordc has joined #openstack-infra03:33
*** psachin has joined #openstack-infra03:33
*** janki has joined #openstack-infra03:41
*** diga has joined #openstack-infra03:57
*** hongbin has quit IRC04:06
AJaegerinfra-root, Zuul shows a config error with "04:09
AJaeger  Gerrit error executing git-upload-pack openstack/openstack-ansible-os_murano"04:09
fungiit just got created04:11
fungimay be a race in zuul configuration getting applied sooner than manage-projects runs04:11
AJaegerah, thanks!04:13
AJaegerinfra-root, could you delete the extra directory https://docs.openstack.org/python-tripleoclient/stable/stein/ from our servers, please? I just checked and all looks fine, so https://docs.openstack.org/python-tripleoclient/stein/ was just updated04:27
openstackgerritMerged opendev/base-jobs master: mirror-info: add ubuntu-ports  https://review.opendev.org/67612004:30
ianwAJaeger: np, removed04:31
ianwspeaking of afs volumes, yum-puppetlabs i've unlocked and is currently running a release; it was the same problem as the others04:32
*** jaosorior has quit IRC04:34
openstackgerritIan Wienand proposed openstack/project-config master: AFS grafana : add yum-puppetlabs  https://review.opendev.org/67760104:38
*** ramishra has joined #openstack-infra04:41
*** soniya29 has joined #openstack-infra04:44
AJaegerthanks, ianw04:47
*** ykarel|away has joined #openstack-infra04:48
AJaegeryeah, config error disappeared - thanks, fungi04:53
AJaegerconfig-core, we now can use promote for the infra jobs, please review https://review.opendev.org/67754004:57
*** kopecmartin|off is now known as kopecmartin05:08
*** udesale has joined #openstack-infra05:10
*** udesale has quit IRC05:14
*** raukadah is now known as chkumar|rover05:17
*** odicha has joined #openstack-infra05:19
*** dave-mccowan has quit IRC05:21
*** odicha has quit IRC05:23
*** soniya29 has quit IRC05:24
*** odicha has joined #openstack-infra05:34
*** ykarel|away is now known as ykarel05:35
*** jaosorior has joined #openstack-infra05:49
*** dpawlik has joined #openstack-infra06:20
*** jaosorior has quit IRC06:21
*** igordc has quit IRC06:30
*** dciabrin has joined #openstack-infra06:30
*** ianychoi has quit IRC06:34
*** ianychoi has joined #openstack-infra06:34
openstackgerritAndreas Jaeger proposed openstack/project-config master: Add starlingx promote jobs  https://review.opendev.org/67764706:41
openstackgerritAndreas Jaeger proposed openstack/project-config master: Add starlingx promote jobs  https://review.opendev.org/67764706:44
openstackgerritIan Wienand proposed openstack/project-config master: AFS grafana : add yum-puppetlabs  https://review.opendev.org/67760106:45
AJaegerianw: one more pasto ^06:49
AJaegerI marked initially the wrong line - got confused, sorry06:49
openstackgerritIan Wienand proposed openstack/project-config master: AFS grafana : add yum-puppetlabs  https://review.opendev.org/67760106:50
ianwAJaeger: ah, makes sense :)  thanks for checking06:50
*** kjackal has joined #openstack-infra06:53
openstackgerritJan Kubovy proposed zuul/zuul master: Overriding max. starting builds.  https://review.opendev.org/67046107:00
*** ianychoi has quit IRC07:00
*** ianychoi has joined #openstack-infra07:01
*** jaosorior has joined #openstack-infra07:04
*** trident has quit IRC07:10
*** rcernin has quit IRC07:14
openstackgerritMerged openstack/openstack-zuul-jobs master: Use promote for publish-tox-docs-infra  https://review.opendev.org/67754007:15
openstackgerritMerged openstack/project-config master: AFS grafana : add yum-puppetlabs  https://review.opendev.org/67760107:17
*** trident has joined #openstack-infra07:17
*** udesale has joined #openstack-infra07:24
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Revert "Use promote for publish-tox-docs-infra"  https://review.opendev.org/67765607:32
AJaegerianw, frickler, tox-docs needs updating first - sorry. ^07:33
yoctozeptoanother case of no private ipv4 address: https://7ee223d0d079934adc99-de5ae15935168409da4576fce7897429.ssl.cf5.rackcdn.com/677228/2/check/kolla-ansible-centos-source/982f9e3/ara-report/ :-(07:36
yoctozeptosad it hits the retry limit07:37
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Use promote job for infra  https://review.opendev.org/67765707:38
AJaegerso, this is the proper way ^07:38
* AJaeger will self approve the revert 677656 - and then you can review 677657 at leisure...07:39
*** rpittau|afk is now known as rpittau07:40
*** jpena|off is now known as jpena07:47
openstackgerritAndreas Jaeger proposed opendev/base-jobs master: Make names in promote job unique  https://review.opendev.org/67766207:49
*** jtomasek has joined #openstack-infra07:51
openstackgerritAndreas Jaeger proposed opendev/base-jobs master: Make names in promote job unique  https://review.opendev.org/67766207:51
*** ralonsoh has joined #openstack-infra07:52
openstackgerritMerged openstack/openstack-zuul-jobs master: Revert "Use promote for publish-tox-docs-infra"  https://review.opendev.org/67765607:55
*** e0ne has joined #openstack-infra07:56
*** zbr is now known as zbr|ooo07:56
openstackgerritFatih Degirmenci proposed opendev/glean master: Test glean gates  https://review.opendev.org/67766507:56
openstackgerritFelix Schmidt proposed zuul/zuul master: Make direct-push configurable on project-level  https://review.opendev.org/67710907:58
openstackgerritFelix Schmidt proposed zuul/zuul master: Implement push job in merger  https://review.opendev.org/67711007:58
openstackgerritFelix Schmidt proposed zuul/zuul master: Push changes in GerritReporter if direct-push is enabled  https://review.opendev.org/67711107:58
*** jaosorior has quit IRC08:00
*** lucasagomes has joined #openstack-infra08:01
openstackgerritFelix Schmidt proposed zuul/zuul master: Implement push job in merger  https://review.opendev.org/67711008:08
openstackgerritFelix Schmidt proposed zuul/zuul master: Push changes in GerritReporter if direct-push is enabled  https://review.opendev.org/67711108:08
*** jtomasek has quit IRC08:09
*** dtantsur|afk is now known as dtantsur08:11
*** pkopec has joined #openstack-infra08:12
*** jtomasek has joined #openstack-infra08:15
*** gfidente|afk is now known as gfidente08:15
*** ykarel is now known as ykarel|lunch08:15
cgoncalveshey Infra folks! this patch https://review.opendev.org/#/c/676120/ broke Octavia gate08:18
cgoncalveshttps://object-storage-ca-ymq-1.vexxhost.net/v1/86bbbcfa8ad043109d2d7af530225c72/logs_55/639155/8/check/octavia-v2-dsvm-scenario/175ad61/controller/logs/dib-build/amphora-x64-haproxy.qcow2_log.txt.gz08:18
cgoncalves /etc/ci/mirror_info.sh: line 54: NODEPOOLMIRROR_HOST: unbound variable08:18
rm_work:(08:18
rm_workyeah, missing underscore08:18
cgoncalvesright. unless someone is pushing a patch right now I'll do it08:19
rm_worki assume you should just do it08:19
rm_workthen more of them can +2 :)08:19
AJaegerrm_work: can you sent a patch?08:20
AJaegercgoncalves: ah, please go ahead...08:20
openstackgerritChandan Kumar (raukadah) proposed opendev/base-jobs master: Fixed NODEPOOL_MIRROR_HOST typo  https://review.opendev.org/67766908:25
chkumar|roverianw: AJaeger: https://review.opendev.org/#/c/677669/ can we got this merged?08:25
*** gregoryo has quit IRC08:25
chkumar|roverit broke the tripleo ci08:25
openstackgerritCarlos Goncalves proposed opendev/base-jobs master: Fix NODEPOOL_UBUNTU_PORTS_MIRROR default value  https://review.opendev.org/67767008:26
AJaegercgoncalves: chkumar|rover was faster ^08:26
AJaegercgoncalves, rm_work, chkumar|rover: I approved 67766908:26
AJaegerThanks - and sorry for the breakage!08:27
chkumar|roverAJaeger: thanks :-)08:27
cgoncalvesabandoned08:27
AJaegercgoncalves: thanks!08:29
*** tkajinam has quit IRC08:29
*** piotrowskim has joined #openstack-infra08:30
openstackgerritMerged opendev/base-jobs master: Fixed NODEPOOL_MIRROR_HOST typo  https://review.opendev.org/67766908:32
*** ianychoi has quit IRC08:32
*** ianychoi has joined #openstack-infra08:33
AJaegercgoncalves, chkumar|rover, rm_work, change is merged ^08:35
*** ociuhandu has joined #openstack-infra08:35
cgoncalvescool, thanks08:35
rm_work\o/08:45
*** elod_off is now known as elod08:47
*** Lucas_Gray has joined #openstack-infra08:51
*** yamamoto has joined #openstack-infra08:53
*** Lucas_Gray has quit IRC09:10
*** jtomasek has quit IRC09:11
*** Lucas_Gray has joined #openstack-infra09:12
*** jaosorior has joined #openstack-infra09:19
*** ykarel|lunch is now known as ykarel09:25
*** derekh has joined #openstack-infra09:26
*** yamamoto has quit IRC09:28
ykarelHi which job publishes deploy-guide to docs.openstack.org/project-deploy-guide09:51
ykarelis it publish-openstack-docs-pti or publish-deploy-guide, background:- https://review.opendev.org/#/c/67766109:52
AJaegerykarel: publish-deploy-guide09:52
ykarelramishra, ^^09:52
ykarelAJaeger, Thanks09:52
AJaegeruse this: https://docs.openstack.org/infra/openstack-zuul-jobs/project-templates.html#project_template-deploy-guide-jobs09:53
AJaegerykarel, ramishra ^09:53
ykarelAJaeger, yes proposed this only https://review.opendev.org/#/c/677661/2/.zuul.yaml09:54
ykarelbut had some doubts so confirming here09:54
*** rpittau is now known as rpittau|bbl10:14
*** sshnaidm|afk is now known as sshnaidm10:23
*** gfidente has quit IRC10:35
*** yamamoto has joined #openstack-infra10:36
*** gfidente has joined #openstack-infra10:41
*** kjackal has quit IRC10:43
*** jchhatbar has joined #openstack-infra10:47
openstackgerritRoman Gorshunov proposed openstack/openstack-zuul-jobs master: Fix propose-translation-update job failure on wrong po format  https://review.opendev.org/67769610:47
*** janki has quit IRC10:49
*** ociuhandu has quit IRC10:49
jrosserdoes anyone know what this means? https://zuul.opendev.org/t/openstack/build/4867c6cab168436e92db7753836c3a5d/log/job-output.txt#294710:53
jrosser /etc/ci/mirror_info.sh: line 54: NODEPOOLMIRROR_HOST: unbound variable10:54
chkumar|roverjrosser: it got fixed10:54
jrosserooooh - recheck then10:54
chkumar|roverjrosser: https://review.opendev.org/#/c/677669/10:54
*** yamamoto has quit IRC10:55
pabelangerI guess we didn't test that in base-test first?10:56
*** owalsh is now known as owalsh|away11:08
*** ociuhandu has joined #openstack-infra11:20
*** kjackal has joined #openstack-infra11:23
roman_gAJaeger: can't see change in i18n :)11:23
*** ociuhandu has quit IRC11:25
*** udesale has quit IRC11:26
*** udesale has joined #openstack-infra11:27
AJaegerroman_g: nobody will be able to find the warning you add. And your implementation is broken, it REMOVES files.11:27
*** dayou has quit IRC11:28
*** yikun has joined #openstack-infra11:28
AJaegerianychoi: any feedback on https://review.opendev.org/677696 ? I'm completely opposed since I think it makes the situation worse (if the implementation would do what it should do)11:28
roman_gAJaeger: yeah, I saw that I actually remove whole translation. Will update PS.11:29
AJaegerroman_g: I still don't like the approach since it is impossible to find those WARNINGS.11:30
roman_g> Let's fix the i18n repo to warn about "Leading/trailing newline (\n)" - so that translators see that specific error. I just did that ;)11:35
roman_gAJaeger: where?11:35
roman_gcould i have a look?11:35
*** rh-jelabarre has joined #openstack-infra11:37
*** jpena is now known as jpena|lunch11:39
*** dayou has joined #openstack-infra11:39
AJaegerroman_g: I just updated translate.openstack.org - that's an option in the project and it was not enabled. It's there now..11:47
AJaegerI guess that's something for admins only ( ianychoi and myself are admins)11:47
*** jamesmcarthur has joined #openstack-infra11:49
*** rlandy has joined #openstack-infra11:50
*** rlandy is now known as rlandy|ruck11:50
roman_gAJaeger: I actually saw that error in old UI. But I was working in new UI, and it was not showing there, thus I did this mistype with additional newlines11:52
AJaegerroman_g: you got a warning - now it's an ERROR ;)11:53
AJaegerand AFAIK zanata will not export ERROR strings ;)11:53
roman_gAJaeger: cool. so it shouldn't let me save translation if format in translation differs?11:53
AJaegernot sure - but AFAIK the import would ignore that entry11:55
AJaegeryou would need to experiment to be sure ;)11:55
* AJaeger will be offline for some time now11:56
*** rlandy|ruck is now known as rlandy|ruck|mtg11:58
roman_gAJaeger: works very good. Thanks! Need to have the same ERROR for other possible errors, e.g. "Unexpected variable: %s" and similar11:58
*** udesale has quit IRC12:02
*** jamesmcarthur has quit IRC12:07
*** jamesmcarthur has joined #openstack-infra12:08
*** rpittau|bbl is now known as rpittau12:10
fdegirinfra-core: are you aware of issues with glean gates?12:13
fdegirinfra-core: opensuse job for this change https://review.opendev.org/#/c/652238/ fails but the failure is not related to what changed as I tried a dummy change with no code impact and it failed for it to?12:14
*** Lucas_Gray has quit IRC12:20
*** jamesmcarthur has quit IRC12:28
*** larainema has quit IRC12:29
*** udesale has joined #openstack-infra12:32
*** rfolco has quit IRC12:33
*** sgw has quit IRC12:33
*** jchhatba_ has joined #openstack-infra12:39
*** rfolco has joined #openstack-infra12:39
*** jpena|lunch is now known as jpena12:40
*** jchhatbar has quit IRC12:42
*** jchhatba_ has quit IRC12:43
*** larainema has joined #openstack-infra12:43
*** rlandy|ruck|mtg is now known as rlandy|ruck12:43
*** ociuhandu has joined #openstack-infra12:46
*** ociuhandu has quit IRC12:47
*** ociuhandu has joined #openstack-infra12:47
AJaegerroman_g: there are some more options - best discuss with ianychoi...12:49
AJaegerdirk, evrardjp , can you help fdegir , please?12:50
*** jamesmcarthur has joined #openstack-infra12:50
guilhermespbtw fungi is this the correct hold ssh root@162.242.235.61 ? Seems that the uptime of the instance is 22 days12:53
*** sgw has joined #openstack-infra12:53
guilhermespand it is a xenial server12:53
guilhermespjust as a reminder12:53
guilhermespthat's the autohold12:53
guilhermespzuul autohold --tenant openstack --project openstack/openstack-ansible-os_heat --job openstack-ansible-deploy-aio_distro_metal-ubuntu-bionic --change 672948 --reason "guilhermesp debugging mysterious failure which won't reproduce locally" --count 112:53
*** aaronsheffield has joined #openstack-infra12:54
guilhermespbut seems that this is not the correct hold maybe12:54
guilhermespis there any kind o f caching of old instances and maybe it is pointing to an old instnace that is shown as deleted but actually still exists?12:54
guilhermespalso, I can't see somthing related to the openstack-ansible deployment in there so... i think that this might be a wrong instance12:55
*** eharney has joined #openstack-infra12:55
*** ociuhandu has quit IRC13:00
*** sthussey has joined #openstack-infra13:10
fricklerguilhermesp: yes that looks like the wrong instance, 104.130.127.203 seems to be yours13:10
mordredinfra-root: I'm going to be basically out again today - sorry for the lack of warning. we're in the transition between locations for the first time and it's turned out to need much more time than we budgeted.13:10
fricklerguilhermesp: can you point me to your ssh key then I can set it up for you13:11
guilhermespsure frickler a sec13:11
guilhermespthere it is https://github.com/guilhermesteinmuller.keys13:11
fricklerguilhermesp: ok you should have access now13:14
guilhermespcool frickler Im in13:14
guilhermesp:)13:14
mnaserwhen is the next rename scheduled?13:16
mnaserhttps://review.opendev.org/#/c/669298/ -- this has been sitting around for 1.5 months now, its only beacause its also blocking a governance change thats been up as long13:17
*** tesseract has joined #openstack-infra13:22
*** tesseract has quit IRC13:22
AJaegerconfig-core, could you review https://review.opendev.org/#/c/677547/ and https://review.opendev.org/677657 , please?13:24
*** Lucas_Gray has joined #openstack-infra13:29
fungiguilhermesp: sorry about that! i must have mis-copied from another hold someone hadn't cleaned up yet13:37
*** mriedem has joined #openstack-infra13:39
*** eharney has quit IRC13:40
*** Goneri has joined #openstack-infra13:42
*** zbr|ooo is now known as zbr13:47
*** udesale has quit IRC13:47
fungifdegir: i don't know much specifically about opensuse, but i think the error may be related to a missing etc/resolv.conf (or missing etc entirely?) in the chroot: https://zuul.opendev.org/t/openstack/build/707eff03f5b2484b958a1837a5b08264/log/nodepool/builds/test-image-0000000001.log#357913:48
mnaserquestion - has anyone ever wanted a ci node *without* unbound?13:48
mnaseri'm having issues deploying k8s in ci because coredns seems to have some loop protection (it grabs whatever is in /etc/resolv.conf as forwarders)13:48
fungifdegir: this may be a recent regression, all our tumbleweed images in production are over 1.5 days old now too so should have been rebuilt more recently13:48
mnaserand given that it takes 127.0.0.1 -- it configures itself to forward to that address and then turns off to avoid a loop13:49
fungimnaser: you could probably uninstall unbound, but then you'll have no local caching (unless kubernetes provides that functionality for you)13:49
mnaserim just trying to think from a software pov on how to best workaround this13:52
fungifdegir: hrm, though our production tumbleweed image builds seem to be failing on a missing package in our mirror instead:13:52
fungi2019-08-21 13:36:10.291 | File './x86_64/openSUSE-release-ftp-20190815-227.1.x86_64.rpm' not found on medium 'http://mirror.dfw.rax.openstack.org/opensuse/tumbleweed/repo/oss/'13:52
*** ykarel is now known as ykarel|afk13:53
fungimnaser: i think we must have a workaround somewhere because there are other jobs installing kubernetes on test nodes, and i recall this challenge coming up13:53
*** bhavikdbavishi has quit IRC13:54
mnaseri think if we actually just set the /etc/resolv.conf servers to $host_ip it might work around it13:54
guilhermespfungi: no worries!13:55
mnaserbecause then when it slurps it on start up, it won't be looping13:55
fungimnaser: if unbound is listening on that interface i guess, and if iptables isn't preventing stuff on the node from reaching it13:55
*** bnemec has joined #openstack-infra13:55
* mnaser is just trying to take into consideartion people that might do the same thing inside the software13:56
mnaserso not just solve it for zuul but working around for users who might be doing the same thing13:56
fungiright, i haven't personally been very involved in the way any of the kubernetes testing jobs are designed, so off the top of my head i don't know what workaround(s) got implemented13:57
*** ykarel|afk has quit IRC14:02
Shrewswe use minikube, iirc, in our k8s test jobs in nodepool (so not a production k8s install). not sure if that makes any difference14:04
clarkbmnaser: the workaround others have used is to read unbounds config amd configure other resolvers to use that same config or in the case of docker just always use google dns14:05
clarkbI dont think you should disable unbound as the host still needs dns14:05
clarkblxc is the best when it comes to this imo14:06
clarkbthey set up a dnsmasq to forward to host resolver if it is there14:06
fungidirk: evrardjp: i'm not quite sure what to make of the tumbleweed image build failures we're seeing in production... seems zypper is looking for opensuse/tumbleweed/repo/oss/x86_64/openSUSE-release-ftp-20190815-227.1.x86_64.rpm on our mirrors, but we mirror from http://mirror.us.leaseweb.net/opensuse/tumbleweed/repo/oss/x86_64/ and i don't see it listed there either14:07
fungilooks like we've been unable to build tumbleweed images for at least 12 hours14:07
dirkfungi: we're currently renaming the opensuse products, some fallout is expected I guess14:07
dirkI'll take a closer look after meeting madness ( ~ 3 hours)14:08
fungiokay, should we disable our ci jobs which test building tumbleweed too?14:08
dirkno14:08
dirkwe're just going to lose the opensuse prefix (so distros will be called leap and tumbleweed)14:08
fungiseeing a different failure for those very recently and can't merge glean changes because the job that tests we can build a tumbleweed image with dib breaks14:08
Shrewsdid opensuse-423 get fixed?14:09
dirkcan you point me to the revie wthat you need?14:09
clarkbShrews: I think 42 is being removed14:09
clarkbfungi what glean change?14:09
fungidirk: https://review.opendev.org/652238 is failing dib-nodepool-functional-openstack-opensuse-tumbleweed-src because it can't build a tumbleweed image14:10
Shrewsclarkb: i put up a change to remove it, but AJaeger pointed me to a change to fix it14:10
dirkShrews: the 423 thing should be fixec by https://review.opendev.org/#/c/677188/ but it looks like I need to reiterate this14:10
Shrewsdirk: *nod*14:11
fungidirk: though the failure there seems to be about a missing etc/resolv.conf in the chroot dib is building14:11
fungidirk: https://zuul.opendev.org/t/openstack/build/707eff03f5b2484b958a1837a5b08264/log/nodepool/builds/test-image-0000000001.log#357914:11
dirkcertainly anything using opensuse-423 should be replaced by opensuse-1514:11
fungidirk: the closer i look at that error, the more i suspect etc is entirely missing in the chroot even14:12
fungiso there may be something breaking earlier in the image build still14:12
clarkbfungi: fwiw I feel like we've tested that sync behavior and its been fine?14:13
clarkbI think when python exits it flushes? maybe the early close affects that?14:13
*** rfolco is now known as not_rlandy14:13
clarkbthat was a suspected issue when we had the fn ipv6 issues14:14
clarkbbut ruled out14:14
*** not_rlandy is now known as folco14:14
*** folco is now known as rfolco14:14
fungiclarkb: yeah, i'm not sure under what circumstances fdegir is seeing it14:14
openstackgerritDmitriy Rabotyagov (noonedeadpunk) proposed openstack/project-config master: Add uwsgi role to gerritbot  https://review.opendev.org/67773814:14
*** portdirect has quit IRC14:14
*** sdoran has quit IRC14:14
*** coreycb has quit IRC14:14
*** eharney has joined #openstack-infra14:15
*** aprice has quit IRC14:15
*** dougwig has quit IRC14:15
*** kmalloc has quit IRC14:15
*** evgenyl has quit IRC14:15
*** petevg has quit IRC14:15
*** tiffanie has quit IRC14:15
*** sthussey has quit IRC14:16
*** mordred has quit IRC14:16
*** _Cyclone_ has quit IRC14:16
*** zxiiro has quit IRC14:16
*** cmurphy has quit IRC14:16
*** sparkycollier has quit IRC14:16
*** jamespage has quit IRC14:16
*** philroche has quit IRC14:16
*** jbryce has quit IRC14:16
*** mgagne has quit IRC14:16
*** csatari has quit IRC14:16
*** aaronsheffield has quit IRC14:16
*** logan- has quit IRC14:16
*** crodriguez has quit IRC14:16
*** irclogbot_2 has quit IRC14:17
*** adriancz has joined #openstack-infra14:17
*** _Cyclone_ has joined #openstack-infra14:17
*** logan_ has joined #openstack-infra14:17
*** mgagne has joined #openstack-infra14:17
*** irclogbot_2 has joined #openstack-infra14:17
*** dougwig has joined #openstack-infra14:18
*** logan_ is now known as logan-14:18
*** tiffanie has joined #openstack-infra14:18
*** portdirect has joined #openstack-infra14:18
*** coreycb has joined #openstack-infra14:18
*** sthussey has joined #openstack-infra14:19
*** crodriguez has joined #openstack-infra14:19
*** sparkycollier has joined #openstack-infra14:19
*** philroche has joined #openstack-infra14:19
*** aaronsheffield has joined #openstack-infra14:19
*** kmalloc has joined #openstack-infra14:19
*** aprice has joined #openstack-infra14:19
*** csatari has joined #openstack-infra14:19
*** sdoran has joined #openstack-infra14:19
*** rosmaita has joined #openstack-infra14:19
*** evgenyl has joined #openstack-infra14:19
*** jbryce has joined #openstack-infra14:19
*** Jeffrey4l_ has quit IRC14:19
*** jamespage has joined #openstack-infra14:19
*** zxiiro has joined #openstack-infra14:20
*** Jeffrey4l has joined #openstack-infra14:21
*** mordred has joined #openstack-infra14:26
*** petevg has joined #openstack-infra14:26
*** ykarel|afk has joined #openstack-infra14:27
*** ykarel|afk is now known as ykarel14:28
*** rlandy|ruck is now known as rlandy|ruck|mtg14:29
mnaserhmm14:35
mnaserwe have the systemd-resolved service running in our ci images14:36
mnaseryet we kinda override /etc/resolv.conf directly14:36
*** _Cyclone_ has quit IRC14:36
*** jeliu_ has joined #openstack-infra14:37
*** _Cyclone_ has joined #openstack-infra14:39
*** ociuhandu has joined #openstack-infra14:39
*** chkumar|rover is now known as raukadah14:40
*** mattw4 has joined #openstack-infra14:40
*** jaosorior has quit IRC14:41
mnaserso i think we should configure systemd-resolved to point towards it instead?14:42
sshnaidmclarkb, is it ok that in some logs storage logs files are not gzipped? https://storage.gra1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/logs_70/664170/13/check/tripleo-ci-centos-7-scenario012-multinode-oooq-container/c59c8dc/14:43
sshnaidmclarkb, like job-output.txt instead of job-output.txt.gz14:43
clarkbmnaser: I believe systemd-resolvd alreary does the correct thing in this case (it will ask unbound if it gets a request over dbus iirc)14:44
mnaserclarkb: afaik usually what happens is /etc/resolv.conf should point to 127.0.0.53 and then systemd-resolved will do all the magic it needs14:44
mnaserand then /run/systemd/resolve/resolv.conf is the 'content' -- right now it seems like its overridden14:44
clarkbsshnaidm: they are compressed in transit and storage by openstacksdk and swift aiui14:44
clarkbsshnaidm: if you wget the file you get it back in deflate encoding14:45
sshnaidmclarkb, ok14:45
*** rlandy|ruck|mtg is now known as rlandy|ruck14:45
clarkbmnaser: well we dont want systemd resolvd to be the primary resolver14:46
mnaserright, in that case i believe the service is doing nothing?14:46
clarkbperfect :)14:46
mnaserso we should probably shut it down then to avoid confusion in ci (or configure it, and point it towards unbound as a forwarder)14:46
openstackgerritJeff Liu proposed zuul/zuul-operator master: Add PerconaXDB Cluster to Zuul-Operator  https://review.opendev.org/67731514:46
mnaser(as someone building deployment tooling against nodepool vms, its a "broken scenario" if systemd-resolved is running but /etc/resolv.conf isn't pointing to it)14:47
*** cmurphy has joined #openstack-infra14:48
clarkbwhy? isnt resolv.conf canonical?14:49
clarkbthat is what libc will look at14:49
clarkbI guess if you want dns over dbus it might be confusing if systemd resolvd doesnt look at resolv.conf (I thought it did)14:49
*** raissa has joined #openstack-infra14:49
mgoddardHi there infra, is there a gerrit restart planned? I'm waiting on a project rename (x/kayobe -> openstack/kayobe)14:50
clarkbmgoddard: I had been expecting more rename requests after the opendev reorg so was waiting. But we continue to not have those so we shoyld probably go ahead and schedule something with the list we do have14:52
mgoddardclarkb: would be appreciated :)14:52
pabelangermnaser: clarkb: I want to say we did stop systemd-resolve before, but reverted it14:53
mnaserclarkb: resolv.conf should be a symlink to /run/systemd/resolve/resolv.conf when using systemd-resolved14:54
mnaserwhich will include all the forwarders configured in systemd-resolved there14:54
mnaserhttps://www.irccloud.com/pastebin/WhfQ4ScW/14:55
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add caching of autohold requests  https://review.opendev.org/66341214:55
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add autohold-info CLI command  https://review.opendev.org/66248714:55
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Record held node IDs with autohold request  https://review.opendev.org/66249814:55
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Auto-delete expired autohold requests  https://review.opendev.org/66376214:55
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Mark nodes as USED when deleting autohold  https://review.opendev.org/66406014:55
mnaserif we make it a symlink, then the behaviour won't change, but at least systemd-resolved will be doing the right thing(tm)14:55
clarkbyour paste looks exactly ad I exppected14:55
mnaserimho /etc/resolv.conf should be a symlink to /run/systemd/resolve/resolv.conf14:56
clarkblibc will resolve against unbound anything that goes to systemd resolvd will too (dbus)14:56
pabelangermnaser: https://opendev.org/openstack/project-config/src/branch/master/nodepool/elements/nodepool-base/finalise.d/89-unbound#L20314:56
clarkbmnaser: why? systemd resolvd is not tge primary resolver14:56
clarkbit is largely there for dbus I think14:56
mnaserright, but if you see the 'forwarder' for systemd-resolved to be 1.1.1.1 (or whatever), then /etc/resolv.conf will update to that value14:57
clarkbbut we dont want that14:57
mnaserso if you set the systemd-resolved forwarder to 127.0.0.1 (aka unbound) then the resolv.conf will update automatically to point to it14:57
clarkbyou should make that change the other way around14:57
mnaserbut the behaviour will be correct14:57
AJaegerconfig-core, could you review https://review.opendev.org/#/c/677547/ and https://review.opendev.org/677657 , please? Two more changes for promote...14:57
clarkbyou update unbounds config and have systemd resolvd resolv against unbound14:58
mnaseri agree with you14:58
mnaserif we symlink /etc/resolv.conf to /run/systemd/resolve/resolv.conf ... nothing will change, but at least if we change forwaders in systemd-resolved, it will update resolv.conf correctly14:58
clarkbI dont understand that14:58
clarkbwhy would we change forwarders in systemd resolvd?14:59
clarkbwe use unbound as a forwarding caching resolver14:59
mnaserthe /run/systemd/resolve/resolv.conf file is a dynamically generated file that contains a list of all the 'forwarders' or 'dns' servers configured in systemd-resolved14:59
fdegirfungi: clarkb: fyi - glean opensuse job fails for a change that has no code impact: https://review.opendev.org/#/c/677665/14:59
mnaserif you tell systemd-resolved to use 127.0.0.1, it will update /run/systemd/resolve/resolv.conf with 'nameserver 127.0.0.1'14:59
mnaserif you tell it to use 1.1.1.1, it will update /run/systemd/resolve/resolv.conf with 'nameserver 1.1.1.1'14:59
mnaserso, if we do the symlink, and systemd-resolved is configured to use 127.0.0.1 -- nothing will change, the resolv.conf file will simply be dynamic instead of static15:00
clarkbbut we want it static15:00
clarkbits doing exactly what we want right now15:00
mnaserit will continue to be static. until someone like me needs to change it because it doesn't work for me15:01
mnaserthen i need to update both /etc/resolv.conf and systemd-resolved.15:01
clarkbwhy do you need to change it?15:01
mnaserbecause kubeadm deployments pull in /etc/resolv.conf into the pods15:01
mnaserand so the coredns pod goes like 'oh no i am looping'15:01
mnaserbecause /etc/resolv.conf contains 127.0.0.1 and then it tries to do lookups on itself15:01
clarkbaiui you configure that on the k8s dide15:01
mnaserand im trying to build ansible playbooks that operate under the assumption that a system is _properly_ configured15:01
clarkbyou say dns is at $server15:02
clarkbit is properly configured15:02
mnaserin this case, a running systemd-resolved service + static /etc/resolv.conf is a combo that doesnt make sense15:02
mnaser_literally_ nothing changes if we setup a symlink15:02
mnaseri dont see why its an issue15:02
mnaserits how distros ship by default15:02
mnaserthe only difference will be the file will be dynamically created by systemd-resolved instead of us manually plopping it down15:02
mnaserthe same exact content15:02
mnasersee https://www.irccloud.com/pastebin/To0qKIVM/15:03
clarkblets back up15:03
clarkbcan we accept it is perfectly normal to use a local resolver for caching purposes? in which case a resolv.conf containing localhost is fine?15:03
mnaseryes, that's the behaviour i want to keep15:03
*** jamesmcarthur has quit IRC15:04
clarkbgreat. Then it is up to anything using namespacing to deal with the fact that a new network namespace's localhost is not necessarily going to have access to htat local resolver?15:04
mnasercorrect. in my case, i'd want to change the host DNS so that it propagates properly right off the bat15:04
*** bobh has joined #openstack-infra15:05
clarkbmnaser: well you want ot bypass host dns and configure remote resolvers directly15:05
clarkbeg 1.1.1.115:05
*** jtomasek has joined #openstack-infra15:05
mnaserin my playbook, i'd have two possible paths: a) systemd-resovled is not running, /etc/resolv.conf is configured staticly -- so i will override /etc/resolv.conf15:05
mnaseror b) systemd-resolved *is* being used and running, /etc/resolv.conf is configured dynamically, so i reconfigure systemd-resolved15:05
mnaserbut the case we have in nodepool vms is systemd-resolved *is* running, but /etc/resolv.conf is configured staticly15:06
clarkbok that helps. I think you are neglatcing that b) has two subcases15:06
clarkb1) systemd-resolvd is primary resolver and 2) systemd-resolvd is simply part of chain to primary resolver15:06
clarkbyou want everyone to run system-resolvd in the 1) case15:07
clarkbbut we run it in the 2) case15:07
clarkbbecause unbound is a better resolver15:07
*** priteau has joined #openstack-infra15:07
*** Lucas_Gray has quit IRC15:07
mnaseri think you can tell systemd-resolved to do nothing more than 'point' the system to the resolvers it's given (which is where i was thinknig at)15:07
mnaserso the systemd-resolved would be configured with 127.0.0.1 and it will dynamically generate a resolv.conf file with 'nameserver 127.0.0.1'15:08
mnaserand in that case, it is _not_ acting as a dns server15:08
mnaserjust a /etc/resolv.conf-igurator15:08
clarkbmnaser: that is basically what we have right now15:08
clarkbmnaser: what you are suggesting is to bypass unbound15:08
clarkbif I'm understanding how this will make the containers work15:09
mnaserok so15:09
clarkbbasically you want to tell systemd-resolvd that 1.1.1.1 is now the resolver which will write that to /etc/resolv.conf15:09
clarkbthen containers will use 1.1.1.115:09
mnaser"systemd-resolved maintains the /run/systemd/resolve/stub-resolv.conf file for compatibility with traditional Linux programs. This file may be symlinked from /etc/resolv.conf. This file lists the 127.0.0.53 DNS stub (see above) as the only DNS server."15:09
mnaser"systemd-resolved maintains the /run/systemd/resolve/resolv.conf file for compatibility with traditional Linux programs. This file may be symlinked from /etc/resolv.conf and is always kept up-to-date, containing information about all known DNS servers."15:09
mnaseri dont want us to use stub-resolv.conf15:10
mnaserbut if we use the second method, then yes, we can configure systemd-resolved to use 127.0.0.1 and the behaviour won't change15:11
mnaser"Note that /run/systemd/resolve/resolv.conf should not be used directly by applications, but only through a symlink from /etc/resolv.conf. If this mode of operation is used local clients that bypass any local DNS API will also bypass systemd-resolved and will talk directly to the known DNS servers."15:11
*** Lucas_Gray has joined #openstack-infra15:11
clarkbhow does that help your container case? it will be equivalent to today with 127.0.0.115:11
mnaseri can update systemd-resolved to point to $some_other_resolver and everything will work properly15:12
clarkbmnaser: and /etc/resolv.conf will contain for example 1.1.1.1 right?15:12
clarkb(because that is where containers are reading the info)15:12
mnaserin this case kubeadm is smart enough to actually read /run/systemd/resolve/resolv.conf and pull resolvers from there15:13
mnaserif it sees systemd-resolved is running15:13
clarkband that is the file you want to symlink /etc/resolv.conf to15:14
clarkbso /etc/resolv.conf will also have 1.1.1.1 in it15:14
clarkbthis is what I would like to avoid15:14
mnaserright, but im not asking to change default behaviour, by default /etc/resolv.conf _will_ have 127.0.0.115:14
clarkbmnaser: sure but its still a problem for your running jobs for a cuple reasons15:14
mnaserits only if i change systemd-resolved settings, i can assume /etc/resolv.conf will have the right value (as it would be in a _real_ deployment i guess)15:15
clarkbbasically in your jobs the entire host will stop using the dns resolver we've setup to add caching and ensure we are resolving against the correct IPs for the current cloud region15:15
fungireminder: i'm disappearing for a few hours to run errands on another island, but should be back in time for the storyboard meeting15:16
clarkbwe've had to be very careful particularly where NAT is involved to do this otherwise your dns requests start failing a lot15:16
mnaserso what would be a good solution in my case if i want to continue to use the on-host unbound15:16
clarkbon ipv6 only clouds we very specifically use ipv6 resolvers to ensure we don't get NAT'd to avoid problems in NAT and being blacklisted for too many requests on the remote side15:17
mnaseri dont think unbound listens on the primary ip, does it?15:17
clarkbmnaser: it does not because we don't want to be used to ddos15:17
mnaseri figured as much15:17
* mnaser thinks15:17
clarkbmnaser: what I've seen other people do are tell k8s to use the resolvers that we've configured unbound to use15:17
clarkbbecause that ensures we are avoiding NAT if we have to. You don't get caching though15:18
clarkbthe LXC solution is the best one but I'm not sure how hard it would be to implement for k8s15:18
clarkbLXC sets up a localhost to localhost port 53 bridge15:18
clarkb(its actually container network IP to host localhost not localhost to localhost)15:19
AJaegerdtroyer: could you review https://review.opendev.org/677647 for starlingx as well, please?15:19
clarkbdocker does the worst thing and tells everything to use 8.8.8.815:19
clarkbwhich is still better than forcing the entire host to use 8.8.8.815:19
mnaserclarkb: yeah.. thing is i dont want to start overriding too much of the default behaviour of kubeadm15:20
mnaserso im trying to figure out a way to make it work without it being too nodepool-opinitiated15:20
mnaser(my solution to k8s in gate is to build a library that deploys against nodepool vms, then use said library in magnum but deploy against heat vms)15:21
mnaserit won't be tested with integration in magnum but we'll at least know the library works15:21
mnaserit's.. an interesting workaround the nested virt issue that i'm toying with15:21
*** gyee has joined #openstack-infra15:22
clarkbkubeadm docs imply this is configurable15:22
clarkbbut don't directly say how15:22
mnaseri looked everywhere here but no avail https://godoc.org/k8s.io/kubernetes/cmd/kubeadm/app/apis/kubeadm/v1beta215:22
mnaserhttps://github.com/kubernetes/kubernetes/blob/efcb62abff0ee6511cb5f82b9f25d28b17d81912/cmd/kubeadm/app/phases/addons/dns/manifests.go#L305-L33015:24
* mnaser hmms15:24
clarkblooks like you supply a different corefile to coredns service15:24
clarkbproxy . /etc/resolv.conf is what it does by default but I think you change that?15:25
clarkbya looks like you use that forward directive15:25
mnaserclarkb: ok one final thing to bother you on, whats the best way of getting the best dns servers for the nodepool vm15:26
clarkbmnaser: right now reading the unbound config is probably best15:26
clarkbwe might be able to expose that in a more general way then have the unbound role consume that?15:26
mnaserso i cant use any of the zuul ansible vars? :<15:26
clarkbhttps://opendev.org/opendev/base-jobs/src/branch/master/roles/configure-unbound is what we currently use to configure unbound15:27
clarkbhttps://opendev.org/opendev/base-jobs/src/branch/master/roles/configure-unbound/tasks/main.yaml#L12-L41 and it uses that logic15:27
mnaserso i guess i can borrow that15:28
clarkbthe major thing is using ipv6 if possible though that reminds me k8s can't ipv615:28
clarkbso that may be a complete waste of effort here :/15:28
mnaserpoop15:29
clarkbin that case I would tell coredns to use 8.8.8.8 and 1.1.1.1 (and if you can only pick one do so randomly?)15:30
clarkbthen we at least distribute the potential for NAT blacklisting across a couple services15:30
clarkbhttps://opendev.org/opendev/base-jobs/src/branch/master/roles/configure-unbound/defaults/main.yaml google and cloudflare are the two services we'll distribute across by default15:31
*** mattw4 has quit IRC15:33
*** efried is now known as efried_afk15:33
clarkbyou'd think that a google backed service would support ipv6 out of the box since google is all ipv615:34
logan-i learned this week that gcp compute instances don't support ipv6 yet... also surprising15:34
clarkblogan-: I'm also told all the gcp ipv4 traffic is tunneled over ipv615:35
pabelangerlogan-: docker hub also doesn't15:35
clarkbbecause google only knows how to ipv615:35
clarkbits too bad that we don't have a more localhost version of localhost that is namespace wide15:38
clarkbthat would likely require updates to ip rfcs so no hope of that any time soon :)15:39
*** kjackal has quit IRC15:40
clarkboh coredns will do caching at least15:40
clarkbthat is good15:40
clarkbso ya the big gap there then is ipv6 only clouds and forcing all requests through NAT which is probably ok if we avoid it as much as possible, have caching, and distribute across providers15:40
*** jamesmcarthur has joined #openstack-infra15:41
mnasernow to figure out how to feed this config to kubeadm..15:43
*** tdasilva has joined #openstack-infra15:44
clarkbmnaser: https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/control-plane-flags/ something like that maybe?15:45
mnaserclarkb: coredns runs as a 'deployment' on top of kubernetes so it would have to edit the config it uses by default15:46
mnaserim going through the k8s code and it seems to try and read the kube-dns config to grab that info from..15:46
mnaserhttps://github.com/kubernetes/kubernetes/blob/efcb62abff0ee6511cb5f82b9f25d28b17d81912/cmd/kubeadm/app/phases/addons/dns/dns.go#L175-L20215:47
*** mattw4 has joined #openstack-infra15:48
clarkbheh the open issue for this in k8s was closed because there was a crashloop issue in kubeadm that they decided to track instead except the fix for the crashloop issue in kubeadm was to update docs15:49
openstackgerritJeff Liu proposed zuul/zuul-operator master: Add PerconaXDB Cluster to Zuul-Operator  https://review.opendev.org/67731515:50
clarkbhttps://github.com/kubernetes/kubernetes/issues/71705#issuecomment-444128684 discussion almost gets to the point of addressing this problem then trickles off15:51
clarkbmnaser: reading more it sounds like you can get the config map of the running coredns, modify it, then apply those updates. I suppose that only works if it has working dns already?15:53
clarkbthough coredns should bootstrap without that given its place in the system15:53
mnaserclarkb: yeah and it starts feeling a little hacky at taht point, plus maybe a kubeadm upgrade would override it again15:53
mnaserim thinking if i override resolv-conf though15:53
mnaserthat might just fix it for me15:53
mnaser(and it would also ensure that i don't have to worry about different hosts with different resolv.conf)15:54
clarkbto me that seems super heavy handed because now all services on the host too must use the wrong resolver15:54
*** ykarel is now known as ykarel|afk15:54
clarkb(which is the one saving grace of what docker does, it doesn't modify the host)15:54
*** mgagne has quit IRC15:54
*** mgagne has joined #openstack-infra15:55
clarkbthis makes me wonder how the nodepool jobs works15:56
*** igordc has joined #openstack-infra15:56
*** jtomasek has quit IRC15:56
*** raissa has quit IRC15:57
*** tdasilva has quit IRC15:57
*** diablo_rojo has joined #openstack-infra15:58
*** larainema has quit IRC15:59
clarkbthose jobs use minikube which should run coreDNS too16:00
clarkbseems like they are going to use localhost too if I'm reading minikube correctly. How does that work?16:03
clarkbmaybe the jobs don't resolve anything external from within the container16:05
*** rpittau is now known as rpittau|afk16:05
*** ricolin has quit IRC16:05
clarkbShrews: ^ do you know if that is the case in the nodepool functional jobs?16:07
*** markvoelker has quit IRC16:08
clarkb--extra-config=kubelet.resolv-conf= is a flag we can set on minikube to specify the addrs though16:08
Shrewsclarkb: i don't know. minikube is entirely self contained and the k8s commands "just work". nothing outside of the vm is used16:08
clarkbwhich kubeadm seems to lack16:09
clarkbShrews: ya I think that must be it then. If we ran workload that wanted to talk to say github or opendev or whatever that would fail16:09
mnaserwe can override kubelet resolv-conf apparently16:09
Shrewsclarkb: i suspect so16:09
mnaseri mean my idea was to have resolvers configured _for_ kubernetes so that my roles/playbooks don't touch the host system16:09
mnaserso something lik /etc/resolv-k8s.conf or something16:10
mnaserbut seems hacky :<16:10
clarkbmnaser: I think that is how I would do it16:10
clarkbmnaser: which makes sense since k8s runs its own dns resolver16:11
clarkbnothing says it has to use the host setup16:11
clarkbI'm going to write that change for zuul-jobs/install-kubernetes now while I'm thinking about it16:13
*** mattw4 has quit IRC16:19
*** ykarel|afk is now known as ykarel16:20
*** lucasagomes has quit IRC16:21
*** markvoelker has joined #openstack-infra16:22
*** tdasilva has joined #openstack-infra16:22
*** Lucas_Gray has quit IRC16:25
*** tdasilva has quit IRC16:27
*** armax has joined #openstack-infra16:29
*** e0ne has quit IRC16:30
*** jpena is now known as jpena|off16:30
*** dtantsur is now known as dtantsur|afk16:37
openstackgerritClark Boylan proposed zuul/zuul-jobs master: Allow for overriding dns resolvers in install-kubernetes  https://review.opendev.org/67778716:37
clarkbmnaser: Shrews ^ something like thatfor the minikube case16:37
*** diablo_rojo has quit IRC16:40
*** igordc has quit IRC16:41
AJaegerconfig-core, could you review https://review.opendev.org/#/c/677547/ and https://review.opendev.org/677657 and a cosmetic https://review.opendev.org/677662, please? more changes for promote...16:42
clarkbAJaeger: on it16:42
AJaegerthanks16:42
*** spsurya has quit IRC16:43
*** Garyx has quit IRC16:45
clarkbyoctozepto: to follow up on the ipv4 problems in limestone I believe the next round of centos image builds should address that16:46
clarkblet me see if I should manually trigger builds16:46
clarkbhrm we haven't updated diskimage-builder on the builders yet /me checks on that first16:47
clarkbwe install diskimage-builder when we update nodepool16:48
clarkbI think in this case I'll manually update diskimage-builder then trigger centos 7 rebuild16:49
*** Garyx has joined #openstack-infra16:49
pabelangerhas that been the case the whole time? I thought we managed that independently16:50
clarkbI think it must've changed at some point16:51
clarkb#status log updated diskimage-builder to 2.26.0 on nodepool builders to pick up centos network manager ipv4 fix16:52
openstackstatusclarkb: finished logging16:52
*** noorul has joined #openstack-infra16:52
clarkbI've also kindly asked nodepool to build centos 7 but it is already building some suse stuff so might be a little while16:52
*** derekh has quit IRC17:00
*** bobh has quit IRC17:01
clarkbcentos7 is now building on nb0117:02
*** psachin has quit IRC17:04
clarkbfdegir: ya fungi was prodding prople to see hat is necessary to fix that17:04
clarkbfdegir: sounds like tumbleweed is having a repo reorg so we may just have to wait17:05
clarkbfdegir: that said I recently tested if fsyncing was necessary and couldn't find a case where it was17:05
clarkbfdegir: once python exits it should flush and sync all the things to disk17:05
clarkbfdegir: if you need those files on disk before glean exits then maybe that is the problem?17:05
*** noorul has quit IRC17:08
*** noorul has joined #openstack-infra17:13
*** mattw4 has joined #openstack-infra17:13
guilhermespinfra-root:  the following hold can be deleted 104.130.127.20317:15
clarkbout of curiousity what was the problem (and any idea why it wasn't reproduceable locally?)17:17
*** diga has quit IRC17:18
clarkbguilhermesp: ^17:19
*** mattw4 has quit IRC17:19
clarkbdelete has been submitted to nodepool17:19
*** noorul has quit IRC17:20
*** priteau has quit IRC17:21
clarkbfedora mirror is now in same size range as ubuntu \o/17:22
clarkbhopefully that helps make releaess for it more relaible17:22
guilhermespclarkb: seems that we are having issues with mixed setup (source/distro)  installs with debian based OS and having duplication of tempest puglins for heat. We are rebasing the broken patch to see if recent changes regarding the issue fixed the job :)17:23
*** noorul has joined #openstack-infra17:25
melwittcan anyone point me to docs or otherwise give me a pointer as to whether a non-zero return value from a post test hook script is expected to result in a POST_FAILURE rather than a FAILED job status?17:26
melwittexample: https://zuul.opendev.org/t/openstack/build/707d40fa5ad54efd8e8ea4ea9d10812e17:27
clarkbmelwitt: https://zuul-ci.org/docs/zuul/user/jobs.html#build-status should be POST_FAILURE17:28
AJaegermelwitt: if the run playbook does not generate the content that is required by teh post-run, it fails with post-failure.17:29
clarkbmelwitt: https://storage.gra1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/logs_40/672840/9/check/nova-next/707d40f/ara-report/reports/53598c9a-c84c-4374-934c-f14acf081790.html that is why you get that return result17:29
AJaegerSo, might be that the post-run is not handling missing content correctly17:29
fdegirclarkb: here is the background for that glean fsync17:29
fdegirclarkb: as you know, network configuration is done by glean via a linux service (one registration for each interface)17:30
fdegirclarkb: the pre part is to run glean to generate the needed files, and the main part is to run “ifup”.17:30
fdegirclarkb: howeve,r glean does not attempt to do os.fsync after writing files, which is not safe in python,17:30
*** noorul has quit IRC17:30
fdegirclarkb: as python will not guarantee the file being written even if it is closed in the python level.17:30
fdegirclarkb: if we are lucky, ifup may complain unknown interface as the file is still in cache.17:30
AJaegerclarkb, melwitt run-post-test-hook runs in post-run, so gets POST_FAILURE. So, question is: Should it run there? ;)17:31
clarkbfdegir: it is my understanding that when the python process exits it flushes though?17:31
clarkbI mean your change won't hurt anything but I spent some time trying to reproduce that recently and wasn't able to17:32
*** ociuhandu has quit IRC17:32
fdegirclarkb: that is the luck part17:32
clarkbbut also even if the file is still in cache other readers should see it via the cache17:32
fdegirclarkb: it happens randomly and when we patched glean locally and rebuilt the images with the patched version, we never experienced this issue17:32
clarkbit would only be a problem if you hard rebooted under it I think17:32
clarkbfdegir: ya could also be a difference in filesytem behavior. I was testing on ext417:33
melwittclarkb, AJaeger: thanks! looks like POST_FAILURE is always expected in this case then (good). I think I just hadn't seen a failure before and was surprised when it didn't say FAILED.  I shall crunch on the info you linked17:35
*** noorul has joined #openstack-infra17:35
fdegirclarkb: regarding tumbleweed issue - is there any place where i can check to see if it is resolved and then try recheck?17:35
fdegirrather than doing random rechecks unnecessarily?17:35
clarkbfdegir: it sounded like fungi was going to try and shepherd your change in? we can probably ping you here when we believe tumbleweed has stabilized17:36
clarkbfdegir: reading more on python stuff. closing the file is supposed to be sufficient17:36
fdegirclarkb: that would be great17:36
clarkbmakes me wonder if there is some funny bug in either python or the system that this is hitting a corner case of17:37
*** noorul has quit IRC17:38
fdegirif i remember correctly, we faced this both on ubuntu1604 and centos717:38
*** noorul has joined #openstack-infra17:38
fdegirbut since it's been a while the change was submitted and we continued using the image we built with locally patched glean, i've forgotten the details17:39
clarkbya thats fine. As mentioned your change won't hurt anything and is just more explicit (which is why it makes me wonder of a corner case)17:39
clarkbthe close and process exit (if clean) should handle it for us but maybe in some cases on some filesystems or whatever it doesn't17:39
clarkband in those caes being explicit is fine17:40
fdegirin any case, i'll talk to my colleagues to see if they remember more and bring the info back or the people17:41
*** odicha has quit IRC17:44
*** efried_afk is now known as dansmith117:46
*** dansmith1 is now known as efried17:46
*** e0ne has joined #openstack-infra17:49
aspiersclarkb, fungi, corvus: in case you missed it, this got uploaded yesterday (one year later!) https://www.youtube.com/watch?v=kM7dxm1O1jg17:51
*** bobh has joined #openstack-infra17:52
openstackgerritDirk Mueller proposed openstack/diskimage-builder master: zypper-minimal: Don't get confused by etc/resolv.conf symlink  https://review.opendev.org/67779617:57
openstackgerritDirk Mueller proposed opendev/glean master: Sync when writing the file  https://review.opendev.org/65223817:58
*** e0ne has quit IRC18:01
*** igordc has joined #openstack-infra18:03
*** dave-mccowan has joined #openstack-infra18:06
*** gfidente is now known as gfidente|afk18:06
donnydHow much divergence is there between software-factory and the openstack infra aside from the obvious parts18:10
*** bobh has quit IRC18:10
*** dave-mccowan has quit IRC18:11
clarkbdonnyd: they often run patches to zuul and nodepool for unmerged changes to add in features (not sure if that has slowed down as more code has merged to zuul)18:11
clarkbother than that and the obvious bits like bug tracker etc I think it is pretty close18:12
AJaegerinfra-root, a couple of more delete request for wrong content from ages ago: http://files.openstack.org/docs/project-deploy-guide/OpenStack-Ansible/ http://files.openstack.org/docs/project-deploy-guide/kolla-ansible/draft/ http://files.openstack.org/docs/project-deploy-guide/kolla-ansible/html/ http://files.openstack.org/docs/project-deploy-guide/openstack-ansible/draft/18:12
AJaegerhttp://files.openstack.org/docs/project-deploy-guide/openstack-ansible/html/ - thanks18:12
*** diablo_rojo has joined #openstack-infra18:13
donnydIt looked like was pretty close. I only ask for the purpose of 3P CI wanting to be as close as possible... and short of rolling their own it looks to be a great place to start18:13
*** pkopec has quit IRC18:14
clarkbdonnyd: there are also differences in the test node images18:14
clarkbI want to say they run a lot more distro packages than we do18:15
clarkbfor things like tox18:15
donnydYou mean the images nodepool builds? Can't that be customized in the SF deployment? or is it baked in18:16
*** markvoelker has quit IRC18:16
clarkbya the images nodepool builds. I don't think SF lets users customize those18:17
clarkbbut I may be wrong18:17
donnydThanks for filling in the gaps for me clarkb18:17
clarkbyou could do runtime changes though18:17
tristanCclarkb: donnyd: most of our patches are now merged, only three left, see: https://softwarefactory-project.io/cgit/scl/zuul-distgit/tree/ and https://softwarefactory-project.io/cgit/scl/nodepool-distgit/tree/18:18
tristanCdonnyd: SF let you build the same image as upstream, using disk-image-builder and custom elements, see: https://softwarefactory-project.io/cgit/config/tree/nodepool/rdo-cloud.yaml18:20
*** markvoelker has joined #openstack-infra18:22
tristanCdonnyd: and SF also support container based system and virt-customize image too18:23
tristanCbut for opendev 3P CI, it's easier to use the same dib elements as openstack-infra18:24
*** ramishra has quit IRC18:24
donnydRight, but I could potentially just plug in the same as infra and it should work the same18:24
tristanCdonnyd: yes, that's what we do for review.rdoproject.org 3P CI18:25
openstackgerritAkihiro Motoki proposed openstack/openstack-zuul-jobs master: Add support for building PDFs  https://review.opendev.org/66455518:28
*** markvoelker has quit IRC18:30
openstackgerritAndreas Jaeger proposed openstack/project-config master: Add more promote jobs  https://review.opendev.org/67779918:31
*** jtomasek has joined #openstack-infra18:32
clarkbtristanC: what is a virt-customize image in the context of nodepool? Is that run container to execute virt-customize then execute it with libvirt?18:32
openstackgerritAndreas Jaeger proposed openstack/project-config master: Add more promote jobs  https://review.opendev.org/67779918:33
*** kopecmartin is now known as kopecmartin|off18:33
clarkbor maybe running virt-customize within dib?18:33
donnydI would think it would be virt-customize in lieu of dib18:34
clarkbdonnyd: ya but nodepool doesn't support that18:34
clarkbmaybe that is a change they are carrying? Ididn't look at hte patch list18:34
*** ralonsoh has quit IRC18:35
tristanCclarkb: it's using a fake disk-image-create script injected with the PATH env...18:35
clarkb(fwiw in the past its bee nsuggested that tools like that and libguestfs could be used as a dib build mechanism and keep all the existing elements and other driver compat, but I don't think anyone added that to dib either. Could be there are patches there)18:35
fungifdegir: clarkb: yeah, it looks like dirk pushed up some dib fixes while i was running errands so hopefully we can get those confirmed and added almost as easily as temporarily setting the job non-voting18:35
*** ralonsoh has joined #openstack-infra18:35
pabelangerclarkb: http://libguestfs.org/virt-dib.1.html18:36
pabelangernever tried it18:36
tristanCe.g.: https://softwarefactory-project.io/cgit/config/tree/nodepool/elements/virt-customize/disk-image-create18:36
clarkbpabelanger: ah ok so they did it the other way around18:36
clarkbpabelanger: the suggestion we made as that running diskimage-create --guestfs or whatever would build image in libguestfs VM and run elements there18:36
pabelangerbut for ansible network, we have some vendors doing virt-customize (vyos for example). So plan on trying it out someday.18:36
clarkbpabelanger: basically something to replace the chroot builder that exists today if you don't mind nested virt or want other features that libguestfs provides18:37
clarkbtristanC: seems like we should be able to suppor that better than subverting a testing mechanism that could be changed in nodepool?18:37
clarkbthat definitely isn't part of any supported user contract18:37
pabelangeryah, big thing we need, is boot appliance and interact with some console port first18:38
pabelangerto setup SSH18:38
pabelangerfor now, we qemu-img directly and pepect18:38
pabelangerpexpect*18:38
pabelangerI think virt-customize has a cleaning interface there18:38
tristanCclarkb: oh yes, that would be easier, nodepool could directly use a non dib create process18:38
pabelangerI'm waiting for https://review.opendev.org/672196/ before trying it, just drop down different .sh files18:39
tristanCclarkb: but in the meantime, that has been working quite well, the main reason is to be able to use the same cloud image as published by the distrib18:40
clarkbpabelanger: do you have nested virt for that or is it just expected to be slow?18:40
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Switch deploy-guides to promote publishing  https://review.opendev.org/67780318:40
clarkbtristanC: just so you know dib supports that already18:40
clarkbtristanC: the centos element for example works that way18:40
clarkb(centos-minimal builds the image from scratch instead)18:40
pabelangerclarkb: yah, we need nested virt for a few appliances. Cisco specifically, like 12GB / 24GB, not fun18:40
clarkbsame is true with ubuntu vs ubuntu-minimal elements18:40
pabelangersmaller ones, we don't18:40
openstackgerritAndreas Jaeger proposed openstack/project-config master: Remove now obsolete publish jobs  https://review.opendev.org/67780418:42
clarkbAJaeger: for your afs deletes what makes the content wrong?18:43
AJaegerclarkb: those were errors in publishing years ago18:43
clarkboh I see the timestamps are 201618:43
fungimis-published to wrong path?18:43
AJaegernothing should link to them18:43
AJaegeryes, mispublished18:43
fungiand so never cleaned up. got it18:43
AJaegerSo, instead of http://files.openstack.org/docs/project-deploy-guide/OpenStack-Ansible/ we have http://files.openstack.org/docs/project-deploy-guide/openstack-ansible/18:44
AJaegerand the draft and html directories are wrong18:44
clarkbAJaeger: the correct one is the loewr case version?18:44
AJaegeryes18:44
*** e0ne has joined #openstack-infra18:44
AJaegerAnd it's also /draft and /html in those only - last change in 2016, 2017, or 201818:45
AJaeger/OpenStack-Ansible/draft is wrong18:45
AJaegerand nothing else in /OpenStack-Ansible18:45
AJaegerfungi, if you have time later, could you review https://review.opendev.org/#/c/677547/ and https://review.opendev.org/677657, https://review.opendev.org/677799 and a cosmetic https://review.opendev.org/677662, please? more changes for promote18:47
*** kjackal has joined #openstack-infra18:47
rlandy|ruckhi - we used to have a footer in our (tripleo) upstream logs and we lost that with the new logging change (https://bugs.launchpad.net/tripleo/+bug/1840818). Is there any expectation that the footer will be re-enabled or is this something we should place in our own log docs now?18:51
openstackLaunchpad bug 1840818 in tripleo "Headers and footers no longer appear in upstream logss" [Medium,Triaged] - Assigned to Ronelle Landy (rlandy)18:51
clarkbrlandy|ruck: that might be a better queston for #zuul as we'd need to update the zuul dashboard to support that I think18:52
*** factor has quit IRC18:52
clarkbrlandy|ruck: I expect zuul would be open to that. Could have a special file in a dir that zuul dashboard renders as part of the dashboard?18:52
rlandy|ruckclarkb: thanks - will ask there18:53
rlandy|ruckwe could even add a new log file18:53
rlandy|ruckthe footer was just a convenient place to point users at18:53
clarkbrlandy|ruck: that would be doable in the existing system18:54
clarkba README file per dir or something18:54
rlandy|ruckso we have that in places18:54
clarkband then maybe later we update zuul to render that in the dashboard automatically18:54
rlandy|rucksome people don't look there though :)18:54
*** mriedem has quit IRC18:58
*** mriedem has joined #openstack-infra18:59
clarkbcentos-7 image in limestone updated just under an hour agao19:00
clarkbyoctozepto: sshnaidm ^ I think that means we should have more reliable ipv4 now. However this is a race between network manager and the kernel. Please do let us know if you see it persist19:00
*** ralonsoh has quit IRC19:05
openstackgerritAkihiro Motoki proposed openstack/openstack-zuul-jobs master: Add support for building PDFs  https://review.opendev.org/66455519:12
*** jtomasek has quit IRC19:15
*** jtomasek has joined #openstack-infra19:17
openstackgerritSorin Sbarnea proposed opendev/bindep master: Expose base python version as an atom  https://review.opendev.org/63995119:19
fdegirthanks to dirk, opensuse job works now and glean change https://review.opendev.org/#/c/652238/ passed the check19:26
yoctozeptoclarkb: thanks! :D19:27
fungiawesome, thanks for the heads up fdegir!19:28
fungiamd thanks for fixing that, dirk!19:28
fungidirk: i think the rm is fine since it's conditional on a test that it is actually a link anyway19:31
fungiso unless we're running that on a tumbleweed server and the main /etc/resolv.conf is a symlink already and $TARGET_ROOT is for some reason empty, i don't think it'll end up causing any collateral damage19:32
*** e0ne has quit IRC19:32
*** e0ne has joined #openstack-infra19:33
clarkbfungi: its the tee not tye rm that is a problem aiui19:35
clarkbthe rm is a dded so the tee writes in the chroot and not out of it19:36
fungiright, i just meant as a response to his question about doing that vs a sudo chroot19:37
openstackgerritSorin Sbarnea proposed opendev/bindep master: Expose base python version as an atom  https://review.opendev.org/63995119:39
clarkbheading out on the bike now. back in a bit19:40
*** markvoelker has joined #openstack-infra19:44
*** markvoelker has quit IRC19:49
*** eernst has joined #openstack-infra19:52
*** factor has joined #openstack-infra19:55
*** eernst has quit IRC19:57
*** eernst_ has joined #openstack-infra19:57
*** noorul has quit IRC19:57
*** noorul has joined #openstack-infra19:58
sshnaidmclarkb, thanks, will keep eye19:58
openstackgerritAkihiro Motoki proposed openstack/openstack-zuul-jobs master: Add support for building PDFs  https://review.opendev.org/66455520:00
*** noorul has quit IRC20:06
*** e0ne has quit IRC20:10
*** e0ne has joined #openstack-infra20:12
*** ykarel has quit IRC20:13
openstackgerritAkihiro Motoki proposed openstack/openstack-zuul-jobs master: Add support for building PDFs  https://review.opendev.org/66455520:14
*** kjackal has quit IRC20:15
*** ykarel has joined #openstack-infra20:16
*** eernst_ has quit IRC20:21
*** noorul has joined #openstack-infra20:21
*** jtomasek has quit IRC20:21
*** jamesmcarthur has quit IRC20:23
*** noorul has quit IRC20:26
*** noorul has joined #openstack-infra20:41
openstackgerritMerged openstack/diskimage-builder master: zypper-minimal: Don't get confused by etc/resolv.conf symlink  https://review.opendev.org/67779620:43
openstackgerritAkihiro Motoki proposed openstack/openstack-zuul-jobs master: Add support for building PDFs  https://review.opendev.org/66455520:43
*** jamesmcarthur has joined #openstack-infra20:44
*** e0ne has quit IRC20:44
*** noorul has quit IRC20:47
*** ykarel has quit IRC20:48
*** noorul has joined #openstack-infra20:52
*** bobh has joined #openstack-infra20:55
mnaseris the zuul console a lil broken?20:55
mnaseri got two builds and i just have "--- END OF STREAM ---" listed there20:56
clarkbif the jobs just started that usually indicates the job isn't far noeugh along to be running the port 19885 console streaming service on the remote node20:56
*** noorul has quit IRC20:56
clarkbif it happens well after job has started I think that means something prevents that network connection?20:57
*** bobh has quit IRC21:00
*** eernst has joined #openstack-infra21:00
*** jamesmcarthur has quit IRC21:01
*** noorul has joined #openstack-infra21:02
openstackgerritSean McGinnis proposed opendev/irc-meetings master: Switch release team to 1600 UTC  https://review.opendev.org/67783121:02
*** noorul has quit IRC21:07
mnaserclarkb: in my case, it seems like something is off, i had a job eventually fail21:08
mnaser(so it was running ok)21:08
mnaserin my case http://zuul.opendev.org/t/opendev/stream/4842112d931c4836ab78645823b744f3?logfile=console.log has been running for a while but still giving end of stream21:08
fungiyeah, if something kills the streaming daemon (oom, reboot) or blocks access to its socket (firewall or networking changes) then that can cause it21:11
fungilogs should still be collected and reported at the end of the build, as long as log collection via rsync isn't also broken21:12
*** noorul has joined #openstack-infra21:12
*** Goneri has quit IRC21:12
fungimaybe there will be something in there which explains the log streamer not working21:12
clarkbthat job is running on ze1021:14
clarkbI cannot ssh onto the test node as root21:15
*** jamesmcarthur has joined #openstack-infra21:16
clarkbthat implies to me that the host is unhappy?21:16
*** altlogbot_3 has quit IRC21:16
*** noorul has quit IRC21:17
clarkboh maybe the job is done?21:17
clarkbit doesn't seem to have logged its completion21:17
clarkbbut those test nodes are unknown by nodepool right now21:17
clarkboh nevermind I can't grep by hostname anymore in nodepool output21:18
*** mriedem has quit IRC21:18
clarkball three nodes have a process listening at 1988521:20
corvuszuul     21479  0.0  0.0      0     0 ?        Z    Aug09   2:19 [zuul-executor] <defunct>21:22
*** noorul has joined #openstack-infra21:22
corvusclarkb: ^ the proxy on the executor died21:22
clarkbah21:22
clarkbfwiw if I nc to the hosts port 19885 I get nothing either (do I need to use finger protocol to that service too?)21:23
clarkblooks like there was an OOM on the 17th but zuul-executor was last restarte on the 9th21:23
clarkbthat may be where the proxy was killed I guess21:23
clarkblooks like 09 and 10 are in that same situation. Should we restart the zuul-executor service  on those hosts?21:27
clarkbalso should zuul consider waitpid'ing its child processes and restarting them if necessary?21:27
*** noorul has quit IRC21:27
*** rcernin has joined #openstack-infra21:27
clarkb(I can restart the zuul-executor service on 09 and 10 if that is what we want to do there21:28
*** mattw4 has joined #openstack-infra21:30
*** bobh has joined #openstack-infra21:32
clarkbinfra-root ^ any objection to that? I doubt there is much we can do to debug in its current state. I've confirmed in 09's dmesg output that OOMKiller killed hte child zuul-executor process21:32
*** noorul has joined #openstack-infra21:32
ianw++21:33
clarkbianw: did you see the dib fix for tumbleweed and the followup to make fdegir glean usage happy?21:33
clarkb(we might want to do releases of those two to close that out if possible)21:33
corvusclarkb: ++ also waitpid++21:34
clarkbI'lld do 09 and 10 sequentially to reduce any potential load on the other executors that might trigger the same thing there21:34
ianwclarkb: will check on it21:35
*** altlogbot_2 has joined #openstack-infra21:37
*** noorul has quit IRC21:37
*** altlogbot_2 has quit IRC21:38
*** sshnaidm is now known as sshnaidm|afk21:40
*** altlogbot_2 has joined #openstack-infra21:41
ianwclarkb / fungi : ahh, so did 677796 (deleting outside chroot) break our builders?21:41
fungiclarkb: sounds fine to me21:41
clarkbianw: you know I didn't check but other builds seemed to be working?21:42
clarkbianw: possible that puppet is fixing it for us?21:42
*** altlogbot_2 has quit IRC21:42
fungiianw: tumbleweed changed to having /etc/resolv.conf symlink into /run so when editing it outside of a chroot the tee was unable to write into that symlink as it was a dead link21:42
*** noorul has joined #openstack-infra21:42
clarkb09 zuul-executor is done21:42
clarkbdoing 10 now21:42
fungiianw: by deleting the symlink from the filesystem subtree, tee now creates a normal file at that location as intended21:43
ianwcool; i just released 2.26.1 with just that fix21:45
ianwfdegir: ^21:46
*** jamesmcarthur has quit IRC21:46
fdegirianw: thx - i'm waiting for this to go in as well - https://review.opendev.org/#/c/652238/21:46
*** jamesmcarthur has joined #openstack-infra21:47
*** noorul has quit IRC21:47
ianwfdegir: np, just having a look now ... ISTR very similar issues, i wonder if it was an old change or we just missed this21:48
clarkband now 10 is restarted21:51
*** jamesmcarthur has quit IRC21:51
*** jamesmcarthur has joined #openstack-infra21:51
ianwi'm thinking of the issues we had @ ~ http://eavesdrop.openstack.org/irclogs/%23openstack-infra/%23openstack-infra.2018-07-25.log.html#t2018-07-25T00:37:51 with the centos ssh key being written but not available21:51
*** noorul has joined #openstack-infra21:52
*** rlandy|ruck is now known as rlandy|ruck|bbl21:57
ianwi'm not sure why we didn't sync at that time too, anyway, LGTM21:57
*** noorul has quit IRC21:58
ianwlooks like that will basically be it from 1.14.1 in glean -- fdegir do you need a release?21:58
*** markvoelker has joined #openstack-infra21:59
*** bnemec has quit IRC21:59
fdegirianw: yes if it is not too  much to ask21:59
*** jeliu_ has quit IRC22:01
*** jeliu_ has joined #openstack-infra22:01
*** noorul has joined #openstack-infra22:03
*** diablo_rojo has quit IRC22:04
*** noorul has quit IRC22:08
*** jeliu_ has quit IRC22:09
openstackgerritClark Boylan proposed zuul/zuul master: Restart log streamer if it dies  https://review.opendev.org/67784622:09
*** noorul has joined #openstack-infra22:13
*** jamesmcarthur has quit IRC22:15
*** noorul has quit IRC22:18
*** rh-jelabarre has quit IRC22:21
*** bobh has quit IRC22:22
*** bobh has joined #openstack-infra22:22
*** dchen has joined #openstack-infra22:23
*** noorul has joined #openstack-infra22:23
*** dchen has quit IRC22:23
*** markvoelker has quit IRC22:24
*** dchen has joined #openstack-infra22:25
*** bobh has quit IRC22:27
*** noorul has quit IRC22:27
openstackgerritPaul Belanger proposed zuul/zuul master: Switch ansible_default to 2.8  https://review.opendev.org/67669522:29
openstackgerritPaul Belanger proposed zuul/zuul master: WIP: Support Ansible 2.9  https://review.opendev.org/67485422:29
*** noorul has joined #openstack-infra22:33
*** threestrands has joined #openstack-infra22:34
openstackgerritMerged zuul/zuul master: Add Tristan to Zuul Maintainers  https://review.opendev.org/67730822:35
*** markvoelker has joined #openstack-infra22:35
*** noorul has quit IRC22:37
*** rcernin has quit IRC22:40
*** rosmaita has quit IRC22:40
*** markvoelker has quit IRC22:40
*** mattw4 has quit IRC22:41
*** eharney has quit IRC22:41
*** noorul has joined #openstack-infra22:43
*** rcernin has joined #openstack-infra22:43
*** eernst has quit IRC22:44
*** eernst has joined #openstack-infra22:45
*** noorul has quit IRC22:48
openstackgerritMerged opendev/glean master: Sync when writing the file  https://review.opendev.org/65223822:51
*** eernst has quit IRC22:56
*** tkajinam has joined #openstack-infra22:56
*** eernst has joined #openstack-infra22:56
*** jamesmcarthur has joined #openstack-infra22:57
*** aaronsheffield has quit IRC23:13
donnydwho owns windmill?23:15
ianwdonnyd: pabelanger i would say23:16
ianwfdegir: glean 1.15.0 should have that change, thanks!23:16
donnydI wasn't sure which channel messages from windmill get posted to23:16
ianwthere is an #openstack-windmill23:17
donnydI just looked and I don't see anything... but i may have been a second too late23:17
pabelangeryup, still kicking23:20
pabelangerwe use it to deploy zuul.ansible.com23:20
donnydhrm, can't resolve that23:25
corvuswe really should see if we can do something about that23:25
corvusdashboard.zuul.ansible.com23:25
donnydI added windmill.ansible to the requirements because it was not there23:26
donnydhttps://review.opendev.org/#/c/677850/23:26
pabelangeryah, noticed that too. gets pulled in via windmill.ops23:26
pabelangerwas trying to convert it to a collection23:26
donnydI am trying to get it running, but i am not sure if I am doing it right23:27
*** bobh has joined #openstack-infra23:28
pabelangeryah, it is poorly documented. I really need to fix that23:28
pabelangerbasically, you need to also setup windmill-config23:28
pabelangerhttps://github.com/ansible-network/windmill-config/ is an example for ansible zuul23:29
donnydWell all the playbooks ran successfully23:29
donnydI just ran it as is with windmill-config23:31
donnydbut I am not sure i am doing it right23:31
pabelangerwe can move to openstack-windmill if you want23:31
*** sgw has quit IRC23:34
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Add arm64 based functional test  https://review.opendev.org/67611123:39
ianw^ not sure if anyone has any thoughts on adding that ... it would have caught the recent "efi blows out disk usage calculations and build runs out of space" issues23:40
ianwrsync: link_stat "/repositories/Virtualization:/containers/openSUSE_Leap_42.3/." (in opensuse) failed: Input/output error (5)23:41
ianwis this our end (afs) or the remote end i wonder?23:41
donnydianw: do the providers need to support efi boot for VM's ?23:45
ianwdonnyd: with arm64 they only support efi.  i have had an x86-64 efi based vm that dib created booting in qemu, but we've never tried it in a provider afaik23:46
donnydok23:47
donnyd:)23:47
*** sthussey has quit IRC23:50
ianwok, so that opensuse mirror error, i think the afs part is a red herring and it's the remote end23:52
ianwhttp://paste.openstack.org/show/761422/23:52
ianwsame thing happens if i just try it to /tmp23:52
ianwso it looks like rsync://provo-mirror.opensuse.org is having issues23:53
ianwfdegir AJaeger dirk: ^23:53
ianwlogs at http://files.openstack.org/mirror/logs/rsync-mirrors/opensuse.log23:53
clarkbya I think dirk said they are reorging repo structure23:54
ianwinfra-root: can we merge https://review.opendev.org/#/c/671963/ so those logs come up in the browser for people looking23:54
ianwclarkb: cool, when i see i/o errors + afs + new mirror-update server i just get a little worried :)23:54
*** jamesmcarthur has quit IRC23:56

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!