Friday, 2019-10-04

openstackgerritIan Wienand proposed openstack/diskimage-builder master: [dnm] testing nested virt labels  https://review.opendev.org/68651000:00
openstackgerritPaul Belanger proposed zuul/zuul master: Remove support for ansible 2.5  https://review.opendev.org/65043100:01
openstackgerritPaul Belanger proposed zuul/zuul master: Switch ansible_default to 2.8  https://review.opendev.org/67669500:01
openstackgerritPaul Belanger proposed zuul/zuul master: WIP: Support Ansible 2.9  https://review.opendev.org/67485400:01
*** dchen has joined #openstack-infra00:04
ianw  File "/usr/local/lib/python3.5/dist-packages/nodepool/driver/openstack/config.py", line 219, in load00:05
ianw    top_label = full_config.labels[pl.name]00:05
ianwKeyError: 'nested-virt-ubuntu-bionic'00:05
mnasersomeone made a nodepool oopsie?00:05
mnaserprobably top level labels missing00:05
openstackgerritTristan Cacqueray proposed zuul/zuul-registry master: Add OCI containers/storage support  https://review.opendev.org/68651200:05
mnaseryep00:05
mnaserianw: patch incoming00:06
ianwheh,just pushed one too00:07
openstackgerritIan Wienand proposed openstack/project-config master: Add top-level nested virt labels  https://review.opendev.org/68651300:07
mnaseryours was better tha mine00:08
ianwwhy didn't linting catch this though?00:08
mnaserhow would it i dont think we lint nodepool config00:08
ianwwe run config-validate?00:08
mnaseri ran into this too at some point i think00:09
mnaserbut not with config-validate i think00:09
ianwi don't think we're running it00:10
ianwi tell a lie, it did run, and it passed00:20
ianwhttps://zuul.opendev.org/t/openstack/build/e7aeb2257151439c907cd2e53a6c8cc7/log/job-output.txt#64200:21
*** gyee has quit IRC00:21
*** yamamoto has joined #openstack-infra00:23
ianwinfra-root: if we can double-check https://review.opendev.org/#/c/686513/ should allow some nested-virt testing00:23
mnaserianw: do you have the full stack trace00:24
mnaserianw: all validate seems to do is just validate the schema from what it seems like00:24
ianwmnaser: http://paste.openstack.org/show/781064/00:24
mnaseryeah it doesnt actually load it00:25
mnasercause the failure happens in the driver level00:25
*** yamamoto has quit IRC00:27
*** jtomasek has quit IRC00:30
*** markvoelker has joined #openstack-infra00:31
*** jbadiapa has joined #openstack-infra00:33
*** markvoelker has quit IRC00:35
openstackgerritIan Wienand proposed zuul/nodepool master: [wip] validate diskimages have top-level labels  https://review.opendev.org/68651400:39
ianwmnaser: ^ something like that?  can add a test case if it might work00:39
mnaseri was actaully working on something similar, i have a failing test case :p00:39
mnaseralso notice for provider_dict loop exists above00:39
mnaserso you're looping one extra time00:40
mnaserand i think the normal schema validation throws an exception00:40
*** jtomasek has joined #openstack-infra00:40
mnaserianw: at this point im actually wondering why the top level labels even exist00:41
ianwhaving it separate was sort of by design, keep each step a bit separate.  but let's take yours if you've got something better00:41
mnaserif all we're going to do is just .. make them match00:42
*** yamamoto has joined #openstack-infra00:47
*** armax has joined #openstack-infra00:48
openstackgerritMohammed Naser proposed zuul/nodepool master: Added failing configuration check  https://review.opendev.org/68651501:00
*** CrayZee has joined #openstack-infra01:10
*** panda|off has quit IRC01:12
*** panda has joined #openstack-infra01:12
*** shachar has quit IRC01:13
*** rfolco has quit IRC01:19
*** efried has quit IRC01:26
*** markvoelker has joined #openstack-infra01:26
*** efried has joined #openstack-infra01:26
*** signed8b_ has joined #openstack-infra01:27
*** dklyle has quit IRC01:27
*** rfolco has joined #openstack-infra01:27
*** dklyle has joined #openstack-infra01:28
*** harlowja has quit IRC01:30
*** signed8bit has quit IRC01:30
*** d34dh0r53 has quit IRC01:30
*** bradm has quit IRC01:30
*** bstinson has quit IRC01:30
*** markvoelker has quit IRC01:30
*** harlowja has joined #openstack-infra01:32
*** d34dh0r53 has joined #openstack-infra01:32
*** bstinson has joined #openstack-infra01:40
*** yamamoto has quit IRC01:42
*** markvoelker has joined #openstack-infra01:54
*** markvoelker has quit IRC01:58
*** markvoelker has joined #openstack-infra02:05
*** whoami-rajat has joined #openstack-infra02:09
*** jamesmcarthur has joined #openstack-infra02:13
*** yamamoto has joined #openstack-infra02:13
openstackgerritMerged openstack/diskimage-builder master: Remove "failovermethod=priority" for Fedora (dnf)  https://review.opendev.org/68587602:17
*** jonher has quit IRC02:19
*** yamamoto has quit IRC02:19
*** yamamoto has joined #openstack-infra02:20
*** ricolin has joined #openstack-infra02:27
*** roman_g has quit IRC02:34
*** markvoelker has quit IRC02:35
*** ykarel|away has joined #openstack-infra02:36
*** jonher has joined #openstack-infra02:41
*** exsdev has quit IRC02:44
*** rfolco has quit IRC02:45
*** exsdev has joined #openstack-infra02:49
openstackgerritTristan Cacqueray proposed zuul/zuul-registry master: Add OCI containers/storage support  https://review.opendev.org/68651202:52
*** markvoelker has joined #openstack-infra03:06
openstackgerritMerged openstack/diskimage-builder master: yum-minimal : update mirrors for Centos 8  https://review.opendev.org/68443903:22
openstackgerritMerged openstack/diskimage-builder master: dib-python : handle centos 8  https://review.opendev.org/68444103:22
openstackgerritIan Wienand proposed zuul/nodepool master: Validate openstack provider pool labels have top-level labels  https://review.opendev.org/68651403:30
ianwdonnyd: ^ changes are making it through.  the centos8 stack was very long, i think that with the rebasing etc and all the jobs running together probably caused undue contention03:33
*** jamesmcarthur has quit IRC03:34
*** jamesmcarthur has joined #openstack-infra03:35
*** kjackal has joined #openstack-infra03:35
*** markvoelker has quit IRC03:36
*** markvoelker has joined #openstack-infra03:36
*** jamesmcarthur has quit IRC03:39
openstackgerritIan Wienand proposed zuul/nodepool master: Validate openstack provider pool labels have top-level labels  https://review.opendev.org/68651403:53
openstackgerritIan Wienand proposed zuul/nodepool master: Validate openstack provider pool labels have top-level labels  https://review.opendev.org/68651404:01
*** ykarel|away has quit IRC04:05
*** jamesmcarthur has joined #openstack-infra04:05
*** markvoelker has quit IRC04:07
*** jamesmcarthur has quit IRC04:12
*** gouthamr has quit IRC04:15
*** gouthamr has joined #openstack-infra04:16
openstackgerritMerged openstack/project-config master: Add top-level nested virt labels  https://review.opendev.org/68651304:24
*** ykarel|away has joined #openstack-infra04:25
*** ykarel|away is now known as ykarel04:31
openstackgerritMerged openstack/diskimage-builder master: Update locales for Centos 8  https://review.opendev.org/68444204:34
*** markvoelker has joined #openstack-infra04:35
*** markvoelker has quit IRC04:39
*** surpatil has joined #openstack-infra04:44
*** ykarel is now known as ykarel|afk04:50
*** jbadiapa has quit IRC04:52
*** soniya29 has joined #openstack-infra04:57
*** kjackal has quit IRC04:59
*** dchen has quit IRC05:00
*** dchen has joined #openstack-infra05:02
*** jamesmcarthur has joined #openstack-infra05:08
*** jamesmcarthur has quit IRC05:13
*** raukadah is now known as chandankumar05:13
*** odicha has joined #openstack-infra05:13
*** setuid is now known as zzZZzzetuid05:15
*** markvoelker has joined #openstack-infra05:19
*** iokiwi has quit IRC05:23
*** markvoelker has quit IRC05:24
*** iokiwi has joined #openstack-infra05:24
*** goldyfruit_ has joined #openstack-infra05:26
*** markvoelker has joined #openstack-infra05:28
*** markvoelker has quit IRC05:33
*** markvoelker has joined #openstack-infra05:37
*** markvoelker has quit IRC05:42
*** rlandy|bbl is now known as rlandy05:43
*** markvoelker has joined #openstack-infra05:47
*** ramishra has joined #openstack-infra05:47
*** markvoelker has quit IRC05:51
*** markvoelker has joined #openstack-infra05:56
*** ykarel|afk is now known as ykarel06:00
*** markvoelker has quit IRC06:01
*** markvoelker has joined #openstack-infra06:05
*** jamesmcarthur has joined #openstack-infra06:09
*** markvoelker has quit IRC06:10
openstackgerritIan Wienand proposed openstack/project-config master: nodepool/elements : use abstracted commands  https://review.opendev.org/68652406:11
*** markvoelker has joined #openstack-infra06:14
ianwdonnyd: so, first results on nested-vm nodes are in @ https://review.opendev.org/#/c/686510/ .  it's hard to say ... not suddenly faster.  however, it's a bit hard because we dont' capture the devstack/nova logs to see 100% that it chose kvm for the nested vm.  feel free to update job and grab that info :)06:14
*** jamesmcarthur has quit IRC06:15
*** markvoelker has quit IRC06:19
*** markvoelker has joined #openstack-infra06:23
AJaegerianw: https://review.opendev.org/#/c/686524/1 will break without nodepool release, correct? So, let's WIP it, please06:25
*** markvoelker has quit IRC06:28
*** pgaxatte has joined #openstack-infra06:32
*** markvoelker has joined #openstack-infra06:33
*** spsurya has joined #openstack-infra06:33
*** dchen has quit IRC06:36
*** slaweq has joined #openstack-infra06:37
*** dchen has joined #openstack-infra06:38
*** markvoelker has quit IRC06:38
*** markvoelker has joined #openstack-infra06:42
*** udesale has joined #openstack-infra06:46
*** markvoelker has quit IRC06:46
*** markvoelker has joined #openstack-infra06:52
*** iurygregory has joined #openstack-infra06:54
*** markvoelker has quit IRC06:56
*** hwoarang has quit IRC06:58
*** markvoelker has joined #openstack-infra07:01
*** ramishra has quit IRC07:06
*** markvoelker has quit IRC07:06
*** jonher has quit IRC07:07
*** jonher has joined #openstack-infra07:07
*** rpittau|afk is now known as rpittau07:10
*** markvoelker has joined #openstack-infra07:11
*** hwoarang has joined #openstack-infra07:11
*** jamesmcarthur has joined #openstack-infra07:11
*** pcaruana has joined #openstack-infra07:15
*** markvoelker has quit IRC07:15
*** jamesmcarthur has quit IRC07:16
*** tosky has joined #openstack-infra07:18
*** markvoelker has joined #openstack-infra07:20
*** xek has joined #openstack-infra07:22
*** e0ne has joined #openstack-infra07:22
*** e0ne has quit IRC07:22
*** markvoelker has quit IRC07:24
*** xenos76 has joined #openstack-infra07:29
*** kjackal has joined #openstack-infra07:31
*** jpena|off is now known as jpena07:38
ianwCannot initiate the connection to mirror.ord.rax.opendev.org:80 (2001:4801:7824:103:be76:4eff:fe10:9c2). - connect (101: Network is unreachable) [IP: 23.253.216.84 80]07:43
ianwthat's odd07:43
*** markvoelker has joined #openstack-infra07:47
*** markvoelker has quit IRC07:52
openstackgerritMatthieu Huin proposed zuul/zuul master: Add OpenAPI description for enqueue, dequeue, autohold  https://review.opendev.org/67425707:53
*** markvoelker has joined #openstack-infra07:57
*** pkopec has joined #openstack-infra08:01
*** markvoelker has quit IRC08:01
*** jamesmcarthur has joined #openstack-infra08:13
*** rcernin has quit IRC08:15
*** markvoelker has joined #openstack-infra08:15
*** gfidente has joined #openstack-infra08:16
*** jamesmcarthur has quit IRC08:18
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - Support for branch creation/deletion  https://review.opendev.org/68511608:20
*** markvoelker has quit IRC08:20
*** xek_ has joined #openstack-infra08:21
*** zbr is now known as zbr|ruck08:23
*** soniya29 has quit IRC08:24
*** xek has quit IRC08:24
*** lpetrut has joined #openstack-infra08:25
*** tkajinam has quit IRC08:27
*** yamamoto has quit IRC08:35
*** ociuhandu has joined #openstack-infra08:37
*** yamamoto has joined #openstack-infra08:37
*** derekh has joined #openstack-infra08:37
*** soniya29 has joined #openstack-infra08:41
*** markvoelker has joined #openstack-infra08:43
openstackgerritMerged openstack/diskimage-builder master: Add security mirror override for debian-minimal  https://review.opendev.org/68632808:46
*** markvoelker has quit IRC08:48
*** ociuhandu has quit IRC08:54
*** ociuhandu has joined #openstack-infra08:54
*** ociuhandu has quit IRC08:59
*** ociuhandu has joined #openstack-infra08:59
*** e0ne has joined #openstack-infra08:59
*** markvoelker has joined #openstack-infra09:02
*** markvoelker has quit IRC09:07
*** jamesmcarthur has joined #openstack-infra09:14
*** hrw has joined #openstack-infra09:14
hrwmorning09:14
yoctozeptomorning infra09:15
*** ramishra has joined #openstack-infra09:17
hrwWe have a problem in Kolla. Out periodic job builds and then push ~250 images in 5 jobs (centos/source, centos/binary, ubuntu/source, ubuntu/binary, debian/source). And then it timeouts in 2.5h09:18
hrwhow much we can expand timeout?09:19
*** ccamacho has joined #openstack-infra09:19
*** jamesmcarthur has quit IRC09:19
*** markvoelker has joined #openstack-infra09:21
*** kjackal has quit IRC09:22
openstackgerritMatthieu Huin proposed zuul/zuul master: Zuul Web: add /api/user/authorizations endpoint  https://review.opendev.org/64109909:23
*** rfolco has joined #openstack-infra09:23
*** panda is now known as panda|bbl09:25
*** markvoelker has quit IRC09:25
*** yamamoto has quit IRC09:26
*** yamamoto has joined #openstack-infra09:30
*** yamamoto has quit IRC09:35
*** markvoelker has joined #openstack-infra09:39
*** yamamoto has joined #openstack-infra09:41
*** jbadiapa has joined #openstack-infra09:42
*** SergeyLukjanov has quit IRC09:44
*** markvoelker has quit IRC09:45
AJaegerWhy do you need to build those in a periodic job?09:45
*** exsdev0 has joined #openstack-infra09:45
*** exsdev has quit IRC09:46
*** exsdev0 is now known as exsdev09:46
AJaegerhrw: ^09:47
AJaegerhrw: maximal allowed timeout is AFAIK 10800s09:47
hrwok09:48
AJaegerand you can set that as post-timeout.09:48
hrwAJaeger: done in https://review.opendev.org/68656109:48
hrwbuilding is ~20 minutes. then push goes09:48
AJaegerthen don't increase timeout, just post-timeout ;)09:49
hrwnormal timeout was 2:50 anyway09:49
AJaegeryou don't want it running 6hours ;)09:49
AJaegermmh, so you're at the limit - I think we have some time to collect log files...09:49
hrwI can bear with 6h if it does the job ;(09:50
AJaegerhrw: so, what is the purpose of this?09:50
hrwAJaeger: our job is to provide images for people to deploy openstack09:50
hrwand with more and more images pushing time gets longer and longer09:51
*** SergeyLukjanov has joined #openstack-infra09:51
AJaegerand why -source and -binary? Isn't one enough?09:51
mordredwow - that's taking 2 hours just in the pushing to dockerhub09:52
AJaegerand you need those for all branches, correct?09:52
hrwAJaeger: yep. 5 branches now09:52
AJaegerso, 5 branches and 5 jobs pushing at the same time?09:52
hrwstable runs at other time iirc09:53
AJaegerhrw: no, all at basically same time09:53
hrwok09:53
hrwin stable/* there are 4 jobs. we just added 5th one for master09:53
AJaegerhrw: stable starts at 6:01 UTC, others at 6:00 UTC09:53
hrwk09:54
AJaegermordred: would it make sense to make them dependent on each other? So, serializing them?09:54
AJaeger(at least per branch)09:54
AJaegerOr use a semaphore so that only a few of them run at once? Or is parallel execution not a problem?09:54
hrwnot a problem as long as dockerhub responds in time09:55
AJaegerhrw: So, let me ask differently (see my ramblings above): Do you think a single job run on its own would be better than 24 jobs running in parallel and uploading?09:55
hrwAJaeger: definitelly better09:56
mordredyeah - I think the issue here is simply pure volume of data uploaded in the job09:56
*** kjackal has joined #openstack-infra09:56
AJaegerhrw: consider using a semaphore for the jobs, see what we do with translations at https://opendev.org/openstack/project-config/src/branch/master/zuul.d/jobs.yaml#L112309:57
hrwAJaeger: thanks. will look09:57
AJaegerhrw: and how large are your images?09:57
*** markvoelker has joined #openstack-infra09:58
hrwlet me check do I have just a set09:58
hrwwill take a moment - removing extra ones09:59
mordredyeah - looking through the logs - it just seems like 2 hours is a really long time to just be stuck in pushing - so either the images are really big and it's just going to take a while, or dockerhub is throttling us09:59
mordredin which case potentially the semaphore would help, as serializing the uploads would allow more bandwidth per push10:00
yoctozeptoAJaeger, mordred: yeah we are pushing a lot in parallel10:00
openstackgerritFabien Boucher proposed zuul/zuul master: Gitlab - Basic handling of merge_requests event  https://review.opendev.org/68599010:00
mordredso I think the semaphore is definitely worth a try - just to see if it helps with bandwidth between us and dockerhub10:00
yoctozeptobest if we could semaphore post only10:00
yoctozeptobut probably not achievable10:00
AJaegeryoctozepto: sure, you can add the semaphore only to the periodic jobs10:01
AJaegeryoctozepto: post or periodic? Is kolla pushing in post as well?10:01
*** markvoelker has quit IRC10:02
*** mugsie has quit IRC10:03
yoctozeptoAJaeger: yeah the publishing/pushing in periodics is done inside post10:03
yoctozeptothat's why hrw is increasing the post timeout10:03
AJaegeryoctozepto: Ah, post of a job run in periodic pipeline ;)10:04
yoctozeptoindeed :-)10:04
AJaegeryoctozepto: I confused post-job with post pipeline ;(10:04
yoctozeptothat ambiguity is confusing10:04
yoctozeptoyeah, figured10:05
* hrw rebuilds one set of images to check space need.10:05
yoctozeptono problem10:05
AJaegeryoctozepto: so, no, you can only semaphore whole job10:05
yoctozepto"Semaphores, like most configuration items, are unique within a tenant"10:05
hrwas cleaning of not needed ones was wrong approach10:05
yoctozeptoso we should prefix it with kolla-10:05
AJaegeryes, please10:05
yoctozeptoAJaeger: yeah, read the docs10:05
yoctozeptothanks for the hints10:05
yoctozeptohrw: xD10:05
*** mugsie has joined #openstack-infra10:05
yoctozeptoI will prepare the semaphore change in case we want to use it there10:06
yoctozeptoAJaeger: any idea whether other projects are also pushing something to dockerhub?10:06
AJaegerso, I see images are around 350 MB - are you pushing 250 images per job - or 24 jobs, each pushes 10 images?10:07
hrwyoctozepto: thanks10:07
hrwAJaeger: each 25010:07
*** markvoelker has joined #openstack-infra10:07
hrw~25010:07
yoctozeptoif we could ask them to be nice to us10:07
yoctozeptoAJaeger: but layers are pushed in an optimized way10:07
AJaegerThat sounds insane - 24*250*350 MB. Guess it's more that kolla needs to be nice ;)10:07
yoctozeptoand they share lots of layers10:07
hrwthey can also answer "please run registry.openstack.org and point users there"10:08
*** yamamoto has quit IRC10:08
yoctozeptoyeah, we would be happy to get local registry at some point10:08
AJaeger24*250*350 gives me 2 TB ;(10:08
yoctozeptoAJaeger: I already told you they share layers ;p10:08
yoctozeptothis computation is wrong in the 250*350 part10:09
yoctozeptohrw is working on estimation10:09
AJaegerok10:09
hrwgive me ~30 minutes to build10:09
yoctozeptoalso, binary have less images10:09
hrwthe good part is: we do not even try to publish arm64 images ;D10:10
yoctozeptoAJaeger: re other projects - are you aware of any doing pushing to dockerhub?10:10
hrwyoctozepto: loci may. but they have small set10:10
yoctozeptohrw: it would need much more thought10:10
hrwyoctozepto: arm64 would need lot of work outside of openstack10:11
hrwwheel uploads etc10:11
donnydianw: so before we had 100% failure rates  on FN10:11
donnydThose look like passes to me10:11
*** markvoelker has quit IRC10:12
*** yamamoto has joined #openstack-infra10:12
AJaegeryoctozepto: you might want to check via codesearch.openstack.org10:12
AJaegeryoctozepto, hrw, are you building kubernetes as well as part of this? Aren't there images you can use?10:13
*** yamamoto has quit IRC10:13
AJaegerfound on master "found 293 Dockerfiles" - wow10:13
*** jamesmcarthur has joined #openstack-infra10:15
hrwAJaeger: good point is that we are slowly working on deprecation or defining level of support for them. but still would build all probably10:16
openstackgerritMerged openstack/diskimage-builder master: Update redhat-common pkg-map for centos 8  https://review.opendev.org/68444910:16
AJaegerhrw: everything both as source and binary?10:17
* AJaeger will be back later10:17
hrwand for 3 distros as base10:17
AJaegerbut why build those support things both as source and binary? Do you really need everything both? Anything you can optimize?10:18
hrwI think that this is something our project have to reconsider one day10:19
*** jamesmcarthur has quit IRC10:19
donnydThe only job that ran on FN was the centos build, but it was still failing before10:21
*** prometheanfire has quit IRC10:22
*** ociuhandu has quit IRC10:22
hrwfsck. my devmachine hang10:22
*** ociuhandu has joined #openstack-infra10:22
*** prometheanfire has joined #openstack-infra10:23
*** factor has joined #openstack-infra10:26
* hrw out - bbl10:31
*** markvoelker has joined #openstack-infra10:35
*** markvoelker has quit IRC10:39
*** panda|bbl is now known as panda10:39
*** yamamoto has joined #openstack-infra10:41
*** soniya29 has quit IRC10:42
*** yamamoto has quit IRC10:43
*** kmarc has quit IRC10:46
*** kmarc has joined #openstack-infra10:46
*** jamesmcarthur has joined #openstack-infra10:47
*** ociuhandu has quit IRC10:47
*** ociuhandu has joined #openstack-infra10:48
*** jamesmcarthur has quit IRC10:51
*** ociuhandu has quit IRC10:52
*** dchen has quit IRC10:52
*** ociuhandu has joined #openstack-infra10:52
*** markvoelker has joined #openstack-infra10:53
*** yamamoto has joined #openstack-infra10:54
*** jaosorior has joined #openstack-infra10:54
hrwset of centos/binary images (223 images + centos:7 + some dangling ones) is 16GB10:57
*** markvoelker has quit IRC10:57
*** jpena is now known as jpena|lunch11:00
*** yamamoto has quit IRC11:00
*** kmarc has quit IRC11:01
*** markvoelker has joined #openstack-infra11:02
*** kmarc has joined #openstack-infra11:03
*** yamamoto has joined #openstack-infra11:03
*** markvoelker has quit IRC11:06
*** ociuhandu has quit IRC11:07
*** soniya29 has joined #openstack-infra11:08
*** ociuhandu has joined #openstack-infra11:08
*** jbadiapa has quit IRC11:09
yoctozeptoAJaeger: https://review.opendev.org/68656911:11
*** markvoelker has joined #openstack-infra11:11
*** ociuhandu has quit IRC11:12
openstackgerritMonty Taylor proposed zuul/zuul-registry master: WIP Consume typing from openstacksdk and keystoneauth  https://review.opendev.org/68640311:12
*** ociuhandu has joined #openstack-infra11:14
*** zbr|ruck is now known as zbr|lunch11:15
*** markvoelker has quit IRC11:16
yoctozeptoAJaeger: about that semaphore - if I then want to change the number of concurrent jobs11:17
yoctozeptoand have this semaphore in many branches11:17
yoctozeptohow would that work11:18
yoctozeptowould it prevent me from updating it?11:18
*** ociuhandu has quit IRC11:18
donnydhow much would it be to ask zuul if we could get the node provider where the build ran on the summary screen? Not sure if I am asking for a lot or a little11:19
*** markvoelker has joined #openstack-infra11:21
*** ociuhandu has joined #openstack-infra11:25
*** markvoelker has quit IRC11:25
*** yamamoto has quit IRC11:36
*** markvoelker has joined #openstack-infra11:39
*** panda is now known as panda|eat11:43
*** markvoelker has quit IRC11:44
*** ccamacho has quit IRC11:45
*** jamesmcarthur has joined #openstack-infra11:48
*** pgaxatte has quit IRC11:48
*** markvoelker has joined #openstack-infra11:48
*** markvoelker has quit IRC11:49
*** markvoelker has joined #openstack-infra11:49
*** yamamoto has joined #openstack-infra11:51
*** lucasagomes has joined #openstack-infra11:52
*** EmilienM has quit IRC11:53
*** jamesmcarthur has quit IRC11:53
*** EmilienM has joined #openstack-infra11:54
*** ociuhandu has quit IRC11:55
AJaegeryoctozepto: I think it's best define the semaphore in a repo that is not branched - let's see what others say. Fallback is project-config...11:56
*** ociuhandu has joined #openstack-infra11:57
*** ociuhandu has quit IRC11:59
*** ociuhandu has joined #openstack-infra11:59
*** yamamoto has quit IRC12:00
*** jpena|lunch is now known as jpena12:00
*** ociuhandu has quit IRC12:03
*** ociuhandu has joined #openstack-infra12:04
yoctozeptoAJaeger: ok, replied12:06
*** kukacz has joined #openstack-infra12:09
*** ociuhandu has quit IRC12:09
*** spsurya has quit IRC12:10
*** zzZZzzetuid is now known as setuid12:10
*** pgaxatte has joined #openstack-infra12:14
*** jamesmcarthur has joined #openstack-infra12:15
*** udesale has quit IRC12:16
*** udesale has joined #openstack-infra12:16
*** goldyfruit_ has quit IRC12:22
*** rh-jelabarre has joined #openstack-infra12:23
*** signed8b_ has quit IRC12:24
*** rfolco has quit IRC12:24
*** rfolco has joined #openstack-infra12:25
openstackgerritMonty Taylor proposed openstack/openstack-zuul-jobs master: Remove a reference to cgit from comment  https://review.opendev.org/68658612:26
*** yamamoto has joined #openstack-infra12:30
yoctozeptoAJaeger: it could be nice to get a branchless repo for kolla for sure12:31
yoctozeptomgoddard, hrw, mnasiadka: ^12:31
mnasiadkayou mean a separate repo for some of the CI stuff?12:32
AJaegeryoctozepto: it's not worth doing it for this yet...12:32
mgoddardyoctozepto: a repo just for this semaphore?12:32
mgoddardyoctozepto: before we go down this path, are we sure it's the right solution?12:34
mgoddardwhat is the problem we are trying to solve?12:34
yoctozeptohmm, yeah, for now just semaphore but the same collision exists for nodesets, though we don't really need to change them hmm12:34
*** rlandy has joined #openstack-infra12:34
*** ociuhandu has joined #openstack-infra12:34
yoctozeptoAJaeger: how can we estimate the effect of semaphore without waiting 24h for each test?12:35
*** nhicher has quit IRC12:35
*** nhicher has joined #openstack-infra12:36
yoctozeptomgoddard: we are trying to solve 'oh why are 2 hours not enough to push those images'12:37
AJaegeryoctozepto: you have to wait...12:37
yoctozeptoso we must backport this to all the branches and then wait12:37
mgoddardyoctozepto: what is the difference between a normal build job vs. publish?12:37
yoctozeptoand then get issues12:37
yoctozepto:D12:37
*** panda|eat is now known as panda12:37
mgoddard(in time)12:37
yoctozeptomgoddard: 2 hours of pushing not enough12:38
yoctozeptowe have 2 hour limit on the push12:38
yoctozeptoand this timeouts12:38
mgoddardyoctozepto: we are definitely not building during that time?12:38
yoctozeptohrw knows the details as well, I have to leave12:38
yoctozeptomgoddard: definitely12:38
yoctozeptothis is POST12:38
yoctozeptotill later12:38
*** whoami-rajat has quit IRC12:38
hrwmgoddard: building is 20 minutes, then pushing12:39
*** xek_ has quit IRC12:39
mgoddardhrw: it definitely takes longer than 20 minutes to build kolla images12:39
*** xek_ has joined #openstack-infra12:40
*** ociuhandu has quit IRC12:40
*** markvoelker has quit IRC12:41
hrwmgoddard: https://cb91710b8557a2f6220d-98f49207ca8d4e649788d064c2e22814.ssl.cf2.rackcdn.com/periodic/opendev.org/openstack/kolla/master/kolla-publish-debian-source/747d878/job-output.txt12:41
mgoddardhrw: yeah, looking at one now12:41
*** xek_ has quit IRC12:42
mgoddardhttps://openstack.fortnebula.com:13808/v1/AUTH_e8fd161dc34c421a979a9e6421f823e9/zuul_opendev_logs_b5e/periodic-stable/opendev.org/openstack/kolla/stable/queens/kolla-publish-centos-source/b5e20fb/job-output.txt12:42
*** xek_ has joined #openstack-infra12:42
mgoddardthat one takes 1h20 to push12:42
*** whoami-rajat has joined #openstack-infra12:42
hrwmgoddard: some layers were already present at hub probably so they were reused12:43
mgoddardok12:43
*** xek_ has quit IRC12:43
hrwI am swowly thinking of building those images locally, pushing to hub, waiting for periodic to push own, remove my ones12:44
*** xek_ has joined #openstack-infra12:44
mgoddardI imagine it depends a lot on which cloud provider12:44
hrwso it would reuse layers present on hub probably12:44
mgoddardhrw: I don't think you have the key for kolla docker namespace12:44
*** kukacz has quit IRC12:44
fungiyes, network bandwidth availability is one of our biggest variables between providers12:44
hrwmgoddard: right. no shared layers between users12:44
*** kukacz has joined #openstack-infra12:45
AJaegerfungi: and they are pushing 297 images (on master) for each job - and have 5 jobs run in parallel on master and 4 jobs for each stable branch...12:47
AJaegerfungi: that's why I proposed to experiment some rate limit - using a semaphore12:47
*** jamesmcarthur has quit IRC12:47
*** jamesmcarthur has joined #openstack-infra12:47
AJaegermgoddard: as I said earlier: I suggest you reduce those image pushes as well, I see redis, kubernetes etc. build and pushed - are those really needed in both -source and -binary - or at all?12:48
mgoddardAJaeger: I was just looking at our image list12:48
mgoddardwe have deprecated some, but maybe we should just be bold and remove12:49
mordredmgoddard: boldness is rewarded12:49
* mgoddard deprecates openstack12:49
openstackgerritMerged openstack/openstack-zuul-jobs master: Remove a reference to cgit from comment  https://review.opendev.org/68658612:49
mgoddardtoo bold?12:50
hrwmgoddard: we should remove 'base' one - no one is using us.12:50
*** yamamoto has quit IRC12:50
hrwmgoddard: we should remove 'base' one - no one is using it12:50
mgoddardtrue12:50
hrwsorry, but I have to go handle some out of computer things12:50
hrwbbl12:50
*** soniya29 has quit IRC12:51
fungialso have you evaluated whether too much is being included in some of the images? 16gb seems a bit much for a minimal operating system, openstack python projects and their dependencies. but i've really never tried to measure it12:52
fungiwhich layers are the largest?12:53
*** vesper11 has quit IRC12:53
mgoddardwe have done a few iterations of slimming down images, never with a huge amount of success12:53
mgoddardwe could try again in Ussuri12:53
mgoddardI looked at multi-stage builds for source images, shaved a bit off12:54
mnaseris 16gb uploaded .. on every job?12:54
mgoddardmnaser: per-day12:54
AJaegermnaser: that's my understanding.12:54
mnaseroh ack12:54
mgoddardpublisher is periodic12:54
*** vesper11 has joined #openstack-infra12:54
mnaserthat might be expensive over time :)12:54
mgoddardcould be12:55
AJaegerSo, every periodic job - and they run 5 on master, 4 on stable branches which gives 24 runs, so 384 GB per day12:55
mgoddardthere may be shared layers12:55
mgoddardwe've started a support matrix: https://storage.bhs1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_606/677500/18/check/openstack-tox-docs/606bd6a/docs/support_matrix.html12:56
AJaegermgoddard, my understanding is that the 16 GB is without shared layers - otherwise you have much more (~300 images times 350 MB gives 1 TB)12:56
*** udesale has quit IRC12:56
mgoddardplan is to use it to help trim down our image list12:56
mgoddardand also categorise images12:56
mgoddardwe could look at a less frequent publishing cycle for our non-core images12:57
AJaegermgoddard: that is wrong, you build almanach (first line) in both binary and source!12:57
mgoddardAJaeger: I don't think so: https://opendev.org/openstack/kolla/src/branch/master/docker/almanach/almanach-base/Dockerfile.j2#L1012:58
*** ramishra has quit IRC12:58
*** markvoelker has joined #openstack-infra12:58
*** vesper11 has quit IRC12:59
AJaegermgoddard: http://zuul.opendev.org/t/openstack/build/78864ef8b2514e359e5e5fca858d9189/log/job-output.txt#61267 is ubuntu-binary13:00
AJaegermgoddard: http://zuul.opendev.org/t/openstack/build/3a30769c75cb4c8c83845831f5125e97/log/job-output.txt#70310 is ubuntu-source13:00
AJaegerAren't both building?13:00
AJaegermgoddard: oh, I see the skipping now, so you start building and stop directly. confusing...13:01
mgoddardyes13:01
mgoddardfixed in master13:02
*** rlandy is now known as rlandy|mtg13:02
mgoddardwe could EOL pike...13:03
*** ociuhandu has joined #openstack-infra13:03
mgoddardor at least stop publishing images13:03
cgoncalveshey folks. octavia tempest jobs are not being triggered in a stable/train patch: https://review.opendev.org/#/c/686541/13:04
cgoncalveswhat am I missing?13:04
mnasiadkamgoddard: +113:04
AJaegermgoddard: you have images in both binary and source like cron, is that correct?13:05
mgoddardAJaeger: correct13:05
AJaegermgoddard: could you explain me why, please?13:05
AJaegercgoncalves: the release team needs to branch devstack first to be able to do this13:05
mgoddardAJaeger: because it means I can just set the build type to binary/source and build all images I want13:05
cgoncalvesAJaeger, understood. thank you13:06
AJaegercgoncalves: that happens at later time, so for now we depend on good testing on master13:06
AJaegermgoddard: my suggestion: Build them only once in the periodic job and not twice.13:06
AJaegeralmanach for example is only binary as you said - why not go through with it?13:07
mgoddardAJaeger: sorry, I don't follow13:07
AJaegerlooking at your documentation, it even says "The binary means that OpenStack will be installed from apt/yum. And the source means that OpenStack will be installed from source code. "13:07
AJaegerbut for cron you have no source code, so why build such a container in the source periodic job?13:07
mgoddardfair point13:08
*** roman_g has joined #openstack-infra13:08
mgoddardhaving a cron image labelled as source makes for a better user experience - they can build all images with one type, and set one variable for the type to deploy13:09
mgoddardwhat might work is if we made those images essentially the same underneath, then tag twice in one job13:09
*** ociuhandu has quit IRC13:09
mgoddardi.e. a base layer that is not dependent on source vs binary13:09
*** psachin has joined #openstack-infra13:10
AJaegermgoddard: but that's not what's in the docs - and not what you do, there are other examples where you only build in binary but not in source. I don't see a consistency here13:10
fungiwould calling those undifferetiated images something like "independent" instead of either binary or source make for a reasonable user experience?13:10
mnaserinfra-root: been doing some clean up on things in sjc1 for infra and looks like we're peaking ~50vms13:10
mnasersorry took a while13:10
fungithanks mnaser!!!13:10
mgoddardAJaeger: perhaps that's true13:11
AJaegerfungi, so, having three sets: binary OpenStack, source OpenStack, binary Support (your independent)13:11
mgoddardit's not a bad idea, and ties into our thinking around categorising images13:12
mgoddardI will add this as a topic of conversation for our next meeting, and we'll aim to improve the situation in Ussuri13:14
AJaegerSo, my suggestion: Figure out how you can reduce those ~ 16 GB per job to be able to push the container images to dockerhub Bandwidth is not unlimited (neither with our providers nor with dockerhub)13:16
AJaegermgoddard: thanks!13:16
*** mriedem has joined #openstack-infra13:18
*** xek__ has joined #openstack-infra13:19
*** xek_ has quit IRC13:21
openstackgerritMerged openstack/project-config master: Override security mirror for Debian image builds  https://review.opendev.org/68620413:25
*** dpawlik has quit IRC13:25
*** zbr|lunch is now known as zbr|ruck13:25
*** yamamoto has joined #openstack-infra13:26
*** vesper11 has joined #openstack-infra13:27
*** vesper11 has quit IRC13:31
*** vesper11 has joined #openstack-infra13:32
*** SurajPatil has joined #openstack-infra13:34
*** yamamoto has quit IRC13:36
*** surpatil has quit IRC13:36
*** goldyfruit_ has joined #openstack-infra13:40
*** yamamoto has joined #openstack-infra13:57
*** gmann_afk is now known as gmann13:58
*** ociuhandu has joined #openstack-infra14:02
*** woojay has joined #openstack-infra14:15
*** goldyfruit___ has joined #openstack-infra14:16
yoctozeptoAJaeger, fungi, mgoddard: yeah, we had those ideas already but they tend to come and go as their issues do14:17
yoctozeptowe are getting more mature each release :-)14:18
*** goldyfruit_ has quit IRC14:19
*** surpatil has joined #openstack-infra14:24
*** ricolin_ has joined #openstack-infra14:25
*** SurajPatil has quit IRC14:26
*** ricolin has quit IRC14:26
*** rlandy|mtg is now known as rlandy14:32
*** jbadiapa has joined #openstack-infra14:35
*** goldyfruit_ has joined #openstack-infra14:36
*** odicha has quit IRC14:38
openstackgerritMerged openstack/project-config master: python-dracclient zuulv3 changes  https://review.opendev.org/68640914:39
*** goldyfruit___ has quit IRC14:39
openstackgerritFabien Boucher proposed zuul/zuul master: Gitlab - Basic handling of merge_requests event  https://review.opendev.org/68599014:39
openstackgerritTristan Cacqueray proposed zuul/nodepool master: Switch to fedora-30 for the openshift integration job  https://review.opendev.org/68673714:40
clarkbwhen this kolla stuff has come up in the past it wasmy understanding that for the source images all of the build objects remain in the images as do the build deps14:40
clarkbthis is one reason for their large size14:41
clarkbbuilding in a separate image andcopying to the image you want to publish is how we deal with that on infra and zuul images14:41
paladoxcorvus fungi https://gerrit-review.googlesource.com/c/gerrit/+/239436 is the deadlock workaround!14:42
*** ricolin_ has quit IRC14:49
*** ociuhandu has quit IRC14:51
*** ociuhandu has joined #openstack-infra14:52
*** bnemec is now known as beekneemech14:54
*** ociuhandu has quit IRC14:56
*** chandankumar is now known as raukadah14:57
*** xenos76 has quit IRC14:58
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Fix merge error in streaming support  https://review.opendev.org/68650515:00
*** ociuhandu has joined #openstack-infra15:04
*** arxcruz|ruck is now known as arxcruz|rover15:08
*** jamesmcarthur has quit IRC15:09
*** ociuhandu has quit IRC15:09
openstackgerritMerged zuul/zuul-registry master: Fix merge error in streaming support  https://review.opendev.org/68650515:11
*** jamesmcarthur has joined #openstack-infra15:11
*** iurygregory has quit IRC15:12
corvusinfra-root: i have 2 production changes that require babysitting which i would like to do today: https://review.opendev.org/686049  https://review.opendev.org/686298  they both could use more +2s15:14
*** ykarel is now known as ykarel|afk15:15
* prometheanfire is trying to find where the redirects for releases.openstack.org/constraints/upper/BRANCH_NAME are set15:15
* prometheanfire wants to add ussuri since reqs is branching15:15
clarkbprometheanfire: I think it is an htaccess file in the releases repo15:16
mordredcorvus: those are both awesome changes15:18
corvusi'm excited about them15:18
*** donnyd is now known as donnyd_afk15:19
prometheanfirethat could be15:19
*** gyee has joined #openstack-infra15:22
*** mattw4 has joined #openstack-infra15:22
*** ociuhandu has joined #openstack-infra15:23
fungiprometheanfire: https://opendev.org/openstack/releases/src/branch/master/doc/source/_templates/htaccess15:25
prometheanfirefungi: yep15:26
prometheanfirecontroled by openstack_releases/_redirections.py15:26
*** pkopec has quit IRC15:28
*** kopecmartin is now known as kopecmartin|off15:30
openstackgerritTristan Cacqueray proposed opendev/glean master: Import ipv6 configuration done by cloud-init  https://review.opendev.org/68674915:38
*** jaosorior has quit IRC15:41
fungimordred: https://docs.python.org/3.9/whatsnew/3.8.html#new-modules looks interesting from a pbr perspective15:42
clarkbfungi: I tested that with osc15:42
fungioh?15:43
clarkbit works but is slower on python2 and only slightly quicker with python315:43
fungiahh15:43
clarkbthere is a pypi backport for the other python versions15:43
clarkbI think that means we should definitely switch to it from pkg_resources once python2 is no longer supported15:43
clarkbbut it won't be a major improvement performance wise15:43
fungii wonder if the one in stdlib performs any better or if it's just the same routines15:43
clarkbaiui its the same15:44
fungiregardless, it does reduce the required dependencies for 3.8 onward15:44
clarkbthe major underlying problem is the disk scan and version sort15:44
clarkbwhihc the new thing keeps from pkg_resources15:44
clarkbit just does it slightly more efficiently15:44
*** rpittau is now known as rpittau|afk15:45
openstackgerritTristan Cacqueray proposed opendev/glean master: Import ipv6 configuration done by cloud-init  https://review.opendev.org/68674915:46
*** kjackal has quit IRC15:47
*** ociuhandu has quit IRC15:47
*** ociuhandu has joined #openstack-infra15:48
*** rlandy is now known as rlandy|brb15:49
*** piotrowskim has quit IRC15:55
*** ociuhandu has quit IRC15:56
corvusis it known and expected that the arm64 images haven't updated in 37+ days?15:57
clarkbif it is the buster image yes, if it is the other arm64 images then no15:58
corvusit's buster, stretch, bionic and xenial15:58
*** SurajPatil has joined #openstack-infra15:58
corvushttp://paste.openstack.org/show/781093/15:59
*** xarses has joined #openstack-infra15:59
*** adriant has quit IRC15:59
*** iokiwi has quit IRC15:59
*** xarses has quit IRC15:59
*** andreww has quit IRC15:59
*** xarses has joined #openstack-infra16:00
clarkblooks like the builder has filled its disk16:00
clarkb(buster is broken because its mirror is broken iirc)16:00
*** pgaxatte has quit IRC16:01
*** donnyd_afk is now known as donnyd16:01
*** surpatil has quit IRC16:01
*** cmurphy is now known as cmorpheus16:01
corvusdoes that cause uploads to fail?16:01
corvusi see connection timeouts to arm64ci-nrt116:02
*** iokiwi has joined #openstack-infra16:02
clarkbthat cloud doesn't exist anymore and I guess the cleanups for it never merged16:02
*** xarses_ has joined #openstack-infra16:02
corvusokay, do linaro-cn1 and linaro-london exist?16:03
clarkblondon does. cn1 was in a state of flux at one point I'm not fully up to date on its status16:03
corvuswho would know?16:04
*** xarses has quit IRC16:04
clarkbianw is typically most up to date on that.16:04
clarkbhrw: may also know16:04
*** tosky has quit IRC16:05
corvusi'll clean up the disk16:06
*** jamesmcarthur has quit IRC16:06
*** ykarel|afk is now known as ykarel16:07
corvusrm: cannot remove 'dib_build.QolQCiqi/mnt/proc/18/net/wireless': Operation not permitted16:09
corvuswhat's that about ^?16:10
AJaegercorvus: /proc mounted in chroot?16:10
clarkbAJaeger: ya  Ithink this is related to dib leaking mounts when builds fail in some cases16:10
AJaegercorvus: you better umount what in the chroot16:10
openstackgerritMerged opendev/system-config master: Switch the insecure-ci-registry to zuul-registry  https://review.opendev.org/68629816:10
openstackgerritMerged opendev/system-config master: Use the gerrit driver for the googlesource connection  https://review.opendev.org/68604916:10
clarkbwhat I've done in the past is stop and disable the builder servers, reboot, rm those files, enable the service, reboot16:10
corvusoh yep, i just missed it in the mount output16:11
clarkb(rebooting is just easy mode clear out stale mounts)16:11
*** jamesmcarthur has joined #openstack-infra16:11
*** SurajPatil has quit IRC16:12
openstackgerritJames E. Blair proposed openstack/project-config master: Remove arm64ci-nrt1  (1/2)  https://review.opendev.org/68676016:13
openstackgerritJames E. Blair proposed openstack/project-config master: Remove arm64ci-nrt1 (2/2)  https://review.opendev.org/68676116:13
*** mriedem is now known as mriedem_lunch16:14
*** yamamoto has quit IRC16:15
openstackgerritJames E. Blair proposed opendev/system-config master: Remove arm64ci-nrt1 (3/2)  https://review.opendev.org/68676316:15
* corvus is bad at fractions16:15
clarkb3/2 is a valid fraction16:16
openstackgerritJames E. Blair proposed opendev/system-config master: Remove arm64ci-nrt1 (3/2)  https://review.opendev.org/68676316:16
AJaegercorvus: I think you missed one line, could you double check https://opendev.org/opendev/system-config/src/branch/master/inventory/groups.yaml#L176 , please?16:18
corvusAJaeger: ah yeah, i was grepping for nrt116:19
AJaegercorvus: I checked codesearch for arm64ci16:19
openstackgerritJames E. Blair proposed opendev/system-config master: Remove arm64ci (3/2)  https://review.opendev.org/68676316:20
*** jbadiapa has quit IRC16:21
corvus#status log nb03 dib_tmp partition was full; cleaned and restarted16:22
openstackstatuscorvus: finished logging16:22
corvusAJaeger: it looks like we should wait a little longer to merge the git cache directory change since these images are out of date16:23
*** ykarel has quit IRC16:24
hrwclarkb, corvus: linaro-cn1 is no more16:24
hrwlinaro-london is the only linaro cloud now16:24
*** michael-beaver has joined #openstack-infra16:24
openstackgerritTristan Cacqueray proposed opendev/glean master: Import ipv6 configuration done by cloud-init for Red Hat distro  https://review.opendev.org/68674916:25
AJaegercorvus: fun ;( Thanks for checking16:26
AJaegercorvus: I don't find linaro-cn1 in codesearch, so that one seems removed cleanly - or did you find anything?16:28
AJaegerWe still have : https://opendev.org/opendev/system-config/src/branch/master/playbooks/templates/clouds/bridge_all_clouds.yaml.j2#L15716:28
AJaegercorvus: want to put that on top of your 3/2 change? ;)16:29
hrwI can send patch to gerrit if you want16:30
mordredclarkb: 3/0 isn't16:30
corvushrw: that's okay, i'll go ahead and take care of it, thanks!16:31
hrwok16:31
mordredcorvus: maybe number that patch 4/216:31
*** rlandy|brb is now known as rlandy16:31
*** goldyfruit_ has quit IRC16:32
corvusmordred: i was really tempted16:32
openstackgerritJames E. Blair proposed opendev/system-config master: Remove linaro-cn1  https://review.opendev.org/68677016:32
*** e0ne has quit IRC16:32
corvusbut it doesn't relate to the others so it'd be confusing16:33
hrwcorvus: so is there any other arm64 one than linaro-london now?16:34
corvushrw: no, that looks to be the last16:34
hrwok16:35
openstackgerritMerged openstack/project-config master: Remove arm64ci-nrt1  (1/2)  https://review.opendev.org/68676016:35
hrwthanks for info16:35
hrwI hope that situation will improve16:36
corvushrw: ++16:36
corvusclarkb, AJaeger, mordred: unfortunately it looks like cn1 was removed from the nodepool config without letting nodepool clean up the images first16:36
corvusthat means there is no nodepool server that can delete them from zk16:37
hrwcorvus: sorry for that16:37
mordredcorvus: bother16:37
hrwcn1 was pain in the ass anyway16:37
*** roman_g has quit IRC16:37
corvushrw: it's not your fault -- it's something the infra-root folks should have caught in review16:37
mordredyeah16:37
mordredthat's our bad - not yours16:38
corvusso we actually have to add it back16:38
hrw;)16:38
clarkbcorvus: will that work for cleaning up zk even if the cloud does not exist?16:38
corvusclarkb: it's supposed to.  let's verify that works for arm64 before we go to the trouble.16:38
clarkb++16:39
fungilooks like 649773 is where we retired it back in april16:39
openstackgerritTristan Cacqueray proposed opendev/glean master: Set Type=Ethernet for RedHat system  https://review.opendev.org/68647416:39
openstackgerritTristan Cacqueray proposed opendev/glean master: Import ipv6 configuration done by cloud-init for Red Hat distro  https://review.opendev.org/68674916:39
*** goldyfruit_ has joined #openstack-infra16:39
*** zbr|ruck has quit IRC16:39
Shrewsoops. yeah, supposed to set max-servers to 0 first before removing the provider, as documented at: https://zuul-ci.org/docs/nodepool/operation.html#removing-from-the-launcher16:41
Shrewsbut easy to forget16:41
corvusShrews: i think the nodes are okay, but we still have image records16:41
*** lpetrut has quit IRC16:41
corvusbut same thing16:41
openstackgerritTristan Cacqueray proposed opendev/glean master: Import ipv6 configuration done by cloud-init for Red Hat distro  https://review.opendev.org/68674916:41
corvusset images to []16:41
clarkbtristanC: re the type ethernet, that is specific to static configurations. Did you confirm that static configurations don't use the type setting?16:42
Shrewscorvus: ah yeah. that's covered in that doc too.  i wonder if we could programmatically check for those changes and prevent them?16:43
*** dpawlik has joined #openstack-infra16:44
*** xenos76 has joined #openstack-infra16:44
openstackgerritMerged zuul/zuul master: web: render log manifest consistently  https://review.opendev.org/68630716:44
*** goldyfruit_ has quit IRC16:46
*** gfidente has quit IRC16:49
fungiShrews: we'd need a job which analyzed the diff within the change itself i expect. i guess you could load the configuration from both HEAD and HEAD^1 and make sure that any provider missing from HEAD only had max-servers:0 and images:[] in HEAD^116:49
Shrewsfungi: i was thinking more within nodepool itself16:50
fungioh, got it16:50
fungiduring configuration reload?16:50
*** zbr has joined #openstack-infra16:50
Shrewsyeah, but not sure how useful that would be, tbh. unless someone is actively monitoring logs to see the warning we would issue, it might go unnoticed16:50
fungiso if you see a provider disappear then first treat it as if you went to max-servers:0 and images:[] on the provider and then... wait for that cleanup to complete?16:51
Shrewsfungi: that's an option i hadn't thought of16:51
fungiand defer replacing the configuration until it's all empty16:51
*** yamamoto has joined #openstack-infra16:51
corvusit would be good to have a warning, but we also don't want to make it impossible16:51
corvusfungi: hrm... that could work, but it could also surprise folks who just want nodepool to ignore a provider for a bit16:52
fungiyep16:52
corvus(that's not how we do it, but i could see someone doing that)16:52
Shrewsyeah16:52
fungimaybe a provider toggle for disable:true or something?16:52
fungiwhich might be more convenient for people who want to temporarily disable a provider than removing it from configuration16:53
corvusfungi: we have paused16:53
fungioh, right, seems like that already would meet their use case then16:53
*** ykarel has joined #openstack-infra16:53
fungianyway, more of a discussion for #zuul16:54
corvustdasilva: fyi i'm about to switch out the opendev intermediate image registry and there could be a brief outage that could cause image build jobs to fail -- i mention it because swift is a pretty heavy user.16:59
clarkbcorvus: re your changes you listd earlier today, one of them involves a zuul scheduler restart right?17:00
mordredcorvus: swift is a pretty heavy user of the image registry - and the image registry is a pretty heavy user of swift17:00
corvusmordred: it's a virtuous cycle17:00
clarkbthat should pick up the url linking bug fix in the dashboard17:00
corvusclarkb: yep17:01
clarkbk, let me know if I can help (I think you got the reviews you needed already)17:01
*** yamamoto has quit IRC17:02
clarkbhttps://review.opendev.org/#/c/686237/ is anothe rchange to zuul managementwhich we might want to get in17:05
clarkbone thought is that could fix issues like the include role problem in persistent-firewall if we keep ansible up to date17:05
*** derekh has quit IRC17:06
corvuswe may see more job failures when point releases happen.  but we can merge it and see17:06
clarkbdue to files moving? thats a good point. Hrm17:06
clarkbmaybe we can add manage-ansible -u to our zuul-executor restart playbook?17:06
clarkbwe do tend to update zuul often enough and we shouldn't introduce failures (that wouldn't happen otherwise) at that point17:07
corvusclarkb: yeah, that's a good idea17:07
*** jpena is now known as jpena|off17:08
openstackgerritGhanshyam Mann proposed openstack/devstack-gate master: Update grenade settings for stable/train  https://review.opendev.org/68677417:08
*** ociuhandu has joined #openstack-infra17:09
tdasilvacorvus: thanks for the heads up (cc: timburke)17:11
openstackgerritClark Boylan proposed opendev/system-config master: Update zuul ansible installs when restarting zuul  https://review.opendev.org/68677517:13
clarkbcorvus: ^ something like that17:13
corvusproblem with the registry17:13
corvus/usr/local/bin/python: error while loading shared libraries: libpython3.7m.so.1.0: cannot open shared object file: No such file or directory17:13
corvusthat's what i see in docker logs17:13
corvusi'm very confused and am starting to think i don't understand how containers work17:13
corvusit's in a restart loop because of that17:14
corvusclarkb, fungi, mordred: ^ any ideas?17:14
clarkbis /usr/local/bin/python a python2 maybe?17:14
clarkband we've somehow mixed up package built for python3 (links against it) with python2 container?17:14
mordredshouldn't be17:14
corvus(this is a container image that ran a test in ci)17:15
*** lpetrut has joined #openstack-infra17:15
mordredcorvus: can you boot a bash shell with the same image?17:15
corvusmordred: yes17:16
openstackgerritGhanshyam Mann proposed openstack/devstack-gate master: Add gate job for stable/train  https://review.opendev.org/68677617:16
corvus'python' gives me a repl and is 3.7.317:16
mordreduhm17:16
corvus"zuul-registry --help" works17:16
corvusi'm going to try some voodo docker-compose down/up17:17
mordredkk17:18
mordredwe're not doing anything with docker-compose that should be doing anything17:18
corvusno joy, same error17:18
mordredcorvus: is the command docker-compose is running to launch the docker logged anywhere?17:20
corvusmordred: i don't know, but 'docker inspect registrydocker_registry_1' may answer similar questions17:21
corvusmordred: if we want i can down/up with --verbose17:21
corvusi'll do that17:21
mordredcorvus: le's try that17:21
clarkbother idea: we built the package on old version of python so then the imgae we copy pcakge into updated with newer so and the file hsa moved?17:22
corvusmordred: /tmp/log17:22
corvusclarkb: why does 'zuul-registry --help' not show the problem then?17:22
clarkbit might not call into anything that needs the linking (though ldd should check al lthat upfront iirc)17:23
corvusa difference between the ci test and this is the use of swift17:24
corvuswhy don't i try setting up the actual configuration and running it from my container17:24
mordredlibpython3.7m.so.1.0 => /usr/local/lib/libpython3.7m.so.1.0 (0x00007f7036e69000)17:24
mordredthat's the ldd of /usr/local/bin/python in the container17:25
clarkbmordred: and does that file exist?17:25
mordredit's almost like something is blanking an env setting or something and /usr/local/lib isn't in the LD_LIBRARY_PATH17:25
mordredyes17:25
corvuswait /var/registry/etc:/etc17:25
mordredand I agree with corvus - when I boot a shell in the container ....17:25
corvusthat would do it17:25
mordredoh17:25
mordredyeah17:26
mordredld.so.conf would get blown away17:26
clarkboh clearing out the ldd config17:26
mordredyeah17:26
corvusokay, let me manually patch docker-compose.yaml and fix17:26
mordredkk17:26
mordredPHEW17:26
corvuswe can bind-mount in files, yeah?17:27
mordredyes17:27
corvusthat's just for clouds.yaml17:27
mordredah - which is probably why this didn't show in CI17:27
clarkbah yup since swift isn't used in ci17:28
corvusokay it's up17:29
mordredyay!17:29
fungistepped away at the wrong time, and i caught up just in time to see it solved17:29
corvusfungi: likely story :)17:29
mordredfungi: tl;dr - don't bind mount a mostly empty dir over top of /etc17:30
fungii got nerdsniped trying to find a good fudge recipe in my collection of ancient cookbooks17:30
clarkbfungi: did you have to test a bunch to determine which are good?17:31
funginot yet, that's this evening ;)17:31
*** jtomasek has quit IRC17:31
corvusi've totally done A/B testing on brownies17:31
mordredcorvus: there are tracebacks in the log for the registry container - are they expected?17:31
corvusmordred: yes, i'm hitting wrong urls17:32
fungireally most of the time waste was spent comparing my 1938 boston cooking school cookbook to the "modern" 1984 fanny farmer edition17:32
mordredcorvus: awesome17:32
fungiin the end though i think i've settled on the 1963 good housekeeping, so can get back to python17:33
*** psachin has quit IRC17:36
corvuslooking for changes to recheck17:37
corvusapparently all the registry changes are in merge conflict now17:40
openstackgerritJames E. Blair proposed zuul/zuul-registry master: DNM null commit for testing  https://review.opendev.org/68679117:40
corvusokay zuul-registry-build-image should upload to the intermediate registry17:41
openstackgerritJames E. Blair proposed zuul/zuul-registry master: DNM: second null commit for more testing  https://review.opendev.org/68679317:42
corvusand that (because the role is naive) should download from it17:42
*** lpetrut has quit IRC17:49
*** mriedem_lunch is now known as mriedem17:51
* paladox just ate dominos17:54
Shrewscorvus: could your changes be causing the POST_FAILURE on the nodepool-build-image job, by chance?17:55
fungiShrews: could it have tried to upload while stuff was down?17:55
Shrewsfungi: possible. i just returned from lunch. happened about 35min ago17:56
fungi~17:13-17:29z17:56
*** efried has quit IRC17:56
fungias best i can piece together from the discussion timeline17:56
corvusi may have also restarted it once after that....17:56
*** ykarel is now known as ykarel|away17:56
Shrewsseems likely then17:56
corvusso maybe throw a couple more minutes onto the end of that window.  sorry.17:57
fungimostly wanting to be sure any related failures we see aren't from after it is thought to have stabilized17:57
Shrewsno worries. just didn't want to get bogged down in chasing yet another random fail  :)17:57
corvusjust got a post failure on the z-r job17:57
corvusgah17:58
corvusansible overwrote my fix17:59
corvusrefixing and updating emergency file17:59
fungiso let's call it ~17:13-18:00z18:00
openstackgerritTristan Cacqueray proposed zuul/zuul-registry master: Add type annotations  https://review.opendev.org/68624918:02
openstackgerritTristan Cacqueray proposed zuul/zuul-registry master: Add support for skopeo copy  https://review.opendev.org/68680318:02
*** ociuhandu has quit IRC18:08
AJaegerthe bug for review.o.o links in zuul.opendev.org is not fixed - is that to be expected? I thought I read in backscroll it would be part of the deployment?18:08
*** ociuhandu has joined #openstack-infra18:08
clarkbAJaeger: it requires a zuul-scheduler restart which has not yet been done. Only the intermediate registry was restarted aiui18:09
AJaegerunderstood - thanks18:10
*** rfolco is now known as rfolco|bbl18:12
openstackgerritJames E. Blair proposed zuul/zuul-registry master: DNM null commit for testing  https://review.opendev.org/68679118:12
openstackgerritJames E. Blair proposed zuul/zuul-registry master: DNM: second null commit for more testing  https://review.opendev.org/68679318:13
corvusINFO:registry.api:Put manifest zuul/zuul-registry aa146c4af83743eeb805df6c5892ae6a_latest digest sha256:e6c35cd710660f41ba5c14a913d4beb7ecc608a055a169b1c9fc3ab709f1b14a18:22
corvusthat looks promising18:22
corvuscool, a simple image build job that pushes to the intermediate registry works18:23
corvusthat's the typical case -- the next thing is pulling from it, which we should see in a minute.18:23
*** jamesmcarthur has quit IRC18:25
corvus2 successful pushes18:27
corvusw00t http://zuul.opendev.org/t/zuul/stream/c7e9ca7514ab44dea61333831071deb6?logfile=console.log  just pulled an image18:31
corvusi think we're all set!18:32
AJaegercongrats!18:32
corvustimburke, tdasilva: the intermediate registry should be stable now.  we did have some hiccups rolling it out; image job errors from ~17:13-18:00z are likely due to that and can be rechecked.18:33
*** jamesmcarthur has joined #openstack-infra18:40
openstackgerritJames E. Blair proposed opendev/system-config master: Correct registry docker mount  https://review.opendev.org/68682318:44
corvusinfra-root: ^ matches what's running in prod and will let us lift the emergency entry18:45
fungicorvus: that does look like it'll work, but i'm curious about the other files present in /var/registry/etc/18:48
fungi(config.yml, hostname, hosts, resolv.conf)18:48
fungiwere we not relying on those to end up in /etc?18:49
openstackgerritTristan Cacqueray proposed zuul/zuul-registry master: Add support for skopeo copy  https://review.opendev.org/68680318:49
openstackgerritTristan Cacqueray proposed zuul/zuul-registry master: Add type annotations  https://review.opendev.org/68624918:49
corvusfungi: nope, those are leftover from the previous registry18:49
corvusfungi: actually, the first file is leftover. i  think the next 3 were written by docker when we ran in the bad config18:50
corvusie, they were *exported* from the container18:50
corvuswe can just delete those manually now18:50
fungigot it, that all makes perfect sense18:51
fungithanks!18:51
*** pcaruana has quit IRC18:51
*** ociuhandu has quit IRC18:51
*** ociuhandu has joined #openstack-infra18:52
*** petevg has joined #openstack-infra18:54
*** ociuhandu has quit IRC18:56
openstackgerritTim Burke proposed zuul/zuul-registry master: Rework the stream_blob/stream_object API  https://review.opendev.org/68682719:00
clarkbcorvus: on that registry mounting change is /var/registry/etc just a clouds.yaml file?19:06
clarkbah yup that is related to what fungi was asking re hosts and resolv.conf and friends19:08
fungiagreed, that's what i was expecting to find, but was confused by the other (stale) files in there19:10
*** jamesmcarthur has quit IRC19:14
clarkbcorvus: re restarting zuul scheduler do we want to do a full restart and if so should we get https://review.opendev.org/#/c/686775/ in first?19:18
*** kjackal has joined #openstack-infra19:24
corvusclarkb: does that mean we have the list of extra packages in 2 places?19:25
clarkbcorvus: yes, because puppet does the normall installation. I suppose we might be able to get away with not having it in the playbook because puppet will install the packages otherwise?19:26
corvuslooks like we didn't plumb that through to puppet-zuul19:27
corvusclarkb: it doesn't change that often, we can probably muddle through until we switch to images19:27
clarkbits in the puppet-zuul exec for zuul-manage-ansible19:27
corvusyeah, my plumb comment was meant to convey that we didn't make it an argument to puppet-zuul so that we could just have one list in system-config19:27
clarkboh ya19:28
corvusclarkb: 775 lgtm; we can add that in manually if it doesn't get in place before we're ready19:29
clarkbcorvus: sgtm, thanks19:29
*** ykarel|away has quit IRC19:31
corvusi think i'm ready to do the zuul restart now19:36
clarkbI've eaten lunch and can assist if necessary19:36
*** panda is now known as panda|off19:38
corvusclarkb: i've cherry-picked your change into a local repo on bridge19:39
corvusi think i'll wait ~5m for merges19:42
corvusthere's a bunch of 5m changes followed by 52m; that seems like a good spot19:42
openstackgerritMerged opendev/system-config master: Correct registry docker mount  https://review.opendev.org/68682319:46
*** zbr has quit IRC19:46
*** e0ne has joined #openstack-infra19:50
*** markvoelker has quit IRC19:55
corvusyay a bunch of stuff merged!19:57
corvusi'll get ready to restart now19:58
corvusrestarting20:01
corvus2019-10-04 20:01:18,706 INFO zuul.GerritConnection: Remote version is: 3.0.2-3196-g589fe6669e8e (parsed as (3, 0, 2))20:02
corvusthat's promising -- it's talking to googlesource20:02
corvus(we should probably add connection names into those log messages20:02
clarkbstatus dashboard is loading now20:08
clarkband urls work again20:08
corvusreenqueing20:08
*** e0ne has quit IRC20:09
*** lucasagomes has quit IRC20:13
corvusthat's done20:13
clarkbseems to be working according to the dashboard20:14
corvusthe executors are still stopping20:14
corvusso no jobs are actually running yet, and we haven't seen the ansible upgrade20:14
*** EmilienM is now known as EvilienM20:18
corvusclarkb: ansible update and executor start have happened20:18
*** EvilienM is now known as containerizes_hi20:19
*** containerizes_hi is now known as containerized20:19
clarkband jobs have actually started running?20:19
*** containerized is now known as EvilienM20:19
corvusclarkb: yes20:20
clarkbcorvus: double checking ansible versions they all appear up to date on ze01 now too20:26
corvus\o/20:26
openstackgerritTristan Cacqueray proposed zuul/zuul-registry master: Add support for skopeo copy  https://review.opendev.org/68680320:29
*** jamesden_ has joined #openstack-infra20:30
*** jamesdenton has quit IRC20:31
*** xenos76 has quit IRC20:32
*** jamesmcarthur has joined #openstack-infra20:33
fungiindeed, /usr/lib/zuul/ansible/2.7/bin/ansible was last modified a few minutes ago20:35
fungiso i guess the updates for those are working now20:35
fungi2.5 is still left behind there, but i suppose that's reasonable20:36
clarkbfungi: left behind?20:36
fungiwe can always manually clean up removed versions afterward if we need20:36
clarkboh did we remove support for it?20:37
fungi-rwxr-xr-x 1 root root 5863 Mar 18  2019 /usr/lib/zuul/ansible/2.5/bin/ansible20:37
fungior maybe we just didn't need to update it?20:37
clarkbthat is the current version20:37
fungii don't know if the 2.5 removal patch has merged yet20:37
fungi2.6 through 2.8 were all upgraded though, so i guess that means we're finally up to date on those20:38
corvushttps://review.opendev.org/65043120:39
fungiyeah, i guess https://review.opendev.org/650431 hasn't merged yet20:39
corvusmordred: do you have a gerrit docker image change handy we can recheck to verify the googlesource projects work?20:40
*** jamesden_ has quit IRC20:41
*** jamesdenton has joined #openstack-infra20:41
corvusmordred: maybe https://review.opendev.org/68260120:42
corvuswhich i will also review :)20:42
*** jamesmcarthur has quit IRC20:44
corvusmordred: +3 with comments20:45
*** jamesmcarthur has joined #openstack-infra20:46
corvusi'm curious about whether we want dependencies in promote... it kindof makes sense, but i also don't think it would be 100% effective in preventing errors.20:47
clarkbthe persistent firewall issue persists after the updates20:53
clarkbat least we can rule that out20:53
*** dpawlik has quit IRC20:54
*** kjackal has quit IRC20:55
*** slaweq has quit IRC20:59
corvusze03 is slowly chewing through the setup for the 2.15 job21:01
corvus  stderr: 'ssh: connect to host gerrit-review.googlesource.com port 29418: Connection timed out21:05
corvusthat's a problem21:05
corvushuh, there's no password in the zuul.conf file on the executors21:06
clarkbI don't recall if it was there before or not21:08
corvusit wouldn't have been (for the git driver)21:08
clarkbah21:08
corvusit's there on the scheduler, so it's correctly using http, but the execs are using ssh because the password is what causes it to switch21:09
openstackgerritJames E. Blair proposed opendev/system-config master: Add zuul_connection_secrets to mergers and executors  https://review.opendev.org/68684021:12
corvusclarkb, fungi, mordred: ^ i think that should fix it?21:12
corvusin the mean time, jobs with googlesource repos will fail21:13
corvusslowly21:13
*** xek_ has joined #openstack-infra21:13
fungiahh, yep, separate node entries in the global manifest21:13
fungibut same class, so all the plumbing is already there21:14
clarkbLooks like that key exists in hiera for the mergers and executors too (just double checked)21:14
corvusyeah, i added it previously assuming it would work like on the scheduler21:14
corvusi will dequeue the test change from gate since it'll block this21:15
*** xek__ has quit IRC21:16
*** prometheanfire has quit IRC21:16
corvusand maybe enqueue this one so hopefully we can fix this today21:16
*** prometheanfire has joined #openstack-infra21:17
corvus#status log removed insecure-ci-registry01 from emergency21:18
openstackstatuscorvus: finished logging21:18
corvus#status log restarted all of zuul at commit e6496faf406529b4003ce7ebaa22eb1f2fa7892921:18
openstackstatuscorvus: finished logging21:18
*** whoami-rajat has quit IRC21:18
*** derekh has joined #openstack-infra21:29
*** derekh has quit IRC21:29
*** tosky has joined #openstack-infra21:31
*** jamesmcarthur has quit IRC21:34
*** goldyfruit_ has joined #openstack-infra21:38
*** rlandy has quit IRC21:40
openstackgerritMerged opendev/system-config master: Add zuul_connection_secrets to mergers and executors  https://review.opendev.org/68684021:40
*** mriedem has quit IRC21:47
*** jbadiapa has joined #openstack-infra21:50
*** EvilienM is now known as EmilienM21:52
*** goldyfruit_ has quit IRC22:04
*** goldyfruit_ has joined #openstack-infra22:07
corvusfungi: do you have the fanny farmer baking book?22:09
corvuser fannie farmer22:11
*** slaweq has joined #openstack-infra22:11
fungii have "the boston cooking school cook book" (fannie merritt farmer, 1938 ed.)22:14
corvusit's a collection by marion cunningham, who edited the revised fannie farmer cookbook; it has both old and newer recipies in it.22:15
fungiit's seen a lot of use, but still quite intact, you just have to look past the oil stains and whatnot22:15
fungioh, *baking* book!22:15
fungino, i do not22:15
fungii should obtain, yes22:15
corvusthere's some good stuff in there :)22:15
fungimy usual go-to is the "the king arthur flour baker's companion" but it's a modern volume22:16
*** slaweq has quit IRC22:16
*** xek_ has quit IRC22:17
fungistill i love that it seems like nearly a quarter of the content in it is treatises on baking science rather than just a pile of recipes22:17
fungioh, and of course that they systematically include weights for all the ingredient quantities22:19
corvusthey have a bakery and store in vermont.  they're pretty good.  :)22:20
fungiyeah, i keep meaning to plan a visit, vt is the only state in new england i've never even passed through22:22
corvusmake a day of it and stop there on the way to ben and jerry's :)  then, i guess, hike the green mountains to burn off the calories22:24
fungisounds like an excellent itinerary22:26
corvusi'm going to run some errands while ansible does its thing and then restart the executors later22:32
fungii'm around, just waiting on this fudge to reach 115c22:34
fungi(and then to reach 40c)22:35
*** mattw4 has quit IRC22:53
*** rfolco|bbl has quit IRC22:54
fungimakes me want to calculate the specific heat and thermal inertia of fudge22:59
*** yamamoto has joined #openstack-infra23:00
*** yamamoto has quit IRC23:05
*** slaweq has joined #openstack-infra23:11
*** slaweq has quit IRC23:16
*** tosky has quit IRC23:39

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!