Thursday, 2019-06-06

*** lseki has quit IRC00:00
*** roman_g has quit IRC00:01
*** weifan has quit IRC00:03
*** markvoelker has quit IRC00:07
*** markvoelker has joined #openstack-infra00:07
*** michael-beaver has quit IRC00:08
*** diablo_rojo has quit IRC00:11
*** aaronsheffield has quit IRC00:11
*** rfarr has joined #openstack-infra00:11
*** markvoelker has quit IRC00:12
*** mattw4 has quit IRC00:15
*** mattw4 has joined #openstack-infra00:15
*** mattw4 has quit IRC00:24
openstackgerritIan Wienand proposed opendev/system-config master: [wip] backups  https://review.opendev.org/66265700:31
*** auristor has quit IRC00:33
*** rkukura has quit IRC00:38
*** auristor has joined #openstack-infra00:40
*** rfarr has quit IRC00:44
*** rcernin has joined #openstack-infra00:45
*** rcernin has quit IRC00:45
*** rcernin has joined #openstack-infra00:45
openstackgerritIan Wienand proposed opendev/system-config master: [wip] backups  https://review.opendev.org/66265700:50
openstackgerritMerged opendev/system-config master: centos-mirror-update: handle non-x86 architectures  https://review.opendev.org/66272700:57
*** bobh has quit IRC00:59
*** spsurya has joined #openstack-infra01:01
*** zhurong has quit IRC01:04
*** slaweq has quit IRC01:07
tkajinamHi. Can anybody from infra can check my patch for opendev/sandbox? https://review.opendev.org/#/c/66346301:16
tkajinamI know that I can merge this by myself, but as it is related to infra configuration, I want to get some confirmation01:16
*** rcernin has quit IRC01:21
*** rcernin has joined #openstack-infra01:21
*** rkukura has joined #openstack-infra01:24
*** bobh has joined #openstack-infra01:26
ianwtkajinam: it looks fine :)  fungi i think was responsible for https://opendev.org/opendev/sandbox/commit/ab26fe63cb025b2de30b49f46fe8e8cf3c0b49e4 ; not exactly sure why that one didn't rename01:27
tkajinamianw, thanks for your check. Can I land it or is it better to wait for fungi?01:29
ianwi'd say merge away!01:30
*** dave-mccowan has joined #openstack-infra01:31
tkajinamianw, thanks for the quick response :-)01:36
*** mriedem has quit IRC01:38
*** rkukura has quit IRC01:41
fungiianw: to answer your question, it wasn't renamed during the migration, we renamed it last friday with https://review.opendev.org/65727501:43
fungibut nobody pushed a .gitreview update after that maintenance01:44
ianwahh, well that explains that :)01:44
fungiso it's been broken less than a week, at least ;)01:44
*** bobh has quit IRC01:48
*** hongbin has joined #openstack-infra01:55
*** ykarel|away has joined #openstack-infra01:57
ianwdo we know who runs mirror.lstn.net/centos/ ... it seems it has a AAAA record, but does not actually like ipv601:59
*** weifan has joined #openstack-infra01:59
*** weifan has quit IRC02:03
*** whoami-rajat has joined #openstack-infra02:07
*** markvoelker has joined #openstack-infra02:08
*** ykarel|away has quit IRC02:09
clarkbianw: logan- iirc02:17
logan-hmm02:24
*** Goneri has joined #openstack-infra02:24
*** gfidente is now known as gfidente|afk02:26
logan-ianw clarkb: should be fixed now02:29
logan-thanks for pointing it out. i'm not sure why keepalived didn't have the vip bound, but ill keep an eye on it for a while and make sure it doesn't disappear again.02:33
ianwlogan-: cool; i'm just finishing the manual sync https://review.opendev.org/#/c/662727/ ... once it's done with the alt-arch stuff i'll double back to be sure, but i just connected ok, thanks02:37
*** jamesmcarthur has joined #openstack-infra02:41
*** markvoelker has quit IRC02:42
*** rlandy|ruck|bbl has quit IRC02:45
*** yamamoto has joined #openstack-infra02:50
*** jamesmcarthur has quit IRC03:08
*** eernst has quit IRC03:09
openstackgerritIan Wienand proposed opendev/system-config master: Add centos/ to centos mirror path  https://review.opendev.org/66349103:24
openstackgerritIan Wienand proposed opendev/system-config master: Add centos/ to centos mirror path  https://review.opendev.org/66349103:37
ianwhrw / fungi: ^ still syncing (screen on mirror-update)03:40
*** dikonoor has joined #openstack-infra03:41
*** dave-mccowan has quit IRC03:50
fungithat'll take a while, and so will the vos release03:57
*** udesale has joined #openstack-infra04:00
*** yamamoto has quit IRC04:00
ianwyep it's into the "j" of ppc64 atm04:03
*** yamamoto has joined #openstack-infra04:06
*** ramishra has joined #openstack-infra04:11
*** hongbin has quit IRC04:11
*** ykarel|away has joined #openstack-infra04:14
openstackgerritIan Wienand proposed opendev/system-config master: Ansible roles for backup  https://review.opendev.org/66265704:27
*** ykarel|away has quit IRC04:39
*** pcaruana has joined #openstack-infra04:50
*** ykarel|away has joined #openstack-infra04:57
*** dikonoor has quit IRC04:59
*** tkajinam has quit IRC05:00
*** ykarel|away is now known as ykarel05:03
*** gregoryo has joined #openstack-infra05:05
*** dikonoor has joined #openstack-infra05:12
*** janki has joined #openstack-infra05:30
ianwhrw: look ok? http://mirror.dfw.rax.opendev.org/centos/altarch/7/05:34
openstackgerritIan Wienand proposed opendev/system-config master: bridge.opendev.org: use Ansible 2.8.0 stable  https://review.opendev.org/66266205:38
openstackgerritIan Wienand proposed opendev/system-config master: Ansible roles for backup  https://review.opendev.org/66265705:38
openstackgerritIan Wienand proposed opendev/system-config master: Update to ansible-lint 4.1.0  https://review.opendev.org/66350105:38
ianwyak shaved ...05:39
*** jtomasek has joined #openstack-infra05:45
*** janki has quit IRC05:49
*** dtantsur|afk is now known as dtantsur05:49
*** janki has joined #openstack-infra05:49
*** janki has quit IRC05:50
*** janki has joined #openstack-infra05:51
openstackgerritMerged opendev/system-config master: Add centos/ to centos mirror path  https://review.opendev.org/66349105:55
*** tkajinam has joined #openstack-infra06:00
*** lpetrut has joined #openstack-infra06:03
*** slaweq has joined #openstack-infra06:14
*** gregoryo has quit IRC06:16
ianwdmsimard: http://logs.openstack.org/57/662657/24/check/system-config-run-base-ansible-devel/5dfe1c5/job-output.txt.gz#_2019-06-06_05_44_55_61477006:16
ianwERROR! Unexpected Exception, this is probably a bug: Unable to import Django: the server dependencies can be installed with 'pip install ara[server]'06:16
ianwdunno if that is expected on master branch or not (devel job)06:16
*** xek has joined #openstack-infra06:16
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Return javascript content artifact records to Zuul  https://review.opendev.org/66305606:17
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Return python artifact records to Zuul  https://review.opendev.org/66305306:19
*** e0ne has joined #openstack-infra06:19
*** udesale has quit IRC06:21
*** udesale has joined #openstack-infra06:21
*** dpawlik has joined #openstack-infra06:22
*** raukadah is now known as chandankumar06:24
*** kopecmartin|off is now known as kopecmartin06:25
*** pgaxatte has joined #openstack-infra06:29
openstackgerritMerged zuul/zuul-jobs master: Allow download-artifact to download multiple files  https://review.opendev.org/66287606:33
*** dklyle has quit IRC06:38
*** dklyle has joined #openstack-infra06:38
openstackgerritMerged zuul/zuul-jobs master: Return javascript content artifact records to Zuul  https://review.opendev.org/66305606:39
*** jtomasek has quit IRC06:42
*** udesale has quit IRC06:49
*** udesale has joined #openstack-infra06:50
openstackgerritMerged zuul/zuul-jobs master: Return python artifact records to Zuul  https://review.opendev.org/66305306:57
*** gyee has quit IRC07:04
*** e0ne has quit IRC07:05
*** e0ne has joined #openstack-infra07:06
*** tesseract has joined #openstack-infra07:12
*** rcernin has quit IRC07:14
lpetrutmordred: Hi, we were talking yesterday about transfering the cloudbase-init repo to github.com/cloudbase. We've just deleted the existing repo and should be good to go with the transfer.07:17
*** rpittau|afk is now known as rpittau07:19
*** haleyb has quit IRC07:22
*** otherwiseguy has quit IRC07:22
*** weshay has quit IRC07:22
*** mhu has quit IRC07:22
*** mhu has joined #openstack-infra07:22
*** weshay has joined #openstack-infra07:23
*** otherwiseguy has joined #openstack-infra07:26
*** e0ne has quit IRC07:26
*** janki has quit IRC07:27
*** janki has joined #openstack-infra07:27
*** janki has quit IRC07:28
*** jpena|off is now known as jpena07:38
*** roman_g has joined #openstack-infra07:52
*** dchen has quit IRC07:56
*** ralonsoh has joined #openstack-infra07:58
*** markvoelker has joined #openstack-infra07:58
*** pkopec has joined #openstack-infra07:59
*** udesale has quit IRC08:01
*** udesale has joined #openstack-infra08:03
ianwsigh, looks like there's something up with the handlers for letsencrypt and ansible 2.8 ... so ... yeah08:03
ianwhttp://logs.openstack.org/57/662657/24/check/system-config-run-mirror/8db2d14/ if anyone feels like ansible deep dive before i get to it08:03
*** ociuhandu has joined #openstack-infra08:07
*** ociuhandu has quit IRC08:07
*** ociuhandu has joined #openstack-infra08:08
*** ykarel_ has joined #openstack-infra08:10
*** ykarel has quit IRC08:12
*** ykarel_ is now known as ykarel08:14
*** happyhemant has joined #openstack-infra08:19
*** udesale has quit IRC08:24
*** markvoelker has quit IRC08:32
*** derekh has joined #openstack-infra08:41
*** ginopc has joined #openstack-infra08:43
*** ociuhandu has quit IRC08:44
openstackgerritCarlos Goncalves proposed openstack/diskimage-builder master: WIP: Move FS_TYPE set to after elements are sourced  https://review.opendev.org/66354308:46
*** e0ne has joined #openstack-infra08:49
*** udesale has joined #openstack-infra08:51
*** lucasagomes has joined #openstack-infra08:51
*** lucasagomes has quit IRC08:52
*** lucasagomes has joined #openstack-infra08:53
openstackgerritCarlos Goncalves proposed openstack/diskimage-builder master: Move FS_TYPE set post-element source  https://review.opendev.org/66354308:56
*** gfidente|afk has quit IRC08:56
*** ykarel is now known as ykarel|lunch09:00
*** tkajinam has quit IRC09:01
*** jtomasek has joined #openstack-infra09:03
*** sshnaidm|afk is now known as sshnaidm09:05
*** priteau has joined #openstack-infra09:12
*** iokiwi has quit IRC09:17
*** jistr is now known as jistr|lnl09:28
*** markvoelker has joined #openstack-infra09:29
*** janki has joined #openstack-infra09:29
*** ramishra has quit IRC09:30
*** pkopec has quit IRC09:32
*** ykarel|lunch is now known as ykarel09:46
*** Lucas_Gray has joined #openstack-infra09:48
*** gfidente has joined #openstack-infra09:57
*** aedc has joined #openstack-infra10:00
*** ociuhandu has joined #openstack-infra10:01
*** markvoelker has quit IRC10:03
*** ramishra has joined #openstack-infra10:11
*** pkopec has joined #openstack-infra10:17
*** dikonoor has quit IRC10:17
*** priteau has quit IRC10:35
*** jpena is now known as jpena|away10:36
*** yamamoto has quit IRC10:37
*** happyhemant has quit IRC10:38
*** ginopc has quit IRC10:47
*** ginopc has joined #openstack-infra10:48
*** markvoelker has joined #openstack-infra10:59
*** yamamoto has joined #openstack-infra11:01
*** takamatsu has quit IRC11:01
*** takamatsu has joined #openstack-infra11:02
openstackgerritSorin Sbarnea proposed openstack/openstack-zuul-jobs master: Adds openstack-tox-mol job  https://review.opendev.org/66359911:02
openstackgerritSorin Sbarnea proposed openstack/openstack-zuul-jobs master: Adds openstack-tox-mol job  https://review.opendev.org/66359911:03
*** yamamoto has quit IRC11:06
slaweqinfra-root: hi, can You do me a favor and put on hold nodes with job networking-ovn-tempest-slow run on my patch https://review.opendev.org/#/c/658326/ - I'm trying to understand what is missing there that there is no connectivty to floating IPs there as on single node job all worked fine11:07
slaweqinfra-root: just let me know when I should recheck this job :)11:08
slaweqthx in advance11:08
*** pkopec has quit IRC11:08
*** pkopec has joined #openstack-infra11:11
*** pkopec has quit IRC11:11
*** Lucas_Gray has quit IRC11:11
*** Lucas_Gray has joined #openstack-infra11:12
*** yamamoto has joined #openstack-infra11:13
*** pkopec has joined #openstack-infra11:14
fricklerslaweq: give me a couple of minutes11:17
*** jistr|lnl is now known as jistr11:18
*** udesale has quit IRC11:23
*** pkopec has quit IRC11:26
zbrianw: clarkb fungi : please let me know if you have anything against adding this new tox job as we need it for multiple projects: https://review.opendev.org/#/c/663599/11:30
*** pkopec has joined #openstack-infra11:31
slaweqfrickler: thx a lot11:31
*** markvoelker has quit IRC11:32
openstackgerritMerged opendev/base-jobs master: Add jobs to publish artifacts to tarballs.opendev.org  https://review.opendev.org/66287711:32
openstackgerritCarlos Goncalves proposed openstack/diskimage-builder master: Move FS_TYPE set post-element source  https://review.opendev.org/66354311:34
fricklerslaweq: always welcome, can you paste me your public key again, please?11:46
fricklernodes will be 149.202.191.161 and 149.202.161.23911:47
*** aedc has quit IRC11:48
*** dikonoor has joined #openstack-infra11:59
*** rh-jelabarre has joined #openstack-infra11:59
*** rlandy has joined #openstack-infra11:59
*** lpetrut has quit IRC12:00
*** lpetrut has joined #openstack-infra12:00
*** rlandy is now known as rlandy|ruck12:00
*** ramishra_ has joined #openstack-infra12:03
slaweqfrickler: my key http://paste.openstack.org/show/752589/12:04
*** ramishra has quit IRC12:05
fricklerslaweq: o.k., added to both nodes. the nodes will only be held if the job actually fails, but you may want to take a look at it while the job is still running12:06
slaweqfrickler: yes, I will look in few minutes, thx a lot12:06
*** eharney has quit IRC12:14
*** aedc has joined #openstack-infra12:20
*** derekh has quit IRC12:21
*** trident has quit IRC12:21
*** trident has joined #openstack-infra12:26
*** markvoelker has joined #openstack-infra12:29
*** dave-mccowan has joined #openstack-infra12:29
*** Emine has joined #openstack-infra12:32
*** jchhatbar has joined #openstack-infra12:36
*** janki has quit IRC12:38
*** yamamoto has quit IRC12:44
*** jchhatbar has quit IRC12:49
*** yamamoto has joined #openstack-infra12:51
*** tdasilva has joined #openstack-infra12:51
*** aaronsheffield has joined #openstack-infra12:51
*** pcaruana has quit IRC12:52
*** yamamoto has quit IRC13:00
*** dikonoor has quit IRC13:01
AJaegerzbr: you can add that tox job to any repo - and use it from there everywhere. We share one namespace, everything is global.13:02
*** markvoelker has quit IRC13:03
*** haleyb has joined #openstack-infra13:03
*** rfarr has joined #openstack-infra13:04
*** Goneri has quit IRC13:04
*** ekultails has joined #openstack-infra13:05
*** rfarr_ has joined #openstack-infra13:05
zbrAJaeger: i am not sure it would make sense that his would work. i am concerned that if I add it to, let's say, tripleo-common, others from outside tripleo would try to use it. 2nd is that once is added to a specific repo, if someone decides to change it, it may break unrelated projects. With centralized location I am sure more attention would be made if someone want to modify it. What do you think?13:05
*** derekh has joined #openstack-infra13:06
zbradding openstack-tox-mol does not block me from adding this job in multiple repos, but wanted to avoid duplication.13:06
zbrthe trick is that this requires docker installation as you probably observed with the pre-run.yaml13:06
AJaegerzbr: I read your request - and commit message as: It will not work if it does not life in that repo. And I was replying on that technical argument. I agree that there are process issues...13:07
*** rfarr has quit IRC13:08
*** aakarsh has quit IRC13:10
zbrAJaeger: i didn't create a depends-on change as I was not sure if it people would appreciate it or not, but I do have at least 4 different repositories that could use it, all of them happening to be tripleo*, but there is nothing specific to tripleo about it, could apply to any repo with some ansible roles insie.13:10
AJaegerzbr: on the other hand having it in a central location, means that people review the change that have no clue about molecule while having it in tripleo-common you have tight coupling ;)13:10
AJaegerzbr: I asked for a test  in just *one* repo to show that the job is working as expected.13:11
zbrAJaeger: sure, doing it right away.13:11
*** bnemec has joined #openstack-infra13:11
AJaegerzbr: note I'm not objecting to add the job - I just need more information ;)13:13
*** ykarel is now known as ykarel|afk13:18
*** Goneri has joined #openstack-infra13:22
*** mriedem has joined #openstack-infra13:23
*** hwoarang has quit IRC13:24
*** hwoarang has joined #openstack-infra13:25
*** tdasilva has quit IRC13:28
*** mloza has joined #openstack-infra13:29
*** udesale has joined #openstack-infra13:29
openstackgerritSorin Sbarnea proposed openstack/openstack-zuul-jobs master: Adds openstack-tox-mol job and template  https://review.opendev.org/66359913:30
slaweqfrickler: I hope I found what was missing there so I pushed new PS and probably those nodes which You hold are already destroyed but if not, You can destroy them13:35
slaweqthx a lot for help13:35
*** tdasilva has joined #openstack-infra13:38
fricklerslaweq: oh, you pushed the new ps before the job failed. yeah, that deleted those nodes, but also causes the hold request still to be in place, so you will get the next set of nodes held, too ;)13:39
*** aakarsh has joined #openstack-infra13:39
*** yamamoto has joined #openstack-infra13:39
slaweqfrickler: ok, good to know13:42
slaweqif those job will still be failing, I will be able to check again13:43
slaweqif not, I will tell here to unhold those nodes than13:43
fricklerslaweq: well, if the job passes, the nodes won't be held, either. so we'll have a stale hold request until the next zuul reboot, which has happened before, so it isn't a real issue. also Shrews has some patches up that will hopefully allow us to also delete no-longer-needed hold requests again ;)13:45
*** whoami-rajat has quit IRC13:47
*** jaosorior has joined #openstack-infra13:47
*** ramishra_ is now known as ramishra13:47
*** jaosorior has quit IRC13:51
*** jpena|away is now known as jpena13:53
*** whoami-rajat has joined #openstack-infra13:57
*** ricolin has joined #openstack-infra13:57
*** markvoelker has joined #openstack-infra13:59
*** Lucas_Gray has quit IRC14:01
*** kjackal_v2 has joined #openstack-infra14:04
*** kjackal has quit IRC14:05
*** jamesmcarthur has joined #openstack-infra14:05
*** yamamoto has quit IRC14:10
*** dikonoor has joined #openstack-infra14:14
*** jcoufal has joined #openstack-infra14:15
openstackgerritBen Nemec proposed openstack/reviewstats master: Load subproject data from governance  https://review.opendev.org/65302414:15
openstackgerritBen Nemec proposed openstack/reviewstats master: Switch to stestr  https://review.opendev.org/65550614:15
*** lpetrut has quit IRC14:16
*** jcoufal_ has joined #openstack-infra14:20
*** jcoufal has quit IRC14:22
*** Lucas_Gray has joined #openstack-infra14:22
mordredfungi, corvus: +A'd the opendev-tarballs patches outstanding. they made me think - should we tee up making an opendev.org AFS cell at some point?14:24
corvusmordred: theoretically, but it's both difficult and hidden enough that i think we can defer that for a while14:25
mordredcorvus: yeah - I think "at some point" can be a decent ways out into the future14:25
*** ykarel|afk is now known as ykarel14:25
clarkbya I pondered that when upgrading servers and agree on hidden and difficult14:26
corvusi'm going to wip the final system-config change so we don't add the vhost until we can confirm the LE certs are in place14:27
mordredcorvus: ah - good point14:28
openstackgerritBen Nemec proposed openstack/reviewstats master: Load subproject data from governance  https://review.opendev.org/65302414:30
openstackgerritBen Nemec proposed openstack/reviewstats master: Switch to stestr  https://review.opendev.org/65550614:30
*** jcoufal_ has quit IRC14:31
*** jcoufal has joined #openstack-infra14:31
*** markvoelker has quit IRC14:33
*** priteau has joined #openstack-infra14:36
*** jaosorior has joined #openstack-infra14:37
*** iurygregory has quit IRC14:38
*** jaosorior has quit IRC14:39
clarkbre LE ianw found an issue with our le stuff and ansible 2.8 in scrollback14:41
*** jaosorior has joined #openstack-infra14:41
clarkbthe upgrade hasnt merged yet so doesnt affect this but testing of the upgrade has a sad14:42
*** xek_ has joined #openstack-infra14:42
*** xek has quit IRC14:43
*** rpittau is now known as rpittau|brb14:45
*** yamamoto has joined #openstack-infra14:47
openstackgerritLon Hohberger proposed openstack/diskimage-builder master: Use architecture-specific grub2 RPMs on RHEL8  https://review.opendev.org/66369314:50
*** lseki has joined #openstack-infra14:53
openstackgerritMerged opendev/system-config master: Get an LE cert for tarballs.opendev.org  https://review.opendev.org/66342814:53
Shrewsfrickler: you can delete no longer needed hold requests now. just set count to 014:54
*** jaosorior has quit IRC14:55
*** yamamoto has quit IRC14:55
*** jaosorior has joined #openstack-infra14:57
fricklerShrews: oh, cool, seems I missed that update14:59
clarkbit seems that http://logs.openstack.org/57/662657/24/check/system-config-run-mirror/8db2d14/job-output.txt.gz#_2019-06-06_06_42_03_692428 shows ansible 2.8 crashing after running the restart apache handler?15:00
clarkbpabelanger: dmsimard do you know of any known issues in 2.8 with handlers?15:01
dmsimardhave not personally come across a particular issue yet15:01
asettleoh clarkb I lost my logs - did you see my ping about the rackspace server?15:03
dmsimardclarkb: "ERROR! The requested handler 'letsencrypt updated mirror01-region-provider-opendev-org-main' was not found in either the main handlers list nor in the listening handlers list"15:04
clarkbdmsimard: why does it report no failures at the play recap then?15:04
dmsimardclarkb: nested playbooks ? I found it in http://logs.openstack.org/57/662657/24/check/system-config-run-mirror/8db2d14/ara-report/result/4776b797-5bfb-4166-acdf-0daf01c23624/ which runs a few of them15:05
clarkbasettle: much of our control plane still runs on rax, though we are becoming more and more multicloud for the control plane15:06
asettleGotcha. Is that of concern since RAX isn't that integrated with stack anymore?15:06
dmsimardplaybooks/service-mirror.yaml looks fine, the error is for playbooks/service-letsencrypt.yaml15:06
clarkbasettle: moving the web server is straightforward as all the data lievs on afs (which can be accessed globally). Moving the filesystems if more difficult15:06
asettleAh okay15:06
fricklerclarkb: the play recap you pointed to was for bridge. the complete recap is here http://logs.openstack.org/57/662657/24/check/system-config-run-mirror/8db2d14/job-output.txt.gz#_2019-06-06_06_42_03_76792915:07
clarkbasettle: It is of some concern, but they haven't indicated we need to evacuate and we are trying to distribute the load across other servers. However we don't have volunteers to host everything nor the manpower to make such a switch15:07
asettleOh15:08
asettleThat... that's the real problem15:08
clarkbbasically what we've been doing is try to make it a rax + others control plane and not a not rax control plane. Distribute the load and have redundancy etc15:08
*** pkopec_ has joined #openstack-infra15:10
jamesmcarthurfungi, rpioso: We are now reading directly from driverlog: https://www.openstack.org/marketplace/drivers/15:10
*** pkopec has quit IRC15:10
jamesmcarthurI think you'll see many previously missing drivers there. This list will update nightly.15:10
*** chandankumar is now known as raukadah15:10
jamesmcarthurI've confirmed the missing Ironic drivers are there, plus the ones from this review: https://review.opendev.org/#/c/661407/15:11
evrardjppabelanger: I am whipping votes on https://review.opendev.org/#/c/662817/ could you give me your opinion?15:13
pabelangerevrardjp: seems fine to me, +2, but cannot watch it merge sadly15:13
pabelangermaybe corvus or clarkb can +A15:13
evrardjppabelanger: thanks :)15:14
evrardjpI guess I will see with them! I am glad the approach doesn't sound too bad15:14
pabelangerclarkb: speaking of 2.8, we should consder doing a round of executor restarts to pick up new 2.8 support in zuul. I don't believe we have done that yet15:14
*** gyee has joined #openstack-infra15:16
clarkbdmsimard: ok I've confirmed that handlers is in that role's handlers main.yaml https://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/letsencrypt-create-certs/handlers/main.yaml#L40-L4115:17
clarkbhttps://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/letsencrypt-create-certs/tasks/acme.yaml#L5 is that task in the same role15:18
mnaserinfra-root: there is a lot of old glance images that have not been deleted by nodepool in sjc1.  can i clean those up?15:18
mnasertc15:19
clarkbmnaser: yes or we can. Shrews ^ I guess we are still leaking images. I think in the past we thought that may be due to sdk saying the deletion was a success witout it actually being?15:19
*** jeremyfreudberg has joined #openstack-infra15:19
Shrewsi dunno. i don't have time to look at that now15:19
jeremyfreudberghi infra, I did a patch a while ago to update the description of some Sahara repos. https://review.opendev.org/#/c/655527/ . i just noticed that the change took effect only github, and not on opendev.org15:20
mnaserclarkb: if ya can do them now, that'd be great, if not, i can do some basic introspection on whats being used right now (or the 2 newest ones)15:20
jeremyfreudbergcompare https://opendev.org/openstack/sahara and https://github.com/openstack/sahara15:20
clarkbits been a chronic issue over time that we thought may be due to gearman based rpc then it persisted when we switched to zk and then we thought it could be related to shade and I thought we thought we fixed it but I Guess it still persists15:20
clarkbmnaser: ya I can take a look in a few minutes15:20
clarkbjeremyfreudberg: we don't update opendev.org unless it is repo creation or a manual trigger as it takes many hours to do a full reconciliation15:21
clarkbthis is due to ansible task spinup cost so we are going to rewrite those ansible tasks into python script (which is how github updates)15:22
jeremyfreudbergclarkb: got it, thanks!15:23
clarkbmordred: we have control plane image in vexxhost sjc1 now btw15:23
clarkbour test node images don't seem to be updating as expected though15:24
clarkbthis means I/we can spin up a new gitea0615:24
clarkboh should double check the ext4 journal size first15:25
*** rpittau|brb is now known as rpittau15:26
*** jeremyfreudberg has quit IRC15:26
fricklerianw: from https://docs.ansible.com/ansible/latest/porting_guides/porting_guide_2.8.html#imports-as-handlers I'd say you need to move from import_tasks to include_tasks here https://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/letsencrypt-create-certs/handlers/main.yaml15:27
fricklerdmsimard: clarkb: ^^15:27
clarkbfrickler: thanks!15:29
openstackgerritMerged zuul/zuul-jobs master: Explicitly store date facts for promote  https://review.opendev.org/66281715:29
clarkbmnaser: they are failing to delete15:29
mnaserclarkb: all of them or some of them?15:29
mnaserusually if it fails to delete, that's because there is a volume booted using it15:29
clarkball of the ones I have attempted so far (just our leaked centos-7 images)15:29
clarkbI'll get a paste for you15:29
*** markvoelker has joined #openstack-infra15:29
clarkboh wait15:30
clarkbI didn't specify a region15:30
clarkbwhy is it working at all then15:30
clarkbtrying agin with explicit region set15:30
*** iurygregory has joined #openstack-infra15:31
fricklerclarkb: mnaser: there were centos-7 nodes stuck in ERROR yesterday or so, maybe that's blocking the deletion15:31
clarkbok that makes it happier. mordred ^ any idea why some clouds yell at you when you don't set a region and others don't? seems like that should be consistent15:31
mnaseri'd bet some of those are because of the quota issue :\15:31
fricklerand with that, /me goes home15:32
clarkbfrickler: good night and thanks for the find on that ansible item15:32
*** ykarel is now known as ykarel|away15:36
*** AJaeger has quit IRC15:38
*** pgaxatte has quit IRC15:41
openstackgerritFabien Boucher proposed zuul/zuul master: A reporter for Elasticsearch  https://review.opendev.org/64492715:44
*** e0ne has quit IRC15:45
openstackgerritFabien Boucher proposed zuul/zuul master: A reporter for Elasticsearch  https://review.opendev.org/64492715:46
*** pkopec__ has joined #openstack-infra15:46
*** rpittau is now known as rpittau|afk15:46
*** pkopec__ has quit IRC15:47
*** pkopec_ has quit IRC15:49
*** jaosorior has quit IRC15:50
clarkbthe reason our image builds have stopped functioning is we ran out of disk15:51
clarkbas soon as my oneliner is done removing sjc1 images I'll see if I can't delete some iamges on nb01 to get some disk back15:53
*** diablo_rojo has joined #openstack-infra15:54
clarkbmnaser: can you check your side of things? I think I got all images but those nodepool has records for and those glance says are used15:58
*** kjackal_v2 has quit IRC16:00
mnaserclarkb: thanks, it looks much better.. we can look at the other extras later16:00
*** rkukura has joined #openstack-infra16:01
*** piotrowskim has quit IRC16:01
*** anteaya has joined #openstack-infra16:02
*** markvoelker has quit IRC16:02
*** ykarel|away has quit IRC16:02
clarkbmnaser: I'm asking nodepool to delete the extras it knows about (these aren't leaked but appear to have been overlapped uploads)16:03
clarkbonce that is done we should be left with only the images glance says are in use16:03
*** tesseract has quit IRC16:03
clarkbok those are queued up, just a matter of deletions happening now16:05
clarkband now to see if there is disk that can be freed on nb01 and nb0216:05
*** whoami-rajat has quit IRC16:07
*** lucasagomes has quit IRC16:07
*** Lucas_Gray has quit IRC16:12
*** whoami-rajat has joined #openstack-infra16:12
fungisorry i've been on silent running today, it's been meetings most of my morning and i need to go run errands now but will hopefully finish catching up and pitch in as soon as i get back16:13
clarkbok I only got about 20GB cleared on nb01 by cleaning up /opt/nodepool_dib16:18
clarkbI'm running a du -hs * in /opt to see whereelse I can clear things out. If the cache is quite large I think I'll stop the builder, delete the cache, reboot to clear mounts, then let it rebuild its cache16:19
*** rkukura has quit IRC16:19
clarkbits possible our cache is leaky as we change what we need tocache over time so that may clear out a bit of space16:19
clarkboh and clear out dib_tmp16:21
clarkbmordred: you have a mordred dir on nb01:/opt/ can we delete that? looks like you used it to build and stash your control plane images from before16:22
clarkbmordred: or maybe I can move it into your homedir which is on a different device?16:22
clarkbya I'm gonna move your /opt/mordred dir into /home/mordred/opt/mordred16:25
*** tjgresha has joined #openstack-infra16:25
*** dikonoor has quit IRC16:27
*** gfidente has quit IRC16:28
clarkbsimilarly I'm going to move /opt/ianw to /home/ianw/opt/ianw16:28
*** rkukura has joined #openstack-infra16:30
clarkboh wow dib_tmp is half our disk? we must not actually be deleting things because they are mounted?16:30
clarkbso ya I think the cleanup here is disable builder in systemctl, reboot (clears out mounts), delete dib_tmp, enable builder, reboot16:30
*** hwoarang has quit IRC16:31
*** hwoarang has joined #openstack-infra16:32
clarkbI am in the delete dib_tmp contents step16:35
clarkbianw: any thoughts on how we can make dib more resilient to leaking this stuff?16:35
clarkbwe must have many many really small files as this rm is not fast16:40
clarkbone idea is we could have nodepool-builder clear whatever files it owns in the specified tempdir on startup16:43
clarkbmight make for the occasional slow startup but should help to keep things in check16:43
*** xek_ has quit IRC16:45
*** mgoddard has quit IRC16:48
*** mgoddard has joined #openstack-infra16:50
clarkbnb01 is done and has a lot of disk now. Doing the same on nb0216:50
*** Emine has quit IRC16:50
mordredclarkb: sorry - was on the phone - I do not need that /opt/mordred anymore16:54
*** ricolin has quit IRC16:54
clarkbmordred: it is in your homedir so feel free to clean up as you wish :)16:55
*** hwoarang has quit IRC16:56
clarkbnb02 is now done too16:56
clarkbboth builders have a ton of disk and are each building an image now. Note I did not end up deleting the cache as it was only 35GB or so16:56
clarkbclearing dib_tmp was the ticket16:56
*** weifan has joined #openstack-infra16:57
*** hwoarang has joined #openstack-infra16:57
openstackgerritDavid Shrewsbury proposed zuul/zuul master: WIP: Add caching of autohold requests  https://review.opendev.org/66341216:58
*** markvoelker has joined #openstack-infra16:59
*** derekh has quit IRC17:00
openstackgerritClark Boylan proposed opendev/system-config master: bridge.opendev.org: use Ansible 2.8.0 stable  https://review.opendev.org/66266217:01
clarkbianw: ^ went ahead and updated that for you to reduce rtt on test results17:01
*** mrhillsman is now known as openlab17:02
clarkbinfra-root https://review.opendev.org/#/c/663146/ whcih adds db backups to gitea servers is +1 from zuul now17:03
clarkbI'm gonna pop out for a bike ride but will be back to check that the cleaned up builders are successfully building now and I want to do scan/cleanup of leaked images in clouds that are not vexxhost17:03
*** dpawlik has quit IRC17:04
*** dpawlik has joined #openstack-infra17:04
*** openlab is now known as codebauss17:05
*** dtantsur is now known as dtantsur|afk17:06
openstackgerritClark Boylan proposed openstack/project-config master: Increase the size of ext4 journal on control plane images  https://review.opendev.org/66372617:06
clarkbinfra-root I think we want ^ before we rebuild gitea0617:06
clarkband now I'm really popping out. Back in a bit17:07
mordredclarkb: yay control plane images!17:07
*** dpawlik has quit IRC17:10
*** ginopc has quit IRC17:10
*** codebauss is now known as openlab17:13
*** openlab is now known as codebauss17:14
*** codebauss is now known as openlab17:15
*** bobh has joined #openstack-infra17:15
*** openlab is now known as codebauss17:16
*** kopecmartin is now known as kopecmartin|off17:17
*** jtomasek has quit IRC17:19
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Store autohold requests in zookeeper  https://review.opendev.org/66111417:19
openstackgerritDavid Shrewsbury proposed zuul/zuul master: WIP: Add caching of autohold requests  https://review.opendev.org/66341217:19
*** yolanda__ has joined #openstack-infra17:20
*** mattw4 has joined #openstack-infra17:22
*** bobh has quit IRC17:22
*** yolanda has quit IRC17:23
*** jpena is now known as jpena|off17:23
*** codebauss is now known as mrhillsman17:24
*** dpawlik has joined #openstack-infra17:26
*** dpawlik has quit IRC17:31
*** markvoelker has quit IRC17:32
*** aedc has quit IRC17:32
*** aedc has joined #openstack-infra17:33
*** jamesmcarthur has quit IRC17:33
*** aedc has quit IRC17:37
openstackgerritMerged openstack/project-config master: Increase the size of ext4 journal on control plane images  https://review.opendev.org/66372617:39
*** ociuhandu_ has joined #openstack-infra17:39
*** spsurya has quit IRC17:40
*** udesale has quit IRC17:42
*** dpawlik has joined #openstack-infra17:42
*** ociuhandu has quit IRC17:42
*** ociuhandu_ has quit IRC17:43
*** dpawlik has quit IRC17:46
*** tdasilva has quit IRC17:48
corvusclarkb, mordred, ianw, fungi: there's a LE cert on files02 for tarballs.opendev.org :)17:49
corvusi will re-enqueue my deployment change once my schedule clears17:50
*** Vadmacs has joined #openstack-infra17:53
mnaserclarkb: thanks, it looks much better.. we can look at the other extras later17:56
mnaseroops, sorry, accidental arrow up and enter.17:56
*** priteau has quit IRC17:56
fungicorvus: excellent!17:57
fungido we know yet why the apache restart handler crashes ansible 2.8?17:58
*** panda has quit IRC17:59
rpiosojamesmcarthur: Well done! The change you made is a nice improvement. And it's good to see all of those previously missing in action drivers. Thank you very much :-)17:59
*** lennyb has quit IRC18:00
*** panda has joined #openstack-infra18:01
*** michael-beaver has joined #openstack-infra18:02
*** mriedem is now known as mriedem_lunch18:02
*** mgoddard has quit IRC18:03
*** mgoddard has joined #openstack-infra18:03
*** ralonsoh has quit IRC18:04
*** roman_g has quit IRC18:04
*** Goneri has quit IRC18:05
*** sshnaidm is now known as sshnaidm|off18:06
clarkbfungi: yes https://review.opendev.org/662662 updated to fix it18:09
clarkbhas a link to the docs too18:09
fungiawesome18:11
fungiseems i skimmed scrollback too quickly18:11
clarkbnb02 successfully built a bionic image ~6 minutes ago18:11
clarkbnb01 is in the middle of a centos-7 build18:12
fungiand looks like 662662 is failing all the run jobs besides run-base18:13
clarkbpossible that include_tasks there isn't the drop in replacement I expected18:14
fungi"Could not find or access '/home/zuul/src/opendev.org/opendev/system-config/playbooks/touch_file.yaml' on the Ansible Controller."18:15
openstackgerritPaul Belanger proposed zuul/nodepool master: Toggle host-key-checking for openstack provider.labels  https://review.opendev.org/66337818:16
fungii *think* that's the error, but hard to pick out of the nested ara report18:16
fungioh, right, we have a second layer ara in there18:16
clarkbya thats one of the handlers that got renamed from import_tasks to include_tasks18:16
clarkbthe path should be local to the roles/foo/handlers dir18:16
fungibut yep http://logs.openstack.org/62/662662/8/check/system-config-run-letsencrypt/ac4c99a/bridge.openstack.org/ara-report/result/dd3ee162-9245-469b-adbc-3b69ffb4adcc/18:17
*** Goneri has joined #openstack-infra18:17
clarkbthe docs say nothing about pathing for that :/18:17
clarkbhttps://github.com/ansible/ansible/issues/33784 says this should've been fixed in 2.518:20
clarkbnew regression possibly?18:20
clarkbdmsimard: pabelanger ^ is there a canonical way to do this? I guess provide the entire relative path relative to the playbook dir?18:24
* clarkb updates to see if that works18:24
*** aedc has joined #openstack-infra18:24
openstackgerritClark Boylan proposed opendev/system-config master: bridge.opendev.org: use Ansible 2.8.0 stable  https://review.opendev.org/66266218:25
*** e0ne has joined #openstack-infra18:26
*** mattw4 has quit IRC18:26
*** mattw4 has joined #openstack-infra18:27
*** markvoelker has joined #openstack-infra18:29
clarkbnb01 did successfully build that centos-7 image18:30
clarkbI've queued up a control plane image rebuild as the ext4 journal update seems to be in place18:31
*** e0ne has quit IRC18:31
*** e0ne has joined #openstack-infra18:32
pabelangerclarkb: I've never really used include_tasks in a handler, would have to debug more18:36
pabelangerpossible you need to add something like: include_tasks: ../tasks/foo.yaml18:36
pabelangeras it might be trying to load in the handlers directory18:37
*** dpawlik has joined #openstack-infra18:37
*** e0ne has quit IRC18:38
openstackgerritDavid Shrewsbury proposed zuul/zuul master: WIP: Add caching of autohold requests  https://review.opendev.org/66341218:39
*** dpawlik has quit IRC18:42
openstackgerritBen Nemec proposed openstack/pbr master: DNM: Debug wsgi ci failures  https://review.opendev.org/66374518:42
clarkbcan I tempt anyone with reviews for gitea db backups https://review.opendev.org/66314618:46
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add autohold-info CLI command  https://review.opendev.org/66248718:48
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Record held node IDs with autohold request  https://review.opendev.org/66249818:48
fungiclarkb: yup! you certainly can18:49
fungiand then i have some changes i need to update18:49
fungii should get some dnm changes up to use base-test so we can exercise the fetch-zuul-cloner removal today too18:50
clarkb++18:51
*** mattw4 has quit IRC18:53
*** kjackal has joined #openstack-infra18:53
*** mattw4 has joined #openstack-infra18:53
*** dpawlik has joined #openstack-infra18:53
*** yamamoto has joined #openstack-infra18:53
*** jamesmcarthur has joined #openstack-infra18:56
*** mattw4 has quit IRC18:57
*** yamamoto has quit IRC18:58
*** dpawlik has quit IRC18:58
*** markvoelker has quit IRC19:02
openstackgerritBen Nemec proposed openstack/pbr master: DNM: Debug wsgi ci failures  https://review.opendev.org/66374519:03
*** Vadmacs has quit IRC19:03
openstackgerritClark Boylan proposed zuul/zuul master: Update axios version and yarn.lock  https://review.opendev.org/66231619:06
*** rkukura has quit IRC19:07
fungiclarkb: small question on 663146 but otherwise lgtm19:09
clarkbfungi: responded19:12
*** jamesmcarthur has quit IRC19:12
corvusi also responded with very similar words19:13
*** mriedem_lunch is now known as mriedem19:13
*** jamesmcarthur has joined #openstack-infra19:14
fungithanks!19:14
clarkbhopefully the way I folded the string makes it easier to read that too19:15
clarkbhttps://review.opendev.org/#/c/662662/ passes testing after I updated the paths to be relative to the playbooks dir19:16
clarkbthat is the ansible 2.8 change19:16
*** e0ne has joined #openstack-infra19:17
*** owalsh has quit IRC19:18
*** bnemec has quit IRC19:23
*** d34dh0r53 has quit IRC19:25
*** zbr has quit IRC19:25
*** cloudnull has quit IRC19:25
*** Goneri has quit IRC19:25
*** bnemec has joined #openstack-infra19:25
*** cloudnull has joined #openstack-infra19:30
*** AJaeger has joined #openstack-infra19:31
*** bnemec has quit IRC19:31
AJaegermordred, fungi, do we need a variant of https://review.opendev.org/#/c/535836 for bindep removal? Or can it be abandoned?19:31
*** d34dh0r53 has joined #openstack-infra19:31
fungiAJaeger: i think clarkb has sufficiently convinced me that it already works that way19:34
*** owalsh has joined #openstack-infra19:34
*** bnemec has joined #openstack-infra19:34
mordredwant me to abandon?19:34
fungiAJaeger: oh, my mistake, it already avoids *running* bindep, but 535836 avoids installing it when it's not going to be used19:34
fungii don't think that one is urgent19:35
fungiit just improves performance in jobs slightly19:35
mordredah - yeah19:35
fungiit does still seem useful19:36
AJaegermordred: it's ancient - want to update?19:36
openstackgerritMerged zuul/nodepool master: Toggle host-key-checking for openstack provider.labels  https://review.opendev.org/66337819:36
AJaegermordred: you have quite a few open changes that are stale - want to cleanup that list one time? ;)19:37
openstackgerritMerged opendev/system-config master: Add db backups to gitea  https://review.opendev.org/66314619:38
AJaegerfungi: want to abandon https://review.opendev.org/461878  - flatten namespace infra-specs change...19:38
fungioh, yep, that has recently become redundant. thanks!19:39
*** Goneri has joined #openstack-infra19:39
*** ekultails has quit IRC19:40
AJaeger;)19:41
*** bnemec has quit IRC19:41
*** AJaeger has quit IRC19:42
*** bnemec has joined #openstack-infra19:42
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add caching of autohold requests  https://review.opendev.org/66341219:43
*** ekultails has joined #openstack-infra19:44
*** Goneri has quit IRC19:47
*** rkukura has joined #openstack-infra19:50
openstackgerritGeorge Dimitrov proposed opendev/git-review master: Fix parsing of review over http  https://review.opendev.org/64148419:52
*** imacdonn has quit IRC19:53
*** imacdonn has joined #openstack-infra19:54
*** jamesmcarthur has quit IRC19:55
*** jamesmcarthur has joined #openstack-infra19:56
*** markvoelker has joined #openstack-infra19:59
*** Goneri has joined #openstack-infra20:00
*** rlandy|ruck is now known as rlandy|ruck|brb20:02
openstackgerritMerged opendev/system-config master: Add tarballs.opendev.org vhost  https://review.opendev.org/66343120:03
*** eernst has joined #openstack-infra20:04
*** panda has quit IRC20:04
*** e0ne has quit IRC20:05
*** eernst has quit IRC20:05
clarkboh also if anyone is wondering the gitea backups are fairly small < 10MB once compressed20:05
*** panda has joined #openstack-infra20:05
openstackgerritBen Nemec proposed openstack/pbr master: DNM: Debug wsgi ci failures  https://review.opendev.org/66374520:06
mordredclarkb: yay20:06
*** mattw4 has joined #openstack-infra20:07
*** ccamacho has quit IRC20:08
openstackgerritBen Nemec proposed openstack/pbr master: Make WSGI tests listen on localhost  https://review.opendev.org/66375820:11
corvuslooks like the merge just missed the ansipuppet cycle20:13
*** bnemec has quit IRC20:16
*** rlandy|ruck|brb is now known as rlandy|ruck20:17
fungiclarkb: makes sense, it's almost all metadata i'm sure20:17
*** bnemec has joined #openstack-infra20:17
openstackgerritDavid Shrewsbury proposed zuul/zuul master: WIP: Auto-delete expired autohold requests  https://review.opendev.org/66376220:20
*** tjgresha has quit IRC20:21
openstackgerritBen Nemec proposed openstack/pbr master: DNM: Debug wsgi ci failures  https://review.opendev.org/66374520:23
*** jamesmcarthur has quit IRC20:24
fungiinfra-root: from fanatical support, "This message is to inform you that our monitoring systems have detected a problem with the server which hosts your Cloud Block Storage device 'afs02.dfw.openstack.org/main02' at 19:50 UTC. We are currently investigating the issue and will update you as soon as we have additional information regarding the alert. Please do not access or modify20:26
fungi'afs02.dfw.openstack.org/main02' during this process."20:26
*** jamesmcarthur has joined #openstack-infra20:27
*** jamesmcarthur_ has joined #openstack-infra20:29
fungilooks like afs started panicking on the server as of 17:01:35z20:29
fungicurrently complaining about i/o errors on xvdc20:29
*** jamesmcarthur has quit IRC20:31
*** mattw4 has quit IRC20:31
fungiper that message, i guess we should avoid rebooting it just yet20:31
*** mattw4 has joined #openstack-infra20:31
*** markvoelker has quit IRC20:32
clarkbiirc RW is on afs01.dfw across the baord anyway20:33
corvusyeah, i think 02 is all RO replicas20:47
fungiokay, so after we get the all-clear from fanatical support we can just reboot it20:50
fungiand presumably make sure it resyncs correctly (however long that takes)20:50
*** jcoufal has quit IRC20:50
fungi"This message is to inform you that your Cloud Block Storage device afs02.dfw.openstack.org/main02 has been returned to service."20:53
fungiguess i'll reboot it now and see what happens20:53
corvusi'm working on fixing files.o.o20:54
fungiwhat happened to files.o.o?20:54
corvuschange merged20:54
fungioh, that20:54
*** dpawlik has joined #openstack-infra20:54
corvusugh, the vhost files are completely wrong20:54
fungi#status log rebooted afs02.dfw.openstack.org following a cinder volume outage for xvdc20:56
openstackstatusfungi: finished logging20:56
fungiit's not come up yet, likely fsck underway20:57
*** dpawlik has quit IRC20:59
fungiif you need help with files.o.o, lmk20:59
corvusback up now20:59
corvusso what it did is it wrote blank certificate path names for tarballs.opendev.org, docs.opendev.org, docs.starlingx.io, and zuul-ci.org21:00
corvushttps://review.opendev.org/66343121:00
fungihrm, undefined variables in the erb?21:01
corvusi don't understand what's missing there, but i admit, i don't understand variables in puppet21:01
*** ekultails has quit IRC21:02
fungii suspect it's a puppet scoping thing, and @ is referring to variables in the vhost resource scope not the containing manifest? checking for another place we've done this21:02
*** jamesmcarthur_ has quit IRC21:03
clarkbthe problem is scoping21:03
clarkb@ssl_cert_file and friends are only in the website.pp scope not the vhost scope21:03
fungiyeah, other places where we're doing this we use:21:03
fungiSSLCertificateFile <%= scope['openstack_project::static::cert_file'] %>21:03
clarkbso you have to look up the other location's value21:03
fungior similar21:03
fungibecause those are classvars for the files class, not attributes of the vhost resource21:04
fungier, website class21:04
corvusdo either of you know which keyboard buttons to press to fix this?21:05
fungiyeah, i can copy what we're using on static.o.o21:05
TheJuliaQuestion, is there any way to configure node selection such that a job will always land on an ipv6 enabled cloud?21:05
fungijust a moment21:05
*** jamesmcarthur has joined #openstack-infra21:05
clarkbTheJulia: no21:05
TheJuliathat is going to be ouchy for ipv6 testing :(21:06
clarkbTheJulia: what do you need an ipv6 enabled cloud for?21:06
clarkbright so I've expalined this to the TC a couple times now21:06
fungiTheJulia: nope, but all our nodes are ipv6-enabled, they may just lack global v6 routes. it's best not to have job behaviors depend on the routing available to the nodes21:06
clarkbwhat probably should be done to test the ipv6 only is supported assertion is do packet captures21:06
clarkbthen allow external connectivity to external resources (zuul, dns, mirrors, etc) via a whitelist21:06
fungii did mention that as a suggestion in my goal rewrite too21:06
clarkbthen if you have any other ipv4 traffic fail the test21:06
TheJuliaclarkb: well, the devstack provided host_ipv6 var can't be trusted unless devstack changes :\21:07
clarkbTheJulia: not sure I understand. The trust comes from the external packet capture check21:07
TheJuliaclarkb: so not about validation, more about job mechanics of actually having things work reliably because I've seen it set and not set on job runs.21:07
clarkbif you see traffic that isn't whitelisted you fail21:07
clarkbTheJulia: any traffic that ends up on v4 would show up in your capture though21:08
TheJuliaclarkb: I'm talking about something different than goal validation, I'm talking about actually being able to trust the variables in the jobs21:08
clarkbthat should cover your concern?21:08
clarkbTheJulia: and I'm saying if the variable isn't working properly the validation should catch that21:08
fungithe job ought not depend on what routing is available to the node21:08
TheJuliaclarkb: that would be nice :)21:09
fungias an analog, swift wants to test that their software works with xfs, but our nodes are provisioned with ext4 filesystems. the solution is to create a virtual filesystem and use that, not expect special nodes which are booted on xfs instead21:09
TheJuliafungi: If that is what causes HOST_IPV6 to be set or not :\21:09
TheJuliafungi: doing that already but yeah, it creates a fun code path21:10
*** dpawlik has joined #openstack-infra21:10
clarkbok I think I understand better you are saying devstack auto detects that for you and won't on ipv4 only clouds. For that we should just set the value21:10
clarkbdon't rely on devstack to autodetect21:10
clarkbthat could be via virtual network that is the same on all test runs21:11
fungiyep, we should be asking devstack for consistent behavior in our jobs, not context-dependent behavior guessed from the system21:11
TheJulia++++++21:11
clarkb(or ansible in zuul that checks host facts21:11
TheJuliafungi: that21:11
fungithe context-dependent guessing is likely useful for local use, but doesn't lead to consistent results21:12
clarkbthe gitea backup cron is in place now21:13
*** dpawlik has quit IRC21:14
clarkbwe did similar with nova net once upon a time and always set up networking for it as if it were multinode (whether or not it was actually multinode)21:15
clarkbthen the testing consistently used the virtual network across the baord21:15
clarkbcould do similar here21:16
*** jamesmcarthur_ has joined #openstack-infra21:16
clarkbor use the scope local addr (though I bet routing gets weird with that)21:16
TheJuliaWe do the same actually for multinode. I'll ask the QA team because I'd like a simpler code path. :)21:16
TheJuliaThe downside is for baremetal, we need to fake out the end virtual machines to think they have global for $reasons21:17
clarkbfungi: in your fix for corvus remember the scoping is slightly weird because the scope you need to lookup is for a define which is instantiated multiple times21:17
clarkbfungi: I think you have to stick the name of the define in the lookup path somewhere21:18
*** jamesmcarthur has quit IRC21:18
fungii'm about to push it as soon as i finish the commit message, and then you can tell me it's wrong ;)21:18
openstackgerritJeremy Stanley proposed opendev/system-config master: Correctly scope SSL path lookup for website vhosts  https://review.opendev.org/66377421:20
fungicorvus: clarkb: ^ that's what i *think* we want21:20
clarkbfungi: ya, my only concern is that there are many website instantiations each with different values for those vars21:21
clarkbso I think we may have to lookup the specific instances of the defined type?21:22
clarkbcmurphy: ^ do you know21:22
corvus(restarted apache again)21:23
fungioh, i see, as in need some way to ask the vhost resource what its parent instance of the webserver class is?21:23
* cmurphy looks21:23
clarkbfungi: ya21:23
clarkbor maybe that is automagic within the context of each website definition21:23
clarkb(and nothing extra is needed)21:23
fungii guess this works for our "static" class because it only refers to one server21:23
clarkbhttps://puppet.com/docs/puppet/4.8/lang_defined_types.html#containment may be the answer21:25
clarkb(in that it is automagic)21:25
*** mriedem is now known as mriedem_afk21:26
*** whoami-rajat has quit IRC21:27
*** weifan has quit IRC21:27
fungithere are many things about puppet which seem magic to me, so i'll just not be surprised that it's not a problem21:28
cmurphyi'm not sure if puppet 4 can handle this better than puppet 3, what i think we should do is use this https://review.opendev.org/57082421:28
cmurphyis it really not a problem?21:29
*** markvoelker has joined #openstack-infra21:29
clarkbcmurphy: we don't know yet. But also that change you linked won't fix this because those vars will still be outside of scope? or do you mean evaluate template first then pass to vhost content?21:29
clarkbya that must be it, so basically pass a really large string in that has already been interpolated21:30
*** weifan has joined #openstack-infra21:30
clarkbI guess we can try fungi's change and see if it is automagic21:30
clarkb(our testing may even do this for us already?)21:30
cmurphyclarkb: yeah evaluate the template first which would have the variables set properly at that scope and then pass the result to httpd21:31
corvusi trust our testing on this as far as i can throw the cloud it runs on21:31
cmurphyi don't think we trust the ssl paths very well21:31
cmurphyi sort of doubt that there is automagic that works for this21:31
fungiwe do it this way for the vhosts in openstack_project::static but we only instantiate that class once21:33
fungiand it provides multiple vhost resources21:33
cmurphyfungi: that is because it's a class, you can only instantiate it once21:33
cmurphythe openstack_project::website is a defined type21:33
cmurphyis the problem21:33
cmurphyit doesn't have the same place in the hierarchy as a class does21:33
fungioh, wow, i completely missed that it's a define not a class21:34
fungican we define these at the node level?21:35
clarkbfwiw I have an alternate change in the works21:35
*** iokiwi has joined #openstack-infra21:36
*** jamesmcarthur_ has quit IRC21:36
*** jamesmcarthur has joined #openstack-infra21:37
fungithough i guess it would be a lot of repeated conditional logic in the node definition if we did it thatway21:37
openstackgerritClark Boylan proposed opendev/system-config master: Evaluate files website vhosts in context of website not vhost  https://review.opendev.org/66377721:37
clarkbsomething like that (I think I got the mapping of the var names correct too but double check me on that)21:37
clarkbcmurphy: ^21:38
cmurphylgtm21:40
corvuswhoah.21:42
corvuswould have taken me the rest of the day to get to that point.  :)21:42
*** kjackal has quit IRC21:42
clarkbI guess I can double check the @name value == @srvname value via the logs path21:43
fungithat's... deceptively simple21:43
* clarkb does this21:43
clarkbya they seem to line up21:43
*** jamesmcarthur has quit IRC21:46
corvusi'll direct-enqueue that21:47
*** aakarsh has quit IRC21:47
corvuscmurphy, clarkb, fungi: thank you!21:48
fungii'll abandon mine21:49
clarkbthis is one of those really weird corner cases in puppet21:51
clarkbI assume the role path thing we ran into with ansible 2.8 is similar21:51
clarkbin theory the tool knows the data you want is right there21:51
corvus(restarted apache)21:51
clarkbyou just can't get it21:51
*** cloudnull has quit IRC22:00
*** d34dh0r53 has quit IRC22:01
ianwfrickler: thanks for pointing out https://docs.ansible.com/ansible/latest/porting_guides/porting_guide_2.8.html#imports-as-handlers ... i'm certain that has something to do with it ...22:01
ianwi tried "listen:" which seemed like the logical thing to do https://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/letsencrypt-create-certs/handlers/main.yaml#L622:02
ianwi'll go back to that, maybe it's fixed22:02
*** markvoelker has quit IRC22:03
clarkbianw: yes should be fixed (though the child chagnes may need a rebase)22:03
fungiafs02.dfw did come back up and mount /vicepa successfully, btw22:04
fungii just checked in on it22:04
corvusfungi: yay!22:06
fungivos listvldb is reporting "Old release" on a bunch of its volume copies22:06
fungii assume as each of those gets a fresh vos release it'll do a full release for afs02.dfw?22:07
corvusyeah it should; we can probably leave it to its own devices and just check back tomorrow22:07
corvus(unless we're really antsy, in which case we should take the cron job out of service and run releases manually)22:08
fungii am not in the least bit antsy22:08
fungithough my yard is increasingly so22:08
fungiants are on a rampage this year22:09
* corvus suddenly feels itchy22:09
clarkbhttp://logs.openstack.org/77/663777/1/check/infra-puppet-apply-4-ubuntu-xenial/aa31350/applytest/puppetapplytest24.final.out.FAILED is alias a special word in ruby?22:10
ianwclarkb: oh, awesome, sorry missed your updates :)22:11
cmurphyclarkb: til yes https://blog.bigbinary.com/2012/01/08/alias-vs-alias-method.html22:11
corvusthe GOOD news is the test caught it! :)22:12
fungialias_ ftw22:12
openstackgerritIan Wienand proposed opendev/system-config master: Update to ansible-lint 4.1.0  https://review.opendev.org/66350122:12
openstackgerritClark Boylan proposed opendev/system-config master: Evaluate files website vhosts in context of website not vhost  https://review.opendev.org/66377722:12
clarkbcmurphy: cool I just changed it to alias_name22:12
openstackgerritIan Wienand proposed opendev/system-config master: Update to ansible-lint 4.1.0  https://review.opendev.org/66350122:13
openstackgerritIan Wienand proposed opendev/system-config master: Ansible roles for backup  https://review.opendev.org/66265722:13
corvusre-enqueued22:14
ianwassuming CI is happy with the new stuff, ^ should be ready for review too22:14
clarkbianw: no worries the path thing on the include_tasks is weird22:14
clarkbbut seems happy with using the longer path so good enoguh rpobably22:14
ianwi can retry the listen: thing too ... it's the more logical thing to register a handler but i have no desire to dive deep into how listen & include_tasks etc interact in the ansible code base :)22:16
*** aakarsh has joined #openstack-infra22:19
*** weifan has quit IRC22:22
corvus(restarted apache)22:22
*** rcernin has joined #openstack-infra22:23
*** weifan has joined #openstack-infra22:23
*** rcernin has quit IRC22:23
*** rcernin has joined #openstack-infra22:24
clarkbzuul says 13 minutes to merging the fix22:24
*** jcoufal has joined #openstack-infra22:25
*** rcernin has quit IRC22:26
*** rfarr_ has quit IRC22:27
fungiold release copies on afs02 have fallen from 14 to 10 since i last checked22:27
fungiso it does appear to be catching up22:27
*** slaweq has quit IRC22:27
*** cloudnull has joined #openstack-infra22:27
*** rcernin has joined #openstack-infra22:28
*** d34dh0r53 has joined #openstack-infra22:28
clarkbnodepool builders are continuing to successfully build images too22:29
openstackgerritMerged opendev/system-config master: Evaluate files website vhosts in context of website not vhost  https://review.opendev.org/66377722:36
corvus\o/22:38
*** yamamoto has joined #openstack-infra22:47
ianwfungi: do the fanatical support messages line up at all with any of the afs errors we saw on the opendev.org mirror?22:51
funginope22:52
fungithey were about a volume for afs02 which the mirrors aren't using22:52
corvusi restarted apache again.  this is why i think there's something weird about the ansipuppet loop that sometimes requires 2 passes for a change to go into effect.22:53
*** tkajinam has joined #openstack-infra22:53
*** slaweq has joined #openstack-infra22:55
*** jcoufal has quit IRC22:55
clarkbcorvus: we only update it at the very start of the cron22:56
clarkbso if you merge after that then you have that pass then the next before it applies22:56
corvusclarkb: ah, looks like we're on half-hour pulses (2230-2253 was most recent)22:58
corvusso that jives22:59
*** markvoelker has joined #openstack-infra22:59
corvusso this should finally go onto the fs around 23:2223:00
fungiso we have a pulse starting now which should hopefully apply it23:00
*** slaweq has quit IRC23:08
*** slaweq has joined #openstack-infra23:11
*** dpawlik has joined #openstack-infra23:11
*** jamesmcarthur has joined #openstack-infra23:13
*** dpawlik has quit IRC23:15
*** jamesmcarthur has quit IRC23:16
*** jamesmcarthur has joined #openstack-infra23:17
corvusbroken; fixing23:22
corvusokay, so i guess that didn't work?23:23
corvusresult was the same -- same sites had nothing in the ssl cert path sections23:23
clarkbhrm23:23
clarkbthe three variable names seem correct23:24
*** slaweq has quit IRC23:24
corvus#status log added files02.openstack.org to emergency file due to recent system-config changes breaking apache config23:24
clarkbcmurphy: ^ any other ideas?23:25
openstackstatuscorvus: finished logging23:25
corvusi can undertake to set up a more complete test and fix tomorrow23:26
*** dpawlik has joined #openstack-infra23:27
clarkboh I see it ugh23:27
clarkbmaybe23:27
clarkbits an order of operations thing I think23:27
clarkbchange in a sec23:27
*** dchen has joined #openstack-infra23:29
openstackgerritClark Boylan proposed opendev/system-config master: Evaluate files vhosts after we determine ssl file paths  https://review.opendev.org/66379623:29
clarkbthat23:29
corvusdoes that really change the order?23:30
corvusi have no idea when order does or does not matter in puppet23:31
clarkbin modern puppet it evaluates files top to bottom because people found the other behavior too confusing23:31
corvusbut i guess that's about a sequence of steps to create a graph, and the graph itself may be unordered?23:31
clarkbso I'm fairly certain now that we are in puppet 4 the order matters there23:31
corvusoh, ok, so it's just ordered :)23:31
*** markvoelker has quit IRC23:31
*** dpawlik has quit IRC23:31
corvusif that lands, i can mind it tomorrow23:38
cmurphyclarkb: ah yeah the ordering of variables matters in puppet 3 and puppet 423:39
*** yamamoto has quit IRC23:41
*** yamamoto has joined #openstack-infra23:41
*** jamesmcarthur has quit IRC23:42
*** lseki has quit IRC23:42
*** jamesmcarthur has joined #openstack-infra23:45
clarkbianw: https://review.opendev.org/663796 care to get that one in so it is ready for corvus tomorrow?23:46
ianwlgtm ... yeah ansible got it right just being (mostly) a glorified shell script23:47
clarkbianw: I rechecked the ansible lint change as it failed on beaker23:48
ianwbtw on that ansible-lint change, there was a response to https://github.com/ansible/ansible-lint/issues/53423:48
clarkband well those two things shouldn't be related23:48
*** weifan has quit IRC23:48
*** hwoarang has quit IRC23:48
ianwhaving tried doing similar things with bashate, i can somewhat feel the pain23:48
*** weifan has joined #openstack-infra23:49
*** hwoarang has joined #openstack-infra23:50
clarkbstill don't have new control plane images with the ext4 journal fix. Hopefully we'll have one tomorrow and I can start poking at a replacement gitea0623:50
*** weifan has quit IRC23:50
*** weifan has joined #openstack-infra23:51
*** weifan has quit IRC23:51
*** aaronsheffield has quit IRC23:51
clarkbbasically every image has to be rebuilt now that builders are happier23:51
clarkband the control plane image is stuck in the queue23:51
*** weifan has joined #openstack-infra23:53
*** weifan has quit IRC23:53
*** weifan has joined #openstack-infra23:54
*** mriedem_afk has quit IRC23:54
*** rcernin has quit IRC23:55
*** rcernin has joined #openstack-infra23:56
*** jamesmcarthur has quit IRC23:57
*** weifan has quit IRC23:58

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!