Monday, 2017-10-16

openstackgerritDavid Moreau Simard proposed openstack-infra/openstack-zuul-jobs master: Aggregate OpenStack-Ansible required projects in a single base job  https://review.openstack.org/51212600:00
pabelangermnaser: wait00:00
pabelangerwhy are you using base-test?00:00
dmsimardodyssey4me, cloudnull, evrardjp, logan- ^ this probably resolves broken OSA gate jobs but I'd like OSA folks to chime in00:01
*** felipemonteiro has quit IRC00:02
mnaserpabelanger that was an oopsie which i later fixed in a follow-up change on master (which i was going to backport to stable branches afterwards)00:02
*** bobh has joined #openstack-infra00:02
mnasermaybe that could be related00:03
mnaserlet me fix that in the backport as well00:03
mnaserand see what gives00:03
pabelangeryah, I'm going to stop looking until jeblair returns. I'm just guessing at things now00:04
pabelangermnaser: if you haven't done so, please update zuulv3-issues on etherpad00:05
pabelangerso we can properly track this00:05
jeblairback00:05
pabelangeryay00:05
mnaserpabelanger im going to push them back up with base-test removed00:07
sc`chef jobs seem to work. it just. won't. die.00:08
pabelangersc`: that's good right?00:08
sc`apparently!00:08
mnaserwelp, still happened00:10
mnaserill add it to zuul issues00:10
jeblairmnaser: link to change without base-test?00:10
mnaserhttp://zuulv3.openstack.org/static/stream.html?uuid=e6ee304be4974c3da305370b5ee988cd&logfile=console.log00:10
mnaserif you can click super quick00:11
mnaserotherwise gonna have to wait for it to hit retry_limit00:11
jeblairmnaser: well, i meant the change00:11
mnaseroh, sure one sec00:11
mnaserjeblair: https://review.openstack.org/#/c/509202/ https://review.openstack.org/#/c/509204/ https://review.openstack.org/#/c/509205/00:11
*** baoli has joined #openstack-infra00:11
mnaser(all same issue)00:11
*** felipemonteiro_ has quit IRC00:14
*** baoli has quit IRC00:16
pabelangerhttp://logs.openstack.org/27/512027/1/gate/legacy-grenade-dsvm-neutron/c131ea1/ just failed with POST_FAILURE looking00:16
jeblairi'm going to dig into the multi-pre thing00:17
pabelangerremove-build-sshkey : Remove the build SSH key from all nodes - unreachable00:17
pabelangerthat was citycloud00:17
pabelangerwhich is odd00:19
pabelangerbecaue post-logs playbook worked00:19
pabelangerbecause*00:19
pabelangerI wonder if we should consider adding retries to ansible.cfg00:20
pabelangerhttp://docs.ansible.com/ansible/latest/intro_configuration.html#retries00:20
ianwi'm going to move the hardlink issues from needs triage to in progress00:21
ianwjust to keep that clear-ish00:23
pabelangermordred: odyssey4me: cloudnull: do you remember where we added the SSH tweaks for ansible went?00:23
pabelangermordred: odyssey4me: cloudnull: I think there was some recommendations based on cloud100:23
*** slaweq has quit IRC00:23
*** slaweq has joined #openstack-infra00:27
openstackgerritPaul Belanger proposed openstack-infra/zuul feature/zuulv3: Enabled ssh retries for ansible  https://review.openstack.org/51213000:30
*** baoli has joined #openstack-infra00:32
*** gildub has joined #openstack-infra00:32
*** psachin has joined #openstack-infra00:34
*** Goneri has joined #openstack-infra00:38
jeblairmnaser: i've reproduced the issue in a test case; i'm assuming it's a zuul bug; will continue to dig00:38
mnaserjeblair cool, i guess that's good in some ways :)00:39
jeblairyeah, i'm pretty happy as long as i can make a test case :)00:40
*** huanxie has joined #openstack-infra00:40
openstackgerritPaul Belanger proposed openstack-infra/system-config master: Add statsd_host to zuulv3.o.o, zuul-mergers and zuul-executors  https://review.openstack.org/51213100:41
pabelangerjeblair: not to distract, but I believe we also need ^ for statsd on zuulv3 things00:41
pabelangerI just check zuul.conf on zuulv3.o.o and statsd wasn't configured yet00:42
jeblairpabelanger: oh i thought that was set, thanks00:45
pabelangerI think I see an issue with neutron releasenotes jobs, looking at that now00:46
*** ijw has joined #openstack-infra00:48
*** sdague has quit IRC00:50
fungiokay, i'm done deleting the ~60 verify -2 votes from "jenkins" on open changes now00:50
fungigonna knock off for the evening and prepare for an exciting day of zuul v3 tomorrow00:51
funginight all! great work on the rollout00:52
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Fix root-marker for releasenotes publish job  https://review.openstack.org/51213200:52
* clarkb had to put dinner in the oven00:53
*** ijw_ has joined #openstack-infra00:54
pabelangerokay, ^ should fix releasenotes publishing jobs00:54
pabelangeradded to zuulv3-issues00:54
*** ijw__ has joined #openstack-infra00:55
*** baoli has quit IRC00:56
*** ijw has quit IRC00:56
*** bobh has quit IRC00:56
*** armax has joined #openstack-infra00:57
*** ijw_ has quit IRC00:58
*** kiennt26 has joined #openstack-infra00:59
*** slaweq has quit IRC01:00
pabelangerHmm01:02
pabelangerSSH Error: data could not be sent to remote host "146.20.110.99". Make sure this host can be reached over ssh01:02
pabelangerthat is another job01:02
pabelangerhttp://logs.openstack.org/37/509337/2/check/legacy-interop-tox-doc8/dbe1920/01:02
*** baoli has joined #openstack-infra01:02
pabelangerI'm not sure why we are seeing an increase01:02
openstackgerritMerged openstack-infra/project-config master: Fix root-marker for releasenotes publish job  https://review.openstack.org/51213201:04
*** slaweq has joined #openstack-infra01:04
*** LindaWang has joined #openstack-infra01:04
mnaserpabelanger i have an idea about the possibility of a cause01:05
pabelangercould be networking coming out of rackspace01:05
mnaserrelated to unbound issue, timing out ssh-ing because DNS resolution?01:05
clarkbpabelanger: the host key changed in that log and we have strict host key checking01:05
pabelangerclarkb: oh, really?01:05
clarkbpabelanger: ya its in the log you linked01:05
pabelangerOh, wow01:06
clarkbhttp://logs.openstack.org/37/509337/2/check/legacy-interop-tox-doc8/dbe1920/job-output.txt.gz#_2017-10-16_00_59_22_00357201:06
pabelangerI see that now01:06
mnaser"Resolving apt.puppetlabs.com (apt.puppetlabs.com)... failed: Temporary failure in name resolution." boo :<01:06
pabelangerclarkb: but we did copy logs off in a later task01:06
pabelangermnaser: log?01:06
mnaserhttp://logs.openstack.org/10/511910/2/check/puppet-openstack-integration-4-scenario002-tempest-ubuntu-xenial/d97bdb2/01:07
clarkbthe task before it did package updates01:07
pabelangerclarkb: maybe this is the issue of a node coming back online for a moment, then disappering01:07
mnaserlooks like it ran on our cloud, but still strange01:07
clarkbpabelanger: or multiple nodes with the same IP?01:07
clarkbor package updates restarting services?01:07
pabelangerclarkb: it is rax-iad01:07
clarkbso maybe the ssh service wasn'tthere and it got a "null" host key?01:07
pabelangerclarkb: no, because http://logs.openstack.org/37/509337/2/check/legacy-interop-tox-doc8/dbe1920/job-output.txt.gz#_2017-10-16_00_57_06_235134 would fail01:08
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Ignore parents on job variants  https://review.openstack.org/51213401:08
*** baoli has quit IRC01:08
jeblairmnaser: ^01:08
clarkbAdding debian:TÜBİTAK_UEKAE_Kök_Sertifika_Hizmet_Sağlayıcısı_-_Sürüm_3.pem that is an awesome cert name btw01:08
jeblairwow that pretty much hosed my irc client01:09
pabelangermnaser: so that failure is in vexxhost, and I did see an DNS issue today from them.01:09
clarkbpabelanger: the package updates happen afterwards though01:09
clarkbjeblair: sorry (but that makes it even more awesome I think01:09
*** bobh has joined #openstack-infra01:09
*** thorst has quit IRC01:09
mnaserpabelanger weird.  i can resolve that with no problems on another vm here (granted it doesn't run unbound to do the recursive resolve)01:10
jeblairmnaser, pabelanger: ^ there's the fix for the multiple pre thing01:10
pabelangerclarkb: yah, but we do run another pre playbook after the updates01:10
* mnaser looks01:10
clarkbpabelanger: that is the one that failed01:10
jeblairmnaser: you can get the correct behavior now by dropping the 'parent' lines from the jobs you're adding to the stable branch01:10
pabelangermnaser: it is possible related to unbound, ianw and dmsimard have been working on that01:10
pabelangerclarkb: no, run playbook fails I think01:11
jeblairmnaser: (when that fix lands, they'll be ignored anyway, and eventually, we'll figure out how to get zuul to emit warnings suggesting you remove them)01:11
clarkbpabelanger: http://logs.openstack.org/37/509337/2/check/legacy-interop-tox-doc8/dbe1920/job-output.txt.gz#_2017-10-16_00_59_19_546353 its bindep then error01:11
*** cuongnv has joined #openstack-infra01:11
ianwpabelanger: we can *probably* put that setup back in ... depends if it's more risky than the problem at hand :)01:12
pabelangerclarkb: wow, I think that log is truncated01:13
mnaserjeblair does that mean that if we wanted a base per branch, we'd need to have a different name of parent in each branch? i was hoping each branch would maintain its own fully self-contained zuul.yaml so when we branch out a release, it keeps its own jobs without us possibly breaking them inadvertently by making a change to a base01:13
pabelangerclarkb: see: http://logs.openstack.org/37/509337/2/check/legacy-interop-tox-doc8/dbe1920/ara/01:13
pabelangerclarkb: oh wait01:13
pabelangerclarkb: Now I understand01:13
jeblairmnaser: no, you can still have the same config -- the branch01:13
jeblairblah01:13
clarkbpabelanger: ya its run.yaml's bindep install that breaks01:14
jeblairmnaser: you can still have the same jobs on all the branches -- the ones on the branches just become "variants"01:14
clarkbor rather the next one after that?01:14
pabelangerclarkb: very odd01:14
clarkbI wouldn't be too surprised if it is part of package updates racing somehow01:14
jeblairmnaser: it is intended to work the way you expect; i just didn't have a complex enough test case to catch the issue with the variants having parents01:15
clarkbthough openssh-server is not in the package list01:15
jeblairmnaser: what you won't be able to do is change the inheritance structure across branches.  if you want to do that, you'll need to make new jobs.01:15
pabelangerclarkb: it' like we lose the node during bindep, like you said01:15
pabelangerclarkb: I do see openssl01:16
pabelangerbut, should be an issue01:16
pabelangershouldn't*01:16
jeblairpabelanger, clarkb: some kinds of package updates restart sshd01:17
jeblairie glibc01:17
clarkbjeblair: ya I'm guessing it is something like that, maybe openssl (thought I thought openssh was completely its own implementation)01:17
ianwdo we have any best practices around base-test, or just submit a [DNM] something01:17
jeblairianw: that's the best practice01:17
pabelangerjeblair: clarkb: yah, so possible we might be seeing more until new DIBs are uploaded01:18
clarkbexisting connections should remain though, I thought we used the connection manager pool thing?01:18
pabelangertime will tell I think01:18
clarkbeg why is ansible making a new connectio nat all?01:18
pabelangerclarkb: right, that is what I am confused about too01:18
jeblairclarkb: yes, we use control persist01:18
mnaserjeblair makes sense, that's fine by me in that case, makes complete sense, thank you!01:18
pabelangerbut even if sshd restarted, why would we get new hostkey for 1 task, then work again for another01:20
openstackgerritIan Wienand proposed openstack-infra/openstack-zuul-jobs master: [dnm] parent integration test to base-test  https://review.openstack.org/51213501:21
clarkbrereading it it says data could not be sent to the remote01:21
*** yamamoto has joined #openstack-infra01:21
clarkbthen the next thing is the host key thing01:21
clarkbas if the connection actually is closed by the remote side, then we retry to connect and that fails01:21
pabelangeryah01:21
*** ijw__ has quit IRC01:23
clarkbit does Running hooks in /etc/ca-certificates/update.d... then the bindep listing for test01:23
clarkbI wonder if it is a package in test breaking it? and we don't see it because th econnection borks before we can read that data01:24
clarkbprobably the thing to do is run that command on a booted rax-iad instance and see what happens to ssh01:24
openstackgerritPaul Belanger proposed openstack-infra/system-config master: Set SETUP_PIP=false for tools/prep-apply.sh  https://review.openstack.org/51213701:25
openstackgerritJames E. Blair proposed openstack-infra/project-config master: Document base job change procedure  https://review.openstack.org/51213801:25
jeblairianw: ^01:25
*** wewe0901 has joined #openstack-infra01:25
*** baoli has joined #openstack-infra01:26
* jeblair -> dinner01:26
openstackgerritIan Wienand proposed openstack-infra/openstack-zuul-jobs master: [dnm] parent integration test to base-test  https://review.openstack.org/51213501:27
ianwjeblair: cool ^ is sort of testing a role-back, so we could mention the integration tests/match as being good targets, but probably not necessary for new commits01:30
ianwroll back even01:31
clarkbpabelanger: maybe you know the anwer, which job has a variant in the test for the multi pre fix change?01:31
*** Goneri has quit IRC01:31
openstackgerritPaul Belanger proposed openstack-infra/system-config master: Set SETUP_PIP=false for tools/prep-apply.sh  https://review.openstack.org/51213701:31
clarkboh is it implicit because of the branch?01:31
clarkbthat is quite magical01:32
clarkbianw: how goes es07?01:32
*** daidv has joined #openstack-infra01:33
ianwvolume is 69.1% moved01:33
ianwas discussed, when done i'll remove the ssd, reboot for good measure and then bring it up01:33
clarkbcool thanks01:34
pabelangerclarkb: in puppet-integrations, I believe name: puppet-module-base is the job that tests the problem01:34
ianwpabelanger: ^^^ should you check for zuul vars?01:34
pabelangerianw: zuul vars?01:35
ianwpabelanger: sorry, before setting it to not install pip, just in case it's used externally?01:35
pabelangerianw: oh, yah. Maybe we should01:35
ianwi dunno, google seems to suggest that's an overblown concern01:36
pabelangerSo far, zuulv3 holding up well01:37
pabelangerjobs generally appear to be green too01:37
pabelangerHmm01:37
pabelangermnaser: puppet-cinder and puppet-tripleo should be in same change pipeline in gate right?01:37
mnaserpabelanger yeah, i was looking at that going "hmm"01:38
mnaserpretty much most of puppet-* gates with puppet-tripleo and tripleo-heat-templates and networking-bagpipe and a few others01:38
*** slaweq has quit IRC01:38
pabelangeryah, I think are missing queue: tripleo some place01:39
pabelangerlet me see if I can find a good job01:39
pabelangerlikely tripleo-ci is one01:40
*** baoli has quit IRC01:41
pabelangermnaser: where did puppet-openstack-integration project-templates go?01:43
mnaserpabelanger moved to .zuul.yaml in-repo01:43
*** sbezverk has quit IRC01:43
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Create tripleo change queue for gate pipeline  https://review.openstack.org/51214001:44
*** sbezverk has joined #openstack-infra01:44
pabelangerclarkb: mnaser: jeblair: okay, so that should be the first step to create tripleo change queue for gate.01:44
*** dhinesh has joined #openstack-infra01:45
pabelangermnaser: I think the next step would be to add queue: tripleo to your openstack-puppet-integration template for gate pipeline too01:45
mnaserpabelanger ack, will do that now01:45
*** slaweq has joined #openstack-infra01:45
pabelangerI say, I think because I'd like jeblair or clarkb to comment01:45
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Ignore parents on job variants  https://review.openstack.org/51213401:46
mnaserpabelanger but the thing is i didnt migrate any of the puppet-cinder jobs for example01:46
mnaserwell, some of them did get migrated but not all01:46
pabelangermnaser: right, but they should be using the project-template?01:46
*** fanzhang has left #openstack-infra01:49
mnaserpabelanger remote:   https://review.openstack.org/512141 move jobs to tripleo gate queue01:50
mnaserand ...01:50
openstackgerritMohammed Naser proposed openstack-infra/openstack-zuul-jobs master: move puppet jobs to tripleo queue  https://review.openstack.org/51214201:50
openstackgerritMerged openstack-infra/system-config master: Add statsd_host to zuulv3.o.o, zuul-mergers and zuul-executors  https://review.openstack.org/51213101:55
pabelangermnaser: I don't think you need to add linters, but shouldn't hurt anything01:55
pabelangerlets see what others say01:55
pabelangermnaser: do you mind updating zuulv3-issues etherpad01:55
mnaserpabelanger sure np one moment!01:55
mnaserpabelanger triaged or?01:56
mnaseri'll go for issues with jobs01:56
mnaserpabelanger added01:58
*** baoli has joined #openstack-infra02:01
pabelangerand releasenotes publishing fixed02:02
pabelangerhttps://docs.openstack.org/releasenotes/puppet-tripleo/.root-marker02:02
*** slaweq has quit IRC02:17
*** boden has quit IRC02:20
*** namnh has joined #openstack-infra02:26
*** gouthamr has quit IRC02:27
*** slaweq has joined #openstack-infra02:28
*** dave-mccowan has quit IRC02:35
*** yamamoto_ has joined #openstack-infra02:36
*** baoli has quit IRC02:37
*** adreznec has quit IRC02:37
*** felipemonteiro_ has joined #openstack-infra02:38
*** adreznec has joined #openstack-infra02:39
*** yamamoto has quit IRC02:39
pabelangeranother SSH Error02:40
pabelangerhttp://logs.openstack.org/15/512115/4/check/openstack-tox-linters/0d6e0a6/job-output.txt.gz#_2017-10-16_02_32_38_90919102:40
pabelangerthis time ssh host key didn't change02:40
pabelangeris there a way we can confirm our controlpersist is still working?02:41
openstackgerritNam Nguyen Hoai proposed openstack-infra/project-config master: Remove legacy jobs from Trove  https://review.openstack.org/51114902:42
*** esberglu has quit IRC02:43
clarkb pabelanger ps -elf and look for control persist option on ssh commands?02:44
clarkbthat will at least show if the option is set02:45
pabelangeryah, I do ee02:46
pabelangerssh -o ControlMaster=auto -o ControlPersist=60s02:46
clarkbI think the sshd must be crashing/stopping in a way that existing connectiona notice02:47
clarkbthen if it reconnects fast enough we may see the host key issue but I bet  thats mostly a race and not 100% based on how fast it reconnects02:47
pabelangerif that is the case, maybe https://review.openstack.org/512130/ will help02:49
pabelangerbut surprising we are seeing it happen more with zuulv302:49
pabelangermaybe because we are running more ansible-playbook invokes per job02:49
clarkbya I think that may be it02:50
pabelangerthere also is some SSH settings that cloud1 ran for ansible, would be intersting to find them again and maybe test02:50
*** gcb has joined #openstack-infra02:51
jeblaircontrolpersist doesn't span playbook invocations02:52
jeblairare we seeing errors when crossing a playbook boundary, or within the same playbook?02:52
pabelangerno, playbook boundary is more02:53
pabelangerI've seen about 5 since going live02:53
pabelangerI've added a few links to zuulv3-issues02:53
jeblairpabelanger: i do not understand the sentence "no, playbook boundary is more"02:53
pabelangerand just posted one above^02:53
pabelangerI am seeing error more when we cross playbook boundary02:54
pabelangerthen in same playbook02:54
openstackgerritMerged openstack-infra/devstack-gate master: Remove v3 devstack job  https://review.openstack.org/51202702:54
jeblairpabelanger: any within the same playbook?02:54
pabelangerjeblair: just 1 so far02:54
pabelangerlet me get link02:54
pabelangerhttp://logs.openstack.org/37/509337/2/check/legacy-interop-tox-doc8/dbe1920/job-output.txt.gz#_2017-10-16_00_59_19_81772902:55
pabelangerthat was the ssh host key changed issue02:55
*** bobh has quit IRC02:57
pabelangerjeblair: what would we need to change to have controlpersist span playbooks?02:57
jeblairpabelanger: https://storyboard.openstack.org/#!/story/200107202:57
pabelangerjeblair: Ah, thanks. Now I remember02:58
*** slaweq has quit IRC02:58
*** gongysh has joined #openstack-infra02:58
jeblairpabelanger: are you still suspecting that the error you liked above might be due to a package install, or have you eliminated that possibility?02:59
*** slaweq has joined #openstack-infra02:59
pabelangerjeblair: I'm not sure myself, I haven't really digged into it yet.02:59
*** udesale has joined #openstack-infra03:00
jeblairpabelanger: okay, so it's possible there are two problems03:00
*** mrhillsman has quit IRC03:00
jeblairand that spending time on making controlpersist span playbooks may help with one of them03:01
*** mrhillsman has joined #openstack-infra03:01
pabelangeryes, I think clarkb is suggesting legacy-interop-tox-doc8 issue was connection was closed on host side for some reason, which could be package update, but I am not sure how host key could change03:02
clarkbI'm not sure the host key actually changed could just be it wasnt read as expeceted due to the service restarta03:02
jeblairclarkb: sounds plausible03:03
*** gildub has quit IRC03:03
jeblairi'll poke at persistent controlpersist for a few mins03:03
pabelangergreat03:05
ianw#status log elasticsearch07.o.o rebooted & elasticsearch started.  data was migrated from SSD storage and "main" vg contains only one block device now03:05
openstackstatusianw: finished logging03:05
ianwclarkb: ^03:05
ianwi've detached & removed the other volume03:06
clarkbtyty03:06
clarkbdid it come up quickly after reboot?03:06
ianwstatus is still "red" but it's pretty busy03:06
clarkbnot slow with timeouts like last one?03:06
ianwyep, came up normally03:06
clarkbya I expect it will be red until it notices the old indexes it had have since been deleted03:06
clarkbso that isnt entirely surprising03:07
*** thorst has joined #openstack-infra03:10
*** thorst has quit IRC03:15
*** masber has joined #openstack-infra03:20
*** felipemonteiro_ has quit IRC03:22
dmsimardclarkb, ianw: fyi I successfully tested a middleware for loading sqlite databases on the fly https://review.openstack.org/#/c/511992/03:26
*** esberglu has joined #openstack-infra03:26
dmsimardI'm just waiting on an OSA gate job to finish to retrieve it's database (which is a good benchmark) to put numbers and perspective03:26
*** chlong has joined #openstack-infra03:29
*** esberglu has quit IRC03:30
*** bobh has joined #openstack-infra03:34
*** wewe0901 has quit IRC03:35
ianwi do get the heeby-jeebies when i see os.path.join on a provided URI :)03:36
dmsimardianw: it's not like I took that from your downloader patch or anything :p03:38
dmsimardhappy to run it through some security improvements, this is just a proof of concept to show -- hey, we can do this and it works, let's do it properly though03:39
dmsimardianw: it's heavily inspired by https://review.openstack.org/#/c/122615/ :D03:40
openstackgerritIan Wienand proposed openstack-infra/project-config master: Revert "Revert "Add configure-unbound to base job""  https://review.openstack.org/51215303:40
dmsimardyou get all the credit :)03:41
ianw2014 seems to be both so long ago, and actually not that long ago, both at the same time!03:42
dmsimardianw: ah yeah you have https://review.openstack.org/#/c/122615/2/os_loganalyze/downloader.py@83 to avoid people poking out03:43
*** bobh has quit IRC03:44
*** udesale has quit IRC03:46
ianwpabelanger: ^^^ that's the unbound restore, i'll add some notes in there, but I'll also mark it WIP to avoid merge.  just thought we should have it primed in case these ssh issues turn out to be related03:46
*** bobh has joined #openstack-infra03:48
*** ykarel has joined #openstack-infra03:52
ianwfor something like -- http://logs.openstack.org/70/504870/5/infra-check/project-config-irc-access/f3625f9/ -- we haven't really logged anywhere what node it ran on, or what provider?  i'd really like that as a debug statement right at the top03:52
*** bobh has quit IRC03:53
*** edmondsw has joined #openstack-infra03:53
*** felipemonteiro_ has joined #openstack-infra03:55
*** coolsvap has joined #openstack-infra03:57
dmsimardianw: WIP03:57
*** edmondsw has quit IRC03:57
dmsimardianw: see https://review.openstack.org/#/c/511948/ and https://review.openstack.org/#/c/511821/03:57
dmsimard(and the child of https://review.openstack.org/#/c/511821/)03:58
ianw++ that's exactly what i wanted :)03:59
dmsimardianw: it doesn't give the cloud provider, maybe we can add that as follow up.03:59
*** udesale has joined #openstack-infra03:59
ianwhostname is enough to figure that out04:00
*** udesale has quit IRC04:00
*** udesale has joined #openstack-infra04:00
dmsimardianw: not in v3 anymore ? nodes can be given any name04:03
*** dhinesh has quit IRC04:04
*** slaweq has quit IRC04:05
clarkbthats an inventory thing but  hostname should be the nodepool name04:09
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Start ssh persistent connections outside bwrap  https://review.openstack.org/51215804:09
jeblairoy04:09
jeblairpabelanger, clarkb, ianw: ^ i *think* that will work.04:10
jeblairjlk, SpamapS, mordred: ^04:10
*** claudiub has joined #openstack-infra04:10
*** dhinesh has joined #openstack-infra04:10
jeblairthat has depleted my brain for the evening.  good night!04:11
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Start ssh persistent connections outside bwrap  https://review.openstack.org/51215804:12
*** felipemonteiro__ has joined #openstack-infra04:15
*** dhinesh has quit IRC04:15
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Disable sql_reporter alembic downgrades  https://review.openstack.org/51215904:16
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Clarify how the sql database is managed  https://review.openstack.org/51216004:16
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Move alembic_reporter back to alembic  https://review.openstack.org/51216104:16
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Allow the pool_recycle to be configured  https://review.openstack.org/51216204:16
dmsimardianw: I added some numbers for perspective on sqlite middleware: https://review.openstack.org/#/c/511992/04:16
dmsimardbed time for me, catch ya later04:17
*** slaweq has joined #openstack-infra04:17
*** felipemonteiro_ has quit IRC04:18
*** huanxie has quit IRC04:18
openstackgerritDuong Ha-Quang proposed openstack-infra/openstack-zuul-jobs master: Remove legacy jobs in Freezer  https://review.openstack.org/51115104:18
openstackgerritDuong Ha-Quang proposed openstack-infra/project-config master: Remove legacy jobs in Freezer  https://review.openstack.org/51115204:18
*** huanxie has joined #openstack-infra04:24
*** hongbin_ has quit IRC04:24
*** felipemonteiro__ has quit IRC04:27
SamYapleis there anyway to get the post logs from a commit after its merged (not in real time)04:28
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Clarify how the sql database is managed  https://review.openstack.org/51216004:32
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Move alembic_reporter back to alembic  https://review.openstack.org/51216104:32
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Allow the pool_recycle to be configured  https://review.openstack.org/51216204:32
clarkbSamYaple: http://logs.openstack.org/firsttwoofsha1/sha104:33
SamYapleclarkb: and the sha1 is the git commit id as seen by the "commit" field on the review?04:35
clarkbI think it may be the merge commit if one was genwrated04:37
clarkbbut otherwise yes04:38
*** jascott1 has quit IRC04:41
SamYapleperfect. i can figure it out from one of those. thanks!04:42
*** Dinesh_Bhor has joined #openstack-infra04:42
SamYaplemy first dockerhub-pushing POST job is running. wish me luck04:45
cloudnullpabelanger: IDK if you had already found this but we use the following SSH tweaks within OSA https://github.com/openstack/openstack-ansible/blob/master/group_vars/all/all.yml#L107-L11704:47
*** slaweq has quit IRC04:48
*** sree has joined #openstack-infra04:48
*** armaan has joined #openstack-infra04:50
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Import legacy playbooks  https://review.openstack.org/51216404:52
ykarelHi, is there a way to reuse the values from a argument from base job in zuulv3, For example i want to reuse irrelevant-files from base job and add additional in child jobs.04:53
*** kiennt26 has quit IRC04:54
openstackgerritIan Wienand proposed openstack-infra/openstack-zuul-jobs master: Remove dib legacy playbooks  https://review.openstack.org/51216604:54
openstackgerritJoshua Hesketh proposed openstack-infra/zuul feature/zuulv3: Allow the pool_recycle to be configured  https://review.openstack.org/51216204:55
openstackgerritIan Wienand proposed openstack-infra/project-config master: Remove diskimage-builder functional test jobs  https://review.openstack.org/51216904:57
*** links has joined #openstack-infra04:57
*** markvoelker has quit IRC04:58
*** markvoelker has joined #openstack-infra04:58
openstackgerritIan Wienand proposed openstack-infra/openstack-zuul-jobs master: Remove dib legacy playbooks  https://review.openstack.org/51216604:58
*** jascott1 has joined #openstack-infra04:59
*** slaweq has joined #openstack-infra04:59
openstackgerritIan Wienand proposed openstack-infra/project-config master: Move bashate jobs in-tree  https://review.openstack.org/51211805:00
ykarelHi Team, one more why check and gate job defined in .zuul.yml in magnum repo didn't run in https://review.openstack.org/#/c/488581/ (stable/newton) patch, can someone guide where to look05:00
ykarelis jobs defined in master branch will not work for newton?05:01
SamYaplehmmm. its seems my secret is not accessible via the post job http://logs.openstack.org/1d/1d7787eff61c898051833a14a4f59dd88a1c3ab9/post/publish-loci-keystone/c5d4128/job-output.txt.gz#_2017-10-16_04_52_26_26286105:01
SamYapleit is declared here https://github.com/openstack/loci/blob/master/.zuul.yaml#L17405:01
SamYapleand used here https://github.com/openstack/loci/blob/master/playbooks/post.yaml#L3805:01
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Remove legacy-windmill jobs  https://review.openstack.org/51211005:02
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: move puppet jobs to tripleo queue  https://review.openstack.org/51214205:03
SamYapleis the secret not available in the post-run part of the job?05:03
*** armaan has quit IRC05:05
*** armax has quit IRC05:07
*** armax has joined #openstack-infra05:07
*** armax has quit IRC05:07
*** armax has joined #openstack-infra05:08
*** armax has quit IRC05:08
*** gongysh has quit IRC05:09
*** armax has joined #openstack-infra05:09
*** armax has quit IRC05:09
*** armax has joined #openstack-infra05:10
*** armax has quit IRC05:10
*** dhajare has joined #openstack-infra05:10
openstackgerritIan Wienand proposed openstack-infra/project-config master: Remove diskimage-builder functional test jobs  https://review.openstack.org/51216905:10
*** armax has joined #openstack-infra05:10
*** armax has quit IRC05:11
*** dhinesh has joined #openstack-infra05:11
*** armax has joined #openstack-infra05:11
*** armax has quit IRC05:11
*** thorst has joined #openstack-infra05:12
*** armax has joined #openstack-infra05:12
*** armax has quit IRC05:12
*** armax has joined #openstack-infra05:13
*** armax has quit IRC05:13
*** armax has joined #openstack-infra05:14
*** armax has quit IRC05:14
*** armax has joined #openstack-infra05:14
*** armax has quit IRC05:15
*** dhinesh has quit IRC05:15
*** thorst has quit IRC05:16
snapiriHi Infra, I have some issue with different builds running on zuul. When the build runs on a machine with specific filesystem, the build fails with zuul-cloner issues (e.g. http://logs.openstack.org/96/512096/1/check/legacy-dragonflow-dsvm-fullstack/5670008/job-output.txt.gz#_2017-10-15_20_51_17_069037). When the filesystem is different (I believe it is a different hypervisor) it succeeds and passes the zuul-cloner stage with no05:24
snapiriissue.05:24
snapiriWhen the filesystem is as follows (http://logs.openstack.org/96/512096/1/check/legacy-dragonflow-dsvm-rally/fe8384c/logs/df.txt.gz) it seems to pass the zuul-cloner stage with no problem05:25
ianwsnapiri: i think this is more likely related to http://logs.openstack.org/96/512096/1/check/legacy-dragonflow-dsvm-fullstack/5670008/job-output.txt.gz#_2017-10-15_20_51_12_60463105:27
*** gildub has joined #openstack-infra05:27
ianwyou're running that as "stack" but those repos will have been cloned by zuul user05:27
ianwdoes it really need to be stack?05:28
ianwas a first step, you can probably try replacing that with $USER05:28
snapiriianw: thanks, will try05:28
*** huanxie has quit IRC05:32
*** slaweq has quit IRC05:32
*** slaweq has joined #openstack-infra05:34
*** ramishra has joined #openstack-infra05:35
*** kaisers1 has joined #openstack-infra05:36
*** gongysh has joined #openstack-infra05:37
openstackgerritzhangfei gao proposed openstack/diskimage-builder master: Fix grub2 dependency on arm64  https://review.openstack.org/51217405:40
openstackgerritzhangfei gao proposed openstack/diskimage-builder master: Add debian minimal requirement for arm64  https://review.openstack.org/51217505:40
*** edmondsw has joined #openstack-infra05:41
*** edmondsw has quit IRC05:45
*** markvoelker has quit IRC05:46
*** armaan has joined #openstack-infra05:47
snapiriianw: will that not be a problem as the /opt/stack is owned by stack... see http://logs.openstack.org/96/512096/1/check/legacy-dragonflow-dsvm-fullstack/5670008/job-output.txt.gz#_2017-10-15_20_22_56_26003505:53
snapirishouldn't everything run as the user stack?05:54
*** huanxie has joined #openstack-infra06:01
*** slaweq has quit IRC06:07
*** Guest5873 has quit IRC06:08
*** pgadiya has joined #openstack-infra06:09
*** pgadiya has quit IRC06:09
*** slaweq has joined #openstack-infra06:10
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Make statsd import failure raise exception  https://review.openstack.org/51197806:11
*** dhinesh has joined #openstack-infra06:12
*** dhinesh has quit IRC06:16
*** yamahata has joined #openstack-infra06:22
*** martinkopec has joined #openstack-infra06:23
*** larainema has quit IRC06:24
*** electrical has quit IRC06:24
*** simonmcc has quit IRC06:24
*** berendt has quit IRC06:24
*** serverascode has quit IRC06:24
*** ameade has quit IRC06:24
*** cargonza has quit IRC06:24
*** tomhambleton_ has quit IRC06:24
*** NobodyCam has quit IRC06:24
*** zhongjun has quit IRC06:24
*** zul has quit IRC06:24
*** betherly has quit IRC06:24
*** clayg has quit IRC06:24
*** hrybacki has quit IRC06:24
*** sweston has quit IRC06:24
*** aarefiev has quit IRC06:24
*** csatari has quit IRC06:24
*** kmalloc has quit IRC06:24
*** hogepodge has quit IRC06:24
*** gcb has quit IRC06:24
*** ggherdov- has quit IRC06:24
*** DuncanT has quit IRC06:25
*** TheJulia has quit IRC06:25
*** zxiiro has quit IRC06:25
*** samueldmq has quit IRC06:25
*** fenar has quit IRC06:25
*** vdrok has quit IRC06:25
*** gmann has quit IRC06:25
*** jamespage has quit IRC06:25
*** fmccrthy has quit IRC06:25
*** srwilkers has quit IRC06:25
*** evgenyl has quit IRC06:25
*** mrmartin has quit IRC06:25
*** fmccrthy has joined #openstack-infra06:26
*** srwilkers has joined #openstack-infra06:26
*** clayg has joined #openstack-infra06:26
*** NobodyCam has joined #openstack-infra06:26
*** evgenyl has joined #openstack-infra06:26
*** kmalloc has joined #openstack-infra06:26
*** vdrok has joined #openstack-infra06:26
*** berendt has joined #openstack-infra06:26
*** jamespage has joined #openstack-infra06:26
*** samueldmq has joined #openstack-infra06:26
*** sweston has joined #openstack-infra06:26
*** hogepodge has joined #openstack-infra06:26
*** ameade has joined #openstack-infra06:26
*** cargonza has joined #openstack-infra06:26
*** csatari has joined #openstack-infra06:26
*** ryotagami has quit IRC06:26
*** hrybacki has joined #openstack-infra06:26
*** serverascode has joined #openstack-infra06:26
*** calbers has quit IRC06:26
*** ying_zuo has quit IRC06:26
*** electrical has joined #openstack-infra06:26
*** fenar has joined #openstack-infra06:27
*** DuncanT has joined #openstack-infra06:27
*** simonmcc has joined #openstack-infra06:27
*** aarefiev has joined #openstack-infra06:27
*** TheJulia has joined #openstack-infra06:27
*** betherly has joined #openstack-infra06:27
*** zxiiro has joined #openstack-infra06:27
*** openstackgerrit has quit IRC06:28
*** zoli has quit IRC06:28
*** Jeffrey4l has quit IRC06:28
*** gcb has joined #openstack-infra06:28
*** calbers has joined #openstack-infra06:29
*** ryotagami has joined #openstack-infra06:29
*** Jeffrey4l has joined #openstack-infra06:29
*** psachin has quit IRC06:30
*** zoli has joined #openstack-infra06:31
*** kiennt26 has joined #openstack-infra06:36
*** openstackgerrit has joined #openstack-infra06:36
openstackgerritNam Nguyen Hoai proposed openstack-infra/project-config master: Remove legacy jobs from Rally  https://review.openstack.org/51218206:36
*** esberglu has joined #openstack-infra06:38
*** priya_ has joined #openstack-infra06:38
*** ccamacho has joined #openstack-infra06:40
*** slaweq has quit IRC06:42
*** kjackal_ has joined #openstack-infra06:43
*** esberglu has quit IRC06:43
*** slaweq has joined #openstack-infra06:45
*** psachin has joined #openstack-infra06:46
*** psachin has joined #openstack-infra06:46
*** liusheng has quit IRC06:51
*** liusheng has joined #openstack-infra06:52
ianwsnapiri: going to have to think about it, let me add something to the issues page06:52
*** gongysh has quit IRC06:54
*** gongysh has joined #openstack-infra06:56
ianwsee https://etherpad.openstack.org/p/zuulv3-issues06:56
*** gridinv has quit IRC06:56
*** ykarel_ has joined #openstack-infra06:57
*** mrmartin has joined #openstack-infra06:57
*** s-shiono has joined #openstack-infra06:57
priya_hi , Could somebody review the commit - https://review.openstack.org/#/c/508533/06:58
*** pgadiya has joined #openstack-infra06:58
*** liusheng has quit IRC06:59
*** liusheng has joined #openstack-infra06:59
*** ykarel has quit IRC06:59
openstackgerritNam Nguyen Hoai proposed openstack-infra/openstack-zuul-jobs master: Remove Rally legacy jobs  https://review.openstack.org/51218607:00
ianwpriya_: reviewers are unlikely to see things here, try #openstack-nova07:01
*** claudiub has quit IRC07:01
*** markvoelker has joined #openstack-infra07:02
openstackgerritIan Wienand proposed openstack-infra/project-config master: Remove diskimage-builder functional test jobs  https://review.openstack.org/51216907:03
*** priya_ has quit IRC07:03
*** florianf has joined #openstack-infra07:07
snapiriianw: thanks07:07
*** jtomasek has joined #openstack-infra07:10
*** jtomasek has quit IRC07:11
*** jtomasek has joined #openstack-infra07:11
*** gildub has quit IRC07:14
*** claudiub has joined #openstack-infra07:14
*** Hal has joined #openstack-infra07:17
*** Hal is now known as Guest7120107:18
*** tesseract has joined #openstack-infra07:18
*** markvoelker has quit IRC07:18
*** shardy has joined #openstack-infra07:18
*** slaweq has quit IRC07:19
*** markvoelker has joined #openstack-infra07:20
*** markvoelker has quit IRC07:20
*** markvoelker has joined #openstack-infra07:20
*** gongysh has quit IRC07:21
*** gongysh has joined #openstack-infra07:23
openstackgerritIan Wienand proposed openstack-infra/zuul-jobs master: zuul-cloner-shim: don't link if owners different  https://review.openstack.org/51219407:23
ianwsnapiri: i think ^ is the solution, maybe.  I don't have time to test it fully right now, but watch out and others can post their ideas on the review/etherpad07:23
*** slaweq has joined #openstack-infra07:23
*** edmondsw has joined #openstack-infra07:29
*** yamahata has quit IRC07:30
snapiriianw: looks like this should work :)07:30
*** toabctl has joined #openstack-infra07:32
*** efried has quit IRC07:32
*** edmondsw has quit IRC07:33
openstackgerritMerged openstack-infra/project-config master: Move bashate jobs in-tree  https://review.openstack.org/51211807:35
*** yamamoto_ has quit IRC07:35
*** xinliang has joined #openstack-infra07:37
*** xinliang has quit IRC07:37
*** xinliang has joined #openstack-infra07:37
openstackgerritNam Nguyen Hoai proposed openstack-infra/project-config master: Remove legacy jobs from Networking-cisco  https://review.openstack.org/51219707:38
openstackgerritNuman Siddique proposed openstack-infra/openstack-zuul-jobs master: Add files section in the legact tripleo ci scenario007 jobs  https://review.openstack.org/51200707:38
*** jpich has joined #openstack-infra07:38
openstackgerritMerged openstack-infra/project-config master: Create tripleo change queue for gate pipeline  https://review.openstack.org/51214007:39
snapiriianw: so I am a bit confused, should I continue to use the zuul-cloner or not? In the migration guide it ways to stop using it. I understand that the required projects are cloned by zuul now, but I should install them, so if zuul brings them, will I still have the same issue when I try to install them as user stack?07:40
openstackgerritMerged openstack-infra/project-config master: Revert "neutron: Make grenade-neutron-dvr-multinode job non-voting"  https://review.openstack.org/50531807:41
openstackgerritMerged openstack-infra/project-config master: Adding check/publish jobs for contributor guide  https://review.openstack.org/51194607:41
*** slaweq_ has joined #openstack-infra07:44
*** efried has joined #openstack-infra07:44
*** egonzalez has joined #openstack-infra07:47
*** jascott1 has quit IRC07:47
*** jpena|off is now known as jpena07:48
*** jascott1 has joined #openstack-infra07:48
*** jascott1 has quit IRC07:52
*** bobh has joined #openstack-infra07:54
*** slaweq has quit IRC07:57
*** bobh has quit IRC07:58
*** slaweq has joined #openstack-infra08:01
*** d0ugal_ has quit IRC08:01
*** d0ugal has joined #openstack-infra08:01
*** d0ugal has quit IRC08:01
*** d0ugal has joined #openstack-infra08:01
*** cuongnv_ has joined #openstack-infra08:07
*** namnh_ has joined #openstack-infra08:07
*** cuongnv has quit IRC08:07
*** namnh has quit IRC08:08
*** armaan has quit IRC08:08
*** armaan has joined #openstack-infra08:08
*** panda|rover|off has quit IRC08:10
SamYaplehmmm. im not able to access my secret. i defined the secret in .zuul.yaml and try to access it in a playbook and it is not defined. any help here?08:12
SamYapledoes the secret need to be defined in project-config? and the job? or what? im confused08:12
*** derekh has joined #openstack-infra08:13
SamYaplecan secrets be used in untrusted playbooks?08:19
SamYapleit appears not08:19
*** dbecker has joined #openstack-infra08:19
*** lucas-afk is now known as lucasagomes08:21
*** AJaeger has joined #openstack-infra08:21
*** tomhambleton_ has joined #openstack-infra08:22
*** gmann has joined #openstack-infra08:22
*** larainema has joined #openstack-infra08:22
*** zhongjun has joined #openstack-infra08:22
AJaegerSamYaple: project-config is a trusted repository and secrets are only available there. Best discuss with rest of the team later, my understanding is that your post job needs to get defined in project-config.08:22
*** ggherdov- has joined #openstack-infra08:22
*** zul has joined #openstack-infra08:22
SamYapleAJaeger: thats contrary to the zuul docs which say secrets can be defined and used in untrusted jobs08:25
SamYaple(if the secrets are defined in the untrusted repository)08:25
SamYaplei think i found the missing bit that i needed though https://docs.openstack.org/infra/zuul/feature/zuulv3/user/config.html#attr-pipeline.post-review08:26
*** esberglu has joined #openstack-infra08:26
*** ying_zuo has joined #openstack-infra08:26
SamYaplemaybe not. it is really late for me08:27
*** hashar has joined #openstack-infra08:27
*** s-shiono has quit IRC08:27
*** armaan has quit IRC08:28
*** armaan has joined #openstack-infra08:28
SamYapleohhh i think i see it "It is possible to use secrets for jobs defined in config projects as well as untrusted projects,"08:28
SamYaplei just need to define the secret in project-config08:28
SamYaplei can consume it in my untrusted job *if* the pipeline has post-review=true08:29
SamYaplesuch as the post pipeline08:29
SamYapleif that is true, thats wierd. i can see no security benefit from defining a secret in project-config vs the untrusted repository08:30
*** esberglu has quit IRC08:30
*** markvoelker has quit IRC08:31
*** slaweq has quit IRC08:34
*** yamamoto has joined #openstack-infra08:36
*** slaweq has joined #openstack-infra08:36
AJaegerSamYaple: I'm not the expert yet on this, best to ask jeblair later today08:36
*** mpeterson has joined #openstack-infra08:36
SamYapleindeed. still, thanks for the help08:37
*** pgadiya has quit IRC08:40
*** yamamoto has quit IRC08:42
*** ccamacho1 has joined #openstack-infra08:45
*** ccamacho has quit IRC08:45
*** markvoelker has joined #openstack-infra08:45
*** ykarel_ is now known as ykarel08:46
*** ccamacho1 has quit IRC08:48
*** ccamacho has joined #openstack-infra08:48
kazshAJaeger: thanks for your +2 on https://review.openstack.org/#/c/509119/  Should I ask someone to review this or wait for a while ?08:49
*** ralonsoh has joined #openstack-infra08:52
*** bobh has joined #openstack-infra08:55
*** bobh has quit IRC08:59
*** tosky has joined #openstack-infra08:59
*** ykarel is now known as ykarel|lunch09:00
*** yamamoto has joined #openstack-infra09:04
*** kashyap has joined #openstack-infra09:04
*** e0ne has joined #openstack-infra09:08
*** slaweq has quit IRC09:08
*** yamamoto has quit IRC09:09
kashyapHow can one do a 'recheck' on a Zuul v3 job?09:09
toskykashyap: I don't think that it changed09:10
kashyaptosky: So a 'recheck' should Just Work?09:10
toskykashyap: yes09:10
kashyapThanks, asking for a colleague09:11
*** slaweq has joined #openstack-infra09:13
*** yamamoto has joined #openstack-infra09:13
AJaegerkazsh: just wait - we're all busy with Zuul v3 migration, it will get reviewed...09:19
kazshAJarger: Noted with thanks!09:20
AJaegeryolanda: do you want to put 509119 on your review queue?09:20
*** esberglu has joined #openstack-infra09:20
*** esberglu has quit IRC09:20
*** esberglu has joined #openstack-infra09:20
*** esberglu has quit IRC09:21
*** askb has quit IRC09:21
kazshOops sorry s/AJarger/AJaeger09:21
openstackgerritTetsuro Nakamura proposed openstack-infra/project-config master: Add new project networking-spp  https://review.openstack.org/51222309:22
*** sambetts|afk is now known as sambetts09:27
sambettsAJaeger: thank you for adding the networking-cisc projects :D09:27
andreafclarkb, jeblair: ok thanks for the follow up devstack change09:29
*** kjackal_ has quit IRC09:30
andreafclarkb, jeblair: since now the job definition is spread across a mix of branchless and branched repos, we may have to start backporting jobs parts every now and then09:30
andreafclarkb, jeblair: for the devstack job/roles we may want to wait until it's ~complete and then backport the whole ansible bits + .zuul.yaml at once?09:32
*** dhajare has quit IRC09:37
openstackgerritNam Nguyen Hoai proposed openstack-infra/openstack-zuul-jobs master: Remove Networking-cisco legacy jobs  https://review.openstack.org/51223509:38
openstackgerritMichel Peterson proposed openstack-infra/project-config master: networking-odl: Add required-projects for tox jobs  https://review.openstack.org/51223809:39
openstackgerritMichel Peterson proposed openstack-infra/project-config master: networking-odl: Add required-projects for functional tests  https://review.openstack.org/51223809:42
*** e0ne has quit IRC09:45
*** askb has joined #openstack-infra09:45
*** slaweq has quit IRC09:46
*** kota_ has joined #openstack-infra09:47
kota_hello folks. Could i confirm what's happen in zuul v3 migration on the gerrit?09:48
kota_it looks like to change the gate jobs as sort of 'legacy-' in the commit, https://github.com/openstack-infra/project-config/commit/e87b20f719f691f61e35be1054ccb4fbcf65e43309:49
kota_but it seems to cause gate failure in our project09:49
kota_e.g. https://review.openstack.org/#/c/483798/09:49
*** jascott1 has joined #openstack-infra09:50
*** kiennt26 has quit IRC09:50
kota_I'd like to get how i can resolve the failure and gate jobs.09:50
kota_perhaps, just fix the gate job name to be same with jenkins jobs?09:51
*** ykarel|lunch is now known as ykarel09:52
*** Shrews has quit IRC09:53
*** slaweq has joined #openstack-infra09:54
*** Shrews has joined #openstack-infra09:55
toskykota_: there were few emails about this topic, the main suggestion is read this: https://docs.openstack.org/infra/manual/zuulv3.html09:55
kota_tosky: thx09:55
*** bobh has joined #openstack-infra09:55
openstackgerritBernard Cafarelli proposed openstack-infra/project-config master: Fix build-openstack-sphinx-docs for networking-sfc  https://review.openstack.org/51224409:55
toskywell, not "few" emails, but many emails09:55
openstackgerritSam Betts proposed openstack-infra/devstack-gate master: Allow EOL branches to use the test-matrix.py script  https://review.openstack.org/51094609:56
openstackgerritSam Betts proposed openstack-infra/devstack-gate master: Add ability to override project branch to a tag  https://review.openstack.org/51224509:56
AJaegerandreaf: you can backport .zuul.yaml if you want different config on different branches. But you don't have to. Your call - best to discuss later...09:56
andreafAJaeger: uhm ok - so zuul will always take it from master?09:57
andreafAJaeger: what about roles and playbooks? Are those taken from master as well? Even for jobs that run on stable branches?09:57
AJaegerandreaf: If you have .zuul.yaml on a branch, it uses that one. IF there's none, it takes the one from master09:57
AJaegerandreaf: best discuss with jeblair and mordred later... Just wanted to point out that you have options ;)09:57
andreafAJaeger: thanks :) the fallback on master is handy indeed09:58
*** derekh has quit IRC09:58
openstackgerritMichel Peterson proposed openstack-infra/project-config master: networking-odl: Add required-projects for functional tests  https://review.openstack.org/51223809:58
*** namnh_ has quit IRC09:58
kota_ah, ok. It looks the legacy one starts to work well, just timed out for too long job.09:58
*** Shrews has quit IRC09:59
*** derekh has joined #openstack-infra09:59
*** bobh has quit IRC10:00
*** kjackal_ has joined #openstack-infra10:00
*** Shrews has joined #openstack-infra10:01
*** LindaWang has quit IRC10:03
yolandaAJaeger, looking10:08
openstackgerritStephen Finucane proposed openstack-dev/pbr master: Rework features doc  https://review.openstack.org/51224810:08
openstackgerritStephen Finucane proposed openstack-dev/pbr master: doc: Minor rework of usage doc  https://review.openstack.org/51224910:08
openstackgerritYAMAMOTO Takashi proposed openstack-infra/project-config master: networking-midonet: Use publish-openstack-sphinx-docs-neutron  https://review.openstack.org/51225010:09
*** cuongnv_ has quit IRC10:10
*** esberglu has joined #openstack-infra10:14
*** derekh has quit IRC10:15
openstackgerritBernard Cafarelli proposed openstack-infra/project-config master: Use -neutron template variants for networking-sfc jobs  https://review.openstack.org/51224410:16
openstackgerritMerged openstack-infra/project-config master: Add heat-dashboard project  https://review.openstack.org/50911910:16
*** sdague has joined #openstack-infra10:17
*** esberglu has quit IRC10:18
*** derekh has joined #openstack-infra10:20
*** fanzhang has joined #openstack-infra10:21
*** pbourke has quit IRC10:23
openstackgerritMichel Peterson proposed openstack-infra/openstack-zuul-jobs master: networking-odl: Add required-projects for functional tests  https://review.openstack.org/51225410:24
*** pbourke has joined #openstack-infra10:25
openstackgerritMichel Peterson proposed openstack-infra/project-config master: networking-odl: Add required-projects for functional tests  https://review.openstack.org/51223810:25
*** psachin has quit IRC10:31
*** LindaWang has joined #openstack-infra10:31
*** openstackgerrit has quit IRC10:33
*** ldnunes has joined #openstack-infra10:37
*** daidv has quit IRC10:38
*** rbergeron has joined #openstack-infra10:38
*** openstackgerrit has joined #openstack-infra10:43
openstackgerritAntoine Musso proposed openstack/diskimage-builder master: dib-lint: test elements have README.rst file  https://review.openstack.org/17783210:43
*** armaan has quit IRC10:43
*** armaan has joined #openstack-infra10:44
openstackgerritYAMAMOTO Takashi proposed openstack-infra/project-config master: networking-midonet: Use release-notes-jobs-neutron  https://review.openstack.org/51226010:54
openstackgerritYAMAMOTO Takashi proposed openstack-infra/project-config master: networking-midonet: publish-to-pypi-neutron  https://review.openstack.org/51226110:54
*** udesale has quit IRC10:54
*** udesale has joined #openstack-infra10:54
openstackgerritYAMAMOTO Takashi proposed openstack-infra/project-config master: networking-midonet: Use publish-to-pypi-neutron  https://review.openstack.org/51226110:55
*** bobh has joined #openstack-infra10:56
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Document base job change procedure  https://review.openstack.org/51213810:58
openstackgerritYAMAMOTO Takashi proposed openstack-infra/project-config master: networking-midonet: Use publish-to-pypi-neutron  https://review.openstack.org/51226110:59
*** bobh has quit IRC11:01
ykarel<AJaeger> andreaf: If you have .zuul.yaml on a branch, it uses that one. IF there's none, it takes the one from master, AJaeger any idea why jobs defined in master not executed in stable branches: For example: https://review.openstack.org/#/c/512172/11:03
*** dtantsur|afk is now known as dtantsur11:04
AJaegerykarel: I don't understand your problem on that one - the jobs run, didn't they?11:05
*** edmondsw has joined #openstack-infra11:05
ykarelAJaeger, no jobs defined in .zuul.yaml in master are not running in stable branches11:05
AJaegerykarel: what did you expect - and didn't get exactly?11:05
AJaegerykarel: example, please11:05
ykarelAJaeger, https://review.openstack.org/#/c/512172/ stable pike patch11:06
ykareljobs defined at https://github.com/openstack/magnum/blob/master/.zuul.yaml#L218-L225 didn't ran11:06
pabelangermorning11:07
*** udesale has quit IRC11:07
dmelladoAJaeger: pabelanger I've a question, what to do with a patch that still has both zuul and jenkins11:07
AJaegermorning, pabelanger11:07
dmelladoand when jenkins is blocking zuul11:07
dmelladohttps://review.openstack.org/#/c/510157/11:07
dmellado^^11:07
dmelladoI submitted a patch for removing legacy jenkins jobs11:07
*** rook-afk is now known as rook11:07
dmelladobut landed after the first jenkins -111:07
dmelladoshould I abandon+recover?11:07
AJaegerdmellado: you should be able to merge this without problems11:08
dmelladoAJaeger: so jenkins wouldn't matter?11:08
AJaegerdmellado: yes, it wouldn't matter11:08
dmelladoack, thanks andreas11:08
AJaegerykarel: this needs some debugging, let's add to the etherpad. Will you do?11:08
pabelangerdmellado: yah, what AJaeger said11:08
ykarelAJaeger, Ok i can11:09
dmelladothanks too, pabelanger, btw, hope you're done with the moving by now!11:09
*** edmondsw has quit IRC11:09
*** dave-mccowan has joined #openstack-infra11:09
pabelangerdmellado: yah, we are in a cottage on Lake Huron (great lakes) for the next 2 months. We are still looking for rental, but we've moved to the correct town11:10
*** Shrews has quit IRC11:10
pabelangerdmellado: we had a massive wind storm roll through yesterday, it was awesome watching the waves on the Lake11:11
AJaegerykarel: put it under "to triage", please11:11
*** armaan has quit IRC11:11
dmelladopabelanger: that looks awesome, take some pics! :)11:11
*** armaan has joined #openstack-infra11:11
ykarelAJaeger, ok updating https://etherpad.openstack.org/p/zuulv3-issues11:11
pabelangerAJaeger: just looking at backscroll, we appear to be in good shape this morning11:12
*** Shrews has joined #openstack-infra11:12
*** armaan has quit IRC11:12
*** armaan has joined #openstack-infra11:13
openstackgerritSam Betts proposed openstack-infra/devstack-gate master: Add ability to override project branch to a tag  https://review.openstack.org/51224511:14
*** smatzek has joined #openstack-infra11:14
AJaegerpabelanger: we do basically. All proposal jobs are broken (see line 90 in etherpad), publish-api-ref is broken (line 78) and the post upstream-translation job just gives an error.11:15
AJaegermordred: ^11:15
AJaegerbut regular ones look okish - and tehre are a couple of changes to add required-repos if you want to +2A a few...11:15
*** jkilpatr_ has joined #openstack-infra11:16
dmelladoI was checking some issues that showed up on some lbaas/octavia gates11:16
dmelladobut that might be just related to octavia itself and not zuul11:16
*** masber has quit IRC11:16
dmelladoit seems that octavia's no longer being installed11:16
dmelladohttp://logs.openstack.org/57/510157/2/check/tempest-kuryr-kubernetes-octavia/2705827/11:16
dmelladoin case you'd like to have a look11:17
pabelangerdmellado: if you think there is an issue, best to add it to https://etherpad.openstack.org/p/zuulv3-issues so we can properly triage it11:18
dmelladopabelanger: I'm checking locally and I'll add it there if it works from my side, thanks11:18
openstackgerritMichel Peterson proposed openstack-infra/openstack-zuul-jobs master: networking-odl: Add required-projects for functional tests  https://review.openstack.org/51225411:20
openstackgerritMichel Peterson proposed openstack-infra/project-config master: networking-odl: Add required-projects missing  https://review.openstack.org/51223811:20
AJaegerdmellado: I see "sudo stack" - there's something for that already on the etherpad11:20
*** shardy is now known as shardy_lunch11:20
dmelladoAJaeger: oh, I see11:21
dmelladothanks!11:21
*** ociuhandu has quit IRC11:22
ykarelAJaeger, updated the etherpad11:23
AJaegerthanks, ykarel11:23
cmurphyseeing a POST_FAILURE here https://review.openstack.org/#/c/508411/, is that something i should add to the zuulv3-issues etherpad? or just ignore and try again?11:23
*** pblaho has joined #openstack-infra11:25
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Fix syntax errors with set-service-type-data-fact role  https://review.openstack.org/51226711:26
pabelangerrsync: recv_generator: mkdir "/var/lib/zuul/builds/5a11ee7f05e64391806dcafb1afdfe3f/work/logs/logs" failed: No space left on device (28)11:26
pabelangercmurphy: ^ yes, please add to etherpad11:26
pabelangerlooks like executor has no more disk11:26
*** Shrews has quit IRC11:27
AJaegerargh ;(11:27
cmurphypabelanger: kk11:27
pabelangerAJaeger: ^will also fix api-ref jobs11:27
pabelangerlooking at HDD issues on ze03.o.o now11:28
AJaegerpabelanger: thanks!11:28
*** Shrews has joined #openstack-infra11:29
pabelanger/dev/xvde2       72G   34G   34G  51% /var/lib/zuul11:29
*** slaweq has quit IRC11:29
pabelangerthat is ze03.o.o currently11:29
AJaegerinodes?11:29
pabelanger/dev/xvde2       4.6M  3.8M  799K   83% /var/lib/zuul11:29
pabelangeryah, that is more likley11:29
*** Swimingly has quit IRC11:30
pabelangerAJaeger: it is likely number of concurrent jobs on the executor11:30
pabelangertrying to see if I can calculate that11:30
*** huanxie has quit IRC11:30
pabelangerYah, I can see in logs it has happened a few times on ze03.o.o11:32
pabelanger2017-10-16 09:01:39,787 ERROR zuul.AnsibleJob: [build: fa8098c3ef55417884306eeeb5649806] Unable to initialize repo for ssh://zuul@review.openstack.org:29418/openstack/horizon11:32
pabelangerthat is because we ran out of HDDs11:32
pabelangeror inodes11:33
*** tpsilva has joined #openstack-infra11:33
pabelangerthis is likely of the spike in merge_failure we are seeing too11:34
AJaegercan we unmount and add more inodes?11:35
*** nicolasbock has joined #openstack-infra11:36
AJaegerpabelanger: could you check whether this is specific to ze03 or happens on others as well, please?11:36
AJaegerpabelanger: I suggest to wait until others are around and discuss options - and collect more information for now.11:37
pabelangerAJaeger: we'd have to stop the executor first, but yah, it should be possible11:37
pabelangerAJaeger: Yah, collecting info now and adding to zuulv3-issues11:37
*** slaweq has joined #openstack-infra11:38
*** yamamoto has quit IRC11:38
*** Swimingly has joined #openstack-infra11:39
*** Swimingly has quit IRC11:39
*** Swimingly has joined #openstack-infra11:39
*** rhallisey has joined #openstack-infra11:39
*** lucasagomes is now known as lucas-hungry11:39
AJaegerpabelanger: do we monitor inode and disk usage? What about a simple loop that watches df -i and df on that filesystem once a minute? That way we can corrolate back to the next failure where space was needed11:41
pabelangerAJaeger: we should have disk space in cacti11:41
pabelangerAJaeger: and I think dmsimard is working on a patch for inodes in cacti too11:42
pabelangerAJaeger: okay, so far only ze03 and ze09 have ran out of disk space this morning11:42
AJaegerpabelanger: indeed, disk space is there - but quite static. So, might be inodes11:43
AJaegerthanks, pabelanger.11:43
* AJaeger will be back online later11:43
*** tmorin has joined #openstack-infra11:46
tmorinhi infra-root : we've legacy-networking-bgpvpn-dsvm-functional failing on "cannot create hard link" ... is there a known solution ? should I track that in storyboard ?11:47
pabelangertmorin: knownn issue, see https://etherpad.openstack.org/p/zuulv3-issues11:47
pabelangerthat is where we are tracking this currently11:47
*** udesale has joined #openstack-infra11:48
tmorinpabelanger: ok11:48
pabelangertmorin: which review are you getting the error on?11:49
tmorinpabelanger: 49421311:49
*** ociuhandu has joined #openstack-infra11:49
tmorinpabelanger: "sudo -H -u stack tox -e dsvm-functional" ...11:49
tmorinpabelanger: matches the description of what https://review.openstack.org/512194 tries to solve11:50
pabelangeryah11:50
pabelangerI'm going to try seeing if that fixes it11:50
*** udesale__ has joined #openstack-infra11:50
pabelangerremote:   https://review.openstack.org/512270 DNM - testing 51219411:51
*** udesale has quit IRC11:51
pabelangertmorin: ^lets see what happens11:51
*** thorst has joined #openstack-infra11:54
*** thorst has quit IRC11:54
*** thorst has joined #openstack-infra11:56
*** udesale__ has quit IRC11:56
pabelangerfungi: clarkb: unrelated, but what do you think about deleting devstack_cache (git repos) on nodepool-builders. We haven't done that in a while and see us still coping stable/newton things when setting up repos11:56
pabelangerwe talked about that a few weeks ago11:57
*** bobh has joined #openstack-infra11:57
*** yamamoto has joined #openstack-infra11:58
*** bobh has quit IRC11:58
*** bobh has joined #openstack-infra11:59
*** gcb has quit IRC12:01
*** jpena is now known as jpena|lunch12:04
*** mat128 has joined #openstack-infra12:07
*** huanxie has joined #openstack-infra12:08
*** trown|outtypewww is now known as trown12:09
*** edmondsw has joined #openstack-infra12:09
*** chlong has quit IRC12:10
*** stakeda has quit IRC12:10
*** mat128 has quit IRC12:10
*** mat128 has joined #openstack-infra12:11
*** slaweq has quit IRC12:11
*** Guest71201 has quit IRC12:14
*** sree has quit IRC12:16
*** sree has joined #openstack-infra12:16
*** gongysh has quit IRC12:17
*** tmorin has quit IRC12:18
*** slaweq has joined #openstack-infra12:18
*** tmorin has joined #openstack-infra12:18
*** sree has quit IRC12:20
*** yamamoto has quit IRC12:21
*** lucas-hungry is now known as lucasagomes12:23
*** dave-mccowan has quit IRC12:23
*** florianf has quit IRC12:23
*** florianf has joined #openstack-infra12:23
*** shardy_lunch is now known as shardy12:25
*** coolsvap has quit IRC12:26
*** rlandy has joined #openstack-infra12:27
*** bobh has quit IRC12:30
*** Hal has joined #openstack-infra12:31
*** Hal is now known as Guest7825512:32
*** bobh has joined #openstack-infra12:34
*** wolverineav has joined #openstack-infra12:35
mordredpabelanger: morning - purging cache sounds great12:35
*** dprince has joined #openstack-infra12:37
*** esberglu has joined #openstack-infra12:38
*** huanxie has quit IRC12:39
*** felipemonteiro has joined #openstack-infra12:39
*** felipemonteiro_ has joined #openstack-infra12:41
pabelangerkk, I'll do that now12:42
tmorinpabelanger: https://review.openstack.org/#/c/512270 => didn't work, zuul-cloner still trying to create hard links12:43
*** wolverineav has quit IRC12:43
pabelangertmorin: yah, i think we need to first land it12:43
tmorinpabelanger: yes, the Depends-On magic must have some limits I guess :)12:43
mordredpabelanger: that should work fine with depends-on12:43
pabelangercool, then in that case, something didn't work12:44
pabelangerneed to see why12:44
mordrednevermind12:44
mordrednope12:44
mordredI'm totally wrong12:44
mordredyou were right12:44
pabelanger:)12:44
mordredmust land first :)12:44
dmsimardpabelanger: I'm priorizing decreasing the disk space and inode consumption of ara over adding them into cacti right now12:45
mordredpabelanger: also - I had an idea yesterday of just chmodding the repos to be world writable12:45
*** dave-mccowan has joined #openstack-infra12:45
*** felipemonteiro has quit IRC12:45
dmsimardif someone else wants to take a stab at inodes in cacti, feel free12:45
pabelangermordred: yah, maybe that is the easiest path forward12:45
*** nhicher has joined #openstack-infra12:46
mordredpabelanger: well - the ianw patch doesn't look bad - it'll just have more copying for folks ... gimme a sec though, lemme see about a quick patch12:46
*** ykarel is now known as ykarel|away12:47
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Make git repos writable so that hard links work  https://review.openstack.org/51228512:50
mordredpabelanger: ^^ somethign like that12:50
dmsimardianw: replied to your comments on https://review.openstack.org/#/c/504789/12:50
flaper87hey folks, sorry if this is an already-answered question but, has the process for adding new repos changed in any way after the roll-out of zuul v3?12:51
*** slaweq has quit IRC12:51
mwhahahahey so i've noticed that zuul is overwriting the results in gerrit for tripleo (example https://review.openstack.org/#/c/512219/) the ovb results cleared out the normal check results12:51
dmsimardflaper87: gerrit, gerritbot and such are the same, what's different is the job configuration and layout12:51
*** ykarel|away has quit IRC12:51
flaper87dmsimard: gotcha, thank you12:51
mordredflaper87: the manual has been update12:51
flaper87mordred: that's what I was hoping for, thanks12:52
flaper87I'll read the manual then12:52
* flaper87 hasn't done this in a while12:52
*** yamamoto has joined #openstack-infra12:52
dmsimardmwhahaha: hm, because you have jobs in check and check-tripleo pipeline I guess ?12:52
mwhahahayea12:52
dmsimardmordred: ^ that's likely a legit bug12:52
mordredflaper87: https://docs.openstack.org/infra/manual/creators.html - let us know if any of it sucks12:52
dmsimardadding to zuulv3 issues12:52
flaper87mordred: will do by proposing new patches if needed :)12:53
pabelangermordred: do we plan on using mirror-workspace-git-repos on none legacy jobs? If so, maybe we should move it some other place. I can think of some places where we might not want that on non zuul-cloner jobs12:53
mordredpabelanger: ah. good call - we could do it in the copy shim role instead12:54
mordredflaper87: \o/12:54
*** armaan has quit IRC12:54
pabelangermordred: yah, shim role might be better12:55
*** armaan has joined #openstack-infra12:55
mordreddmsimard, mwhahaha: hrm. ok. so check-tripleo is just supposed to comment, not to vote, right?12:56
*** Shrews has quit IRC12:56
dmsimardmwhahaha: not sure what to say for the time being other than to be careful, it wasn't a known issue but I just added it to our tracker.12:56
mwhahahamordred: correct12:56
*** dbecker has quit IRC12:56
dmsimardmordred: It votes, I put two examples on the etherpad, in v2 both pipelines had distinct results12:56
dmsimardmwhahaha: it doesn't vote ??12:56
* dmsimard confused12:56
mwhahahathe ovb jobs didn't vote12:56
mordredit's not supposed to vote12:56
mordredit's supposed to just leave a comment12:57
mordred    success:12:57
mordred      gerrit: {}12:57
dmsimardmwhahaha: I don't see any 'non-voting' messages in the results of https://review.openstack.org/#/c/512219/12:57
dmsimardmwhahaha: other than some individual jobs12:57
mwhahahadmsimard: the check-tripleo ci doesnt' Verifiied+112:58
mwhahahain the past that just ends up being a different block in the gerrit UI12:58
mordreddmsimard, mwhahaha: it looks like that's doing the right thing - the javascript to show the results in the table up top might not be doing the right thing12:58
mwhahahayea12:58
mordredyah - I think the bug is in presentation - not in the votes being wrong12:58
mwhahahathat's the problem you have to view the toggle CI to see the results12:58
dmsimardoh I see what you mean12:58
mordred(which is still a bug - but incorrect ui is slightly less disaster than incorrect voting)12:58
mwhahahacorrect it's a presentation issue so i don't know if that's in teh way it's posted or whatever12:58
*** huanxie has joined #openstack-infra12:59
dmsimardI got mistaken because the -V vote had been moved but it's because it was enqueued to gate jobs12:59
mordredmwhahaha: it's ... the way that table is created is some amazing hacky javascript :)12:59
mwhahahammmm hacky javascript my favorit :D12:59
*** apetrich has quit IRC12:59
*** sree has joined #openstack-infra12:59
*** Shrews has joined #openstack-infra12:59
mordredmwhahaha: when we upgrade gerrit to 2.14 there's an actual API that's been added to gerrit that will allow us to register build results directly - which will be GREAT (thanks zaro)13:00
*** apetrich has joined #openstack-infra13:00
mordredmwhahaha: but for now it's the hacky javascript - so we'll need to figure out what is confusing it13:00
dmsimardhey if the 2.14 upgrade goes as smooth as the 2.13 one, it'll be awesome13:00
mwhahahamordred: the name13:00
mwhahahamordred: see https://review.openstack.org/#/c/506186/413:00
mwhahahamordred: it used to be 'jenkins' and 'jenkins check'13:00
mwhahahawe need a zuul and zuul check13:00
mordredmwhahaha: so - that's all actually driven by scraping the html in the comments ...13:01
*** slaweq has joined #openstack-infra13:02
mordredso lookingto see what's different about the two jenkins comments vs. the two zuul comments13:02
*** jcoufal has joined #openstack-infra13:02
*** mriedem has joined #openstack-infra13:02
cmurphyis MERGER_FAILURE another incarnation of out of inodes? https://review.openstack.org/#/c/500005/13:02
*** jpena|lunch is now known as jpena13:03
jkilpatr_any idea why I'm getting permission denied pubkey when doing this13:04
*** Shrews has quit IRC13:04
jkilpatr_13:03:06 + git remote add gerrit https://review.openstack.org/openstack/browbeat13:04
jkilpatr_13:03:06 + git fetch --all13:04
*** Guest78255 has quit IRC13:05
jkilpatr_wondering if it's an upstream issue or if I should start debugging the downstream jenkins.13:05
dmsimardcmurphy: merger failure would not occur on the server we've been having inode issues on13:05
cmurphydmsimard: okay, is it related to something else you know about or should i add it to the etherpad?13:05
*** dave-mccowan has quit IRC13:06
*** tmorin has quit IRC13:06
*** Shrews has joined #openstack-infra13:06
cmurphylooks like you're adding it, ty13:06
dmsimardcmurphy: nothing I know about but I already added it to the etherpad after skimming to see if it wasn't already there.. iirc jeblair did some fixes but maybe there is something else13:07
dmsimardor maybe it hasn't merged yet, let me look13:07
AJaegermordred: morning! All proposal jobs are broken (see line 90 in etherpad), publish-api-ref is broken (line 78 - fixed with https://review.openstack.org/#/c/512267/) and the post upstream-translation job just gives an error - in case you want to review/dig into either of these...13:08
mordredAJaeger: yay!13:08
*** bobh has quit IRC13:08
*** tmorin has joined #openstack-infra13:08
dmsimardcmurphy: yeah, the change merged ( https://review.openstack.org/#/c/511957/ ) which jeblair mentioned was a fix for mergers but it's merged and very likely loaded at this point. It is maybe a new one, feel free to recheck for the time being.13:09
cmurphydmsimard: the failure only happened an hour ago so if the fix was merged three days ago i think it would be something different13:10
dmsimardyup.13:10
*** links has quit IRC13:12
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Make git repos writable so that hard links work  https://review.openstack.org/51228513:12
*** bauwser is now known as bauzas13:12
mordredpabelanger: ^^ how about that?13:12
openstackgerritMonty Taylor proposed openstack-infra/openstack-zuul-jobs master: Fix set-service-type-data-fact role  https://review.openstack.org/51229213:14
mordredoh - whoops - pabelanger had that one already13:14
AJaegermordred: yeah - just +2A https://review.openstack.org/512267 ;)13:15
pabelangerlooking13:15
*** dave-mccowan has joined #openstack-infra13:16
mordredyup. done13:16
pabelangerjkilpatr_: where are you seeing the permisson denied pubkey? Have a log?13:16
pabelangermordred: +213:17
*** dbecker has joined #openstack-infra13:18
mordredtmorin: you should be able to depends-on with https://review.openstack.org/51228513:18
*** armaan has quit IRC13:19
*** Shrews has quit IRC13:19
*** armaan has joined #openstack-infra13:20
*** mat128_ has joined #openstack-infra13:20
*** rosmaita has joined #openstack-infra13:20
jkilpatr_pabelanger, a downstream jenkins, playing around with the issue more it seems to be a local settings issue. I'll ping the right people.13:20
pabelangerAJaeger: mind adding https://review.openstack.org/512114 to your review pipeline13:20
*** mat128_ has quit IRC13:20
*** wolverineav has joined #openstack-infra13:20
*** Shrews has joined #openstack-infra13:21
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Fix syntax errors with set-service-type-data-fact role  https://review.openstack.org/51226713:21
tmorinmordred: thanks, trying this right now (https://review.openstack.org/512294)13:22
AJaegerpabelanger: +2A13:22
*** mat128 has quit IRC13:22
openstackgerritMonty Taylor proposed openstack-infra/project-config master: It is pre_tasks, not pre-tasks  https://review.openstack.org/51229513:22
pabelangerAJaeger: now we wait for post-api-ref job^13:23
*** dbecker has quit IRC13:23
*** smatzek has quit IRC13:23
mordredAJaeger: ^^ that should fix the proposal job issue13:23
pabelanger+313:23
*** kiennt26 has joined #openstack-infra13:23
AJaegermordred: that's all? ansible should be intelligent enough to self-fix ;) Thanks!13:24
pabelangeryah, onc we get linters job working, should be able to catch a lot of that13:24
pabelangerI'm going to try hacking on that this week13:24
*** gcb has joined #openstack-infra13:25
AJaegerpabelanger: jlk has a linters job up as well that includes a few fixes13:25
AJaegerpabelanger: https://review.openstack.org/50375113:26
*** Shrews has quit IRC13:27
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Remove windmill from zuul-legacy-project-templates  https://review.openstack.org/51211413:29
*** kashyap has left #openstack-infra13:29
*** huanxie has quit IRC13:29
*** ihrachys has joined #openstack-infra13:30
*** erlon has joined #openstack-infra13:30
mordredpabelanger, tmorin: https://review.openstack.org/512270 DNM - testing 512285 ... I re-submitted that one witha depends-on the chmod patch13:32
pabelangerack13:32
mordredsince that happens in the zuul-cloner copy role, Im pretty sure it's depends-on-able, which'll be nice if it fixes it13:33
*** rkukura has joined #openstack-infra13:33
pabelangeryah, watching stream now13:33
*** slaweq has quit IRC13:34
*** rkukura has quit IRC13:34
*** dbecker has joined #openstack-infra13:35
*** baoli has joined #openstack-infra13:36
mpetersonAJaeger: hey! I was wondering if we can merge these? https://review.openstack.org/#/c/512238/ and https://review.openstack.org/#/c/512254/13:36
*** gongysh has joined #openstack-infra13:36
*** gongysh has quit IRC13:37
*** kiennt26 has quit IRC13:37
*** slaweq has joined #openstack-infra13:37
*** lbragstad has joined #openstack-infra13:37
AJaegermpeterson: all chagnes need a second +2... and the second needs a recheck once the first is in.13:38
mordredmpeterson, AJaeger: first one approved13:40
dmsimardmordred, pabelanger: https://review.openstack.org/#/c/512295 would very likely have been caught by an 'ansible-playbook --syntax-check' or by ansible-lint, does it not run ?13:40
mpetersonAJaeger: gotcha.13:40
mpetersonmordred: thanks13:40
*** hongbin has joined #openstack-infra13:41
*** kgiusti has joined #openstack-infra13:41
AJaegerdmsimard: no, we need one - see my discussion with pabelanger earlier and check https://review.openstack.org/503751  and https://review.openstack.org/49546313:42
*** ramishra has quit IRC13:42
dmsimardAJaeger: ack13:43
openstackgerritMerged openstack-infra/project-config master: It is pre_tasks, not pre-tasks  https://review.openstack.org/51229513:43
*** mriedem1 has joined #openstack-infra13:43
*** Hal has joined #openstack-infra13:44
*** Hal is now known as Guest6884413:44
pabelangerAJaeger: dmsimard: Yah at PTG talked with mordred about created a linter jobs to pull in all playbooks and roles to be shared with project-config, zuul-jobs, openstack-zuul-jobs. Then we could get a good coverage going across a three repos.13:44
dmsimardI added it to zuulv3-issues in response to the pre-tasks vs pre_tasks13:45
pabelangerI'm flying back up to Ottawa on Wednesday, will have some free time to hack on it13:45
dmsimardpabelanger: oh that's right, OpenStack Canada thursday.. need to leave from Montreal around like 6AM T_T13:45
AJaegerdmsimard: don't add it to the "Bug fixes section" - we clean that up regularly, it's solely for tracking...13:46
*** dtantsur_ has joined #openstack-infra13:46
dmsimardsure13:46
mordredpabelanger: looks like https://review.openstack.org/512270 worked - let's land 51228513:46
pabelangermordred: 512285 worked: http://logs.openstack.org/70/512270/2/check/legacy-networking-bgpvpn-dsvm-functional/3897d47/13:47
pabelangerhaha13:47
*** _bmjen has joined #openstack-infra13:47
mordred:)13:47
mordredAJaeger: wanna +3 https://review.openstack.org/#/c/512285/ ?13:47
openstackgerritMerged openstack-infra/project-config master: networking-odl: Add required-projects missing  https://review.openstack.org/51223813:47
*** mriedem has quit IRC13:48
*** liusheng has quit IRC13:48
*** bmjen has quit IRC13:48
*** dtantsur has quit IRC13:48
*** dtantsur_ is now known as dtantsur13:48
*** liusheng has joined #openstack-infra13:48
AJaegermordred: done13:48
*** bobh has joined #openstack-infra13:49
*** mriedem1 is now known as mriedem13:49
*** ryotagami has quit IRC13:51
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Collapse job header lines into a single statement  https://review.openstack.org/51182113:51
*** psachin has joined #openstack-infra13:53
mpetersonI have a question regarding migrating. Once everything is migrated then the project no longer needs to exist on `project-config/zuul.d/projects.yaml`, right? As it will be on the project repository13:54
AJaegermpeterson: Wrong ;)13:54
mordreddmsimard, pabelanger: ^^ I have fixed dmsimard's review comments on https://review.openstack.org/511821 - pabelanger, are you ok with the followup patch version?13:54
AJaegermpeterson: did you read the zuulv3 document?13:54
*** gouthamr has joined #openstack-infra13:54
pabelangerAJaeger: did you want to make project-config-frozen-files voting now in project-config?13:54
AJaegermpeterson: https://docs.openstack.org/infra/manual/zuulv3.html#moving-legacy-jobs-to-projects13:54
*** chlong has joined #openstack-infra13:54
pabelangermordred: looking13:54
AJaegerpabelanger: we could - right now I'm fine taking in changes that touch those files in addition to v3 files. But we can change13:55
mpetersonAJaeger: yes, and that bit wasn't clear to me :) because 3.1 of those steps says to remove them from the pipeline definition13:55
mordredpabelanger: (mostly want to either land those or abandon them to get them off teh list)13:55
pabelangermordred: I'd like to see ansible_hostname if possible, that is helpful information to quickly tell the type of node in nodepool (eg: centos in vexxhost)13:55
pabelangermordred: but that could be a follow up13:56
AJaegermpeterson: did you see "what not convert"?13:56
*** armax has joined #openstack-infra13:56
AJaegermpeterson: you remove the legacy jobs, leve the standard ones in13:56
mordredmpeterson, AJaeger ah - I think we could add the word "legacy" to that sentence in 3.1 ...13:56
openstackgerritMerged openstack-infra/puppet-subunit2sql master: Ensure that build_names are unique per project  https://review.openstack.org/50825813:57
openstackgerritMerged openstack-infra/puppet-subunit2sql master: Strip legacy prefix if it's present  https://review.openstack.org/50827213:57
pabelangerAJaeger: okay, your call13:57
*** ryotagami has joined #openstack-infra13:57
*** kiennt26 has joined #openstack-infra13:57
mpetersonAJaeger: wow, I had actually read that.. Just didn't stick to my mind...13:58
mtreinishinfra-root: ^^^ we might need to manually restart the subunit workers to take those13:58
pabelangermtreinish: can you add that to zuulv3-issues etherpad13:59
pabelangerso we don't lose them13:59
openstackgerritMonty Taylor proposed openstack-infra/infra-manual master: Clarify job removal section  https://review.openstack.org/51230413:59
mordredAJaeger: ^^^ like that13:59
mordredmpeterson: also, would that have helped?14:00
*** dbecker has quit IRC14:00
mordredpabelanger: kk. I can add followup with that14:00
AJaegermpeterson: if you have ideas for better wording, feel free to send changes14:00
mtreinishpabelanger: sure14:00
AJaegerbbl14:01
openstackgerritMerged openstack-infra/zuul-jobs master: Make git repos writable so that hard links work  https://review.openstack.org/51228514:01
*** jcoufal_ has joined #openstack-infra14:01
mpetersonmordred: I think it would have raised a flag on my mind, yes. I would probably add a (See "What Not to Convert?")14:01
mpetersonAJaeger: will keep that in mind14:01
openstackgerritStephen Finucane proposed openstack-dev/pbr master: deprecations: Deprecate support for '-py{N}' requirements  https://review.openstack.org/51230614:02
jeblairandreaf: i think for devstack, and likely tempest too, we should have the job definition in multiple branches.  when you do that, they become variants, and override settings on the main job.  I think that will be a good way to manage configuration changes across branches as devstack evolves.14:03
mpetersonso I have another doubt now... it says for example to leave openstack-python-jobs in project-config... but in our specific case we need a lot more requirements than for example the one given by openstack-python-jobs-neutron, so I defined a project-template that includes those requirements. In that case, in project-config openstack-python-jobs-neutron should remain and in the project repo we add14:04
mpetersonthe project-template defined in the repo?14:04
*** yamahata has joined #openstack-infra14:04
andreafjeblair: well tempest is branchless14:04
*** jcoufal has quit IRC14:04
jeblairandreaf: however, it's also possible to do all of that on the master branch if we want.  i just think that putting it on different branches may be more intuitiv.14:04
jeblairandreaf: ah right, let's not do that for tempest then.   :)  we can still define all the branches on master.14:04
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Add tripleo-common to tripleo change queue for gate  https://review.openstack.org/51230814:05
mordredpabelanger: actually -we can't emit hostname -that job header runs on localhost (so that we don't print job info more than one time)14:05
andreafjeblair: is it possible to do that at role granularity? i.e. if override a single role in a branch for instance?14:05
*** dbecker has joined #openstack-infra14:05
pabelangermordred: boo14:05
pabelangermordred: but makes sense14:06
pabelangermordred: wonder if we should write it into inventory along with other nodepool variables. I can look into that in a bit14:06
* fungi is attempting to skim the ~700 lines of scrollback which have accumulated since he went to sleep last night. may take a while14:06
mordredpabelanger: well, it's in the host-info file: http://logs.openstack.org/85/512285/2/check/tox-linters/318980d/zuul-info/host-info.ubuntu-xenial.yaml14:06
*** mat128 has joined #openstack-infra14:07
mordredpabelanger: and, for that matter, the zuul-info file too: http://logs.openstack.org/85/512285/2/check/tox-linters/318980d/zuul-info/zuul-info.ubuntu-xenial.txt14:07
*** yamamoto has quit IRC14:07
mordredjeblair: there is an issue related to the discussion above with andreaf ...14:07
pabelangerAJaeger: with 512308, it might be time for tripleo to create something long the lines of 'integrated-gate' project-template, but 'tripleo-integrated-gate'. Otherwise, I see a lot of place where we are adding the queue keywork into jobs14:07
jeblairandreaf: not for a single role, though you can specify a branch for the project the role is in14:07
mordredjeblair: on the https://etherpad.openstack.org/p/zuulv3-issues line 6214:08
pabelangermordred: yah, just helpful to print it in realtime too. But, that's what I end up looking at14:08
jeblairmordred: ack, thanks14:08
mordredpabelanger: agree. maybe we should add a emit-host-header that's similar but per-host14:09
dtantsurfolks, any known problems with our SIP conferencing? we're trying to have a virtual meetup, but apparently only one person can connect of all14:09
mordredpabelanger: it's mostly the most useful when I'm watching a live stream and the other files don't yet exist14:09
dtantsurwell, 214:09
pabelangerdtantsur: I can look14:10
*** slaweq has quit IRC14:10
dtantsurpabelanger: thanks! it's strange. Registrar-less accounts work for half of people. People using Red Hat SIP accounts can connect but do not hear anyone14:10
pabelangerdtantsur: what conference room14:11
dtantsurpabelanger: 600514:11
pabelangerdtantsur: yah, I see 9 users. Let me call in real quick14:11
pabelangerdtantsur: you can try testing with sip:echo@pbx.openstack.org14:12
*** wolverineav has quit IRC14:12
dtantsursure14:12
*** hashar is now known as hasharAway14:12
pabelangerdtantsur: that should allow users to test audio14:12
*** masber has joined #openstack-infra14:13
*** dhinesh has joined #openstack-infra14:13
*** iyamahat has joined #openstack-infra14:13
dtantsurpabelanger: the same result14:14
pabelangerdtantsur: no audio? or 1 way audio14:14
dtantsurpabelanger: I hear nothing. the mic level in jitsi seems to move when I speak14:14
*** david-lyle has joined #openstack-infra14:14
pabelangerdtantsur: can you try again14:15
dtantsurthis is with RH SIP, with registrar-less I still cannot connect14:15
dtantsurstill nothing14:15
pabelangerdtantsur: well, all traffic in to pbx.o.o will be not register.14:15
pabelangerdtantsur: again please14:16
pabelangercollecting sip trace14:16
dtantsurpabelanger: which account to use?14:16
*** rkukura has joined #openstack-infra14:16
*** rbrndt has joined #openstack-infra14:16
dtantsurtrying with a registrar-less one now14:17
*** slaweq has joined #openstack-infra14:17
pabelangerdtantsur: there is no account, we just allow anonymous14:17
pabelangerdtantsur: do you have ekiga installed?14:17
dtantsurwell, for me the difference is big14:17
dtantsurpabelanger: no, but I can install it14:17
*** dhinesh has quit IRC14:17
*** masber has quit IRC14:17
*** dbecker has quit IRC14:17
pabelangerIt's likely something with NAT on redhat side, SIP trace looks to be correct14:18
pabelangerI should be able to test that in a bit with my corp account, but not right now14:18
pabelangerlets see how ekiga works14:18
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Start ssh persistent connections outside bwrap  https://review.openstack.org/51215814:18
*** dizquierdo has joined #openstack-infra14:18
*** wolverineav has joined #openstack-infra14:19
*** milan_ has joined #openstack-infra14:19
dtantsurpabelanger: trying ekiga now with echo14:19
dtantsurpabelanger: "remove host is offline"14:20
*** smarcet has joined #openstack-infra14:20
pabelangerdtantsur: I didn't see attempt14:20
dtantsurI suspect you only see the attempts from my RH SIP14:21
dtantsurI wonder if my provider just blocks SIP in the end..14:21
dmsimardmordred, pabelanger: btw ianw said he'd like to see the nodepool provider in the job header but he says it was also available in the inventory if need be. I don't have a strong opinion, maybe you have one :)14:21
jeblairdmsimard: yeah, though by adding it to the job log output, a user can see it before the job completes14:22
dmsimardoh, there's a nice thought14:22
dmsimardindeed14:22
dmsimardDo you want to add it on https://review.openstack.org/#/c/511821/ mordred ?14:23
mordreddmsimard: I'll do a followup ...14:23
dmsimardwfm14:23
*** slaweq_ has quit IRC14:24
*** slaweq_ has joined #openstack-infra14:24
pabelangerdtantsur: best I can see, we are getting Retransmission timeout from RH account14:25
dtantsurmeh..14:25
pabelangerwhich likey is routing issues14:26
dtantsurthanks pabelanger! we'll probably have to try bluejeans or something like that14:26
AJaegermordred, jeblair, jlk, upstream-translation-update fails without a log file - it just gives "error". I put this on the etherpad14:26
*** smarcet has quit IRC14:26
pabelangerdtantsur: okay, sorry we couldn't get it working14:27
dtantsurnp14:28
*** slaweq_ has quit IRC14:29
jeblairmordred: is the issue described under "Still seeing the hardlink copy failures between /home/zuul/... and /tmp in OSC functional jobs  (dtroyer/ianw)" the same as the one described under "Running tox as alternative users" ?14:29
*** david-lyle has quit IRC14:29
mordredjeblair: yes14:29
* jeblair rejiggers etherpad14:30
*** david-lyle has joined #openstack-infra14:30
mordredjeblair: at least, that is my current belief14:30
*** camunoz has joined #openstack-infra14:30
jeblair512285 merged, i'll move the combined entry to fixed14:31
jeblairdtroyer: can you retry the osc jobs that failed with perms issues?14:32
pabelangerokay, I see build-openstack-releasenotes failing in gate. Possible issue with tox-siblings http://logs.openstack.org/30/511630/1/gate/build-openstack-releasenotes/06eeaac/ara/result/eb67d5f9-3800-4565-adc0-e3ce284da7eb/14:33
pabelangergoing to check if known issue14:33
dtroyerjeblair: https://review.openstack.org/512125 is ianw's fix, passed check and gate, https://review.openstack.org/509015 is the original, passed chack, gate failure appears to be one of our occasional racy failures in the functional tests, unrelated to zuulv314:34
dtroyerthanks14:35
jeblairdtroyer: cool, thx14:35
pabelangerokay, I've added build-openstack-releasenotes MODULE_FAILURE to zuulv3-issue14:35
jeblairpabelanger: can you move that to issues with jobs?14:35
pabelangersomething not correct in tox-siblings14:35
jeblairpabelanger: you have triaged it -- it's a job bug not a zuul bug14:36
pabelangersure, and done14:36
*** sree has quit IRC14:37
*** camunoz has quit IRC14:38
dmsimardjeblair: btw there was a MERGER_FAILURE earlier, I posted it in triage14:38
jeblairdmsimard: ack, thx14:39
*** dbecker has joined #openstack-infra14:39
* clarkb attempts to catch up on scrollback like fungi14:39
clarkbpabelanger: so we think inodes for /var/lib/zuul filling?14:40
mordredpabelanger: didn't we land a patch forever ago to make sure MODULE FAILURE was printe to the consolelog?14:40
clarkbwe should be able to stop executors one at a time and reformat that partition right?14:40
pabelangerclarkb: I am not sure, it was something AJaeger suggested we look at14:41
pabelangermordred: cannot remember, but I'll look on executor now14:41
pabelangermordred: I guess not14:42
*** esberglu has quit IRC14:46
jeblairmordred: regarding the magnum stable jobs not running issue -- i think there is perhaps a subtlety to https://review.openstack.org/508658 that we missed.  if a project adds an in-repo project stanza, they either need to add it to all branches, or specify all the branches on the jobs if they only add it to master.14:46
jeblairmordred: i think we need a infra-manual zuulv3 section update for that14:46
*** d0ugal has quit IRC14:46
*** yamahata has quit IRC14:49
*** yamahata has joined #openstack-infra14:50
jeblair(we could consider not doing implied branch matchers on project stanzas in the master branch, but then if you add a job to master, it will also run on stable, so you'd have to do explicit branch exclusions, which partly defeats the purpose of having implied branch matchers on non-master branch project stanzas)14:50
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Fix wrong paths for api-guide/api-ref  https://review.openstack.org/51231914:52
AJaegerthis fixes publishing of guides, reviews welcome ^14:52
*** camunoz has joined #openstack-infra14:53
*** gcb has quit IRC14:54
Jeffrey4ldoes /var/lib/zuul/builds folder really exist? why i can not find it on the ci node?14:54
Jeffrey4l in zuulv314:54
*** xarses has joined #openstack-infra14:55
clarkbJeffrey4l: that is on the executor not the test node(s)14:55
pabelangerJeffrey4l: that exists on our zuul-executors, so host: localhost14:55
odyssey4meis it normal to not see the console log - for example all I see is 'end of stream' here: http://zuulv3.openstack.org/static/stream.html?uuid=4d46810442244f549a25c1cf70b4c4fe&logfile=console.log14:55
*** wolverineav has quit IRC14:56
Jeffrey4lso where is the executor node? is it a static node?14:56
pabelangerodyssey4me: which patchset is that?14:56
*** wolverineav has joined #openstack-infra14:56
*** slaweq_ has joined #openstack-infra14:56
*** sree has joined #openstack-infra14:56
pabelangerJeffrey4l: Yah, they exist in our control plane, ze01.o.o to ze10.o.o14:56
odyssey4mepabelanger https://review.openstack.org/50850914:56
Jeffrey4lroger. thanks14:57
odyssey4mepabelanger actually, sorry it's https://review.openstack.org/51191214:57
odyssey4mepabelanger another issue is that the legacy-ansible-func job is non-voting for some reason - where do I correct it?14:57
pabelangerodyssey4me: do you know whick job the console stream was? could it be possible the job was actually finished?14:58
odyssey4mepabelanger I tried at the beginning of the job, and during it - no difference14:59
*** slaweq has quit IRC14:59
*** slaweq has joined #openstack-infra14:59
odyssey4mepabelanger it's still running now - openstack-ansible-tests14:59
clarkbodyssey4me: openstack-infra/openstack-zuul-jobs/zuul.d/zuul-legacy-project-templates.yaml to fix the non voting thing14:59
pabelangerodyssey4me: you can look in project-config/zuul.d/project.yaml14:59
*** beekneemech is now known as bnemec15:00
odyssey4meclarkb pabelanger thanks - patch incoming15:00
*** d0ugal has joined #openstack-infra15:00
*** wolverineav has quit IRC15:00
*** slaweq_ has quit IRC15:00
*** slaweq has quit IRC15:00
pabelangerodyssey4me: both streams for openstack-ansible-tests are working for me currently15:01
clarkbpabelanger: looks like it is inodes /dev/xvde2        4743168 4743168          0  100% /var/lib/zuul <- just now from ze0315:01
pabelangerclarkb: kk15:01
*** dbecker has quit IRC15:01
pabelangernice work15:01
*** Shrews has joined #openstack-infra15:01
odyssey4mepabelanger hmm, maybe it's a firewall thing - but it's never seemed to work for me, whether at the office or home15:01
pabelangerodyssey4me: but, not streaming should be working15:01
clarkbso we probably do want to shut off each executor one by one and reformat that partition with significantly more inodes15:01
clarkbwe have 4 million now, maybe go up to 100 million?15:01
odyssey4meI'll try again from home tomorrow and look more closely at browser things.15:01
odyssey4meThanks for checking pabelanger .15:01
pabelangerodyssey4me: k, we do have finger protocol too15:02
fungiclarkb: well, you set it as a ratio of inodes to filesystem size15:02
pabelangerbut still need to add links to status page15:02
clarkb(I've got a meeting right now but can annotate the etherpad shortly then help with that work)15:02
clarkbfungi: mkfs.ext4 takes a -N flag to just set it directly15:03
clarkbbut we can also bump the ration instead, either way as long as we end up with more :)15:03
pabelangerodyssey4me: if you want to add your issue to etherpad.openstack.org/p/zuulv3-issues we can try and debug once you are at home15:03
clarkbI also wonder if ze03 has leaked inodes?15:04
fungiclarkb: ahh, right, i guess you can do it either way (it gets stored as a ratio on the fs though)15:04
clarkbeg do we delete all the things on shutdown?15:04
clarkbI'll have to dig in more after meeting but that gives us a good place to start15:04
jeblairclarkb: what's the disk usage that goes along with those inodes?15:06
clarkbjeblair: /dev/xvde2       72G   40G   29G  59% /var/lib/zuul was taken at roughly the same time as the inode listing15:07
*** ijw has joined #openstack-infra15:07
*** sdague has quit IRC15:07
clarkb60%15:07
mordredpabelanger: ok - I see the bug with ironic-inspector and build-openstack-releasenotes15:07
*** sdague has joined #openstack-infra15:07
*** yamamoto has joined #openstack-infra15:08
pabelangermordred: great15:08
mordred[testenv:releasenotes]15:08
mordredenvdir = {toxworkdir}/venv15:08
mordredthey redefine envdir for that venv in their tox.ini15:08
clarkbmordred: maybe in that case we stat if the dir exists where we want it then ignore copying those logs otherwise?15:08
openstackgerritFlavio Percoco proposed openstack-infra/project-config master: Add ansible-role-k8s-cookiecutter  https://review.openstack.org/51232315:10
mordredclarkb: yah - I think for today we can skip if it doesn't exist - and I thinkn we can make it better for tomorrow15:10
*** Shrews has quit IRC15:12
clarkbwe can likely just copy anything out of .tox/**/log15:12
clarkbor is it .tox/**/logs15:12
clarkb(to use the jenkins globbing method)15:12
pabelangerit was like that original but think we changed it15:12
*** dave-mccowan has quit IRC15:13
AJaegerteam, please put https://review.openstack.org/512319 on your review queue - to fix a wrong path for api-ref/api-guide publishing15:13
*** yamamoto has quit IRC15:14
*** armax has quit IRC15:14
pabelanger+215:14
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Work around non-standard envdir locations  https://review.openstack.org/51232515:14
AJaegerthanks15:14
*** kjackal_ has quit IRC15:15
clarkbpabelanger: I think the old method was grab anything under .tox and assume it had a log dir which is why it failed. There is at least one dir that doesn't collect logs that tox uses15:15
clarkbpabelanger: so we just have to glob it properly I Think15:15
pabelangerclarkb: ah, possible15:16
mordredpabelanger, clarkb, AJaeger: ^^ https://review.openstack.org/512325 should work around the releasenotes job for ironic-inspector15:16
mordredand I  mention a follow up we can write later15:16
mordredwe also need to do a similar thing for fetching logs15:16
clarkbdu --inodes says half a million inodes in executor-git then 3.9 million in builds15:17
*** e0ne has joined #openstack-infra15:17
dmsimardDo we have/use python3 on logs.o.o (for os-loganalyze) ?15:18
clarkbdmsimard: I think it is python2 currently but it should be able to python3.415:18
pabelangeryah, logs.o.o is trusty still15:18
* clarkb is digging in more to see if there are exceptional builds or if this is normal ish, but we copy the git repos in there don't we? guessing its copies of git repos that may be a problem especially if not hardlinked15:19
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Ignore missing tox log directories  https://review.openstack.org/51232715:19
dmsimardclarkb: ok, I'll write something compatible with both just in case.. py3 has something convenient for a problem I have15:19
openstackgerritJames E. Blair proposed openstack-infra/infra-manual master: Zuulv3: Add note about in-repo stable config  https://review.openstack.org/51232815:20
jeblairmordred, AJaeger: ^ does that make sense?15:20
*** trown is now known as trown|brb15:20
odyssey4mepabelanger clarkb - I've added an issue with our jobs into 'issues needing triage' in the etherpad... not too crucial at this time, but when there's a gap we'd like to get it resolved15:21
ilpianista_hi, what's needed to accept this change? https://review.openstack.org/#/c/507531/15:21
mordredjeblair: yes - with one wording nit (expand a pronoun)15:21
ilpianista_I see no updates since 3 weaks...15:21
clarkb#status log elasticsearch cluster is now green after triggering index curator early to clear out old indexes "lost" on es0715:21
openstackstatusclarkb: finished logging15:21
ilpianista_weeks*15:21
openstackgerritFlavio Percoco proposed openstack-infra/project-config master: Add ansible-role-k8s-cookiecutter  https://review.openstack.org/51232315:22
AJaegerjeblair: this backporting is worth an email to openstack-dev IMHO15:22
*** LindaWang has quit IRC15:23
openstackgerritJames E. Blair proposed openstack-infra/infra-manual master: Zuulv3: Add note about in-repo stable config  https://review.openstack.org/51232815:23
jeblairmordred: good call15:23
clarkbilpianista_: JJB is mostly reviewed by electrofelix, zaro, and zxiiro at this point. It is possible that we may need more volunteers around that tools particularly since we don't consume it anymore15:23
clarkbor maybe go to single core approvals so you aren't waiting for a second reviewer15:23
jeblairthere's also a jjb channel15:23
zxiiroilpianista_: we need another core to review it. I already have so cannot review it again. Also the jjb channel is now #openstack-jjb15:24
*** trown|brb is now known as trown15:24
*** gmann is now known as gmann_afk15:25
jeblairAJaeger: agreed.  what topic flags should we use?15:25
tmorinhi infraroot: I have a template producing both a check and a gate job, I need the check job becoming non-voting, and (consistently) want to plainly disable the gate job ... how can I do that ?15:25
openstackgerritMerged openstack-infra/project-config master: Fix wrong paths for api-guide/api-ref  https://review.openstack.org/51231915:25
AJaegerjeblair: just choose one ;)15:26
* AJaeger needs to go offline now, will be back later15:26
jeblairmordred: do you remember who reported the magnum issue?15:26
jeblairtmorin: can you point me at the template?15:27
*** kiennt26 has quit IRC15:27
tmorinjeblair: http://git.openstack.org/cgit/openstack-infra/openstack-zuul-jobs/tree/zuul.d/zuul-legacy-project-templates.yaml#n120015:28
jeblairtmorin: delete the gate section entirely, then under 'branches' in the check job say "voting: false"15:28
tmorinjeblair: I was hopping that there would be something like "voting: false", but for completely disabling15:28
jeblairtmorin: well, removing it is completely disabling it15:29
tmorinjeblair: I don't/can't change the template definition, I just want the gate job disabled in one place where the template is used15:29
*** Shrews has joined #openstack-infra15:29
*** Shrews has quit IRC15:29
*** Shrews has joined #openstack-infra15:29
openstackgerritFlavio Percoco proposed openstack-infra/project-config master: Add ansible-role-k8s-cookiecutter  https://review.openstack.org/51232315:29
openstackgerritFlavio Percoco proposed openstack-infra/project-config master: ansible-role-k8s-cookiecutter to zuul.d/projects  https://review.openstack.org/51233015:29
pabelangeryah, creating a new project-template for non-voting is how we usually had it15:29
tmorinjeblair: sorry, I realise I wa a bit elliptic15:29
mwhahahatmorin: jeblair: we think we spotted, it we're getting git.o.o instead of review.o.o for the changes somewhere15:30
pabelangerhandle*15:30
jeblairtmorin: maybe don't use the template and just manually add the check job?15:30
*** slaweq has joined #openstack-infra15:30
*** sbezverk has quit IRC15:31
tmorinjeblair: ah, yes, sure ( no risk that I might get out-of-sync with whatever the template would tweak in the job later ?)15:31
*** bramleya_ has joined #openstack-infra15:31
openstackgerritFlavio Percoco proposed openstack-infra/project-config master: ansible-role-k8s-cookiecutter to zuul.d/projects  https://review.openstack.org/51233015:31
*** dizquierdo has quit IRC15:32
tmorinmwhahaha: nice if you have a fix, but given past week experience, I'd prefer the job to be non-voting, for some time at least, and then see how this goes15:32
mwhahahatmorin: sure15:32
clarkbmordred: is http://logs.openstack.org/25/512325/1/check/tox-py35-on-zuul/8fc7774/tox/ self testing (this is change that udpates how we collect tox logs)?15:33
tmorinmwhahaha: we will notify if we see it breaking too often, if you need to, but I guess you must have a tool for that, don't you ?15:33
jeblairtmorin: oh definitely, but you're already out of sync.  you could define 2 templates, one that's non-voting.  but a template for a single job in a single pipeline seems unecessary.15:33
mwhahahatmorin: we do but it's beneficial to have some communication from the other projects (ie feedback)15:33
tmorinmwhahaha: understood & agreed15:34
*** dbecker has joined #openstack-infra15:34
*** slaweq has quit IRC15:34
openstackgerritThomas Morin proposed openstack-infra/project-config master: n8g-bgpvpn: make triple-o job check/nv  https://review.openstack.org/51233215:35
*** jogo has quit IRC15:36
clarkbdigging into inode usage http://paste.openstack.org/show/623745/ it does seem to largely be git repos15:36
jeblairodyssey4me: legacy-ansible-func itself is defined as non-voting: https://git.openstack.org/cgit/openstack-infra/openstack-zuul-jobs/tree/zuul.d/zuul-legacy-jobs.yaml#n715:36
*** thorst has quit IRC15:37
odyssey4mejeblair that would explain it - any idea why?15:37
*** gongysh has joined #openstack-infra15:37
*** udesale has joined #openstack-infra15:38
jeblairodyssey4me: perhaps a bug in the migration script?15:38
odyssey4mejeblair pabelanger assuming it's alright with you, I'll push up a review to remove that15:38
jeblairodyssey4me: sounds good15:38
pabelanger++15:39
Shrewspabelanger: there are 2 held nodes from the multinode-integration-fedora-26 job hold. probably hold overs from before we had the bug fix in. can i delete those?15:39
pabelangerShrews: yes15:39
jeblairmordred: +3 512328 ?15:39
Shrewspabelanger: done15:40
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/openstack-zuul-jobs master: Switch legacy-ansible-func to voting  https://review.openstack.org/51233415:41
clarkbI've updated the etherpad with info collected re inodes15:41
pabelangerShrews: danke15:41
*** ifat_afek has joined #openstack-infra15:41
mhaydenhow should zuulv3 jobs be handled if they need sudo? the revoke-sudo role appears to remove it15:42
*** e0ne_ has joined #openstack-infra15:42
jeblairmhayden: don't use that role15:42
mhaydenjeblair: hah! i guess i should have my own 'pre' playbook to resolve that? :)15:43
jeblairmhayden: iirc, we don't do it in all jobs, just unit test jobs15:43
*** slaweq has joined #openstack-infra15:44
jeblairmhayden: what kind of job are you building, and what were you going to use as a parent?15:44
*** e0ne_ has quit IRC15:44
mhaydenjeblair: it's a tox job that runs ansible playbooks, and i'm using openstack-tox as a the parent15:44
*** jpena is now known as jpena|brb15:45
pabelangerDid we write an openstack-tox-with-sudo jobs?15:45
fungiand tox is firing scripts which invoke sudo?15:45
*** e0ne has quit IRC15:45
pabelangeryah, we have gate-{name}-tox-with-sudo-{envlist} today in JJB for that15:45
jeblairthere are a lot of 'functional' test jobs which use sudo15:45
jeblairhowever, openstack-tox -> tox -> unittests.  and unittests uses revoke-sudo15:46
*** caphrim007 has quit IRC15:46
jeblairi think there's a slight mismatch here.  we should probably get mordred in on this too.15:46
jeblairit seems like folks should be able to easily say "this is a functional test job that uses tox"15:47
*** markvoelker has quit IRC15:47
jeblair(which therefore requires sudo)15:47
*** jascott1 has quit IRC15:48
*** slaweq has quit IRC15:48
jeblairmaybe we need an openstack-tox-functional which parents to base, and has a pre-playbook that uses the roles in tox and unittests, but omits revoke-sudo.15:49
SpamapSjeblair: I like that plan.15:50
*** wolverineav has joined #openstack-infra15:51
*** e0ne has joined #openstack-infra15:51
pabelangercould somebody remind me where we are setting up ZUUL_CHANGES on legacy zuulv3 jobs?15:51
ifat_afekHi, can someone please help me? I’m trying to understand why vitrage release notes no longer exists: https://docs.openstack.org/releasenotes/vitrage/pike.html15:52
ifat_afekAs far as I know, nobody has changed anything in Vitrage related to the releasenotes15:52
pabelangerifat_afek: https://docs.openstack.org/releasenotes/vitrage/ is empty for some reason15:53
*** caphrim007 has joined #openstack-infra15:53
pabelangerifat_afek: can you update zuulv3-issue on etherpad.o.o in triage section15:53
pabelangerhttps://etherpad.openstack.org/p/zuulv3-issues15:53
fungipabelanger: https://git.openstack.org/cgit/openstack-infra/zuul/tree/zuul/ansible/filter/zuul_filters.py?h=feature/zuulv3#n4315:53
jeblairpabelanger: is that a zuulv3 issue?15:54
clarkbfungi: digging in we have ~18 million blocks so if we set bytes per inode to fs block size we should get a mostly maxmimum number of useable inodes without wasting space (and that will be ~18 million inodes)15:54
fungiclarkb: and what's the current ratio?15:54
pabelangerjeblair: unknown would need to look more. I can once I figure out an issue with tripleo jobs15:54
ifat_afekpabelanger: sure, I’ll do that, but do you think this is relevant to the latest zuul changes? this link used to exist before, it was somehow deleted15:54
pabelangerfungi: thank you15:55
jeblairdhellmann: can you look into ifat_afek's question?15:55
ifat_afekpabelanger: when I run ‘tox -e releasenotes’ the release  notes are generated fine15:55
clarkbfungi: we have 4 million so about 4x larger?15:55
clarkbfungi: blocks size is 4096, that puts it in the 16k range of bytes per inode I think15:55
pabelangerifat_afek: okay, looking now15:55
*** martinkopec has quit IRC15:56
ifat_afekpabelanger: thanks! At the moment it prevents me from releasing a new Pike version, so I’ll be really happy to have it fixed15:57
clarkbalso df's output is in kB not B that was only slightly confusing15:57
openstackgerritJulien Danjou proposed openstack-infra/project-config master: Ceilometer: disable Tempest tests for master and Queens  https://review.openstack.org/51234215:57
*** bandini has quit IRC15:57
fungiclarkb: sounds reasonable15:58
*** bandini has joined #openstack-infra15:58
clarkbthat is interesting, btrfs doesn't list inodes? (locally on a btrfs fs its lists 0 inodes available and free when running df)15:59
fungiclarkb: i thought btrfs had adaptable inode handling (like reiserfs)16:00
*** nikhil has joined #openstack-infra16:00
clarkbfungi: ah maybe that is it. I mostly  just have it over here in a semi corner because I was curious and it hasn't exploded yet16:01
jeblairclarkb: i am not using it because it exploded.16:01
clarkbthat and suse seems to really like it and I installed suse16:01
clarkbjeblair: ya I only trust it on the battery backed up static host. Definitely not on my latptop16:01
clarkbfor certain values of trust16:01
fungiyeah, i had btrfs explode on me once and was entirely unrecoverable (had to restore from nightly backups)16:01
mhaydenjeblair: should i open a bug for the sudo + tox issue?16:02
fungibut that was also back before there was a fsck or any recovery tools16:02
jeblairfungi: yes, btrfs was also part of my restore test strategy.16:02
pabelangerifat_afek: okay, I don't see any logs for your most recent post jobs yet16:02
jeblairmhayden: well, i was hoping that we could discuss solutions to that, but mordred isn't around and he was heavily involved in setting that stuff up originally16:03
*** dave-mccowan has joined #openstack-infra16:03
pabelangerifat_afek: however, i did confirm the project should be building releasenotes16:03
jeblairmhayden: i guess why don't you put it on the zuulv3 issues etherpad under the "Issues with Jobs" section16:03
clarkbinfra-root I think we should do seomthing like this: for each executor: gracefully stop zuul executor, unmount /var/lib/zuul, mkfs.ext -i 4096, mount -a, start zuul executor16:03
clarkber mkfs.ext4 -i 4096 /dev/xvde216:04
jeblairmhayden: aside from just doing it, that's probably the best way to avoid losing track of it16:04
pabelangerifat_afek: is there anything ready to merge now?16:04
clarkbjeblair: mhayden fwiw my initial impression is that isn't a bug, instead I think we need something that is tox but also not unittests16:04
ifat_afekpabelanger: yes, should I do workflow+1 so we can check? there is one change that should be approved16:05
clarkbjeblair: so maybe we have tox-unittests and tox-vanilla?16:05
clarkbI guess the bug would be in the organization16:05
*** slaweq has joined #openstack-infra16:05
pabelangerifat_afek: sure, I just need to know the patch you are going to +A16:05
jeblairclarkb: ya16:05
fungijeblair: it was part of mine too, but luckily my strategy involved making a btrfs snapshot and then backing that up to another system over the network for extra insurance16:05
ifat_afekpabelanger: that would be https://review.openstack.org/#/c/512206/16:06
jeblairpabelanger: were you doing something with tox-siblings before?16:06
jeblairah i found it on the etherpad16:07
SamYapleim not able to access my secret. i defined the secret in .zuul.yaml and try to access it in a playbook and it is not defined.16:07
jeblaira new tox-siblings error appeared under triage, i'm moving it to job issues16:07
SamYapledoes the secret needto be defined in project-config?16:07
*** udesale has quit IRC16:07
fungijeblair: thanks, i just spotted the entry mordred had for what looks like probably the same problem16:07
pabelangerjeblair: yah, I just reported the error, think mordred is working on fix16:08
dhellmannpabelanger, ifat_afek : I'm just catching up here. Let me know if I can help at all.16:08
*** d0ugal has quit IRC16:08
jeblairfungi: i think that may have been you; see line 107 now16:08
pabelangerifat_afek: thanks16:08
jeblairfungi: ya16:08
mhaydenjeblair / clarkb: can do, thanks for looking into it16:08
jeblairSamYaple: no, a secret can be defined anywhere16:08
clarkbSamYaple: the consumption of the secret has to be merged before it will work16:08
fungijeblair: yep, i saw you move it--thanks!16:08
pabelangerifat_afek: ready here16:08
pabelangerdhellmann: sure, hope to know more in a few moments16:08
jeblairSamYaple: point me at patches if you got 'em16:09
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Switch legacy-ansible-func to voting  https://review.openstack.org/51233416:09
clarkbjeblair: reading the zuul-executor init script it isn't clear to me that there is a way to gracefully stop that process. Is that something we are able to do?16:09
jeblairclarkb: nope.  implement graceful is a "# TODO"16:09
*** amoralej has joined #openstack-infra16:09
*** tmorin has quit IRC16:09
fungiclarkb: i think the changes merged to make running jobs reenqueue on abort from an executor restart at least?16:09
jeblairfungi, clarkb: yes16:10
*** yamamoto has joined #openstack-infra16:10
jeblairclarkb, pabelanger: take a look at https://review.openstack.org/512158 and let me know if you want to merge that before we do an executor restart16:10
clarkbinfra-root knowing ^ should I just go ahead and stop an executor now (probably 03 since that is where I've been working) and reformat the xvde2 partition?16:10
* clarkb looks16:10
fungiclarkb: that seems prudent16:10
pabelangerclarkb: looking16:11
odyssey4meI've popped another issue - not urgent into the 'needs triage' queue on the etherpad.16:11
SamYaplejeblair: secret declared: https://github.com/openstack/loci/blob/master/.zuul.yaml#L4916:11
jeblairi can also see the inode issue being too urgent to slow down with that patch; i just don't have a relative sense for how often we're seeing inode failures vs ssh failures and how to prioritize16:11
SamYaplejeblair: secret in job: https://github.com/openstack/loci/blob/master/.zuul.yaml#L17416:11
clarkbjeblair: the 60 second hold on the jobdir will consume indoes that we know are currently more precious than we'd like16:11
SamYaplejeblair: secret used: https://github.com/openstack/loci/blob/master/playbooks/loci-builder.yaml#L7316:11
SamYapleplaybook doesnt run that task because secret is not defiend16:12
jeblairclarkb: yes, but we won't run that change before your inode reformat16:12
ifat_afekpabelanger: I approved https://review.openstack.org/#/c/512206/16:12
*** trown is now known as trown|lunch16:13
pabelangerjeblair: clarkb: I haven't see any POST_FAILURES this morning. But haven't been looking too much.16:13
*** mat128 has quit IRC16:13
pabelangerbut, happy to try the patch if we want16:13
jeblairpabelanger, clarkb: if things aren't on fire due to ssh connection issues, let's defer it until the next restart and only do inodes for now.16:13
*** dhinesh has joined #openstack-infra16:14
pabelangersure, I'm okay with that16:14
jeblairSamYaple: what job are you looking at?16:14
clarkbjeblair: ok that works for me16:14
SamYaplejeblair: any of the publish jobs16:15
SamYaplejeblair: see http://logs.openstack.org/74/743fdb397cf9d0e80ba00bd1b4ef85d1bd41e2ec/post/publish-loci-cinder/43fa65d/job-output.txt.gz#_2017-10-16_07_57_51_06233916:15
dansmithis the v2->v3 redirect for the status page not in place?16:15
SamYaplethe when conditional in this case is "is secret defined"16:15
*** yamamoto has quit IRC16:15
clarkbjeblair: before I start reformating things one question about the ssh wrapper. Isn't a single bwrap used per job? which means it could start the control process and persist across all of the playbook invocations?16:15
openstackgerritJulien Danjou proposed openstack-infra/project-config master: Ceilometer: disable Tempest tests for master and Queens  https://review.openstack.org/51234216:15
clarkbdansmith: it is, but I had to run a hard refresh to get it16:15
dansmithclarkb: I don't seem to be getting status.json from it like I did last time16:16
*** armax has joined #openstack-infra16:16
dansmithclarkb: no caching going on there, of course16:16
clarkbdansmith: it redirects you to zuulv316:16
clarkbdansmith: so you should get a 30116:16
dhellmannis the "MERGER_FAILURE" error condition defined somewhere?16:16
SamYapledansmith: i had to update the script to get it to work16:17
dansmithclarkb: hard refresh gives me the redirect in a browser, but not status.json.. this worked in the first v3 switch16:17
dansmithSamYaple: update how? maybe this is related to the https thing?16:17
*** dbecker has quit IRC16:17
SamYaplei pointed it at zuulv3 directly16:17
pabelangerdhellmann: yah, see zuulv3-issues etherpad. TL;DR: we think it is because we ran out of inodes on zuul-executor. clarkb is working on it16:17
dhellmannah16:17
dansmithSamYaple: okay last time the redirect worked so we don't have to change again16:18
*** dhinesh has quit IRC16:18
clarkboh before I stop ze03's executor, do I need to worry about caching the git repos?16:18
clarkbjeblair: ^16:18
SamYapledansmith: yea i know. dont know whats up. it might be related to https?16:19
SamYapledansmith: you can try removing it and see if it follows links after that16:19
SamYapledansmith: its probably the fact that the 301 redirect is only setup on port 8016:19
dansmithah, clarkb^ ?16:19
fungiodyssey4me: looking at the command task, using == for string comparison is not posixly-correct. as to why this seems to be getting executed under dash instead of bash, i'm less sure16:20
jeblairSamYaple: ah, i think i see the problem.  secrets bind tightly to playbooks in job definitions.  so you have to have the playbook that uses the secret in the same job definition that uses the secret.  for example: https://git.openstack.org/cgit/openstack-infra/project-config/tree/zuul.d/jobs.yaml#n88 uses the tarballs secret in its post-run playbook.  then other jobs inherit from that.16:20
dhellmannpabelanger : thanks16:20
SamYapledansmith: clarkb **im guessing. i havent confirmed the 301 is only on port 8016:20
*** markvoelker has joined #openstack-infra16:20
jeblairclarkb: one bwrap per playbook, many per job.16:20
odyssey4mefungi any shell task should ideally specify the environment under which it should run16:21
*** jpena|brb is now known as jpena16:21
*** lucasagomes is now known as lucas-afk16:21
clarkbjeblair: ah that explains my confusion.16:21
fungiodyssey4me: agreed, and i thought we usually set it to bash16:21
*** rloo has joined #openstack-infra16:21
clarkbSamYaple: dansmith fungi put the redirect in place and may know more16:21
odyssey4mefungi one would hope so ;)16:21
pabelangerSamYaple: I'm not sure we have https on status.o.o, I've always used http16:21
SamYaplejeblair: oh. so all i would need to do is redeclare the playbook in that job definition. let me try that16:21
SamYaplepabelanger: you do16:21
SamYaplesigned cert and all16:22
SamYaplewell for zuul.o.o, not zuulv3 yet16:22
fungiSamYaple: dansmith: oh, possible i missed the https vhost block. will check16:22
*** d0ugal has joined #openstack-infra16:22
robcresswello/ I don't suppose anyone has any insight into why https://review.openstack.org/#/c/512333 is incorrect? Thought I'd try my hand at porting some things to v3 config files.16:22
jeblairclarkb: saving /var/lib/zuul would be nice16:22
jeblairclarkb: should have enough space on / to hold it once executor is stopped16:22
*** e0ne has quit IRC16:23
openstackgerritTerry Wilson proposed openstack-infra/project-config master: Fix networking-ovn openstack-tox-cover job  https://review.openstack.org/51234716:23
*** iyamahat has quit IRC16:23
clarkbjeblair: ok16:23
*** egonzalez has quit IRC16:23
jeblairrobcresswell: i'll take a look16:23
* clarkb updates etherpad with an rsync back and forth16:23
jeblairclarkb: or a tar?16:23
robcresswelljeblair: Much obliged :)16:23
pabelangerSamYaple: https://review.openstack.org/511067/ was the revert16:23
fungiodyssey4me: in that playbook, it looks like the first cmd task does specify "executable: /bin/bash" but the other 3 do not16:23
*** e0ne has joined #openstack-infra16:24
odyssey4mefungi which playbook is that? I can fix it up quick16:24
pabelangerSamYaple: should be able to see in system-config if we had redirect on 443 before16:24
*** e0ne has quit IRC16:24
fungiodyssey4me: openstack-infra/openstack-zuul-jobs/playbooks/legacy/ansible-func/run.yaml16:24
SamYaplepabelanger: the issue is a script that dan wrote we updated in this timeframe to us https over http, so the redirect worked lasttime and not this time16:24
clarkbjeblair: rsync is easy I thinK? I dunno its what I've written down in the etherpad, canyou see if that looks sane to you?16:25
*** dizquierdo has joined #openstack-infra16:25
*** e0ne has joined #openstack-infra16:25
toskyuhm, if I use the devstack job (or a job which inherits it), and I specify vars.devstack_localrc, should I rewrite the entire content or the base job, or will the content be merged with the content from the base job?16:25
jeblairclarkb: i think you would make the rsync authors so happy if you told them it was easier than tar.  :)16:26
*** e0ne has quit IRC16:26
clarkbjeblair: ha16:26
*** yamahata has quit IRC16:26
pabelangerSamYaple: which URL did you say https worked on?16:27
*** e0ne has joined #openstack-infra16:27
fungiSamYaple: dansmith: should be redirecting correctly now16:27
dansmithconfirmed16:27
fungii had indeed only added it for http and not https16:27
SamYaplepabelanger: zuul.o.o or zuulv3.o.o (though only zuul.o.o has a valid cert)16:27
pabelangerOh16:27
SamYaplei have predicted a thing!16:27
*** e0ne has quit IRC16:27
*** slaweq has quit IRC16:27
SamYaplethanks fungi16:27
pabelangeryah, i think fungi just manually changed them16:27
pabelangerI was looking on status.o.o16:27
SamYapleah16:27
fungiyeah, i'm redirecting from https://zuul.openstack.org/ to http://zuulv3.openstack.org/ in the apache config there now16:27
*** slaweq has joined #openstack-infra16:27
*** gongysh has quit IRC16:28
jeblairclarkb: i made some updates16:28
clarkbjeblair: I have bad memories of tar in subshells piped through sockets to do what essentialy rsync -a does16:28
clarkbjeblair: I see them thanks16:28
jeblairclarkb: erm i wasn't suggesting anything complicated16:28
SamYapledansmith: we might as well revert the ssl patch if we are just redirecting from https to http16:28
clarkbjeblair: I know, its just hwy I tend to go for rsync in these cases regardless of complexity16:28
*** e0ne has joined #openstack-infra16:28
*** ociuhandu has quit IRC16:28
jeblairclarkb: just tar cvf /root/zuul.tar /var/lib/zuul16:28
*** e0ne has quit IRC16:29
SamYaplejeblair: https://review.openstack.org/#/c/512348/1/.zuul.yaml i am consuming the secret in the post playbook, so this looks right, correct?16:29
jeblairor since it's going to be a lot of files: tar cf /root/zuul.tar /var/lib/zuul16:29
*** e0ne has joined #openstack-infra16:29
fungiSamYaple: i _could_ redirect from https to https, but sending people to a server with a temporary cert seems messy16:29
*** chlong has quit IRC16:29
*** e0ne has quit IRC16:29
clarkbinfra-root I think I am ready to begin the filesystem work on ze03, I will be stopping zuul executor there nowish unless you have further input (steps are on the etherpad)16:30
SamYaplefungi: reasonable. i just updated my the script im running to go to zuulv3 directly, i dont want to have the overhead of a redirect each time personally16:30
*** xarses has quit IRC16:30
SamYaplewill zuulv3 eventually be just zuul.o.o ? or will it always redirect?16:30
fungiSamYaple: sure, the redirect was intended more as a convenience so that people who were consuming the status.json didn't get an unpleasant surprise16:30
*** cdent has joined #openstack-infra16:30
fungiSamYaple: zuulv3 will (likely somewhat soon) be renamed to zuul.openstack.org16:31
jeblairrobcresswell: left comment pointing out the error16:31
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/openstack-zuul-jobs master: Ensure func->functional toe env mapping works  https://review.openstack.org/51235116:31
SamYaplecool. then ill just ride out the storm :)16:31
cdentHello openstack infra stalwarts, I don’t know if this is a thing you want to know about, but just saw a POST_FAILURE on https://review.openstack.org/#/c/510626/16:31
fungiSamYaple: the zuulv3.openstack.org subdomain name is not meant as a long-term stable interface16:31
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/openstack-zuul-jobs master: Ensure func->functional tox env mapping works  https://review.openstack.org/51235116:31
clarkbok not hearing objections so here goes16:31
SamYaplegot it16:31
fungicdent: it absolutely is16:32
*** sbezverk has joined #openstack-infra16:32
fungilooking into it now to see if it's a known issue16:32
cdentfungi: well then, I’m glad I mentioned it16:32
fungiyes, thanks!16:32
pabelangercdent: ++16:32
robcresswelljeblair: Ah dammit, sorry, I should've caught that. Thanks for your time.16:32
pabelangerclarkb: ++16:32
pabelangercdent: sorry16:32
clarkbok process isn't actually stopping, I think jeblair said last night that was because things keep the socket for console logs open?16:33
clarkbjeblair: do I manually kill the child zuul-execturo in that case?16:33
*** jogo has joined #openstack-infra16:33
*** jogo has joined #openstack-infra16:33
jeblairclarkb: how did you stop it?16:33
clarkbjeblair: service zuul-executor stop16:33
clarkbso via the init script16:33
cdentfungi: I was hoping to get to use finger there (because that would be fun), but sadly no data16:33
jeblairclarkb: on 03?16:33
clarkbjeblair: yes16:33
cdentpabelanger: no worries, clarkb is practically my brother16:33
fungicdent: aha, the executor for that build was struggling with inode capacity in its scratch space16:33
fungicdent: we're presently working to remedy that specific case16:34
* cdent shakes fist at inodes 16:34
cdentcool, glad to hear it is somewhat under control16:34
pabelangeryah, inodes would be my guess also16:34
fungicdent: so basically it didn't have room to retrieve the logs from the job node16:34
* cdent nods16:34
fungipabelanger: i looked in the logs on ze03 to confirm that one16:34
jeblairSamYaple: left comment on 51234816:34
pabelangerfungi: ++16:35
*** slaweq has quit IRC16:35
*** tesseract has quit IRC16:35
jeblairclarkb: the finger streaming issue should only leave us with some simple, idle zuul-executor proceseses.  but it looks like we still have jobs running16:35
clarkbjeblair: ya I'm noticing disk usage hasn't fallen dramatically16:36
fungipabelanger: cdent: 2017-10-16 14:40:08,645 DEBUG zuul.AnsibleJob: [build: 6875450d24054b4ab5b44ef331a31bde]           msg: 'Unable to write to compressed file: [Errno 28] No space left on device:16:36
clarkbit is falling though16:36
jeblairclarkb: i see 'ABORTED' lines in the log16:36
jeblairclarkb: i think we're just seeing the executor stop16:36
jeblairwhich, under load, is not immediate.  :)16:36
clarkbjeblair: ok I shall practice patience then, thank you for looking16:36
*** ifat_afek has left #openstack-infra16:36
*** ifat_afek has joined #openstack-infra16:36
SamYaplejeblair: ok. otherwise the patchset looks good?16:36
*** jpich has quit IRC16:37
*** xarses has joined #openstack-infra16:37
jeblairSamYaple: i haven't really internalized the whole thing to say whether that's the best approach, but i believe aside from that, it will work.  :)16:37
*** sambetts is now known as sambetts|afk16:37
*** shardy has quit IRC16:39
jeblairclarkb: looks like it stopped16:39
*** dbecker has joined #openstack-infra16:39
SamYaplewill give it a shot16:39
SamYapleits always easier to iterate from something thats working :)16:39
clarkbjeblair: ya but still using 25GB of disk? I'm checking what is using that really quickly16:40
clarkbjeblair: also 2.2 million inodes. If we've leaked that it could potentialy contribute to the problem we are seeing16:40
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci master: Update CI links to use zuul user  https://review.openstack.org/51235816:41
*** milan_ has quit IRC16:42
openstackgerritJavier Peña proposed openstack-infra/zuul feature/zuulv3: Fix ref calculation for ZUUL_CHANGES legacy var  https://review.openstack.org/51235916:42
clarkbjeblair: /var/lib/zuul/builds has 16GB left in it16:42
pabelangerjust noticed that too16:42
*** ijw has quit IRC16:42
fungiclarkb: the only processes i see with open files on that fs now are your shell session16:43
clarkbexecutor-git is 8.3GB but I think we expect that as its the cache right?16:43
fungi(su and du)16:43
clarkbwe have builds from september 29th and october 2 and 1516:44
jeblairthere are a lot of builds16:44
openstackgerritDavid Shrewsbury proposed openstack-infra/zuul feature/zuulv3: New client command for printing autohold requests  https://review.openstack.org/51236016:44
jeblairsome of those may be left over from someone setting 'keep'16:44
pabelangeris it possible we had keep-jobs at some point?16:44
clarkbbasically half of our inodes are consumed by these builds16:45
jeblairthe ones from oct 15/16 seem likely to be bugs.  but they only total 4.16:45
jeblairthe rest are probably left over from keep.16:45
jpenapabelanger, dmsimard: I think https://review.openstack.org/512359 should fix the issue with the legacy tripleo jobs in https://review.openstack.org/51227016:45
clarkbjeblair: does that means I can delete them?16:45
pabelangerclarkb: we should consider checking ze09 too16:45
pabelangerthat was the other server to have issues16:46
clarkbpabelanger: plan is to do this for all the executors just one at a time so we don't have an outage16:46
clarkbI can start with 09 next16:46
jeblairclarkb: yep; delete them all16:46
pabelangerjpena: please add it to https://etherpad.openstack.org/p/zuulv3-issues so we can get more eyes on the issue16:46
clarkbjeblair: ok am deleting, then will backup16:46
* clarkb updates etherpad first16:46
*** iyamahat has joined #openstack-infra16:46
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci master: Switch cistatus page to zuul v3  https://review.openstack.org/50854616:47
jeblairjpena: oops, thanks16:47
*** andreww has joined #openstack-infra16:48
jeblairjpena, pabelanger: that's going to require an executor restart16:48
jpenapabelanger: it's there already16:48
jeblairthe good news is that clarkb is doing those right now16:48
*** andreww has quit IRC16:48
clarkbfirst one is slow but I expect subsequent restarst to go much quicker :)16:48
jeblairif we can merge that very quickly, we might even be able to sneak it into the restart of ze0316:48
pabelangerKK, let me +3 then :)16:48
*** xarses has quit IRC16:49
jeblairhow about i directly enqueue that into gate?16:49
clarkbjeblair: the other weird thing I noticed is /var/lib/zuul/executor.socket should that be cleaned up when the executor stops?16:49
pabelangerjeblair: efm16:50
pabelangerwfm*16:50
jeblairclarkb: ideally, it make suffer from the same perm issue that causes it to be unable to delete its pid16:50
*** andreww has joined #openstack-infra16:50
jeblairs/make/may/16:50
jeblair512359 is in gate16:50
clarkbjeblair: should I delete it too for cleanliness?16:51
jeblairclarkb: shouldn't matter16:51
clarkbok16:51
*** links has joined #openstack-infra16:51
*** wolverineav has quit IRC16:52
pabelangergoing to see why cacti.o.o isn't showing 2nd HDD for ze01 (all executors actually)16:52
clarkbbacking up /var/lib/zuul on ze03 now16:54
*** markmcclain has quit IRC16:55
* clarkb grabs tea while waiting16:55
*** markmcclain has joined #openstack-infra16:56
*** slaweq has joined #openstack-infra16:57
*** derekh has quit IRC16:58
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Fix ref calculation for ZUUL_CHANGES legacy var  https://review.openstack.org/51235917:00
openstackgerritMerged openstack-infra/infra-manual master: Zuulv3: Add note about in-repo stable config  https://review.openstack.org/51232817:01
jeblairclarkb: i'm running kick.sh on ze0317:02
dansmithclarkb: fungi: sorry for the curt response, I was on a call.. but, status feed is working via https again, thanks a lot for whatever you did17:03
*** slaweq has quit IRC17:04
jeblairclarkb: it's complete; zuul is upgraded on ze0317:04
clarkbjeblair: thanks will continue with fs there shortly17:05
*** cdent has left #openstack-infra17:05
*** trown|lunch is now known as trown17:06
*** yamahata has joined #openstack-infra17:06
jeblairrunning kick on all the other ze's now17:06
*** sree has quit IRC17:07
*** gordc has joined #openstack-infra17:08
chandankumarclarkb: hello17:08
*** sree has joined #openstack-infra17:08
chandankumarclarkb: i logged out early on friday17:08
clarkbok var lib zuul backup is done17:10
clarkbI am unmounting and reformating now17:10
clarkbchandankumar: did you see the questions I had? We will need to get answers for those to see what the best way forward is17:10
clarkbchandankumar: basically what account id do you have logging in and what ssh username do you use to push code17:10
chandankumarclarkb: ssh username chkumar246 and Account ID 1239317:11
*** chlong has joined #openstack-infra17:11
*** sree has quit IRC17:11
*** sree has joined #openstack-infra17:12
*** yamamoto has joined #openstack-infra17:12
clarkbmkfs resulted in 17751479 blocks with 18972672 inodes so we have about 1 million too many inodes17:12
clarkbinfra-root ^ do we want to try tuning that or call it good and move on?17:13
clarkbI could do -N 18000000 ?17:13
clarkbchandankumar: ok once a gerrit admin has a moment we will have to see how those two accounts map onto those items and see what we can do to fix the accounts17:14
clarkbchandankumar: currently in the middle of filesystem reformatting for zuul17:14
chandankumarclarkb: no problem thanks, :-) take your time :-)17:14
clarkbit made the fs 65GB too17:15
clarkbdown from 72GB17:15
clarkber I guess 68GB down from 72GB17:16
clarkbfungi: jeblair ^ any thoughts on if we want to tune that better?17:16
andreafmordred: is there a way with os_* ansible modules to take a snapshot of a VM? http://docs.ansible.com/ansible/latest/list_of_cloud_modules.html17:16
jeblairclarkb: what lead you to 4096 bytes per inode?17:16
clarkbjeblair: 4096 is the block size so we can only have as many files/inodes as blocks17:17
*** mat128 has joined #openstack-infra17:17
clarkbjeblair: unfortauntely this doesn't account for the overhead of the inodes themselves on the filesystem so we end up with more inodes tahn useable blocks17:17
jeblairclarkb: yeah, but 4096 isn't a typical size for us17:17
gordchi, just curious, is there any way to support OVERRIDE_<projectname>_PROJECT_BRANCH functionality in zuulv3?17:17
*** yamamoto has quit IRC17:18
jeblairgordc: yes: https://docs.openstack.org/infra/zuul/feature/zuulv3/user/config.html#attr-job.override-branch17:18
clarkbjeblair: we had 4.7 million or so inodes previously or about 1/4 the total on the current fs17:18
jeblairgordc: and https://docs.openstack.org/infra/zuul/feature/zuulv3/user/config.html#attr-job.required-projects.override-branch17:18
clarkbjeblair: since we were using about half the disk at full inode I could set it up to double the number of inodes rather than 4x them17:19
jeblairclarkb: and that was with the fs 59% full, so we really only need to double... right that17:19
clarkbI think that means we'd roughly run out of disk and inodes at the same time17:19
clarkbok let me rerun with -i 819217:19
clarkbetherpad has been updated17:19
openstackgerritPaul Belanger proposed openstack-infra/system-config master: Add field name check to add_ds_graph  https://review.openstack.org/51236717:19
*** jpena is now known as jpena|off17:20
*** Apoorva has joined #openstack-infra17:20
clarkb18344375 blocks and 9486336 inodes now17:20
clarkbthat looks better17:20
clarkband down to 70GB from 72GB whcih looks about right based on how we have been halving things. Ok moving forward with this17:21
jeblairclarkb: ++17:21
clarkbbackup is restoring17:22
*** sree has quit IRC17:23
*** amoralej is now known as amoralej|off17:24
*** camunoz has quit IRC17:24
gordcjeblair: cool cool. i'll take a look.17:24
gordcthanks17:24
toskywhat is a "MERGER_FAILURE" error? ( https://review.openstack.org/#/c/507080/ )17:24
*** sree has joined #openstack-infra17:25
jeblairtosky: at the moment, likely the issue clarkb is fixing -- our executors are running out of inodes17:25
toskyjeblair: oh, ok, I didn't check the backlog17:25
toskyI will wait then17:25
clarkbbackup restore is complete17:26
clarkbjeblair: anything else you want me to do before restarting zuul-executor?17:26
jeblairclarkb: nope17:26
clarkbok starting now17:26
jeblairtosky: feel free to go ahead and recheck, the odds are still pretty good (and increasing!) that you won't hit it.17:26
toskyjeblair: okidoki, rechecking, thanks!17:27
*** slaweq has joined #openstack-infra17:28
clarkb2017-10-16 17:27:56,955 DEBUG zuul.AnsibleJob: [build: b7671b4998b54836abc8fbf872789fd7] Ansible complete, result RESULT_NORMAL code 017:28
clarkbI think that means it is working17:28
jeblairclarkb: ++17:28
clarkbI'm going to move on to 09 now if anyone else wants to monitor 03 for problems17:28
jeblairclarkb: i'll tail logs there for a bit17:28
clarkbthanks17:28
*** ijw has joined #openstack-infra17:29
*** tosky has quit IRC17:29
jeblairwow it's busy :)17:29
*** sree has quit IRC17:29
clarkbzuul-executor is stopping on 09 now17:30
*** slaweq has quit IRC17:32
*** ijw has quit IRC17:33
EmilienMproject-config/zuul/layout.yaml is only used by zuulv3, correct?17:35
EmilienMerr17:35
EmilienMby zuul v2 I mean17:35
clarkbyes17:35
clarkbto v217:35
EmilienMok thx17:35
fungiEmilienM: also known as "unused" at this stage17:36
fungi(zuul v2 is not presently running at all)17:36
EmilienMfungi: wait, aren't we running jenkins? I'm confused17:36
* EmilienM monday jokes17:36
fungiEmilienM: happy monday!17:36
AJaegerjeblair: regarding releasenotes: Did we push out some releasenotes without root-marker? That might explain it ifat_afek' problem. We fixed root-marker today (?). So, why not push up a new change and see whether it appears again?17:37
pabelangerjeblair: when you have time for non zuulv3 things, I'd most welcome a review on https://review.openstack.org/512367 for smaller cacti emails for crontab17:37
jeblairAJaeger: pabelanger was looking into that; may have more info17:38
inc0good morning ... need your help folks https://review.openstack.org/#/c/508661/17:38
inc0ahh17:38
inc0disregard that ^17:38
inc0I was looking at wrong change:(17:38
pabelangerAJaeger: we did update releasenotes job root-marker yesterday, it is possible I did it incorrectly. That was in project-config17:39
pabelangerAJaeger: if you could help find the post jobs logs for https://review.openstack.org/#/c/512206/, that would be awesome17:41
*** links has quit IRC17:42
*** dhinesh has joined #openstack-infra17:42
*** dave-mccowan has quit IRC17:42
odyssey4mepabelanger do you have any objections to me switching up zuul.d/zuul-legacy-jobs.yaml to make all legacy-ansible-func-.* jobs use the same set of playbooks, instead of the current mechanism which has one per distro?17:43
pabelangerodyssey4me: well, you likely want to just copy them inrepo and start modifying them17:44
pabelangerodyssey4me: then we can delete the legacy jobs17:44
odyssey4mepabelanger alright, I guess we can do that too17:44
AJaegerpabelanger: looking17:45
*** esberglu has joined #openstack-infra17:45
odyssey4mepabelanger I was hoping to get into a working state before doing that17:45
odyssey4memuch of the brokenness there now is due to the auto-conversion17:45
*** smatzek has joined #openstack-infra17:45
pabelangerodyssey4me: what is the issue you are having?17:45
AJaegerquestion for the team: Is this knowon "build-tox-manuals-checkbuild build-tox-manuals-checkbuild : ERROR Unable to find playbook /var/lib/zuul/builds/"17:45
pabelangerAJaeger: where did you see that?17:45
AJaegerpabelanger: http://logs.openstack.org/29/29388b98f9999e4f98aaa4f421c090fb580aab10/post/publish-openstack-releasenotes/045cdd0/ is the post log17:46
AJaegerpabelanger: https://review.openstack.org/512370  is the error17:46
odyssey4mepabelanger none of the functional tests are working quite right - turns out I'll end up having to do https://review.openstack.org/#/c/512351/2/playbooks/legacy/ansible-func/run.yaml across many, many jobs17:46
ifat_afekpabelanger: https://review.openstack.org/#/c/512206/ was merged, and now https://docs.openstack.org/releasenotes/vitrage/pike.html is ok17:46
AJaegerpabelanger: sorry, need to step out quickly...17:46
odyssey4mebut they all work the same, so I can also just reduce them down to a smaller set17:46
pabelangerAJaeger: Yah, that likely means the post-run path in zuul.yaml is not correct17:47
pabelangerodyssey4me: yah, I'd just make the same change across all run playbooks for now. We don't really want to refactor legacy jobs longer then needed17:48
odyssey4mepabelanger okie dokey17:48
*** slaweq has joined #openstack-infra17:48
pabelangerodyssey4me: did you want to update that patch or just do a follow up?17:48
odyssey4mepabelanger I'll do a follow up17:48
pabelangerifat_afek: Yay, but still trying to see why that is17:49
pabelangerodyssey4me: kk, +317:49
odyssey4meta17:49
inc0ok I'll need help after all - NODEPOOL_TARBALLS_MIRROR <- how to get url to local tarballs in zuulv3?17:52
inc0zuul_site_mirror_fqdn17:52
inc0?17:52
*** slaweq has quit IRC17:53
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Ensure func->functional tox env mapping works  https://review.openstack.org/51235117:53
dtantsurso, is the release notes problem known? our gate (not check?) jobs seem to fail with RETRY_LIMIT17:54
dtantsuror was it a check job.. anyway, a lot of these recently17:54
pabelangerokay, I finally figure out why cacti is having a syntax error in our emails17:55
clarkb09 is starting again17:55
jeblairAJaeger: sent email to -dev about stable branch project stanza backports17:55
fungidtantsur: yeah, for projects using nonstandard tox configuration there's an issue with the tox-siblings pre task causnig the job to get repeatedly retried. it's listed under "issues with jobs" (currently bullet #6 i think?) at https://etherpad.openstack.org/p/zuulv3-issues17:55
pabelangerwhen we try to add graphs for puppetmaster.o.o, it is also matching on ci-puppetmaster.o.o and old.puppetmaster.o.o.  So, I thin we can just delete them17:55
dtantsurack, thanks fungi17:55
jeblairfungi: looks like we should merge those 2 changes17:56
fungijeblair: yeah, i was just pulling them up now17:56
pabelangeror make the matching logic better17:56
jeblairdtantsur: can you Depends-On: 512325 and 512327 and let us know if it fixes things for you?17:58
fungia depends-on the latter should be sufficient since its parent is the former17:59
clarkb2017-10-16 17:58:54,090 DEBUG zuul.AnsibleJob: [build: 418f826adacf4f9086cde51dbd885aa5] Ansible complete, result RESULT_NORMAL code 017:59
clarkbwith that I'm moving on to 01-10 sequentially17:59
fungiclarkb: looks like good news!17:59
clarkbtracking the completion status on the etherpad17:59
fungiresult code 0 is the best result code17:59
dtantsurjeblair: I think both recent patches have W+1 already, so people may not like me modifying them..17:59
*** dave-mccowan has joined #openstack-infra17:59
*** psachin has quit IRC18:00
inc0anyone? how can I get url to local cached tarballs in zuulv3?:/18:00
fungiinc0: i think you want to add this role: http://git.openstack.org/cgit/openstack-infra/openstack-zuul-jobs/tree/roles/mirror-info18:01
jeblairdtantsur: okay, if you don't want to wait for a merge/recheck cycle, you could push up a new copy of a change.  or you could wait and see.  :)18:01
dtantsurjlvillal, rloo, re gate ^^^18:02
jeblairdtantsur: (just have to remove the change-id so a new one gets generated)18:02
inc0thanks fungi yeah18:02
dtantsurI'll leave it up for these folks, as I have to run now.. thanks jeblair18:02
fungiinc0: or access the mirror_fqdn variable and then create your own18:02
fungiinc0: (look at the script template in that role for an example)18:02
*** dtantsur is now known as dtantsur|afk18:02
*** ldnunes has quit IRC18:03
inc0yeah, I see it, thanks fungi18:03
SamYapleare the variables zuul_site_mirror_fqdn and mirror_fqdn different?18:03
SamYapleive been using zuul_site_mirror_fqdn18:03
inc0SamYaple: zuul_site.. is good one18:04
andreykurilinhi folks!  I got something strange - `ERROR Project openstack/manila does not have the default branch master` . Should I file it at etherpad or just ignore and make recheck?18:04
fungiandreykurilin: a url for the log where you saw that would be great18:04
andreykurilinfungi: https://review.openstack.org/#/c/512246/118:05
andreykurilinthere is no job log18:05
clarkb01 executor is stopping now18:06
jeblairandreykurilin: if you can add that to etherpad please, that would be great18:06
fungiandreykurilin: that's a failure mode i haven't seen before, thanks!18:06
fungilooks like the same job worked on that change in the check pipeline ~1.5 hours prior18:07
andreykurilinjeblair: np, will add :)18:07
andreykurilinfungi: yes, everything was ok at first check18:07
openstackgerritPaul Belanger proposed openstack-infra/system-config master: Match HOST_ID using tab character too  https://review.openstack.org/51237618:07
jeblairunfortunately, zuul did not output a finger url, so we don't have the build uuid18:08
jeblairthat will make it slightly harder to track down18:08
pabelangerfungi: jeblair: clarkb: ^non-zuulv3 related, but will help fix cacti email spam issue for me.  When you have a free moment18:08
openstackgerritMerged openstack-infra/zuul-jobs master: Work around non-standard envdir locations  https://review.openstack.org/51232518:10
*** baoli has quit IRC18:10
*** tosky has joined #openstack-infra18:11
AJaegereven a recheck does not help with https://review.openstack.org/#/c/512371/1 -  "build-tox-manuals-checkbuild build-tox-manuals-checkbuild : ERROR Unable to find playbook /var/lib/zuul/builds/"18:13
AJaegerjeblair: that is a backport of .zuul.yaml to a stable branch18:13
andreykurilinjeblair: I put it into issues with jobs section18:13
AJaegeradding to the etherpad now18:13
clarkbjeblair: do you think zuul would be ok with me taking two exectuors out of the rotation at the same time? 01 is stopping very slowly...18:13
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Add tripleo-quickstart jobs into tripleo change queue  https://review.openstack.org/51237818:13
pabelangerAJaeger: ^another tripleo change queue patch for gate18:14
*** yamamoto has joined #openstack-infra18:14
pabelangeradded both to zuulv3-issues etherpad18:14
jeblairclarkb: maybe?  probably the load governor will keep us from hitting inode problems more than normal.18:14
pabelangerAJaeger: if you have an idea how to better group jobs, I'd be happy to review too18:15
jeblairandreykurilin, fungi: no space left on device on ze03.  likely the inode issue which has since been corrected on that host.18:16
fungijeblair: yep, that was my conclusion as well18:16
*** markvoelker has quit IRC18:16
andreykurilinbut the error message looks quite strange:)18:16
pabelangermwhahaha: EmilienM: just a heads up, tripleo changes in gate are not in the same change queue right now, so it is possible for broken code to slip through. If you don't mind keeping an eye out on zuulv3.o.o gate pipeline and make sure your specific jobs are in the same shared queue would  be helpful18:16
*** markvoelker has joined #openstack-infra18:17
*** ldnunes has joined #openstack-infra18:17
jeblairandreykurilin: yes indeed.  :)18:17
clarkbon 03 inode usage percentage remains under disk usage18:17
fungithe inode exhaustion problem on the executors seems to have manifested in a number of different reporting behaviors18:18
clarkbya18:18
*** jcoufal has joined #openstack-infra18:18
pabelangerclarkb: did you by chance confirm 512359 was installed when you restarted ze03.o.o?18:18
clarkbpabelanger: jeblair said it was and I trusted him :)18:18
pabelangergreat!18:18
pabelangercleaning up etherpad then18:18
AJaegerpabelanger: no idea right now. Do we really need to group them explicitely now?18:18
clarkbpabelanger: well its not on all executors yet18:18
clarkbpabelanger: just 03 and 0918:18
clarkbworking on 01 now18:18
*** yamamoto has quit IRC18:19
jeblairAJaeger: re 512371 -- is that job defined in that repo?  if so, you may need to backport the playbooks too18:19
fungiclarkb: speaking of inode utilization, i'm still monitoring progress of the logs.o.o cleanup, and we're now using a lower percentage of inodes than blocks on that filesystem18:19
pabelangerAJaeger: yah, we do that with integrated today. We likely need to do something the same with tripleo moving forward18:19
clarkbfungi: woo18:19
jeblairfungi: fascinating18:19
pabelangerfungi: yah, I see that this morning, great news18:20
AJaegerjeblair: yes, there are playbooks with that job in the repo18:20
fungi64% space in use, 56% inode utilization18:20
AJaegerjeblair: ok, will try...18:20
pabelanger\o/18:20
EmilienMpabelanger: I see tripleo jobs in zuul v3 gate pipeline18:21
*** jcoufal_ has quit IRC18:21
EmilienMpabelanger: but in different queues18:21
pabelangerEmilienM: yes, but not in same change queue. look at quickstart and tripleo-heat-templates18:21
EmilienMyeah, I just saw it18:21
*** markvoelker has quit IRC18:21
pabelangerEmilienM: yah, thats the issue. have 2 patches up see etherpad18:21
pabelangerwe might need more18:21
EmilienMlet me see18:22
EmilienMpabelanger: ok I got it, I'll be careful18:22
pabelangerEmilienM: we likey need to create a project-template called tripleo-change-queue (or something) which has enough common jobs to be added to all tripleo / puppet projects.18:23
pabelangerunless there is something today you can think off18:23
clarkbpabelanger: I've +2'd your cacti script fix but not approved it in case you wanted jeblair to see it. But feel free to approve it if you like18:24
jeblairclarkb: you wrote "    Use hardlinks for the git repos (is this possible? I think we are on the same fs as the source repos)" can you elaborate?18:24
clarkbjeblair: when we copy all the git repos can we use hardlink copies? since the cost seems largely tied up in all the git repos in the builds18:25
jeblairclarkb: you mean when we create the git repos inside the jobdir work/ directories?18:25
clarkbjeblair: I think that if you use git clone with the hardlink option you can treat the local repo as different than the source even with hardlinks (it just won't be a perfect copy as you change it)18:25
clarkbjeblair: yes18:25
jeblairclarkb: we already do that18:25
clarkbah ok so this is already just all the unique bits and changes for each job?18:25
jeblairclarkb: yep18:25
fungiif it's a git clone of a file path git should already hardlink when it sees that is possible18:26
pabelangerclarkb: kk18:26
clarkbin that case increasing the number of inodes is lamost definitely the best answer18:26
pabelangerteam, https://review.openstack.org/512308/ and https://review.openstack.org/512378/ could use a +3 to help tripleo jobs form a change queue in gate18:27
jeblairclarkb: some jobs still use a couple hundred MB of data for git repos, even though they hard-linked (for instance, a full devstack legacy repo job)18:27
clarkbjeblair: the paste.o.o link in the etherpad shows what inode usage looks like for a specific build18:28
clarkbbut ya18:28
jeblairclarkb: du will count those once18:28
openstackgerritMerged openstack-infra/zuul-jobs master: Ignore missing tox log directories  https://review.openstack.org/51232718:29
clarkbjeblair: ya I think du --inodes does that right thing?18:29
jeblairclarkb: basically, to figure out the delta, you need to du two copies of the same repo, then subtract the du for one copy18:29
jeblairclarkb: that will give you "incremental usage of things not covered by hard links"18:30
AJaegerjeblair: yes, that was it - backporting playbooks fixes it...18:30
jeblair(hard linked copies, obviously)18:30
openstackgerritAndreas Jaeger proposed openstack-infra/infra-manual master: mention playbooks for backporting  https://review.openstack.org/51238118:31
clarkbthe other thing that may potentially help here is repacking to reduce the number of small fiels per repo18:31
jeblairclarkb: oh, erm, we're talking about inodes.  inodes *are* the hard links.  :)18:31
clarkbjeblair: yes18:31
AJaegerjeblair: documented in 51238118:31
clarkbjeblair: I'm pretty sure du --inodes will do the right thing when run at the top level of the builds dir for example liek in my paste18:31
jeblairclarkb: so yeah, we're not goiing to be able to use any fewer of those things.18:32
clarkbjeblair: ya I think repacking is likely the only thing ot possibly use fewer18:32
clarkband maybe gc'ing if branches get deleted18:32
jeblairclarkb: well, regardless, the accounting issue only applies to block usage.  you still need just as many inodes for a hard-linked copy of a git repo18:32
openstackgerritMerged openstack-infra/project-config master: Add tripleo-quickstart jobs into tripleo change queue  https://review.openstack.org/51237818:33
clarkboh the way we do branch deletions is to make a tag so likely won't help to gc18:33
clarkb01 is still running :/18:33
openstackgerritAndreas Jaeger proposed openstack-infra/infra-manual master: Clarify job removal section  https://review.openstack.org/51230418:33
jeblairAJaeger: comments on 51238118:34
clarkbps -elf | grep zuul | wc -l is trending down so I guess I just need to keep waiting18:34
pabelangerEmilienM: okay, we have tripleo change queue now. So, should be able to start adding projcts to it. We likely need to do that in project-config/zuul.d/projects.yaml for now18:35
openstackgerritAndreas Jaeger proposed openstack-infra/infra-manual master: mention playbooks for backporting  https://review.openstack.org/51238118:35
jeblairclarkb: i wonder if we're killing them one at a time18:35
pabelangerclarkb: yah, on a busy executor it too a while to stop for me too18:35
AJaegerjeblair: fixed, thanks18:35
pabelangerIIRC: pre abort, it was because post playbooks needed to run18:36
*** MasterOfBugs has joined #openstack-infra18:36
*** efried has left #openstack-infra18:39
*** pramodrj07 has joined #openstack-infra18:39
*** ociuhandu has joined #openstack-infra18:40
AJaegerteam, if you later find time: upstream-translation-jobs report "error" without any logs - mentioned on etherpad.18:41
* AJaeger calls it a day now18:41
clarkbwould there be objections if I started to do two exectuors concurrently? still waiting on 01 and wanting to wait less :)18:42
pabelangerno issue here, we seem to be handing jobs well right now18:43
clarkbok I'm going to start on 02 as well18:44
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Executor: stop jobs in parallel on shutdown  https://review.openstack.org/51238318:45
jeblairclarkb: speedification of your current task ^18:46
jeblairsomething to review while waiting :|18:46
openstackgerritgordon chung proposed openstack-infra/project-config master: install using pip  https://review.openstack.org/51238418:46
openstackgerritJames E. Blair proposed openstack-infra/infra-manual master: mention playbooks for backporting  https://review.openstack.org/51238118:47
openstackgerritMerged openstack-infra/project-config master: Add tripleo-common to tripleo change queue for gate  https://review.openstack.org/51230818:48
clarkbjeblair: thanks18:48
openstackgerritgordon chung proposed openstack-infra/project-config master: install gnocchi using pip  https://review.openstack.org/51238418:48
jeblairi'm afk for an hour or so for lunch18:49
clarkbmaybe I should do that while I wait18:49
*** slaweq has joined #openstack-infra18:50
pabelangergordc: fyi, we're migrated to zuulv3, so change to JJB are frozen. You should be making the changes the playbooks for your jobs.18:51
* gordc thinks about how to pretend like i read the zuulv3 emails.18:52
gordcok. i guess i'll go read them :P18:52
*** slaweq has quit IRC18:55
*** sree has joined #openstack-infra18:57
openstackgerritKazunori Shinohara proposed openstack-infra/project-config master: Follow up change for Heat Dashboard creation  https://review.openstack.org/51083518:57
jlvillalmordred: So the RETRY_LIMIT is fixed for the ironic-inspector releasenotes job.18:59
jlvillalmordred: But. Now I see a POST_FAILURE18:59
jlvillalmordred: Looking at: http://zuulv3.openstack.org/18:59
pabelangerhave a log?18:59
jlvillalFor https://review.openstack.org/51163419:00
jlvillalpabelanger: It hasn't finished. And if it has POST_FAILURE, doesn't that mean no log? :(19:00
pabelangerjlvillal: no, there might be a log19:00
jlvillalpabelanger: Ah: http://logs.openstack.org/34/511634/1/check/build-openstack-releasenotes/8c93973/19:00
pabelangerlooking19:01
jlvillalpabelanger: Related patch: https://review.openstack.org/#/c/512327/19:01
fungiif the post failure is for the log uploading task then there often won't be logs, but there are other post tasks which can fail and we'd still get logs19:01
jlvillalpabelanger: I think that got rid of the retry_limit19:01
jlvillalpabelanger: fungi: Thanks19:01
pabelangerrsync: change_dir "/home/zuul//src/git.openstack.org/openstack/ironic-inspector/.tox/releasenotes/log" failed: No such file or directory (2)19:01
pabelangerHmm, I think we might have a patch to fix that19:02
pabelangerchecking19:02
fungijlvillal: we just merged a couple of suspected fixes for the ironic-inspector (and similar) releasenotes jobs19:02
*** sree has quit IRC19:02
jlvillalpabelanger: https://review.openstack.org/#/c/512327/ was kind of like that. It merged.19:02
fungipabelanger: jlvillal: that particular problem should be addressed by https://review.openstack.org/51232719:03
fungiwhich merged at 18:29 utc, a little over half an hour ago19:03
jlvillalfungi: I did the check after that19:03
jlvillalfungi: I think it fixed the RETRY_LIMIT19:03
pabelangerfungi: jlvillal: no, we are running that patch I think19:03
pabelangerhttp://logs.openstack.org/34/511634/1/check/build-openstack-releasenotes/8c93973/ara/file/4c9220a4-5726-44db-86c0-bfcbe79a97c5/#line-3419:03
*** edmondsw has quit IRC19:04
fungithere was a patch just before that one which addressed the tox-siblings issue resulting in the retry_limit failures for ironic-inspector (and similar) release notes jobs19:04
*** edmondsw has joined #openstack-infra19:04
fungihttps://review.openstack.org/512325 which merged at 18:10 utc19:04
fungipabelanger: okay, so telling rsync to include --ignore-missing-args is insufficient i guess19:05
pabelangeryah19:05
pabelangeri think /home/zuul//src/git.openstack.org/openstack/ironic-inspector/.tox/releasenotes/log is missing19:05
pabelangerand ignore-missing-args is for inside the directory19:06
pabelangerI think we need to stat the folder first or use the glob match clarkb suggested this morning19:07
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Follow up change for Heat Dashboard creation  https://review.openstack.org/51083519:07
fungioh, glob instead of iterating sounds safe enough19:07
jlvillalpabelanger: fungi: I'm proposing a change to the tox.ini19:07
pabelangeror that19:08
inc0another question, how do I publish logs in v3? we had post job in v219:08
*** edmondsw has quit IRC19:09
fungioh, also, why is that job running (and failing) the same post playbook twice?19:09
openstackgerritMerged openstack-infra/project-config master: Set v3 nodepool citycloud timeout to 600  https://review.openstack.org/51165619:09
inc0basically I have all logs in dir, how do I make them visible in http://logs.openstack.org/59/508759/55/check/kolla-ansible-centos-source/161c76b/ ?19:09
pabelangerinc0: you'd sync then back to executor into the zuul.executor.log_src folder19:11
pabelangerwe should have something in devstack-gate that does this today19:11
inc0rsync module?19:11
SamYapleW00T!!! a successful dockerhub push using secrets in zuulv319:11
SamYaplethanks everyone for your help19:11
inc0cool SamYaple :)19:11
pabelangerinc0: see https://git.openstack.org/cgit/openstack-dev/devstack/tree/roles/fetch-devstack-log-dir for an example19:12
inc0thank you Paul19:12
pabelangerinc0: there is also a proposal from mordred to make this a little easier19:12
pabelangerinc0: see opentack-infra ML for more on the topic19:13
pabelangerSamYaple: cool, url?19:13
inc0I will, although this seems straightforward to me19:13
SamYaplepabelanger: to the images? https://hub.docker.com/r/openstackloci/requirements/tags/ (master-ubuntu and master-centos tags)19:13
pabelangerSamYaple: neat, have a log file handy?19:14
SamYaplepabelanger: indeed i do http://logs.openstack.org/59/59e8266aa0856a5462ec44820401cec52c8731f7/post/publish-loci-requirements/c7e9047/job-output.txt.gz#_2017-10-16_19_05_50_27593919:14
pabelangerI like the warning onthe next line :)19:14
*** yamamoto has joined #openstack-infra19:15
SamYaplepabelanger: as long as it doesnt expose the password im fine with it :)19:15
pabelangerSamYaple: where are you keeping your secret / playbook for that?19:17
inc0speaking of which, infra hosted registry, once you guys feel comfortable with zuulv3 I'd love to get back ot it19:17
pabelangerI see it now19:17
SamYaplepabelanger: https://github.com/openstack/loci/blob/master/.zuul.yaml19:17
clarkb01 and 02 still running but process count continues to trend down19:17
pabelangerSamYaple: yah, no_log should help with that19:18
SamYaplepabelanger: im running no_log :)19:19
SamYaplepabelanger: https://github.com/openstack/loci/blob/master/playbooks/push.yaml#L819:20
*** yamamoto has quit IRC19:20
pabelangerSamYaple: cool, now to see if they rate limit your uploads19:20
*** hemna_ has quit IRC19:20
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Start ssh persistent connections outside bwrap  https://review.openstack.org/51215819:21
SamYaplepabelanger: our next step is adding a post job for cinder to build loci images. so well see.19:21
SamYaplewe just need to figure out how to do that now19:22
pabelangershouldn't be hard to do that19:23
*** rbrndt has quit IRC19:23
SamYaplewould having cinder post pipeline run a loci job mean that the loci secrets/playbooks/job definitions would need to move to project-config?19:24
pabelangeri don't think so, jobs are currently global19:25
pabelangerso, any project should be able to reference that19:25
*** bramleya_ has quit IRC19:25
pabelangeryah, I think that is right19:26
jlkSo I've been out of it for a bit, where do things stand with zuul v3 and infra production?19:26
SamYaplebut the job would be running as cinder, so i would have to adjust the allowed-projects on the secrets if im recalling correctly19:26
* mordred waves to everyone from the air19:26
pabelangerjlk: https://etherpad.openstack.org/p/zuulv3-issues is the current issues we are working on19:26
pabelangerjlk: but alive and working19:26
pabelangermordred: ^519:27
jlkis it gating for everything now, or still a minimal set?19:27
pabelangerjlk: everything, we rolled out 22:00UTC yesterday19:27
SamYaplejlk: everything19:27
jlknice19:27
pabelangeronly a few bumps so far19:27
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Use weakref for change cache  https://review.openstack.org/51135519:27
pabelangerbut all and all, looking real good!19:28
jlkalrighty. I'll try to get some reviewing done today.19:28
jlvillalpabelanger: fungi: As a note, for ironic-python-agent, we also set the envdir for 'docs', but that job passes. Not sure how the sphinx-docs job differs from the releasenotes job.19:28
SamYapleif all job definitions are global, does that mean that from the cinder repo they could install a job in the nova pipelines?19:28
jeblairmordred: o/  if you have time to think about a tox base job that doesn't use sudo, that'd be great.  see "Need a openstack-tox job that doesn't revoke sudo" in etherpad19:28
jlvillalpabelanger: fungi: Of course we can't look at the logfiles for those jobs unless the job fails.19:29
jlkSamYaple: "install a job in the nova pipelines" waht does that mean?19:29
* jeblair continues lunching19:30
SamYaplejlk: define a job in the cinder .zuul.yaml that says "run this job in the nova check pipeline"19:30
jlkhrm.19:30
jlkI know the job definition could live in cinder, I'm not sure if you can make additions to the nova project definition.19:31
*** mordred has quit IRC19:31
toskySamYaple: I don't think you can do that; which syntax would you use for that?19:31
SamYaplejlk: but its all global...19:31
jlkyes, global. But not necessarily /merged/19:32
SamYapletosky: https://review.openstack.org/#/c/512394/19:32
SamYaplejlk: thats not what ive been lead to believe19:32
jlkIf you try to define a project with a specific name in two different locations you may get a conflict19:32
SamYaplejlk: thats why we need uqniuely named jobs still, because it _is_ merged19:32
jlkjust like if you define a job in two different places you get conflicts19:32
toskySamYaple: and in fact zuul refused it19:33
jlkMaybe we're talking past each other19:33
toskyyou can't use another unprivileged queue19:33
*** hemna_ has joined #openstack-infra19:34
SamYapleright. so my original question was to do this, i must put this in project-config19:34
jlkWhen I say "merged" i mean zuul would interleave the dictionary content of the two definitions, which is ... difficult19:34
jlkBy global, I mean they all exist in the same namespace, but duplications aren't necessarily allowed19:34
fungiSamYaple: project A can use a job defined in project B, but project B cannot force project A to use a job it defines19:34
jlkSamYaple: you could put it in nova's repo19:34
jlkif the project is defined in nova's repo, you would make the addition there19:34
fungi(excepting for the project-config repo)19:35
jlkDefine the job in loci, make use of it in the nova repo, and the cinder repo19:35
SamYapleour end goal is for every merged commit to cinder to trigger a post job publish-loci-cinder. i will accept advice on how to do this19:35
*** slaweq has joined #openstack-infra19:36
SamYapleis the best way to A) put this in project-config or B) put this in cinder19:36
SamYaplesecrets from openstack/loci are involved here19:36
jlkAdd it to the place where the cinder project is defined.19:36
jlkSorry, by "it" I mean the configuration of the job to run in the post pipeline19:36
fungiSamYaple: cinder would need to add that job to run in the post pipeline, but it can use the job defined in loci19:36
jlkSomewhere there is a - project: name: openstack/cinder   block19:37
jlkthat location is where you'd add the new job19:37
SamYaplejlk: you mean i need to copy around all the playbooks in openstack/loci to openstack/cinder to do this?19:37
jlkno19:37
jlkyou reference the job name in openstack/cinder19:37
jlkthe content can all live in openstack/loci19:37
SamYapleok im following you19:37
SamYaplewhat about the secrets?19:37
SamYaplei believe i recall an allowed-projects list for secrets for cross-project secrets? am i mistaken?19:38
SamYapleor will the fact taht the job is defined in the repo with teh secret all it to use the secret?19:38
pabelangercinder won't have access to the secret19:38
pabelangeronly your job in loci will19:38
jlk"A Secret may only be used by jobs defined within the same project."19:38
jlkhttps://docs.openstack.org/infra/zuul/feature/zuulv3/user/config.html#secret19:38
pabelangersecrets are tied to playbooks19:38
pabelangeryah, what jlk said19:39
jlkTo run the job, Zuul will check out the job content from the place it's defined19:39
jlkand have access to the secret to run that playbook19:39
SamYapleso calling the job will let cinder run the job, and the job has access to the secret, but at no point cinder had access to the secret, correct?19:39
fungiso the job can run triggered in cinder's post pipeline, and as long as the secret and job are both defined in loci that should work fine19:39
SamYaplesorry, defining the job19:39
pabelangerSamYaple: right19:39
jlkSamYaple: nobody could write a job in cinder's repo to make use of the secret in your repo19:39
SamYaplegot it. im on the same page now19:40
SamYaplethanks for the clarity19:40
jlkonly the jobs in your repo (if that's where the secret is defined) can access the secret19:40
jlkalways good to check! And to bubble up things that aren't clear19:40
pabelangerand if they can, quietly let us know19:40
SamYapleheh will do19:40
fungiSamYaple: now, this _does_ point out a safety concern... don't want to write jobs which run arbitrary code which could be used to expose secrets that job uses19:40
SamYaplefungi: only use it on one line, and i set no_log19:41
*** ralonsoh has quit IRC19:41
*** slaweq has quit IRC19:41
toskyswitching topic a bit, just to be sure: if a job defined in repo A is used in repo B, will repo A be cloned as well or should it explicitly be put into required-projects?19:41
openstackgerritgordon chung proposed openstack-infra/openstack-zuul-jobs master: Remove Ceilometer legacy jobs  https://review.openstack.org/51072019:41
SamYapletosky: i second that question19:41
fungiSamYaple: right, just pointing out you want to avoid patterns similar to `setup.py upload` or `npm upload` which run arbitrary code from the project being uploaded19:42
*** mordred has joined #openstack-infra19:42
jlkI thought that Zuul will automatically clone the repo the job exists in, in order to access the playbook19:42
toskythat makes sense; but does it copy it to the workspace too?19:42
jlkat least cloned to the executor, not necessarily exposed in teh workspace. that I'm less clear on19:42
SamYaplefungi: good info to have, thanks19:42
fungitosky: repo B will be cloned automatically, but they _will_ need to add project A to the required-projects list for their variant19:42
* mordred waves to everyone from the air again after reconnecting his irc bouncer to freenode19:43
pabelangerthese are exciting questions19:43
toskyfungi: oh, good to know19:43
SamYaplesuch depth. much wow19:43
toskymy use case is: a job derived from the (work-in-progress) devstack-tempest, which will be put in sahara-tests19:43
jlkfungi: "their variant" means what?19:43
toskycalled sahara-tests-tempest19:43
toskyand that will be used in sahara and python-saharaclient (in addition to sahara-tests)19:43
pabelangertosky: yah, you'll want to use required-projects19:44
toskygood to know :)19:44
toskyI would have probably discovered it, but one review less is better19:44
fungijlk: when they put the job in their projects list, they can set parameters on it (including required-projects) and, unless i'm mistaken on the terminology, that technically makes it a variant of the original19:44
pabelangertosky: yah, zuul does a good jobs telling you if something is missing. It should results in a syntax error19:44
jlkfungi: wouldn't that mean ANY application of a job is a variant?19:45
toskynow the question is: what is the reason for not making that repository available automatically?19:46
fungijlk: oh, maybe i'm still mistaken on exactly what constitutes a job variant in that case19:46
toskyis there a use case where I would need only the playbooks/roles/zuul stuff but not the rest of its content?19:46
* jlk thinks a bit19:47
pabelangerfirst definition of a job is reference, anything else name the same as the reference job, is a variant19:47
jlk"exposure" of content isn't a built-in to Zuul. It's done as a pre-task19:47
jlkcloning happens on the executor, which uses it for running playbooks. What's on the remote node depends on the base job19:47
fungipabelanger: okay, that's what i thought as well19:48
pabelangerhad to double check in the docs :)19:48
openstackgerritSam Yaple proposed openstack-infra/project-config master: Publish cinder loci images to DockerHub  https://review.openstack.org/51239819:49
toskyok, another question19:50
mordredtosky, jlk: yes, there is a difference in repos cloned so that zuul can use theirplaybooks/roles to execute jobs - and repos needed on the remote node for job content itself19:50
jlktosky: jobs are free form, there's no real standard that the job (playbooks) make any use of the repo contents.19:50
SamYapleok so fungi this also adds openstack/loci as a required-project, did i do that right https://review.openstack.org/#/c/512398/ ?19:50
jlktosky: like the job could just be curling against remote URLs, and won't even need a node from nodepool19:50
*** armax has quit IRC19:50
toskyI see19:50
jlkso really, NOTHING is exposed on the remote node, until a pre-job delivers it to said node19:50
*** markvoelker has joined #openstack-infra19:51
toskythe documentation for a job says " If parent is set to null (which is only valid in a config-project), this is a base job."19:51
mordredfor instance, the base job comes from project-config, so when running the base job pre-playbooks zuul clones project-config at is master state on the executor so that it has the roles/playbooks ... but we don't rsync project-config to the remote nodes because that wouldn't be very helpful19:51
*** markvoelker has quit IRC19:51
jlkand those pre-jobs are up to Zuul instance admins, and in this case, openstack-infra19:51
pabelangerSamYaple: you shouldn't need require-projects, since the job exists in openstack-loci19:51
mordredand, in fact, a job could be testing a change to project-config and would need the speculative state available for rsyncing to the remote build node so that the test of the patch can take place19:51
*** markvoelker has joined #openstack-infra19:51
toskyabout parents: does it mean that if I try to define a job without a parent in a in-tree .zuul.yaml I should get an error, or that there is an implicit parent?19:52
SamYaplepabelanger: "19:42:51 < fungi> tosky: repo B will be cloned automatically, but they _will_ need to add project A to the required-projects list for their variant"19:52
jlkthe take away is to use 'required-projects' if you require the content of the repo available during job execution.19:52
mordredtosky: there is an implicit parent called base19:52
SamYaplepabelanger: doesnt that mean project B (cinder) needs to add project A (loci) to it?19:52
mordredjlk: ++19:52
pabelangerSamYaple: that isn't a job variant however19:52
pabelangerSamYaple: you are just adding a job to a pipeline19:52
toskymordred: ah, ok, that explains why https://review.openstack.org/#/c/511080/ works19:52
SamYapleah i see. i need ot research "variant" then19:53
mordredtosky: yes19:53
fungipabelanger: but the project the job is defined in won't be automatically cloned, will it? just the project triggering the job i thought. are you suggesting it's the other way around?19:53
*** edmondsw has joined #openstack-infra19:54
jlkfungi: the repo where the playbooks exist will be cloned to the executor19:54
*** panda has joined #openstack-infra19:54
pabelangerfungi: we'd need to clone the jobs on the executor, because that is wehere the playbooks live19:54
jlkso that ansible has access to the playbooks19:54
pabelangerYah, which is not needed on the remove node19:54
*** ldnunes has quit IRC19:54
fungijlk: pabelanger: but to have it available on the job node you'd still need to add it to required-projects right?19:54
jlkit's just that the content won't then be pushed to the nodepool node.19:54
pabelangerfungi: yah, that is right19:55
mordredyah. there are two different cloning contexts19:55
jlkfungi: correct.19:55
mordredif you look at:19:55
*** panda is now known as panda|rover19:55
mordredhttp://logs.openstack.org/80/511080/18/check/sahara-image-elements-buildimages-spark/d26ceaf/zuul-info/inventory.yaml19:55
*** ldnunes has joined #openstack-infra19:55
fungii assumed SamYaple wanted the job to also have access to loci19:55
mordredzuul.projects is the list of things that we're going to rsync to the remote nodes19:55
fungibut maybe that's an incorrect assumption19:55
mordredfungi: yes. if SamYaple is defining a loci job that might want to be added to the cinder pipelines19:55
mordredthen that job should required-projects on openstack/loci19:56
fungiaha19:56
mordredit'll not add much to the loci invocations of that job19:56
mordredbut it'll be needed for the cinder invocations19:56
fungiwhereas if the job only ran against loci changes it wouldn't be needed in required-projects19:56
jlkyeah that can be defined with the job itself19:56
jlkfungi: right, it's implicit there19:56
jlkor well, self serving?19:56
dhellmannthe tag-releases job is failing because add-sshkey doesn't have an "ssh_private_key" value. That doesn't feel like something I can fix on my own. Does someone have a few minutes to help out? http://logs.openstack.org/ef/ef00e18f9937e1544d2bedebf26328a7c3d741b8/release-post/tag-releases/e9fc7f2/job-output.txt.gz#_2017-10-16_19_53_58_64569119:56
fungior required-projects could be omitted from the job definition in loci, but then cinder would need to add it as a variant19:57
pabelangerI'll look at that19:57
jlkyou could do it that way, but that's probably more error prone19:57
jlkeasier to define it at the job level, so that consumers can just toss the job on and it'll work as expected19:57
toskyjlk: do you mean the job in A adding required-projects on A?19:57
mordredyes19:57
*** markvoelker_ has joined #openstack-infra19:57
clarkbdhellmann: I'm guessing that is beacuse the job wants to add an ssh key to push tags into gerrit?19:57
*** ifat_afek has quit IRC19:57
mordredsince it's a job designed to be run by other repos and the job requires loci19:57
dhellmannclarkb : yes19:57
jlktosky: correct, the canonical job definition in A can list required-projects of A19:57
toskythat makes sense19:58
toskyI will do it19:58
clarkbdhellmann: so you need to update the job to use the ssh_private_key of that user. If it is already a secret in projcet-config I think that is easy ish but if not ya an infra root will have to add it19:58
mordredjlk, fungi: https://review.openstack.org/#/c/500365/29/.zuul.yaml is an example - look at shade-functional-devstack-base19:58
*** rhallisey has quit IRC19:58
dhellmannclarkb : I understand most of those words, but I'm not sure where to start with that. Is that just a setting on the job definition? How do I figure out what value to use?19:58
mordredjlk, fungi: that puts name: openstack-infra/shade into required-projects: so that if I convinced keystoneauth to use that job it would have the complte set of repos19:58
openstackgerritSam Yaple proposed openstack-infra/project-config master: Publish cinder loci images to DockerHub  https://review.openstack.org/51239819:58
clarkbdhellmann: ya let me dig up some links (though I'm in the middle of updating local network hard ware so may disappear for short periods of time19:59
SamYapleok fungi mordred , the job is already functional (it runs successfully from loci) so all the required-projects should be set correctly19:59
jlkwell19:59
dhellmannclarkb : in the tag-releases job definition I see 2 secrets. "lp_creds" and "release_ssh_key"19:59
jlkin the case of running on loci jobs, it's assumed that the loci repo exists19:59
jlkon the remote node20:00
mordredyah. the project triggering the job gets added to the projects list automatically20:00
jlkSamYaple: the question is, does the job require content from the loci repo itself?20:00
jlkif it does, then you should add a required-projects: of loci in that definition20:00
fungiSamYaple: a job running against changes/commits for the repo it's defined in is a special case, because it will get its project required automatically by being the one which triggered the job20:00
dhellmannclarkb : and in secrets.yaml I see a release_ssh_key with a private_key that's a list (?) of encoded data20:00
jlksince the job is designed to (potentially) run in pipelines for other projects20:00
*** markvoelker has quit IRC20:01
mordredSamYaple: I'd add openstack/loci to required-projects in your base job: http://git.openstack.org/cgit/openstack/loci/tree/.zuul.yaml#n7520:01
SamYapleoh jeez. THATS where you all are talking about it20:01
toskymordred: about that shade example: what would have happened if you omitted the override-branch: master for shade? A run-time error because the, say, stable/ocata branch does not exist?20:01
SamYapleyes. i can add loci there20:01
SamYaplecan i do a depends on in project-config to an untrusted repo?20:01
mordredtosky: no - it's just that the stable/ocata branch would be checked out - thing is, I don't actually care about that branch in that case - it would totally work if I hadn't put in that override though20:02
mordredSamYaple: you can - it won't do speculative job config though20:02
toskymordred: and if the branch does not exist?20:02
SamYaplegot it20:02
mordredtosky: it'll fall back to master using normal branch fallback logic20:03
mordredtosky: so if keystoneauth added a shade-functional test to their pipeline and then submitted a patch to stable/ocata of keystoneauth20:03
mordredtosky: without that branch line it would result in stable/ocata of shade being cloned if it existed (which is does) or master if it didn't20:03
toskymordred: but I guess it's better to add it explicitly? (use case: sahara-tests, described above, is branchless, so technically I don't need that override-branch, but...)20:04
mordredtosky: but the thing I want to test with shade is that master of shade works with stable/ocata of keystoneauth20:04
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: fix the configuration for the release tagging key  https://review.openstack.org/51240120:04
dhellmannclarkb : is it possible this is the fix? ^^20:04
mordredtosky: yah - sahara-tests being branchless means you do not need that override thing...20:04
mordredtosky: for branch matching shade's world-view is weird - odon't copy that part :)20:04
toskymordred: ok :)20:04
openstackgerritSam Yaple proposed openstack-infra/project-config master: Publish cinder loci images to DockerHub  https://review.openstack.org/51239820:05
*** hongbin has quit IRC20:05
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Fix release_ssh_key secret  https://review.openstack.org/51240220:05
pabelangerdhellmann: yah, you beat me to it20:05
dhellmannpabelanger : cool, thanks20:05
dhellmannwe need to put some jsonschema validation on that file :-)20:06
clarkbpabelanger: dhellmann we also need to make sure that it is using that secret.ssh_private_key on the other end right?20:06
SamYaplephew. thanks all for sticking with me! i think i have a handle on it now20:06
pabelangerclarkb: it will once we update the secret20:06
SamYaplemordred: i did successfully push to dockerhub from a post job this morning :)20:06
mordredSamYaple: that patch looks fine. NOW - just for sake of being complete with the discussion - you could also just submit that patch to the cinder repo20:06
dhellmannclarkb : that key is listed in the set of secrets20:06
dhellmannfor the job20:06
dhellmannI'm not sure if you mean go deeper than that in the validation?20:06
mordredSamYaple: project-config vs. cinder in this case is a social question, not a technical one20:06
clarkbdhellmann: in the place where we attempt to set the key20:07
pabelangerdhellmann: which server is that key for?20:07
clarkbpabelanger: its a user key for a gerrit user20:07
openstackgerritKazunori Shinohara proposed openstack-infra/project-config master: Follow up change for Heat Dashboard creation  https://review.openstack.org/51083520:07
dhellmannpabelanger : it's used by the tag-releases job. I don't know what server.20:07
pabelangerah, okay, so we don't SSH into the server20:07
dhellmannno20:07
SamYaplemordred: cinder repo has no .zuul.yaml or zuul.d yet20:07
pabelangerdhellmann: and we don't validate known_hosts?20:08
mordredSamYaple: cool. mostly wanted to just make sure it was understaood it was physically possible20:08
mordredpabelanger, dhellmann we should ALWAYS validate known_hosts20:08
fungiSamYaple: it could also be done in project-config in that case, but reviewers would want to see +1 from cinder infra liaison or ptl20:08
SamYaplemordred: and we talking about ti a bit, it seems like it makes sense to keep them seperate. because this in a post job in no way affects anything in cinder20:08
mordredno?20:08
openstackgerritKazunori Shinohara proposed openstack-infra/project-config master: Follow up change for Heat Dashboard creation  https://review.openstack.org/51083520:08
pabelangermordred: right, we likely should be adding ssh_known_hosts to that secret too20:08
dhellmannpabelanger , mordred : how do I make it do that?20:08
mordredpabelanger, dhellmann: that ssh key is the one used by git review to communicate with gerrit, yeah?20:08
dhellmannit's used to let us push tags20:09
* mordred looks real quick20:09
SamYaplefungi: patch is already up for project-config. have a +1 from the cinder team, if you need it from the PTL/cinder infra liasion ill look into that20:09
mordredyah. it's the ssh key for the account on gerrit IIRC - one sec20:09
dhellmannI see an "ssh_known_hosts" setting on some other keys20:09
pabelangermordred: dhellmann: you might be able to use proposal_ssh_key for info20:09
mordredpabelanger, dhellmann: I'm reading the full job stack real quick ...20:10
pabelangerkk20:10
dhellmannthis stuff isn't new, it just didn't migrate over cleanly I think20:10
dhellmannI'm assuming this is the key we were using before?20:10
dhellmannsome guy named "Monty" put it in the file20:11
*** rbrndt has joined #openstack-infra20:11
clarkbpabelanger: dhellmann ya the add-sshkey role needs a ssh_private_key arg which we are not passing in20:11
fungii'm still trying to piece together how the tag-releases job works... i see the pre playbook calls an add-sshkey role but where is that role defined? http://git.openstack.org/cgit/openstack-infra/project-config/tree/playbooks/release/pre.yaml#n1320:11
fungii don't see it in the roles directory in project-config20:11
clarkbit also takes ssh known hosts and fqdn to set that up20:11
fungiand i thought any role using a secret needed to be defined in the same repo as the secret20:11
clarkbso I think the problem is we aren't passing the secret's details into that role20:11
pabelangerclarkb: yah, 512401 fixes that problem. Just confirming known_hosts know20:11
clarkbfungi: any playbook20:11
pabelangernow*20:11
jlkmordred: from a social aspect, is it expected that we'll slowly move project definitions from project-config into the repos themselves?20:11
mordredpabelanger: ssh_key.ssh_known_hosts and ssh_key.fqdn20:12
fungiclarkb: oh, wow so as long as the playbook is in the same repo as the secret, the role using that secret can go anywhere?20:12
* fungi looks in osj20:12
jlkgiving the project more explicit overview of changes to pipeline configurations that impact them?20:12
mordredfungi: yes. secrets are tied to playbooks20:12
fungimordred: but roles can subvert that?20:12
pabelangermordred: yah, so proposal_ssh_key is likely the format, with different private_key contents20:12
pabelangerdhellmann: ^20:13
dhellmannpabelanger : the user value in the known_hosts entry for the proposal key is not the user I would expect the tag job to use. does that matter?20:13
fungimordred: i guess the idea is you wouldn't intentionally pass a secret to a role you don't trust/control20:13
pabelangerdhellmann: I don't believe that is an issue20:13
openstackgerritKazunori Shinohara proposed openstack-infra/project-config master: Follow up change for Heat Dashboard creation  https://review.openstack.org/51083520:13
dhellmannit has "gerrit-code-review@gerrit.openstack.org" and I don't know if that value is part of the fingerprint or just metadata20:13
ianwo/ ... nice to see no chatter in infra-incident :)20:13
pabelangerI think that is meta, fungi ^?20:13
mordredfungi: yah. and roles don't get passed in speculative versions to playbooks with secrets20:13
fungipabelanger: dhellmann: it's irrelevant (a "comment" field in the key format)20:14
pabelangerty20:14
mordredjlk: I think loci here is an interesting case ...20:14
dhellmannfungi : thanks20:14
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: fix the configuration for the release tagging key  https://review.openstack.org/51240120:15
dhellmannlet's see if ^^ is closer20:15
clarkbjeblair: ok I think 01 is mostly stopped except it is still running zuul-executor proceses. I don't see any ansible though20:15
mordredjlk: in this case, rather than wanting cinder to run a job in check/gate - the loci project wants to 'subscribe' to commits from cinder so it can rebuild loci artifacts20:15
clarkbjeblair: is that a situation you want to look at?20:15
mordredjlk: I could see that being a thing to just do in project-config because it's not *actually* a thing that impacts the cinder project directly20:15
pabelanger+220:16
jlkyeah it's different. and post jobs don't really "block" changes from going in, right?20:16
*** salv-orlando has joined #openstack-infra20:16
SamYaplecorrect20:16
mordredjlk: or I could see us deciding socially that even doing that without cinder consent is rude ... it's a new fun question! :)20:16
SamYaplelots of testing there in recent days20:16
jlkmordred: the other question, can part of a project definition live in cinder, and the rest live in project-config?20:16
pabelangerjlk: mostly just consume resources IMO20:16
mordredjlk: yup!20:16
SamYaplein this case, i already have cinder PTL +1 o nthe patch20:16
SamYaplebut i would prefer *not* to have to get everyones permission. i ll do what i need to though20:17
*** yamamoto has joined #openstack-infra20:17
jlkmordred: so they are merged, but ... hrm.20:17
mordredjlk: in fact, pretty much every project that has in-repo config has some project definition in project-config and some in their repo20:17
jlkmordred: how much merging is done?20:17
jlkcan both list jobs for a "check" pipeline, and you get the sum total of both listings?20:17
mordredjlk: since we are requiring that projects keep the 'system-required' project-template in their project definition in project-config20:17
mordredjlk: yes20:17
jlkokay.20:17
mordredjlk: so putting things in project-config is like the "the openstack project requires python projects run pep8" .. and then putting things in in-repo is like "also, cinder wants to run cinder-foo-bar"20:18
* mordred waves hands20:18
SamYapledid merge-check evlove into system-required?20:18
jlkmordred: that makes sense20:19
jlksince it does merging in that style that totally makes sense.20:19
mordredSamYaple: yes20:19
SamYaplecool. cause that was new to me20:19
jlkhow that merging is done, should be documented, if it isn't already20:19
mordredjlk: it's like a shared responsibility model20:19
clarkbbut also we aren't running merge-check right now20:19
mordredjlk: totally20:19
clarkbI'm half expecting it to just go away because gerrit20:19
mordredclarkb: yah - but the project-template we have documented everyone should have in their project is now 'system-required'20:19
*** ijw has joined #openstack-infra20:19
clarkbya20:20
mordredwhich would let us put in other things globally if we needed/wanted20:20
clarkbpabelanger: is the situation ze01 is in familiar to you?20:20
pabelangerso, I would thinking about 'system-required' if we require all projects to have it, couldn't be have a global jobs regex some how, and it there, over having 1500 repos do it?20:20
mordredlike a 'cleanup stale logs' triggered by abandon events that would delete logs for abandoned changesets from the log server (not saying that's a good idea - but as an example of a thing we might choose to put there)20:21
clarkbbasically just a zuul daemon hanging out but no apparent jobs running20:21
pabelangerclarkb: let me look20:21
*** wolverineav has joined #openstack-infra20:21
*** yamamoto has quit IRC20:22
mordredpabelanger: yah - I floated the idea a little while ago of adding the ability to define a list of default project-templates for a tenant - but that got quickly put into the 'let's talk about that post v3 rollout' pile - I honestly have not thought through all of the ramifications ... it might be a TERRIBLE idea20:22
pabelangerclarkb: I see a few ssh-agent processes running, perhaps that is blocking the executor20:22
dmsimardinfra-root: my mid-term solution for ara storage and inodes: https://review.openstack.org/#/c/511992/ let me know what you think20:22
pabelangerclarkb: I think have jeblair would be good too20:22
jeblairback20:22
mordreddmsimard: looking20:22
mordred\o/ it's a jeblair20:22
jeblairclarkb: gimme a sec on ze0120:22
clarkbpabelanger: the last thing logged is about killing an ssh agent (and that agent doesn't appear to exist)20:22
clarkbjeblair: ok20:22
*** jtomasek has quit IRC20:23
*** dprince has quit IRC20:23
openstackgerritMerged openstack-infra/project-config master: Remove legacy jobs in Mixmatch  https://review.openstack.org/51176420:23
*** armax has joined #openstack-infra20:24
pabelangermordred: yah, post zuulv3 for sure :)20:24
mordreddmsimard: on first glance that looks pretty cool20:24
jeblairclarkb: i'm going to sigusr220:24
clarkbjeblair: ok20:25
dmsimardmordred: I've tested it and it works very well. I also suspect this could work well against swift or s3 by abstracting them with something like s3ql but I need to test it.20:25
fungidmsimard: when it comes to test-driving that, we can add it to the logs-dev vhost on static.o.o without impacting logs.o.o20:25
*** baoli has joined #openstack-infra20:25
mordredfungi: ++20:25
jeblairclarkb: tracebacks are crazy wrong :)20:25
*** smatzek has quit IRC20:26
*** smatzek has joined #openstack-infra20:26
*** ldnunes has quit IRC20:26
*** baoli has quit IRC20:26
dmsimardfungi: oh, that's a good idea.20:27
*** baoli has joined #openstack-infra20:27
clarkbjeblair: looks like one of the threads may be waiting to stop and then the others are all lined up behind it?20:27
openstackgerritMerged openstack-infra/project-config master: fix the configuration for the release tagging key  https://review.openstack.org/51240120:27
dmsimardfungi: the only doubt I have is about "hardcoding" /ara/ which might clash with other users whom might not be saving their databases (such as openstack-ansible)20:27
fungidhellmann: ^20:27
fungineed anything retriggered?20:28
fungior reenqueued i guess20:28
jeblairclarkb: first three look like they're waiting for the merger lock so they can run a job...20:28
pabelangerdmsimard: after the first load, will the database be used again?20:28
dhellmannfungi : I have a revert in process now, so I'll run it that way. thanks!20:28
jeblair0064 is the main thread20:29
dmsimardpabelanger: the database is used by the web app so long as it gets queries for it, the middleware doesn't delete the database or anything20:29
*** Swami has joined #openstack-infra20:29
pabelangerdmsimard: I guess I am curious, if I constently hit f5, would I be using CPU each time?20:29
pabelangeror does ARA cache it some how20:30
dmsimardpabelanger: there is no static generation involved20:30
pabelangerokay20:30
dmsimardpabelanger: it's just reading from the database and serving routes20:30
pabelangerso, something to keep in mind then20:30
*** smatzek has quit IRC20:30
dmsimardpabelanger: it doesn't do 'ara generate' which consumes the CPU and generates all those files and directories20:30
dmsimardpabelanger: it's a native wsgi web app that reads from the database20:30
pabelangerkk20:30
*** epende has quit IRC20:30
pabelangerhttp://logs.openstack.org/76/512376/1/gate/legacy-openstackci-beaker-centos-7/bc2dd8b/ is an interesting POST_FAILURE. I think that is an aborted job from ze02 but we didn't re-run it20:32
pabelangergoing to add to etherpad to triage later20:32
jeblairpabelanger: that would be a zuul issue, go ahead and add it there20:33
*** hasharAway has quit IRC20:33
jeblairlet's leave the triage section for "i have no idea what this issue is or who can help with it"20:33
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Add flag to allow disabling sudo revocation  https://review.openstack.org/51240820:34
jeblairsince we have folks who specialize in job issues and different folks who specialize in zuul issues20:34
clarkb02 is near being down according to the process listing at least20:35
pabelangersure20:35
*** kgiusti has quit IRC20:35
dhellmannthere's a new error with tag-releases: http://logs.openstack.org/6b/6b2b2f10af366ea3c2a7f4c056ce16a7ccde16f1/release-post/tag-releases/7dd1a54/job-output.txt.gz#_2017-10-16_20_35_00_42758920:36
mordredpabelanger, clarkb, jlvillal : did the patches for releasenotes building fix ironic-inspector?20:36
jeblairmordred: can we talk about the sudo thing?20:36
dhellmannthat one is odd because it looks like the task failed but didn't cause the job to fail20:36
mordredjeblair: totally - although Im landing right now so it might need to be in a little bit ...20:37
mordredjeblair: feel free to -2 it to block it until we talk20:37
jeblairmordred: okay i was thinking we should add a new tox-functional job.  cause i don't think we want unittest jobs to not revoke sudo20:37
dhellmannand I'm not sure where to look for the add-launchpad-credentials role20:38
jlk jeblair that sounds better to me as well, don't let consumers opt-out of system protections20:38
fungimordred: the first patch fixed the tox-siblings issue in pre, but the second patch did not solve the rsync issue in post with missing tox logs for some envs20:38
* fungi updates the pad with that status20:39
mordredfungi: oh - yah - I remember this from scrollback now - bleh20:39
jeblairmordred: left comment on change so it's got a pin in it for later20:39
mordredjeblair: kk. that works for me too. cool20:39
mordreddhellmann: it's in zuul-jobs20:39
dhellmannmordred : thanks. I think this is just another key name error in the task definition20:40
pabelanger13384720:41
pabelangeroops20:41
mordreddhellmann: it's also possible that the template module wants src and dest not src and path20:41
clarkbjeblair: do you think we should continue to wait for 01?20:41
mordreddhellmann: yes - that's it20:41
dhellmannmordred : yeah20:41
dhellmannmordred : I wish I had time to help, but maybe we can recruit a small group to work on adding schema validation to all of these files?20:42
dhellmannseems like something that would be useful for ansible anyway, maybe there is already a linter20:42
jeblairclarkb: i think i've got a handle on the deadlock.  why don't you kill and proceed on ze01, i'll write a patch.20:42
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Use dest for template: not path  https://review.openstack.org/51240920:42
openstackgerritDoug Hellmann proposed openstack-infra/zuul-jobs master: fix the template instruction for writing launchpadlib/creds  https://review.openstack.org/51241020:42
openstackgerritgordon chung proposed openstack-infra/project-config master: drop legacy aodh jobs  https://review.openstack.org/51241120:42
mordreddhellmann: ^^ there's that one - and yes, adding more linting is DEFINITELY  athing that would be good20:42
* mordred abandons his patch20:42
clarkbjeblair: will do, thanks20:43
*** tpsilva has quit IRC20:43
clarkbjeblair: will I need to clean up those ssh-agents too?20:43
openstackgerritgordon chung proposed openstack-infra/openstack-zuul-jobs master: remove legacy aodh jobs  https://review.openstack.org/51241220:43
mordreddhellmann: +220:43
jeblairclarkb: not critical, but good to be tidy20:43
clarkbrgr20:43
jeblairmordred, dhellmann, pabelanger: iirc, we had linting there and then discovered a substantial bug in the linter20:44
dhellmannmordred : otoh, this whack-a-mole thing gives that fast feedback loop of "debug, fix, rinse, repeat" :-)20:44
mordreddhellmann: :)20:44
mordredjeblair: yes - this is very true20:44
jlvillalmordred: Not sure if anyone else responded. But no it did not fix ironic-inspector. Have a POST_FAILURE now.20:44
jeblairhopefully we can turn it back on soon20:44
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Harden exception handling around instance deletes  https://review.openstack.org/51241320:44
SamYaplewhat is infracloud-chocolate and infracloud-vanilla? and why are all my jobs timing out on them from being so slow :(20:44
dhellmannjeblair : ah. it seems like it would be pretty complicated to get it all 100%, so maybe it can be done incrementally20:44
mordredhaving a playbook that uses the role and running a syntax check on that playbook gets a fairly decently far distance down the path20:44
jlvillalmordred: That was about an hour ago though.20:44
clarkbfs backup is running now20:44
dhellmannas I said, i wish I could help instead of just complaining :-/20:45
pabelangerYah, I am hoping to start work on linters tomorrow20:45
SamYaplejobs that typically take 3-5m to run are timing out at 20m :/20:45
pabelangerif pressure on zuulv3-issues is down20:45
pabelangerSamYaple: which job?20:45
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Harden exception handling around instance deletes  https://review.openstack.org/51241320:45
SamYaplepabelanger: https://review.openstack.org/#/c/512400/20:46
dhellmannmordred : should I be concerned that the task failed but the job didn't?20:46
dhellmannpabelanger : ++20:46
SamYaplepabelanger: zuul is about to comment, but over half of the jobs hit the time cap of 20m and failed. all of those are on infra-canilla/chocolate20:46
clarkbjeblair: also the fix for slow shutdown seemed to run into job failures20:47
clarkbjeblair: thinking we may want to get both sets of fixes in with the next round of restarts20:48
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Harden exception handling around instance deletes  https://review.openstack.org/51241320:48
openstackgerritBrian Rosmaita proposed openstack-infra/project-config master: Remove workflow +1 on glance_store from swift-core  https://review.openstack.org/51151720:49
pabelangerSamYaple: some ran in infracloud, are you using docker reverse proxy cache20:50
pabelangersounds like networking20:50
*** jcoufal_ has joined #openstack-infra20:50
SamYaplepabelanger: indeed. im using all the infra mirrors possible for everything20:50
*** ihrachys_ has joined #openstack-infra20:51
pabelangerSamYaple: can only suggest collecting docker log file, with debug enabled to see what is going on20:52
SamYaplefair enough20:52
SamYaplethanks!20:53
*** ihrachys has quit IRC20:53
pabelangerSamYaple: also possible the reverse proxy cache is cold, and caching docker things.20:53
dmsimardI feel noob for asking this, but does anyone know why I keep screwing up the etherpad formatting ? Especially around font types and copy/pastes.20:53
*** jcoufal has quit IRC20:53
clarkbjeblair: what is /var/lib/zuul/git?20:53
dmsimardclarkb: zuul merger root ?20:54
clarkbjeblair: it appears to be empty on some executors and not in others.20:54
pabelangerSamYaple: this doesn't look right either: http://logs.openstack.org/00/512400/2/check/loci-keystone/cda54e6/job-output.txt.gz#_2017-10-16_20_31_01_99428520:54
clarkboh ya I guess 01 would've been its own merger at one point20:54
clarkbjeblair: can I delete that to free up space and inodes on 01?20:54
fungii was under the impression all the executors were also running merger daemons20:54
SamYaplepabelanger: ill start collecting more logs and get back to you20:54
dmsimardfungi: that would make sense, actually, if an executor could use a merger instance hosted locally20:55
SamYaplethis might be a problem with async. i hadnt used it before these playbooks20:55
clarkbfungi: not that I haev seen unless its a thread in the main zuul-executor20:55
*** jcoufal has joined #openstack-infra20:55
dmsimardfungi: less traffic and opportunities for failure20:55
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Harden exception handling around instance deletes  https://review.openstack.org/51241320:55
*** edmondsw has quit IRC20:56
pabelangerSamYaple: yah, your likey the first one to be using async task on zuulv3 too :)20:57
pabelanger512376 has been in gate pipeline for  over 2h25mins20:57
*** jcoufal_ has quit IRC20:57
pabelangergoing to see why that is20:58
clarkbjeblair: you have a shell on 01 from the 14th that is in /var/lib/zuul20:58
clarkbjlvillal: so I can't unmount the fs, can you shut that shell down?20:58
clarkber jeblair ^20:58
* jlvillal was confused...20:58
pabelangerI think 512376 had a job running on ze0220:59
clarkbpabelanger: ya I'm worreid 02 is in the same boat as 01 and will need more forceful stopping20:59
pabelanger2017-10-16 18:48:10,741 DEBUG zuul.AnsibleJob: [build: 2a418f7682104223b5d41155a00ad0a4] Abort: no process is running20:59
pabelangerthat is the last thing in log20:59
pabelangerfor the build20:59
pabelangerclarkb: kk21:00
pabelangerI'll hold off debuging for now21:00
*** smatzek has joined #openstack-infra21:00
dmsimardpabelanger: hey now that we're back on v3, I'll do a recheck on https://review.openstack.org/#/c/504238/ .. could you fetch executor logs when it inevitably fails ?21:01
jeblairclarkb: that was pre-power outage, i've lost it21:01
jeblairclarkb: feel free to kill as needed21:01
clarkbjeblair: will do21:01
pabelangerdmsimard: will have to be tomorrow, just about to afk for now21:02
pabelangerI have an local meetup to attend this evening21:02
dmsimardpabelanger: ok, np, maybe another infra-root can help.. I've been trying to nail down that bug ever since the ptg :)21:02
*** sree has joined #openstack-infra21:02
*** rockyg has joined #openstack-infra21:02
clarkbjeblair: did you see question about deleting /var/lib/zuul/git ?21:02
clarkbjeblair: it has ~8GB on 01 but seems empty elsewhere21:02
*** smatzek has quit IRC21:04
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Executor: stop jobs in parallel on shutdown  https://review.openstack.org/51238321:04
*** trown is now known as trown|outtypewww21:04
jeblairclarkb: missed that.  it can be deleted; that was probably a puppet install error (installed it as a merger rather than executor perhaps)21:05
jeblairclarkb: updated that change ^ with the deadlock fix (since they touch the same code)21:05
clarkbjeblair: cool, I'll be sure to delete the contents before starting the service21:05
clarkbI'll leave the dir there as I think puppet may be creating it as it is everywhere (just only full of stuff on 01)21:06
*** mat128 has quit IRC21:06
jeblairdmsimard: i can grab logs21:06
dmsimardjeblair: ok, the recheck is fired, I'll let you know when I got the finger url.21:07
*** sree has quit IRC21:07
jeblairdmsimard: kk21:07
openstackgerritMerged openstack-infra/zuul-jobs master: fix the template instruction for writing launchpadlib/creds  https://review.openstack.org/51241021:09
dmsimardjeblair: finger://ze08.openstack.org/e974e5b38df9480ab1385fb45e40449721:09
openstackgerritboden proposed openstack-infra/project-config master: zuul changes for vmware-nsx  https://review.openstack.org/51242621:10
openstackgerritDavid Moreau Simard proposed openstack-infra/openstack-zuul-jobs master: Add integration tests for emit-job-header and set-zuul-log-path-fact  https://review.openstack.org/51194821:11
dmsimardI guess I'll add the include_role post failure (truncated json) issue to the zuulv3-issues pad.21:13
clarkb01 is starting now with fs reinoded21:15
clarkbjeblair: why did you remove the if not self._running from mergerJobDispatch in ps2?21:17
jeblairclarkb: oh forgot to mention that.  it was because merge jobs are fast, and we can afford to let a merge job we just got go ahead and finish while we're running down the real jobs.  that will keep someone from getting a MERGER_ERROR.21:18
*** yamamoto has joined #openstack-infra21:18
clarkbjeblair: and to clarify the git/ dir thing earlier. Executors do run their own mergers in process (as a thread) and they use executor-git/ not git/ ?21:19
jeblairdmsimard: why is the intergration job defined in openstack-zuul-jobs instead of zuul-jobs?21:19
jeblairclarkb: correct.  there may be raisins there.21:20
jeblairclarkb: (at the very least, it lets you run a merger process on an executor without collision (or special configuration))21:21
*** bobh has quit IRC21:21
prometheanfireis openstack running checks for bad ssh/gpg keys for the infineon rsa badness?21:23
clarkbjeblair: 01 looks possibly unhappy21:23
clarkbjeblair: exit codes are all 25021:23
jeblairdmsimard: i asked because in my comment on 509436 i really wanted to say 'squash all 3 changes together'21:23
jeblairclarkb: go ahead and stop it21:23
*** florianf has quit IRC21:23
clarkbok stopping now21:23
*** yamamoto has quit IRC21:24
clarkb2017-10-16 21:23:57,140 DEBUG zuul.AnsibleJob: [build: dab61331cba040298f98311cf695adb1] Ansible output: b"ERROR! Unexpected Exception: 'NoneType' object has no attribute 'isatty'"21:24
clarkbthat appears to be the cause21:24
odyssey4meevening all - I'm working on a starting set of in-repo jobs for OSA and need a little guidance... for now I'm doing a like-for-like replacement with https://review.openstack.org/512414 but it's failing because it doesn't have sudo access... where exactly is sudo revoked?21:25
*** florianf has joined #openstack-infra21:25
clarkbodyssey4me: jeblair figured out it is revoked by the unittest job which is parent to tox which is parent to openstack-tox21:25
jeblairodyssey4me: mordred started working on this between takeoffs and landings in https://review.openstack.org/51240821:26
odyssey4meclarkb jeblair ah ok - what's the options in this case? I'm happy to propose something or review.21:26
odyssey4meah, that looks like a start - thanks21:27
jeblairodyssey4me: i think the outstanding question is will the result be that osa just writes a new job completely from scratch, or is there a sort of tox-functional-test paradigm that's repeated enough we should make a new general purpose parent job for that.21:27
jeblairodyssey4me: i suspect the latter21:28
clarkbto see full traceback use -vvv21:28
odyssey4mejeblair fair question - honestly our situation is that for now we need to use tox as it's the quickest way to get like-for-like tests to what we had with jenkins... but in time we will rewrite how the tests are executed to be rid of tox21:28
jeblairclarkb: okay i suspect something about the control persist has gone haywire21:28
jeblairodyssey4me: *nod* i think other projects will continue to use tox even after you stop21:29
jeblairclarkb: i guess we should have switched to verbose before starting :/21:29
odyssey4meoh absolutely, although I'm not sure they'll need sudo though21:29
jeblairodyssey4me: i think a lot of them do (for *functional* tests)21:30
odyssey4methe flag patch is a good one, with the comments already suggested21:30
openstackgerritgordon chung proposed openstack-infra/project-config master: drop legacy panko jobs  https://review.openstack.org/51242921:30
jeblairodyssey4me: i have very strong feelings about the use of sudo in unit tests, so i just don't want us to accidentally make it easy for someone to disable that for a tox-py35 job.21:31
odyssey4meyeah, agreed21:31
clarkbjeblair: thoughts on what we should do next on 01?21:31
jeblairclarkb: i kind of want to start it again, turn on verbose, get an error, then stop21:31
jeblairclarkb: and hope all that happens fast enough all the jobs it grabs get abort/retried21:32
clarkbjeblair: ok I think it is still running since the last stop but clost to being done21:32
*** ijw has quit IRC21:32
clarkbjeblair: how do I turn on verbose?21:32
dmsimardjeblair: pretty sure I added them in zuul-jobs first and someone asked to move them in ozj21:32
jeblairclarkb: 'zuul-executor verbose'21:32
jeblairdmsimard: was that someone me? :)21:33
clarkboh I'd run it in the foreground then?21:33
jeblairclarkb: no, that sends a command to the socket21:33
clarkbah21:33
dmsimardjeblair: maybe21:33
smcginnisIf someone has a moment for pointers, looks like we still have failures in the release-post jobs: http://logs.openstack.org/81/81f73d804ccf51f985b902d84bbf88ef13f2c32b/release-post/tag-releases/ce6f50f/job-output.txt.gz#_2017-10-16_21_29_35_21797921:33
clarkbjeblair: should I go ahead and kill the existing processes, start it, turn on verbose then stop it then?21:33
clarkbalso I think 02 is ready to be manually stopped maybe21:34
jeblairclarkb: yeah, though wait for a 250 before stopping21:34
openstackgerritgordon chung proposed openstack-infra/openstack-zuul-jobs master: remove legacy panko jobs  https://review.openstack.org/51243021:34
jeblairdmsimard: is that ze08 job still running?21:34
clarkbjeblair: also comment on https://review.openstack.org/#/c/512383/221:34
* clarkb proceeds on 0121:34
dmsimardjeblair: nop21:35
jeblairdmsimard: ok i'll grab error21:35
*** jcoufal_ has joined #openstack-infra21:35
dmsimardjeblair: ok, making something to eat before meeting, I'll catch up.21:35
jeblairdmsimard: http://paste.openstack.org/show/623764/  do we need to set verbose for that too?21:36
jeblairclarkb: i see errors21:36
clarkbya about to stop it21:36
clarkbok init stop run. Will watch it to see if I have to manually stop it soon21:37
dmsimardjeblair: hmm, I guess so ? Are we able to do that without reloading executors ?21:37
jeblairdmsimard: yes, though we'll have to do it on all the executors since we don't know where it will end up21:37
*** jcoufal has quit IRC21:38
jeblairwell... actually... it should take effect for any playbooks started after we set it...21:38
jeblairdmsimard: so if i'm really fast, i may just be able to do it on one21:38
jeblairdmsimard: anyway, i'll have to shelve that for a minute while i switch to the thing with clarkb21:38
dmsimardjeblair: sure, I can't look at it immediately anyway21:39
jeblairwhich, btw,  is: http://paste.openstack.org/show/623765/21:39
clarkb oh I just made a paste too :)21:39
clarkbyours has slightly more context so we'll go with it21:39
*** claudiub has quit IRC21:40
smcginnisAny idea if this is a known issue? http://logs.openstack.org/81/81f73d804ccf51f985b902d84bbf88ef13f2c32b/release-post/tag-releases/ce6f50f/job-output.txt.gz#_2017-10-16_21_29_35_21797921:40
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Harden exception handling around instance deletes  https://review.openstack.org/51241321:41
clarkbsmcginnis: I tknow pabelanger and dhellmann were looking at related failures but that one looks new to me (but I've been working other things)21:43
*** ijw has joined #openstack-infra21:43
smcginnisclarkb: Yeah, dhellmann fixed a few things, now this appears to be the next roadblock.21:44
clarkbjeblair: so sys.stdin is None which makes sense for a daemon I think?21:44
SamYaplepabelanger: looks like it was async and me :/ . I was not retrying enough times when polling the async job, so the job wasn't timing out, the "task" was failing because it didn't complete fast enough.21:44
SamYaplea slower host would aggrevate that issue21:45
smcginnisThe playbooks/release/tag.yaml file it's point to looks fine to me, so not really sure where to take it from here.21:45
*** chlong has quit IRC21:45
jeblairclarkb: yeah; i switched to using the nullwrap driver for the popen implementation rather than bubblewrap; there must be something about that21:45
odyssey4meif I want to make use of a project template, for example https://git.openstack.org/cgit/openstack-infra/openstack-zuul-jobs/tree/zuul.d/project-templates.yaml#n253 - how do I do that in my pipeline?21:46
dmsimardodyssey4me: you want to create a new template or use an existing one ?21:47
odyssey4medmsimard use an existing one for now21:47
dmsimardodyssey4me: nothing special, it is shared by default21:48
odyssey4meyeah, but how do I configure my project to use it :)21:48
dmsimardOh, one sec21:48
*** MasterOfBugs has quit IRC21:49
*** pramodrj07 has quit IRC21:49
clarkbjeblair: this is the wrapped=False bits of the persistent ssh change?21:49
jeblairclarkb: ya21:49
jeblairclarkb: i'm having trouble replicating this locally (though i haven't gone full-on daemon yet)21:50
clarkbjeblair: ya my guess is it works without being a daemon because when you aren't a daemon sys.stdin is a thing21:50
clarkbjeblair: but part of daemonization is to close that fd21:50
*** aeng has joined #openstack-infra21:50
dmsimardodyssey4me: https://docs.openstack.org/infra/zuul/feature/zuulv3/user/config.html#attr-project.templates21:51
clarkbok ze01 never fully stopped I am going to kill the processes now21:51
dmsimardIt's a list attribute, see beginning of https://raw.githubusercontent.com/openstack-infra/project-config/master/zuul.d/projects.yaml for example21:51
odyssey4meah, thanks dmsimard - I misread that earlier21:51
clarkbbut then I need to step away for a bit (will return to hopeflly happier zuul and continue to restart things on new filesystem21:52
clarkbalright 01 is in that steady state now21:54
clarkbI'll be back in about 30-45 minutes21:54
*** jcook has quit IRC22:00
fungimeeting time?22:01
*** brett has joined #openstack-infra22:04
brettHi, hoping to get a couple of simple gerrit reviews for Jenkins Job Builder changes:22:05
bretthttps://review.openstack.org/#/c/505532/22:05
bretthttps://review.openstack.org/#/c/505560/22:05
*** Shrews has quit IRC22:06
jeblairbrett: you might try in the #openstack-jjb channel22:06
brett@jeblair thanks22:07
*** brett has left #openstack-infra22:07
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Set ansible stdin to /dev/null  https://review.openstack.org/51243422:08
jeblairclarkb, SpamapS, fungi: ^ i *think* that will fix it.  this is very difficult to test though.  so i will manually apply that on ze01 and restart.22:08
*** jcoufal_ has quit IRC22:11
jeblairseems better so far22:11
fungiis it legitimate to use jinja parameter substitution in a shell task?22:11
jlkit should be fine, I think22:12
jlkunless you're having it specifically not run the bits through the templating engine.22:12
fungithe tag-releases job seems to think this is a templating error in project-config:playbooks/release/tag.yaml:22:13
fungiRELEASES_DIR=/home/fungi/src/{{ zuul.project.canonical_name }}22:13
jlkhrm.22:14
fungiERROR! failed at splitting arguments, either an unbalanced jinja2 block or quotes: RELEASES_DIR=$HOME/src/{{ zuul.project.canonical_name }}22:14
*** salv-orlando has quit IRC22:14
fungino quote marks anywhere in there, and the jinja doesn't _seem_ imbalanced22:15
jlkoh, I wonder if it's because of the | thing that it's not properly quoting the bits to template.22:15
jlkI wonder.22:15
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Executor: stop jobs in parallel on shutdown  https://review.openstack.org/51238322:17
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Set ansible stdin to /dev/null  https://review.openstack.org/51243422:17
jeblairfungi: it's the "'" in it's22:18
jeblairansible hates contractions22:18
*** panda|rover is now known as panda|rover|off22:19
fungid/oh!22:19
jeblair*also* that shouldn't be a contraction!22:19
jeblair*This* is why grammar is important!  :)22:19
fungii keep forgetting quotes in comments throw off jinja there22:19
fungismcginnis: ^ *sigh*22:19
jlkyeah that is kinda dumb.22:19
jlkprobably something COULD be done22:19
fungipatch on the way22:20
smcginnisfungi: Thanks22:20
*** dingyichen has quit IRC22:20
*** yamamoto has joined #openstack-infra22:21
smcginnisfungi, jeblair: So it's the "it's" in the comment on line 6 throwing it off?22:21
jeblairsmcginnis: yep22:21
smcginnisjeblair: And you're right, should be its. :D22:21
smcginnisfungi: I can take care of that if you have better things to do.22:22
*** rloo has left #openstack-infra22:22
smcginnisfungi: Just let me know.22:22
openstackgerritJeremy Stanley proposed openstack-infra/project-config master: Correct grammar in tag-releases comment line  https://review.openstack.org/51243722:22
fungismcginnis: already done22:22
fungitook way longer to write the commit message22:23
smcginnisfungi: If only they could all be that easy. ;)22:23
clarkbI hate that ansible bug22:23
fungismcginnis: i enjoyed that i could masquerade that as a really pedantic grammar patch ;)22:23
clarkbjeblair: I'm reviewing the fix change now, also I see you saw my can of worms commetn so reviewing that too22:23
johnsomzuulv3 noob question: I have a horizon plugin that requires horizon installed for the openstack-tox-py27/35 gates.  Should I just create a new job with openstack-tox-py27 as the parent that includes the horizon package?  This used tox_install.sh before.22:23
jeblairfungi: i enjoyed +2ing it.22:24
jeblairjohnsom: i think there's a project-template for that now22:24
*** edmondsw has joined #openstack-infra22:25
*** aviau has quit IRC22:25
ianwmordred: (sorry, catching up) ... are we happy with increased permissions for the "tox as other user" issue.  should i abandon https://review.openstack.org/#/c/512194/ (zuul-cloner-shim: don't link if users different)22:25
johnsomjeblair I looked on the docs pages and nothing jumped out at me, any hints to the name?22:25
*** aviau has joined #openstack-infra22:25
jeblairjohnsom: https://git.openstack.org/cgit/openstack-infra/openstack-zuul-jobs/tree/zuul.d/project-templates.yaml#n33522:25
johnsomjeblair Perfect!  Thanks22:25
*** yamamoto has quit IRC22:26
*** masber has joined #openstack-infra22:26
jeblairclarkb: afaict the fix is working on ze01.22:26
clarkbjeblair: re stdin looks like regardless of wrapper method we'll just set stdin to dev/null which should be fine since all of ansible et al's input are via config and command line args22:26
jeblairclarkb: yep, that's my thinkin22:27
johnsomHmm, this may not be getting updated: https://docs.openstack.org/infra/openstack-zuul-jobs/jobs.html as I don't see openstack-python-jobs-horizon there.22:27
SamYapleif someone on my team were to ask infra to decrypt the secrets for them, would infra be able to do that (planning hit-by-a-bus scenarios)22:27
clarkbSamYaple: as long as we don't lose the secret key yes. I'm not sure we have those in the backup rotation yet though22:28
jeblairjohnsom: yes, i think we forgot to add support for project-templates to zuul-sphinx (which is what renders that).  a terrible mistake in retrospect -- those are way more likely to be referenced than jobs.  we'll correct it soon, but it'll probably take a few days to plumb everything through.22:28
clarkbjeblair: I don't think I can do anymore restarts until those two changes merge then. Also should I plan to restart 01 again after those merge so that it is running of of HEAD?22:29
johnsomNP22:29
SamYapleclarkb: cool. but if you lose the private key then someone will have to go in an reset the secrets anyway22:29
johnsomI will stick that somewhere in the FAQ etherpad22:29
jeblairclarkb: we should really add that to backups!  :)22:29
*** andreww has quit IRC22:29
clarkbjeblair: we probably should22:29
*** edmondsw has quit IRC22:29
jeblairor, i guess, tell everyone if we aren't so they know what to expect.  :)22:30
* jeblair adds item to infra meeting agenda22:30
openstackgerritMerged openstack-infra/tripleo-ci master: Switch cistatus page to zuul v3  https://review.openstack.org/50854622:32
*** ijw has quit IRC22:33
openstackgerritMerged openstack-infra/project-config master: Correct grammar in tag-releases comment line  https://review.openstack.org/51243722:33
*** ijw has joined #openstack-infra22:34
jeblairSpamapS: are you available to merge https://review.openstack.org/512383 ?22:35
openstackgerritMichael Johnson proposed openstack-infra/project-config master: Fix octavia-dashboard py27/35 jobs  https://review.openstack.org/51243922:35
*** esberglu has quit IRC22:36
*** florianf has quit IRC22:37
*** felipemonteiro_ has quit IRC22:37
*** gouthamr has quit IRC22:37
jeblairclarkb: i'm inclined to just merge that with the 2 of us, and ask SpamapS to retro-review it later22:40
clarkbjeblair: wfm22:40
clarkbjeblair: if you can work on that (and the kick.sh I am going to start prepping 0222:41
dmsimardInteresting how k8s didn't develop their own "stackalytics": https://devstats.k8s.io/dashboard/db/companies-stats?orgId=1&var-period=y&var-metric=activity&var-repogroup_name=All&var-repogroup=all&var-companies=All22:41
clarkbjeblair: the fs backup takes long enough that you should be done merging things by the time I am ready to start the service again22:41
clarkbdmsimard: they kind of did, its a bunch of bigtable goolgle stuff on top of the github integration aiui22:42
clarkbok 02 is all stopped now. Moving ahead with the fs work22:44
SpamapSjeblair: I'm OTP for a bit, available in 14 for sure.22:46
*** Shrews has joined #openstack-infra22:47
*** wolverineav has quit IRC22:47
*** rbrndt has quit IRC22:50
*** rockyg has quit IRC22:50
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Import legacy playbooks  https://review.openstack.org/51216422:50
openstackgerritMichael Johnson proposed openstack-infra/project-config master: Fix Octavia jobs to be voting again  https://review.openstack.org/51244422:51
clarkbrestoring fs backup on 02 now (then will be ready to start the service whenever code is installed)22:52
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Executor: stop jobs in parallel on shutdown  https://review.openstack.org/51238322:54
*** sdague has quit IRC22:56
*** lbragstad has quit IRC22:57
*** gouthamr has joined #openstack-infra22:57
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Set ansible stdin to /dev/null  https://review.openstack.org/51243423:01
*** gordc has quit IRC23:01
jeblairclarkb: i'll start kicking things23:01
clarkbthanks23:01
clarkbI'm ready to start zuul on 02 as soon as that is done23:01
*** aeng has quit IRC23:02
*** smarcet has joined #openstack-infra23:03
*** smarcet has quit IRC23:04
*** dhinesh has quit IRC23:05
*** esberglu has joined #openstack-infra23:11
clarkbjeblair: the git repo on 02 looks updates but pip doesn't seem to know that zuul is installed23:13
clarkbjeblair: is it still running?23:13
jeblairclarkb: just finished23:14
jeblairclarkb: make sure you use pip323:14
clarkboh right23:14
clarkbthanks23:14
clarkbok I am starting zuul-executor on 02 now23:14
jeblair++23:14
*** esberglu has quit IRC23:15
clarkbjeblair: did you want to restart 01 too so that it is running the same code as the others?23:15
clarkbI guess I should restart 03 too23:15
clarkband 0923:15
*** salv-orlando has joined #openstack-infra23:15
jeblairclarkb: yeah, we should do those, but maybe after the others?23:16
clarkbok23:16
clarkbI'll do 4-8 then 1023:16
clarkb2017-10-16 23:16:40,264 DEBUG zuul.AnsibleJob: [build: 21bb90abd9394f3bbfe1f7b3561952b5] Ansible exit code: 023:16
clarkb02 lgtm23:16
* clarkb stops on 0423:16
clarkband 0523:17
*** ijw has quit IRC23:17
jeblairclarkb: 02 lgtm too23:17
*** tosky has quit IRC23:17
*** aeng has joined #openstack-infra23:19
*** mat128 has joined #openstack-infra23:19
clarkb16384 bytes per inode is apparently an ext4 default (or a value close to that). What do we thinkabout updating the mkfs in the system-config script for using the ephemeral drive to just hardcode -i 8192?23:20
clarkbseems like we tend to need more inodes than not and the actual cost isn't that large to have extra inodes23:20
clarkbok 04 is stopped migrating its fs now23:20
dmsimardWhen we generate ara reports (on the executors), the reports are stored in the bubblewrap env right ? Those are discarded/deleted after every build ?23:21
*** salv-orlando has quit IRC23:21
clarkbthe bubblewrap envs are deleted unless we set keep on the job23:22
*** yamamoto has joined #openstack-infra23:22
clarkb05 has also stopped quickly so proceding to migrate it too23:24
jeblairkeep is an executor run-time flag, but yes23:25
jeblair(just wanted to clarify you can't change that in a job definition)23:25
clarkbjeblair: is it global to all jobs?23:26
*** gongysh has joined #openstack-infra23:27
jeblairclarkb: yes23:27
*** gongysh has quit IRC23:27
jeblairit's kind of a blunt debugging tool23:27
clarkbit is a big hammer23:28
*** yamamoto has quit IRC23:28
*** gmann_afk is now known as gmann23:30
*** ijw has joined #openstack-infra23:30
clarkb04 is starting back up now23:31
clarkb2017-10-16 23:31:56,554 DEBUG zuul.AnsibleJob: [build: 1c66f0d4b3804a91801b87c4a3c8e232] Ansible complete, result RESULT_NORMAL code 023:32
clarkbstopping 06 now23:32
*** ijw has quit IRC23:33
clarkb05 starting now23:35
clarkb2017-10-16 23:35:49,245 DEBUG zuul.AnsibleJob: [build: 1ca9202922b245e2ab7e20cbedfe8209] Ansible complete, result RESULT_NORMAL code 023:36
*** wolverineav has joined #openstack-infra23:36
*** jascott1 has joined #openstack-infra23:37
*** aeng has quit IRC23:40
*** aeng has joined #openstack-infra23:41
SamYaplei just want to say, great job on zuulv3. even the seemingly obscure scenarios seem already well thoughtout23:41
SamYapleim more and more impressed the more i use it23:41
clarkb07 has been stopped23:41
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Import legacy playbooks  https://review.openstack.org/51216423:42
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Handle tox environments via tox config  https://review.openstack.org/51244823:45
openstackgerritJames E. Blair proposed openstack-infra/zuul-sphinx master: Add support for project-templates  https://review.openstack.org/51244923:46
jeblairSamYaple: thanks!23:46
*** bobh has joined #openstack-infra23:46
clarkb06 starting now23:47
clarkb2017-10-16 23:47:13,795 DEBUG zuul.AnsibleJob: [build: 98eecfb7a381464088656bb2ed778356] Ansible complete, result RESULT_NORMAL code 023:47
clarkbon to 0823:47
SamYaplejeblair: my dreams are almost fully realized :) new docker image pushed per commit to openstack project23:47
mordredpabelanger, clarkb, jlvillal : https://review.openstack.org/512448 should handle the tox directory thing much more comprehensively23:47
*** jascott1 has quit IRC23:47
*** jascott1 has joined #openstack-infra23:48
* mordred shuts laptop lid to eat airplane dinner23:48
jeblairSamYaple: yay!23:48
SamYaplefun fact: food tastes different on airplanes because our tastebuds get all smushed from decompression leading to bland tastes23:49
clarkb07 is done 2017-10-16 23:51:32,890 DEBUG zuul.AnsibleJob: [build: 4733a27cc2274b00aeab22e5202371e4] Ansible complete, result RESULT_NORMAL code 023:51
clarkbstopping 10 now23:52
*** ijw has joined #openstack-infra23:52
*** gildub has joined #openstack-infra23:55
*** ijw has quit IRC23:56
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [DNM] start at templating out the zuulv3 jobs  https://review.openstack.org/51245023:57

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!