Monday, 2017-09-04

openstackgerritMerged openstack-infra/zuul feature/zuulv3: Support IPv6 with zuul_stream  https://review.openstack.org/50040100:02
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Use afs_publisher_target for afs-docs  https://review.openstack.org/50040600:15
*** xinliang has quit IRC02:22
*** xinliang has joined #zuul02:34
*** xinliang has quit IRC02:34
*** xinliang has joined #zuul02:34
jeblairmordred: http://zuulv3.openstack.org/static/stream.html?uuid=68504b89c2454917865fbe2dc1d6b5a6&logfile=console.log hit the stop-on-etcd3 problem02:59
jeblairit appears that the script continued02:59
jeblairthe next several steps are to write a sha256sum file, then untar the download; both of those appear to have happened (i logged into the node and inspected)02:59
jeblairthis is the non-boring part of the process list: http://paste.openstack.org/show/620283/03:00
jeblairpstree: http://paste.openstack.org/show/620284/03:02
jeblairbetter pstree: http://paste.openstack.org/show/620285/03:02
jeblairit looks like the script has moved on all the way to the pip install03:03
jeblairwhich has stopped at write(1, "Successfully built uwsgi\n", 2503:03
jeblairso it's probably stuck at output buffering03:04
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: DNM: test wget  https://review.openstack.org/50042603:07
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: DNM: test wget  https://review.openstack.org/50042603:22
openstackgerritTristan Cacqueray proposed openstack-infra/zuul feature/zuulv3: sql: normalize logger name  https://review.openstack.org/50043103:42
*** persia has quit IRC04:20
*** bhavik1 has joined #zuul04:53
*** piccobit has joined #zuul05:03
piccobithi, we're using submodules in our projects and currently we're are trying to figure out, how we can trigger a build in a project if something gets changed in an embedded submodule. any hints?05:06
piccobitah, forget my question, just saw the discussion regarding submodules some days ago!05:13
*** piccobit has quit IRC05:13
*** hashar has joined #zuul06:18
*** hashar is now known as hasharAway06:18
openstackgerritJamie Lennox proposed openstack-infra/zuul-jobs master: Allow overriding the workspace directory in prepare-workspace  https://review.openstack.org/50046607:25
openstackgerritMerged openstack-infra/zuul-jobs master: upload-logs/bindep: fix indentation  https://review.openstack.org/50033907:32
* tobiash is back from vacation08:17
openstackgerritJamie Lennox proposed openstack-infra/zuul feature/zuulv3: Print SIGTERM logging to debug  https://review.openstack.org/50047608:23
*** hasharAway is now known as hashar08:42
*** electrofelix has joined #zuul08:56
*** bhavik1 has quit IRC11:01
tobiashwoot, rebased my zuul deployment after more than 3 weeks and nothing broke :)11:48
*** jkilpatr has joined #zuul11:50
mordredtobiash: seriously? are you sure you rebaesd it properly? I'm sure we MUST have broken something12:59
mordredjeblair: yes - this is EXACTLY the same as what happened when I looked at it13:00
mordredjeblair: as in, I saw the sha and the untar and the python was stopped on something related to uwsgi13:00
mordredtobiash: welcome back!13:01
* tobiash thinks he still knows how to use git rebase13:03
tobiash;)13:03
openstackgerritMerged openstack-infra/zuul feature/zuulv3: sql: normalize logger name  https://review.openstack.org/50043113:30
pabelangermorning13:34
*** hashar is now known as hasharAway13:49
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Add python backport PPA during stream test  https://review.openstack.org/50054913:50
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Set ansible_python_interpreter in stream test  https://review.openstack.org/50055013:50
electrofelixis there a known issue, where a change in the gate can be en-queued into the check pipeline through a 'recheck' command and result in a spurious 'merge failure' due to zuul resetting the merge attribute on the change object?13:55
electrofelixsaw it with a GitHub Enterprise PR13:56
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Move ara output generation to post playbook  https://review.openstack.org/50040314:16
clarkbelectrofelix: are you sure it isnt a valid merge failure?14:22
clarkbevery job run comes with its own constructed git tree14:22
electrofelixclarkb: positive, I went through the log and I could see when the merger succeeded and the corresponding 'complete, merged: True, updated: False, commit: None' in the scheduler14:32
electrofelixclarkb: but then the reporter had "Reported change 38,9a54b8475f906256d012e250753870ce25b2cc91 status: all-succeeded: True, merged: False"14:32
electrofelixwhich stumped me for quite a while until I spotted 'getItemForChange(self, change)' can return an item that is already queued on any of the other pipelines14:33
electrofelixand since it got added to the check pipeline without it actually being a new change, it didn't trigger the code that would kick out the existing change in the gate queue14:34
electrofelixThe sequence was: run check, run gate, trigger check due to 'recheck', less than 10 seconds later the gate finished and reported merge failed, subsequent the merger for the triggered check ran14:35
electrofelixActually that's not quite right14:42
electrofelixPR38 queued in check queue & passes, queued in gate queue, merger runs and passes, 'recheck' issued on PR38, merger runs and passes at "17:11:42,209", gate jobs complete report merge failure at "2017-09-01 17:11:46,207", recheck completes, change is requeued into gate and passes14:42
electrofelixHow it went from 'merged: True' to 'merged: False' is stumping me, but I can only assume it was some form of race with the same change going into the check queue at the same time14:47
electrofelixI would point out that if the change had really failed to merge, I would have expected a message containing 'Unable to merge change ..." to appear in the log based on the scheduler code14:48
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: DNM: test wget  https://review.openstack.org/50042614:52
electrofelixIs it possible that after emitting the log message from merger client, that part of the call to self.sched.onMergeCompleted(..) only completed after part of the reportItem had already retrieved the value False using 'self.sched.onMergeCompleted()' ?14:52
electrofelixFundamentally it seems like there is a problem where 'recheck' can result in the same change appearing in the check and gate queues at the same time, and reuse of the same item object definitely has the potential for a race to occur14:54
electrofelixI'm just not quite sure what the fix for this scenario should be, make the comment trigger be ignored if the same change is in the gate (how?), or abort the change in the gate (again how to know that this is correct?)14:55
electrofelixis there a need to provide some kind of link between pipelines to say a change should be unique between the two?14:56
jeblairelectrofelix: they don't use the same item object.  the 'merged: False' from the gate message isn't related to the zuul merger.  that means that the change was not merged into the upstream repo.14:58
jeblairelectrofelix: aside from this case, is the gate pipeline generally able to merge pull-requests?  is there branch protection that could have prevented the merge?  is there anything in the logs about why the merge failed?15:01
electrofelixjeblair: no there is nothing to stop it, and the subsequent run in the gate after the recheck comment worked without anyone having done anything to the original repo15:02
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: DNM: test wget  https://review.openstack.org/50042615:02
electrofelixso the branches were not out of sync, and the merger run returned success each time15:03
jeblairelectrofelix: can you paste all the log lines between "Reporting change 38,9a54b8475f906256d012e250753870ce25b2cc91" and "Reported change 38,9a54b8475f906256d012e250753870ce25b2cc91  ..." ?15:04
jeblairelectrofelix: (ideally debug level if you have 'em)15:05
electrofelixjeblair: don't have debug level but I can put the chunk into a paste service15:06
electrofelixjeblair: http://paste.openstack.org/show/620349/15:09
electrofelixwe have two check pipelines, an old one called check-zing-github, which is obsolete and will be removed shortly and a newer one called check-github, just in case you're wondering what's going on15:10
jeblairelectrofelix: do you have any log lines that match the regex "zuul.Github"?15:16
electrofelixjeblair: last one was "2017-08-08 15:11:58,502 ERROR zuul.GithubWebhookListener: Exception when handling event:", we've turned on debug this morning, so none around when this occurred15:20
jeblairelectrofelix: hrm, i expected a warning log line from zuul.GithubConnection.15:25
jeblairelectrofelix: well, turning on debug logging was my next suggestion; i think we'll need that to proceed further; so let me know if it happens again and we can comb through the logs then15:26
electrofelixbefore we even get there, if there is a concept of check & gate pipelines, it does seem like a recipe for surprise if a change in the gate can also simultaneously added to the check queue through a comment15:32
electrofelixwondering if there should be something to prevent this case from occurring rather than worrying about debugging how it managed to think the merge failed15:33
pabelangerso, feature request. if anybody js wizards could also add finger:// URLs to zuulv3.o.o/status, it would make me so happy :D15:45
pabelangerI like the webUI, but my CPU and browser doesn't15:47
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Make validate-host read from site-variables  https://review.openstack.org/50059216:05
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Make validate-host read from site-variables  https://review.openstack.org/50059216:06
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: DNM: test wget  https://review.openstack.org/50042616:11
fungipabelanger: maybe as an icon next to the websockets url?16:24
pabelangerfinger emoji wfm :D16:26
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: DNM: test wget  https://review.openstack.org/50042616:28
electrofelixjeblair: I think I know what caused the behaviour we saw, the github project has required checks enabled for the check & gate queues16:42
electrofelixjeblair:  in triggering a recheck, the status of the check was reset so zuul request to merge would fail, so this looks like it's really just a problem of a poor error message16:42
electrofelixor a failure in the connection code to detect that the merge request was rejected16:45
jeblairelectrofelix: ah, i thought you had said there was no github branch protection.  so yes, that makes sense.16:45
jeblairelectrofelix: yeah, there's supposed to be a warning level log message if the merge fails16:45
electrofelixjeblair: sorry, meant there was nothing preventing the zuul user from merging, should have been more clear16:45
electrofelixjeblair: I think there is something missing with the version we're running, we might be missing a commit or two16:46
electrofelixjeblair: or because the log is only at debug for when that occurs16:47
electrofelixor rather on the branch protections, they were supposed to be setup to allow the zuul user do merges, but the setting to require the status checks (to prevent users from forgetting and clicking he merge button themselves) was enabled16:49
electrofelixanyway, I know far now about zuul internals than I did yesterday16:49
electrofelixs/far/far more/16:49
electrofelixjeblair: is there room for another failure message in zuul pipelines? it seems that zuul merger failure should be considered different to the failure to submit a Gerrit change/merge a PR16:53
jeblairelectrofelix: yes, though first thing is to find out why there was no warning message from zuul.GithubConnection, since that would be the source of what we'd report back there.16:54
electrofelixjeblair: our tree is out of date, the code at https://github.com/openstack-infra/zuul/blob/feature/zuulv3/zuul/driver/github/githubreporter.py#L142-L144 is using debug level in our codebase16:57
jeblairah ok16:57
*** jkilpatr has quit IRC17:17
*** electrofelix has quit IRC17:42
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Make validate-host read from site-variables  https://review.openstack.org/50059218:25
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Rework upload-logs to enable running on localhost  https://review.openstack.org/50061118:25
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Add zuul base roles to ease sharing base job content  https://review.openstack.org/50061218:25
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Add zuul base roles to ease sharing base job content  https://review.openstack.org/50061218:27
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Contract back to a single upload-logs tasks file  https://review.openstack.org/50061818:27
*** kmalloc_ has joined #zuul18:30
*** eventingmonkey_ has joined #zuul18:36
*** leifmadsen_ has joined #zuul18:36
*** eventingmonkey has quit IRC18:37
*** leifmadsen has quit IRC18:37
*** kmalloc has quit IRC18:37
*** kmalloc_ is now known as kmalloc18:37
openstackgerritPaul Belanger proposed openstack-infra/zuul feature/zuulv3: Reduce debug output for repo  https://review.openstack.org/50062218:49
pabelangerjeblair: mordred: ^ reduce some logging to prevent secrets from showing in debug logs18:49
pabelangerheh, I should test that first18:51
*** hasharAway is now known as hashar18:51
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: DNM: test wget  https://review.openstack.org/50042618:57
openstackgerritPaul Belanger proposed openstack-infra/zuul feature/zuulv3: Reduce debug output for repo  https://review.openstack.org/50062218:57
mordredjeblair: WHY DOES wget WORK IN YOUR TEST BUT NOT IN DEVSTACK??????19:15
*** pabelanger has quit IRC19:17
*** pabelanger has joined #zuul19:17
jeblairmordred: hrm; i may still be missing weird things that devstack does with output or file handles, etc.... i was trying to start simple and add them one by one19:56
jeblairi'm turning on keep to find out why the post playbook doesn't copy logs when the devstack job times out20:06
pabelangerack20:08
pabelangerjeblair: mordred: https://review.openstack.org/500622/ is green now. Would be great to land and reset executor for cleaner debug logs20:09
jeblairlgtm20:10
pabelangerright now, logs are not pastebin friendly20:10
*** hashar has quit IRC20:44
mordredpabelanger: +321:03
mordredjeblair: there's a weird thing going on with zuulv3 and shade that I thought might be worth point you at, in case it's a thing you're worried about21:06
mordredjeblair: if you look at https://review.openstack.org/#/c/49935721:06
mordredjeblair: you'll see that zuul has voted -2 on the change21:07
mordredeven though there are no jobs for shade in the gate pipeline21:07
mordredoh - actually - https://review.openstack.org/#/c/499357/ is the base change, and it just got the same thing21:08
mordredjeblair: I'm going to go look through scheduler logs to see if I can figure out why zuulv3 thinks it should leave  -2 there21:08
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Reduce debug output for repo  https://review.openstack.org/50062221:12
mordredjeblair: 2017-09-04 19:28:14,600 INFO zuul.DependentPipelineManager: Resetting builds for change <Change 0x7fb05ff854e0 499357,7> because the item ahead, <QueueItem 0x7fb066205d68 for <Change 0x7fb06446bda0 499345,4> in gate>, failed to merge21:20
mordredjeblair: "499345" was the previous top of the queue, and zuul v2 actually merged it - I'm wondering if there's just a race-condition between the 2 zuuls, with v3 trying to do a merge check or something, and v2 merges it so v3 attempting to merge the patch on top of remote master errors21:22
mordredjeblair: other than not being sure why it would be considering the change during gate queue processing21:22
mordredjeblair: so - it's the gate queue processing that makes me wonder if it'sa real bug we should care about - if it's just v2 and v3 fighting, I'm not worried about that21:23
fungimordred: any guess why 499843 is still resulting in that "extra keys not allowed" validation error from zuulv3 even after its parent merged?21:59
openstackgerritJamie Lennox proposed openstack-infra/zuul feature/zuulv3: Print SIGTERM logging to debug  https://review.openstack.org/50047622:05
Shrewslots of working for a holiday22:34
* rcarrillocruz waves22:36
rcarrillocruzso, looking at zuulv3 github docs22:36
rcarrillocruzit says there are two options, either a github app or a webhook22:36
rcarrillocruzis there docs or blogpost, anything, about setting the two in the zuul context22:36
rcarrillocruzperms, what not22:36
* rcarrillocruz throws a squirrel to mordred22:37
pabelangerrcarrillocruz: see https://docs.openstack.org/infra/zuul/feature/zuulv3/admin/drivers/github.html22:45
pabelangerwe're using github app right now22:45
pabelangerhttps://github.com/apps/openstack-zuul22:45
rcarrillocruzyeah, but that depicts from zuul side, i wonder if there are docs or samples or a blogpost about 'hey, i set up a webhook on this repo so it could consume events on my zuul' kind of thing22:46
rcarrillocruzalternatively, a github app that sends thing to a given zuul, not openstack zuul22:47
rcarrillocruzi assume the github app is a hardcoded convenient thing you install in a repo22:47
rcarrillocruzthta sets it up to send events from yuour repo to openstack zuul22:47
pabelangerI still think we are working on that part, but mordred did setup openstack-infra. So best person to ask22:47
rcarrillocruzi wonder how to create 'ricky dummy zuul gh ap'22:47
jeblairrcarrillocruz: if you do it, please write it down and add to the docs22:53
rcarrillocruzif i get pointers on how openstack zuul was done, i22:54
rcarrillocruzwill contribute that back for sure22:54
jeblairmordred, dmsimard: this file has some weird encoding: http://git.openstack.org/cgit/openstack-infra/zuul-jobs/tree/roles/emit-ara-html/tasks/main.yaml23:05
jeblairline 18, the space before bool is actually "\xc2\xa0"23:05
jeblairdmsimard: i hereby revoke your emacs derogation privileges.  :)23:07
openstackgerritJames E. Blair proposed openstack-infra/zuul-jobs master: Ignore errors from ara generate  https://review.openstack.org/50064523:09
jeblairthat's why the devstack timeout isn't uploading logs.  we'll probably want to snarf the ara db from that to find out *why* ara generate isn't working as well.23:10
jeblairi turned off keep23:12
jeblairdmsimard: here's the ara sqlite file that caused ara generate to fail: http://files.openstack.org/user/corvus/ara-failed-ansible.sqlite23:15
jamielennoxrcarrillocruz: a github app is an instance of a thing, it's basically a wrapper around a webhook with the permissions inbuilt23:36
jamielennoxyou can do the same thing with out the app, you just need to point the webhook at the correct zuul, but then you also need a user for the zuul to post as23:37
jamielennoxso webhook_key (or whatever) is always common, but you need a github oauth token if you don't have a app_id23:38
dmsimardjeblair: that's ok I don't use emacs23:57
dmsimardAlso, yes, it's entirely possible there is a junk whitespace in there23:58
dmsimardIt's something my keyboard layout sometimes produce and it's a real PITA23:58
dmsimardSo much that I have a shell alias just to find them23:58

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!