Wednesday, 2017-05-24

openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add support for requiring github pr head status  https://review.openstack.org/44939000:01
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Adds github triggering from status updates  https://review.openstack.org/45384400:01
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Implement pipeline requirement on github reviews  https://review.openstack.org/45384500:01
jlkjeblair: mordred: ^^ Three changes on top of the refactor. You've already seen one of them, but now it's rebased. More to come, but I wanted to get these out there today.00:02
mordredjlk: \o/00:02
mordredjlk: tis open in my browser00:03
* jlk shakes fist at tox-py3500:15
*** adam_g has quit IRC02:00
*** adam_g has joined #zuul02:04
*** adam_g has quit IRC02:25
*** adam_g has joined #zuul02:26
*** adam_g has quit IRC02:33
*** adam_g has joined #zuul02:34
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Ensure PRs arent rejected for stale negative reviews  https://review.openstack.org/46070003:07
jlkThat one was a slog :/03:07
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Represent github change ID in status page by PR number  https://review.openstack.org/46071603:09
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Comment on PRs if a remote call to merge a change failed  https://review.openstack.org/46076203:10
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Include exc_info in reporter failure  https://review.openstack.org/46076503:11
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add cachecontrol to requests to github  https://review.openstack.org/46158704:48
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Include exc_info in reporter failure  https://review.openstack.org/46076504:48
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Adds github triggering from status updates  https://review.openstack.org/45384404:48
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Implement github trigger requirement status  https://review.openstack.org/46338604:48
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Implement pipeline requirement on github reviews  https://review.openstack.org/45384504:48
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Ensure PRs arent rejected for stale negative reviews  https://review.openstack.org/46070004:48
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Represent github change ID in status page by PR number  https://review.openstack.org/46071604:48
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Comment on PRs if a remote call to merge a change failed  https://review.openstack.org/46076204:48
jlkThat's enough for tonight.04:49
*** isaacb has joined #zuul05:33
*** isaacb has quit IRC06:05
*** adam_g has quit IRC06:52
*** adam_g has joined #zuul06:53
*** DangerousDaren has joined #zuul06:56
*** jroll has quit IRC07:12
*** adam_g has quit IRC07:33
*** adam_g has joined #zuul07:34
*** isaacb has joined #zuul08:10
*** isaacb has quit IRC08:27
*** Cibo_ has quit IRC09:00
*** hashar has joined #zuul09:47
*** jroll has joined #zuul10:25
*** jkilpatr has quit IRC10:38
*** adam_g has quit IRC10:47
*** adam_g has joined #zuul10:49
*** jkilpatr has joined #zuul10:56
*** hashar has quit IRC10:59
*** adam_g has quit IRC11:41
*** hashar has joined #zuul12:01
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Display command/shell results in a banner  https://review.openstack.org/46760312:05
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Add support for defining groups in nodesets  https://review.openstack.org/46761112:47
*** dkranz has joined #zuul12:51
*** Cibo_ has joined #zuul12:51
*** Cibo_ has quit IRC13:07
*** Cibo_ has joined #zuul13:25
SpamapSjeblair: in my SSH agent patch I'm getting leaked git.Repo's in py27 btw13:25
SpamapS2017-05-24 06:24:13,582 zuul.test                        DEBUG    Leaked git repo object: <git.Repo "/home/clint/tmp/tmpFUnTtu/zuul-test/executor-git/github.com/org/project/.git">13:25
SpamapSit adds a small delay as it waits for the ssh-agent to start.. like, 2-3ms delay.. but maybe that's enough to change the race that causes leaky git.Repo's13:27
* SpamapS will ponder whilst shuttling children13:30
*** Cibo_ has quit IRC13:35
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Add support for defining groups in nodesets  https://review.openstack.org/46761114:18
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Write inventory as yaml not ini  https://review.openstack.org/46763414:18
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Put variables into the inventory  https://review.openstack.org/46763514:18
jeblairSpamapS: is that with a successful test or a failing one?  (leaked git repos on failing tests is fairly normal as parts of zuul are still running)14:21
openstackgerritPaul Belanger proposed openstack-infra/nodepool feature/zuulv3: Support both python2 and python3 virtualenv  https://review.openstack.org/46764014:37
*** pabelanger has quit IRC14:52
*** pabelanger has joined #zuul14:52
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Add support for defining groups in nodesets  https://review.openstack.org/46761115:30
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Put variables into the inventory  https://review.openstack.org/46763515:30
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Write inventory as yaml not ini  https://review.openstack.org/46763415:30
*** DangerousDaren has quit IRC15:34
openstackgerritMonty Taylor proposed openstack-infra/nodepool master: Don't spam logs with tracebacks on missing image  https://review.openstack.org/46766415:55
openstackgerritMonty Taylor proposed openstack-infra/nodepool master: Don't spam logs with tracebacks on missing image  https://review.openstack.org/46766416:01
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Add support for defining groups in nodesets  https://review.openstack.org/46761116:05
openstackgerritPaul Belanger proposed openstack-infra/nodepool feature/zuulv3: Support both python2 and python3 virtualenv  https://review.openstack.org/46764016:26
*** hashar has quit IRC16:26
openstackgerritMonty Taylor proposed openstack-infra/nodepool master: Don't log tracebacks for server-side missing image  https://review.openstack.org/46766416:28
SpamapSjeblair: oh failing, ok16:31
pabelangerSpamapS: when you have time: https://review.openstack.org/#/q/status:open+topic:py3-nodepool :)16:42
SpamapSpabelanger: ACK, I'll hit that soon.16:45
*** rcarrillocruz has quit IRC17:12
openstackgerritMonty Taylor proposed openstack-infra/nodepool master: Don't log tracebacks for server-side missing image  https://review.openstack.org/46766417:25
*** Cibo_ has joined #zuul17:46
*** rcarrillocruz has joined #zuul17:47
jeblairwoohoo!  i have the first zuul-cloner test applied to the executor working locally17:48
jeblairi should be able to clean that up and push it this afternoon17:49
openstackgerritMerged openstack-infra/nodepool feature/zuulv3: Python3: RuntimeError: dictionary changed size during iteration  https://review.openstack.org/46604918:01
SpamapSpabelanger: reviewed18:03
pabelangerthanks, will look shortly18:04
openstackgerritPaul Belanger proposed openstack-infra/nodepool feature/zuulv3: Support both python2 and python3 virtualenv  https://review.openstack.org/46764018:17
openstackgerritPaul Belanger proposed openstack-infra/nodepool feature/zuulv3: Wrap map() in list() for python3  https://review.openstack.org/46606918:17
openstackgerritPaul Belanger proposed openstack-infra/nodepool feature/zuulv3: Python3: encode / decode data as utf8  https://review.openstack.org/46606518:17
pabelangerSpamapS: thanks, I missed that time18:18
*** rcarrillocruz has quit IRC18:27
SpamapSpabelanger: np, will re-review soon18:34
*** rcarrillocruz has joined #zuul18:36
openstackgerritMerged openstack-infra/nodepool feature/zuulv3: Python3: encode / decode data as utf8  https://review.openstack.org/46606519:06
* SpamapS is going in circles on this ssh agent stuff :-P19:10
SpamapSsomehow starting an SSH agent breaks everything :-P19:10
*** hashar has joined #zuul19:16
*** hashar has quit IRC19:21
mordredSpamapS: you know you're going to find it and it's going to be a comma19:24
openstackgerritMerged openstack-infra/nodepool feature/zuulv3: Wrap map() in list() for python3  https://review.openstack.org/46606919:24
*** hashar has joined #zuul19:25
openstackgerritMerged openstack-infra/nodepool feature/zuulv3: Support both python2 and python3 virtualenv  https://review.openstack.org/46764019:26
*** openstackgerrit has quit IRC19:48
SpamapSmordred: >:|20:00
SpamapSmy bet is on ()20:01
mordredSpamapS: or some spaces20:01
SpamapSthe annoying thing is there doesn't seem to be any error. Things just stop working right. :-P20:05
SpamapSand when tests are run one-off, they don't fail20:06
SpamapS:-P20:06
SpamapSoh looks like I did find one bug that was making alarm clock fails look weird.. but now it's just that all the tests are timing out :-P20:16
SpamapSdeadlocking in threading stuff20:24
SpamapSfutex(0x2a3fe60, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, ffffffff^Cstrace: Process 26671 detached20:24
*** rcarrillocruz has quit IRC20:27
SpamapSaha20:29
SpamapSI think it's the __del__ method20:29
*** rcarrillocruz has joined #zuul20:35
SpamapSoh also I think we have some tests that call zuul.cmd code that does basicConfig20:39
*** openstackgerrit has joined #zuul20:44
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Add SSH Agent Primitives and usage  https://review.openstack.org/46271220:44
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Add support for bwrap  https://review.openstack.org/45385120:44
*** Cibo_ has quit IRC20:47
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Add SSH Agent Primitives and usage  https://review.openstack.org/46271220:57
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Add support for bwrap  https://review.openstack.org/45385120:57
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Add override-branch property to job repos  https://review.openstack.org/46737521:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Wait for merger to complete before item is ready  https://review.openstack.org/46305421:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Add default-branch property to projects  https://review.openstack.org/46733421:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Store initial repo state in the merger  https://review.openstack.org/46117621:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Use previously stored repo state on executor  https://review.openstack.org/46117721:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Rename 'repos' job attribute to 'required-projects'  https://review.openstack.org/46737621:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Add override-branch property to jobs  https://review.openstack.org/46777521:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Return resulting commits from merger  https://review.openstack.org/46777621:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Use the executor cached repos more often  https://review.openstack.org/46777721:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Check out the appropriate branch in executor  https://review.openstack.org/46777821:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Enable test_one_branch cloner test  https://review.openstack.org/46777921:25
jeblairmy eyes are bleeding from the 'git commit -p' i just did21:25
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Disable leaked repo check  https://review.openstack.org/46681021:29
jeblairSpamapS, Shrews, clarkb, jlk: ^ i think we're at that point.21:29
Shrews:(21:30
openstackgerritClark Boylan proposed openstack-infra/nodepool feature/zuulv3: Cleanup failed upload records  https://review.openstack.org/46778321:39
clarkbthere is the patch I said I would write earler21:39
clarkbif it looks good in v3 branch I will likely port to master21:39
SpamapSjeblair: oh my!21:45
jeblairSpamapS: nice george takei impression21:46
SpamapSjeblair: impulse power to 50% captain.21:48
SpamapSso .. there's an evil thing happening because of the ssh-agent21:49
SpamapSwhen tests timeout hard and die.. the ssh-agent isn't getting killed21:50
SpamapSI have about 20 errant ssh-agents sitting around on my laptop :-P21:50
SpamapS(and they're foreground)21:50
mordredSpamapS: ewwww21:51
SpamapSI think process grouping is the answer21:51
SpamapSIndeed, they're all part of the process group of the desktop session manager which is upstart in Ubuntu 16.0421:53
clarkbsystemd21:54
SpamapSyeah that's the answer right?21:54
SpamapSjust systemd it up21:54
SpamapSAll systemd for justice21:54
* SpamapS shakes deadlocked python process violently22:02
SpamapSTELL ME YOUR SECRETS22:02
*** jkilpatr has quit IRC22:05
jeblairSpamapS: the executor sets up each ansible run in its own process group; i'm surprised ssh-agent isn't automatically part of that22:12
*** hashar has quit IRC22:13
jeblairSpamapS: oh, right, because we want it to span multiple playbooks22:13
SpamapSjeblair: right22:13
SpamapSwe're up one level from that22:13
jeblairSpamapS: so yeah, you can probably do something similar22:14
SpamapSjeblair: indeed that's what I think I'll chase22:14
jeblairSpamapS: the os.setsid() call is the magic i believe22:14
jeblairSpamapS: then os.getpgid()/os.killpg() on the kill side22:14
jeblair(both of those are used in the executor for ansible)22:15
SpamapSjeblair: thanks, was looking for os.setpgid22:15
SpamapSjeblair: I'm not sure that will work the same. By putting it in a different pgrp I think it will get adopted by init (though it will get reaped if it exits). Testing now.22:19
SpamapSindeed..22:21
SpamapSjeblair: what I'm suggesting is that when running tests we need the test runner processes to detach from the controlling terminal, and then ssh-agent to be in the same process group as the executor, so if executor dies, they get HUP'd22:22
SpamapSwell not detach22:23
SpamapSthey need to be in their own pgrp22:23
SpamapSbut then pwn the terminal, except that's going to be.. ugly22:23
* SpamapS isn't really sure what to do now22:23
jeblairSpamapS: oh, it seems that you *don't* want them in their own group then22:23
SpamapSthis may only be a concern while testing22:24
SpamapSand there may be better ways to figure this out.22:24
jeblairSpamapS: i wonder what ssh-agent is doing so that it doesn't get killed when the test process dies22:24
SpamapSit doesn't ignore HUP or anything like that22:25
SpamapSjust tested that22:25
jeblairSpamapS: i'll also throw this out: it may not be worth engineering clean shutdown for hard test timeouts; it may be better to fix whatever is causing the hard timeout so you're back to zuul shutting down the test cleanly on its own22:25
*** jkilpatr has joined #zuul22:26
SpamapSjeblair: I agree.22:26
SpamapShttp://paste.ubuntu.com/24648427/ <-- demonstrates the problem22:27
SpamapSthe ssh-agent will live forever22:27
SpamapSor maybe, as long as the desktop session manager lives ;)22:27
SpamapSjeblair: this may just be a minor yak shave since I'm losing my mind trying to figure out why we're hard timing out just because I started a process.22:27
* SpamapS tries with --analyze-isolation22:29
* SpamapS has now determined it wasn't his __del__ method.. I think... probably.22:33
*** adam_g has joined #zuul22:39
* SpamapS is now manually killing off subprocesses when they get deadlocked in hopes that testr will consider that a fail and find the bad interaction23:09
SpamapSdamnit23:09
SpamapStests.unit.test_scheduler.TestScheduler.test_failed_change_at_head_with_queue  unknown - no conflicts23:09
mordredSpamapS: I would like to say that you have an excellent skill at uncovering the fascinating race conditions and deadlocks23:23
SpamapSmordred: it's my super-anti-power23:26
SpamapSI'm at the phase of commenting all the code out and uncommenting one logical block at a time. :-P23:31
SpamapSwhich honestly doesn't help.. it pretty much only breaks if I run ssh-agent :-P23:32

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!