Thursday, 2018-06-14

openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/job/{job_name} route  https://review.openstack.org/55097800:07
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/projects and /{tenant}/project/{project} routes  https://review.openstack.org/55097900:08
*** rlandy is now known as rlandy|afk00:13
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add initial GraphQL controller  https://review.openstack.org/57462500:57
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: sql: use a declarative base model  https://review.openstack.org/57527500:57
*** rlandy|afk is now known as rlandy01:10
*** rlandy has quit IRC01:35
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: sql: use a declarative base model  https://review.openstack.org/57527503:26
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add initial GraphQL controller  https://review.openstack.org/57462503:26
tobiashcorvus, mordred: I just found a simpler way solving this issue without forking the linear strategy04:15
tobiashcorvus, mordred: using the task uuid itself works without forking the linear plugin (just tested that locally)04:18
tobiashcorvus, mordred: do we really need that host part? Right now the callback is called exactly once for a multi-host task04:20
tobiashcorvus, mordred: so that's already now a problem if we use delegate04:21
tobiashcorvus, mordred: so thinking about this this was always wrong but I think that should be fixed04:22
*** threestrands has quit IRC04:47
*** pcaruana has quit IRC05:09
*** leifmadsen has quit IRC05:27
*** leifmadsen has joined #zuul05:28
*** pcaruana has joined #zuul06:17
*** AJaeger has quit IRC06:24
*** threestrands has joined #zuul06:27
*** hashar has joined #zuul06:33
*** AJaeger has joined #zuul06:57
*** Rohaan has joined #zuul06:57
*** gtema has joined #zuul07:01
*** jpena|off is now known as jpena07:30
*** nguyenhai has quit IRC07:37
*** nguyenhai has joined #zuul07:37
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Move zuul_log_id injection to command action plugin  https://review.openstack.org/57535107:38
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Fix log streaming for delegated hosts  https://review.openstack.org/57535207:38
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Revert "Temporarily override Ansible linear strategy"  https://review.openstack.org/57535307:38
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Remove extra argument when logging logger timeout  https://review.openstack.org/57535407:38
*** electrofelix has joined #zuul08:33
*** ianychoi has quit IRC08:39
*** flawi has joined #zuul09:35
flawihola, is anyone aware of any effort being put into supporting bitbucket hosted repositories in zuul? just wondering if there's something we could help with, or if we should just assume that we'll be writing that integration from scratch if we plan on using zuul09:42
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Move zuul_log_id injection to command action plugin  https://review.openstack.org/57535109:44
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Fix log streaming for delegated hosts  https://review.openstack.org/57535209:44
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Revert "Temporarily override Ansible linear strategy"  https://review.openstack.org/57535309:44
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Remove extra argument when logging logger timeout  https://review.openstack.org/57535409:44
*** threestrands has quit IRC09:54
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Move zuul_log_id injection to command action plugin  https://review.openstack.org/57535110:28
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Fix log streaming for delegated hosts  https://review.openstack.org/57535210:28
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Revert "Temporarily override Ansible linear strategy"  https://review.openstack.org/57535310:28
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Remove extra argument when logging logger timeout  https://review.openstack.org/57535410:28
tobiashflawi: I don't know of any but if you're interested you could start with the github driver as template as I think github works at least a bit similar like bitbucket10:30
flawiyeah, should be entirely doable, just making sure we don't duplicate efforts there if someone is already working on it10:31
tobiashflawi: I think I heard the same question with gitlab but at least I didn't notice any serious effort right now10:31
flawiall right, thanks. I'll just mark it down as something we'll need to do using the GitHub event handling as a starting point, the workflows should be similar10:35
tobiashcorvus, mordred: that stack above is green now ^10:50
tobiashcorvus, mordred: that implements the task uuid idea and doesn't need to fork the linear plugin10:51
tobiashcorvus, mordred: further I have the strong impression that delegated logging of multiple hosts was broken since the beginning whithout us noticing it so that also includes a fix for that10:52
*** jpena is now known as jpena|lunch10:57
*** elyezer has joined #zuul11:25
*** rlandy has joined #zuul12:01
*** jpena|lunch is now known as jpena12:03
mordredtobiash: oh wow - looking forward to reading!12:15
pabelangerare test_job_aborted and test_abandoned_not_timer know flapping tests? http://logs.openstack.org/73/575173/6/check/tox-py36/d0df7d8/testr_results.html.gz12:17
mordredtobiash: it seems like you could remove the command.py action plugin in the other action dir now12:26
tobiashmordred: Do I have both now?12:27
* tobiash is afk atm12:28
mordredtobiash: yah. but no big deal - we can take care of it in a followup12:29
mordredtobiash: nevermind. I had a stale copy of the tree locally12:30
tobiash:)12:32
*** myoung|off is now known as myoung12:36
*** Rohaan has quit IRC12:39
pabelangertristanC: replied to ML thread12:41
*** ianychoi has joined #zuul12:46
tristanCpabelanger: awesome, thanks!12:48
openstackgerritMerged openstack-infra/zuul master: Allow zuul_return in untrusted jobs  https://review.openstack.org/57517312:51
pabelangermordred: tobiash: clarkb: corvus: It seems tristanC has already created an builtset artifact role in SF.io, but believe would be great to land in zuul-jobs, could trouble you for reviews: https://review.openstack.org/530679/12:52
mordredpabelanger: lgtm12:53
*** elyezer has quit IRC12:56
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Remove extra argument when logging logger timeout  https://review.openstack.org/57535413:07
tobiashmordred: fixed commit message ^13:08
*** elyezer has joined #zuul14:15
openstackgerritMerged openstack-infra/zuul-jobs master: Collect the coverage report for npm test jobs  https://review.openstack.org/57026014:54
openstackgerritPaul Belanger proposed openstack-infra/zuul-jobs master: Add buildset-artifacts-location  https://review.openstack.org/53067915:14
pabelangermordred: tristanC: corvus: ^fixes ansible-lint failure gate pipeline found15:15
corvusmordred, clarkb: did we want to have that call about logging?15:20
mordredcorvus: I'm not sure we need to anymore with the new stack from tobiash ?15:21
corvusmordred: oh, do you want to shelve the unix socket idea?15:21
mordredcorvus: I don't want to shelve it completely - but I believe the new patches mean we don't necessarily need to jump straight on it (although we certainly can if we want to)15:22
*** pcaruana has quit IRC15:24
corvusmordred: okay, i guess we can leave it for when things settle down and you (or someone else) want(s) to pick it up again?  i'll go review tobiash's patches now15:26
corvusmordred: your comment on 575351 -- is that the one you decided was erroneous in irc?15:27
mordredcorvus: yes15:28
tobiashthere was no command.py action override before that change15:28
corvusclarkb: 575351 and children have +2s from mordred and I if you want to review / +W them15:34
clarkbok I'm trying ti get my windoe manager back then will look15:35
mordredclarkb: tl;dr - I was wrong, you were right15:36
clarkbok I switched back to desktop and will just not update for a bit here15:39
*** hashar is now known as hasharAway15:40
*** sshnaidm has quit IRC15:55
*** myoung is now known as myoung|lunch16:04
clarkbmordred: corvus tobiash in https://review.openstack.org/#/c/575352/3/zuul/ansible/callback/zuul_stream.py we set host to the delegate to value when delegating but then use the second half of the new tuple return (inventory_hostname) to find the log_id. Does that all work out still with delegated tasks?16:07
clarkbon the action module command side it is still going to use the inventory hostname of the host in the task not the one it was delegated to?16:07
corvusclarkb: can you give a line number for "set host to the delegate to value" ?16:10
clarkb57216:10
tobiashclarkb: host is where we need to connect to for log streaming16:11
tobiashclarkb: there is also a test case which is exercising this16:12
*** elyezer has quit IRC16:12
clarkbtobiash: ya, I think my question boils down to is the inventory hostname the delegated to value in the command module side16:12
clarkbsince we need the two to match up. Sounds like it isn't if you delegate to the inventory hostname is still whatever node that is "running" on16:13
tobiashclarkb: in the name of the logfile we actually want the inventory hostname16:13
tobiashusing the delegated hostname for the logfile would be useless as it's not unique16:13
tobiashclarkb: so we connect to the delegate but request the logfile with the inventory hostname16:14
corvusclarkb: in 575351 we have host = paths._sanitize_filename(task_vars.get('inventory_hostname'))16:14
corvusso yes, the remote side is always going to have "inventory_hostname" in its log path16:14
clarkboh right, thanks16:15
clarkbok I've +2'd things but not approved. My allergies are bad enough that I think I may try to take it easy to day16:15
clarkb(so likely won't be around for executor restarts)16:15
corvusi'll approve and do a full restart today so that hopefully we can release today or tomorrow16:16
ssbarneai receive lots of emails from storyboard about gerrit comments and I am unable to find why I get those as I am not CC neither on the CR nor on the story.16:16
ssbarneaexample: https://storyboard.openstack.org/#!/story/200252816:16
corvusssbarnea: did you subscribe to the project?16:17
ssbarneacorvus: yep, this was my "mistake" to press the star on zuul.16:17
ssbarneathanks, resolved it. too much noise16:18
corvusssbarnea: https://storyboard.openstack.org/#!/profile/preferences  and https://storyboard.openstack.org/#!/project/67916:18
*** sshnaidm has joined #zuul16:40
pabelangercorvus: do you mind refreshing your +3 on https://review.openstack.org/530679/ new patch addressed ansible-lint failure17:09
*** myoung|lunch is now known as myoung17:12
*** jpena is now known as jpena|off17:14
corvuspabelanger: where's the failure?17:17
pabelangercorvus: it never got reported back to gerrit as I pushed up new patch, but http://logs.openstack.org/79/530679/8/gate/openstack-zuul-jobs-linters/7444209/job-output.txt.gz is the log17:19
corvuspabelanger: gotcha, i +3d17:19
pabelangerthanks17:19
Shrewspabelanger: corvus: i believe i found a nodepool bug that may cause the node leak. working up a test case17:24
pabelangerShrews: yay17:26
pabelangerspeaking of nodepool, should region-name be fetched from clouds.yaml over nodepool.yaml? Have a case in rdo nodepool, we don't set region-name, but nodepool.region in inventory ends up null: https://logs.rdoproject.org/01/14201/2/check/legacy-DLRN-rpmbuild/8435496/zuul-info/inventory.yaml17:30
pabelangeractually, I think it does work like that. seems we don't have it set in our clouds.yaml right now17:32
pabelangertesting17:32
corvuslooks like 575351 hit a timeout on test_plugins.  i'll look at that locally but re-enqueue the stack (which otherwise passed all tests)17:38
corvuswow, test_plugins takes 31 seconds on my machine.  we might want to bump that timeout to 12017:43
*** gtema has quit IRC17:43
*** pcaruana has joined #zuul17:43
corvusoh, it already is... :/17:44
corvuslet's just keep an eye out on it17:44
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool master: Fix paused handler exception handling  https://review.openstack.org/57551517:57
ShrewsI actually have a fix for ^^ but waiting to show that failure before I add it in PS218:00
*** elyezer has joined #zuul18:00
*** pcaruana has quit IRC18:01
openstackgerritMerged openstack-infra/zuul-jobs master: Add buildset-artifacts-location  https://review.openstack.org/53067918:06
*** pcaruana has joined #zuul18:06
Shrewspabelanger: corvus: looks to me like zuul has most of those locks, so i suspect the executor restarts18:06
Shrews6 of them are part of 2 different requests (3 nodes each request)18:08
Shrewsat least 2 of the other 4 are likely due to the bug i found. going to assume the other 2 fall into one of those 2 categories18:13
pabelangerokay, region_name doesn't seem to get pulled from clouds.yaml, if missing from nodepool.yaml18:15
*** electrofelix has quit IRC18:19
clarkbpabelanger: for the single region case?18:23
pabelangerclarkb: yah, in this case we only have regionOne for the cloud, and configs opted not to set the region-name value, but nodepool.region in ansible inventory is null18:24
pabelangerso, proposing patches to set it in nodepool.yaml now, but feel we could also load it from clouds.yaml if set18:25
pabelangeror just remove nodepool.yaml region-name all together and fall back to clouds.yaml?18:25
*** sshnaidm is now known as sshnaidm|off18:26
clarkbyou can't remove it from nodepool.yaml because clouds can have more than one region18:29
clarkbthe setting there is to select the region from the valid list of regions in clouds.yaml. osc does the same thing18:30
clarkbwe could do like osc and error if more than one region is in the clouds.yaml18:30
clarkbnot sure if that is more or less friendly than erroring if you don't set it18:30
pabelangeryah, that is true18:32
pabelangerokay, we've proposed a patch to just set region-name in nodepool.yaml18:33
clarkber error if more than one region is in clouds.yaml and you didn't set on in nodepool config18:33
clarkbits the tradeoff between telling everyone to just set it and that being simple beacuse its one rule vs being smart and having conditionals in your rules but being easier for the simple case18:34
openstackgerritMerged openstack-infra/zuul master: Move zuul_log_id injection to command action plugin  https://review.openstack.org/57535118:36
openstackgerritMerged openstack-infra/zuul master: Fix log streaming for delegated hosts  https://review.openstack.org/57535218:36
openstackgerritMerged openstack-infra/zuul master: Revert "Temporarily override Ansible linear strategy"  https://review.openstack.org/57535318:36
openstackgerritMerged openstack-infra/zuul master: Remove extra argument when logging logger timeout  https://review.openstack.org/57535418:36
*** GonZo2000_ has quit IRC18:52
*** dkranz has quit IRC19:20
*** dkranz has joined #zuul19:21
pabelangerquestion, could min-ready in nodepool be scoped to provider too vs just label? I can see a case where a nodepool provider might not actually want to idle ready nodes before a job actually needs it, due to cost. However another provider it might be okay20:55
pabelangerI guess that also gets into the idea of provider precedences20:58
*** hasharAway has quit IRC21:37
*** myoung is now known as myoung|off21:45
corvuspabelanger: my guess is it'd be best to try to solve that with a general preference mechanism (because the same reason you wouldn't want one to idle is probably the same reason you wouldn't want to use it in general)22:22
fungiahh, like having some specified "burst providers" who only provide nodes as a last resort when you're otherwise out of capacity22:38
corvusyep22:39
fungii could see that if, say, i had my own private cloud but also an account with a public provider. i'd want to utilize my sunk cost resources first and only pay to burst into the public cloud when i'm out of local capacity22:39
corvusenough people have described that use case that i think if we were to focus only on the min-ready nodes that we still wouldn't have solved the actual problem.  min-ready nodes are almost exactly the same as any other nodes in that respect (other than they might be expected to run a bit longer)22:40
corvusbut if we solve it generally, we've solved it for min-ready too.22:40
fungicompletely agree22:41
clarkbone thing that complicates it is distributed launchers like the ones we have22:42
*** elyezer has quit IRC22:42
clarkb(so may need provider state in zk)22:42
corvusyes, it may require more communication about their state.  we can totally add that, i just avoided specifying it in the initial algorithm so we could have something simple and (hopefully) robust to start with.22:43
corvuswe knew almost nothing about zk at the time :)22:43
clarkbya I think it was the right decision :)22:43
fungiprudent22:43
clarkbjust point it out for anyone looking to implement it22:43
pabelangercorvus: well, we do want to use the resources, just maybe 12hrs of the day. The rest of the time, we'd might want the provider to be drop back to 0 to avoid billing. But, another cloud maybe donated in this context, which we are happy to keep min-ready nodes22:53
pabelangerbut yes, would be happy to discuss a use case more, maybe ptg22:54
*** threestrands has joined #zuul22:56
*** threestrands has quit IRC22:56
*** threestrands has joined #zuul22:56
*** threestrands has quit IRC22:57
*** threestrands has joined #zuul22:57
*** threestrands has quit IRC22:57
*** threestrands has joined #zuul22:57
*** GonZo2000_ has joined #zuul23:28
openstackgerritJames E. Blair proposed openstack-infra/zuul master: Add supercedent pipeline manager  https://review.openstack.org/57193223:32
corvusclarkb, fungi, tobiash: ^ comments (mostly) addressed23:32
*** GonZo2000_ has quit IRC23:47
*** GonZo2000_ has joined #zuul23:55
*** GonZo2000_ has quit IRC23:56
*** GonZo2000_ has joined #zuul23:56

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!