Tuesday, 2017-12-05

*** jesusaur has quit IRC00:04
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove implied playbook extensions  https://review.openstack.org/52523901:02
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove file extension when building SimpleLayout  https://review.openstack.org/52535601:02
*** jesusaur has joined #zuul01:23
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove implied playbook extensions  https://review.openstack.org/52523901:24
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Update playbook paths with extension  https://review.openstack.org/52535901:24
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove file extension when building SimpleLayout  https://review.openstack.org/52535602:03
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Update playbook paths with extension  https://review.openstack.org/52535902:03
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove implied playbook extensions  https://review.openstack.org/52523902:03
openstackgerritPaul Belanger proposed openstack-infra/zuul feature/zuulv3: Move send_command() into ZuulApp  https://review.openstack.org/52321102:05
*** jesusaur has quit IRC02:30
*** jesusaur has joined #zuul02:32
dmsimardpabelanger: you used a set of playbooks for standing up zuul/nodepool in Sydney right ? was that windmill ?03:03
pabelangerdmsimard: yes, it will use ansible-role-nodepool in git.o.o03:08
pabelangerstill working on coverting all the jobs to native zuulv303:08
dmsimardok, I might re-use that to test a nested zuul+nodepool for ara integration jobs03:08
dmsimardunless it's a horrible idea but I thought might as well test like.. everything03:09
pabelangersure, should work. I still don't have devstack integration just yet, but plan on spending some time on it in the next week for so03:10
dmsimardI don't want to refactor ARA's unit/functional/integration tests ever again, this is mind numbing03:17
dmsimarddefinitely one of the worst part of making maintainable software, the tests :(03:19
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove file extension when building SimpleLayout  https://review.openstack.org/52535603:20
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Update playbook paths with extension  https://review.openstack.org/52535903:20
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove implied playbook extensions  https://review.openstack.org/52523903:20
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove file extension when building SimpleLayout  https://review.openstack.org/52535603:26
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Update playbook paths with extension  https://review.openstack.org/52535903:26
openstackgerritIan Wienand proposed openstack-infra/zuul feature/zuulv3: Remove implied playbook extensions  https://review.openstack.org/52523903:26
*** threestrands has joined #zuul03:27
*** bhavik1 has joined #zuul04:25
*** threestrands has quit IRC05:25
*** jaianshu has joined #zuul07:06
*** hashar has joined #zuul08:21
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: revoke-sudo: only revoke when zuul is sudoer  https://review.openstack.org/52226108:30
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: revoke-sudo: only revoke when zuul is sudoer  https://review.openstack.org/52226108:52
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: bindep: fix install bindep_command fact name  https://review.openstack.org/52550308:52
*** jpena|off is now known as jpena08:55
*** sshnaidm|off is now known as sshnaidm|rover08:58
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: bindep: skip package install when not needed  https://review.openstack.org/52550609:09
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: tox: make install_sibling_packages pip import optional  https://review.openstack.org/52550709:13
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: fetch-tox-output: remove --ignore-missing-args  https://review.openstack.org/52551009:20
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: DNM: negative test  https://review.openstack.org/52243809:22
*** sshnaidm|rover is now known as sshnaidm|rover|a10:18
*** sshnaidm|rover|a is now known as sshnaidm|afk10:18
*** bhavik1 has quit IRC10:26
*** sshnaidm|afk is now known as sshnaidm|rover10:43
*** jkilpatr has quit IRC11:36
jaianshuHi...can anyone look at my config file of nodepool once, i'm getting " return self.config.provider_managers[provider_name]11:41
jaianshuTypeError: unhashable type: 'Provider'11:41
jaianshu " error for `nodepool alien-list`.11:41
jaianshufiles - nodepool.yaml-> http://paste.openstack.org/show/628150/ ,  clouds.yaml -> http://paste.openstack.org/show/628151/11:41
*** jkilpatr has joined #zuul11:55
tobiashjaianshu: what are you expecting from alien-list?12:16
tobiashjaianshu: the targets section looks bogus to me12:18
tobiashjaianshu: alien-list is about to be removed: https://review.openstack.org/#/c/525177/12:19
jaianshutobiash: i have a node provisioned in my provider..i can get the info from  OS_CLOUD=example openstack server list but not from nodepool command. What would be the alternative to get node info then if alien command is going to be removed?12:23
tobiashjaianshu: for listing nodepool managed nodes you should use 'nodepool list'12:24
tobiashalien list is something different12:24
jaianshutobiash: ok..i have removed the targets section..still its not able to get nodes info.12:34
jaianshuanything else i'm missing out here12:34
Shrewsjaianshu: there's a bug in alien-list keeping it from working, but we are removing it entirely soon12:35
jaianshuahh..ok12:38
*** jpena is now known as jpena|lunch12:48
rcarrillocruzShrews: you run your nodepool dev in Docker don't you? do you run in xenial or centos?13:04
Shrewsrcarrillocruz: i do not run a nodepool. i only use docker for running zookeeper for unit tests13:06
rcarrillocruzah ok13:06
*** jaianshu has quit IRC13:30
*** jkilpatr has quit IRC13:47
*** jkilpatr has joined #zuul13:48
*** dkranz has joined #zuul13:50
*** jpena|lunch is now known as jpena13:57
openstackgerritFabien Boucher proposed openstack-infra/zuul feature/zuulv3: WIP: Git driver  https://review.openstack.org/52561414:00
openstackgerritMerged openstack-infra/zuul-jobs master: Fix variable name in npm-publish job  https://review.openstack.org/52532614:05
fbo_jeblair: pabelanger hi, I proposed a wip for the git driver. When you have time, could you let me know if I'm on the right track :)14:20
_ari__msg NickServ identify r3dh@t14:26
pabelangerfbo_: great!14:29
Shrews_ari__: oopsie  :)14:44
ShrewstristanC: about https://review.openstack.org/523640 ... maybe that would be a good thing to make configurable via the [zookeeper] config section? i'm not entirely sure one behavior has a clear advantage over the other, so making it configurable would allow both?15:26
tristanCShrews: sounds good to me, i'll add the option tomorrow15:27
ShrewstristanC: i'd like to hear what jeblair thinks about it, too, since we'd probably do the same with zuul15:27
jeblairShrews, tristanC: hrm -- if we can't tell which one is better, how will our users know which option to set?15:28
Shrewsjeblair: that's why i'm having trouble just accepting it, i guess15:28
tristanCjeblair: ideally, nodepool would wait/retry zk for at least a long time15:29
jeblairwe've seen a lot of zk errors, and i don't remember it just getting stuck....15:29
tristanCjeblair: i put a short timeout because it was enough for the connectionlost issue and because it doesn't change how nodepool currently works (it raises exception if zk isn't available)15:30
tristanCjeblair: the issue with having no retry limit, is that kazoo doesn't seems to log what's going on, and if zk is not started, then nodepool may looks stall while it is in fact waiting for zk15:31
Shrewswell, it does change how it works, doesn't it? if a command would have waitied until ZK returns (may be slower, but would eventually work), now it fails quickly15:32
Shrewsthat's where i'm having my doubts15:32
jeblairi don't know which thing we should do yet, i'll probably need to go read some more code before i have an opinion on that.  but i'm pretty sure i don't want to add an option -- i think it's too much to ask users to know which thing should work better, and it sounds like it would fundamentally change how nodepool works in poor conditions.  that will make it really hard for us to think about problems when debugging if we have to think about two ...15:32
jeblair... completely different failure paths.15:32
Shrewsmore heads thinking about this is very good, IMO  :)15:34
Shrewsi suggested the config option because, as jeblair says, it's been working very well for us as-is and I didn't want to change that out from underneath us. But yeah, I can see having that as an option to be very confusing15:37
jeblairone thought i have: why is retrying forever bad?  certainly in some programs you don't want to do that, but nodepool can't do anything at all if it can't contact zookeeper, so is it bad for it to essentially pause until the connection comes back?15:39
tristanCjeblair: Shrews: i don't mind eitherway, as long as we can silence those connectionlost exception that seems to happen randomly15:39
ShrewstristanC: oh, this is just to clean up logging? i thought there was a concrete example you had on your system15:40
jeblairtristanC: if connection lost exceptions are happening, that's pretty bad -- we should fix those15:40
tristanCjeblair: perhaps there is a way to make kazoo log that it's stuck retrying15:40
jeblairwhenever a connection is lost, we lose ephemeral nodes, and locks, etc....15:40
jeblair(if zuul loses its connection to zk, *all* of the jobs running abort)15:40
tristanCShrews: well clean-up the logging, and I guess improve service quality, I guess node request fail when storenode fails15:40
jeblairwe talked about making a mini-howto for tuning zookeeper, maybe we should now...15:41
Shrewswe can't fix connection lost exceptions, just make sure we are handling them correctly.  :)  but if we're overlogging, would be nice to fix that15:41
jeblairShrews: you and I may not be able to, but tristanC may be able to :)15:42
tristanCheh, for what it worth, we cherry-pick that 1.7 second retry fix, and nodepool isn't raising connectionlost anymore15:43
tristanCi got to go now, i can work on the review tomorrow15:44
jeblairtristanC: do you mean with that change it isn't losing the connection in the first place?15:44
tristanCjeblair: it's probably still loosing the connection, but at least it retry and proceed as if it didn't happen15:44
ShrewstristanC: kazoo.exceptions.ConnectionLoss can be silenced by changing the kazoo logging level. I don't think we can silence that from nodepool15:46
tristanCthis is the kind of exception we were getting every few hours or so: http://paste.openstack.org/show/628181/15:46
*** openstackgerrit has quit IRC15:48
jeblairtristanC: when you have time, let's focus on why you were getting those -- your users are going to be really unhappy if all their jobs reset every few hours15:48
tristanCjeblair: interestingly enough, these errors only happen with the nodepool services15:53
tristanCjeblair: i'll investigate if this is related to how instances are deployed on this cloud, perhaps the qrouter path is different15:53
jeblairtristanC: yeah, that's one thing to look at.  also, whether the host(s) is/are cpu bound (that can cause starvation and timeouts)15:54
pabelangermordred: do you have some time this week to comment on https://review.openstack.org/521324/ ? Would like to see if we could continue forward on it, to help optimize CI resources.  I have some jobs I'd like to update for native v3 and this is a good optimaztion to have if possible16:11
mordredpabelanger: sorry - I  keep opening it and then not actually reviewing it - so yes16:11
pabelangermordred: np, and thanks16:13
*** openstackgerrit has joined #zuul16:53
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Add use_python3 variable to sphinx role and job  https://review.openstack.org/52568816:53
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: DNM testing new build-sphinx job works for python3 projects  https://review.openstack.org/52569016:55
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Add use_python3 variable to sphinx role and job  https://review.openstack.org/52568816:57
openstackgerritAndreas Jaeger proposed openstack-infra/zuul-jobs master: Add use_python3 variable to sphinx role and job  https://review.openstack.org/52568817:00
mordredpabelanger: +2 from me on https://review.openstack.org/#/c/521324 - leaving un +A'd so jeblair can re-review17:04
pabelangermordred: tyty17:06
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Add use_python3 variable to sphinx role and job  https://review.openstack.org/52568817:35
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Update sphinx jobs to use python3  https://review.openstack.org/52569017:37
*** sshnaidm|rover is now known as sshnaidm|off17:47
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Update sphinx jobs to use python3  https://review.openstack.org/52569017:54
openstackgerritMonty Taylor proposed openstack-infra/zuul feature/zuulv3: Use python3 for docs publication  https://review.openstack.org/52571817:55
openstackgerritFabien Boucher proposed openstack-infra/zuul feature/zuulv3: WIP: Git driver  https://review.openstack.org/52561418:07
openstackgerritDavid Shrewsbury proposed openstack-infra/zuul feature/zuulv3: WIP: Add finger gateway  https://review.openstack.org/52527618:15
openstackgerritAndreas Jaeger proposed openstack-infra/zuul feature/zuulv3: Update sphinx jobs to use python3  https://review.openstack.org/52569018:19
openstackgerritAndreas Jaeger proposed openstack-infra/zuul feature/zuulv3: Use python3 for docs publication  https://review.openstack.org/52571818:21
*** jpena is now known as jpena|off18:35
*** JasonCL has joined #zuul19:16
*** electrofelix has quit IRC19:27
Shrewsanyone want to +2A https://review.openstack.org/525177 to finally get rid of alien-list?20:00
*** JasonCL has quit IRC20:29
*** JasonCL has joined #zuul20:39
mordredShrews: done20:45
openstackgerritMerged openstack-infra/nodepool feature/zuulv3: Remove alien_list command  https://review.openstack.org/52517720:54
Shrewsw00t20:55
*** JasonCL has quit IRC21:03
tobiashis the gearman client certificate needed on zuul side?21:07
tobiashah, that's for client authentication21:09
*** JasonCL has joined #zuul21:11
*** jkilpatr has quit IRC21:35
openstackgerritDavid Moreau Simard proposed openstack-infra/zuul-jobs master: Add sphinx_python variable to sphinx role and job  https://review.openstack.org/52576321:39
openstackgerritDavid Moreau Simard proposed openstack-infra/zuul-jobs master: Add sphinx_python variable to sphinx role and job  https://review.openstack.org/52576321:43
openstackgerritJeremy Stanley proposed openstack-infra/zuul feature/zuulv3: Correct exception for missing job dependencies  https://review.openstack.org/52577322:01
*** threestrands has joined #zuul22:05
*** threestrands has quit IRC22:05
*** threestrands has joined #zuul22:05
dmsimardjeblair: btw still seeing an END OF STREAM (although I guess for a shorter duration now?) with that job starting message22:07
openstackgerritJeremy Stanley proposed openstack-infra/zuul feature/zuulv3: Correct exception for missing job dependencies  https://review.openstack.org/52577322:09
jeblairdmsimard: i haven't checked to see if it's running on ze* yet; did it land before friday?22:09
dmsimardYeah it does22:11
dmsimardWell, er, yes it landed22:12
dmsimardAnd it's printed in the consoles22:12
*** JasonCL has quit IRC22:23
jesusaurwanna see something interesting? here's a change that got a full +2 +2 +1 but didn't merge: https://review.openstack.org/#/c/517658/22:25
jesusaurare there any known issues around changes not merging after passing the gate jobs?22:27
dmsimardjesusaur: when that happens it's usually because either the parent patch has not merged or a Depends-On patch hasn't merged. I don't think that's the case here, looks like a legit issue.22:28
dmsimardjesusaur: I think an obvious attempt at working around the issue would be to rebase the patch and +W again22:31
dmsimarduntil someone gets a chance to look at logs or something like that.22:31
dmsimardthe patch mentions the parent is out of date so it can be rebased, maybe it failed a merge silently ?22:32
dmsimard¯\_(ツ)_/¯22:32
jesusaurwell, zuul was able to rebase otherwise it wouldn't start the gate jobs, right?22:34
jesusaurbut maybe it's that jgit vs cgit merge issue? I'll try rebasing22:34
dmsimardjesusaur: It's probably not that, hard to tell without looking at logs :(22:35
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Correct exception for missing job dependencies  https://review.openstack.org/52577322:40
*** JasonCL has joined #zuul22:45
*** JasonCL has quit IRC22:51
SpamapSdo we have examples of the best "zuulv3 way" for jobs to have one job build an artifact, and many jobs consume that artifact?22:56
SpamapSLike I want to build a docker image, and have it used by 3 integration test jobs.22:56
SpamapSoh! zuul_return will dump them in, neat.22:57
dmsimardSpamapS: there's been different discussions about that, there is nothing yet built in afaik23:10
dmsimardIn v2 for RDO we have a parent job that uploads it's log at a predictable location and then the child jobs are able to reconstruct that location to download the artifacts23:11
dmsimardWould need to look at chat logs but jeblair and mordred definitely have something in the backlog23:12
dmsimardSpamapS: those are other ideas we had discussed back in Denver in the particular use case of container images https://etherpad.openstack.org/p/rdo-registry-ideas23:13
*** hashar has quit IRC23:15
*** JasonCL has joined #zuul23:33
mordreddmsimard: what did I do?23:35
mordredSpamapS, dmsimard: ah - yah, that old chestnut ...23:36
*** JasonCL has quit IRC23:39
SpamapShttps://docs.openstack.org/infra/zuul/feature/zuulv3/user/jobs.html#return-values23:41
SpamapSSuggests that I should get anything that isn't under zuul: will get set as a variable in child jobs23:42
* SpamapS has broken brain23:42
*** JasonCL has joined #zuul23:58

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!