Thursday, 2017-03-23

SpamapS	aaaand EOD strikes again	00:00
* SpamapS vows to return		00:00
openstackgerrit	Jesse Keating proposed openstack-infra/zuul feature/zuulv3: Query changes only via relevant sources https://review.openstack.org/448257	00:45
jlk	^^ That was a 31 line (outside of tests) patch for 2.5, down to a 5 line patch for v3.	00:52
openstackgerrit	K Jonathan Harker proposed openstack-infra/zuul feature/zuulv3: Perform pre-launch merge checks https://review.openstack.org/446275	01:33
jesusaur	jlk: wow, awesome, that's so much simpler	01:34
SpamapS	oh how embarassing	05:24
* SpamapS accidentally started refactoring on master.. that makes a lot of sense why it made no damn sense at all		05:24
openstackgerrit	Clint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Refactor out Changeish https://review.openstack.org/448913	06:08
SpamapS	jeblair: please disregard my master-brain-fart ... ^^ is the real slim shady	06:08
SpamapS	though now I have 65 fails instead of 26	06:08
*** isaacb has joined #zuul		06:17
*** jasondotstar has quit IRC		07:03
*** tristanC has quit IRC		07:03
*** dmellado has quit IRC		07:03
*** rattboi has quit IRC		07:04
*** tristanC has joined #zuul		07:05
*** rattboi has joined #zuul		07:05
*** jasondotstar has joined #zuul		07:08
*** dmellado has joined #zuul		07:08
*** isaacb has quit IRC		07:13
*** isaacb has joined #zuul		07:23
lennyb	Hi, is there a way to skip files in zuul, but still be able to merge the code? I have a gating job #link http://paste.openstack.org/show/603873/ but I want to skip checking some files. currently I see a note of 'Staring job' in gerrit, but since no job was triggered, there is no SUCCESS/FAIL message and no code merge.	08:40
tobiash_	lennyb: if you filter out every job but still want zuul to gate/merge stuff you can add additionally the "noop" job	08:49
*** openstackgerrit has quit IRC		09:03
*** hashar has joined #zuul		09:03
lennyb	tobiash_, thanks.	09:25
lennyb	tobiash_, you mean adding 2 jobs, one that will listen only to files that I want to skip, run noop and vote/merge, and another one to listen to all the rest?	09:51
tobiash_	lennyb: no, just add noop to your project	10:07
tobiash_	lennyb: noop is a reserved 'do nothing' job zuul provides and doesn't consume any resources like slaves	10:08
tobiash_	lennyb: adding this then just assures that at least the noop job is run	10:08
tobiash_	lennyb: in case your job is not filtered then your job plus noop 'run'	10:09
tobiash_	lennyb: in case your job is filtered just the 'noop' job 'runs' which means zuul just checks if the patch can merge	10:09
* lennyb went to implement tobiash_ 's instructions.		10:15
lennyb	I guess there is still no way to reload layout.yaml with restarting zuul	10:31
*** bhavik1 has joined #zuul		10:47
lennyb	tobiash_, thanks. noop works.	10:55
lennyb	is there a way to configure zuul, that 'gate' job will abort/cancel ordinary 'check' pipeline/job that was already started?	10:56
tobiash_	lennyb: how does that happen? normally gate only starts when check is done	10:58
lennyb	tobiash_, if I understood you correctly, noop still fails on merge conflict. I am trying to prevent bad code merges.	10:59
lennyb	I triggered gate pipeline without waiting for check to be finished	11:00
lennyb	and added precedence: high to gate pipeline	11:00
tobiash_	lennyb: yes, noop still fails on merge conflict	11:01
tobiash_	lennyb: zuul cannot cancel check jobs when gate starts	11:01
lennyb	tobiash_, thanks. your assistance and help highly appreciated	11:04
tobiash_	yw	11:05
*** bhavik1 has quit IRC		11:08
pabelanger	lennyb: SIGHUP on zuul will reconfigure jobs. No need to stop the process	11:27
lennyb	pabelanger thanks	11:27
*** openstackgerrit has joined #zuul		12:26
openstackgerrit	David Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Add check for valid zk attribute before disconnect https://review.openstack.org/449110	12:26
openstackgerrit	David Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Do not require secure file for nodepoold https://review.openstack.org/449118	12:37
pabelanger	Shrews: jeblair: re: 446683. If I try / except for closing the socket, is it safe to assume if the socket fails to close, not to try closing Transport(), since it uses the socket? Or just be safe and try / except both individually	12:51
Shrews	pabelanger: not sure. we should probably try to close it anyway. i think clarkb (i may be mistaken) mentioned something somewhere about leaking paramiko connections causing issues... maybe i read that in #infra	12:53
Shrews	i can't remember much about it	12:53
pabelanger	safe is better	12:54
openstackgerrit	Paul Belanger proposed openstack-infra/nodepool feature/zuulv3: Remove SSH support from nodepool https://review.openstack.org/446683	12:56
openstackgerrit	David Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Docs: Remove "jenkins" reference https://review.openstack.org/449129	13:02
openstackgerrit	David Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Docs: Clarify secure file usage https://review.openstack.org/449130	13:06
openstackgerrit	David Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Docs: Remove cron references https://review.openstack.org/449140	13:21
openstackgerrit	David Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Docs: Correct availability-zones documentation. https://review.openstack.org/449147	13:30
Shrews	hrm, our nodepool dsvm jobs seem to be timing out a lot now on ssh access	13:33
openstackgerrit	David Shrewsbury proposed openstack-infra/nodepool feature/zuulv3: Docs: Remove refs to removed nodepool commands https://review.openstack.org/449152	13:39
mordred	Shrews: I was just about to ask you if that was a thing you were seeing too	13:42
Shrews	mordred: are you seeing this outside of nodepool?	13:42
Shrews	well, i guess not	13:42
mordred	Shrews: not outside of nodepool -but I noticed it on a shade patch	13:43
Shrews	fun	13:43
mordred	yah. I'm super excited about that	13:43
Shrews	mordred: guess it's time to start digging into devstack changes	13:44
tobiash_	how does nodepool check if the node has fully booted in the future if ssh support gets removed?	13:45
mordred	Shrews: I just pinged in qa channel	13:45
tobiash_	via keyscan?	13:46
pabelanger	tobiash_: currently yes	13:46
mordred	tobiash_: I think the ssh-keyscan will still get us that - the keyscan shouldn't work unless ssh daemon is running and booted	13:46
pabelanger	right, a little trade off, as that doesn't really show us the inside of the VM	13:47
tobiash_	ok, I think that will work as long as the user is preexisting in the image	13:48
pabelanger	I'm kinda on the fense of the removal of it, cause there is some value on doing SSH things in nodepool today. But, I understand why we want to remove it	13:48
tobiash_	less code is better code :)	13:49
pabelanger	ya	13:49
pabelanger	Happy to try it out too	13:49
tobiash_	just curious, is ansible capable of retrying ssh connections?	13:50
pabelanger	I think so	13:50
pabelanger	check ansible.cfg	13:50
mordred	I actually do not see a setting for that	13:51
mordred	although ssh -o arguments can be passed through	13:51
pabelanger	https://github.com/ansible/ansible/blob/devel/lib/ansible/constants.py#L374	13:52
pabelanger	seems to be undocumented	13:52
Shrews	jeblair: rbergeron: I have made a pass through the current nodepool docs and put up reviews for inconsistencies I found.	13:54
Shrews	rbergeron: if you're interested, https://review.openstack.org/#/q/status:open+project:openstack-infra/nodepool+branch:feature/zuulv3+topic:docs	13:55
tobiash_	would have saved me some time if this cool feature would have been documented :-/	13:56
Shrews	mordred: wow. every review i have up is failing dsvm jobs	14:00
mordred	Shrews: yah - I have this feeling something has recently broken in openstack	14:01
Shrews	pabelanger: "Exception closing paramiko: local variable 't' referenced before assignment"	14:02
Shrews	pabelanger: http://logs.openstack.org/83/446683/9/check/gate-dsvm-nodepool/0dd848b/logs/screen-nodepool.txt.gz	14:02
pabelanger	Shrews: k	14:09
openstackgerrit	Paul Belanger proposed openstack-infra/nodepool feature/zuulv3: Remove SSH support from nodepool https://review.openstack.org/446683	14:13
openstackgerrit	Monty Taylor proposed openstack-infra/nodepool master: Add novaclient and keystoneauth debug logging https://review.openstack.org/449173	14:24
openstackgerrit	Monty Taylor proposed openstack-infra/nodepool master: Fetch server console log if ready_script fails https://review.openstack.org/358730	14:48
openstackgerrit	Monty Taylor proposed openstack-infra/nodepool master: Fetch server console log if ready_script fails https://review.openstack.org/358730	14:49
clarkb	Shrews: pabelanger ya you need explicit closes on paramiko clients now or they leak connections	14:54
rbergeron	shrews: aye, will peek -- also, i am in RDU on monday / tuesday next week if you want to get together at some point in there (though my schedule is still in flux for things being done while there)	15:10
openstackgerrit	Monty Taylor proposed openstack-infra/nodepool master: Fetch server console log if ssh connection files https://review.openstack.org/358730	15:34
*** isaacb has quit IRC		15:49
jlk	o/	16:23
*** isaacb has joined #zuul		17:11
openstackgerrit	Monty Taylor proposed openstack-infra/nodepool master: Fetch server console log if ssh connection fails https://review.openstack.org/358730	17:14
* mordred o/ at jlk		17:54
*** hashar has quit IRC		17:58
*** isaacb has quit IRC		18:05
openstackgerrit	Jesse Keating proposed openstack-infra/zuul feature/zuulv3: Add support for github enterprise https://review.openstack.org/449258	18:06
jeblair	clarkb, mordred, Shrews: regarding http://lists.openstack.org/pipermail/openstack-infra/2017-January/005018.html should ipv6-preferred be a provider or pool attribute?	19:58
mordred	jeblair: neither - we should just leave that in clouds.yaml	20:00
clarkb	jeblair: thats a hard one.	20:00
clarkb	mordred: but its nodepool config as nodepool also needs to be able to speak ipv6	20:00
clarkb	so I think clouds.yaml is the wrong place for it	20:00
mordred	I disagree	20:00
mordred	the current logic is that shade detects whether the calling host can speak ipv6	20:00
clarkb	if a cloud can speak ipv6 that doesn't reflect on whether or not nodepool can too	20:00
mordred	and if so, it defaults to ipv6	20:01
mordred	ipv6-preferred is only used to turn off ipv6 communication	20:01
jeblair	oh i was assuming it was still a thing since https://review.openstack.org/410802 is still around, but a confess i don't know the current status of that	20:01
mordred	jeblair: yah - I want to land that for v3	20:01
mordred	jeblair: I stopped pushing it for v2 because v3 was coming	20:01
jeblair	mordred: but that still has the option?	20:01
mordred	that's because I was trying to not break backwards compat too badly	20:02
mordred	but we're already breaking in v3 - so I think cleaning out the things where nodepool and shade duplicate each other would be stellar	20:02
mordred	maybe I should respond to your email with a write up of the bits I think shoudl all no longer be in nodepool.yaml so we can discuss	20:02
clarkb	mordred: so you'd proviude different clouds.yamls based on local host networking? (that seems wrong but I guess it would work so maybe there isn't anything wrong with it)	20:02
jeblair	mordred: might be easier to review https://review.openstack.org/447647	20:03
mordred	clarkb: not really - you should normally never need the setting - it's only a release valve in case something is broken in your setup	20:03
mordred	jeblair: kk	20:03
clarkb	mordred: which is super common with ipv6 (which is why I am concerned)	20:03
clarkb	mordred: I had to turn off ipv6 globally at home when comcasting more often than I care to remember	20:03
mordred	clarkb: right - but so far none of our clouds with ipv6 have been comcast	20:04
mordred	but if they were, we'd just add "force-ipv4: true" into the clouds.yaml for our comcast cloud	20:04
mordred	THAT SAID - I still may be wrong here	20:04
mordred	since the host we're using to create the VM and the host we're actually using it from are, in fact, different hosts	20:05
mordred	so the shade-level detection of ipv6 capabilities is not going to work for us	20:05
clarkb	ya I think that is functional, just weird we'd use different clouds.yamls for the same cloud user/tenant depending on where we use the clients	20:05
jeblair	mordred: but still, in that case, we could set the ipv4 thing in clouds.yaml, right?	20:05
mordred	clarkb: I do not think that's what I was suggesting	20:05
mordred	jeblair: yes	20:05
mordred	clarkb: on our servers, if one of our servers we're using to launch things had broken ipv6, I'd expect us to deconfigure it fro that server's network config	20:06
mordred	and not have broken ipv6 configured live on a production server	20:06
clarkb	gotcha	20:06
clarkb	good pint	20:06
jeblair	clarkb: good pint? cheers! ;)	20:07
clarkb	prost	20:07
mordred	jeblair: so - I think there are a few different concerns	20:07
jeblair	so i think what i will do is leave it as a provider attribute in the patch i'm writing, then remove it as a followup, so it doesn't get buried. but now i will wait to hear concerns. :)	20:08
mordred	1) servers launching and using clouds having broken ipv6 - can be dealt with as mentioned above, don't think we need an option - if you have broken ipv6, stop configuring broken	20:08
mordred	2) clouds giving broken ipv6 - clouds.yaml has force-ipv4, we're fine	20:08
mordred	3) an installation of zuul v3 which is not ours in which the nodepool launchers are on ipv6 enabled hosts but the zuul executors are not	20:09
mordred	I think I'd rather let people who try to run in that configuration be broken adn complain to us and then justify why they should not fix their environment	20:09
jeblair	mordred: 3 can still set force-ipv4, right?	20:10
mordred	jeblair: yes - 3 could also still set force-ipv4 - good point	20:10
mordred	so there is a release valve for those people	20:10
mordred	cool	20:10
jeblair	we can even put that in the manual with a lecture. :)	20:10
mordred	+100	20:10
mordred	jeblair: I will respond to email and patch with thoughts on config things to remove	20:10
jeblair	okay. will go with plan above. back to the yaml mines.	20:10
jeblair	mordred: okay. check the patch first -- that may already remove most of the things.	20:11
mordred	jeblair: I did, and it does, but I left a few more comments	20:17
jeblair	mordred: i'm building an immensely complicated change on that; can we do those in a followup?	20:18
mordred	jeblair: oh, absolutely!!!	20:19
mordred	jeblair: I mostly listed them there for reference - maybe I shoudl go say that	20:19
jeblair	mordred: cool, i'd like to just fix the bug in that change, then land it and mine, then continue cleanup. i think it's great to reference them there. but maybe with a +2 :)	20:19
mordred	jeblair: ++	20:20
*** hashar has joined #zuul		20:30
* SpamapS opens up the box of 65 broken tests and starts digging in		21:18
openstackgerrit	K Jonathan Harker proposed openstack-infra/zuul feature/zuulv3: Perform pre-launch merge checks https://review.openstack.org/446275	21:19
jesusaur	^that^ isn't quite working yet, but i would appreciate a review to know if i'm way off base	21:22
jeblair	jesusaur: i don't think you should need to change the executor? this should all happen before launching a job.	21:37
jesusaur	jeblair: i think i may need to for the second test case i'm adding, but you're right that it was the wrong avenue for the original test case	21:44
jhesketh	Morning	21:44
jesusaur	i suppose i can split it into two changes, if that would make reviewing easier	21:44
jesusaur	jeblair: but either way, shouldn't the executor handle the case of a merge failure?	21:45
jeblair	jesusaur: gotcha. yes -- though we may want to send the specific commits that the merger used to the executor. then the executor can create exactly what the merger did. this has two advantages: it solves the problem you pointed it, and it also ensures that all the jobs for a given change will run with the same code. that's something we have today, and i think it would be a regression if we let that slip for v3.	21:51
jesusaur	jeblair: good point	21:52
openstackgerrit	James E. Blair proposed openstack-infra/nodepool feature/zuulv3: Update nodepool config syntax https://review.openstack.org/448814	21:52
jeblair	Shrews, mordred, clarkb: https://review.openstack.org/448814 passes tests locally	21:54
mordred	jeblair: woot	21:54
mordred	BWAHAHAHAHA	21:54
mordred	https://review.openstack.org/#/c/448814/2/nodepool/tests/fixtures/clouds.yaml	21:54
mordred	that makes me happy	21:54
jeblair	heh	21:55
mordred	jeblair: out of curiosity ... (I;m sure we've both talked about this and it's likely also written down somewhere) ... why "diskimages" instead of just "images"?	21:57
jeblair	mordred: in provider or at root?	21:58
mordred	in provider. in root makes total sense - the builder builds diskimages	21:59
jeblair	mordred: it's the section that says what diskimages to upload. it's a direct reference to the object specified in the root 'diskimages' section, so this better ties the two together. label then references the provider diskimage object.	21:59
mordred	yah. ok	21:59
mordred	makes sense	21:59
jeblair	mordred: if we wanted to launch nodes from non-diskimage based labels, i would probably set an 'image:' attribute on the provider label.	21:59
jeblair	mordred: (that doesn't really preclude calling provider.diskimages provider.images, however, i think if we did add non-diskimage image support to labels, having the diskimage portion of provider called diskimage makes that more clear)	22:00
mordred	++	22:00
mordred	jeblair: I like this patch a lot so far - the only things that stand out to my brain are the things we were talking about earlier that will be good follow ups	22:01
jeblair	ya, i'm going to do a doc shuffling patch, then those	22:01
openstackgerrit	James E. Blair proposed openstack-infra/nodepool feature/zuulv3: Update docs for configuration syntax change https://review.openstack.org/449339	22:18
mordred	jeblair: +2 on both - comments on both +2s	22:22
openstackgerrit	James E. Blair proposed openstack-infra/nodepool feature/zuulv3: Remove api-timeout and provider.image-type https://review.openstack.org/449342	22:29
jeblair	mordred: cool, will check. there's 2 out of 3 removals ^	22:29
jeblair	mordred: for your network question -- are you saying you can say "network: NETWORK_ID" as well as "network: NETWORK_NAME"? or are you referring to the "network: net-id" and "network: net-label" forms?	22:34
jeblair	mordred: (the second is a deprecated form)	22:34
mordred	jeblair: well - it sort of depends I guess	22:35
mordred	jeblair: we can totally just do network: NETWORK_NAME_OR_ID and pass that in to the create_server call in shade and shade will figure it out. we can also collect network.name or network.id if we wanted to ...	22:36
jeblair	mordred: i vote #1	22:36
* mordred hangs head for not having cleaned up the network bits in the create_server call yet		22:36
mordred	awesome	22:36
mordred	that's what I vote too	22:36
jeblair	i'm writing it right now	22:36
jeblair	though, sorry, it distracted me from the ipv6 change	22:36
mordred	and if we hit a point in the future where someone needs to do more complex things with network connections and constructors, we can cross that bridge when we get there	22:36
jeblair	mordred: should that be a list or a single string?	22:37
jeblair	(it's a list right now)	22:37
Shrews	mordred: did you find anything on that node you held today for the nodepool dsvm failures?	22:38
Shrews	jeblair: did you want to -A https://review.openstack.org/447647 until we get the dsvm fails figured out?	22:39
Shrews	eh, the parent is still up for review	22:40
jeblair	Shrews: not especially -- the current patchset passed them.	22:41
Shrews	jeblair: umm, no?	22:41
jeblair	Shrews: yep. Patch set 5 passed them on march 20 at 21:19	22:42
Shrews	oh, yeah. i was still on the parent	22:42
Shrews	duh	22:43
jeblair	there was a recheck since then which has failed along with everything else, but i'm happy than 647 passes	22:43
mordred	jeblair: I think a list	22:48
mordred	jeblair: one sec ...	22:48
SpamapS	does anything ever remove zuul refs from mergers?	22:55
jeblair	SpamapS: no. it's a problem. in v3 we should stop setting refs in mergers, then it won't be.	22:56
SpamapS	wellllllllll	22:57
jeblair	SpamapS: (we haven't done that yet)	22:57
SpamapS	so there's a problem with Github	22:57
SpamapS	If you push -f over the source branch for a PR	22:57
SpamapS	GitHub GC's aggressively	22:57
SpamapS	so having commits on the merger is nice for a specific case which is tracking down abusers.	22:57
mordred	jeblair: it can be either a name or a list	22:58
mordred	:param network: (optional) Network dict or name or ID to attach the	22:58
mordred	server to. Mutually exclusive with the nics parameter.	22:58
mordred	Can also be be a list of network names or IDs or	22:59
SpamapS	jeblair: basically as long as we can archive-on-gc we'll be fine.	22:59
mordred	network dicts.	22:59
mordred	jeblair: so it can be a name_or_id, a dict containing either an id or name key, or a list of any of those things	22:59
jeblair	mordred: cool, thanks	22:59
mordred	jeblair: basically - you pick - we'll handle it :)	22:59
jeblair	SpamapS: we might want to see about finding another way to handle that. i think even with the current behavior, the intent is that mergers aren't reliable long-term data stores.	23:00
jeblair	SpamapS: (part of the problem we're solving with removing refs from the mergers is that repos with many refs get very very slow)	23:01
SpamapS	jeblair: right, just as long as we can archive-on-removal that should be fine.	23:02
jeblair	SpamapS: that sounds great, but i don't know what that means. :)	23:02
SpamapS	however we identify refs that we're going to remove..	23:02
jeblair	SpamapS: oh, i'm saying that in v3 we should not create them in the first place	23:02
SpamapS	at that point, I want a hook to be able to run an archive command with access to the commit	23:02
SpamapS	yeah, if we don't make refs, that' fine too, just give me an archive-all-the-merges option. :)	23:03
SpamapS	I am signing up to write it.	23:03
SpamapS	when the time comes to get rid of the refs	23:03
mordred	I bet you could do it as a site-specific pre-playbook	23:03
mordred	inyour base job	23:03
SpamapS	but basically, since Github is mutable, unlike Gerrit, we have to do extra work (thanks github!)	23:04
SpamapS	mordred: that's what I'm hoping actually	23:04
SpamapS	if there's a way to do a final playbook that runs before all others, that's where I'd do it.	23:04
mordred	just have a place you want to push refs to - and you could, before you do your rsync of the repos to the remote hosts, just also push to your archival location	23:04
SpamapS	bah	23:04
SpamapS	that did not parse well	23:04
mordred	I get what you're saying :)	23:04
SpamapS	a playbook, that is marked in the job config as final ..	23:04
SpamapS	because yeah, ideally I can just do a 'git: src={{local_repo}} dest={{archive_remote}}/{{ item }}' or whatever the right incantation is.	23:06
SpamapS	actually ideally	23:06
jeblair	at the moment at least, i think that could still be bypassed by not inheriting from the base job, so we might have to add something to enforce that.	23:06
SpamapS	I can convince GitHub to make refs on PRs	23:06
SpamapS	and end the madness ;)	23:06
jeblair	however, we may still want to consider doing this at the merger	23:06
jeblair	because that's 1:1 vs 1:N where N=number of jobs	23:07
*** hashar has quit IRC		23:07
SpamapS	jeblair: you can't create jobs that don't exist in a config repo, .zuul.yml, right?	23:07
SpamapS	jeblair: you can't create jobs that don't exist in a config repo, in .zuul.yml, right?	23:07
mordred	jeblair: good point	23:07
jeblair	SpamapS: you can create jobs in any (.?)zuul.yaml in any kind of repo.	23:07
SpamapS	oh ok	23:07
SpamapS	so then we make base do something that enables nodes to work at all	23:08
mordred	or - put in a merger hook	23:08
SpamapS	yeah	23:08
jeblair	yeah, i think for this problem at least, we should focus on the merger.	23:08
mordred	cause like jeblair says, we make the merging once, one merge may have multiple jobs	23:08
SpamapS	we'll cross that bridge when we come to it.	23:08
SpamapS	Think of me when you remove refs. :)	23:08
SpamapS	weeeeee down to 14 failed tests	23:09
jeblair	and having the merger push the archive to a non-merger git repo lets us keep the merger lean and fast, so we don't have the problem we have today.	23:09
openstackgerrit	Clint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Refactor out Changeish https://review.openstack.org/448913	23:10
openstackgerrit	James E. Blair proposed openstack-infra/nodepool feature/zuulv3: Remove deprecated networks syntax https://review.openstack.org/449354	23:10
jlk	only 14? sounds like progress!	23:10
jeblair	mordred: ^ that actually doesn't switch to relying on shade for that, but it does make the config change.	23:10
jeblair	mordred: i think the shade change should be an easy followup though	23:11
mordred	jeblair: cool- I can do the switch-to-shade patch	23:11
mordred	yah	23:11
jeblair	mordred: cool, thx	23:11
mordred	jeblair: oh - speaking of	23:11
mordred	the server_console on failure patch did not work	23:11
SpamapS	Ran 202 (-26) tests in 883.258s (+120.385s)	23:11
SpamapS	damn	23:11
SpamapS	26 less probably means a lot timed out :-P	23:11
jeblair	mordred: even after the fix of the fix?	23:11
SpamapS	+120s too ohwell	23:12
mordred	because the shade get_server_console call relies on a shade extension to the task_manager api ... which I will fix soon	23:12
SpamapS	jlk: s/14/40/ ... but still, better than 65	23:12
mordred	jeblair: the reasons are all bong, they aren't good, but they're trivial to fix, so I'm just going to do that rather than bore you with why any of the problems exist in the first place	23:12
jeblair	mordred: can i convince you to redo your ipv6-preferred change on the end of my stack? it needs updating since we've decided to break backwards compat, and i think you'd have an easier time with the port than i.	23:14
mordred	jeblair: yup	23:15
mordred	jeblair: I will do both ipv6-preferred and network	23:16
mordred	and also server-console	23:16
SpamapS	hrm.. most problems are AttributeError: 'FakeGerritConnection' object has no attribute 'connection'	23:17
mordred	SpamapS: have you checked to see if FakeGerritConnection has an attribute called connection ?	23:18
mordred	SpamapS: also, have you tried turning it off and turning it back on again?	23:18
* mordred lobs a marmot at SpamapS		23:18
SpamapS	No this is just leftover master stupidity from porting my patch	23:18
pabelanger	mordred: did you see this: http://logs.openstack.org/30/358730/5/check/gate-dsvm-nodepool/af54c12/logs/screen-nodepool.txt.gz#_2017-03-23_20_08_20_043	23:20
openstackgerrit	Clint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Refactor out Changeish https://review.openstack.org/448913	23:20
pabelanger	for console-log	23:20
SpamapS	woooooot	23:20
* SpamapS thinks this might work now		23:20
* SpamapS notes two pep8 problems argh.		23:20
mordred	pabelanger: yup	23:21
mordred	pabelanger: working on fix now	23:21
pabelanger	ack	23:21
mordred	oh for the love of - I'm going to have no way to land it	23:21
mordred	sigh	23:21
mordred	this is going to be fun	23:21
openstackgerrit	Monty Taylor proposed openstack-infra/nodepool master: Fetch server console log if ssh connection fails https://review.openstack.org/358730	23:26
mordred	pabelanger: that may workaround that issue	23:26
pabelanger	ah	23:27
jeblair	mordred: how do i specify an image format in clouds.yaml?	23:28
mordred	jeblair: image_format: qcow2	23:29
mordred	jeblair: it defaults to qcow2, and each vendor profile sets it	23:30
mordred	so it's rare you actually need to set it	23:30
jeblair	mordred: i don't see that in the docs?	23:30
mordred	jeblair: that's becaues i'm a bad person	23:30
jeblair	mordred: yeah, i'm trying to fix up the nodepool tests which use image format	23:30
mordred	jeblair: http://git.openstack.org/cgit/openstack/os-client-config/tree/os_client_config/defaults.json	23:30
mordred	jeblair: those are the default values every cloud object starts with	23:31
jeblair	mordred: oh, i found it	23:31
mordred	woot!	23:31
jeblair	mordred: it was the - to _ issue	23:31
jeblair	i thought i wrote a patch to fix that	23:31
mordred	\o/	23:31
SpamapS	oh neat... I have native ipv6 so I was able to telnet into the OSIC node that ran my tests	23:32
mordred	SpamapS: isn't that awesome?	23:32
jeblair	mordred: i guess it doesn't work	23:32
SpamapS	We're livin in the future!	23:32
SpamapS	mordred: I'm always shocked when I run a 'ss -tn' and I see that there are more ipv6 than not.	23:32
SpamapS	mostly from my browser	23:32
jeblair	or maybe it was just constructor args?	23:33
SpamapS	Ran 184 (-18) tests in 752.955s (-130.301s)	23:33
SpamapS	FAILED (id=19, failures=4 (-10), skips=22)	23:33
SpamapS	two Alarm Clocks	23:33
openstackgerrit	James E. Blair proposed openstack-infra/nodepool feature/zuulv3: Remove api-timeout and provider.image-type https://review.openstack.org/449342	23:34
openstackgerrit	James E. Blair proposed openstack-infra/nodepool feature/zuulv3: Remove deprecated networks syntax https://review.openstack.org/449354	23:34
jeblair	mordred: ^ updated	23:35
mordred	woot	23:35
mordred	jeblair: it occurs to me that I should make an api call for that so that we're not just reading out of the config dict	23:35
SpamapS	I forget now how to switch to soft timeouts	23:35
jeblair	mordred: what is raw?	23:36
mordred	jeblair: one of the image formats?	23:37
jeblair	mordred: taskmanager	23:37
mordred	oh - it's a flag inside of shade that tells it not to interpret the result but instead return the result object unaltered ... it showing up here is _clearly_ a layer violation and I will kill it with fire	23:38
mordred	but I can't land patches to shade until we fix the dsvm job	23:38
mordred	so that in that nodepool patch is purely so we might be able to get debugging to figure out why the nodepool dsvm jobs are broken	23:39
jeblair	mordred: ok. i -1d with a request to add a TODO, but maybe we wait until we get the results before we push up a fix for that. :)	23:40
mordred	yah.	23:41
jeblair	(just so we don't forget to remove it)	23:41
mordred	jeblair: I'm ... so super happy that we're as deep on sorting out that issue as we are	23:41
clarkb	we should be able to push a self testing change with hacks around the raw issue to get the output right?	23:44
clarkb	also has anyone just tried running reproduce.sh to reproduce?	23:44
jeblair	clarkb: yeah, mordred's latest PS is that. i guess we could start a depends-on stack, but this is probably simpler for now.	23:46
clarkb	oh I didn't see there was a new patchset	23:47
mordred	there is also a held node	23:48
mordred	I put in a nodepool auto-hold	23:48
mordred	but have not been able to dive in to that level yet	23:48
clarkb	if I'm betting and guessing something related to dib 2.0 that makes the image not boot right	23:50
clarkb	198.72.124.55 is the held server, I am going to take a quick look at it	23:51
mordred	clarkb: oh. I didn't even think about dib 2.0	23:52
mordred	clarkb: aren't we using that in prod already?	23:52
pabelanger	right, but we've been using 2.0 for a while now. Maybe a week	23:53
clarkb	mordred: http://paste.openstack.org/show/603999/ there you go	23:53
clarkb	glean appears to be at fault	23:54
clarkb	I will look quickly at patching it	23:54
pabelanger	oh, where is that?	23:54
clarkb	pabelanger: in console log of nested VM off of auto held nodepool vm that ran the job	23:55
clarkb	IP is up a couple lines	23:55
clarkb	huh glean release is 6 months old	23:55
pabelanger	ah, I see issue in code	23:55
clarkb	oh wait its the src job	23:55
pabelanger	ya	23:56
clarkb	so its src glean?	23:56
clarkb	doesn't the regular job fail too though?	23:56
pabelanger	I think we need to see config-drive	23:56
openstackgerrit	Jesse Keating proposed openstack-infra/zuul feature/zuulv3: Add trigger capability on github pr review https://review.openstack.org/449365	23:56
pabelanger	clarkb: it looks like we have an interface with no type	23:57
pabelanger	eg: ipv4 / ipv6	23:57
clarkb	ya	23:57
clarkb	I think that may be normal and why there is a continue there	23:58
clarkb	we just don't handle it properly	23:58
pabelanger	ya	23:58
clarkb	https://review.openstack.org/449366 lets see if that does better (it will run the nodepool test too ya?)	23:59
pabelanger	ya	23:59
pabelanger	so, we should see something different	23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!