Wednesday, 2016-11-30

clarkbpabelanger: will also be interesting to dig through the logs and see what our ratio between failed uploads and successful uploads is00:02
openstackgerritIan Wienand proposed openstack-infra/nodepool: Activate virtualenv before running dib  https://review.openstack.org/40448700:40
adam_gjeblair: i dont have handy ATM, but it was item.change.refspec and item.change.branch that causes them at https://git.openstack.org/cgit/openstack-infra/zuul/tree/zuul/launcher/client.py?h=feature/zuulv3#n4701:20
adam_gjeblair: item.change there is either a zuul.model.Change or zuul.model.Ref, depending on event. Only a Change has branch and refspec attributes.01:22
adam_gim kinda working backward from assumptions that are encoded in the test, so im not sure if the variance in type of item.change is a bug, or if its just something that needs to be fixed up with the Changish/Change/Ref work you mentioned01:23
mordredadam_g: the tl;dr on the Changeish/Change/Ref thing (which I can't remember if it's written down yet)01:36
mordredadam_g: is to rework them so that Ref is the base class and Change is a type of ref, rather than Changeish being the base class and Change and Ref each being types of that01:36
jeblairmordred: https://storyboard.openstack.org/#!/story/200078101:37
mordredadam_g: it's not an essential thing, and is pretty invasive, so I think it was a thing we weren't prioritizing - but when we did a data model walkthrough a couple of months ago - it jumped out as a thing that could be more understandable01:37
mordredjeblair: zomg01:38
mordredadam_g: so - ignore me and just read the actual story in storyboard01:38
jeblair(or add text to it if it should be more clear :)01:38
openstackgerritJamie Lennox proposed openstack-infra/nodepool: Accept user-home in config validator  https://review.openstack.org/40451903:17
*** bhavik1 has joined #zuul03:58
adam_gmordred: ok, yah. makes sense. if its non-priority atm, maybe the hax in https://review.openstack.org/#/c/404466/ are ok for now? as it is i dont see post jobs workin05:07
*** saneax-_-|AFK is now known as saneax05:19
*** _ari_ has quit IRC05:42
*** _ari_ has joined #zuul05:44
*** abregman has joined #zuul05:59
openstackgerritTristan Cacqueray proposed openstack-infra/zuul: Add hostname to ZUUL_URL when running with append_hostname  https://review.openstack.org/40457406:42
openstackgerritJoshua Hesketh proposed openstack-infra/nodepool: Merge branch 'master' into feature/zuulv3  https://review.openstack.org/40053607:45
*** abregman is now known as abregman|mtg08:52
*** hashar has joined #zuul09:13
*** hashar has quit IRC09:36
*** hashar has joined #zuul09:57
*** abregman|mtg is now known as abregman09:59
openstackgerritJoshua Hesketh proposed openstack-infra/nodepool: Merge branch 'master' into feature/zuulv3  https://review.openstack.org/40053610:00
*** gundalow_ has joined #zuul10:18
openstackgerritJoshua Hesketh proposed openstack-infra/nodepool: Merge branch 'master' into feature/zuulv3  https://review.openstack.org/40053610:19
*** anteaya has quit IRC10:19
*** gundalow has quit IRC10:19
*** olaph has quit IRC10:19
*** hashar has quit IRC10:24
*** hashar has joined #zuul10:30
*** Cibo_ has quit IRC10:31
*** anteaya has joined #zuul10:31
*** olaph has joined #zuul10:36
*** Cibo_ has joined #zuul10:37
*** bhavik1 has quit IRC11:23
*** jlk has quit IRC11:31
*** jlk has joined #zuul11:31
*** rmoe has quit IRC11:31
*** jlk has quit IRC11:31
*** jlk has joined #zuul11:31
*** rmoe has joined #zuul11:31
*** willthames has quit IRC12:21
*** willthames has joined #zuul12:52
mordredjhesketh: woot! the merge patch seems to be passing13:18
jheskethmordred: yep, sorry I didn't get to it earlier in the day13:18
jheskeththe sooner we can merge it the better because the worlds keep shifting13:19
jheskethditto the zuul merge13:19
openstackgerritMerged openstack-infra/zuul: Re-model the job auth  https://review.openstack.org/39964513:20
mordredjhesketh: yup13:42
mordredjhesketh: just +A'd the zuul merge13:48
openstackgerritMerged openstack-infra/zuul: Merge branch 'master' into workingv3  https://review.openstack.org/38947013:52
mordred\o/13:53
*** abregman is now known as abregman|mtg14:02
openstackgerritMerged openstack-infra/zuul: Re-enable requirement-newer-than tests  https://review.openstack.org/40101914:12
openstackgerritMerged openstack-infra/zuul: Re-enable requirement-older-than tests  https://review.openstack.org/40102714:12
openstackgerritMerged openstack-infra/zuul: Re-enable requirement-username tests  https://review.openstack.org/40102814:12
openstackgerritMerged openstack-infra/zuul: Re-enable requirement-email tests  https://review.openstack.org/40102914:12
Shrewsanyone know how nb01 is looking this morning?14:17
pabelangero/14:33
pabelangerShrews: all images built14:33
pabelangerand almost all were uploaded14:34
pabelangerhttp://paste.openstack.org/show/590964/14:34
pabelangercurrent output14:34
pabelangergoing to review: https://review.openstack.org/#/c/400536/14:36
pabelangersee if we can land that first thing this morning14:36
mordredpabelanger: that's good news!14:41
Shrewspabelanger: those that are "uploading" are left over from the older version of the builder?14:42
pabelangermordred: indeed14:42
pabelangerShrews: yes, believe so14:42
pabelangerthe first 4 build attempts for all images had problems14:43
pabelangerDIB issues14:43
Shrewspabelanger: let's also try to land https://review.openstack.org/404438 today14:44
*** willthames has quit IRC14:47
Shrewsmaybe we should also try to clean up the ordering of that output to include provider and build and upload IDs14:48
pabelanger++14:51
openstackgerritMerged openstack-infra/nodepool: Make build/upload worker sleep time configurable  https://review.openstack.org/40443814:57
*** saneax is now known as saneax-_-|AFK14:58
pabelangerShrews: clarkb: jeblair: This is likely a good time to fix nodepool-builder having to be stopped / started for each nodepool.yaml change.15:31
pabelangerchecking to see if this is still the case15:31
Shrewspabelanger: it picks up changes15:32
pabelangerah neat15:32
pabelangerthanks, makes me happy again15:33
*** abregman|mtg is now known as abregman15:36
Shrewspabelanger: mind reviewing 400836?15:47
*** abregman has quit IRC15:53
*** abregman has joined #zuul15:55
pabelangerShrews: +2, so jeblair mordred can review16:06
openstackgerritMerged openstack-infra/nodepool: Re-enable alien-image-list command and tests  https://review.openstack.org/40083616:25
*** gundalow_ is now known as gundalow16:26
*** gundalow is now known as gundalow_16:37
*** gundalow_ is now known as gundalow16:38
openstackgerritJames E. Blair proposed openstack-infra/zuul: Correct logic problem with job trees  https://review.openstack.org/40045616:40
openstackgerritJames E. Blair proposed openstack-infra/zuul: Fix variants not picking up negative matches.  https://review.openstack.org/39987116:40
openstackgerritJames E. Blair proposed openstack-infra/zuul: Define the internal noop job  https://review.openstack.org/40486416:40
openstackgerritMerged openstack-infra/nodepool: Merge branch 'master' into feature/zuulv3  https://review.openstack.org/40053616:49
pabelanger\o/16:49
pabelangergrabbing a shower, then relocating to coffee shop for today16:50
pabelangernb01.o.o looks to be idle now16:50
*** abregman has quit IRC16:54
*** hashar has quit IRC17:13
openstackgerritAdam Gandelman proposed openstack-infra/zuul: Re-enable test_success_pattern as test_success_url  https://review.openstack.org/40045517:15
jeblairadam_g, pabelanger: i'm looking into the post test / merger items thing.  i think i have my head wrapped around it; should have something to show soon.17:17
pabelangergreat17:25
clarkbthe images that failed are likely real failures right?17:29
adam_gjeblair: oh, cool17:29
clarkbthat will give us a good idea of how unreliable this is over time(and maybe we feed that back to $clouds and glance)17:29
openstackgerritPaul Belanger proposed openstack-infra/nodepool: Add image_name to UploadWorker INFO message  https://review.openstack.org/40489117:35
pabelangerclarkb: yes, I can see some shade exceptions17:35
pabelangerI haven't looked into it more17:35
pabelangerI should restart nodepool-builder to pick up the latest commits17:36
mordred2016-11-30 07:02:46,498 ERROR shade.exc: Image creation failed: Error finding address for https://image.compute.bhs1.cloud.ovh.net/v2/images/a19664de-ee96-4c4e-b651-2f47594239d1/file: Unable to establish connection to https://image.compute.bhs1.cloud.ovh.net/v2/images/a19664de-ee96-4c4e-b651-2f47594239d1/file: The write operation timed out17:37
mordred:)17:37
pabelangerokay, restarted17:38
mordredthat seems to be the error17:38
pabelangerya17:39
openstackgerritPaul Belanger proposed openstack-infra/nodepool: Use diskimage-builder checksum files  https://review.openstack.org/40443217:41
pabelangermordred: clarkb: ^ would be great to get that landed today too17:41
pabelangerhttps://review.openstack.org/#/c/400311/ landed too, which will do checksum in parallel now17:42
pabelangerwill ask openstack-dib for a release17:42
mordredpabelanger: I thnk you're going to hit a pep8 barg on that ...17:43
pabelangerboo17:43
pabelangerlocal looks good17:43
mordredoh - ok. ignore me then :)17:43
mordredthe patch looks good17:43
pabelangerOh17:44
pabelangerI think I see17:44
pabelangermissing comma17:44
mordredpabelanger: https://review.openstack.org/#/c/404432/2/nodepool/provider_manager.py line 287/28817:44
openstackgerritPaul Belanger proposed openstack-infra/nodepool: Use diskimage-builder checksum files  https://review.openstack.org/40443217:44
mordredpabelanger: also, jeblair had a comment on ps2 that I agree with17:45
pabelangeryup17:45
pabelangerupdating now17:45
openstackgerritPaul Belanger proposed openstack-infra/nodepool: Use diskimage-builder checksum files  https://review.openstack.org/40443217:47
openstackgerritPaul Belanger proposed openstack-infra/nodepool: Use diskimage-builder checksum files  https://review.openstack.org/40443217:48
pabelangerokay, should be ready now17:48
openstackgerritJames E. Blair proposed openstack-infra/zuul: Don't merge post-merge items  https://review.openstack.org/40490318:06
jeblairadam_g, pabelanger: this is the source of my confusion: http://git.openstack.org/cgit/openstack-infra/zuul/tree/zuul/scheduler.py#n149118:07
jeblair(that's in master)18:07
jeblairwe never called _makeMergerItem (the master equivalent of the zuulv3 branch's make_merger_item) for post-merge changes, which is why we never saw those errors18:08
jeblairthe fact that it kinda-sorta halfway attempted to handle post-merge changes must be leftover from earlier usage18:08
jeblairi considered removing support for them completely from zuulv3 and sending an empty 'items' list to the launcher to say "don't merge anything when you run this job"18:09
jeblairbut i thought that might unduly restrict us later -- we may want the launcher to know about the item being run, and if we did that, we'd just end up adding the data back18:09
jeblairso i think going with the solution of making the item list robust as adam_g did, but then having the launcher-merger filter out items that don't actually need to be merged (since they are not pre-merge changes) is probably the cleaner approach18:10
jeblairso that's what i did in  https://review.openstack.org/404903 and adam_g's change is good as-is i think18:11
openstackgerritMerged openstack-infra/zuul: Enable test_post*  https://review.openstack.org/40446618:16
mordredjeblair: I love when your changes have an order of magnitude more in the commit message than the patch itself https://review.openstack.org/#/c/400456/18:16
pabelangerjeblair: cool, thanks for the info18:18
jeblairmordred: i really hate code.  it's full of bugs.  we need less of it, not more!  :)18:20
mordredjeblair: ++18:20
mordredjeblair: code is the worst18:20
*** Shuo has joined #zuul18:22
rcarrillocruzfolks, i rebased the d-g changes on top of mordred nice output change18:23
mordredrcarrillocruz: woot18:23
rcarrillocruzmind reviewing the chain starting on https://review.openstack.org/401975 ?18:23
* mordred goes to review18:23
rcarrillocruzthey all got +118:23
rcarrillocruzthanks18:23
* rcarrillocruz goes afk a bit, picking up wife at work18:23
mordredrcarrillocruz: this doens't look right: https://review.openstack.org/#/c/402208/9/playbooks/roles/start_fresh_logging/defaults/main.yaml,unified18:26
mordredrcarrillocruz: hrm - also - http://logs.openstack.org/75/401975/7/check/gate-tempest-dsvm-neutron-full-ubuntu-xenial/eb0291a/logs/devstack-gate-setup-host.txt.gz#_2016-11-30_12_05_37_912 isn't printing the name you added to that task - I think we need to fix the callback plugin18:29
openstackgerritMerged openstack-infra/nodepool: Use diskimage-builder checksum files  https://review.openstack.org/40443218:30
pabelangerSo, just thinking out loud here. How do we default our image build to start at 10:00 UTC now for nb01.o.o?19:07
clarkbpabelanger: aiui we don't19:08
clarkbpabelanger: its going to queue things up once they timeout past the last build time19:08
pabelangerright, which is good but our image build times are going to shift over time19:10
pabelangerespecially if a build fails for what ever reason19:10
pabelangerlets see what happens today19:11
timrcOkay, I think I found my calling: to be a truck driver - https://aws.amazon.com/snowmobile19:12
pabelangerhttp://paste.openstack.org/show/591003/19:12
pabelangerthat is our current age of images19:12
clarkbpabelanger: right that was all intentional19:12
*** saneax-_-|AFK is now known as saneax19:12
clarkbpabelanger: so in ~2hours fedora23 wil start19:13
pabelangerclarkb: ya, just trying to see how that is going to come into play when we need to land a change to all images for exmaple19:13
clarkbthen jessie after it and so on19:13
pabelangerya19:13
clarkbpabelanger: you would manually trigger the builds yourself in that case19:13
clarkbwhcih is still supported aiui19:13
clarkb(I read code that seemed to support it at least)19:13
pabelangerright19:13
pabelangerokay, I'm happy to wait and see what happens today19:14
pabelangeralso excited to to get nb02.o.o online too when we are ready :)19:14
jeblairclarkb, pabelanger: yes.  :)  we decided to not bother trying to tie things to a schedule because that hasn't really worked for us for a long time anyway (it takes a long time to build and upload the images, so the idea of "images come on line around 10 o'clock" just doesn't work past one or two providers and images)19:22
jeblairclarkb, pabelanger: manual build triggers, or if you're feeling daring, deleting one of the old images will serve to speed up the process of getting something in place.  if the issue is correcting an erroneous build, deleting the most recent build is actually the right answer there anyway.19:23
pabelangerYup, happy to adapt.19:24
pabelangerthink I'm just going to wait until the normal schedule builds happen19:25
pabelangermake sure it works as expected19:25
*** SpamapS has quit IRC19:26
rcarrillocruzmordred: hey, thx, that should default to /opt/stack, def a bug19:39
rcarrillocruzas for the callback, yeah, noticed it19:39
rcarrillocruzi'll take a stab at it19:39
*** SpamapS has joined #zuul19:42
*** jeblair is now known as inserted_human19:45
*** inserted_human is now known as jeblair19:45
*** hashar has joined #zuul19:55
*** jesusaur has joined #zuul20:21
*** saneax is now known as saneax-_-|AFK20:30
pabelangernext build should start in 20mins20:59
pabelangerusing latest version of code too20:59
mordred\o/21:01
openstackgerritJames E. Blair proposed openstack-infra/zuul: Correct logic problem with job trees  https://review.openstack.org/40045621:03
openstackgerritJames E. Blair proposed openstack-infra/zuul: Fix variants not picking up negative matches.  https://review.openstack.org/39987121:03
adam_gjeblair: i think that all makes sense (the make_merge_items comments), thanks for clarifying21:04
jeblairwhew!21:04
*** hashar has quit IRC21:09
pabelangerand fedora-23 build started21:25
pabelangerbut, first build today failed21:25
pabelangerchecking21:25
pabelanger2016-11-30 21:22:11,451 INFO nodepool.image.build.fedora-23: Error: Failed to synchronize cache for repo 'updates'21:27
pabelangerlooks like a repo failure21:27
pabelangernext build looks better21:27
openstackgerritPaul Belanger proposed openstack-infra/zuul: Revert "Revert "Store ssh_host_key of remote node""  https://review.openstack.org/36067421:46
openstackgerritPaul Belanger proposed openstack-infra/nodepool: Make diskimage-builder command configurable for testing  https://review.openstack.org/40497621:53
pabelangerwant to start writing some tests when our dib-command fails, figures ^ will give me an easy way to start using /usr/bin/false as the command21:54
greghayneswe had a deal for injecting failure where we'd just set DIB_SHOULD_FAIL in the config21:55
pabelangerya, that too21:56
pabelangershould check it again21:56
pabelangerSHOULD_FAIL as image meta field21:59
pabelangerah, seen an issue21:59
pabelangerif images: [] in providers, we cannot use it22:00
pabelangerlet me play with it today22:00
pabelangerbut now, I relocate home for food22:00
jeblairpabelanger: we should be able to use the DIB_SHOULD_FAIL thing anywhere we need to.  and while the configurable dib command is a way to configure things for testing, we actually get more flexibility with DIB_SHOULD_FAIL which lets us specify that some images should fail and others should not.22:03
jeblairi actually kind of like the configurable dib command because it lets us remove the fake check from the actual production code -- but i just want to make sure we don't lose sight of setting DIB_SHOULD_FAIL as the way to actually trigger failures.22:04
greghaynesWFM, just making sure we didnt forget ;)22:41
clarkbjeblair: not only that but the configurable dib command doesn't actually solve the intended issue22:45
clarkb(it feels dirty to expose it to users as being for that)22:45
clarkbbut maybe thats a documentation thing "if dib is installed to a virtuaelnv it is not sufficient to run the disk-image-create binary from that venv, you must also source the env as well"22:46
clarkbor something22:46
jeblairright, well, *today* we're talking about it in the context of tests, so i'm just focusing on that.  :)22:46
pabelangerjeblair: Ya, I'll look into DIB_SHOULD_FAIL too23:02
pabelangerI liked swapping out the dib_cmd to /usr/bin/false, as a easy way to reproduce when disk-image-create fails for unknown reason23:03
jeblairpabelanger: sure, but that's exactly what DIB_SHOULD_FAIL does -- should be even easier :)23:07
pabelangerjeblair: SHOULD_FAIL? I only see that23:10
jeblairyeah i'm just repeating what greghaynes said :)  whatever looks vaguely like that is the thing23:10
pabelangerack23:11
pabelangerI'll give that a go23:11
*** willthames has joined #zuul23:15
pabelangerjeblair: so, if we set images: [] in providers section, we loose the ability to determine the image-type to pass to disk-image-create. Was thinking if img_types was empty, maybe just default it to qcow2 (this is the default if we remove -t from disk-image-create)23:23
pabelangerin reference too: https://etherpad.openstack.org/p/um87VzudKq23:24
jeblairpabelanger: hrm, maybe we should modify the etherpad then and say that in that case we don't build an image (but we also don't delete it)23:25
pabelangerjeblair: if we do that, I see no way to only build an image23:29
pabelangeractually23:29
pabelangerI guess we'd add a provider but have it be pause23:29
pabelangerI'll look at it again in the morning23:29
jeblairpabelanger: yeah, i don't think it's that important.23:30
jeblairthe idea of building without uploading sounds nice, but if we don't know what formats to build, then maybe we just shouldn't do it.23:30
pabelangerYa, don't have a real world example of build only today23:34

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!