Thursday, 2013-11-21

morganfainberg^ IANA assignes ports up to 4915100:00
*** ^d has quit IRC00:00
jog0lifeless jeblair: good thing we have some TC folks here to make sure this type of thing gets fixed00:00
jog0sdague: this should be a req for incubation: port number stuff00:00
lifelesslol00:00
lifelessIANA concludes that this request and the modification request (ticket00:00
lifeless#585246) will be administratively resolved without prejudice.00:00
lifelessso linux ephemeral range needs fixing00:01
lifelesswe can fix that in tripleo00:01
lifelessand we should file bugs on Ubuntu RedHat and Suse00:01
*** ftcjeff has quit IRC00:01
sdaguejog0: swift has the same issue, it uses X ports00:01
morganfainbergit looks like keystone (openstack-id) is the only openstack project with a registered port.00:01
*** nati_ueno has quit IRC00:01
morganfainbergas far as i see.00:02
clarkbmorganfainberg: ya I can't find others00:02
*** yamahata_ has joined #openstack-infra00:02
morganfainberg somewhat interesting (but not super relevant)00:02
harlowjamordred yt00:02
harlowjaa question about pbr if u are around00:02
notmynamesdague: is that a thing that's causing issues right now?00:02
sdaguenotmyname: I don't think so00:02
*** CaptTofu has quit IRC00:03
sdagueit was just in reference to jog0's comment above about making sure projects had port number registrations00:03
clarkbit is a thing causing problems with keystone00:03
clarkbbut before I go fixing keystone I need to check the other projects00:03
jog0notmyname: just a tiny issue00:03
*** CaptTofu has joined #openstack-infra00:03
jog0very tiny00:03
morganfainbergclarkb, if you set the floor to 49152 you wont hit any other registered ports00:04
morganfainbergand i am _fairly_ certain no project uses over 49151 as it's listener00:04
mikaljog0: yeah, that revert still gives us a 20%ish fail rate for 125192000:04
mikaljog0: so I don't think its the answer00:04
jeblairso, raise the ephemeral port floor, or just configure keystone to use port 5000 like it used to?00:05
jog0mikal: I am taking the nuclear option https://review.openstack.org/#/c/57566/100:06
morganfainbergjeblair, we use(d) 5000 and 3535700:06
mikaljog0: oh good, that was my next step00:06
morganfainbergso i think raising the floor is the correct answer.00:06
*** jhesketh_ has quit IRC00:06
mikaljog0: cause like totally out of ideas00:06
jeblairmorganfainberg: oh one is admin, the other user?00:06
morganfainbergthat was how it was in grizzly00:07
mikaljog0: send four more reviews of that... (ie remove the change id and reupload four more times). That way we can have five running in parallel and get more testing done.00:07
morganfainbergin havana they are ... the same thing really.00:07
morganfainberg(v3)00:07
jeblairmorganfainberg: is only 35357 used in havana?00:07
morganfainbergjeblair, i think we still use both00:07
jeblairmorganfainberg: oh, but they provide the same service now00:07
morganfainbergyeah00:07
morganfainbergcompatibility00:08
morganfainberghopefully we can migrate to just one port.00:08
morganfainbergideally that would be our IANA number00:08
morganfainbergbut i can't speak 100% to that plan.00:08
jeblairmorganfainberg: gotcha00:09
*** derekh has quit IRC00:09
morganfainbergjeblair (and lurking in this channel pays off again! woo)00:09
jog0mikal: excellent idea00:09
fungiclarkb: ephemeral port collision... great eye!00:10
clarkbjeblair: I am going to shift the ephemeral port floor to 4915200:10
dolphmmorganfainberg: jeblair: as of the j-release, we'll be able to provide a default configuration that uses only one port (pending deprecating and removal of the v2 api, which behaves differently on each port)00:10
morganfainbergthere we go, the official word.00:11
*** dolphm is now known as dolphm_afk00:12
jog0mikal: there00:12
jog05 copies running00:12
jog0mikal: https://review.openstack.org/#/q/status:open+project:openstack/nova+branch:master+topic:revert,n,z00:13
mordredharlowja: what's up?00:13
*** amotoki is now known as __amotoki__00:14
*** CaptTofu has quit IRC00:14
harlowjamordred just was wondering if there is anyway for pbr to have 2 different set of requirements (for example eventlet in py2.6/py2.7 and no eventlet in 3.3) or if there is another recommended practice00:14
*** CaptTofu has joined #openstack-infra00:15
*** senk has joined #openstack-infra00:15
harlowjain taskflow i made the eventlet usage optional, but its still a valid requirement in 2.6/2.700:15
harlowjabut since there is only 1 requirements.txt file, that requirements.txt file has to be the one that works with 3.3 (and can't include eventlet)00:16
*** fbo is now known as fbo_away00:16
*** Ryan_Lane has quit IRC00:17
mordredharlowja: well....00:18
mordredharlowja: there is a small and a large answer00:18
mordredthe answer to both is no - but there are different details00:18
notmynameharlowja: you should ask mordred to add a decent python dependency resolver to pbr. :-)00:21
*** yamahata_ has quit IRC00:22
* mordred throws a goat at notmyname00:22
*** yamahata_ has joined #openstack-infra00:22
notmynameharlowja: FWIW, I like your ideas and wish to subscribe to your newsletter. I have "issues" and other "feature requests" for managing the requirements file00:23
harlowjanewsletter, haha00:23
*** MarkAtwood has quit IRC00:24
harlowjamaybe a special issue for u notmyname00:24
fungishort answer yes with an if, long answer no with a but00:24
harlowjamordred is there any plans on that, especially as we get more and more 3.3 and 2.6/2.7 compat, it becomes a little weird00:24
mordredharlowja: yeah, I have some plans00:24
harlowjanp00:25
mordredthat are involved with metadata 2.000:25
harlowjacan i subscribe to your newsletter mordred which i can then forward to notmyname as my newsletter (i'll change the newsletter photo), ha00:25
harlowjareplace with a pict of me, ha00:25
harlowjalol00:25
notmynameI just saw a patch land in swift. did it sneak through or does that mean stuff is starting to be readded to the gate queue?00:26
clarkbnotmyname: a few things have snuck onto the gate queue. There were also a couple swift changes that fixed bugs that we put in the gate queue00:26
*** salv-orlando has quit IRC00:26
jog0clarkb: we have another patch to add to the VIP list00:26
notmynameharlowja: I already get mordred's newsletter. it comes out quarterly, but it normally has booze with it00:26
jog0https://review.openstack.org/#/c/57572/00:26
mordredharlowja, notmyname: part of them involve markerlib00:27
clarkbjog0: sounds good to me. now we just need someone to approve that change :)00:27
clarkbjeblair: quick question. for shifting the local port range. I wanted to do that in devstack, but would have to do that as one of the first things devstack does for it to have any hope of being useful. Is that ok with you?00:28
mordredharlowja, notmyname: http://www.python.org/dev/peps/pep-0345/#version-specifiers00:28
harlowjaintersting, didn't know about markerlib00:28
clarkbjeblair: I also think I will create a puppet change that shifts the range on all of our slaves using /etc/sysctl.d/60-keystone-port-shift00:28
harlowjaneat, thx mordred00:28
mordredor actuall, http://www.python.org/dev/peps/pep-0345/#environment-markers00:28
mordredharlowja: ^^00:28
mordredthe idea00:28
mordredRequires-Dist: bar; python_version == '2.4' or python_version == '2.5'00:29
mordredis that ^^00:29
fungijeblair: clarkb: heard back from rackspace... their environment has inherent limitations of 16 xen virtual block devices per domu, so the two built-in plus 14 cinder volumes is the maximum we can expect from them00:29
harlowjainteresting00:29
mordredso, instead of requirements.txt itself, we'll have a list of things potentially in setup.cfg itself00:29
mordredwhich can express using the above format00:29
mordred(or both)00:29
harlowjagotcha00:29
harlowjamakes sense00:29
mordredbut requirements.txt is not really intended ot have the flexibility that the above stuff does as part of metadata 2.000:30
harlowjareminds me of rpm :-/00:30
mordredrealistically, I will not get to this until Jan00:30
mordredso in the mean time, you can do a local in-tree pbr plugin00:30
harlowjaits ok, just was wondering the thoughts around it00:30
jog0clarkb: https://etherpad.openstack.org/p/critical-patches-gatecrash-November-201300:31
jog0jeblair fungi: ^00:31
jog0to keep track of things00:31
fungijog0: thanmks--tracking00:31
jog0I may be missing some00:31
mordredharlowja: if you look at neutron/hooks.py00:31
harlowjacool00:31
mordredharlowja: in the neutron tree00:31
mordredthere is an example00:31
*** boris-42 has joined #openstack-infra00:32
mordredharlowja: you register it in setup.cfg (so look at that in setup.cfg in neutron too)00:32
*** atiwari has quit IRC00:32
fungijog0: those were the three i was aware of plus the two latecomers00:32
*** matsuhashi has joined #openstack-infra00:32
harlowjanice, i'll check it out mordred00:32
jog0fungi: cool00:32
jog0anteaya: any from neutron we should track?00:32
*** dolphm_afk is now known as dolphm_for_real_00:35
*** dolphm_for_real_ is now known as dolphm_really_af00:35
*** dolphm_really_af is now known as dolphm_reallyafk00:36
openstackgerritTim Daly, Jr. proposed a change to openstack-infra/config: Disable python33 check for tomograph for now  https://review.openstack.org/5757300:36
*** julim has joined #openstack-infra00:37
pabelangerAny other reviewers or feedback for import zuul packaging into -infra?00:37
pabelangerhttps://review.openstack.org/#/c/56107/00:37
*** dcramer_ has joined #openstack-infra00:38
mordredpabelanger: I think the team has been heads-down on the gate today00:39
mordredpabelanger: however, I spoke with both zul and zigo_ today and both are interested in having larger discussions with you on here00:39
pabelangermordred, no problems. No rush on this00:39
pabelangermordred, Ya, I'm tracking along a debian thread now about it.00:39
mordredossum00:40
*** arata has joined #openstack-infra00:42
anteayajog0: 57290 is still looking to merge00:42
zulpablenager: although i dont know how much i can help depending on my work load00:42
jog0anteaya: thanks, on to the list it goes00:43
*** CaptTofu has quit IRC00:43
anteayathanks00:44
openstackgerritClark Boylan proposed a change to openstack-infra/config: Shift local port range to avoid IANA conflict.  https://review.openstack.org/5757500:44
*** sdake_ has quit IRC00:44
*** CaptTofu has joined #openstack-infra00:44
anteayagoing to be pushing the other bugs mentioned in your gate blocking bugs email, again tomorrow00:44
clarkbactually I need to reload the service when that file changes...00:44
anteayahave more momentum, so thanks for that00:44
pabelangerzul: I think if we get a plan together and people are all on board, I'll be able to commit a fair bit of time to it. I have some code already working locally, but likely need to get a design scope going that everybody is happy with00:44
clarkball of the d-g images are updated now00:44
*** pcrews has quit IRC00:44
zulpabelanger:  cool lemme know00:45
fungiclarkb: reload what service?00:45
fungiclarkb: reread sysctl.conf?00:45
fungii guess distros do treat that action as a service reload via initscripts00:45
clarkbfungi: yeah reread the sysctl.conf and sysctl.d files00:46
*** julim has quit IRC00:46
fungiit dawned on me that's what you meant. my brain just doesn't connect that with the word "service" for some reason. ignore me00:47
clarkbfungi: ubuntu has a procps "service" but centos doesn't appear to have one00:47
clarkbI may just use sysctl -p directly00:47
*** cody-somerville has quit IRC00:48
anteayawhen I say pushing I mean haranging -neutron devs00:48
anteayaall things considered this week is going better than last week00:48
jog0anteaya: heh, I just hope we get everything in today (https://etherpad.openstack.org/p/critical-patches-gatecrash-November-2013)00:49
openstackgerritClark Boylan proposed a change to openstack-infra/config: Shift local port range to avoid IANA conflict.  https://review.openstack.org/5757500:50
clarkbthat should be better00:50
anteayawell we got no new movement on 1249065, and 1251448 so I will be picking that mission up again tomorrow00:50
anteayaalready posted that I would be talking about it again tomorrow as a heads up in -neutron tonight00:51
jog0anteaya: thanks00:51
anteayathank you00:51
anteayahad a dream about you last night00:51
anteayayou were on a bike and I was on a bike behind you00:52
fungiclarkb: puppet lint will still hate on you there00:52
anteayapedalled as fast as I could, couldn't keep up00:52
jog0haha00:52
anteayathen you took this wicked complex trail and I couldn't follow you00:52
anteayathen you disappeared in a pool00:52
jeblairclarkb: why shift it in puppet?00:53
anteayaI kept getting people to look for you, I was upset you had drowned00:53
anteayain the pool - nodepool00:53
anteayaah dreams00:53
clarkbjeblair: because something may grab that port before devstack runs00:53
clarkbjeblair: I am going to updated devstack too, but devstack itself may not be sufficient00:53
clarkbfungi: :/00:53
jeblairclarkb: if not, then i really think we should look into changing the port in the devstack configuration00:54
clarkbjeblair: ugh00:54
clarkbjeblair: that makes sense sort of. We really should be testing as we expect keystone to be deployed00:54
clarkb(granted changing the port is a minor thing)00:54
jeblairclarkb: we should have a simple config that will work with "./stack.sh" on our systems as well as a random dev00:55
*** CaptTofu has quit IRC00:55
*** CaptTofu has joined #openstack-infra00:55
clarkbjeblair: should we change the default in devstack itself or have d-g move it for us?00:55
fungii suppose asking iana for a do-over which doesn't conflict with linux's incorrectly-chosen ephemeral ports range isn't going to fly00:56
clarkbit seems silly ot have an IANA assignment and not use it00:56
jeblairclarkb: devstack itself i think.00:56
jeblairclarkb: i agree, but it's also silly to have a config that may not work by default...00:56
*** nati_ueno has joined #openstack-infra00:56
jeblairclarkb: this is a bad situation with no good answers i fear00:56
clarkbya00:56
*** nati_ueno has quit IRC00:57
fungiso a section in devstack wrapped in a comment which says "really ugly workaround" and then goes in and adjusts your kernel00:57
clarkbfungi: the problem with that is you may already have a thing running on your box using the port00:58
fungiyep00:58
clarkbso jeblair's suggestion is use a port < 3276800:58
clarkbwhich is an easy change in devstack actually00:58
mikaljog0: bad news00:58
fungitrue. much easier than playing with kernel knobs00:58
mikaljog0: that big revert you did still has the console log problem00:58
mikaljog0: http://logs.openstack.org/66/57566/1/check/check-tempest-devstack-vm-full/f61db56/console.html00:58
clarkblet me whip that up and we can argue over it in gerrit :)00:58
jeblairclarkb: ++00:59
jog0mikal: WAT :(00:59
jog0everything else worked which is pretty amazing00:59
*** nati_ueno has joined #openstack-infra01:00
*** nati_ueno has quit IRC01:00
jog0mikal: so that means it wasn't a nova patch that did it01:00
fungiin better news, our swift friend 57019 is 8 minutes and a neutron py26 unit test run away from merging01:00
jog0at least not on its own01:00
jog0fungi: :)01:00
jog0mikal: I really thought this was going to work :/01:00
*** ericw has joined #openstack-infra01:01
jog0so if its not nova, and we don't *think* its the new tempest01:01
jog0what are the other candidates?01:01
jog0we could revert back another whole week01:01
jog0just to be extra safe01:01
jog0but don't see what that could possibly do01:01
fungijog0: mikal: did new libvirt gets ruled out for solid reasons, or just on the merit of its changelog?01:01
jog0fungi: it got ruled out based on timing01:01
openstackgerritKhai Do proposed a change to openstack-infra/config: Setup a private gerrit instance for security reviews  https://review.openstack.org/4793701:01
fungiahh, okay01:02
*** matsuhashi has quit IRC01:02
jog0not a strong rule out just a, how could this timing line up01:02
fungithat's pretty strong, all things considered01:02
clarkbjog0: could be the new package01:02
*** matsuhashi has joined #openstack-infra01:02
clarkbthough that seems very far fetched01:02
*** cody-somerville has joined #openstack-infra01:02
jog0clarkb: I did a diff of apt-get01:02
jog0and only change was libvirt01:02
clarkbya the new libvirt package, sorry I stopped typing too soon01:03
fungiclarkb: you ruled out anything having to do with timing being coincidental to when we started putting wheels in pypi.o.o right?01:03
fungii know you mentioned it as maybe a thing at one point01:03
clarkbfungi: yes, we are not consuming the wheels at all01:04
clarkbthey are just on the mirror for when we want to consume them01:04
fungiokay, awwesome01:04
*** nati_ueno has joined #openstack-infra01:05
jog0mikal: I will try reverting this test01:05
jog0https://review.openstack.org/#/c/54363/01:05
jog0touches the same tempest file nd matches up timin wise01:05
jog0and yes I am just guessing now01:06
jog0and will abandon my revert the last week of nova patches01:06
*** nati_ueno has quit IRC01:06
*** nati_ueno has joined #openstack-infra01:06
*** jhesketh has joined #openstack-infra01:06
*** matsuhashi has quit IRC01:07
anteayajog0: this patch of dims passed check and I think we should have logging were we don't have it: https://review.openstack.org/#/c/56316/01:07
anteaya*where01:07
jog0anteaya: makes sense to me01:07
jog0if we think it will help01:07
*** senk has quit IRC01:08
jog0mikal: latest attempt https://review.openstack.org/#/c/57578/01:09
clarkbjeblair: https://review.openstack.org/#/c/57577/01:09
fungi57373 isn't going to make it this time around01:10
sdaguejog0: so... my propose fixed has some other issue01:10
jog0sdague: :(01:10
sdaguelet me be more clever01:10
*** reed has quit IRC01:10
*** zul has quit IRC01:11
jog0mikal: so https://review.openstack.org/#/q/status:open+project:openstack/tempest+branch:master+topic:57578,n,z  https://review.openstack.org/#/c/57578/01:11
anteayajog0: I don't see more logging hurting anything01:11
*** gyee has quit IRC01:12
anteayayay 57290 made it01:13
clarkbwoot01:13
*** thomasem has quit IRC01:13
*** dcramer_ has quit IRC01:13
anteayathat is all that -neutron has to offer this round01:13
clarkbjog0: https://review.openstack.org/#/c/57373/01:14
clarkbjog0: that change is going to fail the gate01:14
*** matsuhashi has joined #openstack-infra01:15
jog0clarkb: sigh01:15
fungimost of them came through in that pass, marked in the etherpad as merged now01:15
jog0another kick01:15
jog0fungi: WOOT!01:15
clarkbooh 57509 got in though01:16
jog0wow we got most of em01:16
anteayafungi you are fast on the etherpad01:16
jog0very01:16
* fungi wins at etherpad01:16
clarkbso we are getting there01:16
clarkb1251920 still hates us, but removing the other noise should help a lot I bet01:16
jog0clarkb: yeah01:16
anteayawhat is the call?01:16
jog0hoping this may work https://review.openstack.org/#/q/status:open+project:openstack/tempest+branch:master+topic:57578,n,z01:16
anteayarequeue or still on lockdown?01:17
jog0wait01:17
anteayawaiting01:17
jog0920 is a ard one01:17
clarkbyeah we need to sort 920 if at all possible01:17
*** boris-42 has quit IRC01:20
mikaljog0: it occurs to me that there are two other things we can try01:21
mikaljog0: running modern code with the Havana tempest (I know that's sort of what you're doing)01:21
mikaljog0: and reverting any deltas in the requirements files since Havana01:21
mikaljog0: given that Havana seems to just work for us01:22
jog0mikal: ohh requirements didn't really look at that too much01:23
jog0although did a pip freeze diff01:23
jog0mikal: anyway at this point any ideayou have do it!01:23
mikaljog0: dude, that's been my strategy for days... :P01:23
jog0mikal: :)01:24
*** DennyZhang has joined #openstack-infra01:24
jog0so clarkb did stable code vs trunk tempest01:24
jog0we can try modern nova vs havana tempest01:24
mikalI think that's worth a try01:24
jog0do a squash of all changes since havana and propose to stable01:24
jog0easy enough01:24
jog0and can't hurt01:24
mikalI don't see anything obvious in requirements01:24
mikalNew netaddr and Bable basically01:25
jog0I do like the modern code on havana01:26
*** dcramer_ has joined #openstack-infra01:26
clarkbjeblair: heh devstack really didn't like that port change01:27
clarkbdoesn't look like we change the port in teh config01:30
*** nosnos has joined #openstack-infra01:30
*** xchu has joined #openstack-infra01:31
*** yaguang has joined #openstack-infra01:31
lifelessrighto, how is the gatelooking?01:31
clarkblifeless: it is a bit better. we got ~4-5 changes in that fix a variety of problems but the console problem (bug 1251920) is still outstanding01:32
uvirtbotLaunchpad bug 1251920 in nova "Tempest failures due to failure to return console logs from an instance" [Critical,Fix committed] https://launchpad.net/bugs/125192001:32
sdaguejog0: so I'm lacking a reasonable test env right now, but I know the conceptual fix for this if you are able to code it up?01:33
jog0sdague: sorry I am about to go AWOL for a bit01:33
jog0but lifeless just popped back in01:33
sdagueno worries01:34
sdagueso the conceptual fix - https://github.com/openstack-dev/devstack/blob/master/lib/tempest#L145 need to conditionally create those flavors if they don't exist01:34
sdaguewe are failing creating them the 2nd time01:34
clarkbjeblair: new patchset on the devstack change should fix the problem (it wasn't being iniset)01:34
lifelessbecause they already exit?01:34
lifelessexist?01:34
sdaguelifeless: yes01:35
clarkbsdague: this is for the grenade failures?01:35
sdaguenova-flavor create fails on second attempt01:35
sdagueclarkb: yes, after that's done, my other grenade patch will take us through the right part of that conditional01:35
lifelessso a flavor-show check should be sufficient01:36
sdaguelifeless: yes01:36
sdagueI just don't trust myself with the syntax without functioning test env :)01:36
*** arata has left #openstack-infra01:37
*** sarob has joined #openstack-infra01:37
*** mriedem has joined #openstack-infra01:37
sdaguethere is a better refactoring here as well, but honestly, I won't have the concentration for it until I'm back home01:38
clarkblifeless: were you going to whip up a patch?01:38
clarkbI don't have a proper devstack/tempest testbed but have no problems throwing something at jenkins >_>01:39
jeblairclarkb: why does 57577,2 reference KEYSTONE_SERVICE_PORT in the else?01:41
clarkbjeblair: so that you can override that value too01:42
clarkbwithout it only the auth port can be overridden in the config01:43
jeblairclarkb: so it's a minor nice enhancement but not strictly needed for the patch?01:43
lifelessclarkb: you should01:44
lifelessclarkb: I'm being torn hither and yon01:44
lifelessclarkb: and I too would be tossing at jenkins and waiting01:44
clarkbjeblair: correct01:44
clarkbwe really don't make it is to parse the output of flavor-list01:45
*** matsuhashi has quit IRC01:45
*** matsuhashi has joined #openstack-infra01:46
sdaguejenkins tossing is fine01:46
lifelesssdague: so how come it is a sporadic fail?01:47
jog0I'm out for the day01:48
jog0thanks everyone01:48
jog0we made decent progress considering01:48
sdaguebecause we're using default guest sizes that are too large01:48
sdagueso if you get too many of them up at the same time, you run out of memory01:48
*** jerryz has quit IRC01:48
sdagueall depending on which come and go in which order01:49
anteayathanks jog001:49
anteayabeware the pools01:49
*** pcrews has joined #openstack-infra01:49
*** wenlock has quit IRC01:50
*** ryanpetrello has quit IRC01:52
clarkbsdague: lifeless: something like https://review.openstack.org/5758401:52
* anteaya cheers for 5758401:53
sdagueclarkb: yep01:57
sdagueif it passes tests, I'll rush it through01:58
*** sarob has quit IRC01:58
jeblairi double checked some of the shell logic against rackspace output01:58
clarkbjeblair: ya I was doing that locally too01:59
clarkbjeblair: I can't figure out why the keystone port change is still unhappy though. Looks like installing or starting tempest may be erroring01:59
clarkbsdague: how does tempest know what port to use for keystone?01:59
clarkbtempest.conf seems to only use port 500002:00
*** nati_uen_ has joined #openstack-infra02:00
clarkbthe lights just went out in this room. I am going to head home now02:02
sdagueclarkb: it uses the uri02:02
sdaguein the config02:02
sdaguedefaulting to the 5000 one02:02
sdaguebut it's settable02:02
sdagueand, yeh, need to head out here as well and regroup for dinner02:02
clarkbsdague: but I shouldn't need to change it if I change the keystone auth port?02:02
clarkbmoving from 35357 to 3235702:03
clarkbthat particular bug is probably a lowe priority since chance says 35357 will be available02:03
sdaguehmmm... good question, and honestly my brain is a bit shot02:04
clarkbjust not always02:04
clarkbsdague: thats fair, I am going to afk and rest the brain02:04
jeblairclarkb: http://logs.openstack.org/77/57577/2/check/check-tempest-devstack-vm-full/21f7af6/console.html02:04
jeblairclarkb: er, yeah, i don't understand what is failing there.  as in literally -- what command is failing?02:04
*** nati_ueno has quit IRC02:04
*** DennyZhang has quit IRC02:05
clarkbjeblair: even the devstack log ends there02:05
clarkbI probably need to run my change locally02:05
jeblairclarkb: oh, there are a bunch of "ERROR: Unauthorized" lines in there02:05
clarkband get hands on with it02:05
clarkbI bet we are hardcoded to 35357 in other places02:05
*** bingbu has joined #openstack-infra02:09
*** sgran has quit IRC02:09
jeblairclarkb: yeah the nova admin commands are emitting that02:09
lifelessclarkb: reviewed; minor quibble02:10
*** sgran has joined #openstack-infra02:10
*** michchap_ has joined #openstack-infra02:11
*** moted has quit IRC02:11
*** ericw has quit IRC02:12
*** michchap has quit IRC02:14
*** nati_uen_ has quit IRC02:15
*** nati_ueno has joined #openstack-infra02:16
*** dolphm_reallyafk is now known as dolphm02:17
anteayaI can't hold -neutron in lockdown much longer, our devs in the other side of the world who aren't on irc are waking up and submitting patches02:17
anteayaand most of the other core devs are offline02:18
*** sarob has joined #openstack-infra02:18
*** ArxCruz has quit IRC02:18
*** michchap has joined #openstack-infra02:21
*** dkliban_ has quit IRC02:21
*** moted has joined #openstack-infra02:22
*** michchap_ has quit IRC02:24
anteayalifeless did you want to comment again on 57584, it looks like it might pass check02:24
*** boris-42 has joined #openstack-infra02:24
lifelessanteaya: hmm?02:24
anteayalifeless: https://review.openstack.org/#/c/57584/102:25
anteayajeblair asked a question02:26
anteayaI have my +1 handy02:26
anteayawould be great if the two of you came to agreement02:26
lifelessMy comment is a comment, not a -1.02:27
*** ftcjeff has joined #openstack-infra02:27
*** sdake_ has joined #openstack-infra02:30
clarkbI am walking home will check comment from there02:32
*** loq_mac has quit IRC02:32
*** llu has joined #openstack-infra02:34
*** masayukig has quit IRC02:37
*** llu has left #openstack-infra02:37
*** masayukig has joined #openstack-infra02:37
*** dolphm has quit IRC02:38
*** jhesketh__ has quit IRC02:38
*** wenlock has joined #openstack-infra02:38
anteayalifeless: okay02:41
*** jhesketh__ has joined #openstack-infra02:42
*** yamahata_ has quit IRC02:42
clarkblifeless: is there less work to do? http GETs are relatively expensive compared to a regex right?02:43
lifelessclarkb: it's a tradeoff; in this case - meh -, but when dealing with all neutron ports for instance, individual GETs will be much cheaper.02:44
lifelessclarkb: in principle a single GET of a flavor is one digital signature + a one row DB query.02:45
jeblairit takes 1.5 seconds against rax prod.02:45
*** dolphm has joined #openstack-infra02:45
lifelessclarkb: however openstack hasn't put much effort into optimisation yet...02:45
*** dkranz has joined #openstack-infra02:45
lifelessjeblair: :(02:47
lifelessanyhow, I don't think it needs changing, it will work.02:48
lifelessti's more a principle thing, avoiding unnecessary work.02:48
*** guohliu has joined #openstack-infra02:51
*** senk has joined #openstack-infra02:51
anteayago go 5758402:52
anteayaif/after that merges are we still on gate lockdown?02:52
anteayawho is keeping track?02:52
jeblairanteaya: https://etherpad.openstack.org/p/critical-patches-gatecrash-November-201302:53
anteayayes02:53
anteayaI'll add 5758402:53
anteayaif it merges, are we open for business again?02:54
*** sarob has quit IRC02:55
*** sarob has joined #openstack-infra02:56
*** sdake_ has quit IRC02:56
anteayaI need to go to sleep soon, but I need to know what to tell -neutron before I go offline02:57
anteayathey have been really co-operative and I want to give them good information02:57
clarkbanteaya: we need 57584 then the change it depends on02:57
clarkbthen a revert of another change so ~3 changes to go02:57
anteayaokay02:57
*** wenlock has quit IRC02:57
clarkb1251920 is still an open issue though we seem to actually be able to merge things now after the various other fixes have gone in02:58
*** dkliban_ has joined #openstack-infra02:58
anteayaclarkb: 57584 doesn't have any dependencies02:58
anteayawhat is the change it depends on?02:58
clarkbanteaya: a change in a different project, this is noted on the etherpad02:58
anteayasorry yes, now I see it02:59
*** thomasem has joined #openstack-infra02:59
anteayaclarkb: do we have a url for the revert?03:00
clarkbanteaya: I don't think anyone has proposed it yet03:00
*** sarob has quit IRC03:00
anteayaso do we leave the gate in lockdown all night?03:00
anteayaor open it up and then lock it again tomorrow03:01
anteayaor are people going to stay up?03:01
clarkbanteaya: considering the success of us asking people to stop I am half tempted to keep it this way until the last few really annoying bugs get fixed03:01
anteayaokay03:01
*** thomasem has quit IRC03:01
clarkbbut I also think we need to be fair and let other people do normal work03:01
clarkbso I honestly don't have a strong opinion either way03:01
anteayaI vote for opening it up and going to sleep - for me03:02
anteayaand assessing the situation tomorrow03:02
clarkbya worst case we shut it down again and do what we did today03:02
anteayayes03:02
cyeohanteaya: the "dependent" one is https://review.openstack.org/#/c/57572/03:02
clarkbwhich wasn't too painful03:02
anteayathe poor folks who submit to neutron and who are not on irc are going to have a heck of a confusing work day otherwise03:03
cyeohclarkb: is the revert one you are referring to the v3 disable patch?03:03
anteayacyeoh: thanks03:03
anteayaI picture the poor man who emailed mikal looking for his account credentials03:04
*** dolphm has quit IRC03:04
clarkbcyeoh: ya we should unrevert the v3 disable patch if the other grenade thigns get us going03:04
anteayaeasier to let them submit and take their chances03:04
*** sdake_ has joined #openstack-infra03:04
anteayaclarkb: and yes this was successful, for a rare event03:04
*** matsuhas_ has joined #openstack-infra03:05
cyeohclarkb: yep, agreed. I'll propose the revert and leave a comment not to approve it until we know the gate is ok.03:06
clarkbsounds good03:06
*** matsuhashi has quit IRC03:08
*** nati_uen_ has joined #openstack-infra03:10
*** dcramer_ has quit IRC03:13
*** nati_ueno has quit IRC03:14
*** fifieldt has quit IRC03:16
sdagueok, so what is the magic invocation to use python-novaclient against rax without any of their crazy gorp?03:17
clarkbsdague: uh uh uh I think you use passwords03:17
clarkbso you must avoid all the api key stuff03:17
sdagueit didn't like that03:17
*** fifieldt has joined #openstack-infra03:18
sdagueI'm trying to use baked in python-novaclient on 13.1003:18
notmynameanteaya: clarkb: so stuff that was blocked on the gate should be "reverify no bug" right?03:19
sdagueI carved off a user on my rax account, and tried to use the devstack-gate script03:19
clarkbnotmyname: yes03:19
anteayaclarkb: are we opening the gate again?03:19
clarkbmordred: do you have an answer for sdague? I think you ahve actually tested it03:20
sdagueERROR: Invalid OpenStack Nova credentials.03:20
clarkbanteaya: meh, I would like those 3 changes to go through but we have been dealing with a bit of leakage all day anyways03:20
clarkbnotice the many ceilometer changes in the gate...03:20
anteayayes I saw03:21
*** sdake_ has quit IRC03:21
notmynameanteaya: it's slow now anyway. what's the worst that could happen? ;-)03:21
anteayaI have a neutron-core in Japan -2 everything -neutron that has been submitted03:21
anteayawell if we are I will communicate as such to -neutron and go to bed03:22
anteayaI just want to comply03:22
anteayasince that is the message I am bringing to -neutron03:22
anteayaand they are getting it03:22
clarkbanteaya: I think it is ok to start letting things through03:22
anteayavery good03:22
anteayathank you03:23
clarkbbecause most peopel aren't able to babysit bug fixes anyways03:23
clarkband if we get a giant backlog again we will just kill it and repeat03:23
sdagueonce that devstack change lands, can someone trigger a recheck on - https://review.openstack.org/#/c/57572/ - if anyone is up03:23
clarkbsdague: ya I can do that03:23
*** fifieldt has quit IRC03:23
sdagueI'm about to head to dinner, and if that's green when I get back, I'll push it to the gate03:23
clarkbk03:23
anteayanight all03:23
sdagueman, what I wouldn't give for that Depends-On: logic :)03:24
anteayathanks for all the hard work today03:24
clarkbyup thanks everyone03:24
*** DennyZhang has joined #openstack-infra03:25
*** dkliban_ has quit IRC03:25
*** melwitt has quit IRC03:30
*** svarnau has quit IRC03:30
sdaguemaybe I can troll mikal about it03:32
*** wenlock has joined #openstack-infra03:37
*** nati_uen_ has quit IRC03:39
*** nati_ueno has joined #openstack-infra03:40
notmynamebad news? https://review.openstack.org/#/c/57582/03:41
notmynamefailed because of bug 125192003:41
uvirtbotLaunchpad bug 1251920 in nova "Tempest failures due to failure to return console logs from an instance" [Critical,Fix committed] https://launchpad.net/bugs/125192003:41
notmynamecheck, not gate03:41
*** pcrews has quit IRC03:45
*** fifieldt has joined #openstack-infra03:46
*** dkliban_ has joined #openstack-infra03:48
*** wenlock has quit IRC03:48
clarkbnotmyname: that is going to be the worst outstanding bug that we haven't managed to fix yet03:49
clarkbso we will still probably see a relatively high incidence of it. hopefully easier to debug now that a lot of the other problems have been fixed03:49
notmynameok, thanks03:49
*** loq_mac has joined #openstack-infra03:50
*** CaptTofu has quit IRC03:54
*** CaptTofu has joined #openstack-infra03:55
*** michchap has quit IRC03:55
*** boris-42 has quit IRC03:57
*** nati_uen_ has joined #openstack-infra03:57
*** boris-42 has joined #openstack-infra03:57
*** nati_uen_ has quit IRC03:58
*** matsuhas_ has quit IRC04:00
*** nati_ueno has quit IRC04:01
*** matsuhashi has joined #openstack-infra04:02
zaroanybody know if there's a way to search openstack-infra mailing list?04:04
*** ftcjeff has quit IRC04:04
clarkbzaro: google04:04
*** mriedem has quit IRC04:04
clarkbsite:lists.openstack.org and go from there04:05
zaroclarkb: google groups or just plain google?04:05
clarkbjust plain google, they should be indexing the mail archives04:05
*** chandankumar has joined #openstack-infra04:07
*** sandywalsh has quit IRC04:07
zarotoo much crap.. just gonna holla.04:08
zarofungi: do you have a link to that security gerrit question i sent to infra list?  the one asking about permissions on groups for security gerrit?04:09
clarkbzaro: http://lists.openstack.org/pipermail/openstack-infra/2013-October/000314.html that one?04:10
*** matsuhashi has quit IRC04:14
*** matsuhashi has joined #openstack-infra04:19
*** DinaBelova has joined #openstack-infra04:22
portanteclarkb: still around?04:22
clarkbportante: ya04:23
*** odyssey4me has joined #openstack-infra04:23
portanteso the gate jobs seem odd04:23
*** DinaBelova has quit IRC04:23
portantelooking at the top one, the jenkins output is out "running testr"04:24
zaroholla! u da best clarkb04:24
portanteis that right?04:24
clarkbportante: looking04:24
clarkbportante: which one can you link? the one I grabbed isn't doing that04:25
portanteI was looking at https://jenkins02.openstack.org/job/gate-tempest-devstack-vm-large-ops/16508/console04:26
clarkbportante: I have noticed in the past that since testr by default is pretty quiet the jenkins console output will appear to hang there but it is just buffering the data and waiting for enough to come back to display it to you04:27
portanteand it seems I just caught it after 04:18:0904:27
clarkbportante: I think that is what happened in that test04:27
clarkbbasically testr wasn't writing enough stuff to the console to have jenkins show it to you04:27
portanteI saw that in a few jobs, so just chekcing04:27
*** dkranz has quit IRC04:35
*** clayg has left #openstack-infra04:35
*** loq_mac has quit IRC04:35
*** dkranz has joined #openstack-infra04:36
*** senk has quit IRC04:37
*** sandywalsh has joined #openstack-infra04:37
*** sdake_ has joined #openstack-infra04:38
*** sdake_ has joined #openstack-infra04:38
clarkb1251920 you are my nemesis04:41
clarkbsdague: my devstack change hit 1251920, I will reverify, but ya ugh04:41
portanteclarkb: so we are not out of the woods yet04:44
clarkbportante: no not completely, most of the problems other than 1251920 that we had identified were fixed04:44
clarkbso today was good, but not enough04:45
clarkbproblem with 1251920 is I don't think anyone knows why it is happening04:45
clarkband most theories have been debunked04:45
portantehmm, if somebody wants to walk me through it, I can take a look tomorrow04:45
portanteI say that naively, just happy to help04:46
clarkbportante: I am pretty naive to it as well. tl;dr is tempest has a test that requests the console output from a qemu VM via nova. For some reason at a relatively high incidence rate nova fails to return that data04:47
clarkbmikal has done some good investigating. he may be able to chime in and give more info04:47
mikalLast I heard jog0 was trying with the havana tempest04:53
mikalNot sure how that went04:53
*** nati_ueno has joined #openstack-infra04:57
*** DennyZhang has quit IRC04:59
*** michchap has joined #openstack-infra05:01
*** matsuhashi has quit IRC05:02
*** odyssey4me has quit IRC05:03
portanteclarkb, mikal: so tempest was changed sometime after Havana, and how was it tested to ensure it works?05:04
*** afazekas has quit IRC05:05
clarkbportante: I rebased tempest master onto tempest havana, squashed that into one commit, then proposed several changes with that code so that icehouse tempest would be run against everything else havana05:07
portanteoh, and how was that received?05:08
clarkbportante: https://review.openstack.org/#/c/57504/ https://review.openstack.org/#/c/57506/ and https://review.openstack.org/#/c/57507/05:08
clarkbthe jobs don't pass but no incidence of 125192005:08
portantethat is a good start05:08
portanteseems like we need to consider more closely what changes to tempest mean05:09
clarkbyeah, we might need to go through it with a comb instead of my bruteforce squahs everything together method05:09
portantei'd be happy to help05:09
portante... tomorrow ... :)05:09
clarkbI am about to sign off for the night as well05:10
portanteping when you are ready05:10
clarkbwill do05:10
portante'night05:10
*** svarnau has joined #openstack-infra05:13
*** loq_mac has joined #openstack-infra05:17
*** loq_mac has quit IRC05:19
*** loq_mac has joined #openstack-infra05:20
*** odyssey4me has joined #openstack-infra05:20
*** loq_mac has quit IRC05:20
*** loq_mac has joined #openstack-infra05:21
*** odyssey4me2 has joined #openstack-infra05:21
*** wenlock has joined #openstack-infra05:21
*** odyssey4me3 has joined #openstack-infra05:23
*** odyssey4me has quit IRC05:25
*** odyssey4me2 has quit IRC05:26
jog0mikal: reading scrollback05:27
*** afazekas has joined #openstack-infra05:27
*** afazekas has quit IRC05:27
jog0mikal: https://review.openstack.org/#/q/status:open+project:openstack/tempest+branch:master+topic:57578,n,z looks promising05:28
*** dkranz has quit IRC05:29
jog0mikal: can you babysit that patch for US night05:29
*** dkranz has joined #openstack-infra05:29
jog0and if it looks good, propose a patch with a proper commit message05:29
jog04 runs and no consolelog == promising05:30
jog0only hitting that grenade bug05:30
*** svarnau has quit IRC05:31
clarkbjog0: ooh05:32
* clarkb looks05:32
jgriffithjog0: not core but I can run rechecks if there's a threshold you want to hit on it05:32
jgriffithLet me know... I'll be up for a bit05:32
jog0jgriffith: yes!05:32
clarkbjog0: any idea why that may help? or was that a stab in the dark?05:32
jog0clarkb: stab in dark05:32
jog0jgriffith: lets say if we get 10 collective runs without console log05:33
*** loq_mac has quit IRC05:33
jog0jgriffith: and we have 4 patches to babysit05:33
jog0https://review.openstack.org/#/c/57578/  https://review.openstack.org/#/q/status:open+project:openstack/tempest+branch:master+topic:57578,n,z05:33
jgriffithaye...05:34
jgriffithwell I'm happy to help monitor on and off05:34
jgriffithwithout knowing which is root it's kinda dicey though05:34
jog0btw http://paste.openstack.org/show/53720/05:35
jog0getting better05:35
*** mihgen has joined #openstack-infra05:35
jog0jgriffith: we are looking for https://bugs.launchpad.net/nova/+bug/125192005:36
uvirtbotLaunchpad bug 1251920 in nova "Tempest failures due to failure to return console logs from an instance" [Critical,Fix committed]05:36
jog0mainly  AssertionError: Console output was empty.05:36
jgriffithYep, caught up on that at least05:36
jog0but yeah I don't know the root05:36
jgriffithjog0: I'll help out if I can, I can at least be the "recheck" monkey05:36
jog0clarkb: that patch touched the right file at the right time05:36
jog0jgriffith: thanks05:36
jgriffithjog0: looking at the test/code it seems reasonable05:37
*** zul has joined #openstack-infra05:37
jgriffithanyway...05:37
jog0jgriffith: I didn't finish it but I was writting a recheck monkey myself05:37
clarkbjog0: gotcha05:37
jog0http://paste.debian.net/66940/05:37
jog0jgriffith: you can just finish that up instead if you want05:37
jog0gerritlib can comment on patches05:37
* jgriffith has been replaced by software :)05:37
*** sarob has joined #openstack-infra05:39
*** sarob has quit IRC05:43
*** SergeyLukjanov has joined #openstack-infra05:48
jgriffithjog0: cool... so if I'm following your inent here05:53
jgriffithjog0: replace the 57357 with a list of the other 4 patches05:54
jog0yeah05:54
jgriffithjog0: add the handler to detect and leave comment on event05:54
clarkbjog0: ken'ichi appears to be on the same track (see email)05:54
clarkbso this has me very hopeful05:54
jgriffithI'll have to wait til I get one to disect/understand teh results it gives back but should be straight forward05:55
jgriffithplaying in the lib code now to try and get a preview05:55
jog0clarkb: cool05:55
jog0so we have at least 8 successes then05:56
jog0ken'ichi's and ours05:56
jog0another 2 and I say go for it05:56
clarkbjog0: should probably do ken'ichi's change as it skips rather than removes the test (though a broken test should maybe be removed?)05:56
jog0clarkb: same difference to me05:57
clarkbjog0: oh the skip includes the bug, I like that05:59
jog0clarkb: that sounds better05:59
clarkbjog0: oh those tests will use the same server06:00
clarkbI bet if they run in the same process the backup test is interfering with the console test06:01
jog0clarkb: ahhh that makes sense so depending on order the server is in a bad state06:01
jog0clarkb: cool06:01
jog0if you can debug that a little more lets merge06:01
clarkbbut when run in different processes its ok because they should use different servers06:01
* clarkb looks at subunit logs06:01
jog0clarkb: also the original patch failed for this bug afew times06:05
jog0which confirms my theory06:05
jog0if gate is flakey people ignore who causes the flakey06:05
*** jhesketh has quit IRC06:05
clarkbjog0: argh06:06
clarkbjog0: so my theory is at least partially wrong because all tempest tests belonging to the same class go in the same test process06:06
clarkbjog0: but it may have to do with test order06:06
clarkbso looking at that now06:06
jog0clarkb: yeah06:06
jog0thats what I was thinking06:06
jog0if console log is second it will break is my guess06:09
jog0why not sure06:09
clarkbso in one failure it is second06:09
clarkbgoing to look at success now06:09
clarkbalso XML and JSON both did backup then console output but only xml failed06:09
jog0ohh that could be part of the issue06:10
jog0xml vs json tests06:10
mikaljog0: yep, I will recheck it a couple of times before bed06:10
mikalUnless jgriffith beats me to it06:11
jgriffithscripts about done, just need to verify the message in the event06:11
mikalIts all good06:11
clarkbjog0: a passing test has backup then console for both XML and JSON so it may be more subtle than I was hoping06:11
jog0clarkb: :/06:11
jog0either way looks like a good candidate and disabling the test is a almost 0 risk06:12
jog0and we have 8 passes06:12
jog0clarkb: what status of the grenade fix?06:12
jgriffithmikal: if nothing else I know have a utility to tell me to go hit recheck :)06:12
jgriffiths/konw/now/06:12
jog0jgriffith: I was going to take it further and have it auto run recheck06:13
clarkbjog0: devstack change is in the gate06:13
clarkbit hit 125192006:13
jgriffithjog0: yeah, that's where I'm at06:13
jog0look at the code in elastic-recheck for how to do that06:13
jgriffithjog0: just wanted to see a success message example back from gerrit before trying to cod06:13
jgriffithahhhh06:13
jgriffithexcellet!06:13
jgriffithent06:13
jog0jgriffith: thats why i didn't do it yet either06:13
jgriffithLOL06:13
*** rpodolyaka1 has joined #openstack-infra06:16
sdagueclarkb: so I could always just take the risk and jump the grenade change into the gate06:18
jog0clarkb: I think we should do just A+  ken'ichi's patch06:19
jog0clarkb: and see if it merges after sdague jumps the grenade06:19
jog0then we should be in good shape06:19
sdaguewhere is ken'ichi's patch?06:19
jog0clarkb: ^06:19
clarkbhttps://review.openstack.org/#/c/57193/06:20
sdaguesorry, just got back from dinner06:20
clarkbyou might want an updated commit message06:20
jog0clarkb: we can just do that for him06:20
jog0his patch is better then mine06:20
clarkbhe is around too it looks like06:20
clarkbnto sure if on irc06:20
*** odyssey4me3 has quit IRC06:21
sdagueyeh, I'm not sure what his nick is06:23
*** loq_mac has joined #openstack-infra06:23
jog0https://launchpad.net/~oomichi06:24
jog0don't see him around06:24
jog0ok I'm out for the night06:26
jog0hopefully by the time I am online tomorrow the gate queue will be relaoded06:26
sdagueso if someone wants to un WIP his patch, I can gate hop it06:28
clarkbsdague: k06:29
clarkbsdague: pushed06:31
sdagueclarkb: +A06:31
*** rpodolyaka1 has quit IRC06:33
sdagueok, that's all the damage I can do tonight. I'll take a look in the morning and see what's merged.06:34
clarkbsdague: now we just hope that no other things prevent it from getting in :)06:34
clarkbI will try to get change in in the morning if they haven't made it in by then06:34
sdagueyeh, honestly, I'm tempted to snipe out all those swift and ceilometer changes06:34
clarkbsdague: the only way to do that other than restarting zuul again is pushing new patchsets06:35
clarkbwhich is far less than ideal06:35
sdagueyep06:35
sdagueI've done it before06:35
clarkb:)06:35
sdaguebut I'll give it overnight06:35
*** loquacities has joined #openstack-infra06:36
*** loq_mac has quit IRC06:36
openstackgerritRussell Bryant proposed a change to openstack-infra/reviewstats: Fix disagreement percentage calculation  https://review.openstack.org/5760606:37
*** loquacities has quit IRC06:40
*** loq_mac has joined #openstack-infra06:40
*** pcrews has joined #openstack-infra06:42
*** michchap has quit IRC06:43
*** michchap has joined #openstack-infra06:44
*** rpodolyaka1 has joined #openstack-infra06:44
*** nosnos has quit IRC06:48
*** michchap has quit IRC06:48
*** nosnos has joined #openstack-infra06:48
*** chandankumar_ has joined #openstack-infra06:50
*** chandankumar has quit IRC06:51
*** amotoki has joined #openstack-infra06:53
*** dstanek has quit IRC06:54
*** loq_mac has quit IRC06:56
*** masayukig has quit IRC06:59
*** SergeyLukjanov has quit IRC06:59
*** yolanda has joined #openstack-infra07:08
*** jcoufal has joined #openstack-infra07:14
*** rpodolyaka1 has quit IRC07:17
jgriffithmikal: jog0 http://paste.openstack.org/show/53726/07:18
jgriffithseems to work ok07:19
jgriffithI'm heading offline for a bit07:19
jgriffithmay make it back on depending on how my other project goes07:19
jgriffithrather than leave it running on my machine I figure I'd give it to somebody else incase there's a bug :)07:19
ogelbukhis there a blogpost or some article anywhere about how gerrit is scaled in OpenStack infra?07:22
clarkbjgriffith: we went ahead and approved one of those changes07:26
clarkbogelbukh: there are the docs at http://ci.openstack.org07:27
clarkbgerrit itself isn't hard to scale. one semi beefy machine and a few parameters tuned is all we do07:27
*** denis_makogon_ has joined #openstack-infra07:28
ogelbukhclarkb: thanks, aligns to my understanding07:28
*** wenlock has quit IRC07:32
mikaljgriffith: I'm going to keep rechecking jog0's change, so go out with vigour07:34
*** nati_ueno has quit IRC07:40
*** nati_ueno has joined #openstack-infra07:43
*** nicedice has quit IRC07:43
*** dstanek has joined #openstack-infra07:46
*** mihgen has quit IRC07:49
*** dstanek has quit IRC07:51
*** hdd has joined #openstack-infra07:52
*** salv-orlando has joined #openstack-infra07:55
clarkbmikal no need07:56
clarkbwe made the decision to approve one. though more rechecks give more confidence07:57
*** fifieldt has quit IRC08:12
*** Mithrandir has quit IRC08:21
*** SergeyLukjanov has joined #openstack-infra08:21
*** flaper87|afk is now known as flaper8708:21
*** nosnos_ has joined #openstack-infra08:30
*** mihgen has joined #openstack-infra08:30
*** nosnos has quit IRC08:33
*** boris-42 has quit IRC08:41
*** mkerrin has joined #openstack-infra08:42
*** Bada has joined #openstack-infra08:43
*** ruhe has joined #openstack-infra08:47
*** osanchez has joined #openstack-infra08:49
*** yassine has joined #openstack-infra08:52
*** derekh has joined #openstack-infra08:58
*** Ng has joined #openstack-infra09:06
*** plomakin has joined #openstack-infra09:06
*** Ng has quit IRC09:06
*** Ng has joined #openstack-infra09:07
*** masayukig has joined #openstack-infra09:14
*** yamahata_ has joined #openstack-infra09:14
*** jpich has joined #openstack-infra09:18
*** ruhe has quit IRC09:22
*** dizquierdo has joined #openstack-infra09:23
*** ljjjustin has joined #openstack-infra09:24
*** pblaho has joined #openstack-infra09:26
*** thomasbiege has joined #openstack-infra09:28
openstackgerritSergey Lukjanov proposed a change to openstack-infra/config: Setup devstack-gate tests for Savanna  https://review.openstack.org/5731709:28
*** pblaho has quit IRC09:38
*** pblaho has joined #openstack-infra09:38
*** yamahata_ has quit IRC09:40
*** fbo_away is now known as fbo09:42
*** johnthetubaguy has joined #openstack-infra09:44
*** thomasbiege has quit IRC09:44
*** boris-42 has joined #openstack-infra09:48
*** boris-42 has quit IRC09:52
matelHi all, where can I get information about when will the infra team use TripleO to test hypervisors?09:53
mikalclarkb: oh, cool09:54
*** plomakin has quit IRC09:55
*** mkerrin has quit IRC09:55
*** nati_ueno has quit IRC09:55
*** plomakin has joined #openstack-infra09:55
*** ruhe has joined #openstack-infra09:58
ttxmordred/jeblair: you should chime in markwash thread on client major releases09:58
*** SergeyLukjanov is now known as _SergeyLukjanov09:59
*** Bada has quit IRC10:07
*** xchu has quit IRC10:08
*** jcoufal has quit IRC10:09
*** odyssey4me3 has joined #openstack-infra10:10
*** afazekas has joined #openstack-infra10:12
*** masayukig has quit IRC10:17
*** bingbu has quit IRC10:19
*** yamahata_ has joined #openstack-infra10:21
*** odyssey4me3 has quit IRC10:21
BobBallfungi: What's your usecase?  Why do you want more than 14 cinder volumes?  Depending on exactly what you want, we might be able to get more.10:25
BobBallfungi: I guess I really am asking how important is it :)10:25
*** Mithrandir has joined #openstack-infra10:29
*** ljjjustin has quit IRC10:29
openstackgerritJulien Danjou proposed a change to openstack/requirements: Bump to using SQLAlchemy migrate 0.8.2.  https://review.openstack.org/5666210:30
*** arata has joined #openstack-infra10:31
*** arata has left #openstack-infra10:31
*** odyssey4me3 has joined #openstack-infra10:31
*** ruhe has quit IRC10:31
*** guohliu has quit IRC10:32
*** yamahata_ has quit IRC10:33
*** nati_ueno has joined #openstack-infra10:37
*** ruhe has joined #openstack-infra10:41
*** mihgen has quit IRC10:52
openstackgerritRoman Prykhodchenko proposed a change to openstack-infra/config: Adds devstack-gate tests for Ironic  https://review.openstack.org/5391710:54
*** mihgen has joined #openstack-infra10:56
*** DinaBelova has joined #openstack-infra10:56
*** SergeyLukjanov has joined #openstack-infra10:59
*** jhesketh__ has quit IRC11:02
*** adalbas has joined #openstack-infra11:11
*** odyssey4me3 has quit IRC11:11
*** ArxCruz has joined #openstack-infra11:12
*** jhesketh__ has joined #openstack-infra11:14
openstackgerritYuuichi Fujioka proposed a change to openstack-dev/hacking: Add metaclass for Python3 compatibility  https://review.openstack.org/5689011:18
*** nosnos_ has quit IRC11:20
*** nosnos has joined #openstack-infra11:21
*** nicedice has joined #openstack-infra11:22
*** nosnos has quit IRC11:25
*** syerrapragada has quit IRC11:32
*** syerrapragada has joined #openstack-infra11:33
*** odyssey4me3 has joined #openstack-infra11:33
*** mihgen has quit IRC11:37
*** loq_mac has joined #openstack-infra11:40
*** mihgen has joined #openstack-infra11:42
*** boris-42 has joined #openstack-infra11:45
openstackgerritSergey Lukjanov proposed a change to openstack-infra/config: Setup devstack-gate tests for Savanna  https://review.openstack.org/5731711:45
*** ericw has joined #openstack-infra11:47
*** loq_mac has quit IRC11:48
*** odyssey4me3 has quit IRC11:48
*** rfolco has joined #openstack-infra11:49
*** ericw has quit IRC11:49
*** yamahata_ has joined #openstack-infra11:50
openstackgerritYuuichi Fujioka proposed a change to openstack-dev/hacking: Add metaclass for Python3 compatibility  https://review.openstack.org/5689011:52
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add properties testing.  https://review.openstack.org/5765411:55
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add batch_tasks support.  https://review.openstack.org/5746911:55
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add seealso to batch_tasks from promoted_build.  https://review.openstack.org/5747311:55
*** yaguang has quit IRC11:58
*** weshay has joined #openstack-infra11:58
*** Guest97079 is now known as cyril__12:05
*** salv-orlando has quit IRC12:08
*** rfolco has quit IRC12:08
*** ericw has joined #openstack-infra12:13
*** rfolco has joined #openstack-infra12:14
*** ruhe has quit IRC12:15
*** salv-orlando has joined #openstack-infra12:16
*** syerrapragada has quit IRC12:17
*** syerrapragada has joined #openstack-infra12:17
*** nati_ueno has quit IRC12:18
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add batch_tasks support.  https://review.openstack.org/5746912:18
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add seealso to batch_tasks from promoted_build.  https://review.openstack.org/5747312:19
*** ruhe has joined #openstack-infra12:26
*** michchap has joined #openstack-infra12:28
*** amotoki has quit IRC12:40
*** pcm_ has joined #openstack-infra12:47
*** mihgen has quit IRC12:54
*** pblaho has quit IRC12:54
*** pblaho has joined #openstack-infra12:57
*** sergmelikyan has joined #openstack-infra13:07
sergmelikyanIs a zuul down?13:07
*** alcabrera has joined #openstack-infra13:08
ogelbukhno, it's not13:09
ogelbukhthough status page is somewhat slow to render13:10
*** ruhe has quit IRC13:12
anteayayes my status page was slow to render as well13:14
anteayaif that is the only problem today, we are going to have a great day13:14
ogelbukh)13:15
*** ruhe has joined #openstack-infra13:17
*** mriedem has joined #openstack-infra13:18
*** dstanek has joined #openstack-infra13:18
*** w_ has joined #openstack-infra13:23
*** olaph has quit IRC13:25
*** thomasem has joined #openstack-infra13:27
anteayaopenstack.org is slow13:27
openstackgerritSergey Lukjanov proposed a change to openstack-infra/config: Add merge-release-tags job to Savanna  https://review.openstack.org/5766713:31
*** mihgen has joined #openstack-infra13:35
*** CaptTofu has quit IRC13:39
*** CaptTofu has joined #openstack-infra13:39
*** hashar has joined #openstack-infra13:40
*** sandywalsh has quit IRC13:40
*** chandankumar_ has quit IRC13:40
*** dizquierdo has quit IRC13:42
openstackgerritSergey Lukjanov proposed a change to openstack-infra/config: Enable #openstack-climate IRC channel logging  https://review.openstack.org/5767513:45
*** jergerber has joined #openstack-infra13:45
*** zul has quit IRC13:50
*** zul has joined #openstack-infra13:51
*** jergerber has quit IRC13:52
*** dprince has joined #openstack-infra13:53
*** sandywalsh has joined #openstack-infra13:53
*** ilyashakhat has quit IRC13:55
*** jergerber has joined #openstack-infra13:55
*** yamahata_ has quit IRC13:57
mordredttx: done13:57
*** changbl has quit IRC13:57
*** yamahata_ has joined #openstack-infra13:58
*** bpokorny has quit IRC13:59
*** dkranz has quit IRC14:03
*** w_ is now known as olaph14:05
sdagueok, so I'm thinking of sniping the stuff out ahead of that grenade change14:06
*** yamahata_ has quit IRC14:07
*** yamahata_ has joined #openstack-infra14:08
*** DinaBelova has quit IRC14:09
*** mfer has joined #openstack-infra14:10
*** herndon_ has joined #openstack-infra14:10
*** rongze has joined #openstack-infra14:11
*** arosen has joined #openstack-infra14:14
*** dolphm has joined #openstack-infra14:14
sdagueman, zuul is taking a long time to react to these14:15
arosenHi, I was wondering if others though it might be useful to have the CI also upload a copy of a mysqldump of the databases at the end of a run. I'm trying to track down one of the gate failures and it seems like that would be somewhat useful/helpful.14:15
*** yaguang has joined #openstack-infra14:16
*** dkliban_ has quit IRC14:16
sdaguearosen: I think that would be fine14:17
sdagueit would be added to devstack-gate14:17
*** DinaBelova has joined #openstack-infra14:20
ttxmordred: we are making rootwrap standalone and I now have a repo that should make a good base for openstack/oslo.rootwrap... what's the best first stage for this ? Push to github ?14:21
arosenthis might also be a dumb question but i've been using chrome to view the log files and search there  but it's really slow because the log files are so large so i've been downloading them locally and using vi to search them in the gz formate but vim doesn't auto load all of the file in the gz format.14:21
arosenWhen I go to extract those files I get an error14:22
ttxI'd definitely like my filter-tree and addition of packaging support files to get reviewed by people before we push it anywhere14:22
arosengunzip  < screen-n-cpu.txt.gz  | tar xvf -14:22
arosengzip: stdin: not in gzip format tar: This does not look like a tar archive tar: Exiting with failure status due to previous errors14:22
*** markmc has joined #openstack-infra14:22
*** julim has joined #openstack-infra14:22
openstackgerritDan Prince proposed a change to openstack-infra/config: Drop the saz-gearman module (we don't use it)  https://review.openstack.org/5752714:22
*** julim has quit IRC14:23
arosenanyone have any work flow tips there?14:23
fungiso much scrollback buffer14:24
*** rongze_ has joined #openstack-infra14:24
mordredttx: yeah. github is the best bet14:24
*** wenlock has joined #openstack-infra14:26
*** rongze has quit IRC14:27
*** lchen has joined #openstack-infra14:28
*** rongze has joined #openstack-infra14:30
*** yamahata_ has quit IRC14:30
*** yamahata_ has joined #openstack-infra14:31
*** dolphm has quit IRC14:33
*** rongze_ has quit IRC14:33
*** xeyed4good has joined #openstack-infra14:34
mattymofungi, I just wanted to thank you for the help back during Summit for our silly branching issues14:34
mattymoit really made a big difference for us14:34
*** xeyed4good has left #openstack-infra14:35
fungimattymo: you're welcome--always glad to help14:35
*** pblaho has quit IRC14:36
anteayaarosen: I haven't run into that problem myself since I just use the browser and ctrl-f for searching14:39
anteayaany vim users downloading the gz log format able to give an earnest -neutron bug fixer a hand?14:40
sdagueman zuul takes some time to reschedule14:40
*** changbl has joined #openstack-infra14:40
*** julim has joined #openstack-infra14:40
fungiBobBall: well, we started out adding 0.5tb cinder devices to a lvm2 vg on a vm, but eventually needed more than we could attach to it. long-term we're looking at finding a way to front-end that data in and out of swift, but near-term options are to migrate the pvs individually from 0.5tb to 1tb to increase available space in the vg or hope there's a way to add more cinder devices to the vm14:42
*** marun has joined #openstack-infra14:43
fungiarosen: i believe sdague added the server-side filtering cgi on logs.o.o specifically to help make that easier14:43
fungiarosen: though to get it to not convert those files to uncompressed text on the fly you need to pass a get variable i think... checking14:45
*** dcramer_ has joined #openstack-infra14:46
sdagueyeh, if you wget it it will come down as real text14:46
*** ryanpetrello has joined #openstack-infra14:46
sdagueit does content negotiation14:46
*** sergmelikyan has quit IRC14:46
arosenHrm I've been doing wget of : http://logs.openstack.org/22/55722/8/check/check-tempest-devstack-vm-neutron/eb06ca6/logs/screen-n-cpu.txt.gz  maybe i should drop the gz from the end.14:47
fungiarosen: add ?content-type=text/plain14:47
sdaguearosen: it14:47
anteayaarosen: running gunzip screen-n-cpu.txt.gz works for me14:47
anteayabut the file is html though doesn't render as html14:47
anteayawhen I open it with a browser it is just a text file with html tags14:48
sdaguearosen: yeh, pipe it through gzip -d locally14:48
sdagueit's not a tar, it's just a gzip14:48
sdagueotherwise it's *really* slow over the network14:48
sdaguewe get about a 15x compression on the logs14:49
*** yamahata_ has quit IRC14:50
fungimy note on ?content-type=text/plain seems to have been wrong. not helping after all14:50
*** CaptTofu has quit IRC14:50
sdaguesounds like I need to sort some docs for the header14:51
*** yamahata_ has joined #openstack-infra14:52
*** dkranz has joined #openstack-infra14:52
fungioh, right, there's a separate flag to wget to force specific content negotiation14:52
sdaguefungi: that param should work14:52
arosenI see, if i drop the .gz from the file it works much faster in vim. I guess it's already a txt file the way i'm downloading it though the extension says. nvm.. Thanks14:52
fungisdague: wget -O screen-n-cpu.txt.gz 'http://logs.openstack.org/22/55722/8/check/check-tempest-devstack-vm-neutron/eb06ca6/logs/screen-n-cpu.txt.gz?content-type=text/plain' gets me something which file reports as ASCII text, with very long lines14:53
sdaguearosen: it actually works for an odd reason of the way the configs work, not why you think. But if it's working for you for now, all good :)14:53
sdagueoh, yeh, wget will be text/plain anyway14:53
sdagueyou need to 0 out the Accept-Encoding header14:53
sdagueto drop gz14:54
sdaguegzip streams14:54
fungithat's what i was forgetting. thx14:54
sdagueso the top 2 patches in the gate should make some of the things better14:55
*** dkliban_ has joined #openstack-infra14:56
anteayasdague: \o/14:56
sdaguebut the first one might fail without the second, so it's a race14:56
sdagueif the first one does reset, I think we have to save off the zuul queues and just work on getting those through14:57
anteaya57584 and 57572?14:57
sdagueyeh14:57
* anteaya cheers for them14:57
*** wenlock has quit IRC14:57
anteayaI wonder if we should set up a pipeline for gate fixing patches14:58
anteayaso if we have to do this again, we just invoke that pipeline with infra or qa cores identifying which patches go in14:58
*** markmcclain has quit IRC14:58
sdagueyeh, some sort of escape valve has been talked about in the past14:59
anteayaand devs can push away but their patches either bounce or get stored in a list14:59
*** ftcjeff has joined #openstack-infra15:00
lchenHi,  could anyone tell me how the docs gate is done? would be very appreciated for any help and hints15:02
lchenI mean where the code resides15:04
anteayalchen: is this what you are looking for? http://git.openstack.org/cgit/openstack-infra/config/tree/modules/openstack_project/files/zuul/layout.yaml#n21015:05
*** rcleere has joined #openstack-infra15:06
lchenanteaya: Thank you! I think so...15:07
anteayaokay great15:07
*** ftcjeff has quit IRC15:08
*** ruhe has quit IRC15:09
*** ^d has joined #openstack-infra15:11
*** pblaho has joined #openstack-infra15:14
sdaguehttps://jenkins02.openstack.org/job/gate-grenade-devstack-vm/17163/console why the grenade changes are stalling forever is something we need to sort15:16
*** ruhe has joined #openstack-infra15:16
*** DinaBelova has quit IRC15:17
*** nati_ueno has joined #openstack-infra15:20
*** yamahata_ has quit IRC15:21
*** blamar has quit IRC15:22
anteaya57584 got in \o/15:23
*** markmcclain has joined #openstack-infra15:25
*** dolphm has joined #openstack-infra15:27
fungisdague: perhaps if we dumped some date commands into the wrap script around various expensive functions we'd get a little granularity there?15:28
sdagueyeh, there is a pattern for it15:28
sdaguejust needs to be done15:28
lchenanteaya: one more question ;)  Could you point me into the direction where I can find how gate-heat-docs is defined?15:31
*** changbl has quit IRC15:33
*** dizquierdo has joined #openstack-infra15:33
*** DinaBelova has joined #openstack-infra15:35
*** CaptTofu has joined #openstack-infra15:35
anteayafungi: would gate-heat-docs be in jenkins-job-builder?15:36
*** datsun180b has joined #openstack-infra15:36
*** rongze_ has joined #openstack-infra15:37
fungianteaya: yeah, it's probably generated from a job template, maybe part of the python-jobs group15:37
*** wenlock has joined #openstack-infra15:37
fungii can dig it up if you wind up not being able to find it15:37
*** rongze has quit IRC15:37
anteayaI'll look15:38
*** afazekas has quit IRC15:39
*** jhesketh__ has quit IRC15:40
*** herndon_ has quit IRC15:41
lchenthank you for all the help15:41
*** __amotoki__ is now known as amotoki15:42
*** yassine has quit IRC15:43
*** yassine has joined #openstack-infra15:44
anteayafungi: I can't find anything for a gate-heat-docs definition in config or jenkins-job-builder15:45
*** hashar has quit IRC15:45
*** rpodolyaka has joined #openstack-infra15:46
fungianteaya: right, it'll probably be a job template so it won't have "heat" in the template name. i'll dig up the details for you15:46
anteayaah15:46
anteayathanks15:46
*** boris-42 has quit IRC15:48
fungianteaya: here you can see heat instantiates all jobs and job templates which are members of the python-jobs group https://git.openstack.org/cgit/openstack-infra/config/tree/modules/openstack_project/files/jenkins_job_builder/config/projects.yaml#n41015:48
*** markmcclain has quit IRC15:48
fungianteaya: the python-jobs groups includes as a member the 'gate-{name}-docs' job template https://git.openstack.org/cgit/openstack-infra/config/tree/modules/openstack_project/files/jenkins_job_builder/config/python-jobs.yaml#n22015:49
anteayaah ha, I should have been looking in jenkins_job_builder _inside_ of config15:49
anteayasilly me15:49
* anteaya goes to look15:49
fungianteaya: and you can see that template's details at https://git.openstack.org/cgit/openstack-infra/config/tree/modules/openstack_project/files/jenkins_job_builder/config/python-jobs.yaml#n14115:49
*** hashar has joined #openstack-infra15:50
fungiin particular, it uses the "docs" builder and passes in the github-org and project parameters to that https://git.openstack.org/cgit/openstack-infra/config/tree/modules/openstack_project/files/jenkins_job_builder/config/macros.yaml#n1115:51
fungianteaya: and the script you see being invoked in that builder can be seen at https://git.openstack.org/cgit/openstack-infra/config/tree/modules/jenkins/files/slave_scripts/run-docs.sh15:52
fungiin particular, line #28, which is the meat of that script, runs 'tox -e$venv -- python setup.py build_sphinx' within a checkout of that project's git repo15:53
*** ruhe has quit IRC15:53
fungiwhere $venv is 'venv' (set up on line #24)15:53
*** markmcclain has joined #openstack-infra15:54
fungilchen: hopefully all the above ^ is of help to you as well15:55
anteayafungi you are the best!! I really appreciate the tour, I have been wondering how this worked for some time15:55
*** SergeyLukjanov has quit IRC15:55
fungiit seems a bit like spaghetti until you bang your head against it for a while15:55
anteayabeen banging my head, the tour relieves all the pain15:56
anteayathank you15:56
lchenfungi: Sure. That's really helpful. Thanks for the information.  Though I may still need sometime to understand them, I am new to infra ;)15:56
*** hashar has quit IRC15:56
*** kgriffs_afk is now known as kgriffs15:57
fungilchen: well, welcome! feel free to ask any other questions you have, and don't take offense if we're busy or missing and don't get you an answer right away (just keep asking if that happens)15:57
*** afazekas has joined #openstack-infra16:00
lchenfungi:  yup.  Thanks a lot!16:00
*** DinaBelova has quit IRC16:01
anteayayay 57572 merged16:01
*** atiwari has joined #openstack-infra16:03
*** datsun180b_ has joined #openstack-infra16:04
*** datsun180b_ has quit IRC16:04
*** datsun180b has quit IRC16:04
*** datsun180b has joined #openstack-infra16:04
*** afazekas is now known as _afazekas16:05
*** afazekas has joined #openstack-infra16:09
*** kgriffs is now known as kgriffs_afk16:10
fungii've rechecked 57589 now16:10
*** blamar has joined #openstack-infra16:10
fungiwe'll see whether it fares any better16:10
* anteaya crosses her fingers16:12
*** CaptTofu has quit IRC16:17
*** CaptTofu has joined #openstack-infra16:17
*** rongze_ has quit IRC16:17
*** CaptTofu has quit IRC16:18
*** CaptTofu has joined #openstack-infra16:19
*** mrodden has joined #openstack-infra16:20
*** rongze has joined #openstack-infra16:23
*** CaptTofu has quit IRC16:23
*** bpokorny has joined #openstack-infra16:25
*** mihgen has quit IRC16:26
*** CaptTofu has joined #openstack-infra16:28
*** zaro0508 has quit IRC16:28
*** dkranz has quit IRC16:29
*** kgriffs_afk is now known as kgriffs16:29
*** markmcclain has quit IRC16:29
*** MarkAtwood has joined #openstack-infra16:30
*** ruhe has joined #openstack-infra16:31
*** ruhe has quit IRC16:31
*** amotoki is now known as amotoki_zzz16:33
*** branen_ has joined #openstack-infra16:34
*** branen_ has quit IRC16:35
clarkbwhat is situation on 1251920? I see one fix didnt merege and that fix got -1'd because another fix was merged?16:39
fungiclarkb: i think we're still trying to keep the situation up to date in https://etherpad.openstack.org/p/critical-patches-gatecrash-November-201316:40
*** kgriffs is now known as kgriffs_afk16:40
*** pblaho has quit IRC16:41
clarkb57193 is the fix I am talking about (sorry on phone and not super useful)16:43
clarkbsdague ^16:46
fungiclarkb: that one is #4 from the head of teh gate now16:46
fungiso either it needs a -2cdrv/0aprv/new patchset or it's likely to land shortly16:47
fungid'oh, one ahead of it just reset, so we've got a while still16:47
*** yassine has quit IRC16:47
clarkb:/ ok once that is in 1251920 should stop failing all the things16:48
*** yassine has joined #openstack-infra16:48
zarois gate open for approval yet?16:53
*** yassine has quit IRC16:54
notmynamewhere is the elastic recheck page that shows what's been detected?16:54
clarkbnot quite. couple more things need to get in16:54
clarkbnotmyname: http://status.openstack.org/elastic-recheck16:55
notmynamethanks16:55
*** svarnau has joined #openstack-infra16:58
*** mriedem1 has joined #openstack-infra16:58
openstackgerritMathieu Gagné proposed a change to openstack-infra/jenkins-job-builder: Ensure jobparams and group_jobparams are dict  https://review.openstack.org/5752516:58
*** mrodden1 has joined #openstack-infra16:59
*** mriedem has quit IRC16:59
*** dkranz has joined #openstack-infra17:00
*** changbl has joined #openstack-infra17:00
*** mriedem1 has quit IRC17:01
*** mriedem has joined #openstack-infra17:01
*** mrodden has quit IRC17:01
*** mrodden1 has quit IRC17:01
*** mrodden has joined #openstack-infra17:03
jog0clarkb: looks like we are getting close to the end17:07
jog0with the final patch(s) in the queue17:07
jog0do you want to start writting up a report of what happened in etherpad?17:07
jog0I will start doing that in a bit myself (first relocating to Berkeley for the day)17:08
*** markwash has quit IRC17:11
*** reed has joined #openstack-infra17:13
jog0clarkb: I think we need a email saying how we got here (bug by bug) and the fixes17:13
*** reed has quit IRC17:13
*** reed has joined #openstack-infra17:13
openstackgerritMathieu Gagné proposed a change to openstack-infra/jenkins-job-builder: Ensure jobparams and group_jobparams are dict  https://review.openstack.org/5752517:14
*** boris-42 has joined #openstack-infra17:16
*** jpich has quit IRC17:18
*** osanchez has quit IRC17:18
*** kgriffs_afk is now known as kgriffs17:21
*** SergeyLukjanov has joined #openstack-infra17:22
clarkbjog0 ya I can start that17:22
*** markmcclain has joined #openstack-infra17:24
sdaguejog0: I sniped out stuff that was in my way this morning, as I was annoyed the grenade patch was still chugging17:25
*** ekarlso has quit IRC17:30
jog0clarkb: cool maybe put it in the same etherpad17:30
jog0I will work on it in abit17:31
*** fbo is now known as fbo_away17:31
mordredjog0, clarkb: neat. we're in good shape now?17:31
clarkbmordred: almost 57193 needs to merge as well as the grenade cleanup changes17:32
mordredwoot17:32
mordredyou guys are awesome17:32
dprincesdague: am I still waiting on someone to push a branch for this? https://review.openstack.org/#/c/57066/ who did you say was working on it?17:32
dprincesdague: the grenade baseline bump to use Havana instead of Griz17:32
sdaguemaurosr17:32
*** mriedem1 has joined #openstack-infra17:35
*** mrodden1 has joined #openstack-infra17:36
*** dolphm is now known as dolphm_afk17:37
*** ekarlso has joined #openstack-infra17:37
*** mriedem has quit IRC17:37
*** Ryan_Lane has joined #openstack-infra17:38
*** ruhe has joined #openstack-infra17:39
*** mrodden has quit IRC17:39
pabelangerSo, where can one good to see the stats for nodepool?  I know status.openstack.org/zuul/ has some graphs but is there anyplace else?17:40
*** mfer has quit IRC17:41
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add batch_tasks support.  https://review.openstack.org/5746917:41
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add seealso to batch_tasks from promoted_build.  https://review.openstack.org/5747317:41
openstackgerritJaroslav Henner proposed a change to openstack-infra/jenkins-job-builder: Add properties testing.  https://review.openstack.org/5765417:41
jeblairpabelanger: there are some more metrics at http://graphite.openstack.org/ under stats.nodepool17:42
*** mihgen has joined #openstack-infra17:42
pabelangerjeblair, great, thank you17:44
*** reed has quit IRC17:45
yolandahi jeblair, did you have any feedback for the licensecheck bug?17:45
*** markwash has joined #openstack-infra17:47
jeblairmordred: ^ please see question from yolanda17:49
jeblairmordred: you filed the bug, i'd like you to decide if it's redundant17:50
jeblairmordred: bug 95040717:50
uvirtbotLaunchpad bug 950407 in openstack-ci "jenkins should run licensecheck on all projects" [Low,Triaged] https://launchpad.net/bugs/95040717:50
*** CaptTofu has quit IRC17:52
*** julim has quit IRC17:53
*** rpodolyaka1 has joined #openstack-infra17:55
clarkbjog0: anteaya https://etherpad.openstack.org/p/critical-patches-gatecrash-November-2013 has a first draft of the situation tl;dr. THere are a couple spots that I think you guys can help fill in as my familiarity with those particular problems isn't very good17:57
clarkbalso feel free to edit whatever, I am not too happy with the current state of the tl;dr17:57
*** herndon has joined #openstack-infra17:58
mordredjeblair, yolanda hrm. I think I agree with jog0 - I think it's an old bug and hacking is handling this now17:59
mordredyolanda: sorry - I should have closed that17:59
yolandanp18:00
*** mrmartin has joined #openstack-infra18:00
*** sarob has joined #openstack-infra18:00
jeblairclarkb: i'm curious about the bugs that caused things, and how they got in....18:01
clarkbjeblair: yeah going to start filling that data in18:02
jeblairclarkb: since there's one change in your etherpad i can track back...18:02
openstackgerritA change was merged to openstack-infra/jenkins-job-builder: Provide default ConfigParser object  https://review.openstack.org/4879018:02
*** derekh has quit IRC18:02
jeblairhttps://review.openstack.org/#/c/54363/ looks like the start of 125192018:02
jeblairand indeed... there are _3_ 'reverify no bug' comments18:03
jeblairsdague: ^ i think examining the comment log of that change will be instructive18:03
jeblairjog0: ^18:03
mordredok - just so everyone knows - I just wrote a script to -1 all of the PEP-427 related patches in the system18:03
yolandamordred, jeblair, so is there some low-hanging-fruit or easy bug where i could collaborate?18:04
mordredif anyone asks, we do not need to do this: https://review.openstack.org/#/c/57127/ in any way to support wheels18:04
fungiyeah, the reverifies on that were pointed out last night. amusing possible confirmation of our suspicions on psychology of nondeterminism in gating18:04
jeblairmordred: thank you18:04
rpodolyaka1mordred: in tripleo we already merged those patches... and kind of released projects since then... should I go and revert those patches and do point releases?18:05
*** arosen has quit IRC18:05
*** boris-42 has quit IRC18:05
*** melwitt has joined #openstack-infra18:05
rpodolyaka1mordred: or it's not critical and can be put to the next releases (we are doing them weekly)18:06
jeblairyolanda: do any other bugs here look interesting? https://bugs.launchpad.net/openstack-ci/+bugs?field.tag=low-hanging-fruit18:06
*** sarob has quit IRC18:07
clarkbjeblair: the last paragraph in the wrapup now has more info. That bug appears to be a case of forcing it through the gate mutliple times until it passes18:07
*** sarob has joined #openstack-infra18:07
yolandajeblair, maybe that one https://bugs.launchpad.net/openstack-ci/+bug/118371618:08
uvirtbotLaunchpad bug 1183716 in openstack-ci "delete old jobs with Jenkins Job Builder" [Medium,Triaged]18:08
*** gyee has joined #openstack-infra18:08
*** markmcclain has quit IRC18:09
jeblairclarkb, jog0, mordred, sdague: after mulling it over, and seeing this evidence, i think "remove 'reverify no bug'" is one of the next steps we should take.18:09
Mithrandiryolanda: that one is fixed now.18:09
mordredrpodolyaka1: it's not critical, but yes please revert them18:09
Mithrandir(there's a --delete-old or something you need to pass, though)18:10
mordredrpodolyaka1: well - let me be more specific -18:10
clarkbjeblair: note that particular change only triggered 1251920 in one of the three failed gate attempts18:10
clarkbjeblair: but still I agree18:10
mordredrpodolyaka1: are the things that you merged them on actually py2/py3 compat?18:10
jeblairi think it's a good incremental step; and maybe we keep going after that, but it may help with this kind of situation18:10
rpodolyaka1mordred: I don't think so :(18:10
yolandai also looked at that one https://bugs.launchpad.net/openstack-ci/+bug/1193444 , but needed clarification18:10
uvirtbotLaunchpad bug 1193444 in openstack-ci "jenkins-job-builder doesn't properly invalidate cache" [Low,Triaged]18:10
mordredrpodolyaka1: ok. then yes please revert - but don't worry about re-releasing18:10
mordredit won't be a problem because we're not cutting wheels yet18:10
rpodolyaka1mordred: cool. thank you for clarifying!18:11
*** senk1 has joined #openstack-infra18:11
jeblairmordred: Unfortunately, I am not able to sign a CLA, so I can't contribute a patch. This puts me in a lousy position to complain about implementations18:11
jeblairmordred: https://bugs.launchpad.net/openstack-ci/+bug/119344418:12
uvirtbotLaunchpad bug 1193444 in openstack-ci "jenkins-job-builder doesn't properly invalidate cache" [Low,Triaged]18:12
*** sarob has quit IRC18:12
clarkbdoes JJB require a CLA >_>18:12
openstackgerritRussell Bryant proposed a change to openstack-infra/config: Add gate-solum-devstack job  https://review.openstack.org/5709818:12
Shrewsfungi, olaph: Where am I drinking away my sorrows tonight?18:13
Mithrandirclarkb: did when I wanted to contribute to it18:14
mordredjeblair: sigh18:14
clarkbmordred: k18:14
clarkber Mithrandir18:14
Mithrandir(not a complain, a data point)18:15
Mithrandircomplaint18:15
olaphShrews: flying burrito / lynnwood grill18:15
jeblairMithrandir: please feel free to complain...18:15
anteayaclarkb: I took a stab at it, not because I understand it - I just summarizied the commit messages from the bug fix patch and two dependencies18:16
clarkbolaph: you guys have burritos that fly in NC?18:16
clarkbanteaya: thanks18:16
anteayaclarkb: let me know if you think I need more eyes on it18:16
jeblairfor that matter if you or anyone you know of has problems contributing because of the CLA, please let us or stefano (reed) know; we need data on that point.18:16
anteayanp, thanks for organizing the wrap-up, we need a summation18:16
Mithrandirjeblair: meh, given I've signed the CLA, I don't really care.  I don't see the point in CLAs in general (I believe that you give the recipient about the same rights by submitting a patch as it it), but it's not really the role of random person who rocks up to complain about the procedures of a project18:17
MithrandirIMO, at least.18:17
olaphclarkb: you have to have them flown into NC.  domestic production is prohibited18:17
jeblairMithrandir: tbh, i'm no longer sure what rights our cla gives.  when i read it, it certainly didn't match up with what people say about it.18:18
*** hogepodge has joined #openstack-infra18:18
*** boris-42 has joined #openstack-infra18:18
*** boris-42 has quit IRC18:19
Mithrandirjeblair: most people aren't trained in reading legal documents18:19
Mithrandirso you might get pretty wild interpretations18:19
fungiMithrandir: i think there are a lot of us who would like to see potential contributors who have legal or philosophical misgivings about agreeing to our cla speak up, and loudly18:21
jeblairfungi: ++ we need data -- i'm not asking people to complain for no purpose -- i'm asking people to speak up (like that person did in the bug) so that we know about concerns18:22
fungiat the moment we're enforcing a legal agreement for contribution, and one of the primary reasons it persists, i believe, is because there's a perception that it doesn't get in the way of attracting capable contributors to the project18:22
fungiso if it really does, then that's something we absolutely need to know18:23
portantefolks, should we be holding off approvals still?18:23
clarkbportante: let me check the gate18:23
portante44 looks like "a lot"18:23
*** afazekas has quit IRC18:23
fungiportante: yes, we've still got a couple of fixes percolating in the gate18:23
clarkbportante: 57193,2 is second in the gate queue and will work around 125192018:23
clarkbso having things behind that is probably mostly ok18:24
portantek, thx18:24
clarkbassuming it doesn't get bumped out due to some other failure :/18:24
jeblairmgagne: yolanda wants to work on https://bugs.launchpad.net/openstack-ci/+bug/1193444 can you help determine the current state of that bug?18:24
uvirtbotLaunchpad bug 1193444 in openstack-ci "jenkins-job-builder doesn't properly invalidate cache" [Low,Triaged]18:24
fungiright, which will most likely mean we dump the gate again so we can requeue it at the head18:24
clarkbportante: we are trying to summarize the various issues at https://etherpad.openstack.org/p/critical-patches-gatecrash-November-2013 any chance you can fill in some of the blanks around the swift things?18:24
clarkbportante: I started a paragraph for the swift things but it is very incomplete18:24
Mithrandirfungi: well, I think it adds a barrier, and I don't think it gives you particularly much in terms of legal shielding so I think it should go, at least for more peripheral projects.18:24
Mithrandirit's not a problem _for me_ because I work in small companies that are flexible with those kinds of things.18:25
MithrandirI know really capable people who work for large universities who can't sign CLAs for instance, but I have no idea if they'd be interested in openstack. :-)18:26
Mithrandir(Stanford comes to mind)18:26
fungiMithrandir: i *personally* would be fine seeing it go away everywhere. some openstack member companies are a fan of it, so it's there at least until we have real evidence it's a problem18:26
fungiMithrandir: and yes, i'd love rra to say he wants to contribute to one of our projects but is disallowed18:26
Mithrandirooi, where do you know rra from?18:27
Mithrandir(and yes, he's one of the people I was thinking of)18:27
Mithrandiryou'll always have under-reporting of it too, since people who work for such an institution who'll see the CLA and then just turn away.18:27
notmynameclarkb: it would be nice to order (or at least mention) the relative weight of these issues. perhaps using the elastic recheck results to show their relative weight in causing issues.18:27
fungidebian. he sponsors stuff into the archive because i'm too lazy^H^H^H^Hbusy to go through nm18:27
*** rnirmal has joined #openstack-infra18:27
clarkbnotmyname: good idea, I will add that info18:28
jeblairMithrandir: everyone knows rra :)18:28
Mithrandirfungi: ah, right.18:28
*** markmcclain has joined #openstack-infra18:29
jeblairMithrandir: he maintained the gnu project's news server and mail-news gateway... handles afs and kerberos packaging (he made installing afs easy!)... he's super-human.  :)18:29
fungiyes, he is rather an amazing guy18:29
jeblairand yeah, when i worked at UC Berkeley, i would have had trouble contributing.  i might have been able to -- with quite a lot of work.18:30
Mithrandirjeblair: I know, it's great.18:30
Mithrandir(all that rra does, not that you'd have trouble contributing)18:30
*** mfer has joined #openstack-infra18:30
*** julim has joined #openstack-infra18:30
jeblairit's kind of daft of us to say we don't really care for contributions from people at stanford and berkeley and, well, most r1 universites.18:30
jeblairheh :)18:31
jeblairyolanda: i think bug 1193444 may be heading into a somewhat-unresolvable place... i think there are different ideas of how jjb should be used...18:32
uvirtbotLaunchpad bug 1193444 in openstack-ci "jenkins-job-builder doesn't properly invalidate cache" [Low,Triaged] https://launchpad.net/bugs/119344418:32
jeblairyolanda: my personal feeling is that the cache support is probably as good as it can get, and if you don't want it, then you need to turn it off, and there's a flag to do that now.18:33
jeblairyolanda: so i think we may want to close that bug too, now that '--ignore-cache' has been added18:33
yolandaok18:34
jeblairi'd want another jjb developer who's more familiar with that to weigh in (like mgagne), but i'm starting to think it's probably not a good one.18:34
jeblairgee, sorry.  :(18:34
jeblairi think maybe on our next bug triage day, we should actually pay much closer attention to the low-hanging-fruit bugs, because this isn't a very good experience for a new contributor18:35
jeblair(i think stale lhf bugs are worse than stale advanced bugs for this reason)18:35
mgagnejeblair: there is already a way to bypass cache. An additional feature would be to allow the user to permanently disable the cache with a config.18:36
clarkb++ also we should schedule our bug days now that we know the release schedule18:36
jeblairmgagne: oh, that's a good idea18:36
fungiShrews: olaph: yes, flying burrito is next to the raleigh grande theater on grove barton rd (just off glenwood/us70 and lynn rd). lynnwood grill is right across grove barton rd from flying burrito, so we can head there second if you want. still planning on 6:30 pm est?18:37
jeblairyolanda: ^ want to add a config file option that does the same as --ignore-cache and use that to close 1193444?18:37
jgriffithAnybody know how to reset a job that's apprantly *stuck*: https://review.openstack.org/#/c/55923/18:37
jgriffithstarted gate on the 19'th and never returned18:38
fungijgriffith: is it still showing on status.openstack.org/zuul? if so, push a trivial rebase or similar minor patchset update18:38
jgriffithfungi: it's not, and will do thanks18:38
fungithat was one of the changes being tested by jenkins01 when it spontaneously died on monday18:39
jeblairjgriffith: if it's not showing, just 'recheck no bug'18:39
clarkbnotmyname: I have filled out some numbers for the things that I have good info on in the etherpad18:39
notmynameclarkb: thanks (looking..)18:39
clarkbnotmyname: still waiting on swift things (I can dig them up if I have to, but need to run and do errands here shortly so won't get to that soon)18:39
jgriffithfungi: yeah, thought that might be the case.  Anywho... thanks for the tip18:39
fungijgriffith: and yeah, as jeblair said, if it's not showing on the status page then a recheck ought to work fine18:40
notmynameclarkb: ya, I was hoping for some numbers on the DBConnection timeout issue. my understanding is that it was a suspect, and may have shown up a couple of times, but perhaps only affected swift patches (not openstack-wide ones)18:40
fungijgriffith: but may take a few minutes to show up, what with the current load18:40
yolandajeblair, not sure if i follow you18:40
lifelesshow's the gate looking this morning?18:40
notmynameclarkb: I didn't see it on the elastic recheck page, so I don't know where to look18:40
*** rongze has quit IRC18:40
* portante runs a kibana search on it18:41
clarkbportante: feel free to fill in info at https://etherpad.openstack.org/p/critical-patches-gatecrash-November-201318:41
notmynameportante: ah, thanks18:41
fungilifeless: improving, and the second change from the head of the integrated queue should improve it substantially as well18:41
portantehttp://logstash.openstack.org/#eyJzZWFyY2giOiJtZXNzYWdlOlwibG9ja2VkXCIgQU5EIGZpbGVuYW1lOlwibG9ncy9zeXNsb2cudHh0XCIiLCJmaWVsZHMiOltdLCJvZmZzZXQiOjAsInRpbWVmcmFtZSI6ImN1c3RvbSIsImdyYXBobW9kZSI6ImNvdW50IiwidGltZSI6eyJmcm9tIjoiMjAxMy0wOS0wMVQxNzo0NzoxMCswMDowMCIsInRvIjoiMjAxMy0xMS0yMFQxNzo0NzoxMCswMDowMCIsInVzZXJfaW50ZXJ2YWwiOiIwIn0sInN0YW1wIjoxMzg1MDU5Mjc1OTQwLCJtb2RlIjoiIiwiYW5hbHl6ZV9maWVsZCI6IiJ918:42
lifelesswas the bad olso local in nova an issue, or a false fix?18:42
jeblairyolanda: mgagne suggested that it would be useful to have a config file option that did the same thing as the '--ignore-cache' command line option, so you could permanently set it for your system if you didn't want to use a cache.18:42
clarkblifeless: I have numbers up at https://etherpad.openstack.org/p/critical-patches-gatecrash-November-2013 looks like it may have fixed the biggest problem18:43
yolandaok, sounds useful18:43
yolandai'll take a look at it18:43
* lifeless does the I was useful dance18:43
portantenotmyname: there is no record of these happening before November 8th18:44
lifelessand can we now stop copying code into projects?18:44
portantethese being "database is locked" errors from swift18:44
notmynameportante: was that when the pypy refactor went in?18:44
portanteI am not sure18:45
portantewell18:45
portanteno, that happened before havana was wrapped up, if I remember right18:45
*** yaguang has quit IRC18:45
clarkbthough it looks like that locked stuff is still happening?18:46
notmynameportante: all of those seem to have a build_status of SUCCESS18:47
clarkbportante: note we only have 2 weeks of indexed logs in elasticsearch18:47
notmynamehow do I filter? build_Status:!SUCCESS breaks it18:47
clarkbnotmyname: NOT build_status:"SUCCESS"18:47
portanteclarkb: oh18:47
notmynameclarkb: thanks18:47
notmynameportante: http://logstash.openstack.org/#eyJzZWFyY2giOiJtZXNzYWdlOlwibG9ja2VkXCIgQU5EIGZpbGVuYW1lOlwibG9ncy9zeXNsb2cudHh0XCIgYW5kIE5PVCBidWlsZF9zdGF0dXM6XCJTVUNDRVNTXCIiLCJmaWVsZHMiOltdLCJvZmZzZXQiOjAsInRpbWVmcmFtZSI6ImN1c3RvbSIsImdyYXBobW9kZSI6ImNvdW50IiwidGltZSI6eyJmcm9tIjoiMjAxMy0xMS0wMVQxNzo0NzoxMCswMDowMCIsInRvIjoiMjAxMy0xMS0yMFQxNzo0NzoxMCswMDowMCIsInVzZXJfaW50ZXJ2YWwiOiIwIn0sInN0YW1wIjoxMzg1MDU5NjcwMTU3LCJtb2RlIjoiIiwiYW5hbHl6ZV9maWVsZ18:48
notmynameCI6IiJ918:48
clarkbportante: I want to extend that more, but I have to be careful because disk18:48
clarkbalso RAM18:48
notmynamehmm...need a URL shortener on those18:48
*** ryanpetrello has quit IRC18:49
notmynameclarkb: it's like you need a scalable place to put large amounts of unstructured data18:49
portanteclarkb: understood18:49
portante:)18:49
portantegot get 'em PTL of Swift!18:49
portantego18:49
clarkbnotmyname: I am not sure how inverted indexes on swift would do18:49
clarkbreally the big problem is the more data you put on disk the more data it wants to load into memory to perform queries18:50
jog0jeblair: I agree we should get rid of reverify no bug and I would like to get rid of recehck no bug as well18:50
clarkbeventually you need bigger machiens than rax provides18:50
*** krtaylor has quit IRC18:50
notmynamejog0: counterpoint is that opening the gate was all "reverify no bug"18:51
*** mrodden1 has quit IRC18:51
jog0notmyname: fair enough, I think this is a good place to start18:51
*** mriedem1 has quit IRC18:51
clarkbnotmyname: core reviewers and infra can open the gate without reverify no bug18:51
clarkbjust leave +A votes18:51
jeblairnotmyname: well, it was a lot of 'reverify bug 1251920'18:51
uvirtbotLaunchpad bug 1251920 in nova "Tempest failures due to failure to return console logs from an instance" [Critical,Fix committed] https://launchpad.net/bugs/125192018:52
jeblairnotmyname: breaking the gate was a lot of 'reverify no bug'; see https://review.openstack.org/#/c/54363/ for an example18:53
dprincejeblair: I'm not a fan of getting rid of the 'no bug' options. Sure they get overused... but in some cases (for dependency related failures) they are very useful18:53
notmynameI'd just hate to see removing functionality as a reaction to a current bad situation. I agree that it should include bug numbers, but it seems similar to "OMG somebody deleted all the data, let's disable deletes"18:53
jeblairdprince: dependency related failures?18:53
clarkbfwiw I think we completely get rid of reverify18:53
*** xeyed4good1 has joined #openstack-infra18:53
* jog0 reads clarkb's wrapup18:53
dprincejeblair: yes. Like I push two branches across different projects. One is a WIP that depends on the other to land first.18:53
jog0oh clarkb so everything big merged?18:54
clarkbjog0: not yet18:54
jog0clarkb: whats left?18:54
jog0920?18:54
clarkbjog0: the fix for 1251920 is still in the gate18:54
dprincejeblair: when the one lands I recheck no bug and would expect to see a pass.18:54
dprincejeblair: it isn't a bug... rather an expected failure18:54
clarkbjog0: and the change to reenable nova v3 changes needs hand holding18:54
jeblairdprince: i said 'reverify no bug' -- not 'recheck no bug' :)18:54
clarkber nova v3 tempest tests18:54
jog0clarkb:the v3 test enabaling should be done after this is all over18:54
jeblairdprince: but i see jog0 brought up 'recheck no bug'18:54
jog0aka when we are back to normal18:54
clarkbjog0: ok, just want to make sure it doesn't get forgotten18:55
fungi57193 is very close to merging though18:55
jog0clarkb: agreed18:55
jeblairdprince: and indeed, that is one of the reasons i think i'm not ready to get rid of 'recheck no bug'.  i also used it today on an old patch.18:55
dprincejeblair: Fair enough. I'm concerned we'll take it too far.18:55
jog0I think cyeoh is on it18:55
fungipotentially 15 minutes if it and the change ahead of it don't step in something18:55
jog0cyeoh: ^18:55
jog0jeblair: I don't to get rid of recheck no bug and not having anything else18:55
jeblairjog0: k18:56
jog0but recheck thisisn'tabugIswear18:56
jog0or something like that18:56
fungicross my hear and hope to die, stick a revert in my eye18:56
fungiheart18:56
*** pete5 has joined #openstack-infra18:56
clarkbrechecks aren't the issue imo18:56
jog0because recheck no bug when it is a bug18:56
jog0is very bad18:56
clarkbrechecks happen prior to merging and the gate18:56
jog0clarkb: let me rephrase18:57
clarkbthe big problem here is you can force bad changes into the gate if they are bad only 25% of the time18:57
jog0rechecks aren't bad yes, unknown bugs in the gate are bad18:57
jog0so maybe recheck no bug isn't the answer but devs ignoring gate bugs is bad18:57
*** dkliban_ has quit IRC18:58
jeblairclarkb: though the extra test runs from the check queue are useful, and when 'recheck no bug' is used there out of laziness, it's as bad as a 'reverify no bug' in the gate.18:58
*** mrodden has joined #openstack-infra18:58
jeblair(eg, if the check caught a bug in the patch but it was ignored)18:58
jeblair(and the gate subsequently misses it)18:58
jog0jeblair: right18:58
*** mriedem has joined #openstack-infra18:58
clarkbjeblair: hmm good point18:59
fungiif we did keep "no bug" maybe we should allow (require?) comments in the pattern. i know that i always feel like i need to justify why i recheck/reverify no bug and end up doing it in a second gerrit review comment instead18:59
jog0fungi: I can get behind that19:00
jog0fungi: cool 8 minutes away (I hope) from 920 being fixed19:01
nikhil__hi, can someone help me resolve jenkins issues on this MP https://review.openstack.org/#/c/54198/ ?19:02
nikhil__seems like, it's failing consistently due to different kinds of issues19:02
funginikhil__: welcome to the party ;)19:02
nikhil__was wondering how to do a recheck with many bugs?19:02
nikhil__fungi: heh :)19:02
funginikhil__: the recheck pattern only allows one bug number19:03
fungifor now, feel free to just pick one, though hopefully those should cease being a problem here shortly (i think several already may be solved)19:03
nikhil__oh, gotcha. Thanks fungi19:04
nikhil__would you rather want me waiting while the issues in the gating process are resolved?19:04
nikhil__or incoming recheck requests are fine?19:04
funginikhil__: if you don't mind, that would be great. your chances of getting a passing verify score from jenkins will be going up considerably here shortly, we hope19:05
openstackgerritDouglas Mendizabal proposed a change to openstack-infra/config: Rename Project Barbican channel  https://review.openstack.org/5774119:05
*** mrmartin has quit IRC19:05
sdaguejeblair: +1 to removing reverify no bug19:05
nikhil__sure fungi . I will wait (anything that can be helpful :) )19:05
jog0FWI http://paste.openstack.org/show/53765/19:05
*** sarob has joined #openstack-infra19:06
jog0things are still pretty shake19:06
jog0bad19:06
jog0grenade is still failing all the time19:06
*** senk1 has quit IRC19:06
dprincesdague: maurosr is unresponsive. Can you clue me in on what exactly needs to happen w/ grenade. I see no reason to wait around?19:06
*** senk has joined #openstack-infra19:07
*** dolphm_afk is now known as dolphm19:07
*** thomasbiege has joined #openstack-infra19:07
notmynamejog0: I think we just had a "no bug" fail in swift, BTW19:07
*** xeyed4good1 has quit IRC19:08
notmynameso if a gate job failed due to a timeout, is that "no bug"?19:09
portantenotmyname: are you talking about: https://review.openstack.org/5462019:09
rfolcoHello folks. Is there a easier way to skip multiple Tempest tests than using @skip_because decorator ?19:10
notmynameportante: ya19:10
portantethat was weird, though19:10
portantethe swift functional tests failed on the check job, for some weird reason19:10
notmynameportante: functests failed due to a tcp drop, and grenade was killed after 60 minutes19:10
notmynameportante: ya. error talking to git19:10
portantebut there was a check and a gate job running at the same time19:10
jog0notmyname: if something times out at least file a  bug for hey it timed out19:11
jog0notmyname: plus gate isn't fixed yet19:11
portanteif you look there was a reverify no bug by you, then dfg did a recheck bug 125192019:11
uvirtbotLaunchpad bug 1251920 in nova "Tempest failures due to failure to return console logs from an instance" [Critical,Fix committed] https://launchpad.net/bugs/125192019:11
*** rongze has joined #openstack-infra19:11
jog0notmyname: we are in bug fixes only in gate now19:11
portantethat reverify no bug came from last night19:12
portantethat was not done this morning19:12
jog0ahh19:12
jog0we were in gate freeze last night too19:12
fungigah. 57193 restarted because the change ahead of it failed19:12
notmynameportante: ya, that's when it was ok to try some stuff because it was probably ok19:12
maurosrdprince: sdague hi sorry, two meetings in the last hours, pushing it19:13
fungiMismatchError: 9 != 10 (which one was that again?)19:13
notmynameyay, the patch at the front of the gate just failed. does that mean another 60 minutes for the good patch to get in?19:13
sdaguemaurosr: great19:13
jog0clarkb: I am just going to do a brain dump for the wrapup and hope someone can help me turn it into something coherent19:13
dprincemaurosr: thanks19:13
*** jgrimm has joined #openstack-infra19:14
maurosrsdague: I still have lots of refactors to avoid duplication, but will do they in separate commit, they were breaking my patch, so I need to think on it better19:14
sdaguemaurosr: ok, well if you have a rev out there, I can help with it as well19:15
sdagueit would be good to unblock dprince on this19:15
*** senk has quit IRC19:16
maurosrsdague: the ones moving each release to a separated file are out, the refactor is just some functions that exists on every single file and could be reused.. I don't think it is a block cause they are not really necessary19:16
sdagueok, cool, lets see how they handle tests19:16
jog0sdague: is gerande worging again?19:16
sdagueI'll review post lunch19:16
jog0working*19:17
sdaguejog0: well, my fix landed19:18
*** hogepodge_ has joined #openstack-infra19:18
*** rongze has quit IRC19:18
*** hogepodge has quit IRC19:19
*** hogepodge_ is now known as hogepodge19:19
sdaguethe last grenade fail I see is unrelated - http://logs.openstack.org/05/55405/5/check/check-grenade-devstack-vm/7d7de61/console.html19:19
jog0sdague: cool19:20
anteayayay sdague19:20
jog0sdague: when it land?19:20
sdague2 hours ago19:21
jog0sdague: ahh that makes sense then19:21
jog0too early to tell19:21
sdagueso the timeout is still 60?19:21
sdaguethe slow node setup time makes me think we should up that regardless19:22
jog0as far as I know19:22
jog0++19:22
jeblairi'm looking into slow node setup time19:22
sdaguejeblair: cool19:22
sdaguethat last change I posted, 30 minutes to get to the main run19:22
zaroclarkb: would you be able to reply to paul's comment? https://review.openstack.org/#/c/47937/10/modules/openstack_project/manifests/review_security.pp19:22
jeblairi think it might be time to remove the zuul repos (and remove their zuul refs)19:24
jeblairso if we restart zuul again, we should 'rm -fr' its working directory19:24
fungijeblair: noted19:24
*** johnthetubaguy has quit IRC19:25
*** herndon has quit IRC19:25
zarofungi: i have reread this.. http://lists.openstack.org/pipermail/openstack-infra/2013-October/000314.html19:26
funginotmyname: 53 minutes now (so sayeth zuul's estimate)19:26
jeblairhttp://cacti.openstack.org/cacti/graph_image.php?action=view&local_graph_id=388&rra_id=319:26
jeblairis an absurd graph.  it's mostly due to git upload packs19:26
zarofungi: i'm not sure what the hold up is, can you refresh my memory?19:27
fungizaro: sure--rereading now so i can refresh mine first ;)19:27
sdaguejeblair: heh19:28
jog0jeblair: what do you think about a zuul mode that only merges code with a bug assocatged with it?19:28
jog0telling 200 people not to approve patches .. is hard19:29
jog0clarkb: how long was the gate queue when we flushed it?19:30
notmynamejog0: 127 (give or take a few) IIRC19:31
fungizaro: mmm, everything seems covered in that thread if you didn't have any other follow-up there. i think i just need to review that outstanding change again19:31
jeblair14519:31
*** ryanpetrello has joined #openstack-infra19:32
zarofungi: cool, i was just about to git review new patch.  it's # 4793719:32
fungizaro: yep, was just looking at it. seemed to be failing tests and wip19:32
lifelessis there some way to stop a meeting someone else started?19:33
zarofungi: ohh, looks like clarkb is not around.  would appreciate it if you could reply to paul's comment.19:33
lifelesssee #openstack-meeting19:33
morganfainberglifeless, #endmeeting ?19:33
lifelessmorganfainberg: 'see #openstack-meeting' :)19:33
openstackgerritKhai Do proposed a change to openstack-infra/config: Setup a private gerrit instance for security reviews  https://review.openstack.org/4793719:33
*** dkliban_ has joined #openstack-infra19:34
zarofungi: was in wip because i thought we were in holding pattern. new patch fixes the white space chars.19:35
jog0notmyname jeblair: remember how long it was in hours?19:36
notmynamejog0: 18?19:36
jog0notmyname: that sounds about right19:36
morganfainbergbefore i go and reverify/recheck things for keystone, wanted to do a temperature check in here.  not sure where we are sitting wrt gate this morning, (not seeing an explicit all clear)19:37
morganfainbergyou know, not making a bad situation worse.19:38
fungizaro: replied to the inline comment19:39
*** thomasbiege has quit IRC19:39
jog0morganfainberg: please wait19:40
morganfainbergjog0, figured as much, this is why i asked :)19:40
jog0morganfainberg: the last big bug patch 57193 is at the top of the qeueue19:40
jog0but until its merged  we want to hold off turning the floodgates on19:40
jog0morganfainberg: thanks you very much for asking19:40
morganfainbergah cool.  will keep my eye on that one, thanks for all the awesome work the last day.19:40
zarofungi: cool. thx.  i guess the thing i'm really not sure of is config in All-Projects-review_security.config19:41
*** kgriffs has left #openstack-infra19:43
fungizaro: yeah, i need to give the whole thing a detailed review. but as for that file we probably want to turn it into a section in docs/source/gerrit.rst instead i think (or add a similar file for review-security.rst or something)19:43
*** gyee has quit IRC19:43
fungizaro: though we could test it as is and see if applying that works19:43
*** rongze has joined #openstack-infra19:45
*** ruhe has quit IRC19:45
zarofungi: not sure what you mean.  are you suggesting that we just need to document it? or replace what's in the review?19:45
fungizaro: there's a chicken-and-egg problem with trying to puppet gerrit from scratch and add an all-projects acl19:46
fungizaro: we work around that at the moment by simply documenting what the initial administrator should configure there19:46
openstackgerritSergey Lukjanov proposed a change to openstack-infra/jeepyb: Savanna client using separated LP project now  https://review.openstack.org/5775219:47
*** krtaylor has joined #openstack-infra19:48
zarofungi: ahh.  so instead of puppeting all-projects those configs need to be set manually?  it seems like puppeting is doable, no?19:48
fungizaro: not safely... at least certainly not initially19:48
fungizaro: to be able to apply the acl you have to push it through gerrit's ssh git interface19:48
fungizaro: and that acl is what determines who and how changes are allowed to be pushed19:49
*** mihgen has quit IRC19:50
fungizaro: it *can* be applied directly on the filesystem via local git operations instead, but that skips gerrit's vacl validation, group uuid creation and so on19:50
*** rongze has quit IRC19:50
*** rpodolyaka1 has quit IRC19:51
*** fbo_away is now known as fbo19:51
*** sarob has quit IRC19:51
fungizaro: yolanda worked out a way to do it, but i think it involves using git to locally apply a very minimal and rigorously tested all-projects acl along with corresponding database queries to emulate the group uuid mapping which corresponds to that, then as a second stage push the more complex all-projects acl through the normal update mechanism in gerrit19:52
*** sarob has joined #openstack-infra19:52
fungiwhile it might be nice to have, it's tricky and certainly not trivial to implement19:52
*** mihgen has joined #openstack-infra19:54
*** yolanda has quit IRC19:55
*** whayutin_ has joined #openstack-infra19:56
*** weshay has quit IRC19:56
zarofungi: ok.  i was just setting up locally on my machine so didn't realize the complexity.  i guess i should change all-projects config to documentation instead?19:56
*** sarob has quit IRC19:56
fungizaro: i think so. that's what we already do for our public gerrit instance on review.o.o (merely document what all-projects should look like)19:57
*** sarob has joined #openstack-infra19:59
*** sarob has joined #openstack-infra19:59
*** ericw has quit IRC20:01
*** danger_fo_away is now known as dangers20:01
*** whayutin_ is now known as weshay20:02
fungizaro: i left a comment on the change summarizing20:02
zarofungi: excellent.  thanks.20:04
jog0anyone want to help me build the bug timeline?20:04
*** hashar has joined #openstack-infra20:07
*** herndon has joined #openstack-infra20:10
*** vipul is now known as vipul-away20:15
*** vipul-away is now known as vipul20:15
jog0looks looks like https://review.openstack.org/#/c/57193/ will merge20:19
mgagnezaro: any experience with jenkins slaves running windows?20:20
fungijog0: there's still a grenade failure behind it though20:20
*** ^d has quit IRC20:21
notmynamejog0: unless it times out... ;-)20:22
jog0fungi: link?20:22
jog0notmyname: :(20:22
jog0jeblair sdague: is the timeout bump for grenade in the queue20:22
mikaljog0: so... how goes operation WTF Console Log?20:23
jog0mikal: http://status.openstack.org/zuul/ its at the top of the qeueue20:23
jog0its all up to greande20:23
jog0grenade20:23
fungijog0: looks like this one is trying to apt-get over the network. and running into connectivity issues... https://jenkins02.openstack.org/job/gate-grenade-devstack-vm/17202/console20:23
jog0fungi: ahh20:24
fungiso unrelated20:24
jog0uh oh20:25
jog017201 has 2 minutes20:25
jog0and is on hpcloud20:25
fungijog0: the console log on this one is a bit on the brief side (haven't looked at what uploaded to logs.o.o yet): https://jenkins01.openstack.org/job/gate-tempest-devstack-vm-neutron-large-ops/8656/console20:25
jog0fungi: yeah taht console lokg is short20:25
fungisame for https://jenkins01.openstack.org/job/gate-tempest-devstack-vm-neutron/21816/console20:25
jog0ohh really short20:25
jog0looks like a network bug20:26
fungishort log with a looooong gap20:26
jog0lol20:26
jog0mikal: looks like https://review.openstack.org/#/c/57193/ passed20:26
notmynamejog0: does this mean you'll have to flush the whole gate again to not get stuck behind 46 other changes (all tested without this patch)20:26
jog0as long as it beats the buzzer20:26
fungijog0: came in just under the wire20:26
notmynamesuccess!20:27
jog0notmyname: wrong patch20:27
jog0mikal: ^20:27
jog0https://jenkins02.openstack.org/job/gate-grenade-devstack-vm/17202/console puppetlabs20:27
jog0WAT20:28
jog0why20:28
jog0fungi: ^20:28
jog0fungi: so if apt.puppet goes down we do t ?20:28
jog0to20:28
fungijog0: very good question. i think we don't want that, no20:28
jog0clarkb jeblair sdague: I think we are out of critical modde20:29
jog0turn off the ceremonial bug fix only mode?20:29
jog0mikal: ^20:29
fungijog0: we install puppet from there to use for configuring the slave image. devstack then wants to update package lists20:29
jog0clarkb: and reload the queuee20:29
zaromgagne: did i hear windows?20:29
* jog0 files a bug against infra20:29
mgagnezaro: sort of. lets say it's a non-linux OS.20:29
*** CaptTofu has joined #openstack-infra20:31
jog0fungi: ^20:31
jog0fungi: https://bugs.launchpad.net/openstack-ci/+bug/125377420:31
uvirtbotLaunchpad bug 1253774 in openstack-ci "Reduce number of apt sources that must be up for gate to work" [Undecided,New]20:31
fungijog0: it might be possible to remove the puppet sources.list entry after we build the image, before it gets snapshotted20:31
fungijog0: but i think we should probably also have a close look at the list to make sure there aren't other things in a similar situation and clean them all up consistently20:31
zaromgagne: i have some experience with that sorta thing.20:32
jog0fungi: agreed, I filed a bug because this isn't a fix this ASAP issue20:32
*** mrodden has quit IRC20:32
fungijog0: great--thanks! i'll triage and get some thoughts in there in case i don't work on it right away20:32
fungi(which seems likely)20:32
jog0fungi: thanks, and with regard to turning on the full gate flood?20:32
fungijog0: i'm in favor, but looking for consensus20:33
jog0fungi: agreed20:33
anteayaneutron has been business as usual since last night20:33
fungijog0: though it might be nice to watch one more iteration of jobs to see how fast the current 45 drop20:33
jog0true20:34
anteayaI had to give the Japanese core some sense of direction20:34
anteayacouldn't leave them hanging and then go to bed20:34
fungianteaya: i think the question is more with regards to auto-reverifying the 100+ we had from the original dump20:34
anteayaah20:34
jog0fungi: so can you babysit zuul20:34
jog0I am going to keep working on the report20:35
fungijog0: for a few hours, yeah20:35
anteayasome folks are coming to me and asking for recheck reverify, which I have been doing manually20:35
mgagnezaro: I think I found what I was looking for. I'll give it a try until I cut myself.20:35
anteayacan we do stages of 25 patches at a time?20:35
*** mriedem1 has joined #openstack-infra20:35
*** mrodden1 has joined #openstack-infra20:35
jog0fungi: cool, I think we should decide about gate by teh end of the hour20:35
fungijog0: agreed20:35
jog0and ping me if anything fails20:35
fungiwill do20:35
jog0fungi: we should also prep an email and the requeue old stuff scripts just in case20:36
clarkbjog0: sorry just got back from errands (woo new lease)20:36
clarkbjog0: is there a tl;dr on the infoos you need?20:36
jog0clarkb: yesah20:37
anteayaclarkb: congrats on new lease20:37
jog0clarkb: https://review.openstack.org/#/c/57193/ merged20:38
clarkbwoot!20:38
*** vipul is now known as vipul-away20:38
*** vipul-away is now known as vipul20:38
jog0clarkb: so fungi and I decied watch zuul till the end of the hour20:38
jog0and decide if we call an all clear and requeue the stuff we bumped20:38
*** mriedem has quit IRC20:38
clarkbok20:39
jog0clarkb: and I am working on the report20:39
fungibasically if it looks like the gate is clearing quickly, then it should be safe to dump in the remainder20:40
fungiotherwise, there's still work to do20:40
fungii've poked at the bug timeline a little, but not sure what you're looking for next (an analysis of what changes caused them to emerge?)20:40
zaromgagne: don't try too hard. x64 win is not real nice to with jenkins.20:41
jog0fungi: I first want to list when each bug went into play20:41
fungiahh, okay20:42
jog0for the wrapup I was going to explain how we got into the state we did (bug by bug blow type thing)20:42
jog0and our fixes20:42
jog0just o give people a better idea of how bad it was20:42
mgagnezaro: ... :-/20:43
*** rongze has joined #openstack-infra20:47
*** markmcclain has quit IRC20:47
jog0wow this timeline is a little scary20:50
jog0essentially just 2 bugs did us in20:50
*** markmc has quit IRC20:50
*** rongze has quit IRC20:51
*** vipul is now known as vipul-away20:53
openstackgerritJames E. Blair proposed a change to openstack-infra/devstack-gate: Add timestamps to devstack-gate output  https://review.openstack.org/5777020:53
*** vipul-away is now known as vipul20:54
jeblairwow we run a lot of jobs on dg changes :)20:54
fungijeblair: and remember we only just stopped running stable/folsom compat jobs on it last week ;)20:55
fungithere were more than a few of those as well20:55
clarkbjog0: 1251920 and the nova oslo thing?20:55
*** rfolco has quit IRC20:56
jog0clarkb: let me rephraase20:57
jog02 bugs put us over the edge20:57
clarkbI see20:57
jog01251920 and parallel greande and everything that involved20:57
*** markmcclain has joined #openstack-infra20:57
jog0we had enough underyling bugs that thats all it took20:57
jog0and 1251920 hit everything20:57
*** datsun180b has quit IRC20:58
jog0clarkb fungi: so how does gate look20:58
fungijog0: all green so far20:58
mikaljog0: so just so I am clear, you think you've fixed 1251920?20:59
mgagnezaro: what a pain...20:59
jog0fungi: none of the important tests reported back yet20:59
mikaljog0: cause I'd like to never have to do that again please20:59
jog0mikal: fixed, no disabled yes20:59
fungino failures on any gate jobs since the fix to 1251920 merged (there's a merge conflict a ways down, but that's uninteresting)20:59
lifelessclarkb: can I forward that mail to the larger internal list?20:59
jog0we found the test that triggered it and turned it off20:59
clarkblifeless: oh I did a reply only, yeah thats fine20:59
mikaljog0: do we understand why that test triggered it yet?20:59
anteayaw00t20:59
jog0mikal: :(20:59
fungijog0: quite a few have reported back so far (devstack/tempest/grenade) and are all success stories20:59
jog0mikal: https://review.openstack.org/#/c/57193/21:00
jog0fungi: if your happy, I am happy21:00
fungii take that back... no grenades yet (hasn't been long enough)21:00
*** johnthetubaguy has joined #openstack-infra21:00
jog0fungi: yeah21:01
fungijog0: current average completion time on grenade seems to be ~58 minutes21:01
jog0mikal: so we still need to fix that bug21:02
jog0so happy hunting to us :)21:02
fungibut jenkins may just have not adjusted its expectations downward yet21:02
jog0fungi: can you push a patch to bump the timeout21:02
jog0fungi: ohh21:02
*** SergeyLukjanov has quit IRC21:02
fungijog0: i can. note that it won'y affect any already-running jobs, only jobs which start after teh config change is applied21:02
*** alcabrera has quit IRC21:03
fungiso we might as well wait to see if it's still that slow first21:03
jeblairi'd like to remove the zuul repos before adjusting the timeout21:03
jog0fungi: ack21:03
*** dprince has quit IRC21:03
jog0jeblair fungi: so it sounds like you have that covered cool21:03
*** denis_makogon_ is now known as denis_makogon21:04
openstackgerritA change was merged to openstack-infra/jenkins-job-builder: fix jjb configuration documentation  https://review.openstack.org/5706221:04
jeblairit's currently taking 2.5 minutes to determine that a ref doesn't exist, and a significant part of that is number of refs in the repos; longer term solution is to push zuul refs to a load balanced set of servers21:04
fungieee, yeah... i wonder if some way to expire zuul refs older than a week or a month or something wouldn't be too hard as a hot periodic sort of task21:05
notmynameshould I be worried that check jobs are still getting hit with 1251920?21:06
funginotmyname: they probably started before the fix for that merged and are based on an earlier state21:06
notmynamefungi: https://review.openstack.org/#/c/57753/21:07
* fungi checks21:07
notmynameperhpas. but it's close21:07
*** mrodden1 is now known as mrodden21:08
fungilooks like the job started at 20:1721:08
fungifix for it merged at 20:2721:08
notmynamekk21:08
fungijog0: stuff seems to be merging and not ejecting... down to 42 in the gate pipeline now21:10
jog0fungi: yeah things are all green :)21:10
jog0so back to business as usual?21:10
fungiin about 10-15 minutes we should have a whole swath of changes finishing, several with grenade jobs21:11
jog0fungi: cool21:11
clarkbjog0: in the wrapup stuff, can you fill in the swift details?21:11
clarkbjog0: they are a bit fuzzy for me because portante managed to just do them :) (which is a good thing)21:12
jog0fungi: sounds like a good plan, I Jam just getting antsey about letting people get back to work21:12
portantewhat'd i do21:12
jog0clarkb: yeah I will fill that in, in a few minutes21:12
portantewhich way did he go21:12
jog0portante: you want to do it instead21:12
jog0https://etherpad.openstack.org/p/critical-patches-gatecrash-November-201321:12
*** gyee has joined #openstack-infra21:12
jog0so this is the second gate crash that I know of hehe21:12
clarkbportante: near the bottom of that etherpad is a paragraph on the swift failures, with a sentence asking joe to fill in detail. can you take a stab at that?21:12
jeblairi don't like 'gate crash' for the same reason sdague doesn't like 'flakey test' :)21:13
clarkbjeblair: ++21:13
portanteyes21:13
notmynamejeblair: the "jim failure"? ;-)21:13
jog0jeblair clarkb: it was supposed to be a refence to *gate (watergate etc)21:14
jog0but fair enough21:14
jog0jeblair: what catchy name do you propose?21:14
notmynamejeblair: the day the music stopped: gerritgate21:14
jog0notmyname: ^_^21:15
jeblairnotmyname: heh, yeah, let's go all out and just say i singlehandedly broke openstack.  while on vacation, no less.  :)21:15
notmynamejeblair: no harm intended :-)21:16
mikaljog0: so its it time to send an all clear on people approving changes again?21:16
mikaljog0: or am I missing somehting?21:16
jeblairjog0: i think describing it as wedged or stuck is reasonable21:16
fungimikal: just watching gate jobs for a bit to make sure there's nothing else slowing it up21:16
*** sandywalsh has quit IRC21:16
jeblairnotmyname: i understand -- collaborative jesting is difficult in irc and it looks like i don't quite have the knack yet.  :)21:16
fungimikal: so far it's a sea of green, though grenade timeouts may still bite us21:17
jog0mikal: not yet, we are wating a few more minutes21:17
jeblairhow about we let a bunch of things merge, then stop zuul, remove the repos, restart, and then send an all clear21:17
jog0jeblair: sounds good, but its not as catchy as the day the music stopped: gerritgate21:17
mikaljog0: ok, cool21:17
jeblairthat way we don't send an all clear and then stop zuul.  :)21:17
clarkbjeblair: I like that plan21:17
mikaljog0: so I should sneakily approve my stuff now before the flood?21:17
clarkbjeblair: my morning errands are all now complete so I can help with that too :)21:18
fungiworth noting, jenkins has adjusted its grenade completion estimates down quite a bit21:18
jog0mikal: heh21:18
clarkbfungi: what was the grenade timing problem?21:18
*** flaper87 is now known as flaper87|afk21:18
portanteclarkb, notmyname: how does that paragraph read?21:18
mikaljog0: you think I'm joking!21:18
jog0mikal: I don't actually21:18
mikaljog0: I'm going to "test" the gate with this here approved patch of mine21:18
jog0mikal: seriously we don't need more tests21:18
jog0we have 40odd21:19
mikal:(21:19
clarkbportante: looks good. can you add links to gerrit changes and lp bugs too?21:19
jeblairmikal: it won't help since i'm about to dump the queue anyway; likely in 16 minutes21:19
fungiclarkb: jeblair has evidence to support zuul ref checking in the setup taking longer than needed because of all the zuul ref buildup in zuul's git repos21:19
jog0mikal: look at zuul btw21:19
jog0aboutto merge 4 patches21:19
jeblairfungi, clarkb: that combined with the sheer number of simultaneous jobs pulling zuul refs21:19
clarkbfungi: ooh fun, basically git slowdowns due to ETOOMANYREFS?21:19
jog0if grenade finishes21:19
mikaljog0: but none of them mine!@21:19
jog0mikal: I think one of my patches is in the gate21:20
jog0someone reapproved it :(21:20
*** reed has joined #openstack-infra21:20
fungihere we go!21:20
fungi4 changes about to cram through21:20
jog0fungi: \o/21:20
fungiand a bunch more right on their heels21:20
* jog0 has never been so happy to see things work21:20
zarofungi: you sure security-gerrit all-projects should get included with the general docs?21:21
fungiso much greeeeen. so beauuuutiful21:21
fungizaro: well, we will use a shadow gerrit, and we need to know how to rebuild it if needed21:21
jog0fungi: http://www.amsterdam-mamas.nl/wp-content/uploads/2013/02/kermit-Frog.jpg21:22
clarkbfungi: zaro: I think manage-projects should manage that for us21:22
notmynameportante: I'm not sure how to word it seeing as there still seems to be a DB lock error in swift21:22
fungizaro: i'm unconvinced whether that goes in gerrit.rst as an additional section (or several), additional paragraphs in existing sections, or a whole new rst file separate from it21:22
*** mindjive1 is now known as mindjiver21:22
fungiclarkb: should manage the all-projects acl?21:22
portantenotmyname: oh, can you show me?21:22
clarkbfungi: zaro: we can still document why and how in the docs though, but the actual management of it can be in manage-projects21:22
clarkbfungi: ya21:22
notmynameportante: see links in -swift21:22
fungiclarkb: how?21:22
clarkbfungi: its just another gerrit project :)21:22
clarkbso need an All-Projects entry in projects.yaml21:23
fungiclarkb: we need at least some stub all-projects acl configuration to be able to add the accounts necessary21:23
fungiclarkb: so that manage-projects will work21:23
*** marun has quit IRC21:23
clarkbfungi: oh good point21:23
clarkbsilly eggs and their chickens21:23
portantenotmyname: I see21:24
zarofungi: even as a seperate rst file, wouldn't you want to include it into gerrit.rst?21:24
fungiclarkb: i was telling zaro, yolanda worked out how to do it, but requires writing a well-tested stub acl via local git operations, making mysql queries to create initial group uuid mappings, and then you can push a more complex acl in on top of it through the normal workflow21:24
notmynameportante: point being, while something that needs to get fixed, it seems the gate issues were around the devstack config, not the db lock21:24
clarkbfungi: gotcha, ya I think for now we should document it like we do for the other gerrit, so that we can bootstrap but we should also automate it beyond the bootstrapping21:25
fungizaro: if it were a separate rst file, it would just go in the list of server/service-specific rst docs on ci.o.o21:25
jeblairi would love that automation21:25
*** sandywalsh has joined #openstack-infra21:25
jeblairbut agree we don't neet to block on it if it's super hard21:25
jeblairneed21:26
portantenotmyname: so you think we rushed that change through because of the gate issues?21:26
portantethat had been in the works well before these gate issues21:26
notmynameportante: yes, I do.21:26
portanteoh, what gives you that impression?21:26
zarofungi: ahh. ok.  that sounds good. i'll create new rst for it.21:27
*** ^d has joined #openstack-infra21:27
*** ^d has joined #openstack-infra21:27
fungijog0: so we have a gate reset a ways down courtesy of https://jenkins02.openstack.org/job/gate-oslo.messaging-python27/159/console (looks like it ran too long)21:28
notmynameportante: 2 things: 1) the original bug was introduced in swift on july19 and 2) the gate *appears* stable now, even though the subsequent patch shows there are still issues21:28
portantehuh?21:28
*** hdd has quit IRC21:29
zarofungi: ohh how do you put the new rst in the list of server/service-specific rst docs on ci.o.o ?21:29
portantecan you share the july19th thing, and have you looked at: https://bugs.launchpad.net/swift/+bug/1224253?21:30
*** che-arne has joined #openstack-infra21:30
uvirtbotLaunchpad bug 1224253 in swift "test_object_upload_in_segments fails with OperationalError database is locked" [Undecided,New]21:30
fungijog0: also looks like we have a bunch of grenade and tempest jobs up near the head which are getting very close to 60 minutes21:30
portanteI have seen these errors since at least the 12th of September21:30
portantethey might have been overshadowed by other errors, as rechecks choose one bug to use, when there are multiple in place21:31
notmynameportante: ya, see the commit message in the patch https://review.openstack.org/#/c/57019/21:31
notmynameportante: the referenced commit landed on jul1921:32
fungijog0: jeblair: yup, timeout at the head of the gate on https://jenkins01.openstack.org/job/gate-grenade-devstack-vm/15750/console21:32
portanteI see that21:32
jog0fungi: :(, will that self correct21:32
*** hashar has quit IRC21:33
jog0portante: see status.openstack.org/elastic-recheck/ for better numbers21:33
fungiso this is probably a good time to dump the zuul state and start with fresh repos jeblair? or were you still prepping?21:33
notmynamejog0: bug 1243973 doesn't show up there21:33
uvirtbotLaunchpad bug 1243973 in swift "Simultaneous PUT requests for the same account or container causes server error response" [Undecided,Fix committed] https://launchpad.net/bugs/124397321:34
*** pcm_ has quit IRC21:34
jeblairfungi: agreed, i'll save the queue now21:34
portantebut it would only show up if folks triaged to the point that they found this problem, and most find other jobs failing that are known issues, this one just did not rise to the top21:34
jog0notmyname: ahh if you make a query you can get it added21:35
portanteI may be just me, but I don't see that as a rush job, an incomplete fix, but not a rush job21:35
jeblairokay, queue saved21:36
jeblairstopping zuul now21:36
* portante back in a bit21:36
jeblairremoving zuul git repos21:37
jeblairfungi, clarkb: want to kill some jenkins jobs?21:38
fungion it21:38
clarkbjeblair: yup I will start on 0221:38
fungii'm sworking up from the bottom on 0121:38
fungisworking. snark21:38
fungiload average on zuul has dropped like a stone too21:39
fungijenkins01 is all clear21:41
clarkbjeblair: 02 is clear21:42
clarkbjeblair: but there are a bunch of stuck jobs there21:42
clarkbmaybe we should restart 02 too?21:42
clarkbjeblair: how did you unstick the jobs that were stuck yesterday?21:43
jeblairclarkb: ok let's restart 0221:43
jeblairclarkb: i deleted their nodes in jenkins, then nodepool21:43
clarkbjeblair: can you delete them when the jobs are running?21:43
* clarkb tries this21:43
jeblairclarkb: yes21:43
jeblairnodepool can't though, so i had to do it through the jenkins webui21:44
fungishall i generate a quick list of all nodepool nodes associated with jenkins02 and ask nodepool to start deleting those?21:44
jeblairfungi: i think we only need to worry about the problem ones21:44
fungiokay21:45
clarkbI am starting at the top of the list if you want to work bottom up21:45
fungiclarkb: which was the first one you deleted? there were 7 but by the time i started opening windows for them there were only 621:45
clarkb708541 and 708544 are being deleted by me21:46
clarkbnow 70854521:46
jeblair70854721:46
fungioh, i had those. maybe one deleted on its own21:46
jeblair70855221:47
*** rongze has joined #openstack-infra21:47
openstackgerritSolly Ross proposed a change to openstack-infra/os-loganalyze: Support Setting The Path Using an ENV Variable  https://review.openstack.org/5778321:48
openstackgerritSolly Ross proposed a change to openstack-infra/os-loganalyze: Introduce Generic Parsing/Filtering Framework  https://review.openstack.org/5778421:48
openstackgerritSolly Ross proposed a change to openstack-infra/os-loganalyze: Introduce Console Version, Move Common Code  https://review.openstack.org/5778521:48
openstackgerritSolly Ross proposed a change to openstack-infra/os-loganalyze: Use JS to Up-Filter  https://review.openstack.org/5778621:48
clarkb02 looks clean now21:48
clarkbshould I restart it?21:48
jeblairclarkb: wait a sec21:48
clarkbk21:48
jeblairnodepool may still be trying to delete nodes, i want to give it a chance to finish21:49
fungiit is21:49
fungisome of them were already deleted according to the tracebacks it gave me21:49
fungibut i also only explicitly deleted the ones which were being removed by hand in jenkins21:50
jeblairok, the only nodes left on jenkins02 that are offline were ones it failed to delete from a long time ago21:50
fungilooks like it thinks there are still over 100 associated with jenkins0221:51
jeblairclarkb: i believe it's safe to restart jenkins02 now21:51
*** mihgen has quit IRC21:51
clarkbok shutting it down now then will start again21:51
jeblairfungi: yeah, fixing the nodepool cleanup thread is next on my list21:51
clarkbits starting now21:51
fungijeblair: did you see my change from a couple days ago?21:51
jeblairfungi: no!21:52
fungijeblair: https://jenkins02.openstack.org/21:52
fungier, that's not the right url :/21:52
jeblairhttps://review.openstack.org/#/c/57364/21:52
*** rongze has quit IRC21:52
fungithat one, yes21:52
fungino idea if that's what you had in mind21:52
*** CaptTofu has quit IRC21:52
jeblairfungi: that's exactly what i had in mind; any chance that's tested?21:52
fungii would say there's probably very close to 0 chance that's tested21:53
*** CaptTofu has joined #openstack-infra21:53
jeblairfungi: i'm willing to test that in production.  :)21:53
fungishould i just spin up a local nodepool and then... i'm not sure on how to make it block the thread21:54
fungiokay21:54
fungii'm guessing i'd have to run that under a debugger to properly test it21:54
* clarkb looks at it21:54
clarkbjeblair: fungi: I am happy giving that a shot as well21:55
clarkbits simple and I am pretty confident it can't make the problem worse21:55
fungioh, right, i wouldn't simulate the thread deadlock issue, that's a symptom, not what we were solving (only just looked back at the change myself)21:56
jeblairclarkb: jenkins02 up?21:56
clarkbjeblair: looks like it21:56
jeblairstarting zuul21:56
clarkbI can get the web ui21:56
*** mfer has quit IRC21:56
jeblairnow we wait for it to clone all the things21:56
jeblairoh right, we clone on demand now, don't we?21:57
jeblairoh, no there's still a clone everything step on startup21:57
fungiat least it's giving nodepool a chance to stock back up for the coming onslaught21:58
*** svarnau has quit IRC21:59
clarkbjeblair: https://review.openstack.org/#/c/52689/ and its child may interest you21:59
*** jhesketh has joined #openstack-infra21:59
jeblairclarkb: very much so, thanks :)22:00
jeblairwe have a lot of repos.22:01
fungiand it just keeps increasing22:01
Shrewsfungi: yep, 6:3022:02
jeblairit's done22:02
fungii see jobs running22:02
*** julim has quit IRC22:03
jeblairshall i send the reverifies for the gate queue?22:03
mikaljeblair: how are you selecting reverifies to send? Or is it just everything approved but unmerged?22:04
*** lcestari has quit IRC22:04
jeblairmikal: everything that was in the gate queue before i stopped it22:04
clarkbwait for it, mikal is going to bribe us to put his changes in first22:05
openstackgerritA change was merged to openstack-infra/nodepool: Skip periodic cleanup if the node is not stale  https://review.openstack.org/5736422:05
jeblairyoink22:05
mikalclarkb: no bribes22:05
fungijenkins02 seems to be running jobs successfully22:05
mikalclarkb: just rage quitting debugging 1251920 unless my demands are met22:05
clarkbfungi: yup it has jobs and is happy22:05
clarkbmikal: nice. I like the way you swung that around on me22:06
jeblairmikal: go aprv your changes now; i'll be slow.22:06
jeblair#status ok22:06
*** ChanServ changes topic to "Discussion of OpenStack Project Infrastructure | Docs http://ci.openstack.org/ | Bugs https://launchpad.net/openstack-ci | Code https://git.openstack.org/cgit/openstack-infra/"22:06
mikaljeblair: heh22:06
mikalIts ok22:06
fungiit's a race to the gate now22:06
mikalI'VE ONLY BEEN WAITING A WEEK FOR THEM TO MERGE22:06
jeblairalso, i put 5 second sleeps in my script22:06
mikaljeblair: seriously though, do you want things to start being approved again?22:07
mikaljeblair: or wait a bit for the queue to clear?22:07
jog0mikal: gate is empty22:07
clarkbmikal: go for it22:07
jeblairmikal: is fine, i think we're all agreed on 'back to normal' now22:07
fungithis graph is nothing short of spectacular, btw... (particularly on the weekly version) http://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=388&rra_id=all22:07
jog0so who will send the email out?22:07
fungiat one point in the past 24 hours, zuul had a 5-minute load average of 143.9822:08
jog0mikal russellb: ^22:08
jeblairjog0, clarkb: i think one of you should, what with having written most of it.  :)22:08
russellbfungi: niiiice22:08
mikaljog0: I can if you want22:08
jog0jeblair: we ll we need two emails22:08
mikaljog0: or you can take credit, I don't mind22:08
jeblairjog0: oh, the all-clear email22:08
jog0mikal:  can send out th response to his stop working email22:08
jog0jeblair: yeah22:08
jog0just taht for now22:08
jeblairmikal, jog0: ++22:09
mikaljog0: ok, doing that now22:09
jog0mikal: thanks22:10
*** yamahata_ has joined #openstack-infra22:10
jog0clarkb: are you going to reload the saved jobs?22:10
clarkbjog0: jeblair is doing that22:10
jog0clarkb: cool beans22:10
jeblairjog0: i am, but i'm being slow about it for some reason :)22:10
mikalDone22:11
jog0mikal: thanks22:12
mikaljog0: np22:12
mikaljog0: thanks for being awesome and stuff22:12
jeblairoh, there we go, now my script is running.  :)22:12
jog0mikal: any time22:12
*** fifieldt has joined #openstack-infra22:12
jeblairclarkb, fungi: i'm manually nodepool-deleting a bunch of nodes in the building state for 48 hours22:13
jog0so we aren't worried about grenade timeout anymore22:13
clarkbjog0: I think we should keep an eye on it but the zuul recloning should fix that for now22:13
jog0cool22:13
jeblairclarkb, fungi: that should help with nodepool responsiveness (until we restart with fungi's patch)22:13
*** eharney has joined #openstack-infra22:14
pabelangerfungi, gebus. Surprised the system didn't tip over22:14
jog0mikal: https://review.openstack.org/#/c/55605/22:15
jog0failed pep8 :)22:15
*** mfer has joined #openstack-infra22:15
jog0mikal: good thing zuul scheduler can deal with that22:16
mikaljog0: but how? I passed it in check22:16
jog0mikal: my guess is the config file stuff22:16
jog0not sure22:16
jeblair2013-11-21 22:14:25.282 | E: nova.conf.sample is not up to date, please run tools/config/generate_sample.sh22:16
mikaljog0: oh yeah, that stupid generator22:17
jog0https://jenkins01.openstack.org/job/gate-nova-pep8/12659/console22:17
*** changbl has quit IRC22:17
jog0mikal: yeah :/22:17
jog0thats hit me a bunch too22:17
mikalUgh22:17
mikalThat's probably going to break all my patches22:17
mikalDammit22:17
mikalYeah, it is the config file thing22:17
jeblairqueue reload and nodepool deletes both finished22:19
mikaljeblair: have you guys had a try of the gate on performance flavours yet?22:19
mikaljeblair: because I'd be super interested in results22:19
mordredmikal: no22:19
clarkbmikal: no, mordred was writing changes to support that22:19
mikalCool.22:19
mordredmikal: I put up half of the change needed, haven't done the other half22:19
mordredjeblair: there is a nodepool patch up to add the capability for it22:20
mikalmordred: is it possible to move a portion of the pool in a region to a different flavor, or is it all or nothing?22:20
*** dolphm has quit IRC22:20
mordredit is possible to trial baloon things for sure, which is what we'll do22:20
mordredmikal: we already only have a couple of rax nodes in the pool because they're slow22:20
mikaljeblair: https://review.openstack.org/#/c/56118/ is a small review waiting for a comment from you when you have some idle cycles22:20
mordredso if we just move those couple of things, we shoudl be able to test22:20
mikalmordred: oh, cool22:20
*** mrodden has quit IRC22:20
mikalmordred: so the move would be small (in terms of instance count)22:21
pabelangerHow long does it take to bootstrap a node?22:21
mordredyes22:21
clarkbpabelanger: image builds take ~20 minutes to an hour22:21
clarkbpabelanger: booting off of the snapshots of those images is relatively quick, we timeout after 2 minutes22:21
*** bknudson has joined #openstack-infra22:21
mikalclarkb: https://review.openstack.org/#/c/56158/ is one for you too... I addressed your concerns and then ignored it during the recent unpleasantness22:21
clarkbmordred: I think we should try and get new hpcloud region in today22:21
mordredclarkb: I agree22:22
pabelangerclarkb, how often are snapshots built?22:22
bknudsonit would be nice to get this keystoneclient change merged quickly because it's breaking keystone -- https://review.openstack.org/#/c/57583/22:22
clarkbpabelanger: daily per cloud region/az22:22
clarkbmordred: do you have time today to address my -1 of the nodepool config change?22:22
clarkbmordred: I can just fix it if you don't22:22
bknudsonnever mind... I see it's in the queue already.22:23
mordredclarkb: can you just fix it? I'm in a stupid meeting22:23
clarkbmordred: sure22:23
pabelangerbknudson, interesting concept, some how adding a priority flag into Zuul.22:23
bknudsonjust let me "bump" it.22:23
mordredclarkb: actually, hold off on the new hp region22:24
clarkbmordred: oh? should I still fix the nodepool config change?22:24
mordredclarkb: yeah - but let's hold off on putting it through22:24
clarkbok22:24
openstackgerritClark Boylan proposed a change to openstack-infra/config: Add new HP Cloud region  https://review.openstack.org/5626022:25
clarkbmordred: ^ is the updated config22:25
mordredclarkb: yes22:26
openstackgerritTim Daly, Jr. proposed a change to openstack-infra/config: Make python33 check for tomograph non-voting  https://review.openstack.org/5757322:26
*** ken1ohmichi has joined #openstack-infra22:27
*** mrodden has joined #openstack-infra22:30
*** ^d has quit IRC22:30
*** loq_mac has joined #openstack-infra22:30
*** fbo is now known as fbo_away22:30
*** mfer has quit IRC22:30
*** ^d has joined #openstack-infra22:30
jeblairmikal: commented, thx.22:32
jeblairi have attempted to review patches today.  i reviewed some that were apprx 1 month old, so it may be a bit before i'm regularly reviewing recent patches.22:32
mordred:)22:32
jeblairzuul load is not obscene22:33
clarkbI am so far behind on reviews, I have hope that next week will be quiet with the holdiay and I can find good chunks of review time22:33
*** dkranz has quit IRC22:34
*** jhesketh has quit IRC22:35
*** thomasem has quit IRC22:36
*** jhesketh has joined #openstack-infra22:36
*** ryanpetrello has quit IRC22:37
*** michchap has quit IRC22:39
jog0gate is looking pretty decent at the moment22:40
jog0merging a patch now :)22:40
*** markmcclain has quit IRC22:40
jog0mikal: its one of yours22:40
clarkbjeblair: for the zuul refs, should we have a monthly jobs that kills all refs older than a month?22:41
mikaljog0: yay!22:41
clarkband pair that with a gc probably22:41
mordredclarkb: I was going to give you a hard time about all of my patches that need reviews22:41
*** michchap has joined #openstack-infra22:42
mikaljeblair: if you're really behind do you love or hate the idea of people highlighting ones which are blocking on specifically you?22:42
mordredclarkb: but then you worked on fixing the gate and I didn't22:42
mordredclarkb: so, I'll keep my mouth shut22:42
clarkbmikal: I like it, if I am specifically requested to review a thing I try to get to it because I know someone is following along22:42
mordredclarkb: is gate ok enough that I can recheck patches of mine that were trapped by earlier gate break?22:43
clarkbmikal: a lot of reviews seem to go in a black hole so I like it when the chance of that happening is low22:43
clarkbmordred: I think so22:43
mordredclarkb: ok. is it ok to recheck no bug them? I know that they are pre-gate-fix blockages that I just left alone22:43
mordred(these are the pbr integration script patches)22:43
clarkbmordred: I think so, I don't tink we can get away from that until after the backlog has caught up22:44
mordredclarkb: ok. cool. thank you.22:44
mordredand thank you to everyone who worked on this - I'm sorry I was AFK for all of it22:44
mikalclarkb: cool22:45
mikalI ask because in nova land people tend to avoid requesting reviews from people specifically22:45
mikalIts seen as rude in 99% of cases22:45
clarkbmikal: I have also been known to say I will get to it when I can :)22:46
clarkbbut it fits into my starring things review pattern really well22:46
mgagnezaro: ping22:47
mordredmikal: we tend to have a STUPIDLY HIGH amount of patches - so help is always welcome22:47
*** rongze has joined #openstack-infra22:48
jog0mordred: so it took us 26 hours to unbreak things22:49
jog0even with drastic measures22:49
jog0just a fun fact22:49
mordredjog0: I'm impressed that you fixed things in 26 hours, tbh22:50
jog0mordred: we got 6 bugs fixed22:50
jog0and mordred the top two bugs on are gate failing list22:50
jog0so not to bad22:51
jog0why is horizon doc hanging22:51
jog0https://jenkins02.openstack.org/job/gate-horizon-docs/1439/22:51
jog0ohh its just slow22:52
*** dcramer_ has quit IRC22:52
*** jhesketh has quit IRC22:53
*** jhesketh has joined #openstack-infra22:54
*** rongze has quit IRC22:54
*** loq_mac has quit IRC22:56
zaromgagne: sup?22:58
mgagnezaro: sorry, found it ^^'22:59
mgagnezaro: I now have a slave running Windows. Now is time to puppetize this thing. ^^'23:01
*** thomasem has joined #openstack-infra23:01
zaromgagne: nice!23:02
*** eharney has quit IRC23:02
*** ^d has quit IRC23:02
*** ^d has joined #openstack-infra23:02
*** weshay has quit IRC23:04
*** dizquierdo has quit IRC23:06
*** bpokorny has quit IRC23:10
openstackgerritKhai Do proposed a change to openstack-infra/config: Setup a private gerrit instance for security reviews  https://review.openstack.org/4793723:10
zarofungi: ^23:10
jeblairi restarted nodepool23:11
fungizaro: awesome. i'm about to disappear for the night (places to be) but will add it to my pile for tomorrow23:13
*** thomasem has quit IRC23:13
zarofungi: in review-security.rst, i only documented high levels.  I just copied project.config setting from general gerrit.23:13
jgriffithjeblair: jog0 in the spirit of discontinuing "reverify no bug" do we want to log new bugs for random well understood issues?23:14
zarofungi: I didn't understand how the gerrit groups in the doc map to high level users so i was wondering if we could iterate and refine project.config?23:14
jgriffithjeblair: jog0 ie failure to connect to puppet repo23:14
jgriffithseems like a no bug is appropriate for now until we have a general category or something23:14
fungijeblair: i'll keep an eye om nodepool for the next few days and see if we witness similar stale node behavior to suggest that maybe it wasn't that race (or additional causes)23:14
jeblairjgriffith: yes, and it'll help us know if that's a problem; it can be a ci bug23:15
jgriffithjeblair: alright, logging one now23:15
*** wenlock has quit IRC23:15
jeblairjgriffith: (i think we'll just have to learn to live with some bugs that stay open for a long time because they aren't very actionable)23:15
jeblairjgriffith: oh wait23:15
jgriffithagreed23:15
jgriffithbut tracking metrics are always helpful23:15
jeblairfungi: didn't someone file a bug about that?23:15
jeblairjog0: ? ^23:16
jgriffithjeblair: fungi I haven't searched yet... gimmie a sec23:16
fungizaro: yeah, more details on project.config would be great23:16
clarkbjgriffith: jeblair: what is the error?23:16
fungijeblair: jgriffith: yeah, jog0 just filed it a couple hours ago23:16
jeblairjgriffith: i think there was a suggestion that we remove some apt repos before snapshotting on devstack runs, so actually, that's a pretty actionable bug.  :)23:16
* fungi finds23:17
jeblairclarkb: i think 'apt-get update' fails because we have a puppet apt repo, which is less reliable than the rax mirror we use for the os23:17
jgriffithjeblair: coolio23:17
jgriffithjeblair: that's correct... in the grenade test23:17
clarkbjgriffith: gotcha23:17
fungijgriffith: jeblair: bug 125377423:17
uvirtbotLaunchpad bug 1253774 in openstack-ci "Reduce number of apt sources that must be up for gate to work" [Medium,Confirmed] https://launchpad.net/bugs/125377423:17
jgriffithhttp://logs.openstack.org/68/54068/23/gate/gate-grenade-devstack-vm/e036eb6/console.html#_2013-11-21_20_19_18_74923:17
clarkbshould the nodepool image build scripts remove that apt repo as one of the last things it does?23:17
jeblairthat's it!23:17
jgriffithfungi: nice... I'll log it against that one23:17
jgriffithfungi: jeblair clarkb thanks!23:18
jeblairclarkb: i think that's the way i'd do it.23:18
*** krtaylor has quit IRC23:18
fungiclarkb: you should suggest that in the bug ;)23:18
clarkbfungi: I will23:18
jeblairfungi: i think you already did suggest that? :)23:18
fungiyes, i did23:19
jeblairfungi, clarkb: good ideas all around! :)23:19
fungiagreement in the bug is just as good though23:19
* fungi really disappears now. can't keep Shrews and olaph waiting23:19
clarkboh heh I read the bug now :)23:20
*** mrodden has quit IRC23:24
*** reed has quit IRC23:24
*** mrodden has joined #openstack-infra23:25
*** yamahata_ has quit IRC23:25
*** CaptTofu has quit IRC23:25
*** CaptTofu has joined #openstack-infra23:26
*** herndon has quit IRC23:27
*** lchen has quit IRC23:29
*** CaptTofu_ has joined #openstack-infra23:30
*** CaptTofu has quit IRC23:31
*** wenlock has joined #openstack-infra23:32
clarkbI haven't seen any obvious grenade failures23:32
*** changbl has joined #openstack-infra23:35
jeblairnodepool started its periodic cleanup at 2315, and hasn't hit an exception or stopped yet23:36
clarkbjeblair: I have been thinking about removing git zuul refs, is the best way to do that with a find -mtime then a git gc?23:39
clarkber find -mtime delete?23:40
jog0jgriffith: thanks for getting on board with the new world view23:41
*** sarob has quit IRC23:44
*** sarob has joined #openstack-infra23:45
jeblairclarkb: er, it's slightly more complicated, and i'm not sure it's safe to do while zuul is running23:45
jeblairclarkb: i'll try to dig up my script23:45
clarkbjeblair: is it more complicated because the refs themselves can get packed/23:46
*** atiwari has quit IRC23:49
*** gyee has quit IRC23:49
*** sarob has quit IRC23:49
*** gyee has joined #openstack-infra23:50
*** rongze has joined #openstack-infra23:50
*** atiwari has joined #openstack-infra23:50
*** jgrimm has quit IRC23:51
*** jhesketh has quit IRC23:53
clarkblifeless: https://jenkins02.openstack.org/job/gate-neutron-python26/3379/console the html output conversion for neutron is taking forever under python26 (it takes 10 minutes on python27 which is pretty bad too). The subunit files are really large, any thoughts on making it go quicker?23:53
clarkbhmm are we not converting to subunitv2 anymore /me checks23:53
lifelessget me a copy of them ?23:53
*** rcleere has quit IRC23:54
*** dkliban_ has quit IRC23:54
*** rnirmal has quit IRC23:54
clarkblifeless: nevermind we aren't converting to v2 first23:54
*** pete5 has quit IRC23:54
*** ken1ohmichi has quit IRC23:54
clarkblifeless: that code got disappeared somehow, I will correct it (v1 going through v2 parser is slow and that is a known problem)23:54
clarkbhmm did that change never get merged?23:55
clarkbgah it happened when we moved to run-unittest.sh23:56
*** rongze has quit IRC23:56
*** michchap has quit IRC23:58
*** michchap has joined #openstack-infra23:58

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!