Wednesday, 2018-05-30

*** ChanServ has quit IRC00:12
myoungrlandy|rover: ack th x00:15
*** yolanda_ has quit IRC00:19
*** yolanda_ has joined #oooq00:20
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.00:22
*** rlandy|rover has quit IRC00:33
*** ChanServ has joined #oooq00:33
*** barjavel.freenode.net sets mode: +o ChanServ00:33
*** atoth has quit IRC00:35
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.02:22
*** ykarel|away has joined #oooq02:46
*** rlandy has joined #oooq02:51
chandankumarrfolco: I have asked release delivery guys to tag it under unittesting repo03:55
*** udesale has joined #oooq03:58
*** ykarel|away is now known as ykarel03:59
*** rlandy has quit IRC04:05
*** ykarel is now known as ykarel|afk04:07
*** links has joined #oooq04:11
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.04:23
chandankumarrfolco: future is also available under unittesting repo you are good to go04:38
*** marios has joined #oooq04:40
*** pgadiya has joined #oooq04:42
*** pgadiya has quit IRC04:44
*** ykarel|afk is now known as ykarel05:18
*** quiquell|off is now known as quiquell05:33
Tenguhello! small question: is there a way to mimic tripleo-ci-centos-7-scenario001-multinode-oooq-container with quickstart? (or any CI env/tests)05:56
quiquellTengu: the reproducer now works with libvirt and RDO cloud05:59
Tenguquiquell: oh?06:01
Tenguthat's a really, really good news. Will check how to do that.06:01
quiquellTengu: Do you have a failing build from zuul ?06:02
Tenguquiquell: yep06:03
Tenguhttp://logs.openstack.org/27/570627/15/check/tripleo-ci-centos-7-scenario007-multinode-oooq-container/69c694f/logs/06:03
Tenguso I basically download the reproduce-quickstart.sh file - I'll read it in order to find out how to use a libvirt host.06:04
quiquellyep there is a --with-libvirt or similar06:04
Tengu... but first I have to retake my builder host, apparently it went down -.-06:05
quiquellquiquell: Is kinda new, so maybe you find some problems, let us know06:05
Tenguok :)06:05
quiquellsshnaidm: You there ?06:12
quiquellsshnaidm: Going to resize the ruck-rover-dashboard to a bigger flavor06:15
Tenguquiquell: erf... apparently it lacks all the env setup... More over, it would be great to be able to pass the virthost IP (i.e. I'm not running that on my laptop, I have a desktop dedicated for this kind of heavy tasks)06:15
Tenguquiquell: I just edited the script and replaced the IP address, not a blocker, but the ansible errors...06:15
quiquellTengu If you use libvirt you have to run it at your desktop06:16
quiquellIt will be the libvirt host06:16
Tenguquiquell: hmmm ok. so it doesn't work like quickstart, copying and running the receipts on the remote node, right?06:16
quiquellTengu It does, but the remote is just the undercloud, with is a livirt image running in the host06:17
quiquellSo the remoting is really inside the virthost but from host to image06:18
Tenguquiquell: hmm wait - do I have to run the reproducer from an undercloud node, or from a "naked" BM node with libvirt?06:19
quiquellTengu the place where libvirt is running06:19
Tenguok, so my BM06:20
quiquellYep06:20
Tengulet's try it then :)06:20
quiquellSure, let me know how it goes (I haven't try libvirt option yet)06:20
Tenguthere are some steps to ensure prior running it: ssh access to localhost without password, sudo without password and so on. best doing this in a dedicated user you can drop later.06:22
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.06:23
*** jtomasek has joined #oooq06:25
quiquellTengu: I usually run a tmux in the virthost, it helps to continue with work06:26
Tenguquiquell: ah, well, of course ;). I always run a tmux on remote computers :).06:27
quiquellTengu: Cool cool06:28
*** jtomasek has quit IRC06:28
*** ratailor has joined #oooq06:33
*** holser__ has joined #oooq06:43
Tenguquiquell: small question: where's the git for the "tripleo-ci"? I'd like to see how it works :).06:59
quiquellopenstack-infra/tripleo-ci07:00
Tenguah, good07:00
Tenguthank you!07:00
quiquellyw07:00
*** tosky has joined #oooq07:06
*** ccamacho has joined #oooq07:11
*** jaosorior has joined #oooq07:15
Tenguquiquell: also, how long are the build logs kept on the CI infra (if you know that, of course)? I guess it becomes pretty heavy regarding disk storage and has to be cleaned on a regular basis...07:16
quiquellTengu: I have to leave go back in a few07:17
quiquellTengu: Don't know those details, you can ask at #openstack-infra07:18
*** quiquell is now known as quiquell|afk07:18
Tenguquiquell|afk: np, thanks :)07:18
*** zz_saneax has joined #oooq07:18
*** zz_saneax is now known as saneax07:19
sshnaidmquiquell|afk, ok07:19
*** sshnaidm is now known as sshnaidm_pto07:20
*** tesseract has joined #oooq07:20
*** apetrich has quit IRC07:21
*** apetrich has joined #oooq07:23
*** florianf has joined #oooq07:23
*** jtomasek has joined #oooq07:25
*** amoralej|off is now known as amoralej07:27
*** jtomasek has quit IRC07:31
*** skramaja has joined #oooq07:32
*** ykarel is now known as ykarel|lunch07:36
*** ykarel|lunch has quit IRC07:47
*** quiquell|afk is now known as quiquell08:11
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.08:23
*** ykarel|lunch has joined #oooq08:48
*** brault has joined #oooq08:55
*** ykarel|lunch is now known as ykarel08:57
*** ykarel is now known as ykarel|away09:06
*** ykarel|away has quit IRC09:12
*** jtomasek has joined #oooq09:24
*** jtomasek has quit IRC09:25
*** zoli is now known as zoli|lunch09:31
*** udesale_ has joined #oooq09:31
*** udesale_ has quit IRC09:32
*** udesale_ has joined #oooq09:32
*** udesale_ has quit IRC09:32
*** udesale_ has joined #oooq09:33
*** udesale has quit IRC09:33
*** dtantsur|afk is now known as dtantsur09:38
*** sanjayu_ has joined #oooq10:01
*** udesale__ has joined #oooq10:02
*** udesale__ has quit IRC10:02
*** udesale has joined #oooq10:03
*** udesale_ has quit IRC10:05
*** jaosorior has quit IRC10:14
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.10:23
*** jaosorior has joined #oooq10:31
*** sanjayu_ has quit IRC10:39
*** saneax has quit IRC10:46
*** saneax has joined #oooq10:48
*** sanjay__u has quit IRC10:54
*** zoli|lunch is now known as zoli11:00
*** quiquell is now known as quiquell|lunch11:01
*** moguimar has joined #oooq11:08
*** ykarel has joined #oooq11:28
*** udesale_ has joined #oooq11:28
*** udesale has quit IRC11:31
*** udesale_ has quit IRC11:33
*** ykarel has quit IRC11:37
*** amoralej is now known as amoralej|lunch11:38
*** rlandy has joined #oooq11:56
ssbarneaapparently using quickstart with ANSIBLE_STRATEGY=debug is a PITA, triggers debugger on lots of taks.12:00
rlandyarxcruz|ruck: ping - I'm on the platform meeting12:03
arxcruz|ruckrlandy: me too12:03
*** rlandy is now known as rlandy|rover12:03
arxcruz|ruckrlandy: already update the doc12:03
rlandy|roverarxcruz|ruck: do you need me or should I drop?12:03
arxcruz|ruckrlandy|rover: you can drop if you want, we are green :)12:03
arxcruz|ruckit will be fast12:04
rlandy|roverarxcruz|ruck: what's the latest on https://bugs.launchpad.net/tripleo/+bug/1773445?12:06
openstackLaunchpad bug 1773445 in tripleo "tripleo-quickstart-extras-gate-newton-delorean-full-minimal fails to install undercloud - Access denied for user 'heat'@'192.168.24.1" [High,Triaged]12:06
arxcruz|ruckrlandy|rover: I update the bug, i have a PR on puppet-certmonger12:06
arxcruz|ruckwaiting jaosorior friends approve it then we can move on to openstack side to fix it12:07
arxcruz|ruckrlandy|rover: https://github.com/saltedsignal/puppet-certmonger/pull/2012:07
rlandy|rovercool12:07
*** panda|off is now known as panda12:10
*** ratailor has quit IRC12:11
pandadoes someone have today's candidate resume ?12:13
*** atoth has joined #oooq12:16
*** quiquell|lunch is now known as quiquell12:17
rlandy|roverpanda: not sure we got one12:18
arxcruz|ruckpanda: i have his facebook :P12:18
pandaarxcruz|ruck: even better12:18
quiquellhumm... did you guys found my facebook before my interview ?12:20
arxcruz|ruckquiquell: of course, it was the only reason we hire you lol12:22
pandaquiquell: I really liked how you decorated the house.12:23
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.12:23
pandahubbot: thanks for checking stable/ocata twice12:23
hubbotpanda: Error: "thanks" is not a valid command.12:23
rlandy|roverok - this is starting to get creepy12:23
pandamyoung: do you know if it's checking stable/pike at all ? ^12:24
arxcruz|ruckhubbot: kill12:25
hubbotarxcruz|ruck: Error: "kill" is not a valid command.12:25
arxcruz|ruckat least kill is not a valid command12:25
arxcruz|ruckhubbot: love12:25
hubbotarxcruz|ruck: Error: "love" is not a valid command.12:25
*** udesale has joined #oooq12:29
pandaI think we need a completely different approach with this candidate. I'm looking at his reviews12:29
*** trown|outtypewww is now known as trown12:31
*** ykarel has joined #oooq12:38
myoungpanda: afaik, it's checking the following patches12:41
*** amoralej|lunch is now known as amoralej12:41
myoung%config plugins.GateStatus.changeIDs12:41
hubbotmyoung: I0cbf9ffb8552411e4dd891c38702ff8d1f6db5b1 I214272a6f25feb75496e44eb0a16269c6ee4cfe2 I4c5bdf00ce8cf7eabf669b248b99cb8443e82fab If12c8fe9bd0bea98a4842f279399285344f2224612:41
myoungpanda: this is...12:42
myoung    TQE, https://review.openstack.org/#/c/560445, I214272a6f25feb75496e44eb0a16269c6ee4cfe212:42
myoung    THT, https://review.openstack.org/#/c/567224, I0cbf9ffb8552411e4dd891c38702ff8d1f6db5b1, stable/queens12:42
myoung    THT, https://review.openstack.org/#/c/564285, If12c8fe9bd0bea98a4842f279399285344f22246, stable/pike12:42
myoung    THT, https://review.openstack.org/#/c/564291, I4c5bdf00ce8cf7eabf669b248b99cb8443e82fab, stable/ocata12:42
myoung%gatestatus12:42
pandamyoung: ok, so it's hubbot that is reporting incorrectly12:42
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.12:42
myoungyeah something's a little wierd there12:42
myoungyeah arxcruz|ruck noticed this yesterday12:43
myounghttps://review.openstack.org/#/c/564285 isn't being rechecked12:43
myoungarxcruz|ruck: do we have a LP for that?12:43
myoung^^ that's a bug / problem12:44
myoungarxcruz|ruck, rlandy|rover, if I had to guess it's maybe a typo on the .conf file on hubbot instance?12:44
*** udesale has quit IRC12:47
*** udesale has joined #oooq12:47
arxcruz|ruckmyoung: we don't but i'll create :)12:49
quiquellpanda: I have some logs about the n -> n + 1 do you have time to look at them with me ?12:49
myoungarxcruz|ruck: thx12:52
pandaquiquell: after the interview12:58
quiquellok12:58
trownrlandy|rover: joining interview?13:00
rlandy|rovermyoung: sshnaidm_pto: hate to ask this question again - any idea why this job was disabled since 09/05 if it was passing at that point? https://ci.centos.org/job/tripleo-quickstart-promote-ocata-rdo_trunk-minimal/13:04
rlandy|roverthere must have been a reason to do so13:04
myoungamoralej: ^^ ?13:05
myoungrlandy|rover: thats in rdo1, I don't have visibility.  checking git history13:05
amoralejmmmm13:05
rlandy|roverit's our (RDO CI) job13:05
rlandy|rovernot amoralej's13:05
myoungrlandy|rover: I just mean I don't recall patching anything to disable it13:05
rlandy|roveralthough if he knows, I'd be grateful13:05
amoraleji've seen it failing lastly13:05
amoralejbut i'm not sure why it was disabled13:06
myoungrlandy|rover: and I/we don't have access to eitherthe nodes it runs on, or the jenkins server13:06
rlandy|roveramoralej: I'm following this failing job here: https://bugs.launchpad.net/tripleo/+bug/177407913:06
openstackLaunchpad bug 1774079 in tripleo "[ocata promotion] phase1 (ci.centos) job tripleo-quickstart-promote-ocata-rdo_trunk-minimal fails introspection/deploy "No valid host found"" [Critical,Triaged]13:06
rlandy|roverI am considering dropping it as a promotion criterion but I'd prefer not tp13:06
rlandy|roverto13:06
rlandy|roverproblem is the failure is not consistent13:07
rlandy|rovermyoung: the only reason I am asking you guys as I am guess the disable happened during the ruck/river shift you had13:08
myoungrlandy|rover: i'm going thru git history now, was the job disabled by hand using jenkins UI in admin mode (we don't have this)?  I'm not seeing any commits that would disable the job during that period13:09
myoungalso looking back thru notes from that sprint13:10
ykarelrlandy|rover, is that ocata job reproducable locally?13:10
ykareli mean has anyone tried that13:10
rlandy|roverykarel: we have no access to the exact hardware - and we don't create a reproducer there13:11
rlandy|roverso the closes we can get is a virt job on our own hardware13:11
* myoung reconstructs history (http://sol.usersys.redhat.com/dlrnapi-reports/ocata-combined.txt)13:13
*** ykarel_ has joined #oooq13:13
ykarel_i think local reproducer would help getting to the root cause13:15
myoung2018-05-09 10:28:33, https://trunk.rdoproject.org/centos7-ocata/b8/d5/b8d5d3b2f3937e2063192a4fb3b97e8eabe56383_1edce40d, current-tripleo-rdo looks to be the last rdo1 promotion, with the next current-tripleo promotion not happening until 5/2213:15
myoung2018-05-22 02:41:04, https://trunk.rdoproject.org/centos7-ocata/5c/13/5c13ff81a5466b3ee8a23e3b910f8cc7a66995b6_5e0d17be, current-tripleo13:15
myoungwhich spawned https://ci.centos.org/job/rdo_trunk-promote-ocata-current-tripleo/45213:15
myoungwhich had the sub-job https://ci.centos.org/job/tripleo-quickstart-promote-ocata-rdo_trunk-minimal/334, which is the next job after 5/913:15
myoungso nothing it appears was disabled per se between 5/9 and 5/22, we just didn't have a current-tripleo promotion13:16
myoungrlandy|rover: ^^13:16
*** ykarel has quit IRC13:16
rlandy|roveroh ok - nothing from 0913:16
rlandy|roverok13:16
myoungi've also confirmed that JJB for those job didn't change13:16
myounglooks like 13 attempts upstream to promote tripleo-ci-testing --> current-tripleo, there's a good deal of gap13:18
myoungif i recall correctly this was also during the period where master and queens were broken due to container issues, as well as the concurrent centos 7.5 release (and the hilarity in gates for a week) during that sprint, so why ocata upstream wasn't promoting between 5/9 and 5/22 isn't clear...our focus wasn't there.  I'm reviewing sprint notes to see if there are some clues...13:19
*** ykarel_ is now known as ykarel13:21
myoungrlandy|rover: finished diving thru notes, and it we were focused on master/queens, and mostly DOA rdo2 from beginning of sprint.  the ocata virt minimal job appears to have been failing in introspection starting 5/22 (https://ci.centos.org/artifacts/rdo/jenkins-tripleo-quickstart-promote-ocata-rdo_trunk-minimal-334/undercloud/home/stack/overcloud_prep_images.log.gz) and persisting with same error until today https://ci.centos.org/artifacts/rdo/13:28
myoungjenkins-tripleo-quickstart-promote-ocata-rdo_trunk-minimal-353/undercloud/home/stack/overcloud_prep_images.log.gz13:28
*** links has quit IRC13:29
rlandy|rovermyoung: thanks - yep - sometimes we fail introspection and sometimes we fail deploy13:29
myoung2018-05-29 21:17:56.478 20823 INFO workflow_trace [-] Workflow 'tripleo.baremetal.v1.introspect_manageable_nodes' [RUNNING -> ERROR, msg=Failure caused by error in tasks: fail_workflow13:29
myoung^^ https://ci.centos.org/artifacts/rdo/jenkins-tripleo-quickstart-promote-ocata-rdo_trunk-minimal-353/undercloud/var/log/mistral/engine.log.gz13:29
*** links has joined #oooq13:30
myoungthat error looks to be preceded by13:30
myoung2018-05-29 21:11:56.173 20896 INFO swiftclient [-] REQ: curl -i https://192.168.24.2:13808/v1/AUTH_cd277c3ea8e142aebf89d4c008ac886f/overcloud -I -H "X-Auth-Token: gAAAAABbDcIXgFli..."13:30
myoung2018-05-29 21:11:56.173 20896 INFO swiftclient [-] RESP STATUS: 404 Not Found13:30
myoung2018-05-29 21:11:56.174 20896 INFO swiftclient [-] RESP HEADERS: {u'Date': u'Tue, 29 May 2018 21:11:56 GMT', u'Content-Length': u'0', u'Content-Type': u'text/html; charset=UTF-8', u'X-Openstack-Request-Id': u'tx862953ad45dc456aaea3a-005b0dc21b', u'X-Trans-Id': u'tx862953ad45dc456aaea3a-005b0dc21b'}13:30
myoung2018-05-29 21:11:56.177 20896 WARNING mistral.actions.openstack.base [-] Traceback (most recent call last):13:30
myoung  File "/usr/lib/python2.7/site-packages/mistral/actions/openstack/base.py", line 127, in run13:30
myoung    result = method(**self._kwargs_for_run)13:30
myoung  File "/usr/lib/python2.7/site-packages/swiftclient/client.py", line 1735, in head_container13:30
myoung    return self._retry(None, head_container, container, headers=headers)13:30
myoung  File "/usr/lib/python2.7/site-packages/swiftclient/client.py", line 1673, in _retry13:30
myoung    service_token=self.service_token, **kwargs)13:30
myoung  File "/usr/lib/python2.7/site-packages/swiftclient/client.py", line 977, in head_container13:30
myoung    resp, 'Container HEAD failed', body)13:30
myoungClientException: Container HEAD failed: https://192.168.24.2:13808/v1/AUTH_cd277c3ea8e142aebf89d4c008ac886f/overcloud 404 Not Found13:31
myoungin executor log13:31
myoungsry i'm probably just retracing your debug steps and spamming the channel13:31
myoungcould this just be swift / infra on the nodes up there and not a product issue?13:32
arxcruz|ruckmyoung: could be swift didn't start ?13:34
arxcruz|ruckbecause it's 40413:34
* myoung is looking at swift logs too13:35
myoungarxcruz|ruck, rlandy|rover, so swift is up and running, here's the request coming in afaict...13:38
myoung2018-05-29 21:11:56.173 20896 INFO swiftclient [-] REQ: curl -i https://192.168.24.2:13808/v1/AUTH_cd277c3ea8e142aebf89d4c008ac886f/overcloud -I -H "X-Auth-Token: gAAAAABbDcIXgFli..."13:38
myoung2018-05-29 21:11:56.173 20896 INFO swiftclient [-] RESP STATUS: 404 Not Found13:38
myoung2018-05-29 21:11:56.174 20896 INFO swiftclient [-] RESP HEADERS: {u'Date': u'Tue, 29 May 2018 21:11:56 GMT', u'Content-Length': u'0', u'Content-Type': u'text/html; charset=UTF-8', u'X-Openstack-Request-Id': u'tx862953ad45dc456aaea3a-005b0dc21b', u'X-Trans-Id': u'tx862953ad45dc456aaea3a-005b0dc21b'}13:38
myoung2018-05-29 21:11:56.177 20896 WARNING mistral.actions.openstack.base [-] Traceback (most recent call last):13:38
myoung  File "/usr/lib/python2.7/site-packages/mistral/actions/openstack/base.py", line 127, in run13:38
myoung    result = method(**self._kwargs_for_run)13:38
myoung  File "/usr/lib/python2.7/site-packages/swiftclient/client.py", line 1735, in head_container13:38
myoung    return self._retry(None, head_container, container, headers=headers)13:38
myoung  File "/usr/lib/python2.7/site-packages/swiftclient/client.py", line 1673, in _retry13:38
myoung    service_token=self.service_token, **kwargs)13:38
myoung  File "/usr/lib/python2.7/site-packages/swiftclient/client.py", line 977, in head_container13:38
myoung    resp, 'Container HEAD failed', body)13:38
myoungClientException: Container HEAD failed: https://192.168.24.2:13808/v1/AUTH_cd277c3ea8e142aebf89d4c008ac886f/overcloud 404 Not Found13:38
arxcruz|ruck:(13:38
myoungshiz13:38
myoungpaste fail13:38
myoungthis rather13:38
arxcruz|ruckpastebin13:38
myoungMay 29 21:11:55 undercloud haproxy[15814]: Connect from 192.168.24.3:37426 to 192.168.24.3:35357 (keystone_admin/HTTP)13:38
myoungMay 29 21:11:56 undercloud proxy-server: 192.168.24.1 192.168.24.1 29/May/2018/21/11/56 HEAD /v1/AUTH_cd277c3ea8e142aebf89d4c008ac886f/overcloud HTTP/1.0 404 - python-swiftclient-3.3.0 gAAAAABbDcIXgFli... - - - tx862953ad45dc456aaea3a-005b0dc21b - 0.5826 - - 1527628315.588342905 1527628316.170984983 -13:38
myoung^^ https://ci.centos.org/artifacts/rdo/jenkins-tripleo-quickstart-promote-ocata-rdo_trunk-minimal-353/undercloud/var/log/swift/swift.log.gz13:38
arxcruz|ruckso the overcloud swift is missing13:39
rlandy|roverI'd like to blame this on infra13:39
myoungrlandy|rover: yeah...i bet if we ran this job on one of our beefier virthosts in RDU it would pass, or at least not fail like this13:40
rlandy|rovermyoung: the fun thing is that pike/master/queens pass13:40
myounghuh...13:41
* myoung wonders if a backport --> stable/ocata was missed somewhere13:41
myoungor if this is an ocata only issue13:41
rlandy|roveronly ocata13:41
myoungwe could ping OSP qe folk, or do a quick bz search to see if it's been found / observed there13:42
* myoung looks to see last puddle import and flips to #rhos-delivery13:42
myoungi do have fond memories of a pile of introspection random fails in ocata timeframe13:42
myoungand by "fond" I mean "My brain is blocking the memories to protect itself"13:43
myoung:)13:43
rlandy|roverI have been searching13:43
myoungrlandy|rover: so imports to ocata have not occured in 21 days13:44
myoungso the last OSP puddle for ocata would be prior to when we started to hit this...13:44
*** links has quit IRC13:44
* myoung confirms13:44
myoungrlandy|rover: huh...dashboard might be off...seeing a pile of puddles for 1013:48
myoungflipping to internal channel13:48
rlandy|rovermyoung: let's bj  - when review is over13:48
rfolcochandankumar, hi, still don't see python2-{future,stestr} on http://download-node-02.eng.bos.redhat.com/rel-eng/repos/rhos-12.0-rhel-7-testdeps/x86_64/13:48
rlandy|roverboard is correct we have not promoted ocata13:48
Tenguhello there! for information, this patch will impact quickstart: https://review.openstack.org/#/c/570627/  - care to have a look, as well as its quickstart-extras part: https://review.openstack.org/#/c/570841/ (especially since there will be a version check)? thank you :)13:51
myoungrlandy|rover: aye...but they have been pulling changes and making puddles anyway, see chatter in rhos-delivery -13:52
ssbarneaout of curiosity, is it normal to see index.html.gz on ara reports? the browser no longer opens the HTML file as HTML if is archived. see https://ci.centos.org/artifacts/rdo/jenkins-tripleo-quickstart-gate-newton-delorean-quick-basic-5031/ara_oooq/13:58
arxcruz|ruck!gate14:02
openstackarxcruz|ruck: Error: "gate" is not a valid command.14:02
arxcruz|ruck!check14:02
openstackarxcruz|ruck: Error: "check" is not a valid command.14:02
rlandy|rovermyoung: arxcruz|ruck: want to bj?14:02
panda%gatestatus14:02
arxcruz|ruckrlandy|rover: sure14:02
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.14:02
myoungrlandy|rover, arxcruz|ruck, so https://code.engineering.redhat.com/gerrit/#/c/139256, imports from ocata --> OSP are no longer (ever) happening as osp11 is now officially EOL anyway14:02
myoungsure14:02
arxcruz|ruckpanda: i want to check what's the changes hubbot is listening14:03
pandassbarnea: yes, I would solve it using class.mro14:03
myoungimho we care even less about this14:03
myoungarxcruz|ruck: fyi you can also spam hubbot privately in a direct message to debug14:03
arxcruz|ruckrlandy|rover: myoung which bj ?14:03
pandaarxcruz|ruck: %config plugins.GateStatus.changeIDs14:03
rlandy|roverhttps://bluejeans.com/u/rlandy/14:03
rlandy|roverarxcruz|ruck: myoung: ^^14:03
arxcruz|ruckrlandy|rover: let me just grab more water14:03
myoungcool incoming, i need 120 sec tho14:03
myoungbrt14:03
myoungmayve 180 sec14:03
quiquellpanda: Have to go to the kindergarden14:08
*** EmilienM_PTO is now known as EmilienM14:08
pandaquiquell: oh, ok, we just finished ...14:08
pandaquiquell: you're here tomorrow morning ?14:08
quiquellpanda: Yep, looks like n -> n + 1 featureset050 is working with rlandy|rover change14:09
pandaquiquell: already ???14:09
quiquellI see build-test-packages and install repo at undercloud install, but have to check with someone else14:10
pandaquiquell: oh, but withouth change injection probably14:10
pandaoh, ok14:10
pandaeyah14:10
quiquellpanda: This is the change, take a look also to the Depends-On https://review.openstack.org/#/c/570888/14:10
*** marios has quit IRC14:16
myoungrlandy|rover: what's your meeting id14:16
myoungthe #14:16
*** quiquell is now known as quique14:16
*** quique is now known as quiquell|off14:16
*** marios has joined #oooq14:16
arxcruz|ruckrasca: around ?14:18
myoungarxcruz|ruck: rlandy|rover https://code.engineering.redhat.com/gerrit/#/c/13925614:18
ssbarneapanda: thanks for the MRO, added a bookmark. I also know why I didn't know it because I avoided multiple inheritance like plague. i know one case where I avoiding by monkey patching Requests.session to implement retries on it.14:21
pandassbarnea: yes, exactly the case in which it's used, you have your base class but also need to overrid an existing API14:22
pandassbarnea: anyway it's something you entounter also when you start dealing with metaclasses14:23
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.14:23
pandabut, advanced and very rare usage, hence the question was "bonus"14:23
ssbarneathe best bonus for me would be to avoid having to add jenkins groovy to my daily plate, i got enough of it, i find plain bash 10x better (and portable)14:26
ssbarneaout of curiosity, what is your policy/practice regarding ansible compatibility? how bleading edge or conservative?14:29
*** skramaja has quit IRC14:29
*** moguimar has quit IRC14:32
pandassbarnea: we are currently one release behind the stable14:32
*** moguimar has joined #oooq14:34
ssbarneaand i suppose that's usually until they release 2.x.1 to fix the bugs affecting you. i was considering creating a job downstream to test with prereleases in order to spot bugs before they break them.14:34
ssbarneai use devel locally sometimes, a source of joy :D14:35
pandassbarnea: we have to pin because moving means testing that evry single job still works14:35
pandassbarnea: testing a single job is not enough unfortunately, we need 100% coverage.14:36
pandassbarnea: we know for example that 2.8 will be a pain, because they are deprecating some stauff we use14:36
rlandy|roverpanda: trown: are we going public with the libvirt reproducer? if so, I still have this doc review outstanding: https://review.openstack.org/#/c/566155/14:46
pandarlandy|rover: I think more than public, the first step is just test it to check  if we are breaking it. Like trown said, if we go public then we need to maintain it. A whole new level of commitment...14:47
*** saneax has quit IRC14:47
*** ykarel is now known as ykarel|away14:53
rlandy|roverquiquell|off: panda: can we w+1 https://review.openstack.org/#/c/568946/?14:56
rlandy|roverwes and Emilien voted14:56
rascahey arxcruz|ruck you pinged me above here... Do you still need me?14:57
rlandy|roverI'd like someone from the sprint to approve14:57
arxcruz|ruckrasca: nah, already have my questions answered14:57
rascaarxcruz|ruck, aCK14:59
pandarlandy|rover: approved, with comment.15:00
rlandy|roverp15:01
rlandy|roverpanda: thanks15:01
myoungarxcruz|ruck: do we have  LP tracking the promoter networking / dns issue(s) that are causing us to have to run a private promoter?15:01
*** ykarel|away is now known as ykarel15:02
pandarlandy|rover: no no, thank you.15:06
rlandy|rovermyoung: arxcruz|ruck: not as far as I know15:14
rlandy|roverI think we should try the rdocloud one again15:14
*** rfolco_ has joined #oooq15:21
*** rfolco has quit IRC15:23
arxcruz|ruckmyoung: no, do i need to open the bug for the ocata change ?15:29
myoungarxcruz|ruck: we need a LP to track the promoter issue(s), would like to return to using the tripleo-infra instance.  regarding ocata issue we already have https://bugs.launchpad.net/tripleo/+bug/177407915:34
openstackLaunchpad bug 1774079 in tripleo "[ocata promotion] phase1 (ci.centos) job tripleo-quickstart-promote-ocata-rdo_trunk-minimal fails introspection/deploy "No valid host found"" [Critical,Triaged]15:34
arxcruz|ruckmyoung: https://bugs.launchpad.net/tripleo/+bug/177422015:35
openstackLaunchpad bug 1774220 in tripleo "Promoter server is having DNS issues" [High,Triaged]15:35
arxcruz|ruckmyoung: i meant the hubbot one15:36
*** udesale has quit IRC15:36
myoungarxcruz|ruck: updated https://bugs.launchpad.net/tripleo/+bug/177422015:38
openstackLaunchpad bug 1774220 in tripleo "Promoter server is impacted by networking issues and is offline (running private instance presently)" [Critical,Triaged]15:38
arxcruz|ruckmyoung: nice english words :P15:39
myoungarxcruz|ruck: we suspect DNS, but don't know it...15:39
myoungarxcruz|ruck: regarding hubbot issues, IMHO yes, please create a LP for that.  I'm generally of the opnion that we should track all things in LP/storyboard (when we get there) as it's transparent, and helps us to understand where/how we spend time.  Things not tracked are invisible.  Lots of invisible effort yields burnout and other sorts of bad things.15:40
*** udesale has joined #oooq15:43
arxcruz|ruckmyoung: do you have the change number ?15:43
arxcruz|ruckhubbot: %config plugins.GateStatus.changeIDs15:44
hubbotarxcruz|ruck: Error: "%config" is not a valid command.15:44
arxcruz|ruck%config plugins.GateStatus.changeIDs15:44
hubbotarxcruz|ruck: I0cbf9ffb8552411e4dd891c38702ff8d1f6db5b1 I214272a6f25feb75496e44eb0a16269c6ee4cfe2 I4c5bdf00ce8cf7eabf669b248b99cb8443e82fab If12c8fe9bd0bea98a4842f279399285344f2224615:44
myoungarxcruz|ruck: see the top few lines of sprint-14 etherpad15:45
arxcruz|ruckk15:46
*** ykarel is now known as ykarel|away15:50
*** ykarel|away has quit IRC16:02
*** zoli is now known as zoli|gone16:04
*** zoli|gone is now known as zoli16:04
*** saneax has joined #oooq16:18
*** panda is now known as panda|off16:19
chandankumarrfolco_: is this one [rhelosp-12.0-unittest] enabled?16:21
chandankumarrfolco_: http://download-node-02.eng.bos.redhat.com/rel-eng/repos/rhos-12.0-rhel-7-testdeps/tagged both the packages are available there16:22
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.16:23
rfolco_chandankumar, ok I tried to physically find the rpm16:24
rfolco_http://download-node-02.eng.bos.redhat.com/rel-eng/repos/rhos-12.0-rhel-7-testdeps/x86_64/16:24
rfolco_chandankumar, looks like the tagged rpms are somewhere else16:24
*** trown is now known as trown|lunch16:26
chandankumarrfolco_: yup, try this one yum-config-manager --enable <unit test repo name>16:26
chandankumarit will work16:26
*** udesale has quit IRC16:26
*** marios has quit IRC16:27
rfolco_chandankumar, cool thanks :)16:27
*** rlandy|rover is now known as rlandy|rover|brb16:49
*** holser__ has quit IRC16:57
*** dtantsur is now known as dtantsur|afk17:00
*** amoralej is now known as amoralej|off17:07
*** tesseract has quit IRC17:10
*** trown|lunch is now known as trown17:39
myoungarxcruz|ruck: do we have the hubbot bug?17:40
*** rlandy|rover|brb is now known as rlandy|rover17:46
*** gvrangan has joined #oooq17:53
arxcruz|ruckmyoung: sorry, not yet, let me do that now for you, it's already 8pm here ;)17:54
rlandy|roverand now we have a 504 deployment error17:56
rlandy|roverit keeps changing17:56
arxcruz|ruckmyoung: actually, i think it's working, because i'm seeing logs for today, but no comments in gerrit, perhaps something change ?17:57
myoungarxcruz|ruck: hubbot?18:02
myoung%gatestatus18:02
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.18:02
* myoung notes the duplicated stable/ocata18:02
arxcruz|ruckhmmm interesting18:03
arxcruz|ruckfunny part is stable/ocata has failures18:04
arxcruz|ruckhttps://review.openstack.org/#/c/564291/18:04
myoungarxcruz|ruck: afaik it looks for 2 failures in a row18:05
myoungso it see's the http://logs.openstack.org/91/564291/14/check/tripleo-ci-centos-7-undercloud-oooq/f420bbc, but isn't alerting until that same job fails a second time (I think)18:05
arxcruz|ruckmyoung: ok18:06
arxcruz|rucki need to check the code for hubbot cuz i'm a bit lost :/18:06
myoungarxcruz|ruck: this might help18:07
myounghttps://trello.com/c/vdDrtoee/50-hubbot-is-private-code-running-on-a-private-server-lets-open-this-up-and-run-on-a-shared-instance18:07
myoungand this:18:07
myounghttps://etherpad.openstack.org/p/tripleo-ci-hubbot-configuration18:07
myoung^^ bj recording there as well18:07
myoungarxcruz|ruck: if I recall correctly https://trello.com/c/iWAz4ONC/73-hubbot-bot-add-two-dummy-changes-on-tht-to-watch#comment-5ae1a62c4600414413011626 was the last note/item on the hubbot config work18:08
myoung(until now)18:09
*** ccamacho has quit IRC18:11
rlandy|rovercan we manually kick the ocata promotion?18:20
rlandy|roverfrom rdocloud18:20
rlandy|roveron 24-hr cycle18:20
rlandy|rover4 days deplayed18:20
rlandy|roverhttps://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-ocata-upload/7175801/undercloud/home/jenkins/overcloud_validate.log.txt.gz18:21
rlandy|roverarxcruz|ruck: ^^18:21
rlandy|roverI'd like to rekick that once we figure out the dns issues18:21
rlandy|rovercould also be a dns problem18:21
rlandy|rovermyoung: actually  - can we rekick ocata on the promoter server?18:23
rlandy|roveryours?18:23
rlandy|roverpromotion should have happened on 05/2818:23
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.18:23
rlandy|roverto current-tripleo18:24
myoungrlandy|rover: ack...looking now18:37
rlandy|rovermyoung: last current-tripleo is 05/2618:37
rlandy|roverI think we should have had a successful promotion 05/2818:38
rlandy|rovermaybe that was the latest hash?18:38
rlandy|roverfailures over the last two days18:38
myoungrlandy|rover: here's past 3 days of promotions to ocata...18:39
myoung2018-05-30 00:00:27, https://trunk.rdoproject.org/centos7-ocata/00/91/00914c802ded15a6ad4643a1a8b277a5342ed5a6_e0df978b, tripleo-ci-testing18:39
myoung2018-05-29 00:00:35, https://trunk.rdoproject.org/centos7-ocata/d8/4a/d84a6ecd344b9c3513596b476172fa1c890a2fc6_1558157c, tripleo-ci-testing18:39
myoung2018-05-28 00:00:29, https://trunk.rdoproject.org/centos7-ocata/0a/d7/0ad78bda27846527d1b755cd7a248e35ef6c6932_787a5938, tripleo-ci-testing18:39
myoung2018-05-27 00:20:11, https://trunk.rdoproject.org/centos7-ocata/0a/d7/0ad78bda27846527d1b755cd7a248e35ef6c6932_787a5938, current-tripleo18:39
myounglooking at logs now18:39
rlandy|roverchecking hashes18:42
myoungrlandy|rover: most recent hash is missing fs218:42
myounghttps://paste.fedoraproject.org/paste/iiZmHsRP8Nd-cr9mvQV~dQ18:42
rlandy|rovermyoung: yeah - that is correct - that job failed18:43
rlandy|rovershould have passed on 05/28 though18:43
myoungthe hash prior (d84a6ecd344b9c3513596b476172fa1c890a2fc6_1558157c) is missing 22018-05-30 13:44:11,342 25536 INFO     promoter Skipping promotion of tripleo-ci-testing to current-tripleo, missing successful jobs: ['periodic-ovb-1ctlr_1comp-featureset002', 'periodic-ovb-1ctlr_1comp-featureset020', 'periodic-ovb-3ctlr_1comp-featureset001']18:43
* myoung looks18:43
rlandy|rovermyoung: sorry for bother you again on this ... I'll check it tomorrow18:43
myoungnot a bother18:44
rlandy|roverthe hash is 4 days out18:44
rlandy|roverI would think it would be 218:44
rlandy|rovernvm18:44
rlandy|roverthis promotion is making me crazy :)18:46
myoungrlandy|rover: i don't see any others that passed tripleo-ci-testing that would be candidates for current-tripleo in past 6 days18:47
rlandy|roverhttps://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-ocata-upload/b494da6/18:47
rlandy|roveron 05/28 all promote jobs passed18:47
myoungrlandy|rover: correct, that job was testing hash: 0ad78bda27846527d1b755cd7a248e35ef6c6932_787a5938 -->  https://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-ocata-upload/b494da6/console.txt.gz#_2018-05-28_04_02_38_66918:51
myoungand was promoted --> 2018-05-27 00:20:11, https://trunk.rdoproject.org/centos7-ocata/0a/d7/0ad78bda27846527d1b755cd7a248e35ef6c6932_787a5938, current-tripleo18:52
myoungis currently https://trunk.rdoproject.org/centos7-ocata/current-tripleo/commit.yaml18:52
*** atoth has quit IRC18:55
*** yolanda has joined #oooq18:57
*** yolanda_ has quit IRC18:59
*** gvrangan has quit IRC20:09
rlandy|roverhttps://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/rdo-promote-queens-rdo_trunk/20:10
rlandy|rovermyoung: ^^ why is rdo-queens-promote-rdo_trunk-build-images still marked as a failure?20:10
* myoung looks20:12
myoung00:00:30.367 cmd2 requires Python '>=3.4' but the running Python is 2.7.1320:13
myoung00:00:30.455 python setup.py install failed20:13
rlandy|roverI reran it20:13
myounghttps://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/rdo-queens-promote-rdo_trunk-build-images/72/console20:13
rlandy|roverjob is marked as green20:14
myoung.ahh i see...sec20:14
rlandy|roverhttps://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/rdo-queens-promote-rdo_trunk-build-images/73/consoleFull20:14
rlandy|roverit should kick the next text cycle20:14
myoungi'm guessing "retry" was kicked on the build job itself20:16
myoungof #7320:16
myoungya20:16
myoungStarted by Naginator after the failure of build #7220:16
rlandy|roveroh I did that20:16
rlandy|roverwhat should I have done?20:16
myoungif you run the top level multijob it should just run20:16
rlandy|roverretry there>20:16
myoungjenkins naginator / retry on a individual phase of a multijob will just run that phase20:16
rlandy|roveror rebuild?20:16
rlandy|roverretrying - thanks20:17
myoungthey are basically the same thing, at least at multijob level20:17
myoungso "retry" will rekick a job, feeding it the same parameters as before20:17
rlandy|rovercool20:17
EmilienMis wes on pto?20:17
myoungso if it's a build/test job, things like "current_build" get resent20:17
EmilienMyes he is20:17
myoung"rebuild" does the same thing, but will give the opportunity (via UI) to change input params20:18
EmilienMhe's always a pto20:18
myoungsince top level multijob has no params...retry = rebuild20:18
EmilienMin*20:18
myoungEmilienM: he's back Monday20:18
EmilienMhe's always back on Monday :D and then he leaves again20:18
EmilienMlol20:18
rlandy|roverwell - it's reckicking now20:18
myoungrlandy|rover: aye watching https://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/rdo-promote-queens-rdo_trunk/93/20:18
rlandy|roverwe have as redo on https://bugs.launchpad.net/tripleo/+bug/175087420:18
openstackLaunchpad bug 1750874 in tripleo "[queens promotion] fs001 fails overcloud deploy with 'Authentication failed'" [Critical,Fix released] - Assigned to Alex Schultz (alex-schultz)20:18
rlandy|roveron queens20:18
rlandy|roverwhere is he going now?20:19
rlandy|roverEmilienM: welcome back20:19
EmilienMrlandy|rover: thanks :D20:19
rlandy|roverat least some people come back20:19
EmilienMrlandy|rover: what did I miss? :-P20:19
myoungEmilienM: heh20:19
rlandy|roverEmilienM: nothing :( ... still trying to get https://review.openstack.org/#/c/568946/ through gates20:20
EmilienMI see you folks doing well, lot of green everywhere20:20
EmilienMrlandy|rover: yeah but it's almost merged I think :D20:20
EmilienMthanks again for this work20:20
* myoung blames rlandy|rover for the green20:20
rlandy|roverEmilienM: the green is a lie20:20
EmilienMlol20:20
myoungthere's a joke in there somewhere20:20
rlandy|roverwe adjust the promotion criteria20:20
EmilienMexit 020:20
EmilienMrlandy|rover: how badly?20:20
rlandy|roverEmilienM: you don't want to know20:21
* rlandy|rover feels shame20:21
rlandy|roverbut not upstream20:21
rlandy|roverthat is truly green20:21
EmilienMlol20:21
EmilienMcan I help into something?20:22
* EmilienM late in the party20:22
rlandy|roveroh no - lots of party left for us all20:22
myoungto be fair...the criteria we've been using in RDO2 hasn't really changed...we just have cognative dissonence beween how we're modelling promoter config "these jobs must pass" with what reality looks like "this job, and one of these 3 must pass"20:22
* myoung fetches EmilienM a party hat20:22
rlandy|rovermyoung: you should be a political spokesman20:23
rlandy|roverie: we're not really cheating it's more "cognative dissonence"20:23
myoungEmilienM: no sprint would be complete without the intermittent RDO Cloud networking stuff as well20:23
hubbotFAILING CHECK JOBS on stable/ocata: tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/56429120:23
myoungrlandy|rover: "mmmmm...what is cheating really...."20:23
myoung:)20:24
rlandy|roveroh whatever ...20:24
myoungi think TBH there's a bit of technical debt20:24
EmilienMI think we have seen worse situations at this stage of the cyle20:24
myoung(low priority)20:24
EmilienMcycle*20:24
myoungto have our promoter config match reality20:24
rlandy|roverright - so really - it's ocata and phase two that are not great now20:25
rlandy|roverthe rest is ok20:25
myoungphase 2 is mostly ok20:25
rlandy|roverocata is like a random experiment20:25
myoungheh20:25
rlandy|rovera new job a new error20:25
*** trown is now known as trown|outtypewww20:25
rlandy|rovernever a repeat20:25
myoungdid i see/parse correctly btw that now it's a 504?20:25
myoungvs. a 40420:26
myoungor a 50020:26
rlandy|roverlat job was 50420:26
rlandy|roverthe one before the prep-network error20:26
rlandy|roverthe one after the introspection error20:26
rlandy|roversee  - party party20:26
myounghttps://restlet.com/http-status-map20:27
myoung^^ where do you want to go today?20:28
myoungSPIN THE WHEEL, kick the job20:28
EmilienMoh nice20:28
rlandy|roveroh that's hysterical, sad but hysterical20:28
myoungif we couldn't laugh...we would have to cry20:28
* myoung goes back to attempting to get sprint 13 and 14 status out (late)20:29
*** florianf has quit IRC20:34
rlandy|rovermyoung:???21:50
rlandy|roverfire?21:50
myoungyo22:00
rlandy|roverugh  - someone just shoot me :(22:19
*** rlandy|rover is now known as rlandy|rover|bbl22:23
hubbotFAILING CHECK JOBS on stable/ocata: tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/56429122:23
*** tosky has quit IRC23:04
*** saneax has quit IRC23:33
*** sanjayu_ has joined #oooq23:33

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!