Tuesday, 2018-05-29

hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)00:21
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)02:21
*** yolanda_ has quit IRC02:41
*** rasca has quit IRC02:41
*** yolanda_ has joined #oooq02:42
*** rasca has joined #oooq02:42
*** apetrich has quit IRC02:44
*** rnoriega has quit IRC02:44
*** rfolco has quit IRC02:44
*** myoung has quit IRC02:44
*** apetrich has joined #oooq02:45
*** rnoriega has joined #oooq02:45
*** rfolco has joined #oooq02:45
*** myoung has joined #oooq02:45
*** bandini has quit IRC02:45
*** bandini has joined #oooq02:46
*** ssbarnea has quit IRC02:46
*** panda|pto has quit IRC02:46
*** dougbtv_ has quit IRC02:46
*** sai_ has quit IRC02:46
*** strattao has quit IRC02:46
*** ssbarnea has joined #oooq02:47
*** panda|pto has joined #oooq02:47
*** dougbtv_ has joined #oooq02:47
*** sai_ has joined #oooq02:47
*** strattao has joined #oooq02:47
*** jrist has quit IRC02:50
*** jschluet has quit IRC02:50
*** jaganathan has quit IRC02:50
*** weshay_pto has quit IRC02:50
*** fuzzball81 has quit IRC02:50
*** bandini has quit IRC02:50
*** sshnaidm has quit IRC02:50
*** jaosorior has quit IRC02:50
*** hamzy has quit IRC02:50
*** rook has quit IRC02:50
*** dalvarez has quit IRC02:50
*** jrist has joined #oooq02:50
*** jschluet has joined #oooq02:50
*** bandini has joined #oooq02:51
*** sshnaidm has joined #oooq02:51
*** jaosorior has joined #oooq02:51
*** hamzy has joined #oooq02:51
*** rook has joined #oooq02:51
*** dalvarez has joined #oooq02:51
*** jaganathan has joined #oooq02:51
*** weshay_pto has joined #oooq02:51
*** fuzzball81 has joined #oooq02:51
*** ykarel|away has joined #oooq03:46
*** ykarel|away is now known as ykarel04:18
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)04:21
*** udesale has joined #oooq04:26
*** ykarel has quit IRC04:31
*** ykarel has joined #oooq04:46
*** yolanda has joined #oooq05:33
*** yolanda_ has quit IRC05:35
*** quiquell|off is now known as quiquell05:36
*** pgadiya has joined #oooq05:41
*** pgadiya has quit IRC05:41
*** jfrancoa has joined #oooq05:41
*** marios has joined #oooq05:44
*** ratailor has joined #oooq05:46
*** kopecmartin has joined #oooq05:55
*** matbu has joined #oooq05:58
*** links has joined #oooq06:02
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)06:21
*** saneax has joined #oooq06:39
*** dmellado has joined #oooq07:12
*** tosky has joined #oooq07:14
*** tesseract has joined #oooq07:14
*** amoralej|off is now known as amoralej07:20
myoungarxcruz|ruck, quiquell, check mail about promoter07:24
myoungarxcruz|ruck, quiquell late friday night it was not working...so we've been running it locally on a box in westford over the long weekend.07:25
myoungdoes this mean we have 2 promoters running now?07:25
* myoung checks before going to bed07:25
myoungarxcruz|ruck: promoter is still running on my machine...07:26
*** bogdando has joined #oooq07:26
myoungarxcruz|ruck: check mail07:28
*** myoung is now known as myoung|zzz07:28
quiquellmyoung|zzz: We have two promoters07:30
quiquellmyoung|zzz: Has no email07:30
*** quiquell is now known as quiquell|afk07:30
*** jtomasek has joined #oooq07:39
*** ykarel is now known as ykarel|lunch07:40
*** zoli is now known as zoli|wfh07:45
*** zoli|wfh is now known as zoli07:45
*** jaosorior has quit IRC07:48
bogdandoo/ folks. Do you have any ideas how to debug https://logs.rdoproject.org/16/566916/5/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-master/Zca96f34da3724dd89cb0c5ae271cc953/undercloud/home/jenkins/overcloud_deploy.log.txt.gz#_2018-05-28_09_29_09 ? It can not be reproduced via the CI reproduce script on RDO cloud08:00
*** yolanda has quit IRC08:01
arxcruz|ruckbogdando: no idea...08:01
* bogdando sigh08:02
bogdandoat first glance the difference may come from the images used for jobs prolly08:02
bogdandobut I'm not sure08:02
bogdandofor the fs035 job vs that reproduce script does08:03
arxcruz|ruckweird, reproduce should 'reproduce' hehe :/08:03
bogdandoindeed :D08:03
arxcruz|ruck!gatestatus08:03
openstackarxcruz|ruck: Error: "gatestatus" is not a valid command.08:03
arxcruz|ruck!gate08:03
openstackarxcruz|ruck: Error: "gate" is not a valid command.08:03
bogdandotry elleven? :)08:04
arxcruz|ruckdidn't get the reference... :/08:04
*** dtantsur|afk is now known as dtantsur08:04
arxcruz|ruckoh, it's the cmd2 problem :/08:05
bogdandohttps://www.youtube.com/watch?v=NMS2VnDveP808:05
arxcruz|ruckLOL08:05
arxcruz|ruckbogdando: i know that from britsh elevator08:05
arxcruz|ruckyou said elleven i thought was something about stranger things08:05
bogdando:)08:06
chandankumar!hannah08:11
openstackchandankumar: Error: "hannah" is not a valid command.08:11
chandankumarhubbot: it is yes or no08:12
hubbotchandankumar: Error: "it" is not a valid command.08:12
*** yolanda has joined #oooq08:17
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)08:21
*** holser__ has joined #oooq08:30
*** jaosorior has joined #oooq08:32
*** ykarel|lunch is now known as ykarel08:35
arxcruz|ruckrasca: around ?08:43
rascaarxcruz|ruck, sure08:43
arxcruz|ruckrasca: cmd2 still failing08:43
*** d0ugal has joined #oooq08:43
rascaarxcruz|ruck, where exactly? Isn't it the mirror thing that bandini was talking about earlier?08:44
arxcruz|ruckrasca: https://review.rdoproject.org/jenkins/job/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-pike-branch/4916/console08:44
quiquell|afkarxcruz|ruck: I am going to shutdown the promoter in the promoter server08:44
arxcruz|ruckquiquell|afk: why ?08:48
arxcruz|ruckquiquell|afk: did you saw my email ?08:48
arxcruz|ruckrasca: it seems the problem is with ara08:48
quiquell|afkarxcruz|ruck: Looks like myoung has a promoter already running ?08:49
arxcruz|ruckquiquell|afk: seems so08:49
quiquell|afkarxcruz|ruck: That's why I want to shutdown the one at promoter server08:52
arxcruz|ruckquiquell|afk: ok08:52
rascaarxcruz|ruck, please help me to understand. The log you linked above is failing for the same reason, but my question is: is it quickstart or not?08:54
arxcruz|ruckrasca: that's what i'm trying to identify08:54
quiquell|afksshnaidm: If I want to force a n -> n + 108:54
quiquell|afkmeaning queens -> master08:55
quiquell|afkThis is enough ?08:55
*** bogdando has quit IRC08:55
quiquell|afkhttps://review.openstack.org/#/c/570823/08:55
arxcruz|ruckrasca: doesn't seems to be only us http://logs.openstack.org/03/565703/11/check/nova-live-migration/fd8785e/job-output.txt.gz08:56
*** quiquell|afk is now known as quiquell08:56
arxcruz|ruckrasca: 10:24 <CrayZee> Hi Infra, cmd2 0.9.1 was released about 10 hours ago and broke zuul. Is anyone looking at it? (See http://logs.openstack.org/61/566961/3/gate/neutron-fullstack/11b8515/job-output.txt.gz#_2018-05-28_15_56_36_548759)08:56
rascaarxcruz|ruck, I see, as I was saying yesterday... This will hit us hard08:56
arxcruz|ruckon o-infra08:56
arxcruz|ruckrasca: seems to be ara -> cliff -> cmd208:57
rascaarxcruz|ruck, well, at least for my deployments (rdophase2) it's just cmd2 and my patch does the trick08:57
arxcruz|ruckrasca: yeah, because you don't use ara right ?08:58
rascaarxcruz|ruck, in quickstart, inside the requirements ara==0.14.508:59
rascaarxcruz|ruck, so we can't be touched by this08:59
ykarelarxcruz|ruck, rasca in case u have missed updates about cmd2:- https://github.com/python-cmd2/cmd2/issues/421, no only mirrors needs to be fixed09:00
ykarelsomehow rdo mirror is fixed09:00
ykareland phase1 passed09:01
ykarelamoralej, do you know about ^^ mirror fixes?09:01
quiquellsshnaidm: +1v https://review.openstack.org/#/c/570167/ export  ZUUL_CHANGES09:02
amoralejykarel, what mirror fixes?09:02
ykarelamoralej, cmd2 0.9.1 is removed from pypi,09:03
ykarelso mirror should be updated for this removal, rdo mirror fixed somehow,09:03
ykarelbut openstack-infra mirrors still have this broken cmd209:03
*** skramaja has joined #oooq09:03
amoralejykarel, i still see 0.9.1 in pypi09:04
ykarelamoralej, https://github.com/python-cmd2/cmd2/issues/42109:05
*** ratailor has quit IRC09:05
*** bogdando has joined #oooq09:05
ykarelhmm got it on phase1 mirror was not used, that'w why it fixed there09:06
ykarelhttps://files.pythonhosted.org/packages/53/e5/5ec46c74d13b488dedc93d5ff951f459bd898f4315c7564fb67f30d7aec9/ is used09:06
ykarelamoralej, try: pip install cmd2==0.9.1,09:07
amoralejyes, i'm testing it09:07
*** ratailor has joined #oooq09:08
ykarelack09:08
amoralejyeah, it looks fixed now09:09
ykarelso they fixed it for python2, 0.9+ are available only for python309:09
amoralejyeah, that's what i see09:10
ykarelpip3 install cmd2==0.9.1 works09:10
amoraleji'm not sure how mirrors get updated09:10
amoralejin rdo-ci and upstream09:10
amoraleji just saw a job failing still arounc one hour ago https://review.rdoproject.org/jenkins/job/rdoinfo-tripleo-pike-release-centos-7-multinode-1ctlr-featureset006-nv/331/console09:10
ykarelhmm then we need to wait for infra guys, as those needs to be update09:10
arxcruz|ruckjaosorior: hey man, last time i touch a puppet file was 5 years ago, come on, i don't know how to do those stuff :P09:13
sshnaidmarxcruz|ruck, fyi: http://logs.openstack.org/67/570167/2/gate/tripleo-ci-centos-7-scenario001-multinode-oooq-container/cf9ccc3/job-output.txt.gz#_2018-05-29_09_13_53_49617309:22
arxcruz|rucksshnaidm: we are aware09:23
sshnaidmarxcruz|ruck, oh, just saw it in alerts09:23
arxcruz|rucksshnaidm: everything that uses ara is broken09:23
arxcruz|rucksshnaidm: https://review.openstack.org/#/c/570822/09:23
arxcruz|ruckthis will fix it09:23
*** jfrancoa has quit IRC09:24
sshnaidmgreat09:25
ykarelmay be they will go with mirror fix instead of ^^09:25
arxcruz|ruckykarel: no idea, whatever works, i'm okay :)09:27
ykarelyup09:27
*** moguimar has joined #oooq09:28
sshnaidmquiquell, need to think how better to detect/alert such issues in grafana ^^09:29
*** panda|pto is now known as panda09:32
quiquellsshnaidm: This goes beyond the build result09:32
sshnaidmdoes anybody have problems with internal IRC?09:32
sshnaidmquiquell, yeah, need some alert about check jobs too, not only gates..09:33
quiquellsshnaidm: But you don't know error is legit and what not09:33
sshnaidmquiquell, I think if specific job fails >3 times for various patches, it could be thought as broken09:34
sshnaidmquiquell, it's possible that 3 broken patches in a row, but chance is not big09:35
quiquellsshnaidm: We have to group per review and patchset09:36
quiquellpanda: Good come back09:36
pandaquiquell: hold it, this is a weird sprint for me, PTO tomorrow morning too09:37
quiquellpanda: Holy shit, what are you doing here today09:37
pandaquiquell: then all next week, then maybe some other days the one after09:37
pandaquiquell: it's a long story09:37
quiquellpanda: Ok09:38
*** moguimar has quit IRC09:38
pandaquiquell: and anyway, it's holy shift09:39
quiquellpanda: fs051 (overcloud upgrade) has a success in upstream09:40
quiquellMaybe we can add it to check/gate09:40
quiquellupgrade guys fixed it09:40
arxcruz|ruckrasca: do you have any idea here ? https://thirdparty.logs.rdoproject.org/jenkins-oooq-master-rdo_trunk-bmu-haa16-lab-float_nic_with_vlans-166/console.txt.gz#_2018-05-26_01_47_29_22609:42
rascaarxcruz|ruck, you don't have to worry about this one, this is an ha test failed, but the deployment was successful, and in any case the next job was fine09:43
rascaarxcruz|ruck, https://thirdparty.logs.rdoproject.org/jenkins-oooq-master-rdo_trunk-bmu-haa16-lab-float_nic_with_vlans-167/console.txt.gz09:43
arxcruz|ruckrasca: phase3 2 is 4 days behind09:44
rascaphase3 ?09:44
rascaarxcruz|ruck ^^^09:44
arxcruz|ruckrasca: phase 209:44
arxcruz|rucksorry09:44
arxcruz|ruckhehehe09:44
rascaarxcruz|ruck, uhm before job 167 (which as I said was successful) this maste job failed a gazillion of time... So why we care about it today?09:45
*** anande has joined #oooq09:49
arxcruz|ruckrasca: if you say we are good, i'm good, was just checking phase 2 jobs09:50
rascaarxcruz|ruck, we are good. At least on that side :)09:51
quiquellpanda: Reading https://trello.com/c/ZPNYHG3F/775-ci-job-make-job-50-gate09:53
quiquellThere is already a job that does that, how do we ensure that we move it to gate ?09:53
jaosoriorarxcruz|ruck: that's why I gave the docs :D. But yeah, unfortunately I don't have merge rights in that repo. I'm set as a collaborator but can't merge stuff :/09:54
pandaquiquell: you have to move it to the gate pipeline09:54
*** Tengu has joined #oooq09:54
Tenguhello there! I have a small question regarding a patch I'm submitting: https://review.openstack.org/#/c/570841/  in the comment, there's a remark from jaosorior - and I don't really know the answer. Care to have a look?09:55
arxcruz|ruckjaosorior: yeah, but i don't know exactly what to do, add in the service start in certmonger ?09:56
quiquellpanda: We have to ensure that is working first09:58
jaosoriorarxcruz|ruck: no, so, just add notify => Service['certmonger']09:58
jaosoriorarxcruz|ruck: and that'll restart certmonger if 'dbus' is restarted09:59
arxcruz|ruckjaosorior: it's not needed09:59
jaosoriorit isn't?09:59
arxcruz|ruckjaosorior: nope, because you will install certmonger first, then start the service09:59
arxcruz|ruckthe failure is just if dbus is updated without a reboot09:59
jaosoriorarxcruz|ruck: right, so lets say that happens09:59
pandaquiquell: and that is working at least 94% of the time09:59
jaosoriordbus is restarted, certmonger fails10:00
jaosorior* dbus is updated10:00
quiquellpanda: also fs050 is working now10:00
jaosoriorand certmonger fails10:00
jaosoriorwe then run the manifest10:00
quiquellpanda: Going to add a patch to put both in the gate10:00
jaosoriordbus is restarted... wouldn't we then need to restart certmonger?10:00
arxcruz|ruckjaosorior: no because certmonger isn't even installed yet10:00
jaosoriorOK, I'm getting confused. Is this issue happening on updates? wouldn't that mean that we had already an undercloud running with certmonger?10:01
jaosorior(this could also happen in an overcloud if TLS everyhwere is enabled)10:01
arxcruz|ruckjaosorior: hmmmm it might happen that certmonger is already installed10:01
arxcruz|ruckjaosorior: but if I add a Service['certmonger'] and the service doesn't exist, will it fail ?10:02
jaosoriorarxcruz|ruck: no, because it's defined below10:02
jaosoriorarxcruz|ruck: puppet first evaluates the whole manifest, then executes it. So it knows what's in the resource catalog10:02
arxcruz|ruckjaosorior: i'm not puppet master (pun intended) so i'll believe in you :P10:03
jaosoriorarxcruz|ruck: Tengu is :D we could double check.10:04
jaosoriorTengu: could you check this out https://github.com/saltedsignal/puppet-certmonger/pull/20 ?10:04
quiquellpanda: https://review.openstack.org/#/c/570882/  Added to the gates, if we forget about today (ara problem) the job is working10:10
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)10:22
Tengujaosorior: err yeah, sorry, was away10:28
Tenguarxcruz|ruck jaosorior added a small comment, take it or leave it, not a big issue.10:33
Tenguarxcruz|ruck: short note: puppet will build the catalog and evaluate the dependency tree as a whole.10:34
Tenguarxcruz|ruck: no real order is needed for that.10:34
Tenguthat said... wondering whether there isn't a puppet-dbus thingy.10:35
jaosoriorTengu: it might be, but we don't use it in OpenStack. It would be quite painful to introduce a new dependency on a puppet library just for a restart.10:35
Tenguok :)10:36
Tengualthough it might be good to check if there are subtilities.10:36
Tengu2s10:36
jaosoriorTengu: thansk for checking it out10:36
Tenguhttps://github.com/bodgit/puppet-dbus/blob/master/manifests/service.pp  arxcruz|ruck10:36
Tengumight be good to set the same parameters10:36
Tenguadded as a comment to the whole review.10:38
TenguI let you decide :).10:38
*** yolanda has quit IRC10:38
*** yolanda has joined #oooq10:42
arxcruz|ruckTengu: cool, thanks :)11:07
Tengunp :).11:07
arxcruz|ruckTengu: do i need to keep the notify ?11:08
Tenguarxcruz|ruck: the notify will kick in of the daemon is stopped and puppet start it11:09
Tenguas I understand, this is the goal of this patch right?11:09
Tengu*kick in IF the daemon - sorry11:09
arxcruz|ruckTengu: yup \11:09
Tengualso *startS - hungry, can't think, sorry11:10
*** lucasagomes is now known as lucas-hungry11:11
*** lucas-hungry is now known as lucasagomes11:11
quiquellpanda: do you have a minute ?11:23
pandaquiquell: yes11:24
quiquellDoing a noop change on tripleo-upgrade stable/queens11:24
quiquellto trigger something with STABLE_RELEASE=queens11:24
quiquellbut I don't see the fs050 check job running on it11:25
quiquellpanda: https://review.openstack.org/#/c/570823/11:25
pandaquiquell: probably the job definition is missing for queens11:26
quiquellDamn that's right11:27
quiquellpanda: Thanks man11:28
*** udesale_ has joined #oooq11:33
*** myoung|zzz is now known as myoung11:33
myoungquiquell: sent mail...11:33
quiquellmyoung: Just readed it11:33
myoungarxcruz|ruck: did you see mail from weekend?  should I halt the promoter I have running?11:33
quiquellmyoung: Good morning btw11:34
quiquellmyoung: I have stop the promoter at the promoter-server11:34
arxcruz|ruckmyoung: hey, quiquell already stop the promoter on the promoter-server11:34
*** udesale has quit IRC11:34
arxcruz|ruckmyoung: i did saw it, and fwd to quiquell11:34
arxcruz|ruckhe wasn't in the list :(11:34
myoungarxcruz|ruck: ya, it was a "world on fire late friday night" event...i had just put ruck/rover on11:35
myoungquiquell: sorry!11:35
myoungarxcruz|ruck, quiquell will be back in 1 hr...11:35
* myoung transmogrifies into a kindertaxi11:35
quiquellmyoung: No problem, too much fire11:35
arxcruz|ruckfire exclamation point, fire exclamation point11:35
myoung%gatestatus11:36
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)11:36
arxcruz|ruckmyoung: everything is messed11:36
quiquellmyoung: I have some proposals for the promoter, do you have a bj minute later ?11:36
myoungquiquell: sure...i have a prototype as well.  panda has some opinions here too.11:36
arxcruz|ruckmyoung: cmd2 release a new version python 3.4 only, mostly of infra are down, a patch is on the way to fix11:37
myoungquiquell: i was playing with a reimplementation over weekend11:37
myoungi need to dash to take kiddo to school.  back inna bit11:37
quiquellmyoung: Let's talk later on11:37
*** udesale_ has quit IRC11:37
*** ratailor has quit IRC11:42
*** anande has quit IRC11:44
quiquellHi, anyone knows why i have the repos disabled in the undercloud ?11:53
quiquell+(/opt/stack/tripleo-ci/toci_gate_test-oooq.sh:60): sudo rm -f '/etc/yum.repos.d/epel*'11:54
quiquell+(/opt/stack/tripleo-ci/toci_gate_test-oooq.sh:62): sudo yum clean all11:54
quiquellLoaded plugins: fastestmirror11:54
quiquellLoading mirror speeds from cached hostfile11:54
quiquellThere are no enabled repos.11:54
quiquell Run "yum repolist all" to see the repos you have.11:54
quiquell To enable Red Hat Subscription Management repositories:11:54
quiquell     subscription-manager repos --enable <repo>11:54
quiquell To enable custom repositories:11:54
quiquell     yum-config-manager --enable <repo>11:54
*** atoth has joined #oooq11:55
*** amoralej is now known as amoralej|lunch11:55
*** tcw has joined #oooq11:58
quiquellok12:00
quiquellsudo yum-config-manager --enable "*"12:00
*** sshnaidm has quit IRC12:06
*** sshnaidm has joined #oooq12:10
*** udesale has joined #oooq12:11
*** moguimar has joined #oooq12:13
*** sshnaidm has quit IRC12:15
*** udesale has quit IRC12:17
*** sshnaidm has joined #oooq12:18
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)12:22
*** sshnaidm has quit IRC12:24
*** rlandy has joined #oooq12:28
*** rlandy is now known as rlandy|rover12:29
rlandy|roverarxcruz|ruck: hello - how's it going?12:29
arxcruz|ruckrlandy|rover: hi, good12:30
rlandy|roverarxcruz|ruck: nice :)12:30
arxcruz|ruckrlandy|rover: we ha some issues today that block the entire openstack, not only tripleo12:30
rlandy|roverwas expecting some fun after the long weekend12:30
arxcruz|ruckbut it's fixed now, so we should start to see some problems soon :D12:30
Tengu:)12:30
rlandy|roverarxcruz|ruck:  any particular bug I need to look at now?12:31
arxcruz|ruckrlandy|rover: nope12:31
rfolcoarxcruz|ruck, is the cmd2 issue fixed ?12:31
arxcruz|ruckrlandy|rover: i digg into the newton problem, found solution and working on that12:31
rlandy|roverarxcruz|ruck: awesome12:31
arxcruz|ruckrfolco: yup, fixed in infra few minutes ago12:31
rfolcothanks12:31
arxcruz|ruckrfolco: i post comments in the bug12:31
rlandy|roverarxcruz|ruck: ocata is still an issue12:31
arxcruz|ruckrlandy|rover: ^12:31
arxcruz|ruckrlandy|rover: yeah, i was digging into it today, but the cmd2 cat me12:32
arxcruz|ruck:/12:32
arxcruz|ruckrlandy|rover: do you know the problem ?12:32
rlandy|roverarxcruz|ruck: and looks like phase212:32
rlandy|roverarxcruz|ruck: I'll look into phase 212:32
*** moguimar has quit IRC12:32
rlandy|roverwrt ocata12:32
arxcruz|ruckyeah, i saw phase 2 has 4 days12:32
rlandy|roverbeen trying to reproduce for days12:32
rlandy|roverit's a diff error every time12:32
*** moguimar has joined #oooq12:33
rlandy|roverI think the hardware is not reliable12:33
rlandy|roverbut i can't prove it12:33
rlandy|roverarxcruz|ruck: going to look into phase2 and ocata12:33
rlandy|roverping me if there is something else burning12:33
arxcruz|ruckrlandy|rover: i saw ssh failures12:33
rlandy|roversometimes introspection fails12:34
rlandy|roversometimes deploy12:34
arxcruz|ruckrlandy|rover: i also saw stack creation failures, but i check the tenant and the stacks are clean12:34
rlandy|roversometimes undercloud insta;;12:34
rlandy|roverstack creation failures?12:34
rlandy|roverphase 2?12:34
rlandy|roverok - I'm on it12:34
*** udesale has joined #oooq12:34
rlandy|roverykarel: can you review this one for me ... https://review.openstack.org/#/c/570694/12:35
rlandy|roverit's the redone patch12:35
rlandy|roverpanda: ^^12:35
rlandy|roverI tried to keep the order of args for playbooks12:35
arxcruz|ruckrlandy|rover: yeah, on phase2 but i log into dashboard and stacks are created properly12:35
rlandy|roverok - I'll look into it12:36
ykarelrlandy|rover, have u tried this with promotion job12:36
ykareli mean where PERIODIC=112:37
rlandy|roverykarel: will do - you had some other comments that you would check what else was wrong. not sure what you refered to there12:38
ykarelrlandy|rover, ok will look12:38
rlandy|rovermyoung: have you seen this on phase 2 before ""2018-05-29 08:26:09 | 2018-05-29 08:26:09,462 INFO: 12: Timeout on http://mirrorlist.centos.org/?release=7&arch=x86_64&repo=os&infra=stock: (28, 'Operation too slow. Less than 1000 bytes/sec transferred the last 30 seconds')12:56
rlandy|rover2018-05-29 08:26:09 | 2018-05-29 08:26:09,475 DEBUG: An exception occurred12:56
rlandy|rovermaybe we just need to rerun12:56
rlandy|rovermyoung: after the meeting, can we talk about phase 2 jobs in master12:57
rlandy|roverlooks like they have been failing for a long time12:57
rlandy|roveryet we promote12:57
myoungci squad: scrum!12:59
myoungrlandy|rover: ack, can chat after meeting12:59
*** jaganathan has quit IRC13:04
*** trown|outtypewww is now known as trown13:04
*** ykarel is now known as ykarel|away13:06
*** sshnaidm has joined #oooq13:10
*** ykarel|away has quit IRC13:10
*** amoralej|lunch is now known as amoralej13:18
*** links has quit IRC13:20
*** sshnaidm has quit IRC13:25
*** sshnaidm has joined #oooq13:27
*** zoli is now known as zoli|lunch13:31
*** dtantsur is now known as dtantsur|brb13:31
*** yolanda_ has joined #oooq13:40
*** yolanda has quit IRC13:42
myoungarxcruz|ruck: could you send me your key please, will add to the promoter I have running on internal server.  I'm moving logs now to the actual promoter from the weekend13:44
arxcruz|ruckmyoung: https://github.com/arxcruz.keys13:44
myoungarxcruz|ruck: you should have access now13:45
*** dtrainor has joined #oooq13:47
*** ykarel|away has joined #oooq13:51
*** ykarel|away is now known as ykarel13:55
*** moguimar has quit IRC13:58
*** sanjay__u has quit IRC14:02
*** moguimar has joined #oooq14:08
quiquellI have force a n -> n + 1 for fs050 and has work well14:08
quiquelljust reversing the releases stuff at toci_quickstart.sh14:09
quiquelland using rlandy|rover patch14:09
quiquellhttps://review.openstack.org/#/c/570823/14:09
*** skramaja has quit IRC14:11
quiquellWith stable_release as queens14:11
quiquellbuild-test-package works at undercloud install but not install-build-repo14:11
*** quiquell is now known as quiquell|off14:20
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)14:22
*** bogdando has quit IRC14:29
pandaI think we need to cap the initial presentation at 15 minutes14:31
trowninitial presentation?14:32
sshnaidmpanda, arxcruz|ruck rlandy|rover trown, should we discuss interview?14:32
arxcruz|rucksshnaidm: up to you14:33
sshnaidmarxcruz|ruck, it shouldn't be up to me :)14:33
arxcruz|ruckup to trown then14:33
arxcruz|ruck:D14:33
pandatrown: who we are, who you are what we do14:33
trownwe can discuss now I suppose... I thought we would discuss all 3 after tomorrow14:33
trownbut easier for me to say since I was taking notes14:34
trowni rejoined the bluejeans14:34
trownpanda: oh did we spend more than 15 minutes on that? I thought we had plenty of technical questions14:34
sshnaidmtrown, I won't be tomorrow14:34
pandatrown: 20 minutes14:35
trownpanda: arxcruz|ruck rlandy|rover sshnaidm: ok, lets rejoin and discuss now, and I will add more notes for our team discussion14:35
sshnaidmtrown, rejoin to the same bj?14:35
pandatrown: ok, cancelling the community meeting14:35
pandano on there again14:36
trownpanda: ya, seems to much with interviews anyways14:36
trownsshnaidm: ya14:36
chandankumarsshnaidm: myoung trown please have a look these patches https://review.openstack.org/#/q/topic:refstack-support+(status:open+OR+status:merged)14:37
arxcruz|ruckpanda: rlandy|rover https://bluejeans.com/4113567798/?src=calendarLink14:37
pandatrown: sshnaidm rlandy|rover there is also the community meeting to discuss the combination of injections14:39
pandatrown: sshnaidm rlandy|rover I'll need a summary of what you discussed before the next interview14:44
kopecmartinchandankumar, arxcruz|ruck myoung I'm sorry, I can't make it to today's meeting, I need to leave earlier14:49
kopecmartinI've updated my cards and write a summary for each of them,14:49
kopecmartin  tomorrow and the next day I'm on PTO but I'm checking emails if anything urgent :)14:50
chandankumarkopecmartin: enjoy your pto :-)14:50
kopecmartinchandankumar, thanks14:51
myoungarxcruz|ruck, rlandy|rover, the promoter logs until we move promoter off internal server are here: http://sol.usersys.redhat.com/promoter_logs/14:51
arxcruz|ruckmyoung: fancy, i like it14:51
myoungarxcruz|ruck: fancy?14:51
myoungarxcruz|ruck: ahh the apache config :)14:52
arxcruz|ruckmyoung: yup14:52
myoungarxcruz|ruck: for ruck stuff I find this useful too, http://sol.usersys.redhat.com/dlrnapi-reports14:52
myoung^^ to augment http://dashboards.rdoproject.org/{release}14:53
*** ykarel is now known as ykarel|away14:54
*** kopecmartin has quit IRC14:56
*** dtantsur|brb is now known as dtantsur15:00
*** zoli|lunch is now known as zoli|wfh15:04
rlandy|rovermyoung: 2018-05-29 11:08:26,223 11521 INFO     promoter Skipping promotion of current-tripleo-rdo to current-tripleo-rdo-internal, missing successful jobs: ['periodic-master-rdo_trunk-featureset020-1ctlr_1comp_64gb']15:10
rlandy|rovershould be succeddful?15:11
rlandy|roversuccessful?15:11
myoungrlandy|rover: ack, rdo2 criteria has generally been fs020 (which runs tempest in same manner as upstream) + BM pass15:11
myoungrlandy|rover: or do you mean "there is a success logged and promoter should have found it?"15:12
rlandy|roverwill ping in a bit - interview15:13
arxcruz|ruckmyoung: this failures that hubbot is reporting at https://review.openstack.org/564291 isn't getting rebased, is that correct?15:19
*** ccamacho has quit IRC15:20
myoungarxcruz|ruck: in TC meeting, responses delayed.  the hubbot job is watching the jobs listed in https://review.rdoproject.org/etherpad/p/ruckrover-sprint14 at the top15:21
myounghubbot check jobs:15:21
myoung    TQE, https://review.openstack.org/#/c/560445, I214272a6f25feb75496e44eb0a16269c6ee4cfe215:21
myoung    THT, https://review.openstack.org/#/c/567224, I0cbf9ffb8552411e4dd891c38702ff8d1f6db5b1, stable/queens15:21
myoung    THT, https://review.openstack.org/#/c/564285, If12c8fe9bd0bea98a4842f279399285344f22246, stable/pike15:21
myoung    THT, https://review.openstack.org/#/c/564291, I4c5bdf00ce8cf7eabf669b248b99cb8443e82fab, stable/ocata15:21
hubbotmyoung: Error: "check" is not a valid command.15:21
myoung%config plugins.GateStatus.changeIDs15:22
hubbotmyoung: I0cbf9ffb8552411e4dd891c38702ff8d1f6db5b1 I214272a6f25feb75496e44eb0a16269c6ee4cfe2 I4c5bdf00ce8cf7eabf669b248b99cb8443e82fab If12c8fe9bd0bea98a4842f279399285344f2224615:22
myoungarxcruz|ruck: ^^15:22
myoungarxcruz|ruck, or at least it should be.  IF we need/want to add patches to what's watched/monitored by the bot and automatically rechecked, that can be done via IRC or (better) by modifying the hubbot .conf files on the hubbot instance15:34
arxcruz|ruckmyoung: yeah, it seems that the change doesn't rebase since may 2415:34
*** moguimar has quit IRC15:38
*** marios has quit IRC15:48
*** jtomasek has quit IRC15:52
*** saneax has quit IRC15:53
*** udesale_ has joined #oooq15:54
*** jtomasek has joined #oooq15:54
rfolcoarxcruz|ruck, or chandankumar: python-stestr... where do I install this from ? you know ?15:56
chandankumarrfolco: on undercloud15:56
arxcruz|ruckchandankumar: i think he meant the repo right ?15:56
rfolcoyep, which repo15:56
*** udesale has quit IRC15:57
chandankumarrfolco: it is available in rhos-release 13 repo15:57
rfolcorhos-12 does not have it ?15:57
chandankumarrfolco: nope15:57
chandankumarrfolco: look for15:58
chandankumarrhos-12.0-rhel-7-override15:58
chandankumarrepo15:58
chandankumarenable it then you can install python-stestr15:58
chandankumarmyoung: I might not be able to standup today sorry for the late notice16:01
rfolcochandankumar, I think I have it already16:02
rfolcohttps://softwarefactory.usersys.redhat.com/logs/95/95/23/check/osp-rhel-7-undercloud-oooq/ffc333e/logs/undercloud/etc/yum.repos.d/rhos-release-12.repo.txt.gz16:02
rfolco[rhelosp-12.0-image-build-override]16:02
rfolcooh different one16:02
myoungchandankumar, also martin isn't here, shall we just cancel?16:04
myoungarxcruz|ruck: ^^16:04
chandankumarmyoung: yup16:04
myoungchandankumar: ack16:04
arxcruz|ruckmyoung: i'm in an interview16:04
arxcruz|ruckmyoung: plus, i'm rucking :P16:04
myoungchandankumar, arxcruz|ruck anything for tempest squad that needs doing/chasing of note?16:04
myoung"virtualmeetinggo"16:05
chandankumarmyoung: most of the work is in progress16:05
myoungchandankumar: ack.  just checking :)16:05
myoungif anything out of band16:05
chandankumarmyoung: I need reviews on this https://review.openstack.org/#/q/topic:refstack-support+(status:open+OR+status:merged)16:05
*** udesale__ has joined #oooq16:07
*** udesale_ has quit IRC16:10
*** udesale__ has quit IRC16:14
chandankumarrfolco: mburns will tag it under rhelosp-12.0-unittest repo16:14
chandankumarrfolco: then you will able to use it16:15
chandankumarjust make sure you enable repo16:15
rfolco[rhelosp-12.0-unittest]16:15
rfolconame=RHOS-12.0 Unit Test Dependency16:15
rfolcobaseurl=http://download.eng.bos.redhat.com/rel-eng/repos/rhos-12.0-rhel-7-testdeps/$basearch/16:15
rfolcochandankumar, waiting for your signal16:15
*** panda is now known as panda|off16:21
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)16:22
rlandy|rovermyoung: bug  triage?16:33
myoungyes incoming, thought yall were interviewing still16:34
rlandy|rovermyoung: I think panda|off is gone16:34
rlandy|roverlate for arxcruz|ruck as well16:35
*** holser__ has quit IRC16:52
rlandy|rovertrown: can we get a review on https://review.rdoproject.org/r/#/c/13945/17:02
rlandy|roverI am working on fixing envE for fs00117:03
rlandy|roversshnaidm: ^^ can you review? we're trying to get a pike promotion through17:08
rlandy|roverenvE hasn't passed since 05/0717:08
*** trown is now known as trown|lunch17:11
*** dsneddon has joined #oooq17:23
myoungrlandy|rover: https://review.rdoproject.org/r/#/c/1390417:29
chandankumarrfolco: good to go ahead17:36
chandankumarrfolco: sorry I went out for food17:36
chandankumarrfolco: https://brewweb.engineering.redhat.com/brew/buildinfo?buildID=61730917:36
rfolcochandankumar, does it take a while to show up in the repo ? http://download-node-02.eng.bos.redhat.com/rel-eng/repos/rhos-12.0-rhel-7-testdeps/x86_64/17:38
chandankumarrfolco: yup it will take some time to show up there17:38
rfolcochandankumar, cool. Thanks for building it ;)17:39
chandankumarrfolco: let me know if any more package are missed17:39
rfolcoI am very close to a successful run, hope this was the last one17:40
*** jaosorior has quit IRC17:49
*** zoli|wfh is now known as zoli|gone18:16
*** zoli|gone is now known as zoli18:16
*** atoth has quit IRC18:20
*** atoth has joined #oooq18:21
*** rlandy|rover is now known as rlandy|rover|brb18:21
rfolcochandankumar, python2-future is a dep of python-stestr. Had to install it manually.18:22
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-nonha-multinode-oooq, tripleo-ci-centos-7-scenario001-multinode-oooq, tripleo-ci-centos-7-scenario004-multinode-oooq, tripleo-ci-centos-7-scenario002-multinode-oooq, tripleo-ci-centos-7-scenario003-multinode-oooq, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master:  (5 more messages)18:22
*** trown|lunch is now known as trown18:22
*** dtantsur is now known as dtantsur|afk18:28
*** tesseract has quit IRC18:30
*** amoralej is now known as amoralej|off18:38
*** jaganathan has joined #oooq18:41
*** tcw1 has joined #oooq18:45
*** jjoyce has joined #oooq18:50
*** tcw has quit IRC18:52
*** weshay_pto has quit IRC18:53
*** fuzzball81 has quit IRC18:53
*** weshay has joined #oooq18:54
*** rlandy|rover|brb is now known as rlandy|rover19:03
rlandy|rovercurrent-tripleo-rdo-internal19:03
rlandy|rover379 MB19:03
rlandy|rover14 minutes ago19:03
rlandy|roverpike promotion ... hello19:04
*** holser__ has joined #oooq20:11
*** ykarel|away has quit IRC20:14
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.20:22
rlandy|roverwoohoo20:29
myoungrlandy|rover: boomsauce: 2018-05-29 16:26:05, https://trunk.rdoproject.org/centos7-pike/8e/96/8e9629b551b41fa689517dfd9d3be1359b203acd_940a986d, current-tripleo-rdo-internal20:37
myoungrlandy|rover: anything else a priority or should i kick off the normal loop of promotions?20:38
*** trown is now known as trown|outtypewww20:45
rlandy|rovermyou: we're all good except ocata21:01
rlandy|roverthank you21:01
rlandy|rovermyoung,: normal promotions21:01
rlandy|rovermyoung: do you have any access to the ci.centos nodes?21:02
rlandy|roverI can't figure this out21:02
rlandy|roversshnaidm: do you have any access to hold ci.centos nodes by any chance?21:11
rlandy|roverI think adarazs used to21:11
rlandy|roverocata phase 1 is doing us in21:11
sshnaidmrlandy|rover, nope :(21:11
rlandy|roversshnaidm: you reported the ocata phase1 failure  and I still can't get a reliable reproducer21:12
rlandy|roverI need to get on the nodes - asked david simard21:12
rlandy|roverwe really need envs we can reproduce :(21:12
sshnaidmrlandy|rover, doesn't it reproduce with usual libvirt?21:13
sshnaidmrlandy|rover, it should be libvirt job, right?21:13
rlandy|roversshnaidm: it doesn't even reproduce on itself21:13
rlandy|roverone job undercloud install fails21:13
rlandy|roverthen images fail21:13
rlandy|roverthen deploy fails21:13
sshnaidmmmm.. yeah, seems like infrastructure problem21:13
rlandy|roverthe best I can do is try debug why we get 'no available nodes' on the overclodu deployment21:14
rlandy|roverright21:14
rlandy|roverbut I can't prove it21:14
rlandy|roverand I have no access to the infra21:14
sshnaidmrlandy|rover, cpu/mem/disk is ok?21:14
*** holser__ has quit IRC21:14
sshnaidmno oom killers or kind of?21:14
rlandy|rovernot from what I can tell from looking at logs - but will recheck21:15
sshnaidmrlandy|rover, maybe also network problem, worth to check if we got failed yum installs21:15
sshnaidmrlandy|rover, is it the same slave or slave group?21:16
rlandy|rover sshnaidm: david has node access21:16
rlandy|rovergetting a node to debug21:16
sshnaidmok21:17
*** sanjay__u has joined #oooq21:19
rlandy|rover rdo-ci-cloudslave0521:19
rlandy|roverrdo-ci-cloudslave05 ( on previous job)21:19
*** jtomasek has quit IRC21:53
*** tosky has quit IRC22:02
*** tosky has joined #oooq22:06
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.22:22
myoungrlandy|rover: no we don't have access22:36
myoungwe can ping amoralej|off typically or yatin22:37
myoungrlandy|rover: do we already have a LP for the phase 1?  at this point we've dumped some time into debugging and coming up with we don't have access...  LP --> promotion-blocker --> CIX card, mobilize.  I think ruck/rover should have access to rdo1 nodes for debugging.  Until we do we're basically stuck on rdo122:38
myoung^^ (IMHO)22:38
rlandy|rovermyoung: agreed - I'm logging a LP with debug details22:59
myoungrlandy|rover: ack.  back inna bit...making banana bread with my tween-now-teen-omg23:08
rlandy|roverhappy birthday?23:09
myoungevery day is a good day for banana bread23:17
myounglol23:17
*** tosky has quit IRC23:25
rlandy|rovermyoung: https://bugs.launchpad.net/tripleo/+bug/177407923:29
openstackLaunchpad bug 1774079 in tripleo "[ocata promotion] phase1 (ci.centos) job tripleo-quickstart-promote-ocata-rdo_trunk-minimal fails introspection/deploy "No valid host found"" [Critical,Triaged]23:29
*** openstackstatus has joined #oooq23:42

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!