Tuesday, 2018-05-15

hubbotadarazs: An error has occurred and has been logged. Please contact this bot's administrator for more information.00:11
weshaypanda, you have creds to the te-broker/00:23
weshay?00:23
hubbotadarazs: An error has occurred and has been logged. Please contact this bot's administrator for more information.02:12
*** dougbtv_ has quit IRC02:56
*** udesale has joined #oooq03:10
*** udesale has quit IRC03:19
*** udesale has joined #oooq03:31
*** links has joined #oooq03:37
hubbotadarazs: An error has occurred and has been logged. Please contact this bot's administrator for more information.04:12
*** udesale has quit IRC04:28
*** udesale has joined #oooq04:39
*** udesale has quit IRC04:49
*** udesale has joined #oooq04:50
*** marios has joined #oooq05:08
*** saneax-_-|AFK is now known as saneax05:10
*** ykarel has joined #oooq05:10
ykarelOVB jobs failing with: +(/opt/stack/new/tripleo-ci/toci_gate_test.sh:252): sleep 120005:12
ykarel2018-05-15 04:07:03,339 - testenv-client - INFO - Received job : Couldn't retrieve env05:12
*** pgadiya has joined #oooq05:34
*** pgadiya has quit IRC05:34
*** udesale_ has joined #oooq05:36
*** udesale has quit IRC05:39
*** udesale_ has quit IRC05:40
*** udesale has joined #oooq05:40
*** jbadiapa has quit IRC06:00
hubbotadarazs: An error has occurred and has been logged. Please contact this bot's administrator for more information.06:12
*** udesale has quit IRC06:38
*** bogdando has joined #oooq06:40
*** udesale has joined #oooq06:40
*** brault has quit IRC07:01
*** brault has joined #oooq07:06
*** sshnaidm has joined #oooq07:09
*** tesseract has joined #oooq07:13
*** florianf has joined #oooq07:18
*** holser__ has joined #oooq07:20
*** sshnaidm is now known as sshnaidm|rover07:21
*** skramaja has joined #oooq07:23
*** kopecmartin has joined #oooq07:24
*** tosky has joined #oooq07:49
*** lucas-afk is now known as lucasagomes08:04
sshnaidm|roverpanda, trown|outtypewww please review reproducer patch change: https://review.openstack.org/#/c/565839/08:09
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-undercloud-upgrades, tripleo-ci-centos-7-undercloud-oooq @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata-branch, gate-tripleo-ci- (2 more messages)08:12
*** holser__ has quit IRC08:13
*** holser__ has joined #oooq08:13
*** ykarel is now known as ykarel|lunch08:15
*** florianf has quit IRC08:40
*** florianf has joined #oooq08:44
*** gkadam has joined #oooq08:51
*** ykarel|lunch is now known as ykarel08:51
*** trown|outtypewww has quit IRC08:52
*** trown has joined #oooq08:53
*** jbadiapa has joined #oooq09:15
*** udesale_ has joined #oooq09:33
*** udesale has quit IRC09:34
*** jaosorior has quit IRC10:00
*** jaosorior has joined #oooq10:01
*** udesale_ has quit IRC10:01
*** udesale has joined #oooq10:03
*** zoli is now known as zoli|lunch10:05
*** tosky has quit IRC10:11
*** tosky has joined #oooq10:12
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario003-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-queens, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-queens, tripleo-ci-centos-7-scenario004 (2 more messages)10:12
*** zoli|lunch is now known as zoli10:24
*** tesseract is now known as info10:41
*** info is now known as tesseract10:41
*** zoli is now known as zoli|afk-tpb11:13
*** dtantsur|afk is now known as dtantsur11:14
*** atoth has joined #oooq11:20
*** holser__ has quit IRC11:32
*** holser___ has joined #oooq11:32
*** agopi has quit IRC11:46
*** zoli|afk-tpb is now known as zoli11:59
weshaysshnaidm|rover, ykarel you guys cool w/ me promoting queens?12:01
sshnaidm|roverweshay, no12:01
*** skramaja has quit IRC12:01
*** skramaja_ has joined #oooq12:01
weshaywhy?12:01
weshaysshnaidm|rover, it's 100% down upstream12:01
weshaythanks for fixing the dns issue btw12:02
sshnaidm|roverweshay, a lot of issues actually, the latest was rdo cloud networking problems, and seems like it's resolved now12:02
sshnaidm|roverweshay, promotion jobs should start right now12:02
weshaysshnaidm|rover, rdo cloud 3rd party jobs are failing for all kinds of reasons12:03
weshaysshnaidm|rover, what do you mean?12:03
sshnaidm|roverweshay, rdo cloud network issues was last problem12:03
weshaysshnaidm|rover, k.. and it's fixed?12:03
weshayso it's worth just to wait12:04
sshnaidm|roverweshay, promotion jobs will start in a minute, they should be ok afaik12:05
ykarelyup we can wait for next run12:05
weshaysshnaidm|rover, ok.. ur the boss, will wait12:05
ykarelweshay, btw which hash u were promoting12:05
sshnaidm|roverweshay, fixed all known issues for me at least..12:05
weshaysshnaidm|rover, where or how did you fix my dns f up?12:06
sshnaidm|roverweshay, I wrote in mail - removed it from network config12:07
sshnaidm|roverweshay, and then set up right dns in dns server12:07
weshaysshnaidm|rover, ok.  why did you have to change the dns server?12:08
sshnaidm|roverweshay, if you want to advertise dns to all it's ok, but you need to hack dhclient configuration of dns server not to receive DNS from dhcp server12:08
sshnaidm|roverweshay, bluejeans please12:09
weshaysshnaidm|rover, ya.. we can change the sysconfig to be static12:09
weshayand chattr resolv.conf12:09
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario003-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-queens, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-queens, tripleo-ci-centos-7-scenario004 (2 more messages)12:12
*** udesale_ has joined #oooq12:18
weshayhttps://review.rdoproject.org/jenkins/job/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset010-queens/204/12:18
*** udesale has quit IRC12:19
weshayykarel, was master rdo1 impacted by rdo-cloud?12:20
weshaynetworking12:20
ykarelweshay, https://trello.com/c/1oq5yHGU12:28
ykarel3 tempest tests failed in last two runs,12:28
ykareli was not able to reproduce it locally, so rechecked again. I am not sure if failures were impacted by rdo-cloud12:29
weshayykarel, k.. does weirdo have the notion of a skip list?12:31
ykarelweshay, kind of12:32
ykarelweshay, https://github.com/openstack/puppet-openstack-integration/blob/master/run_tests.sh#L32212:32
ykarelbut weirdo don't have, we need to do that in poi or packstack12:32
*** dougbtv_ has joined #oooq12:32
*** rlandy has joined #oooq12:33
weshaycrud I think my networking is going down at my house lolz12:33
weshaycan't get to https://ci.centos.org/job/weirdo-master-promote-packstack-scenario001/1244/console12:34
ykarelweshay, me to can't get to ^^12:35
weshayykarel, heh12:35
weshayk12:35
weshayykarel, we'll be requiring a master promotion.. so if needed we need your card off the tripleo-ci board and onto the escalation board12:37
weshaycix board12:37
ykarelweshay, Ok12:38
weshaysshnaidm|rover, I thought we had fixed the introspection issue w/ an image rebuild https://review.rdoproject.org/jenkins/job/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/13864/console12:38
*** dougbtv_ has quit IRC12:40
weshaysshnaidm|rover, fyi.. rdo2 master was and probably still is blocked on https://review.openstack.org/#/c/566129/12:41
weshaycan somebody please double check the te-broker log http://38.145.34.41/testenv-worker.log as it's not fully rendering for me12:45
weshayit does look like we have log rotate on the box12:45
sshnaidm|roverweshay, not sure about last image.. I'm looking at random job and it's stuck on image prepare: https://review.rdoproject.org/jenkins/job/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/13868/console12:46
weshaysshnaidm|rover, ya.. I've seen that multiple times now12:46
sshnaidm|roverweshay, I configured logrotate today, but you have keys there and can log in12:46
weshaysshnaidm|rover, yup.. thank you sir.. I am in12:46
ykarelweshay, running phase1 packstack job again, last run failed due to Caused: java.io.IOException: Backing channel 'JNLP4-connect connection from n4-112.cloud.ci.centos.org/172.19.4.112:59406' is disconnected.12:50
weshayykarel, k.. thanks12:53
*** Goneri has joined #oooq13:06
myoung|rucko/  morning!13:09
myoung|ruckweshay, sshnaidm|rover yall still on bj?13:13
rlandytrown: hi - I removed the pylint stuff from https://review.openstack.org/#/c/568287/ - it's just pep8 now. If we want pylint later, the commit history is there to add it back. I removed the WIP as this shoudl pass now13:28
*** dougbtv_ has joined #oooq13:28
weshayI'm in a 1-113:30
weshayrlandy, https://review.rdoproject.org/r/#/c/13777/13:30
rlandyweshay: is there something wrong with my comment there?13:31
trownrlandy: cool, fixing up the comments on the base patch from quiquell|off now, but I think it should be good to go after that13:31
rlandytrown: yep - working on testing those patches together today13:32
rlandythe tests run in the tox jobs13:32
myoung|rucksshnaidm|rover: do you have a few to sync from last night?  when I left for the evening we were about to get a queens periodic. don't want to duplicate efforts...13:34
sshnaidm|rovermyoung|ruck, we didn't get promotions for queens because of dns issues, rdo cloud network issues, running promotion jobs atm13:35
myoung|ruckssh ya saw they just kicked a little while ago13:36
myoung|rucksshnaidm|rover: *13:36
myoung|rucksshnaidm|rover: just saw your mail from last night13:36
trownpanda: wrt https://review.openstack.org/567320 I was going to wire that in to the actual release dictionary in a later patch... would you rather me do it in the same patch?13:36
trownpanda: no real reason not to... when I was working on that, base patch was not close to being done though13:37
sshnaidm|rovermyoung|ruck, and we have problems with introspection again13:37
myoung|ruckarxcruz: were the dns changes you made yesterday to tempestmail instance doc'd anywhere?13:37
arxcruzmyoung|ruck: i'm still working on the ansible playbook for that13:38
arxcruzmyoung|ruck: give me some time :D working on nova stuff on tempestconf right now13:39
myoung|ruckarxcruz: cool, is that tracked by a card?  just had a note to check back from yesterday...no rush13:39
arxcruzmyoung|ruck: i can create a card, it's something out of sprint though13:39
sshnaidm|roverrlandy, did you see my comment in https://review.openstack.org/#/c/567060/ ?13:40
myoung|ruckarxcruz: stay on sprint...just a ping.13:40
rlandysshnaidm|rover: about your parallel effort? yes13:41
myoung|rucksshnaidm|rover: https://bugs.launchpad.net/tripleo/+bug/1771318 is concerning...that job totally needs to fail in that case13:41
openstackLaunchpad bug 1771318 in tripleo "No nova-compute image in docker after promotion" [High,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)13:41
rlandyreally that is waiting on the panda's patch13:42
rlandywhen the release file out is confirmed, will return to that13:42
rlandybut I will look at your work13:42
pandatrown: just really wanted to know where this was going to be used, this or another patch is fine.13:43
pandatrown: but I think you can ask requests directly to handle retries13:43
trownpanda: hmm I looked and didnt see it in requests doc... suppose I could google it :P13:44
rlandysshnaidm|rover: afaict, 567060 includes the basic elements of dumping env vars and the playbook commands that 565740 adds13:44
rlandyif you think functionality is missing, pls comment on 567060 and we will add it there13:44
pandatrown: found this today http://www.coglib.com/~icordasc/blog/2014/12/retries-in-requests.html https://www.peterbe.com/plog/best-practice-with-retries-with-requests13:46
rlandypanda: re: https://review.openstack.org/#/c/566565/13:47
rlandycan you complete that patch now?13:47
trownpanda: there is no built in option for retries... looking at those blogs... http://docs.python-requests.org/en/master/api/13:48
trownpanda: hmm that involves totally rewriting what I did to use a persistent session13:50
trownpanda: that seems all a bit more complicated than we need to download a 5 line file13:53
ykarelweshay, (packstack scenario 001)failed again, so good to move it to CIX13:53
*** agopi has joined #oooq13:54
pandatrown: ... no control: difficulty X. just a bit more control: difficulty 10^X13:57
pandatrown: leave the for then.13:58
myoung|ruckall: gentle reminder, #tripleo meeting starts now, CI community meeting starts immediatly after @https://bluejeans.com/7050859455, if anyone's chatty will have an open line in that room during #tripleo14:00
pandarlandy: the patch is missing and argument I can change. Then we need to bargain on what we want to do if the script fails14:00
rlandypanda: bargain? what are you offering???14:01
*** skramaja_ is now known as skramaja14:02
pandamy eternal and unlimited gratitude. And a pat in the back for free14:02
rlandyoh - I thought there was a possibility of a large donation to my swiss bank account in the deal but I'll settle eternal and unlimited gratitude for your14:03
trownpanda: those blogs have a cool url i had not seen though ... that httpbin url14:03
trownpanda: trying out my for loop with some of those14:03
rlandypanda: let's settle that so I can finish the work on 56706014:03
rlandylet me know when you can chat14:03
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario003-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-queens, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-queens, tripleo-ci-centos-7-scenario004 (2 more messages)14:12
*** gkadam has quit IRC14:12
*** gkadam has joined #oooq14:14
rlandypanda: if the script fails, shouldn't we just bail out of the whole run?14:16
rlandythe script should be well tested itself14:17
rlandyso if it fails, the failure should be genuine14:17
pandarlandy: my concerd about this is that we are making all the jobs. ALL. OF. THEM. pass through a script that we implemented and tested in 10 days. Who's next ruck and rover ?14:19
rlandyidk - probably I am14:27
pandaOK, if we inject the script and make it work for everything and something brakes, we will probably know for sure the week after this gets merged. At that point our decision affect ruck and rover time so they need to be informed. Or team should use some of their time to fix it.14:32
myoung|ruckall: community meeting starts shortly, folks popping in.14:32
pandahope for the best, prepare for the worst14:36
*** apetrich has quit IRC14:36
rlandypanda: what is your alternative suggestion?14:37
*** trown is now known as trown|brb14:37
pandarlandy: run the script only for the jobs in scope14:38
rlandypanda: hmmm - but we nee release output now for all jobs14:38
rlandyneed14:39
pandarlandy: it's already defined as empty if the script output doesn't exist14:40
*** tosky has quit IRC14:40
rlandypanda: the biggest thing failing right now is the undefined script14:40
pandarlandy: and if the key is not found,then we use the default14:40
rlandyand path to output14:40
rlandyso ok to kick the script only with a few jobs14:40
rlandybut we need to lock down name of script and output14:41
rlandyso we can finish integration work14:41
*** trown|brb is now known as trown14:41
pandarlandy: I'm ok to put it for all the jobs too, if we prepare a contingency plan IF something goes wrong.14:41
rlandyfyi ... I am out on monday14:41
rlandypanda: I thought our original plan was all the jobs but I agree to a switch to remove it if things go wrong14:42
*** tosky has joined #oooq14:44
*** apetrich has joined #oooq14:48
sshnaidm|rovermyoung|ruck, weshay master containers build failed: http://paste.openstack.org/show/721017/14:58
ykarel:(15:00
*** ykarel is now known as ykarel|afk15:02
*** apetrich has quit IRC15:06
pandarlandy: sorry I had 4 meeetings in a row. I'm available to talk if you want15:12
pandameeeeting15:12
* panda gets some more eeee15:13
rlandypanda: k - in a few minutes - just fixing a review15:14
*** apetrich has joined #oooq15:15
sshnaidm|rovermyoung|ruck, seems like dashboard doesn't work: https://dashboards.rdoproject.org/master15:15
myoung|rucksshnaidm|rover: guessing the most recent rdocloud hiccups might have caused a reboot...looks like the feeder script isn't running15:16
*** skramaja has quit IRC15:18
*** links has quit IRC15:21
*** udesale_ has quit IRC15:25
weshaytrown, panda rlandy https://review.rdoproject.org/r/#/c/13790/115:29
weshaycan we get some eyes on that.. worked for me on cli15:29
weshaythanks sshnaidm|rover15:30
sshnaidm|roverweshay, myoung|ruck,  bbl, gonna pray for queens15:30
rlandypanda: ready to meet when you are15:31
myoung|rucksshnaidm|rover: looks great, in meeting and haven't had a chance to try / verify it yet15:31
*** sshnaidm|rover is now known as sshnaidm|bbl15:31
bogdandoPTAL folks https://review.openstack.org/#/c/56832615:31
weshaybaruch beashvile ha malca15:32
bogdandonot sure why my dependency graph is broken15:32
bogdando;(15:32
weshaybogdando, was about to ping you15:32
weshaybogdando, do you have 5 min?15:32
bogdandoweshay: sure thing!15:32
*** saneax is now known as saneax-_-|AFK15:33
bogdandobtw, an update http://lists.openstack.org/pipermail/openstack-dev/2018-May/130513.html15:33
bogdandojust sent15:33
weshayhttps://bluejeans.com/whayutin15:33
weshayk15:33
weshayI think I get it and like it.. just want to be sure15:33
pandarlandy: bj/gcerami ?15:36
rlandypanda: joining15:38
*** ykarel|afk is now known as ykarel|away15:39
pandaweshay: myoung|ruck in what occasion we upload the container but they are not there ?15:40
weshaypanda, last promotion15:44
*** ykarel|away has quit IRC15:44
myoung|ruckpanda: last night15:45
EmilienMweshay texted me, he had power outage15:52
EmilienMhe's relocating15:52
EmilienMtime to take the day off folks!!! wes is gone !!15:52
*** sanjay__u has quit IRC15:59
*** bogdando has quit IRC16:03
pandamyoung|ruck: and what was the reason ?16:03
pandaoh, wes is not here /nick panda|party16:04
myoung|ruckpanda: https://bugs.launchpad.net/tripleo/+bug/1771318 is tracking, I don't know that we've got root cause, but overnight sshnaidm|bbl confirmed it and manually pushed the nova container for last promotion.16:05
openstackLaunchpad bug 1771318 in tripleo "No nova-compute image in docker after promotion" [High,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)16:05
*** trown is now known as trown|lunch16:08
*** lucasagomes is now known as lucas-afk16:09
pandamyoung|ruck: ok thanks16:12
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-undercloud-upgrades @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata-branch, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp- (2 more messages)16:12
weshaypanda, back had a power outtage16:14
weshaypanda, back had a power outtage16:14
pandaweshay: oh, no /nick panda|noparty16:18
pandarlandy: https://review.openstack.org/566565 latest patchset should be final16:18
weshaypanda, what's up am I missing something?16:18
pandatrown|lunch: we can talk about this, since weshay is on PTO for the next two weeks, and having a plan in place means anyway we'll have to wait for fixes to merge, I would prefer to confine the use of the script to the 2 jobs in scope + 2 other jobs to test the backwards compatibility of the script. So we can make small but useful steps at a time16:20
weshaypanda, starting this coming wednesday16:20
weshaysorry next wednesday16:20
pandaweshay: nothing, just following up on EmilienM joke16:20
weshayend of sprint16:21
weshayEmilienM, is not funny EVER16:21
*** tesseract has quit IRC16:21
rlandypanda: k - thanks - will take a look in a bit16:21
EmilienMweshay: go pay your electricity bills dude16:23
weshayI'm broke as a joke16:23
weshayballer on a budget16:24
*** zoli is now known as zoli|gone16:26
*** zoli|gone is now known as zoli16:26
*** panda is now known as panda|bbl16:29
*** jaosorior has quit IRC16:31
*** dtantsur is now known as dtantsur|afk16:33
weshaysshnaidm|bbl, myoung|ruck probably going to promote queens.. fs10 timed out.. waiting on 20 and 37 neither is critical16:36
*** marios has quit IRC16:41
*** gkadam has quit IRC17:07
*** yolanda_ has quit IRC17:09
rlandyha - kicking tox on select file - finally!!17:16
*** kopecmartin has quit IRC17:17
*** yolanda_ has joined #oooq17:20
*** trown|lunch is now known as trown17:22
*** tosky has quit IRC17:33
*** tosky has joined #oooq17:38
myoung|ruckweshay: (back) and ack17:39
myoung|rucksshnaidm|bbl: when do you want to trade |ruck for |rover?17:40
* myoung|ruck wants to work on some bugz17:40
*** holser__ has joined #oooq17:51
*** ssbarnea has joined #oooq17:51
*** holser___ has quit IRC17:51
*** holser__ has quit IRC17:56
*** links has joined #oooq18:01
myoung|ruckweshay, sshnaidm|bbl, hrm...(from queens.log)18:11
myoung|ruck2018-05-15 09:27:32,098 25359 INFO     promoter FINISHED promotion process18:11
myoung|ruck2018-05-15 18:00:14,808 27234 DEBUG    promoter No other promoters running. Acquired lock and continuing with promot18:11
myoung|ruckwas down for a bit18:11
*** dougbtv_ has quit IRC18:11
*** ykarel|away has joined #oooq18:12
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata, tripleo-ci-centos-7-undercloud-upgrades @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata-branch, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp- (2 more messages)18:13
weshaysigh18:13
myoung|ruckin the wee hours18:14
*** florianf has quit IRC18:16
rlandyweshay: finally - this works https://review.openstack.org/#/c/568287/10/zuul.d/layout.yaml18:17
rlandyselected kicking of zuul tests18:17
weshaynice rlandy18:19
weshayrlandy, it's a little confusing to me why that's in layout.yml18:20
weshaywhen I look at other config...18:20
weshaybut who said that I had to understand it18:20
weshayrlandy, trown can you guys see if you can get fs50 to work w/ the release config.. keep fielding questions from E18:21
weshayhttps://bugs.launchpad.net/tripleo/+bug/176899718:21
openstackLaunchpad bug 1768997 in tripleo "containerized-undercloud-upgrades: dlrn tasks don't run" [Critical,Triaged]18:21
weshaynot sure where you guys are at w/ the flow18:21
*** ykarel|away has quit IRC18:22
weshayso maybe we can get some updates to the bug at the min.. and look at getting it fixed in the longer run18:22
rlandysame as https://review.openstack.org/#/c/563526/6/zuul.d/layout.yaml18:23
weshaymyoung|ruck, make sure that queens promotion goes through please :)18:23
weshayrlandy, ah k18:23
rlandyweshay: yep - just updating the dry-run review with panad's updated18:23
* weshay relocates, power should be back on18:23
weshayrlandy, thanks18:23
rlandythen will try it18:23
myoung|ruckweshay: aye looking at it now18:25
myoung|ruckweshay: /me is plowing thru http://38.145.34.55/queens.log as i type this18:26
*** holser__ has joined #oooq18:28
*** holser__ has quit IRC18:32
rlandypanda: ok - rebased my patch on 566565 latest - updating to only copy release file if it exists18:33
*** agopi has quit IRC18:41
myoung|ruckweshay: something funky going on...promoter isn't seeing your patch (https://github.com/rdo-infra/ci-config/commit/fe555e06c88911e810948540aa09b7b21ff7efe2)18:49
* myoung|ruck looks deeper, captured https://review.rdoproject.org/etherpad/p/ruckrover-sprint13 @ L8518:49
weshaymyoung|ruck, ya.. I saw that it still had fs10 in criteria18:50
* myoung|ruck is on the box18:50
myoung|ruckweshay: ci-config wasn't up to date...was still @ 11 may18:53
weshayk18:53
myoung|rucknext time around should pick up new criteria.  exploring why...should be updated all the times18:53
weshayrlandy, trown reminder to pick up this bug and comment https://bugs.launchpad.net/tripleo/+bug/1768997 :)18:54
openstackLaunchpad bug 1768997 in tripleo "containerized-undercloud-upgrades: dlrn tasks don't run" [Critical,Triaged]18:54
myoung|ruckweshay: also still had our edits from friday...stashed them18:54
weshayah.. that's probably why18:54
myoung|ruckyeah...guessing is silently failing to pull18:55
myoung|ruckI'm writing a RFE (in my spare time HAH) to track improvements.  when I flip to |rover I want to address it18:55
myoung|ruck(this and other stuff)18:55
myoung|rucksshnaidm|bbl: ^^18:55
myoung|ruckweshay killing the current promotoer script and rekicking.  why wait eh?18:56
weshay+118:56
rlandyweshay: what we are currently working on is the releases script18:57
rlandywhether it fixes picking up the right release per zuul change, not sure18:58
rlandywe can try it18:58
weshayrlandy, anything that would drive that forward.. all that needs to happen really.. is that the right release is passed to the update playbook18:58
rlandywhat happened to the fs naming :(18:58
weshayand the upgrade role uses the passed release.. and not a calculated one18:58
weshayrlandy, with regards to?  fs18:58
rlandytripleo-ci-centos-7-containerized-undercloud-upgrades18:59
weshayrlandy, it's an upstream job19:00
weshayrlandy, the upstream jobs still do not use fs$$$19:00
rlandyok - let's see what kicks with current changes19:01
weshayrlandy, note the upgrade role uses a var != release19:01
weshayI think19:02
rlandyweshay: k - just in the middle of commit a fix - will look in a few19:02
weshay%gatestatus19:09
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario003-multinode-oooq-container, tripleo-ci-centos-7-containers-multinode, tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario004-multinode-oooq-container, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-queens,  (2 more messages)19:09
*** agopi has joined #oooq19:10
*** holser__ has joined #oooq19:11
weshaymyoung|ruck, fyi https://bugs.launchpad.net/tripleo/+bug/177141419:16
openstackLaunchpad bug 1771414 in tripleo "RFE: install ovb-tenant-cleanup.sh script in tripleo infra tenant and execute via cron" [High,Triaged]19:16
weshaysshnaidm|bbl, ^19:16
*** holser__ has quit IRC19:16
myoung|ruckweshay: ahh thanks, crossing that one off my list19:16
myoung|ruck(to log)19:16
myoung|ruckweshay: so far container download seems to be moving right along19:17
myoung|ruckWe've pulled  *** 42 *** so far in just a few mins19:18
weshaymyoung|ruck, please update https://bugs.launchpad.net/tripleo/+bug/177097219:19
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)19:19
weshayhttps://logs.rdoproject.org/65/566565/7/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Z14cd35a437b546d0a25ba0721c29b5d0/undercloud/home/jenkins/overcloud_prep_images.log.txt.gz19:19
*** links has quit IRC19:20
*** jaosorior has joined #oooq19:21
rlandyweshay: ok - looking into that bug now with our current sprint stuff19:24
weshayrlandy, thanks19:25
weshayEmilienM, ^19:25
EmilienMhi19:28
EmilienMcontext? -sorry-19:28
weshayEmilienM, your fs50 dlrn patch bug19:38
EmilienMah nice !19:47
*** jtomasek has quit IRC19:51
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario003-multinode-oooq-container, tripleo-ci-centos-7-containers-multinode, tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario004-multinode-oooq-container, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-queens,  (2 more messages)20:13
*** agopi has quit IRC20:25
*** holser__ has joined #oooq20:27
*** sshnaidm|bbl is now known as sshnaidm|rover20:42
weshaymyoung|ruck, k21:00
weshaydo you have bugs on scen001/00221:00
weshaymyoung|ruck, for master21:01
weshaylooks like the same issue21:01
myoung|ruckintrospection ovb...recreate is running now21:02
weshaymyoung|ruck, cool.. how about scen001/002?21:04
myoung|ruckwalking thru sova21:04
myoung|rucknow21:04
myoung|ruckscen001 has had 4 fails today, 2 are the ovb issue, looking at the other 221:05
*** trown is now known as trown|outtypewww21:05
myoung|ruckscen002 is on deck21:05
weshayscen001/002 are both failing on the same issue21:06
myoung|ruckgah...never mind.  details in etherpad...21:06
weshaymyoung|ruck, also failing in the gate.. so + alert21:06
sshnaidm|roverweshay, myoung|ruck so what is with queens?21:06
myoung|ruckit's about to promote21:07
weshaysshnaidm|rover, almost done promoting21:07
weshaymyoung|ruck, what line21:07
myoung|ruckit's pushed everything to docker21:07
myoung|rucksshnaidm|rover: cleaning up and about to push symlinks via dlrn21:07
weshaymyoung|ruck, what line in the etherpad21:07
myoung|ruckweshay: you are generally about 20 seconds ahead of me.  moment.21:08
myoung|ruckor hours.  i dunno21:08
myoung|ruck3121:08
weshayline 31?21:10
weshaythird party?21:10
weshayweird.. myoung|ruck I see http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/undercloud/home/zuul/overcloud_prep_containers.log.txt.gz#_2018-05-15_17_11_3521:16
myoung|ruckweshay: yes...that's the missing nova container i had thought we u/l last night21:17
myoung|ruckworming thru logs now here https://review.rdoproject.org/etherpad/p/ruckrover-sprint1321:17
myoung|ruck@3121:17
myoung|rucki need a few mins21:19
*** holser__ has quit IRC21:21
myoung|ruckweshay: ugh21:49
myoung|ruckqueens promotion just failed21:49
myoung|ruckfailed: [localhost] (item=[u'nova-placement-api', u'current-tripleo']) => {"changed": false, "item": ["nova-placement-api", "current-tripleo"], "msg": "Error removing image docker.io/tripleoqueens/centos-binary-nova-placement-api:current-tripleo - UnixHTTPConnectionPool(host='localhost', port=None): Read timed out. (read timeout=60)"}21:49
myoung|ruckit actually failed while cleaning up21:49
myoung|ruckwhich borked out the calling script...which failed the promoter and it gave up, instead of continuing with promotion workflow21:50
myoung|ruck2018-05-15 21:41:28,797 25051 ERROR    promoter Command '['ansible-playbook', '/home/centos/ci-config/ci-scripts/container-push/container-push.yml']' returned non-zero exit status 221:50
myoung|ruckTraceback (most recent call last):21:50
myoung|ruck  File "/home/centos/ci-config/ci-scripts/dlrnapi_promoter/dlrnapi_promoter.py", line 142, in tag_containers21:50
myoung|ruck    env=env, stderr=subprocess.STDOUT).split("\n")21:50
myoung|ruck  File "/usr/lib64/python2.7/subprocess.py", line 575, in check_output21:50
myoung|ruck    raise CalledProcessError(retcode, cmd, output=output)21:50
myoung|ruckit should loop around and try again.  I don't have root cause for why it failed to remove...that's a new one21:51
weshayfak21:51
myoung|ruckI can look later too...i'm beyond late at this point for "adulting have-to"21:51
myoung|ruckone of the RFE's im thinking of for the promoter is to leave a few promotions worth of images in the local registry, so in this specific case (if...err..WHEN...it happens again) we don't pay the tax for pulling / pushing all these image layers again21:52
myoung|ruckit'll just loop around and try again for the delta21:52
myoung|ruckpanda|bbl: ^^ have you seen this before (random failure to remove an image from promoter's local registry?21:53
* myoung|ruck transmogrifies himself into a taxi and will return later21:53
*** myoung|ruck is now known as myoung|ruck|afk21:53
*** Goneri has quit IRC21:53
* myoung|ruck|afk guesses this could have been an RDO cloud hiccup on i/o access to underlying FS?21:54
sshnaidm|roversecond time it should pass21:56
myoung|ruck|afkweshay, panda|bbl the really wierd thing is that the image is actually removed from the local registry...it's gonzo21:56
myoung|ruck|afkunless someone else just reached in and nuked it21:56
myoung|ruck|afknope...i'm the only one in there.21:56
myoung|ruck|afkoh wait... sshnaidm|rover is entos   pts/0        2018-05-15 09:29 (bzq-79-181-125-206.red.bezeqint.net) you?21:57
myoung|ruck|afksshnaidm|rover: did you remove?21:58
sshnaidm|rovermyoung|ruck|afk, yes21:58
sshnaidm|rovermyoung|ruck|afk, remove what?21:58
myoung|ruck|afknova-placement-api image (see above).  will be back later to check on queens take 221:58
* sshnaidm|rover did nothing21:59
myoung|ruck|afksshnaidm|rover: ack, just checking...strange phantom image :)22:00
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario003-multinode-oooq-container, tripleo-ci-centos-7-containers-multinode, tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario004-multinode-oooq-container, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-queens,  (2 more messages)22:13
*** saneax-_-|AFK is now known as saneax23:01
rlandyweshay: still around?23:22
rlandyweshay: so you can see the args passed ... http://logs.openstack.org/60/567060/17/check/tripleo-ci-centos-7-undercloud-upgrades/ec045e7/logs/playbook_executions.txt.gz23:23
rlandywe are passing the same release file to all23:23
rlandyfor fs047 we should be running the release script23:23
rlandywould have to try a review with a couple other combined23:25
*** tosky has quit IRC23:26
rlandyadded a test review - will watch jobs23:45

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!