Tuesday, 2018-06-26

*** rlandy has quit IRC00:00
*** yolanda_ has joined #oooq00:06
*** yolanda__ has quit IRC00:09
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722400:51
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722402:51
*** skramaja has joined #oooq02:59
*** jaganathan has joined #oooq03:00
*** yolanda__ has joined #oooq03:15
*** yolanda_ has quit IRC03:18
*** yolanda__ has quit IRC03:23
*** yolanda__ has joined #oooq03:29
*** yolanda_ has joined #oooq03:34
*** noama has joined #oooq03:34
*** yolanda__ has quit IRC03:36
*** yolanda__ has joined #oooq03:44
*** yolanda_ has quit IRC03:45
*** agopi has quit IRC03:49
*** agopi has joined #oooq03:49
*** ratailor has joined #oooq04:02
*** yolanda_ has joined #oooq04:03
*** yolanda__ has quit IRC04:06
*** yolanda has joined #oooq04:08
*** yolanda_ has quit IRC04:09
*** yolanda_ has joined #oooq04:10
*** agopi has quit IRC04:13
*** yolanda has quit IRC04:13
*** hamzy has quit IRC04:17
*** hamzy has joined #oooq04:18
*** hamzy has quit IRC04:22
*** hamzy has joined #oooq04:36
*** skramaja_ has joined #oooq04:50
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722404:51
*** skramaja has quit IRC04:51
*** ccamacho has quit IRC05:06
*** jaganathan_ has joined #oooq05:09
*** jaganathan has quit IRC05:13
*** ykarel has joined #oooq05:20
*** ratailor_ has joined #oooq05:23
*** ratailor has quit IRC05:27
*** waleedm has joined #oooq05:30
*** skramaja has joined #oooq05:34
*** skramaja_ has quit IRC05:34
*** quiquell has joined #oooq05:35
*** matbu has joined #oooq05:35
*** udesale has joined #oooq05:44
*** bogdando has joined #oooq05:49
quiquellarxcruz: https://bugs.launchpad.net/tripleo/+bug/177863706:03
openstackLaunchpad bug 1778637 in tripleo "mistral_tempest_tests.tests.api.v2.test_actions.ActionTestsV2: MismatchError: [u'aodh.alarm_create'," [Critical,Triaged] - Assigned to Quique Llorente (quiquell)06:03
ykarelso it happened again06:04
quiquellykarel: You recognize it ?06:07
ykarelquiquell, we have seen this earlier as wel06:07
ykarelis fs01706:07
quiquellykarel: Now we don't see more, beacous we have a lot of timeouts so tempest doesn't get executed06:08
quiquell(I think)06:08
ykarelquiquell, timeout ? fs020?06:08
*** ccamacho has joined #oooq06:09
quiquellykarel: Multiple scenearios, give a min to analyze06:09
ykarelquiquell, ack06:09
*** saneax has joined #oooq06:12
*** yolanda__ has joined #oooq06:21
*** yolanda_ has quit IRC06:24
*** pgadiya has joined #oooq06:30
*** pgadiya has quit IRC06:30
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722406:51
*** gkadam has joined #oooq06:58
*** noam has joined #oooq07:01
*** noama has quit IRC07:04
*** tesseract has joined #oooq07:14
*** quiquell is now known as quique|rover|afk07:16
*** zoli is now known as zoli|wfh07:16
*** zoli|wfh is now known as zoli07:16
*** pgadiya has joined #oooq07:22
*** pgadiya has quit IRC07:22
*** kopecmartin has joined #oooq07:23
*** florianf has joined #oooq07:24
*** florianf has quit IRC07:25
*** amoralej|off is now known as amoralej07:28
*** tosky has joined #oooq07:41
*** quique|rover|afk is now known as quiquell|rover07:42
*** noam__ has joined #oooq07:45
*** noam has quit IRC07:46
*** holser_ has joined #oooq08:01
arxcruzquiquell|rover: if you haven't yet, i can take a look08:10
quiquell|roverarxcruz: Go for it, I am looking at a RDO issue08:11
quiquell|roverarxcruz: I have also another one with a openstack returning 500 at temptest08:11
quiquell|roverarxcruz: But that feels less tempest related08:11
ajogi folks08:13
ajohi08:13
quiquell|roverHello ajo08:13
ajoanybody experienced failure to boot/ping the undercloud08:13
ajo?08:13
ajoquiquell|rover: In one of my servers it's fine, but on other (completely new from scratch) the undercloud doesn't boot08:13
*** noam__ has quit IRC08:13
ajoI'm starting virt-manager via VNC to check08:13
quiquell|roverajo: No issues here regarding this08:14
arxcruzquiquell|rover: this is in check, but i'll create an env08:14
ajoquiquell|rover: ack thanks08:14
quiquell|roverarxcruz: It's a gate job08:15
quiquell|roverarxcruz: from https://review.openstack.org/#/c/573142/08:15
arxcruzquiquell|rover: yup08:16
*** saneax has quit IRC08:20
*** saneax has joined #oooq08:29
*** jaosorior has quit IRC08:39
*** gkadam_ has joined #oooq08:44
*** gkadam has quit IRC08:45
*** gkadam_ has quit IRC08:45
*** gkadam_ has joined #oooq08:45
*** gkadam__ has joined #oooq08:48
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722408:51
*** gkadam_ has quit IRC08:51
*** brault has quit IRC08:57
*** brault has joined #oooq09:13
*** dtantsur|afk is now known as dtantsur09:18
*** ykarel_ has joined #oooq09:20
*** ykarel_ is now known as ykarel|away09:22
*** ykarel has quit IRC09:23
*** ratailor__ has joined #oooq09:24
*** ykarel|away has quit IRC09:25
*** ratailor_ has quit IRC09:26
*** yolanda_ has joined #oooq09:50
*** yolanda__ has quit IRC09:53
*** zoli is now known as zoli|lunch10:14
*** jaganathan_ has quit IRC10:19
*** jaganathan_ has joined #oooq10:19
*** jaosorior has joined #oooq10:29
*** sshnaidm|afk is now known as sshnaidm10:30
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722410:51
*** udesale has quit IRC11:33
arxcruzquiquell|rover: i wasn't able to reproduce the failure from http://logs.openstack.org/42/573142/4/gate/tripleo-ci-centos-7-scenario003-multinode-oooq-container/9973f47/logs/tempest.html.gz11:34
arxcruzit's passing on my env11:34
quiquell|roverarxcruz: Ok will recheck11:35
*** atoth has joined #oooq11:38
*** zoli|lunch is now known as zoli|wfh11:38
*** zoli|wfh is now known as zoli11:38
*** anande has joined #oooq11:45
*** amoralej is now known as amoralej|lunch11:59
*** ratailor__ has quit IRC12:21
*** trown|outtypewww is now known as trown12:24
*** rlandy has joined #oooq12:30
rlandymarios: hello - ping re: hardware box12:34
mariosrlandy: o/12:34
rlandymarios: still having trouble logging in?12:34
* rlandy will check it12:34
mariosrlandy: should i try again (I could get into ssh but no the drac console). trying on rdo cloud just now so not urgent12:34
mariosrlandy: thank you12:35
rlandymarios: root/calvin12:35
rlandymy mistake12:35
mariosrlandy: np trying12:35
mariosrlandy: thanks works12:36
rlandymarios: go to virtual console - launch virtual console12:36
rlandysay yes/ok to all options12:37
mariosrlandy: seem to be missing some plugin (chrome)12:37
rlandyyou can power down/up or reboot12:37
rlandyyep - so you will need to modify your browser to accept this window12:37
weshay|ruckquiquell|rover, https://review.openstack.org/#/c/576990/12:37
mariosrlandy: ack thanks (doing/found some relevant info)12:38
rlandymarios: you will need iced tea web to open it12:38
weshay|ruckquiquell|rover, https://review.openstack.org/#/c/577809/12:39
mariosrlandy: ack12:39
*** skramaja has quit IRC12:40
rfolco_marios, I am getting OOM on undercloud_reinstall... does this help ? https://review.openstack.org/57802312:50
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722412:51
mariosrfolco_: well it just adds meminfo into the logs (panda gave this to me as homework ;)12:55
mariosrfolco_: don't think it will solve your oom but it might give you more info about what is happening12:55
rfolco_marios, ack thanks12:56
pandamarios: do you find it helpful ?12:58
pandaquiquell|rover: any recent issues caused by mariadb crashes that you know of ?12:59
quiquell|roverpanda: Nope13:01
quiquell|roverpanda: Did you found something ?13:01
quiquell|roverpanda: Humm I have some internal errors at tempest13:01
quiquell|roverpanda: Could be caused by mariadb failing13:01
quiquell|roverpanda: https://bugs.launchpad.net/tripleo/+bug/177865513:02
openstackLaunchpad bug 1778655 in tripleo "An unexpected error prevented the server from fulfilling your request. (HTTP 500) (Request-ID: req-d89e9af9-9dff-4115-acb5-5246113ec8c6)" [Critical,Triaged] - Assigned to Quique Llorente (quiquell)13:02
pandaquiquell|rover: I have failures in my test job,13:02
pandaquiquell|rover: mariadb logs look clean in that job. doesn't seem to be the same13:04
quiquell|roverpanda: give it to me13:05
pandaquiquell|rover: give you what ?13:05
quiquell|roverpanda: the logs13:06
pandaquiquell|rover: nah, stay on the  bugs taht matter, the team has to solve this, since it's a new functionality13:06
quiquell|roverpanda: Ok, let me know if you need help13:07
*** tcw has quit IRC13:07
*** tcw1 has joined #oooq13:07
*** quiquell|rover is now known as quique|rover|lch13:08
pandarfolco_: oh, we have the same problem then13:10
pandarfolco_: upstream gets oom-killer too13:10
pandarfolco_: are you testing upstream ?13:11
pandarfolco_: but we're getting this during overcloud deploy13:11
pandarfolco_: thee oom-killer kills mariadb here13:11
mariospanda: yes thank you :)13:15
pandarfolco_: Jun 25 15:27:52 centos-7-rax-dfw-0000326756 kernel: Out of memory: Kill process 17196 (mysqld) score 43 or sacrifice child13:16
pandaJun 25 15:27:52 centos-7-rax-dfw-0000326756 kernel: Killed process 17196 (mysqld) total-vm:4803964kB, anon-rss:352740kB, file-rss:0kB, shmem-rss:0kB13:16
*** anande has quit IRC13:16
pandait takes alsmo 5G of RAM13:16
rfolco_panda, I get the OOM during undercloud_reinstall... since this is undercloud job only13:17
rfolco_undercloud deploy works well13:17
pandarfolco_: I'm testing the all in one job, undercloud install works well, there is no reinstall, then overcloud deploy fails13:18
pandahow is mairadb memory usage related to the reparenting ?13:19
pandarfolco_: the node flavor is exactly the same13:19
pandarfolco_: as the legacy node13:20
rfolco_panda, that's what I am trying to understand too13:21
rlandymyoung: quique|rover|lch: myoung: this is what I think is wrong with rhos-13 gates: https://paste.fedoraproject.org/paste/d7rP3stBA9BWkH4FVYUQZA13:22
*** dtantsur is now known as dtantsur|brb13:22
rlandyweshay|ruck: ^^13:22
rlandywhen you get to overcloud deploy, "no available hosts"13:23
rlandywhich is correct13:23
rlandywe have no flavor with a big enough disk13:23
rlandyrequirements for disk is 40 - you can't deploy on a flavor with 40 - has to be at least 4113:24
rlandycompare with RDO Cloud in the same paste13:24
rlandyhttps://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/tq-gate-rhos-13-ci-rhos-ovb-featureset001 is set up the switch the gates to fs00113:25
* rlandy puts in a request to add flavors with big enough disk13:26
rlandymyoung: weshay|ruck: quique|rover|lch: afaict, we *do* have the correct image passed ...13:29
rlandy[stack@undercloud-86094 ~]$ cat /etc/redhat-release13:29
rlandyRed Hat Enterprise Linux Server release 7.5 (Maipo)13:29
rlandyso the image overrides may be a bug but it's not blocking gates13:29
*** quique|rover|lch is now known as quiquell|rover13:29
quiquell|roverrlandy: Hi13:30
rlandyquiquell|rover: hi - got to join another meeting now - will add you to the request for bugger flavors13:30
rlandybigger13:30
quiquell|roverrlandy: ack13:31
*** udesale has joined #oooq13:31
quiquell|roverrlandy: Thanks13:31
*** agopi has joined #oooq13:34
*** agopi has quit IRC13:34
rlandychandankumar: looking to join the office hours on #rdo - nothing going on there  - am I at wrong place/wrong time?13:35
rlandyYou are kindly invited to the meeting:13:35
rlandy   RDO Office Hours on 2018-06-26 from 13:30:00 to 14:30:00 UTC13:35
rlandyarxcruz: ^^??13:37
*** sanjay__u has quit IRC13:38
rfolco_panda, vm.swappiness = 60... it was 30 in the legacy parent. This is even safer for OOM... https://mariadb.com/kb/en/library/configuring-swappiness/... still looking13:42
*** agopi has joined #oooq13:43
pandarfolco_: where did the parent set the swappiness ?13:45
rlandyquiquell|rover: hi - ok - there was no meeting, I'm back - more questions on rhos-13? you agree with my flavor assessment?13:46
rfolco_panda, I think it comes with the default one (60)... not sure where it was set/changed in the legacy one.13:46
rfolco_I just checked extras13:47
rfolco_http://logs.openstack.org/56/576956/7/check/tripleo-ci-centos-7-undercloud-oooq/96eb01f/logs/undercloud/var/log/extra/sysctl.txt.gz13:47
pandarfolco_: we don't have any idea how much memory mariadb uses normally13:50
pandarfolco_: but 5G is definitely too much13:50
*** jaganathan_ has quit IRC13:51
weshay|ruckarxcruz, kopecmartin you guys making progress on doc?13:52
pandarfolco_: what does the oom-killer kills in your case ?13:52
trownI just dont get how that would be different based on not running devstack13:52
pandatrown: indeed13:52
kopecmartinweshay|ruck, we're working on that whole day, but if it's there a progress , well , good question13:53
weshay|ruckheh.. k :)13:53
rfolco_os-refresh-config, panda http://logs.openstack.org/56/576956/7/check/tripleo-ci-centos-7-undercloud-oooq/96eb01f/logs/undercloud/home/zuul/undercloud_reinstall.log.txt.gz#_2018-06-22_18_14_3513:54
pandarfolco_: oh, you don't get the ooom-killer,13:54
rfolco_cannot allocate mem13:54
pandarfolco_: you're just getting oom from the process13:54
pandarfolco_: so you're not really sure what is taking all that memory13:55
rfolco_dstat says yes13:55
pandarfolco_: we can assume it's mariadb13:55
rfolco_at that exact point I see mem graph at the peak13:55
rfolco_consuming all RAM available13:55
pandarfolco_: does dstat say who's the glutton ?13:57
rfolco_glutton... new word13:57
*** amoralej|lunch is now known as amoralej13:58
rfolco_panda, will look again13:58
*** jaosorior has quit IRC13:59
pandatrown: how does mariadb behaves in libvirt reproduction ? I don't have a live deployment atm14:00
pandamaridb logs are scarce :(14:01
myoungweshay|ruck, quiquell|rover: prepared basic # of days status for #tripleo meeting, if there's realtime / extra notes that make sense to convey for weekly please add https://etherpad.openstack.org/p/tripleo-ci-squad-meeting @ L5014:01
trownpanda: restarting my env now almost to undercloud install14:03
*** rfolco_ is now known as rfolco14:03
quiquell|rovermyoung: ack14:03
myoungquiquell|rover: (optional) - I just used the notes from yesterday's scrum14:05
quiquell|rovermyoung: Maybe you can add that we have timeout issues in the gates14:07
quiquell|rovermyoung: ass addendum of the using ara in more depth14:07
myoungquiquell|rover: aye, i have "using ara in more depth to diagnose timeout issues" already at L5514:08
quiquell|rovermyoung: ack14:09
rfolcopanda, swap is missing14:16
pandarfolco: yeah, we don't generally use swap14:16
rfolcopanda, devstack-gate has a role for it --> https://github.com/openstack-infra/devstack-gate/blob/101e0fbbc5e8851c53b4d09672bd26cee0099201/playbooks/roles/fix_disk_layout/tasks/main.yaml14:16
pandarfolco: and even with swap, mariadb would probably continue to eat memory until even the swap is filled14:17
pandammhh14:17
pandammmmmhhh14:17
pandaso this is masking a mariadb memory problem ?14:19
weshay|ruckpanda, https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/infra-setup/roles/te-broker/tasks/main.yml#L11614:32
*** quiquell|rover is now known as quiquell|off14:34
*** ccamacho has quit IRC14:35
*** ccamacho has joined #oooq14:35
mariospanda: http://logs.openstack.org/56/576956/7/check/tripleo-ci-centos-7-undercloud-oooq/96eb01f/logs/undercloud/var/log/host_info.txt.gz14:37
weshay|ruckmarios, http://logs.openstack.org/56/576956/7/check/tripleo-ci-centos-7-undercloud-oooq/96eb01f/logs/undercloud/var/log/extra/dstat.html.gz14:38
mariosweshay|ruck: thanks14:39
pandamarios: it's clearly mysqld14:39
trownpanda: on libvirt it is using 3.7G overcloud deploy is just starting though14:45
*** dtantsur|brb is now known as dtantsur14:45
mariosbandini: are you aware of any recent changes in mariadb that could be causing the memory spike we are investigating14:46
trownpanda: that is one difference on libvirt though.. we are using 16G RAM VMS14:46
pandatrown: yep, we need to add the swap, I see the latest requirements for undercloud is 16G, upstream flavor is only 8G14:49
bandinimarios: not that I know of, no. (but I have not followed stuff recently, am currently chasing this rabbitmqeddon)14:51
mariosbandini: ack thanks just checking14:51
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722414:51
pandatrown: rfolco weshay|ruck there is a standard role in zuul repo to configure swap, we should use that14:52
*** saneax has quit IRC14:52
rfolcopanda, hmmm will test it thanks14:53
mariospanda: rfolco there is even a env file in tht we can just include btw for swap14:53
mariospanda: rfolco 2 in fact (file vs partition)14:53
marioshttps://github.com/openstack/tripleo-heat-templates/blob/master/environments/enable-swap.yaml14:54
rfolcopanda, this ? openstack-zuul-jobs/roles/configure-swap/tasks/main.yaml14:54
marioshttps://github.com/openstack/tripleo-heat-templates/blob/master/environments/enable-swap-partition.yaml14:54
pandarfolco: yes14:54
trownmarios: I think it is an issue in the undercloud though14:56
mariostrown: ah right14:57
trownpanda: weird that mysqld is using an extra G of memory in CI though15:02
pandatrown: I'm looking beter at the top command, the memory is pretty much the same. 4.7G resident, 300M resident. I was reading resident wrong before, it's not 3G15:03
pandas/better//15:03
mariosrfolco: how do you manage to have such a dark background ? :) is it night time there?15:06
mariosrfolco: it looks cool15:07
mariosrfolco: i want it too :D15:07
mariosrfolco: do you literally have a blackboard behind you ?:)15:07
mariospanda: get yo swap on15:08
rfolcomarios, window is 2% open in front of me, I have a usb flash with light plugged :)15:08
mariosrfolco: ack15:11
pandamarios: rfolco updated PS15:11
weshay|ruckpanda, https://review.openstack.org/#/c/576904/15:11
weshay|ruckrlandy, trown ^15:11
pandatrown: uploaded patchset 20 adding the swap role from infra15:15
*** ccamacho has quit IRC15:15
trownpanda: cool, looking in my libvirt env heat is actually using double the memory of mysqld in terms of RSS15:16
trownpanda: it is split over 4 workers, but each uses ~250M15:16
trownpanda: still odd why we would need more memory just because we dont use devstack-gate... that still doesnt make sense15:18
*** waleedm has quit IRC15:18
rlandyquiquell|off: weshay|ruck: looking at the stack create failure on your review ... checked tenant - there are a bunch of delete_failed stacks there15:18
pandatrown: the nodes have nly 8G of RAM, and the legacy playbook was adding swap, we were missing that in our now setup15:20
weshay|ruckrlandy, https://review.rdoproject.org/r/1448215:20
weshay|ruckrlandy, :)15:21
weshay|ruckI'm cleaning up now15:21
trownpanda: ah that makes sense then15:21
weshay|ruckthanks15:21
pandatrown: the minimum requirement is 12-16G for the undercloud, so we need the swap15:21
weshay|ruckcheck it out :) http://logs.openstack.org/60/577960/2/check/tripleo-ci-centos-7-undercloud-containers/7f49be7/logs/ara_oooq_root/15:24
pandaugh, job in queue since 20 minutes15:30
pandano fast feedback today.15:31
weshay|ruckhttps://review.openstack.org/#/c/577960/ reviews please15:33
pandaswap role is running15:40
pandabut it's taking 4 extra minutes for something htat is maybe not relevant anymore15:41
panda2018-06-26 15:37:06.292986 | TASK [configure-swap : Copy old /opt]15:41
panda2018-06-26 15:40:55.218132 | primary | ok: Runtime: 0:03:48.36460115:42
pandawe need to wipe /opt and ensure we are not using anything there first if we don't want it to take too much15:44
*** trown has quit IRC15:44
*** trown|brb has joined #oooq15:46
*** bogdando has quit IRC15:46
*** ykarel|away has joined #oooq15:51
kopecmartinweshay|ruck, arxcruz I've pushed an update in docs about containerized tempest https://review.openstack.org/#/c/565161/15:53
weshay|ruckk.. thanks kopecmartin15:54
kopecmartinweshay|ruck, the bug can't be verified sooner than the doc is merged, can it?15:56
weshay|ruckkopecmartin, as long as we make sure it merges.. anyone checking the bug would probably not know15:57
weshay|ruck:)15:57
*** hamzy has quit IRC16:00
*** ykarel|away is now known as ykarel16:03
*** udesale has quit IRC16:09
*** trown|brb is now known as trown16:12
ajoweshay|ruck:  https://review.openstack.org/57814216:13
ajoif you can eyeball that ;) it'd be great16:13
ajoweshay|ruck: my undercloud was hanging on boot because of that16:13
ajodont ask me why..16:13
* weshay|ruck looks16:13
arxcruzrlandy: hey, i did not had that on my calendar :(16:14
rlandyarxcruz: it was en email from chandan - maybe a mistake16:16
rlandyarxcruz: anyways, thanks for the tempest tour last week  - I looked over he repos and reviews you pointed at16:16
arxcruzrlandy: cool :)16:17
rlandyarxcruz: are there any simple/not urgent tasks I can start working on - in my spare time :)?16:17
arxcruzrlandy: i need to check, i'll get back to you :)16:17
rlandyarxcruz: thanks - whenever you find some. pls ping me. no rush16:18
rlandyI just need to get my feet wet16:18
weshay|ruckmarios, you still on? I have a question16:18
*** tesseract has quit IRC16:40
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722416:51
pandatrown: we are 8 minutes into overcloud deploy, previous attempts failed in 5 minutes. Seems we are on a better spot al least16:54
trownpanda: nice16:54
*** panda is now known as panda|off17:01
weshay|ruckkopecmartin, arxcruz remarks in the review17:04
kopecmartinweshay|ruck, thanks, i also found issues, so fixing them too17:04
weshay|ruckthanks17:05
weshay|ruckrlandy, fyi. tenant is clean17:05
*** zoli is now known as zoli|gone17:09
*** zoli|gone is now known as zoli17:09
weshay|ruckrlandy, panda this is odd.. the tebroker log is not getting updated17:14
weshay|ruck17:12:35 +(/opt/stack/new/tripleo-ci/toci_gate_test.sh:285): ./testenv-client -b 192.168.103.254:4730 -t 17400 --envsize 4 --ucinstance 83906ba4-83eb-4a91-931b-62c2affb3402 --net-iso multi-nic -- ./toci_quickstart.sh17:14
weshay|ruck17:12:35 +(/opt/stack/new/tripleo-ci/toci_gate_test.sh:279): sleep 120017:14
weshay|ruck-rw-r--r--. 1 root root        0 Jun 26 14:20 testenv-worker.log17:14
weshay|ruckmaybe it takes a minute17:14
*** dtantsur is now known as dtantsur|afk17:15
rlandyyou pushed a change to te-broker?17:18
rlandypicks up changes once a day unless you manaullyc hange it17:19
weshay|ruckrlandy, I didn't push any changes17:21
weshay|ruckrlandy, got nothing going on here.. must be related to the rdo migration, but not sure17:22
weshay|ruckmaybe the tenant id changed17:22
rlandychecking tenant17:24
weshay|ruck systemctl -a | grep te_workers17:25
weshay|ruck● te_workers.service                                                          loaded    failed   failed    TE Workers17:25
weshay|ruckrestarted17:26
weshay|ruckand we're off17:26
weshay|ruckrlandy, http://38.145.34.41/testenv-worker.log17:27
rlandyenvs 2 and 3 are create_ in_progress17:28
rlandyand then nothing17:29
weshay|ruckappears to be working https://review.rdoproject.org/jenkins/job/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-pike/1073/console17:29
weshay|ruckya.. now 3 in create complete17:29
rlandy2018-06-26 17:27:36,155 - testenv-worker-11 - INFO - Getting new job...17:29
rlandy2018-06-26 17:27:36,910 - testenv-worker-1 - ERROR - + ENVNUM=117:29
rlandyarethere jobs to consume those envs17:30
rlandythere is something wrong here17:31
*** holser_ has quit IRC17:39
*** ykarel has quit IRC17:44
*** kopecmartin has quit IRC17:46
*** hamzy has joined #oooq17:50
*** amoralej is now known as amoralej|off17:56
*** trown has quit IRC18:05
*** jaosorior has joined #oooq18:13
weshay|ruckrlandy, https://github.com/openstack/browbeat/blob/master/ci-scripts/tripleo/microbrow-perfci.sh#L90-L10318:16
*** trown has joined #oooq18:23
*** dmellado has quit IRC18:32
*** hubbot has quit IRC18:33
*** atoth has quit IRC18:53
rookhttps://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/user/agopi/my-views/view/Browbeat_view/job/browbeat-quickstart-queens-baremetal-mixed/25/console18:55
rookweshay|ruck: ^18:55
rookWe see the same error referenced in the launchpad agopi shared.18:55
rook18:48:04     SyntaxError: '<' operator not allowed in environment markers18:55
weshay|ruckhrm19:00
weshay|ruckk19:00
*** yolanda__ has joined #oooq19:04
*** yolanda_ has quit IRC19:07
*** yolanda_ has joined #oooq19:09
*** sshnaidm has quit IRC19:09
*** yolanda__ has quit IRC19:12
*** yolanda__ has joined #oooq19:15
*** yolanda_ has quit IRC19:18
*** holser_ has joined #oooq19:20
*** holser_ has quit IRC19:20
weshay|ruckrlandy, re: https://review.openstack.org/#/c/57690419:32
weshay|ruckIf other roles need those vars, the safest thing to do is to leave them in extras I think19:32
rlandyweshay|ruck: ok - I just thought I'd question it19:33
weshay|rucknot sure if there is a right / wrong here.. I do see why they could be in container prep too19:33
rlandyweshay|ruck: fine _ I made my little point19:33
weshay|ruckheh19:33
weshay|ruckI could put a patch on top19:33
weshay|ruckquickstart.sh is broken though19:33
rlandyweshay|ruck: no if quickstart is broken, push it through19:35
rlandymuch bigger deal19:35
weshay|ruck24hr queue :)19:35
rlandyoh no - again??19:37
rlandyweshay|ruck: https://github.com/openstack/tripleo-quickstart-extras/blob/master/config/environments/rdocloud.yml#L1519:37
rlandygoing to change this to m1.xlarge19:37
rlandyif CI is going with xlarge now19:37
rlandythey will have to match19:38
weshay|ruckrlandy, CI? which ci?19:38
rlandyweshay|ruck: we just discussed te-borker is using xlarge, no?19:38
rlandyfor the undercloud19:38
weshay|ruckya..19:39
rlandyhttps://github.com/openstack/tripleo-quickstart-extras/blob/master/config/environments/rdocloud.yml#L15 is for the reproducer19:39
weshay|ruckbrb19:39
*** agopi is now known as agopi|brb19:40
*** agopi|brb has quit IRC19:45
weshay|ruckback19:48
*** sshnaidm has joined #oooq19:50
rlandyon second thoughts ...19:50
rlandyhttps://github.com/openstack-infra/tripleo-ci/blob/master/scripts/prepare-ovb-cloud.sh#L1319:50
rlandyI am not sure it is xlarge19:51
rlandywill have to check it when rdocloud returns to us19:51
rlandyhmmm ... today was not a good day to try sell moving to rdocloud19:53
*** waleedm has joined #oooq19:58
weshay|rucklolz20:02
weshay|ruckno doubt20:02
weshay|ruckI think rdo-cloud was scared of Joe20:02
rlandyshame20:04
*** noama has joined #oooq20:18
weshay|ruckrlandy, when you have a sec https://docs.google.com/document/d/12XqodWjRUHd-AskAJJ543JtGLK6ZRzhxw_YlOqyhWdc/edit20:26
weshay|ruckplease just give it a quick glance before I share w/ Joe and the crew20:26
weshay|ruckrlandy, can you add me to that ticket please too :)20:28
weshay|ruckmyoung, rlandy anyone know what happened to the pike branch here? https://code.engineering.redhat.com/gerrit/gitweb?p=tripleo-environments.git20:33
weshay|ruckerr sorry20:33
weshay|ruckwrong git repo :) https://code.engineering.redhat.com/gerrit/gitweb?p=tripleo-quickstart.git20:34
* myoung looks20:34
weshay|ruckrook, https://bugs.launchpad.net/python-cliff/+bug/159784620:37
openstackLaunchpad bug 1597846 in cliff "Cannot install cliff 2.1.0 in Python 3.x" [Undecided,Invalid]20:37
rlandyweshay|ruck: I think you are on the ticket I forwarded20:44
weshay|ruckdon't see it in my list20:45
weshay|rucksearching for it by id does not bring it up20:45
rlandyweshay|ruck: browbeat notes look good20:47
weshay|ruckk. thanks20:47
rlandyfew minor comments20:47
*** hamzy has quit IRC20:47
rlandyweshay|ruck: you should have access to the ci-rhos flavors ticket now ... https://redhat.service-now.com/pnt?id=ticket&table=x_redha_pnt_devops_table&sys_id=deede177db321f00a9e306e2ca96199320:48
rlandychecking  the reconfig networking ticket20:49
weshay|ruckthanks20:50
weshay|rucksee it20:50
*** hubbot has joined #oooq20:51
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722420:52
*** waleedm has quit IRC21:00
*** trown is now known as trown|outtypewww21:01
rlandyweshay|ruck: https://redhat.service-now.com/pnt/?id=ticket&table=x_redha_pnt_devops_table&sys_id=6bd1c3f3db7e5f003306abc5ca96190d21:02
myoungweshay|ruck: w.r.t pike repo, what do you mean?  looks like is in sync with upstream etc..21:02
rlandyyou should see both now21:02
myoungweshay|ruck: is there an issue with stable/pike branch?21:02
weshay|ruckmyoung, no.. just the pip cache21:03
weshay|ruckin jenkins21:03
weshay|ruckdo you by chance recall how to disable that?21:03
* myoung looks21:03
myoungoh yes!21:03
myoungsec21:03
* myoung fetches details21:03
rlandywe ff'ed it to enable baremetal21:03
weshay|ruckrlandy, ya.. the branch is not the issue21:03
weshay|ruckwe're hitting https://bugs.launchpad.net/python-cliff/+bug/1597846 in jenkins only21:04
openstackLaunchpad bug 1597846 in cliff "Cannot install cliff 2.1.0 in Python 3.x" [Undecided,Invalid]21:04
myoungweshay|ruck: was this: https://bugs.launchpad.net/tripleo/+bug/177246021:05
openstackLaunchpad bug 1772460 in tripleo "rdo2: BM jobs failing b/c concurrent pip installs are failing due to sharing pip cache" [Critical,Fix released] - Assigned to Matt Young (halcyondude)21:05
myoungXDG_CACHE_HOME=$HOME/.cache/$EXECUTOR_NUMBER21:05
weshay|ruckk.. saw that in the config21:05
weshay|ruckhowever it needs to be cleared21:05
myoungso in jenkins workspace dirs (from which we run ansible) the pip cache should be located in ~/.cache/{ordinal} - so multiple jobs on same executor don't overlap21:06
weshay|ruckhrm21:06
weshay|ruckI removed the workspace on the slave21:06
weshay|ruckwell rm -Rf workspace/*21:06
myoungthe issue i hit was timing, when both were running pip installs and accessing a shared pip cache concurrently, was getting issues21:06
weshay|ruckpip install -r requirements works for me locally in python27 and 321:06
weshay|ruckya.. that is unrelated21:07
myoungaye21:07
myounghave link?  can look21:07
* myoung looks in bug21:07
weshay|ruckhttps://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/periodic-pike-rdo_trunk-featureset020-1ctlr_1comp_64gb/65/console21:07
myoungahh 00:00:00.001 Started by upstream project "sbtest-tripleo-quickstart-pike-rdo_trunk" build number 4021:07
myoungweshay|ruck: ok so now these are running on py3?21:09
* myoung chuckles as he sees "Installed /Users/cleverdevil/.virtualenvs/wham/build/cliff" ... cleverdevil, wham, and cliff on the same line. omen?21:09
weshay|ruckI think it's phucked on the pip mirror21:12
myoungweshay|ruck: yeah bug ref's setuptools > 17.1, but we're pulling 39.2...21:12
*** yolanda_ has joined #oooq21:12
myoungweshay|ruck: can we (for now) freeze/fix python-cliff back a minor rev?21:13
myoungor is something requiring the new one?21:13
* myoung loks21:13
myoungahh wait...21:14
myoung00:01:21.954     Uninstalling setuptools-12.0.5:21:14
myoung00:01:22.027       Successfully uninstalled setuptools-12.0.521:14
myoungand from LP21:14
myoungThis is a serious issue and the current workaround is to pre-install `setuptools>=17.1` BEFORE running pip install.21:14
*** yolanda__ has quit IRC21:14
weshay|ruckf.. slave is on f2221:16
weshay|ruckmyoung, rlandy ya.. f... f22 is the reason21:19
myoungweshay|ruck: can hope to 22 --> 24 --> 26 pretty quickly21:20
myounghop*21:20
* myoung looks at which slave this is...thought we upgraded these already21:20
rlandycool21:20
rlandywe upgraded some21:21
myoungya rdo-manager-slave_rdo-ci-fx2-01-s6 is f2221:22
myoungchecking the others21:23
myoungrlandy, weshay|ruck, looks like fx2-01-s2 and fx2-01-s3 are also fedora 2221:26
myoungupdated descriptions in jenkins with a *f22* prefix21:26
myoungalso fx2-01-s121:27
*** noama has quit IRC21:28
myoungweshay|ruck, rlandy, https://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/computer/rdo-manager-slave_rdo-ci-fx2-01-s5 is at f26 and is also now back in the rdo-manager-64 pool.  we can take out the f22's to a diff pool until upgraded to get these jobs rolling now...if ok i'll do that now21:29
*** myoung is now known as myoung|off21:56
*** yolanda__ has joined #oooq22:07
*** agopi has joined #oooq22:07
*** yolanda_ has quit IRC22:10
*** yolanda_ has joined #oooq22:21
*** yolanda__ has quit IRC22:24
*** dsneddon has quit IRC22:39
*** dsneddon has joined #oooq22:41
*** rlandy has quit IRC22:49
*** dsneddon has quit IRC22:50
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/56722422:52
*** tosky has quit IRC23:08
*** dsneddon has joined #oooq23:22

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!