Wednesday, 2018-03-21

*** wolverineav has quit IRC00:03
*** wolverineav has joined #tripleo00:04
openstackgerritSteve Baker proposed openstack/tripleo-docs master: WIP document workflow driven container prepare  https://review.openstack.org/55310400:07
*** ooolpbot has joined #tripleo00:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711100:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717400:10
*** ooolpbot has quit IRC00:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]00:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]00:10
*** sanjayu has joined #tripleo00:10
*** itlinux has joined #tripleo00:12
*** wolverineav has quit IRC00:18
*** wolverineav has joined #tripleo00:21
*** khyr0n has quit IRC00:28
*** wolverineav has quit IRC00:29
*** wolverineav has joined #tripleo00:30
*** jobcespedes has quit IRC00:32
*** wolverineav has quit IRC00:40
openstackgerritSteve Baker proposed openstack/tripleo-common master: Move build_service_filter to kolla_builder from tripleoclient  https://review.openstack.org/55473800:41
openstackgerritSteve Baker proposed openstack/tripleo-common master: WIP Perform multiple container image prepares and merge result  https://review.openstack.org/55473900:41
openstackgerritIan Wienand proposed openstack/tripleo-common master: Ensure output of shlex is quoted  https://review.openstack.org/55468400:52
*** wolverineav has joined #tripleo00:53
*** wolverin_ has joined #tripleo00:55
*** wolverineav has quit IRC00:59
*** mcornea has joined #tripleo01:05
*** mcornea has quit IRC01:06
*** ooolpbot has joined #tripleo01:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711101:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717401:10
*** ooolpbot has quit IRC01:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]01:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]01:10
*** wolverin_ has quit IRC01:13
*** jobcespedes has joined #tripleo01:13
*** wolverineav has joined #tripleo01:13
*** wolverineav has quit IRC01:17
*** jobcespedes has quit IRC01:18
*** dmacpher has joined #tripleo01:27
*** bfournie has joined #tripleo01:35
*** moshele has joined #tripleo01:39
*** fragatina has quit IRC01:39
*** fragatina has joined #tripleo01:41
openstackgerritIan Wienand proposed openstack-infra/tripleo-ci master: Install tripleo-common from source  https://review.openstack.org/55470501:41
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [DNM] testing 554684  https://review.openstack.org/55468501:42
*** fragatina has quit IRC01:46
*** jobcespedes has joined #tripleo01:48
*** ebarrera has quit IRC01:50
*** cshastri has joined #tripleo02:00
*** jobcespedes has quit IRC02:00
*** agopi has joined #tripleo02:03
*** ooolpbot has joined #tripleo02:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711102:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717402:10
*** ooolpbot has quit IRC02:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]02:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]02:10
*** atoth has quit IRC02:20
*** myoung|afk is now known as myoung02:23
*** myoung is now known as myoung|afk02:27
*** psachin has joined #tripleo02:39
*** wolverineav has joined #tripleo02:56
*** fragatina has joined #tripleo03:00
*** ooolpbot has joined #tripleo03:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711103:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717403:10
*** ooolpbot has quit IRC03:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]03:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]03:10
*** rlandy|bbl is now known as rlandy03:15
*** wolverineav has quit IRC03:16
*** wolverineav has joined #tripleo03:17
*** wolverineav has quit IRC03:21
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Fix default partition type  https://review.openstack.org/55477103:22
*** jaganathan has quit IRC03:27
*** jaganathan has joined #tripleo03:27
*** ramishra has joined #tripleo03:30
*** ykarel has joined #tripleo03:36
*** psahoo has joined #tripleo03:44
*** shreshtha has joined #tripleo03:48
*** dpawar has joined #tripleo03:52
*** skramaja has joined #tripleo03:53
*** skramaja_ has joined #tripleo03:58
*** tzumainn has quit IRC03:59
*** skramaja has quit IRC03:59
openstackgerrityatin proposed openstack/tripleo-quickstart master: [DNM] Enable network isolation for Queens+ releases in FS020  https://review.openstack.org/55452804:00
*** links has joined #tripleo04:00
*** udesale has joined #tripleo04:09
*** ooolpbot has joined #tripleo04:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711104:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717404:10
*** ooolpbot has quit IRC04:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]04:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]04:10
*** skramaja_ is now known as skramaja04:11
openstackgerritMerged openstack/tripleo-upgrade master: New major upgrade workflow implementation.  https://review.openstack.org/54833604:16
*** pdeore has joined #tripleo04:20
*** radeks has joined #tripleo04:29
*** radeks has quit IRC04:30
*** radeks has joined #tripleo04:30
*** pgadiya has joined #tripleo04:35
*** rlandy has quit IRC04:39
*** ratailor has joined #tripleo05:02
*** ratailor_ has joined #tripleo05:04
*** ratailor has quit IRC05:07
*** ooolpbot has joined #tripleo05:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711105:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717405:10
*** ooolpbot has quit IRC05:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]05:10
*** dpawar has quit IRC05:31
*** mdnadeem has joined #tripleo05:32
*** dpawar has joined #tripleo05:32
*** fragatina has quit IRC05:35
*** fragatina has joined #tripleo05:35
*** moshele has quit IRC05:39
*** dsariel has joined #tripleo05:45
*** akane_ has joined #tripleo05:58
*** assassin has joined #tripleo06:00
*** masco has joined #tripleo06:01
*** assassin has quit IRC06:04
*** karthiks has quit IRC06:06
*** agurenko has joined #tripleo06:06
*** ooolpbot has joined #tripleo06:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711106:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717406:10
*** ooolpbot has quit IRC06:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]06:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]06:10
*** jcoufal has joined #tripleo06:12
*** yprokule has joined #tripleo06:15
*** ihrachys has quit IRC06:15
*** jfrancoa has joined #tripleo06:18
*** jfrancoa has quit IRC06:19
*** jfrancoa has joined #tripleo06:19
*** jcoufal_ has joined #tripleo06:20
*** udesale has quit IRC06:21
*** jcoufal has quit IRC06:21
*** udesale has joined #tripleo06:21
*** ratailor_ has quit IRC06:22
*** karthiks has joined #tripleo06:23
*** ratailor has joined #tripleo06:24
*** marios has joined #tripleo06:27
*** jcoufal has joined #tripleo06:27
*** yprokule has quit IRC06:28
*** yprokule_ has joined #tripleo06:28
*** yprokule_ is now known as yprokule06:29
Tenguhello there06:29
*** radeks has quit IRC06:30
*** jcoufal_ has quit IRC06:30
*** moshele has joined #tripleo06:31
*** dpawar has quit IRC06:31
*** dbecker has quit IRC06:31
*** radeks has joined #tripleo06:32
*** dpawar has joined #tripleo06:32
*** ratailor_ has joined #tripleo06:32
*** waleedm has joined #tripleo06:33
openstackgerritOpenStack Proposal Bot proposed openstack/tripleo-ui stable/queens: Imported Translations from Zanata  https://review.openstack.org/55480606:34
*** ratailor has quit IRC06:35
*** karthiks has quit IRC06:35
*** StevenK has quit IRC06:35
*** sdake has quit IRC06:35
openstackgerritOpenStack Proposal Bot proposed openstack/tripleo-ui master: Imported Translations from Zanata  https://review.openstack.org/55480806:36
*** jbadiapa has joined #tripleo06:36
*** StevenK has joined #tripleo06:36
*** ratailor_ has quit IRC06:37
*** sdake has joined #tripleo06:37
*** sdake has joined #tripleo06:37
*** karthiks has joined #tripleo06:39
*** hjensas has quit IRC06:40
*** paramite_ has quit IRC06:41
*** ratailor has joined #tripleo06:41
*** dbecker has joined #tripleo06:46
*** agopi has quit IRC06:49
*** gkadam has joined #tripleo06:50
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart-extras master: WIP: Create import-role role.  https://review.openstack.org/55349206:53
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart-extras master: WIP: Add undercloud upgrades playbook.  https://review.openstack.org/55382706:53
openstackgerritJose Luis Franco proposed openstack-infra/tripleo-ci master: WIP: Run playbooks with custom args  https://review.openstack.org/55347407:02
openstackgerritJose Luis Franco proposed openstack-infra/tripleo-ci master: Use tripleo-upgrade role in undercloud upgrades job.  https://review.openstack.org/54897407:02
*** jaosorior has quit IRC07:05
*** oscar has joined #tripleo07:05
*** aufi has joined #tripleo07:06
*** ooolpbot has joined #tripleo07:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711107:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717407:10
*** ooolpbot has quit IRC07:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]07:10
*** akrivoka has joined #tripleo07:11
openstackgerritwaleed mousa proposed openstack/os-net-config master: Adding ethtool command after binding dpdk drivers in Mellanox nics  https://review.openstack.org/55404807:11
*** cylopez has joined #tripleo07:12
*** cylopez has quit IRC07:14
*** cylopez has joined #tripleo07:15
*** cylopez has left #tripleo07:15
*** rcernin has quit IRC07:21
*** pmannidi has quit IRC07:22
*** guits__ has joined #tripleo07:25
*** quiquell has joined #tripleo07:27
*** holser__ has joined #tripleo07:32
*** dpawar has quit IRC07:34
*** nyechiel_ has joined #tripleo07:35
openstackgerritHarald Jensås proposed openstack/python-tripleoclient master: Fix Genconfig - no HOME in environment  https://review.openstack.org/55467807:36
*** moshele has quit IRC07:36
*** hjensas has joined #tripleo07:38
*** hjensas has quit IRC07:38
*** hjensas has joined #tripleo07:38
*** dmacpher has quit IRC07:42
*** jaosorior has joined #tripleo07:44
openstackgerritHarald Jensås proposed openstack/tripleo-common master: Install python2-networking-baremetal in neutron-server  https://review.openstack.org/54545207:46
*** moshele has joined #tripleo07:46
openstackgerritHarald Jensås proposed openstack/instack-undercloud master: Use the new dnsmasq PXE filter in ironic-inspector  https://review.openstack.org/52394407:47
*** dpawar has joined #tripleo07:48
*** moshele has quit IRC07:50
*** moshele has joined #tripleo07:51
*** yamahata has joined #tripleo07:55
openstackgerritAdriano Petrich proposed openstack/tripleo-common master: Move password generation to deployment phase  https://review.openstack.org/54214308:02
*** ebarrera has joined #tripleo08:05
openstackgerritJose Luis Franco proposed openstack-infra/tripleo-ci master: Add multinode-overcloud-update playbook to run list  https://review.openstack.org/54705808:06
openstackgerritDamien Ciabrini proposed openstack/tripleo-heat-templates master: Fix update of pacemaker container images during major upgrade  https://review.openstack.org/54747608:09
*** ooolpbot has joined #tripleo08:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711108:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717408:10
*** ooolpbot has quit IRC08:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]08:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]08:10
*** ebarrera has quit IRC08:12
*** ebarrera has joined #tripleo08:13
*** florianf has joined #tripleo08:13
*** dparkes has joined #tripleo08:13
*** nyechiel_ has quit IRC08:14
*** nyechiel_ has joined #tripleo08:24
*** bogdando has joined #tripleo08:25
*** tesseract has joined #tripleo08:31
*** ffiore has joined #tripleo08:32
*** ratailor has quit IRC08:35
*** ratailor has joined #tripleo08:35
*** amoralej|off is now known as amoralej08:36
*** matbu has quit IRC08:37
*** ccamacho has joined #tripleo08:38
*** matbu has joined #tripleo08:39
openstackgerritHarald Jensås proposed openstack/tripleo-common master: Add ironic-neutron-agent container  https://review.openstack.org/54732108:39
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates stable/queens: WIP: fix fluentd upgrade tasks during ffu.  https://review.openstack.org/55483108:42
*** jpena|off is now known as jpena08:43
*** tosky has joined #tripleo08:43
*** matbu has quit IRC08:47
*** chem|eod is now known as chem08:48
*** matbu has joined #tripleo08:48
openstackgerritDamien Ciabrini proposed openstack/tripleo-heat-templates master: WIP Upgrade data on disk on mariadb major upgrade  https://review.openstack.org/54666608:49
*** tesseract has quit IRC08:51
*** tesseract has joined #tripleo08:52
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: New major upgrade workflow implementation.  https://review.openstack.org/55208208:53
*** tesseract has quit IRC08:54
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Testing P->Q from queens branch.  https://review.openstack.org/55208008:55
*** skramaja has quit IRC08:56
*** tesseract has joined #tripleo08:57
*** lucas-afk is now known as lucasagomes08:59
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: Ensure ansible-pacemaker is present on the undercloud.  https://review.openstack.org/55423608:59
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Add debug and don't fail on validation for ffu.  https://review.openstack.org/55206308:59
openstackgerrityolanda.robla proposed openstack/tripleo-upgrade master: Add the ability to limit the hosts where to apply FFU  https://review.openstack.org/55483309:00
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Testing FFU from queens branch.  https://review.openstack.org/55206409:01
*** jpich has joined #tripleo09:02
*** arxcruz|off is now known as arxcruz09:04
bandinimarios: can you come at me on https://review.openstack.org/#/c/554306/, bro?09:04
bandini(fairly simple cherry-pick)09:04
*** marios has quit IRC09:05
*** marios has joined #tripleo09:05
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates stable/queens: Fix newton compat mode for deployed server.  https://review.openstack.org/55298009:08
openstackgerritMerged openstack/puppet-tripleo stable/ocata: Extract local CA if it expired  https://review.openstack.org/55442309:09
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates stable/queens: WIP: fix fluentd upgrade tasks during ffu.  https://review.openstack.org/55483109:09
*** ooolpbot has joined #tripleo09:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711109:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717409:10
*** ooolpbot has quit IRC09:10
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Testing FFU from queens branch.  https://review.openstack.org/55206409:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]09:10
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-common master: Refactor setting default CA  https://review.openstack.org/55483509:11
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-common master: Read bytes for default CA  https://review.openstack.org/55483609:11
*** ukalifon has joined #tripleo09:14
bandiniccamacho: thank you, sir :)09:16
bandinimarios: unping, carlos did the magic!09:16
openstackgerritBogdan Dobrelya proposed openstack/paunch master: Allow to limit cgroup cpu shares  https://review.openstack.org/55453909:19
ccamachohey bandini np09:23
openstackgerritMerged openstack/puppet-tripleo stable/newton: Disallow TLS v1.0 from HAProxy  https://review.openstack.org/55442209:24
openstackgerritMerged openstack/tripleo-upgrade master: Ensure ansible-pacemaker is present on the undercloud.  https://review.openstack.org/55426209:24
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Standardise Workflow messaging and optionally persist messages  https://review.openstack.org/42506009:25
*** jaganathan has quit IRC09:26
bandinijaosorior: ENOTIME on https://review.openstack.org/551286 or other reasons?09:28
openstackgerritBogdan Dobrelya proposed openstack/paunch master: Allow to limit cgroup cpu shares  https://review.openstack.org/55453909:28
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Standardise Workflow messaging and optionally persist messages  https://review.openstack.org/42506009:28
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Allow for passing boot-time vars/args to OC nodes  https://review.openstack.org/55296709:29
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Add an openshift-cns service  https://review.openstack.org/54393309:29
jaosoriorbandini: your first guess09:30
bandinijaosorior: ah ok :)09:31
jaosorior:(09:31
openstackgerritJuan Antonio Osorio Robles proposed openstack/instack-undercloud master: Enable TLS by default  https://review.openstack.org/55238209:33
*** akane_ has quit IRC09:34
openstackgerritDamien Ciabrini proposed openstack/tripleo-heat-templates master: Make HA containers log to /var/log/containers after upgrade  https://review.openstack.org/55342409:34
*** derekh has joined #tripleo09:34
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Use the standard messaging in the baremetal workbook  https://review.openstack.org/55246009:36
mariosbandini: sorry missed it09:37
bandinimarios: you're still my favourite sadopanda09:37
* marios waves arms in the air09:38
bandinilol09:38
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Use the standard messaging in the fernet-key-rotate workbook  https://review.openstack.org/55455209:38
*** suuuper has joined #tripleo09:38
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Use the standard messaging in the networks workbook  https://review.openstack.org/55459309:39
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Use the standard messaging in the plan_management workbook  https://review.openstack.org/55459509:40
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Use the standard messaging in the package_update workbook  https://review.openstack.org/55459409:40
Tenguhmm. what would happen if I drop the content of /var/lib/heat-config/deployed/ directory prior trying my upgrade thing? any idea? marios maybe? :)09:41
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Use the standard messaging in the derive_parameters workbook  https://review.openstack.org/55454809:41
openstackgerritSaravanan KR proposed openstack/tripleo-heat-templates master: Remove the lacp bond mode constraints  https://review.openstack.org/55484609:43
mariosTengu: o/ why are you wanting to do that :). well if i recall you're doing a ocata to pike? at a guess it would no longer skip all those deployments that were already applied ;) are you having problems getting some particular software deployment (from the tht) to run09:44
Tengumarios: pike BM to pike container :D09:44
Tenguthe infamous migration that nobody wants to know about ;)09:45
mariosTengu: right you're doing pike to pike (noop cos no repo switch)09:45
openstackgerritOliver Walsh proposed openstack/instack-undercloud master: Set undercloud nova notification_format to 'unversioned'  https://review.openstack.org/55484709:45
mariosTengu: ;) you're on the bleeding edge man09:45
Tengumarios: yep. I'm hitting a nasty situation with pacemaker in fact: https://bugs.launchpad.net/tripleo/+bug/175687609:45
openstackLaunchpad bug 1756876 in tripleo "Upgrade "pike bm -> pike container": pacemaker issue" [Low,Triaged]09:45
Tengumarios: pacemaker got shut down on the controllers, and after that, puppet wants a quorum for the cluster before going forward. Of course, this won't work. like, at all.09:46
Tenguso I'm trying to find a way to kind of… well… force some re-deploy and such.09:46
mariosTengu: so during the upgrade steps the cluster goes down then up again, i.e. before running puppet/config09:46
Tengumarios: ah. it apparently doesn't goes up again…09:47
mariosTengu: let me check the bug and add some pointers but i see bandini has already checked in which is good to see09:47
Tengumarios: :) thanks for your time. I've re-armed the lab so that I'm ready for more :)09:47
Tenguah, Michele is bandini - good to know :)09:48
openstackgerritDougal Matthews proposed openstack/instack-undercloud master: Use the default queue when calling create_deployment_plan  https://review.openstack.org/55463009:51
*** akane_ has joined #tripleo09:51
*** panda|off is now known as panda09:52
*** gfidente has joined #tripleo09:54
*** gfidente has quit IRC09:54
*** gfidente has joined #tripleo09:54
mariosTengu: not sure if that helps, but really we need more info to be able to help (i.e. something went wrong on one of those upgrade tasks, i'll bet so that the cluster start is failing)09:56
Tengumarios: I'll comment up as soon as I'm ready for another run :)09:57
oscarHi, trying to deploy a pike overcloud but keep failing at overcloud.AllNodesDeploySteps.ControllerDeployment_Step3.0: "Error: /Stage[main]/Nova::Cell_v2::Discover_hosts/Exec[nova-cell_v2-discover_hosts]: Failed to call refresh: nova-manage  cell_v2 discover_hosts returned 1 instead of one of [0]". Does anyone know what could be causing that?09:57
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart-extras master: WIP: Add undercloud upgrades playbook.  https://review.openstack.org/55382709:57
Tengumarios: hmm so basically I should ensure pacemaker is up'n'running after step 4 of the upgrade_task related to controllers, right? I'll check that, as well as the other things you mention.09:58
mariosTengu: ack, wrt names/nicks... i totally spent 5 minutes thinking "Cédric" is 'one of tengus colleagues that must be helping out with this'09:58
TenguXD09:59
mariossorry i tend to remember and call people by their irc nicks in real life, ask bandini09:59
*** salmankhan has joined #tripleo09:59
chemjistr: hi, featureset037 is the one you're using for testing the update right ?09:59
Tengumarios: np, I tend to do the same09:59
oscarand if I try to run nova-manage cell_v2 discover_hosts manually I get a mysql error: Unknown column 'cn.host' in 'field list'09:59
jistrchem: yup09:59
Tengumarios: and ppl knowing my IRC nick usually call me "tengu".09:59
chemjistr: It was about to get deleted from rdo-cloud I think https://review.rdoproject.org/r/#/c/12160/9/jobs/tripleo-upstream.yml (currently reviewing it)10:00
Tenguah. at last I found how to make ansible wait for a server to come back to life after a reboot \o/10:01
jistrchem: oh... thanks for bringing this up. We shouldn't remove it, at least from master. cc myoung|afk10:01
chemjistr: yeah, I'll minus -1 when I'm done10:01
jistrchem, myoung|afk : but wait Matt's patch actually doesn't remove it from master. I think it's ok to remove it from the older branches, at least for now10:02
jistrwe can re-add if necessary, no need to waste resources for now10:02
chemjistr: heu ... it's removing it from master as far as I can tell10:03
chemjistr: http://paste.openstack.org/show/707327/ ?10:04
chemjistr: will remove master job no ?10:04
*** egallen has joined #tripleo10:06
*** skramaja has joined #tripleo10:07
jistrchem: actually i have no idea how this stuff works :D In the other file, i see the job still under `tripleo-upgrades-check-branchless` list, so i assumed it doesn't get removed, but yea maybe it does get removed if we remove it from the other file? I don't know what's the difference between those files.10:07
chemjistr: on has to match the other kindof stuff, anyway myoung|afk will know10:08
openstackgerritDougal Matthews proposed openstack/tripleo-common master: [Experimental] Unit Test Mistral Workflows  https://review.openstack.org/55338910:09
jistrchem: so one could be job definitions and the other a list of triggers? that'd make sense, just wondering why all the job definitions are under project section with `name: tripleo-quickstart`, but maybe that's ok10:09
*** ooolpbot has joined #tripleo10:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711110:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717410:10
*** ooolpbot has quit IRC10:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]10:10
*** salmankhan1 has joined #tripleo10:11
*** salmankhan has quit IRC10:11
*** salmankhan1 is now known as salmankhan10:11
chemjistr: yeah reviewing 15k lines of yaml in one file is always a thing of beauty10:12
chemjistr: I miss autogen10:12
jistrchem: yea :D btw looking at that file with job lists, we should probably add a section for tripleo-upgrade repo and add the tripleo-upgrades-check-branchless. I may just post it on top of myoung|afk patch and let folks review.10:13
*** assassin has joined #tripleo10:13
chemjistr: well not sure what this "tripleo-upgrades-check-branchless" means ... tripleo-upgarde is "branchfull"10:13
chemjistr: this is just plain confusing10:14
chemIMHO10:14
jistrchem: yea for me too, hopefully not so much for CI folks, we need to sync up with them10:14
*** ktibi has joined #tripleo10:15
openstackgerritJose Luis Franco proposed openstack/tripleo-upgrade master: Include new CLI changes for overcloud update.  https://review.openstack.org/55051710:16
*** dpawar has quit IRC10:17
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Optionally run config download at the end of stack create/update  https://review.openstack.org/55422410:18
chemjistr: oki the branchless is for them to be able to run the tripleo-upgrade check under oooq projects (which are branchless)10:18
ccamachohey mwhahaha sorry to bug you, quick question.  Im testing a few puppet-nova patches and im doing it patching tht and puppet-nova on deployment but it takes too much time for testing simple stuff.. Do you know if there is an easy way of running puppet-apply configuring the parameters together with the manifest to test ?10:19
ccamachoIm trying to do it but im not able to run it correctly10:19
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Add cpu shares cgroup limits for neutron ovs agent  https://review.openstack.org/55486310:20
jistrchem: ok so i guess we shouldn't run the -branchless on tripleo-upgrade then? (which is branchful :) ) so i'll not post that patch10:25
*** marios has quit IRC10:25
*** marios has joined #tripleo10:25
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Add cpu shares cgroup limits for neutron ovs agent  https://review.openstack.org/55486310:25
*** akane__ has joined #tripleo10:26
*** skramaja has quit IRC10:26
Tengumarios: small question: bandini talked about the order - if I push all the docker-related things to the last positions of my "-e" arguments, it should be OK, on that part, right?10:26
*** social has joined #tripleo10:28
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Limit neutron_ovs_agent CPU to 15% in CI scenarios  https://review.openstack.org/55486910:29
*** akane_ has quit IRC10:29
*** salmankhan has quit IRC10:30
*** dpawar has joined #tripleo10:33
*** salmankhan has joined #tripleo10:34
*** egallen has quit IRC10:36
*** egallen has joined #tripleo10:37
*** fzdarsky has joined #tripleo10:41
Tenguo_O wow. just get another error. Never saw that one…10:45
*** egallen has quit IRC10:45
openstackgerrityolanda.robla proposed openstack/tripleo-upgrade master: Add the ability to limit the hosts where to apply FFU  https://review.openstack.org/55483310:46
Tengu-.- ok. a VM seems to have crashed. of course.10:46
openstackgerritmathieu bultel proposed openstack/tripleo-heat-templates master: Do not create NetworkVlanID is the value is not defined  https://review.openstack.org/55487210:46
*** ffiore_ has joined #tripleo10:47
*** ffiore has quit IRC10:47
*** zoli is now known as zoli|lunch10:49
*** dtantsur|afk is now known as dtantsur10:49
*** aputtur__ has quit IRC10:52
openstackgerrityolanda.robla proposed openstack/tripleo-heat-templates master: Fix queries for already installed packages  https://review.openstack.org/55449910:52
*** kmy has quit IRC10:52
*** yamahata has quit IRC10:52
*** kmy has joined #tripleo10:53
*** paramite_ has joined #tripleo10:53
openstackgerritFlorian Fuchs proposed openstack/tripleo-validations master: Fix overcloud services connectivity validation  https://review.openstack.org/55383211:02
*** udesale_ has joined #tripleo11:04
*** numans is now known as numans_afk11:05
*** udesale has quit IRC11:07
*** sshnaidm|sick is now known as sshnaidm11:08
myoung|afkjistr, chem, re the patch to remove old upgreade jobs, this is leftover work/debt from our sprint in feb when we were putting a toe in the water for upgrade jobs (https://trello.com/c/3UFgRWtk/565-remove-old-upgrade-jobs-in-sf).  Nothing's sacred, that patch was proactive cleanup, not to be merged until we had the final upgrade jobs in place.  If it's not necessary any more please patch over it or we can abandon.  I'm not sure (11:09
myoung|afkpersonally) what current state of upgrade job(s) are...11:09
openstackgerritFlorian Fuchs proposed openstack/tripleo-validations master: Fix overcloud services connectivity validation  https://review.openstack.org/55383211:09
myoung|afkjistr, chem, please advise :)  I'll be online (for real) in another 90...11:09
* myoung|afk wanders off to make morning coffee11:09
*** ooolpbot has joined #tripleo11:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711111:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717411:10
*** ooolpbot has quit IRC11:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]11:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]11:10
*** gyankum has joined #tripleo11:10
*** dpawar has quit IRC11:10
chemmyoung|afk: jistr and I think most of it is valid, if you don't mind then, I'll upload a newest version of it that reflect the current status11:11
jpichhonza: A few people mentioned "need to update stuff so the new endpoints work in containerised undercloud as well" about that instack patch, were you working on it? I'm going to look into it today if not, if that's fine by you11:12
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Implement MasqueradeNetworks services  https://review.openstack.org/55342711:13
*** udesale_ has quit IRC11:13
myoung|afkchem: don't mind at all!  one big team...feel free :)11:15
*** egallen has joined #tripleo11:20
*** cschwede has joined #tripleo11:23
honzajpich: i believe it's done --- but i shall double check11:25
openstackgerritMarios Andreou proposed openstack/tripleo-common stable/queens: Remove the noop deploystep for upgrade converge step  https://review.openstack.org/55488011:25
jpichhonza: Oh, cool! If you have a link to the review(s) I'd love to see how that's done11:25
*** kopecmartin has joined #tripleo11:26
*** egallen has quit IRC11:27
*** adarazs is now known as adarazs_lunch11:27
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart master: Ignore empty values for dlrn hashes  https://review.openstack.org/55488211:27
*** cshastri has quit IRC11:27
honzajpich: https://review.openstack.org/#/c/515490/27..3011:27
sshnaidmjfrancoa, ^^11:28
*** bfournie has quit IRC11:28
jpichhonza: Thanks!11:28
honzajpich: ps 27 is where i started it, and then other people fixed a few things11:28
jfrancoasshnaidm: wow! that was fast :-D11:28
*** bfournie has joined #tripleo11:28
jpichhonza: team work11:28
sshnaidmjfrancoa, beside of that you need to pass empty vars in you playbook to dlrn_hash_path and dlrn_hash_path_newest to zeroize them11:29
*** numans_afk is now known as numans11:29
jfrancoasshnaidm: ok, thanks a lot. I'll give it a try!11:29
*** skramaja has joined #tripleo11:30
*** bfournie has quit IRC11:33
*** dmacpher has joined #tripleo11:34
*** abishop has joined #tripleo11:35
chemmyoung|afk: jistr ack, will do then11:36
*** pdeore_ has joined #tripleo11:38
*** pdeore has quit IRC11:38
*** jpena is now known as jpena|off11:39
*** jpena|off is now known as jpena11:40
openstackgerritHarald Jensås proposed openstack/python-tripleoclient master: Contanerized Undercloud - Routed Spine-Leaf  https://review.openstack.org/54345511:41
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Use ironic::inspector::dnsmasq_ip_subnets  https://review.openstack.org/54358211:41
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Use IronicInspectorSubnets in undercloud.yaml  https://review.openstack.org/54732511:41
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Add static routes for routed ctlplane  https://review.openstack.org/54510911:41
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Add ctlplane networking for routed networks  https://review.openstack.org/54732611:41
honzajpich: /me dead => "Honza & co" :)11:41
jpich:-)11:42
openstackgerritJohn Fulton proposed openstack/tripleo-heat-templates master: DO NOT MEREGE: testing scenario001 without CephClient  https://review.openstack.org/55488411:43
bogdandoit seems folks that cgroups do not applied for https://review.openstack.org/#/c/554869/1 :o11:45
bogdandoI have a local libvirt env11:45
bogdandoif someone wants to look into11:45
bogdandothen, we could submit z bz or something11:45
bogdandoowalsh: ^^ perchance?..11:46
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart master: Remove unnecessary parameters from featureset047.  https://review.openstack.org/55385011:47
bogdandojaosorior: ^^ jfyi :) security squad might be interested as well11:47
jaosoriorbogdando: reading the docs https://docs.docker.com/config/containers/resource_constraints/#cpu  it says "This is only enforced when CPU cycles are constrained. When plenty of CPU cycles are available, all containers use as much CPU as they need. In that way, this is a soft limit. --cpu-shares does not prevent containers from being scheduled in swarm mode. It prioritizes container CPU resources for the11:48
jaosorioravailable CPU cycles. It does not guarantee or reserve any specific CPU access."11:48
bogdandoum11:48
bogdandoso we need to bundle this param with something more restrictive11:48
jaosoriorbogdando: we just need to figure out what11:48
bogdandojaosorior: well done, thanks! let's figure out something then11:48
* jaosorior reading docs11:48
bogdandoyeah11:48
owalshbogdando, jaosorior: is that a problem? Why don't we want to use the cycles if they are available?11:49
bogdandoowalsh: well, not sure I have really something available on my env :D11:50
bogdandoLA >3.5 for 2 vcpu11:50
owalshhmm, doesn't necessarily mean CPU is flat out, I/O bound maybe?11:51
bogdandointeresting...11:51
mwhahahaccamacho: no there really isn't especially with containers. You could go try and rerun the docker-puppet.py bits manually on an already deployed system after you patch the modules.11:51
openstackgerritMerged openstack/tripleo-heat-templates master: Fix newton compat mode for deployed server.  https://review.openstack.org/55292311:52
openstackgerritMerged openstack/instack-undercloud stable/queens: Mariadb online upgrade after yum update  https://review.openstack.org/55430611:52
openstackgerritMerged openstack/tripleo-upgrade stable/queens: New major upgrade workflow implementation.  https://review.openstack.org/55208211:52
bogdandoowalsh: https://pastebin.com/D11CedHu my env info11:53
bogdandowa 0, so that's not IO11:53
owalshbogdando: only using half the available CPU time11:54
bogdandothough, you're probably right, and 1.8 is ~2 which shows like my 2 vCPUs are not super busy11:54
owalsh100% for a process == 1 full CPU11:55
bogdandoI wonder if we should keep that as is, or add cpu counts for container as well11:55
bogdandoI feel like neutron is mining bitcoins on my env :<11:56
owalshwould guess that neutron-openvswitch is in a polling loop so 100% CPU is to be expected11:56
* owalsh is just guessing though, networking guys might have some input11:57
openstackgerritJuan Antonio Osorio Robles proposed openstack/paunch master: Allow configuring security options  https://review.openstack.org/55454211:57
*** dsariel has quit IRC11:58
*** jlabarre has quit IRC11:58
hjensasbogdando: We should get the neutron fix for the ovs issue soon.12:00
bogdandohjensas: ack, though we can still have some handy experience from that case :)12:01
bogdandofor future12:01
hjensasbogdando: sure. :)12:01
*** bfournie has joined #tripleo12:01
*** adarazs_lunch is now known as adarazs12:01
*** ansmith has joined #tripleo12:03
*** aputtur__ has joined #tripleo12:03
*** atoth has joined #tripleo12:04
*** rfolco has joined #tripleo12:05
*** dprince has joined #tripleo12:06
*** pchavva has joined #tripleo12:07
*** ooolpbot has joined #tripleo12:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711112:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717412:10
*** ooolpbot has quit IRC12:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]12:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]12:10
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade master: We need to be root to install package.  https://review.openstack.org/55488712:10
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade master: FFU: We need to be root to install ansible-pacemaker package.  https://review.openstack.org/55488712:10
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: Ensure ansible-pacemaker is present on the undercloud.  https://review.openstack.org/55423612:11
*** zoli|lunch is now known as zoli12:11
*** akane__ has quit IRC12:11
*** raildo has joined #tripleo12:12
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: Ensure ansible-pacemaker is present on the undercloud.  https://review.openstack.org/55423612:12
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Add debug and don't fail on validation for ffu.  https://review.openstack.org/55206312:12
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Testing FFU from queens branch.  https://review.openstack.org/55206412:12
*** pkovar has joined #tripleo12:13
*** akane has joined #tripleo12:13
openstackgerritMerged openstack/puppet-tripleo master: Fixes incorrect ownership of ODL TLS cert/key  https://review.openstack.org/55453712:13
*** dsariel has joined #tripleo12:13
*** raildo has quit IRC12:14
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: Ensure ansible-pacemaker is present on the undercloud.  https://review.openstack.org/55423612:14
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Add debug and don't fail on validation for ffu.  https://review.openstack.org/55206312:14
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-upgrade stable/queens: DNM: Testing FFU from queens branch.  https://review.openstack.org/55206412:14
openstackgerritFlorian Fuchs proposed openstack/tripleo-validations master: Fix MySQL Open Files Limit validation  https://review.openstack.org/55488812:15
*** lucasagomes is now known as lucas-hungry12:16
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates stable/queens: Fix newton compat mode for deployed server.  https://review.openstack.org/55298012:17
sshnaidmjfrancoa, is it related to what you are doing now? https://bugs.launchpad.net/tripleo/+bug/173579212:18
openstackLaunchpad bug 1735792 in tripleo "tripleo-ci-centos-7-undercloud-upgrades is running tripleo.sh vs oooq" [High,Triaged] - Assigned to wes hayutin (weshayutin)12:18
jfrancoasshnaidm: exactly, all these patches are to move away that job from running tripleo.sh into tripleo-quickstart + tripleo-upgrade12:19
sshnaidmjfrancoa, ok, will update it then..12:19
jfrancoasshnaidm: I didn't know there was a bug, so I'll add the LP into all the related commits12:19
jfrancoaand assign it to me, please12:20
sshnaidmjfrancoa, cool, thanks12:20
jfrancoasshnaidm: np12:20
*** jpena is now known as jpena|lunch12:20
*** dsariel has quit IRC12:21
openstackgerritJose Luis Franco proposed openstack-infra/tripleo-ci master: Use tripleo-upgrade role in undercloud upgrades job.  https://review.openstack.org/54897412:23
openstackgerritCarlos Camacho proposed openstack/tripleo-quickstart-extras master: Collect installed cron jobs  https://review.openstack.org/55488912:24
*** raildo has joined #tripleo12:24
openstackgerritDougal Matthews proposed openstack/tripleo-common master: Verify the Swift container exists with a small utility workflow  https://review.openstack.org/52821312:27
*** pradk has quit IRC12:29
*** dprince has quit IRC12:29
*** artom has joined #tripleo12:29
*** pkovar has quit IRC12:29
*** panda is now known as panda|lunch12:30
*** pdeore_ has quit IRC12:30
*** masco has quit IRC12:30
*** artom has quit IRC12:31
*** psahoo has quit IRC12:32
Tengumarios: I think I have some news: apparently something goes wrong on 2 of the controllers, and pacemaker show them as "offline". After a quick check, the *network* seems the culprit: both nodes have an issue, apparently they are unable to ping their default gateway (at least)12:32
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart master: Add validation-errors-nonfatal and debug into updates job.  https://review.openstack.org/55489112:32
Tenguand they are apparently unable to ping the public IPs associated to the nodes… duh.12:33
openstackgerritJose Luis Franco proposed openstack-infra/tripleo-ci master: Add multinode-overcloud-update playbook to run list  https://review.openstack.org/54705812:33
*** trown|outtypewww is now known as trown|ruck12:34
*** artom has joined #tripleo12:34
dciabrin_morning o/ is anybody hitting https://bugs.launchpad.net/tripleo/+bug/1755485 atm ? it seems job tripleo-ci-centos-7-3nodes-multinode is broken in the gate12:35
openstackLaunchpad bug 1755485 in tripleo "Barbican tempest test failing to ssh to cirros image" [Critical,Triaged]12:35
*** rlandy has joined #tripleo12:35
*** lblanchard has joined #tripleo12:38
*** lblanchard has quit IRC12:39
trown|ruckdciabrin_: the gate queue does not look out of sorts... It is possible that is just a low failure rate intermittent issue12:39
dciabrin_trown|ruck, ack thx I'll see if it reproduces12:40
*** masco has joined #tripleo12:42
*** lblanchard has joined #tripleo12:42
openstackgerritHarald Jensås proposed openstack/instack-undercloud master: Fix help string for subnets option  https://review.openstack.org/55489512:42
*** pgadiya has quit IRC12:44
openstackgerritHarald Jensås proposed openstack/instack-undercloud master: Fix help string for subnets option  https://review.openstack.org/55489512:44
*** panda|lunch is now known as panda12:45
Tengumarios: apparently… I can go further now.12:47
Tengu… failed. so. what's the next error now :D12:48
*** aputtur has joined #tripleo12:49
*** florianf_ has joined #tripleo12:51
Tenguduh… stack broken :(12:52
Tenguthis might explain. ERROR: The specified reference "ControllerDeployment_Step1" (in WorkflowTasks_Step2) is incorrect.12:52
*** pdeore has joined #tripleo12:52
*** eck`gone is now known as eck`12:53
openstackgerrityolanda.robla proposed openstack/tripleo-heat-templates master: Fix queries for already installed packages  https://review.openstack.org/55449912:53
*** florianf has quit IRC12:53
*** pdeore has quit IRC12:53
Tengudarn! ceph!! X(12:54
*** tcw has joined #tripleo12:56
Tengugfidente: are you here? I have a "small" question regarding ceph.12:57
gfidenteTengu HEY12:57
Tengugfidente: \o/ great :)12:57
gfidenteTengu I saw the error, I guess you're trying to update a stack in UPDATE_FAILED state right?12:58
Tengugfidente: nope12:58
Tengugfidente: we had a small/quick talk at the PTG/Dublin - I'm trying to move from a baremetal pike to container pike - and in  that move, from puppet-ceph to ansible-ceph/containers12:58
gfidenteyep I remember that12:58
Tengugfidente: apparently, there are some checks/references that make mistral fail badly, even in advanced steps.12:59
openstackgerritHarald Jensås proposed openstack/instack-undercloud master: Fix next_hop for metadata service host route on local_subnet  https://review.openstack.org/55490812:59
Tengugfidente: even if I have a "watch rm -f cephstorage_extraconfig.json" on the ceph-storage nodes, apparently it wasn't enough and mistral got a hint there was a puppet stuff: Workflow 'tripleo.storage.v1.ceph-install' [RUNNING -> ERROR, msg=Ceph deployment stopped, puppet-ceph hieradata found. Convert it into ceph-ansible variables. [u'ceph::profile::params::osds']]12:59
jaosoriorlhinds: yo13:00
lhindsjaosorior: o/13:00
jaosoriorlhinds: Lets wait a bit for more folks to show up13:00
gfidenteTengu ah yeah we do that on purpose13:00
lhindssure!13:00
jaosoriord0ugal: around?13:00
*** mcornea has joined #tripleo13:00
Tengugfidente: do you have any idea if what I want to do is possible, and if so how? or should I just drop ceph-storage and re-deploy them with containers from scratch?13:00
gfidenteTengu you have to convert the old disks mapping from ceph::profile::params::osds to the cepha-ansible for13:00
gfidente*format13:00
owalshlhinds, jaosorior: o/13:00
Tengugfidente: well, I think I did it.13:01
gfidenteand *remove* ceph::profile::params::osds from the env files13:01
*** egallen has joined #tripleo13:01
gfidentewe wanted to make sure people did the conversion13:01
gfidentebecuse if they dont, ceph-ansibe might just lose all the data13:01
openstackgerritBrent Eagles proposed openstack/puppet-tripleo master: Adding wrapper scripts for neutron agent subprocesses  https://review.openstack.org/55022413:01
gfidenteso you have to convert it and remove the old one13:01
Tengugfidente: no env file specify that, and I drop the hiera file in addition.13:01
openstackgerritBrent Eagles proposed openstack/tripleo-heat-templates master: Generate and mount wrappers for neutron agent processes  https://review.openstack.org/55082313:01
raildoo/13:02
d0ugaljaosorior: yup!13:02
jaosorior#startmeeting TripleO Security Squad13:02
openstackMeeting started Wed Mar 21 13:02:20 2018 UTC and is due to finish in 60 minutes.  The chair is jaosorior. Information about MeetBot at http://wiki.debian.org/MeetBot.13:02
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.13:02
*** openstack changes topic to " (Meeting topic: TripleO Security Squad)"13:02
openstackThe meeting name has been set to 'tripleo_security_squad'13:02
gfidenteTengu can I see your cmdline?13:02
*** adarazs is now known as adarazs_afk13:02
jaosoriorHey! d0ugal, lhinds, owalsh13:02
Tengugfidente: 2s13:02
Tengugfidente: basically I did that: http://paste.openstack.org/show/707512/13:03
jaosoriorSo, today should be a shorter meeting than last time :D13:03
d0ugal:)13:03
jaosoriorshould I wait a bit more for other folks? or should we start already?13:03
lhindsI think we can kick off with d0ugal here now13:03
jaosoriorAlright!13:03
lhindsmistral is the first topic13:03
Tengujaosorior: oh, meeting? here?13:03
jaosorior#topic Mistral Secret Storage13:03
*** openstack changes topic to "Mistral Secret Storage (Meeting topic: TripleO Security Squad)"13:03
d0ugalapetrich, thrash, rbrady, toure ^ we are going to chat about mistral and secrets if you want to join.13:03
Tengugfidente: do you take part in the meeting?13:03
thrashd0ugal: ack13:04
jaosoriorTengu: yes. It's the weekly Security Squad meeting13:04
apetrichoh dear13:04
gfidenteTengu security squad?13:04
Tengujaosorior: oh. I'll go DM with gfidente then :)13:04
rbradyd0ugal: ack13:04
*** cdearborn has joined #tripleo13:04
jaosoriorSo, we've been talking a while about needing secret storage for mistral13:04
jaosoriorThis is due to the fact that we store a  LOT of sensitive information there13:04
jaosoriorthe overcloud private keys and passwords namely13:05
openstackgerritTim Rozet proposed openstack/puppet-tripleo stable/queens: Fixes incorrect ownership of ODL TLS cert/key  https://review.openstack.org/55490913:05
*** dprince has joined #tripleo13:05
jaosoriorBeing TripleO an active user of mistral, I would like it to "beta" or take into use any solution that we have in mind13:05
jaosoriorAlso, having talked to thrash in the PTG, I also volunteer to help out on the coding side of mistral if more hands are needed.13:06
jaosoriorBut I would like to talk and understand what are the main challenges on this side13:06
d0ugalso, first I think we need to clarify exactly what is stored and why.13:07
jaosoriorsure13:07
d0ugalMistral has a database that is mostly in-flight only. We store all the heat parameters etc. while the workflow is being executed13:07
d0ugalThey are then stored for 48 hours afterwards13:07
d0ugalMistral does log lots of information, and parameters may be logged at times - but I think this has been reduced (or possibly stopped)13:08
thrashI think the more sensitive stuff is stored in a mistral environment, is it not?13:08
d0ugalthrash: no, it is stored in Swift now13:08
thrashd0ugal: ack13:08
apetrichd0ugal, parameters are logged in debug only now13:08
apetrichas with most sensitive info AFAIK13:09
d0ugalThe only information stored in mistral long term is two different "environments" - blobs of json basically13:09
d0ugalThese are the ssh keys for overcloud nodes, iirc13:09
*** dpawar has joined #tripleo13:09
d0ugaland ..13:09
*** pkovar has joined #tripleo13:09
*** egallen has quit IRC13:09
jaosoriord0ugal: which environments?13:10
d0ugalundercloud_ceilometer_snmpd_password and undercloud_db_password13:10
d0ugaltripleo.undercloud-config and "ssh_keys"13:10
*** ooolpbot has joined #tripleo13:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION13:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711113:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717413:10
*** ooolpbot has quit IRC13:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]13:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]13:10
d0ugalThey can be viewed with...13:10
jaosoriord0ugal: why do we specifically store those passwords in mistral and not swift?13:10
d0ugal$ mistral environment-get tripleo.undercloud-config13:10
d0ugal$ mistral environment-get ssh_keys13:10
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade master: Include connectivity check prepare scripts during FFU  https://review.openstack.org/55491413:11
d0ugaljaosorior: good question. Mostly for legacy reasoning I think. They could be moved to swift13:11
owalshssh_keys is the heat-admin key?13:11
d0ugalowalsh: I believe so, but I am not sure.13:11
jaosoriord0ugal: would be great if we would keep all the passwords in one place. So we can secure that one place at some point.13:11
Tengu(use gopass + gpg :D)13:11
d0ugaljaosorior: The tripleo.undercloud-config environment is related to the undercloud itself, rather than a plan - I think that is why it is in mistral.13:11
d0ugaljaosorior: +113:12
*** udesale has joined #tripleo13:12
openstackgerritCarlos Camacho proposed openstack/tripleo-quickstart-extras master: Collect installed cron jobs  https://review.openstack.org/55488913:12
d0ugalI think the ssh_keys environment was added out of simplicity, we didn't have a better plan at the time.13:12
jaosoriorthrash, apetrich does anybody know what ssh_keys actually is? is it the keys for heat-admin?13:12
trozetcan another core help out with reviewing https://review.openstack.org/#/c/553788/1 please?13:13
*** dpawar has quit IRC13:13
dtantsurfolks.. I know it may sound provocative, but is it possible to add configuration steps to a service template that are NOT written in puppet13:13
dtantsur?13:13
d0ugaljaosorior: I can find out.13:13
thrashjaosorior: I think so. Would need to double check.13:13
dtantsurI don't really want to spend half of cycle doing a trivial thing like 'call a command, get its result'13:13
d0ugalor shadower and mandre would know if they are around13:13
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Add ctlplane networking for routed networks  https://review.openstack.org/54732613:13
jaosorioreither way, there's a private key there, which would be considered sensitive info. So we need to secure it somehow13:14
d0ugaljaosorior: +113:14
hjensasderekh: ^^ Can you have a look at the python script there? Make sure I don't mess up the ipv6 stuff again?13:14
dtantsurEmilienM: hey, maybe you know (re my question above)13:14
jaosoriord0ugal, thrash: One option would be to move all that to swift. And rely on swift encryption (which we don't have right now, but we could enable)13:14
apetrichjaosorior, during ping test (and I think tempest as well but not 100% sure) the keys to the created servers are stored in an env in mistral13:14
thrashjaosorior: +100013:15
d0ugaljaosorior: I didn't know swift had that option, sounds like a good (and easy?) starting point.13:15
jaosoriorthrash, d0ugal, apetrich: Would you guys be able to dedicate some time to move those to swift?13:15
thrashjaosorior: somebody can, yes. :)13:15
jaosoriord0ugal, to be able to do that, we probably need barbican in the undercloud, but that's something alee and me can work on.13:16
d0ugaljaosorior: we are going to do some planning soon, so we could open a bug for this and consider it then13:16
jaosoriord0ugal, apetrich, thrash: So, having moved those environments to be stored in swift. Would that be the last bits of sensitive info stored in mistral?13:16
owalshif it's only used the the pingtest/tempest key do we care?13:16
thrashjaosorior: I think from a tripleo perspective, that's a good bet.13:17
apetrichowalsh, not only those keys unfortunately13:17
owalshapetrich: ack13:17
jaosoriorowalsh: it sure depends on the user that pingtest/tempest uses. If it's heat-admin it's problematic, since it's able to do sudo su.13:17
d0ugaljaosorior: do you could storing for 48 hours as storing? :)13:18
*** myoung|afk is now known as myoung13:18
owalshjaosorior: runs as stack AFAIK13:18
d0ugaljaosorior: we also probably need to do some checking of the logs and/or protection there against future leaks13:18
jaosoriord0ugal: I need to double check on that one. lhinds what do you think?13:18
jaosoriord0ugal: definitely13:18
lhindsjaosorior: just reading..13:18
lhindsI guess time could be configurable for now (if that's what you were refering to)13:19
*** masco has quit IRC13:19
lhindsor log integrity?13:20
jaosoriorLog integrity is something we should cover, so we should report any issues as mistral bugs and get those fixed.13:20
jaosoriorlhinds: but currently mistral stores the heat environments (which might contain sensitive info) for a limited time (48 hours)13:20
jaosoriorlhinds: is this something we can live with, or should we also avoid this?13:21
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Add ctlplane networking for routed networks  https://review.openstack.org/54732613:21
d0ugalFWIW, fixing this in Mistral will likely be very hard.13:21
*** jmelvin has joined #tripleo13:21
aleeo/13:21
lhindsso it would be difficult to encrypt the heat envs?13:21
lhinds(stored in mistal)13:22
d0ugallhinds: I think so, mistral internally duplicates them in a few places to optimize db lookup13:22
lhindsd0ugal: ack13:22
jaosoriord0ugal: I thought the generated heat environments were all stored in swift.13:23
lhindsso i think as far as time periods, any time window is a potential exploit window (although shorted better of course)13:23
d0ugaljaosorior: they are - but while the workflow is running and for 48 hours after they are also in Mistral13:23
jaosoriord0ugal: is it possible to disable that?13:23
d0ugaljaosorior: yes, they could be deleted when the workflow finishes, but it is extremely useful for debugging etc.13:24
d0ugalWe actually increased the time, the default is 1 hour irrc13:24
d0ugaliirc*13:24
*** ratailor has quit IRC13:24
jaosoriord0ugal: how is it useful for debugging?13:24
d0ugaljaosorior: when the execution is stored you can inspect it and find out exactly what happened, what inputs and outputs happened at every point in the workflow13:25
d0ugaljaosorior: you can even restart workflows in the middle etc.13:25
lhindshas there been any BP / LP for encrypting heat envs stored in mistral (so it's on the radar so to speak). I could take a look at the code, can't promise anything as new to mistral13:25
*** jpena|lunch is now known as jpena13:25
d0ugalit is a bit like having the interactive debugger you have in most programming languages (but via a rest api :))13:25
jaosoriord0ugal: What about making that attribute configurable? In the hardening docs we could then tell folks to lower that time, or disable it entirely.13:25
lhindsbut with a key in barbican, it should be doable.13:26
d0ugaljaosorior: it is configured by instack-undercloud, can users change those puppet settings?13:26
jaosoriorshould be possible13:26
jaosoriordepending on how it's configured13:26
*** amoralej is now known as amoralej|lunch13:27
jaosoriorNeed to double-check if the instack-undercloud hieradata takes precedence or the hieradata overrides do. but it should be doable.13:27
d0ugallhinds: there was a blueprint for mistral for securing secrets. I think both rbrady and thrash had a look at doing it. So they know more about that than me.13:27
jaosorior#action For now, we will document how to lower the time mistral stores heat environments and add it to the hardening guide.13:28
d0ugaljaosorior: FYI, here is the setting: https://github.com/openstack/instack-undercloud/blob/master/elements/puppet-stack-config/puppet-stack-config.yaml.template#L67113:28
jaosorior#link https://github.com/openstack/instack-undercloud/blob/master/elements/puppet-stack-config/puppet-stack-config.yaml.template#L67113:28
lhindsd0ugal / rbrady / thrash if you manage to dig it out (the BP) please paste if for me.13:28
d0ugallhinds: looking for it.13:28
lhindsthanks d0ugal13:29
d0ugallhinds: https://blueprints.launchpad.net/mistral/+spec/mistral-secure-sensitive-data13:29
lhindsso configurable as first port of call, and then ideal future functionaility to encrypt13:29
d0ugalSee the spec linked at the top and there was a patch, but I think that got stuck.13:29
*** lucas-hungry is now known as lucasagomes13:29
lhindsso there is a fair whack of code there, any reason for the abandon by Brad?13:30
d0ugaljaosorior: should I open a bug for the mistral environments?13:31
jaosoriord0ugal: that would be great13:31
d0ugalk, on it13:31
*** chlong has quit IRC13:31
aleed0ugal, I'm having trouble finding the actual spec ..13:32
d0ugalalee: https://specs.openstack.org/openstack/mistral-specs/specs/pike/approved/secure-sensitive-data.html13:32
lhindsalee: spec has gone missing, but some code here:13:32
lhindshttps://review.openstack.org/#/c/459747/13:32
jaosorior#link https://specs.openstack.org/openstack/mistral-specs/specs/pike/approved/secure-sensitive-data.html13:32
aleeah cool thanks13:32
d0ugalI think the spec was moved because it missed the openstack release13:32
*** pkovar has quit IRC13:32
d0ugalWhich is a bad idea it seems :)13:32
lhindsk, found the spec:13:33
jaosoriorAlright, but at least for the short term we have a plan13:33
lhinds#link https://github.com/openstack/mistral-specs/blob/master/specs/pike/approved/secure-sensitive-data.rst13:33
jaosorior* Move all sensitive data to swift (to have it all in one place)13:34
lhindsok, brad is thrash, got it now13:34
jaosorior* Document how to reduce time mistral stores heat environments)13:34
thrashlhinds: :D13:34
d0ugal#link https://bugs.launchpad.net/tripleo/+bug/175743013:34
openstackLaunchpad bug 1757430 in tripleo "The ssh_keys and tripleo.undercloud-config Mistral environments should be move to swift" [High,Confirmed]13:34
*** pkovar has joined #tripleo13:34
jaosoriorand then we can focus on securing swift instead, which already can encrypt with barbican.13:34
*** jlabarre has joined #tripleo13:34
jaosoriord0ugal: awesome13:35
jaosoriorthanks13:35
*** pkovar has quit IRC13:36
d0ugalnp13:36
jaosoriorAnything else someone wants to bring up about this topic?13:36
lhindsnothing from me this week13:37
jaosoriorok13:37
openstackgerritHarald Jensås proposed openstack/python-tripleoclient master: Fix Genconfig - no HOME in environment  https://review.openstack.org/55467813:37
*** adarazs_afk is now known as adarazs13:37
jaosoriorThanks d0ugal, thrash and apetrich for joining13:37
jaosorior#topic Work progress udpate13:37
*** openstack changes topic to "Work progress udpate (Meeting topic: TripleO Security Squad)"13:37
d0ugaljaosorior: np, thanks for the input!13:38
jaosoriorJust a heads up for folks in the squad, there are a bunch of reviews for different items in the etherpad https://etherpad.openstack.org/p/tripleo-security-squad (Maybe we need to come up with an easier way to track those)13:38
jaosoriorso reviews are appreciated13:38
jaosoriorRight now, most of the work that I've been doing has been on enabling TLS by default (which hopefully almost merges for the undercloud https://review.openstack.org/#/c/552382/ )13:39
*** pkovar has joined #tripleo13:39
jaosoriorI'm also working on enabling it by default in the overcloud, so if someone is intersted in joining that work or testing, let me know.13:39
jaosoriorthat's all on my side.13:40
aleejaosorior, I'll probably ping you about joining that work later today or tomorrow13:40
jaosorioralee: awesome13:40
jaosorior#topic Any other business13:41
*** openstack changes topic to "Any other business (Meeting topic: TripleO Security Squad)"13:41
jaosoriorAnything else someone wants to bring up to the squad?13:41
aleejaosorior, I think we wanted to do a quick meeting to identify secrets to be secured/ passwords etc.13:41
aleejaosorior, did we want to schedule that?13:41
jaosorioralee: that would be good.13:41
jaosorioralee: Any day/time preference?13:42
aleejaosorior, how about tommorow?13:42
jaosoriorworks for me13:42
openstackgerritMartin André proposed openstack/tripleo-common master: Pass connection info via ansible config file  https://review.openstack.org/55452613:42
aleemorning my time -- say 10 am EST?13:42
*** dtrainor has quit IRC13:42
jaosorioralee: that works for me. 2pm UTC13:43
jaosoriorlhinds: does that work for you?13:43
lhindsjaosorior: thats fine for me13:43
lhindsI have a work shop thing, but might be able to leave a little early13:44
lhinds(it's remote)13:44
jaosoriorlhinds, alee: I'll poke you tomorrow then before the time.13:45
jaosoriorAnybody else is welcome to join13:45
jaosoriorAnything else someone would like to bring up?13:46
jaosoriorAlright13:47
jaosoriorthanks everyone for joining!13:47
jaosorior#endmeeting13:47
*** openstack changes topic to "Welcome to Rocky. CI status - Promotions: Yellow; check/gate: Green; RDO CI jobs: Green | http://tripleo.org/ | https://docs.openstack.org/tripleo-docs/latest/"13:47
openstackMeeting ended Wed Mar 21 13:47:17 2018 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)13:47
openstackMinutes:        http://eavesdrop.openstack.org/meetings/tripleo_security_squad/2018/tripleo_security_squad.2018-03-21-13.02.html13:47
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/tripleo_security_squad/2018/tripleo_security_squad.2018-03-21-13.02.txt13:47
openstackLog:            http://eavesdrop.openstack.org/meetings/tripleo_security_squad/2018/tripleo_security_squad.2018-03-21-13.02.log.html13:47
*** psachin has quit IRC13:48
*** dtrainor has joined #tripleo13:48
*** jfrancoa has quit IRC13:50
*** ihrachys has joined #tripleo13:51
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Implement MasqueradeNetworks services  https://review.openstack.org/55342713:53
openstackgerritMerged openstack/tripleo-ui stable/queens: Imported Translations from Zanata  https://review.openstack.org/55480613:53
openstackgerritMerged openstack/tripleo-ui master: Imported Translations from Zanata  https://review.openstack.org/55480813:54
*** dtrainor has quit IRC13:58
*** ktibi has quit IRC13:59
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-common master: WIP: TLS by default for the overcloud  https://review.openstack.org/55492614:00
*** cdearborn_ has joined #tripleo14:01
EmilienMdtantsur: hello, have you found an answer to your question?14:02
EmilienMdtantsur: otherwise I can help14:02
dtantsurEmilienM: hi, no, I haven't. I'm wondering if puppet is still our only option to do things install-time14:04
EmilienMdtantsur: can you tell me exactly what you want to do?14:04
dtantsurEmilienM: I need to run an 'openstack' command, parse its output and based on it run another command14:05
dtantsurwhich is like 5-10 lines everywhere expect for puppet, where it requires you to have a PhD14:05
dtantsur:)14:05
EmilienMlol14:05
mwhahahathat's not a puppet problem14:05
dtantsurso, I ended up with https://review.openstack.org/554885 but it makes my eyes bleed14:05
*** hjensas has quit IRC14:06
dtantsurmwhahaha: well, it's a problem only in puppet, so yes, it IS a puppet problem14:06
mwhahahaor you could integrate it in a command14:06
mwhahahaso that no one has to do 5-10 lines14:06
mwhahahaso no, it's not a puppet problem14:06
dtantsuryes, it's mine problem, because I have to use puppet >_<14:06
mwhahahawhy not do it in python and expose it ina  single command14:06
dtantsurs/mine/my/14:07
mwhahaharight so this is a common issue with openstack in that we provide a bunch of things that require an operator to wire info together14:07
mwhahahaand know what they need to do14:07
dtantsurwe cannot really patch openstackclient for any pattern that can some up14:07
mwhahahathis is a recurring pattern which is awful14:08
*** csmart has quit IRC14:08
* mwhahaha points to octavia14:08
dtantsuras a side note: I have no clue why temporary URLs even need configuring.. a question for swift folks, I guess14:08
dtantsuranyway, I have to do something. I've been stuck with this for months..14:09
*** csmart has joined #tripleo14:10
*** salmankhan has quit IRC14:10
*** ooolpbot has joined #tripleo14:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711114:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717414:10
*** ooolpbot has quit IRC14:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]14:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]14:10
*** dsariel has joined #tripleo14:11
mwhahahadtantsur: so what exactly is the issue with what you have in puppet other than providers are painful?14:11
openstackgerritSergii Golovatiuk proposed openstack/tripleo-quickstart master: Fix image_cache_expire_days  https://review.openstack.org/55462714:12
dtantsurmwhahaha: my problem is that some trivial things are very non-trivial (and yes.. providers are painful)14:12
dtantsuranyway, I'm open to any practical ideas to solve my problem14:12
mwhahahadtantsur: the point of puppet is to allow us to do it idempotently which is generally not considered in any other methods14:12
mwhahahaso it's not that it's a puppet problem, it's a deficiency in other toolings14:12
mwhahahaie shell has no idempotent concept14:13
dtantsurwell, as idempotently as you implement it, which is not any different from other toolings14:13
mwhahahaansible less so14:13
* mwhahaha shrugs14:13
dtantsurwell, providers are only idempotent if you make them idempotent. just like you bash scripts, ansible playbooks, etc14:13
dtantsuranyway14:13
mwhahahawell we're working on it14:13
mwhahahabut we keep having to fix things for other people cause no one else is helping14:13
dtantsuras I said, I'm open to whatever you suggest on doing it, including reviewing my patch and telling me how wrong I am ;)14:14
*** skramaja has quit IRC14:14
mwhahahadtantsur: would be useful to understand at a higher level what you're actually trying to do14:14
mwhahahadtantsur: the blueprint indicates swift/glance but i'm not sure why this temp url stuff can't be implemented elsewhere14:14
dtantsurmwhahaha: automate bullet points 2 and 3 of http://tripleo.org/install/advanced_deployment/ansible_deploy_interface.html#enabling-temporary-urls14:15
*** salmankhan has joined #tripleo14:15
*** itlinux has quit IRC14:16
*** itlinux has joined #tripleo14:17
*** cdearborn has quit IRC14:17
*** psahoo has joined #tripleo14:17
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: ironic/undercloud: align configuration with instack-undercloud  https://review.openstack.org/55063814:17
*** gkadam_ has joined #tripleo14:18
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: undercloud: workaround for masquerading network in CI/OVB  https://review.openstack.org/55362014:18
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: roles: rename overcloud-prep-containers to prep-containers  https://review.openstack.org/54301414:18
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: prep-containers: include containerized undercloud bits  https://review.openstack.org/54302414:18
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: undercloud: add missing TLS environments when preparing containers  https://review.openstack.org/54544414:18
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: Remove adjust-interface-mtus script  https://review.openstack.org/54621614:18
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: DO NOT REVIEW - Workarounds for containerized undercloud  https://review.openstack.org/54562814:18
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: undercloud: remove IronicInspectorCollectors in environment  https://review.openstack.org/55430214:18
mwhahahadtantsur: i think the correct provider would be on the swift account14:18
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Undercloud: inspection_runbench, inspection_extras  https://review.openstack.org/54662614:18
*** myoung is now known as myoung|rover14:19
dtantsurmwhahaha: I *think* I'm doing something like that, but puppet providers make me cry14:19
mwhahahadtantsur: so i think you're problem is that no one really touches puppet-swift as we have these types of concepts for properties in other providers already (like glance)14:19
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Move API cors config to their services  https://review.openstack.org/55438614:19
mwhahahadtantsur: kinda yea but you're basically running into the fact no one has bothered to keep up for the last 4 years14:19
*** salmankhan has quit IRC14:20
dtantsurle sigh14:20
*** salmankhan has joined #tripleo14:20
*** myoung|rover is now known as myoung|rover|mtg14:20
*** gkadam has quit IRC14:20
dtantsurI'm cargo-culting stuff from nova, so maybe it'll be fine14:21
mwhahahait won't be14:21
dtantsur\o/14:21
*** jaganathan has joined #tripleo14:21
mwhahahanova is probably the worst example :D14:21
dtantsur\o/ \o/14:21
* dtantsur jumps out of the window14:21
mwhahahadtantsur: you could ask the storage dfg to make it configurable :D14:21
*** itlinux has quit IRC14:22
EmilienMbogdando: thx for updating the patch, we'll see how that works14:22
dtantsurdo we have somebody here understanding swift?14:22
* mwhahaha points to cschwede14:22
mwhahahaso how do we handle this on the undercloud14:23
mwhahahaor wait maybe we are doing it when we run the commands that rely on this14:23
*** ktibi has joined #tripleo14:23
dtantsurwe're not doing this in the undercloud yet14:23
jistrmatbu, chem: please when you have some time take a look at the pre_upgrade_rolling_tasks review https://review.openstack.org/#/c/55207314:23
dtantsurbut given that we create networks in bash....14:24
mwhahahadtantsur: we leverage temp urls14:24
mwhahahain other ways14:24
*** cshastri has joined #tripleo14:24
dtantsurmmm, interesting14:24
mwhahahalet me see how we do this14:24
cschwedewhat's the issue with swift?14:24
gfidentedtantsur cargo-culting from nova14:24
gfidentedtantsur even you references are too much for me14:24
mwhahahacschwede: dtantsur is unhappily trying to manage temp urls for an account14:24
dtantsurcschwede: yeah, I essentially wonder why I even have to create a temporary URL key myself..14:25
*** derekh has quit IRC14:25
mwhahahadtantsur: so we get away with it by handling it in the action that interacts with swift, https://github.com/openstack/tripleo-common/blob/master/scripts/upload-swift-artifacts#L13514:25
dtantsurI understand why I may want to set it to something, but why not have a sane default?14:25
matbujistr: ack14:25
mwhahahadtantsur: so why can't the operation in ironic do it rather than it be preconfigured14:25
dtantsurmwhahaha: ah, I remember that. so yes, we do it in bash14:25
*** ykarel is now known as ykarel|away14:25
*** jfrancoa has joined #tripleo14:25
mwhahahadtantsur: right but it's done at the usage point, so is there a reason it can't be done in the action calling swift14:26
dtantsurmwhahaha: because it will be racy, if I understand it right. imagine several conductors do it simultaneously14:26
mwhahahadtantsur: or are you seting the key in a config14:26
dtantsurmwhahaha: we used to set it in the config, I've fixed it already14:26
cschwededtantsur: because you actually might not want a key at all? if there is no key, the feature is not working, which is indeed sth some users want14:26
cschwededtantsur: so depending on whom you ask, there are different "sane" defaults :)14:26
dtantsurcschwede: a weird way to disable a feature, if you ask me..14:26
openstackgerritmathieu bultel proposed openstack/python-tripleoclient master: WIP -- do not inherit converge from deploycommand class  https://review.openstack.org/55493414:26
mwhahahadtantsur: are you new to openstack? :D14:27
*** derekh has joined #tripleo14:27
dtantsurask chandankumar :D14:27
mwhahahaconsistency is not our forte14:27
*** bfournie has quit IRC14:27
cschwededtantsur: it's really up to the user, the operator typically enables it cluster-wide, and the user can decide if it is needed on an account or per-container basis14:27
mwhahahaneither is enabling/disabling features14:27
*** nyechiel_ has quit IRC14:27
dtantsurcschwede: well, $ openstack object store account set --temporary-urls-enabled would work so much better for me..14:28
dtantsurI think the issue is merging two actions into one: enabling temporary URLs and setting the key14:28
dtantsurthe former can be run idempotent, the latter, generally speaking, not14:28
*** bfournie has joined #tripleo14:29
cschwedebut the amount of requests is the same? either i create a random key and set it, or it is set by default and i need to read it from the metadata?14:29
dtantsurcschwede: what happens if two ironic conductors generate a random key and try to set it?14:29
dtantsurat the same time?14:29
mwhahahadtantsur: so if you just want to ensure it's set, it's a bootstrap exec command during the deployment14:30
*** ykarel|away has quit IRC14:30
mwhahahadtantsur: it doesn't have to be puppet necessarily14:30
cschwededtantsur: last one "wins" (assuming that the same time does not exist, there will be some microseconds between the requests)14:30
dtantsurmwhahaha: well, that's the question I started with: can I bypass puppet? :)14:30
mwhahahadtantsur: but it comes down to preping swift correctly, but is it done on the udnercloud/overcloud14:30
dtantsurcschwede: right, so one conductor will end up with an invalid key, right?14:31
cschwederight14:31
mwhahahadtantsur: well i wanted to know what you were actually trying to do :D14:31
dtantsurheh14:31
dtantsurmwhahaha: okay, so the "bootstrap exec command". what is it? do you have an example?14:31
cschwedeso talking about swift on the undercloud, there is already a key set during deployment? that could be used?14:31
mwhahahayou focused on how much puppet sucks rather than explaing what you were actually trying to do14:31
dtantsurcschwede: it's on a different account14:31
cschwedeah, got it14:31
mwhahahaie configure swift from a single host during the deployment14:31
* dtantsur puts aside his opinion on puppet14:32
cschwededtantsur: there is already an action that creates the key in mistral, can't that be reused for the other account?14:32
dtantsurcschwede: we just discovered that its done in bash14:32
cschwedeie before puppet et al are running?14:32
dtantsurwait14:32
dtantsurhow can we do anything with swift before swift is installed by puppet?14:33
mwhahahadtantsur: are you trying to configure this on the overcloud or udndercloud14:33
cschwededtantsur: oh sorry, i misunderstood. i thought it was after UC install14:33
mwhahahadtantsur: i think that changes the conversation14:33
dtantsurmwhahaha: both. we can start with either, if it's easier14:33
mwhahahawell the solution may be different14:33
mwhahahaso it matters14:33
mwhahahaif you will need to do it on both, then we need a deployment solution14:34
mwhahahaif you need to do it on the undercloud only then we already have mistral actions to do i think14:34
mwhahahaanyway sec14:34
dtantsurat least on the undercloud. ideally, both.14:34
mwhahahaalso containerized undercloud is probably the targeted solution right?14:34
*** aputtur has quit IRC14:35
mwhahahaor are you going to need to backport this14:35
mwhahahaif so then it has to be puppet14:35
dtantsurmwhahaha: no backports14:35
mwhahahak14:35
cschwededtantsur: so this one exists since Newton: https://github.com/openstack/tripleo-common/blob/master/tripleo_common/actions/swifthelper.py14:36
mwhahahadtantsur: example of bootstrap_host_exec https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/nova-api.yaml#L223-L23714:36
cschwededtantsur: which is used to set the tempurl key on the UC14:36
dtantsurcschwede: thanks14:36
dtantsurmwhahaha: nice. can it used overcloudrc credentials though?14:37
mwhahahadtantsur: so it runs during the deployment on the host. not sure what's available in terms of creds14:37
mwhahahadtantsur: so if it requires creds that's usually a post configuration of some sort14:38
dtantsurwell, kind of, yes14:39
mwhahahadtantsur: swift does something like... https://github.com/openstack/tripleo-heat-templates/blob/fefecf633ab42a9bf2e4fc95a5927db6e9a17153/docker/services/swift-proxy.yaml#L102-L12614:39
mwhahahato pull creds out of the config14:39
mwhahahaso if it has access to the creds you can craft a magical (terrible) shell script14:39
dtantsuraha!14:39
*** egallen has joined #tripleo14:40
openstackgerritMerged openstack/tripleo-heat-templates master: [FFU] Hook to allow user to pass a custom script for repo switching.  https://review.openstack.org/53950314:40
*** amoralej|lunch is now known as amoralej14:40
dtantsurmwhahaha: thanks, this is probably what I need. And it will prevent you from hearing more complaints about puppet, at least in the near future ;)14:42
* dtantsur cannot guarantee absence of complaints about kolla though14:42
mwhahaha:D14:42
mwhahahatrown|ruck:, myoung|rover|mtg: so do we have a bug for all the 3node tempest failures? seems to be ssh connection issues, is that an overlap of an existing bug?14:48
mwhahahatrozet: myoung|rover|mtg: example http://logs.openstack.org/29/550029/3/check/tripleo-ci-centos-7-3nodes-multinode/d413483/job-output.txt.gz#_2018-03-21_08_56_05_80019414:48
owalshmwhahaha: are my horrible hacky docker_config scripts reproducing?14:49
mwhahahaowalsh: you know it14:49
trozetmwhahaha: you mean trown^^^?14:49
mwhahahai do14:49
mwhahahatrozet: unless you want to fi xit14:49
trown|rucktrozet: you take it14:49
trown|rucktrozet: you got this14:49
trown|ruck:)14:49
trozettrown|ruck: i dont think im qualified :)14:50
mwhahahano one is qualified, we're all winging it14:50
openstackgerritAndy Smith proposed openstack/tripleo-heat-templates master: Support separate oslo.messaging services for RPC and Notification  https://review.openstack.org/50796314:50
* trown|ruck confirms this14:50
trown|ruckmwhahaha: seems possibly related to https://bugs.launchpad.net/tripleo/+bug/1755485 ... but maybe not14:51
openstackLaunchpad bug 1755485 in tripleo "Barbican tempest test failing to ssh to cirros image" [Critical,Triaged]14:51
trown|ruckmwhahaha: might not be just barbican that fails to ssh to cirros14:51
mwhahahatrown|ruck: it's possible14:51
*** gyankum has quit IRC14:51
mwhahahait might be an ovs issue or something, would point to a multinode issue14:52
trozethey guys what generates/controls logs in /var/log/containers?14:52
mwhahahatrozet: it's how the containers mount their logs i think14:52
mwhahahai think we map /var/log/containers/<container>/ as /var/log14:53
trozetmwhahaha: whats the difference then between that and docker logs14:53
trozetmwhahaha: do services just output same logs in stdout and the file?14:53
mwhahahatrozet: container/ logs are usually service output logs14:53
mwhahahawhich docker/ folder are you talking about?14:53
openstackgerritHonza Pokorny proposed openstack/tripleo-ui master: eslint: use as-needed for arrow-body-style  https://review.openstack.org/54670714:53
trozetmwhahaha: when you do docker logs <container> vs /var/log/containers/<service>/14:54
mwhahahadocker logs <container> is stdout i think14:54
owalshtrozet: I think jaosorior added something to t-h-t to control whether we log to /var/log/containers/<serivce> or the docker logs (stdout/err)14:54
owalshdefault is /var/log/containers/<service>14:55
*** ykarel|away has joined #tripleo14:55
*** ykarel|away is now known as ykarel14:55
trozetowalsh: yeah so for opendaylight, i only see docker logs work, theres nothing in /var/log/containers/opendaylight14:55
trozetowalsh: but i see other services there, so trying to figure out what is missing14:55
*** jaganathan has quit IRC14:56
dtantsurcan someone please remind me how start_order works: do smaller values get executed first?14:56
jaosoriortrozet: ultimately (when kubernetes comes) it would be better to mvoe to docker logs <service>14:56
owalshdtantsur: smaller first, default is 0 IIRC. NB it's host scope14:57
dtantsurthnx14:57
*** akane has quit IRC14:57
mwhahahatrozet: do you have a docker/services/logging/files/opendaylight.yaml?14:57
openstackgerritMerged openstack/instack-undercloud master: Enable TLS by default  https://review.openstack.org/55238214:57
trozetjaosorior: i see directories there for every service, some services have no logs in their directory though14:58
mwhahahatrozet: see https://github.com/openstack/tripleo-heat-templates/tree/107b610923ba5d39f90c3a6a63bf2d3642e1b35d/docker/services/logging/files14:58
trozetjaosorior: but there is no directory for ODL14:58
jaosoriortrozet: I didn't do the patches for ODL. So Id on't really know why that was done. But ultimately if you can access them via docker logs <odl container name>, it's in the right direction :D14:58
openstackgerritCarlos Camacho proposed openstack/tripleo-quickstart-extras master: Collect installed cron jobs  https://review.openstack.org/55488914:58
jaosoriortrown|ruck: could you check this out https://review.openstack.org/#/c/552781/ ?14:59
*** yamahata has joined #tripleo14:59
trozetjaosorior: we changed ODL to not output logs to a file anymore and only stdout so that docker logs works14:59
*** agopi has joined #tripleo15:00
trown|ruckjaosorior: moved it to top of my list15:00
*** nyechiel_ has joined #tripleo15:00
trozetjaosorior: so is it acceptable to not use this logging/files stuff in THT and just use docker logs?15:00
jaosoriortrozet: well, I think it is. That's ultimately where we wanna go.15:01
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-common master: WIP: TLS by default for the overcloud  https://review.openstack.org/55492615:01
rbradyWorkflows squad status meeting: https://etherpad.openstack.org/p/tripleo-workflows-squad-status15:01
trozetjaosorior: ok ty15:01
rbrady^^ rbrady,d0ugal,apetrich,thrash,toure,jtomasek15:01
*** dtrainor has joined #tripleo15:02
openstackgerritMartin Mágr proposed openstack/tripleo-common master: Add and fix healthcheck scripts for Octavia services  https://review.openstack.org/55494615:02
d0ugalrbrady: omw15:02
*** nyechiel_ has quit IRC15:03
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates master: Add support to ironic "direct" deploy interface  https://review.openstack.org/52934215:05
*** mdnadeem has quit IRC15:06
*** cshastri has quit IRC15:08
*** agurenko has quit IRC15:08
*** moshele has quit IRC15:09
*** ooolpbot has joined #tripleo15:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711115:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717415:10
*** ooolpbot has quit IRC15:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]15:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]15:10
*** wojdec has joined #tripleo15:11
*** thrash is now known as thrash|biab15:13
*** gkadam_ has quit IRC15:15
*** abishop has quit IRC15:19
*** etingof has quit IRC15:19
*** ukalifon has quit IRC15:20
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci master: DNM - test containerized undercloud upgrades  https://review.openstack.org/55363315:26
*** oscar has quit IRC15:27
*** egallen has quit IRC15:28
*** egallen has joined #tripleo15:30
*** kbyrne has quit IRC15:30
*** egallen has quit IRC15:30
*** agopi is now known as agopi|lunch15:31
openstackgerritMerged openstack/tripleo-quickstart-extras master: Ensure gated packages are installed during upgrade.  https://review.openstack.org/54941415:31
openstackgerritMerged openstack/tripleo-upgrade master: FFU: We need to be root to install ansible-pacemaker package.  https://review.openstack.org/55488715:31
*** chem has quit IRC15:32
openstackgerritMartin Mágr proposed openstack/tripleo-common master: [WIP] Activate another set of healthchecks  https://review.openstack.org/55050815:34
*** psahoo has quit IRC15:34
openstackgerritMartin Mágr proposed openstack/tripleo-common master: Add and fix healthcheck scripts for Octavia services  https://review.openstack.org/55494615:35
*** liverpooler has joined #tripleo15:35
*** kbyrne has joined #tripleo15:35
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart master: Remove unnecessary parameters from featureset047.  https://review.openstack.org/55385015:37
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart master: Remove yum update from repo_cmd_after.  https://review.openstack.org/55495115:37
openstackgerritLukas Bezdicka proposed openstack/tripleo-heat-templates stable/queens: [FFU] Hook to allow user to pass a custom script for repo switching.  https://review.openstack.org/55495315:39
*** florianf_ has quit IRC15:39
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart-extras master: WIP: Add undercloud upgrades playbook.  https://review.openstack.org/55382715:40
*** florianf has joined #tripleo15:42
*** thrash|biab is now known as thrash15:43
thrashmwhahaha: would you say this validation no longer serves a purpose? https://github.com/openstack/tripleo-common/blob/master/workbooks/validations.yaml#L274-L33715:44
openstackgerritMartin Mágr proposed openstack/tripleo-common master: [WIP] Activate another set of healthchecks  https://review.openstack.org/55050815:44
mwhahahathrash: has it been moved to tripleo-validations?15:45
thrashmwhahaha: Was starting that work... But I don't even feel like it is necessary?15:46
mwhahahathrash: i'm unsure, it's basically checking that the ironic stuff is properly loaded before it gets kicked off. I kinda think that's important15:46
thrashmwhahaha: Ack. I think I made the mistake of checking against an ovb env. :)15:47
thrashmwhahaha: I'll continue with what I was doing then. :D15:47
mwhahahayes carry on :D15:47
openstackgerritCarlos Camacho proposed openstack/tripleo-docs master: WIP: Add FFU docs  https://review.openstack.org/54989215:48
*** suuuper has quit IRC15:51
*** suuuper has joined #tripleo15:52
*** suuuper has quit IRC15:52
*** suuuper has joined #tripleo15:52
*** khrystoph has quit IRC15:53
openstackgerritAttila Darazs proposed openstack/tripleo-quickstart-extras master: GATE CHECK for quickstart-extras  https://review.openstack.org/47260715:56
*** paramite_ has quit IRC15:57
openstackgerritMerged openstack/tripleo-ui master: eslint: use as-needed for arrow-body-style  https://review.openstack.org/54670715:58
*** yamahata has quit IRC16:00
*** khrystoph has joined #tripleo16:02
*** dparkes has quit IRC16:03
*** jlabarre has quit IRC16:07
*** itlinux has joined #tripleo16:07
openstackgerritMichele Baldessari proposed openstack/puppet-pacemaker master: WIP Fix up fence_compute parameters  https://review.openstack.org/55497516:08
itlinuxhello all and good morning from Cali rainy day today!16:08
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates master: Add purge to Nova cleanup cron.  https://review.openstack.org/55196616:09
*** khrystoph has quit IRC16:10
*** ooolpbot has joined #tripleo16:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION16:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711116:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717416:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747416:10
*** ooolpbot has quit IRC16:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]16:10
*** etingof has joined #tripleo16:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]16:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]16:10
*** aufi has quit IRC16:11
*** itlinux has quit IRC16:15
*** waleedm has quit IRC16:16
*** yolanda_ has joined #tripleo16:19
*** yolanda has quit IRC16:19
aleeweshay, arxcruz looks like the mtu patch did not fix the barbican CIX issue16:21
arxcruzalee: :(16:21
*** khrystoph has joined #tripleo16:21
arxcruzalee: so, let's try to reproduce it again16:21
*** derekh has quit IRC16:21
arxcruzans see if we can reach the root cause16:21
*** derekh has joined #tripleo16:22
aleearxcruz, any other ideas?  yeah -- I'm going to run the reproducer script16:22
arxcruzalee: no ideas, once you have the env, let me know, i can digg a little bit16:22
*** wolverineav has joined #tripleo16:22
aleebeagles, maybe you could take a look?  https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset017-queens/6e4ffba/undercloud/home/jenkins/tempest/tempest.html.gz16:22
*** dsariel has quit IRC16:23
aleesame problem as before -- looks like we get an instance, attach a floating ip and a volume , and try to ssh to the instance and that fails.16:23
openstackgerritMerged openstack/puppet-tripleo stable/queens: Create vhost_socket_dir with proper permissions  https://review.openstack.org/55378816:23
ccamachohey owalsh o/ its faster here :) so, the thing is that we can play with ruby on the puppet manifest, but at the end it should be translated to something like16:23
ccamacho * * * * * nova-manage db purge --before <here a date>   and that date will be fixed.. maybe we will need another parameter like --expire <number>16:23
aleeFailed to establish authenticated ssh connection16:23
ccamachoand the number can be translated to the current date - n days16:24
*** kopecmartin has quit IRC16:24
aleebeagles, can't see any errors in the nova or neutron logs16:24
*** khrystoph has quit IRC16:26
openstackgerritJose Luis Franco proposed openstack/tripleo-upgrade master: Include new CLI changes for overcloud update.  https://review.openstack.org/55051716:27
*** trown|ruck has quit IRC16:28
*** suuuper has quit IRC16:28
*** dpawar has joined #tripleo16:28
aleearxcruz, actually -- whats the password to connect supposed to be?16:29
alee502 1326 ERROR barbican_tempest_plugin.tests.scenario.manager User: cirros, Password: None16:29
*** yprokule has quit IRC16:30
arxcruzalee: it's supposed to use ssh keys, not password, nevertherless the cirros password is cubswin:)16:31
*** egallen has joined #tripleo16:32
aleearxcruz, yeah - thats prob fine then ..16:32
*** abishop has joined #tripleo16:33
*** ramishra has quit IRC16:37
*** hjensas has joined #tripleo16:38
*** karthiks has quit IRC16:38
*** pkovar has quit IRC16:40
*** thrash is now known as thrash|biab16:40
mwhahahaccamacho: you could just add a bash date generation command in the cron entry16:41
*** dmacpher has quit IRC16:41
*** agopi|lunch has quit IRC16:41
ccamachomwhahaha \o/ yeah!16:41
ccamachothanks!16:41
*** agopi|lunch has joined #tripleo16:42
openstackgerritJiri Stransky proposed openstack/python-tripleoclient stable/pike: Get message from websocket instead from zaqarclient directly  https://review.openstack.org/55498616:42
mwhahahaccamacho: date +%Y-%m-%d -d "-7 days"16:42
mwhahahaseems to work16:42
*** egallen has quit IRC16:42
openstackgerritJiri Stransky proposed openstack/python-tripleoclient stable/pike: Get message from websocket instead from zaqarclient directly  https://review.openstack.org/55498616:45
trozetmwhahaha: do i need to recheck this (3rd party ci failure) or can you set workflow? https://review.openstack.org/#/c/554909/16:47
*** myoung|rover|mtg is now known as myoung16:48
mwhahahamyoung|rover|mtg, weshay: rdo cloud die again?16:48
beaglesalee, I wonder if there something wrong happening with the metadata16:48
mwhahahabeagles: are you looking into the ssh timeout thing? We're also seeing it ont he 3 node jobs16:48
* myoung flips a coin and peers at mwhahaha16:48
arxcruzlol16:48
mwhahahamyoung: dat reliability16:48
beaglesmwhahaha, do we know if the VFms are actually coming up?16:49
beaglesVMs16:49
mwhahahabeagles: not sure i hadn't looked, just noticed that we seem to be hitting something very similar to the barbican tempest problem in the 3 node jobs16:49
beaglesI guess we wouldn't be getting authentication errors if they wren't16:49
beaglesoic16:49
beaglesmwhahaha, sorry I thought you meant generally :) so it is barbican specific16:50
aleebeagles, yeah - I noticed that the metadata was not being retrieved ..16:50
mwhahahawell i wasn't sure if it was or not16:50
*** itlinux has joined #tripleo16:50
beaglesmwhahaha, k16:50
mwhahahatrozet: i +A'd it. no need for the 3rd party on those16:50
trozetmwhahaha: ty16:51
mwhahahabeagles: http://logs.openstack.org/13/554213/1/check/tripleo-ci-centos-7-3nodes-multinode/85d6763/job-output.txt.gz#_2018-03-21_12_47_13_935784 example, i'll go poke at the nova logs16:51
aleemwhahaha, beagles sorry I'm confused -- did you say you were seeing this in other scenarios too - or just in the barbican test case?16:51
mwhahahaalee: i'm seeing an ssh timed out in 3 node jobs a bunch today16:52
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Default environment/services/* to docker  https://review.openstack.org/55006016:52
mwhahahaalee: so if that's what you're seeing in the tempest results, then possibly16:52
*** myoung is now known as myoung|food16:53
*** quiquell has quit IRC16:54
weshaymwhahaha, not sure16:54
mwhahahaweshay: https://review.rdoproject.org/jenkins/job/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-queens/221/console wasn't looking promissing (failure from 30 mins ago)16:55
mwhahahaResourceInError: resources.baremetal_server: Went to status ERROR due to "Message: No valid host was found. There are not enough hosts available., Code: 500"16:55
*** karthiks has joined #tripleo16:55
weshayk16:55
weshaymwhahaha, I'll check the tenant16:56
mwhahahaalee, beagles is this what you were talking about for failed metadata: http://logs.openstack.org/13/554213/1/check/tripleo-ci-centos-7-3nodes-multinode/85d6763/logs/subnode-3/var/log/containers/nova/nova-compute.log.txt.gz#_2018-03-21_12_45_21_48016:57
openstackgerritMarios Andreou proposed openstack/tripleo-docs master: WIP: Add docs for Q upgrade workflow  https://review.openstack.org/53585916:58
mwhahahaI see: failed to get http://169.254.169.254/2009-04-04/user-data16:58
beaglesmwhahaha, yeah - I'm wondering if the ssh key isn't getting configured because metadata isn't available17:00
*** etingof has quit IRC17:00
gfidentetherve was looking with fultonj into https://review.openstack.org/#/c/551920/17:00
gfidented0ugal ^^17:00
* mwhahaha checks against a successful job17:00
gfidenteI see you wrote there it can interrupt regular workflows17:00
aleeright17:00
gfidentewas trying to understand why that is?17:00
mwhahahahmm i don't see the same output on success17:01
mwhahahaprobably cause we don't call get console output17:02
mwhahahaso we don't bother logging it17:02
mwhahahabrilliant17:02
*** udesale has quit IRC17:03
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: duplicate fs20 for libvirt  https://review.openstack.org/55499117:03
*** hjensas has quit IRC17:05
mwhahahabeagles: i'm seeing requests in the api-metadata log17:05
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates master: Add purge to Nova cleanup cron.  https://review.openstack.org/55196617:06
beaglesmwhahaha, mm well that's something17:06
mwhahahathe last time this happened the undercloud was serving the metadata up17:06
* mwhahaha checks it's not that again17:06
openstackgerritGiulio Fidente proposed openstack/tripleo-common master: Force ANSIBLE_LOAD_CALLBACK_PLUGINS to False for collect_nodes_uuid  https://review.openstack.org/55263617:06
mwhahahabeagles: for example in the job i'm looking at, http://logs.openstack.org/13/554213/1/check/tripleo-ci-centos-7-3nodes-multinode/85d6763/logs/subnode-3/var/log/containers/nova/nova-api-metadata.log.txt.gz#_2018-03-21_12_40_21_48917:07
*** salmankhan has quit IRC17:08
*** marios has quit IRC17:08
*** marios has joined #tripleo17:08
*** trown has joined #tripleo17:09
beaglesmwhahaha, ack17:10
*** ooolpbot has joined #tripleo17:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711117:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717417:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747417:10
*** ooolpbot has quit IRC17:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]17:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]17:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]17:10
*** panda is now known as panda|off17:10
*** trown is now known as trown|lunch17:11
EmilienMbogdando: I probably missed something but I don't see tripleo-undercloud-passwords.yaml generated anymore17:11
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates master: Add purge to Nova cleanup cron.  https://review.openstack.org/55196617:11
openstackgerritBogdan Dobrelya proposed openstack/puppet-tripleo stable/queens: Replace perl with awk  https://review.openstack.org/55459917:12
weshaymwhahaha, when you have moment of clarity and peace, I would like to destroy that by running that bug about network-isolation by you17:12
openstackgerritBogdan Dobrelya proposed openstack/puppet-tripleo stable/pike: Replace perl with awk  https://review.openstack.org/55499317:12
mwhahahaweshay: pfft clarity is overrated17:12
*** hjensas has joined #tripleo17:12
*** hjensas has quit IRC17:12
*** hjensas has joined #tripleo17:12
mwhahahaweshay: whatcha got17:12
EmilienMbogdando: I think that's because of https://review.openstack.org/#/c/54287517:12
*** salmankhan has joined #tripleo17:12
EmilienMbogdando: it broke the undercloud upgrades to be containerized17:13
weshaymwhahaha, so we've been poking at this by turning net-iso on/off https://bugs.launchpad.net/tripleo/+bug/175711117:13
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]17:13
weshaymwhahaha, yatin has some interesting comments to read through on https://review.openstack.org/#/c/554528/17:13
bogdandoEmilienM: ._.17:13
*** dsneddon has quit IRC17:13
EmilienMbogdando: I think there are things not backward compatible in that patch17:14
mwhahahaweshay: i would like to understand why we're now making net-iso required (it wasn't previously) so something's changed and probably not for the better17:14
weshaymwhahaha, we're not trying to make it required17:14
EmilienMbogdando: but tripleo-undercloud-passwords.yaml is no more handled on ~ directory17:14
bogdandoEmilienM: let's revert then17:15
weshaymwhahaha, what we noticed is that w/ net-iso everything works.. everything being a full tempest run.. w/o net-iso we see networking issues that cause a lot of tempest failures.. around 5017:15
bogdandoI'm not also sure what the comment in https://review.openstack.org/#/c/542875/47/tripleoclient/constants.py means17:15
weshaymwhahaha, so in the interest of keeping non-net-iso deployments working in queens, I'm bringing this to your attention17:15
weshayqueens/master17:15
mwhahahaweshay: what do networking folks say?17:16
weshaywhat ever happened, happend recently..17:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Allow for passing boot-time vars/args to OC nodes  https://review.openstack.org/55296717:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Add an openshift-cns service  https://review.openstack.org/54393317:16
weshayI'll go get them fair point17:16
EmilienMbogdando: the file is now on .undercloud-heat-installer/tripleo-undercloud-passwords.yaml17:16
mwhahahaweshay: cause it seems to be a regression somewhere, i'm wondering if it's the same probelm we're seeing with the ssh stuff17:16
bogdandoEmilienM: would a small symlink patch restored the backwards compat then?17:16
EmilienMbogdando: why did you put files into .undercloud-heat-installer directory?17:17
EmilienMand not HOME ?17:17
bogdandoEmilienM: it comes from the comments17:17
bogdandoand proposals...17:17
bogdandoand my imagination :D17:17
bogdandowrt the implementation17:18
*** zoli is now known as zoli|gone17:18
*** zoli|gone is now known as zoli17:18
EmilienMbogdando: well, the tripleo-undercloud-passwords.yaml generated is no longer based on existing ~/undercloud-passwords.conf17:18
EmilienMand it breaks upgrades17:18
*** marios has quit IRC17:19
mwhahahaweshay: did you know that the dhcp client stuff was udpated recently in centos (wonder if related)17:19
owalshccamacho: yea, +1, was just about to suggest what mwhahaha already had17:20
*** chem has joined #tripleo17:20
weshayI did not know that17:20
mwhahahaweshay: in comparing 77 vs 78 for queens17:20
mwhahahathe dhcp client stuff changed17:20
*** dsneddon has joined #tripleo17:20
mwhahahaweshay: https://www.diffchecker.com/KCbL3JUz17:21
EmilienMbogdando: what do we do?17:24
*** hjensas has quit IRC17:25
mwhahahaweshay: though that's supposed to just be branding, i guess the next step is to go through the few components and see if there is anything that's changed in those17:25
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Make running workflows more robust  https://review.openstack.org/54975117:25
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Make Validation actions use startWorkflow  https://review.openstack.org/55008617:25
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Make Nodes workflow actions use startWorkflow  https://review.openstack.org/55023217:25
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Refactor RolesActions to use startWorkflow  https://review.openstack.org/55253217:25
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Refactor LoggerActions to use startWorkflow  https://review.openstack.org/55254517:25
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Refactor PlansActions to use startWorkflow  https://review.openstack.org/55263817:25
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart master: Ignore empty values for dlrn hashes  https://review.openstack.org/55488217:26
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart master: Remove unnecessary parameters from featureset047.  https://review.openstack.org/55385017:26
d0ugalgfidente: what's up?17:26
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart-extras master: WIP: Add undercloud upgrades playbook.  https://review.openstack.org/55382717:26
*** bogdando has quit IRC17:26
*** ykarel is now known as ykarel|afk17:26
gfidented0ugal https://review.openstack.org/#/c/552452/ why is it affecting regular workflows too?17:27
mwhahahagfidente: it needed to be on another queue17:27
mwhahahagfidente: otherwise i think it's injecting a failure17:27
d0ugalgfidente: Yeah, I think it just meant the messages were confusing.17:28
gfidentemwhahaha yeah but my point is, it seems to be affecting workflows which don't use the zaqar queue17:28
d0ugalagreed17:28
d0ugalbut really, clients should filter by execution id in the workflow messages :)17:28
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Add space above Edit Configuration button  https://review.openstack.org/55450117:28
d0ugalthat is what tripleoclient does17:28
*** rbowen has quit IRC17:28
*** agopi|lunch is now known as agopi|17:28
*** agopi| is now known as agopi17:28
gfidented0ugal wait, I am saying that the ceph-ansible workflow, which does not use any zaqar queue17:29
*** rbowen has joined #tripleo17:29
weshaymwhahaha, yatin pointed out https://review.openstack.org/#/c/548554/  /me checking the rpms in the working job17:29
openstackgerritJose Luis Franco proposed openstack/tripleo-heat-templates stable/pike: DNM: Test undercloud upgrades P->Q.  https://review.openstack.org/55499617:29
gfidentewas affected by the issue caused by the cron trigger on a completely different workflow17:29
d0ugalgfidente: indeed, it shouldn't impact that workflow17:29
mwhahahaweshay: that wason't merged on queens until 7 days ago https://review.openstack.org/#/c/55096517:29
mwhahahaweshay: not likely the cause of the master issues17:29
d0ugalgfidente: are you telling me it was affected?17:30
d0ugalgfidente: or are you asking me if it was17:30
*** florianf has quit IRC17:30
gfidented0ugal I think it was17:30
ccamachoowalsh thanks :)17:30
d0ugalgfidente: if it was affected, can you give me more details? in what way?17:30
*** holser__ has quit IRC17:30
d0ugalgfidente: I gotta run in a minute, but I'd like to look into it. because it 100% shouldn't have been affected :)17:30
gfidented0ugal ack, we might be able to collect logs17:31
mwhahahait's likely that it broke the calling workflow and not actually the ceph one17:31
*** lucasagomes is now known as lucas-afk17:31
mwhahahaso it breaks the deployment one17:31
gfidentemwhahaha yeah which is probably overcloud_deploy17:31
mwhahaharight17:31
gfidentebut in the engine log I saw the req- for ceph-install fail17:31
gfidentetiming out after execution17:31
gfidenteeven though ansible-playbook returned 017:31
gfidenteI'll see if I can collect good logs17:32
*** jpich has quit IRC17:33
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Change default output-dir to be $HOME  https://review.openstack.org/55499717:34
*** moshele has joined #tripleo17:34
mwhahahaweshay: so i'm not seeing anything that sticks out in that diff for queens between 77 and 78 which makes me thing it's probably more rdo cloud than anything17:35
weshaymwhahaha, I can recreate the issue outside of rdo-cloud17:35
mwhahahaweshay: that would also help explain why net-iso vs non-net-iso solves it17:35
weshaymwhahaha, via libvirt17:35
mwhahahaorly17:36
mwhahahaweshay: i wonder if i this is fallout from the docker iptables stuff17:36
weshaymwhahaha, I have two libvirt deployments for fs20 one w/ net-iso one w/o and the deployment w/o fails17:36
mwhahahaweshay: if so i would like to punch peoples17:36
*** dpawar has quit IRC17:37
*** NobodyCam has quit IRC17:37
*** Tyrantelf_ has quit IRC17:37
*** Hazelesque has quit IRC17:38
*** Hazelesque has joined #tripleo17:38
*** alee_ has joined #tripleo17:38
*** v1k0d3n has quit IRC17:38
*** Tyrantelf has joined #tripleo17:39
*** alee has quit IRC17:39
*** andreaf has quit IRC17:39
*** NobodyCam has joined #tripleo17:39
*** andreaf_ has joined #tripleo17:39
*** v1k0d3n has joined #tripleo17:40
*** moshele has quit IRC17:41
mwhahahaweshay: so do the neutron folks have anything to say why the port binding is failing, https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-queens/4cf9425/overcloud-novacompute-bar-0/var/log/containers/nova/nova-compute.log.txt.gz#_2018-03-21_07_03_53_85517:41
*** andreaf_ is now known as andreaf17:41
d0ugalgfidente, mwhahaha - it shouldn't break the calling workflow either - workflows don't read from zaqar (well, other than the UI logging one)17:41
d0ugalgfidente: I'll inspect logs tomorrow :)17:41
d0ugaland I need to read the bug, I never fully understood the change17:42
mwhahahait was a side effect of i think how we changes the reading of teh queue17:42
mwhahahaor something where it gets an error it just pukes17:42
gfidented0ugal which hopes had I to understand it then17:42
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Change default output-dir to be $HOME  https://review.openstack.org/55499717:42
d0ugalmwhahaha: yeah, but that still doesn't make sense to me :)17:43
gfidenteI support ehe pukes idea though17:43
d0ugalanyway, I really gotta run - guests arrived at my house17:43
gfidentetell them17:43
gfidenteabout it17:43
mwhahahaso it may not be the workflow that dies, but the client thinks it fails and then nukes things17:43
mwhahahagfidente: he might like the person, i wouldn't subject anyone i know to our problems17:43
*** myoung|food is now known as myoung17:45
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci master: DNM - test containerized undercloud upgrades  https://review.openstack.org/55363317:46
openstackgerritEmilien Macchi proposed openstack/tripleo-upgrade master: DNM - containerized undercloud upgrade  https://review.openstack.org/55362917:47
openstackgerritEmilien Macchi proposed openstack/tripleo-upgrade master: DNM - containerized undercloud upgrade  https://review.openstack.org/55362917:47
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci master: DNM - test containerized undercloud upgrades  https://review.openstack.org/55363317:48
*** thrash|biab is now known as thrash17:49
mwhahahaweshay: honestly it looks like openvswitch problems17:49
weshayya17:49
mwhahahaweshay: i traced it from nova, to neutron and to the metadata agent and there's some errors in ovs-vswitchd17:50
*** ffiore_ has quit IRC17:53
weshayi see this on the compute node17:53
weshayMar 21 06:36:44 overcloud-novacompute-bar-0 ovs-vsctl[18050]: ovs|00001|db_ctl_base|ERR|unix:/var/run/openvswitch/db.sock: database connection failed (No such file or directory)17:53
EmilienMdprince, mwhahaha : when you have time please take a look at https://review.openstack.org/#/c/550608/5/doc/source/install/containers_deployment/3rd_party.rst17:53
*** pickle has quit IRC17:53
*** pickle has joined #tripleo17:53
weshayhrm... but that is containerized now17:54
mwhahahaweshay: openvswitch is not containerized17:55
mwhahahanever has been17:55
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Add environment to enable Designate  https://review.openstack.org/55500617:56
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Deploy Designate in scenario003  https://review.openstack.org/55500717:56
weshayoh sorry.. was looking at the agent.. not the service17:56
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Begin adding environments with all params for a service  https://review.openstack.org/47592417:56
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Add ability to generate an environment index  https://review.openstack.org/49192517:56
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: WIP: Add service config env with all Designate settings  https://review.openstack.org/55500817:56
beaglesweshay, mwhahaha, is OVS not running on the. server?17:56
*** haleyb has quit IRC17:56
*** ebarrera has quit IRC17:56
mwhahahabeagles: which server17:56
beaglescompute17:56
beaglesjust wondering from weshay's comment17:57
weshayopenvswitch-2.8.2-1.el7.x86_6417:57
weshayopenvswitch-ovn-central-2.8.2-1.el7.x86_6417:57
weshayopenvswitch-ovn-common-2.8.2-1.el7.x86_6417:57
weshayopenvswitch-ovn-host-2.8.2-1.el7.x86_6417:57
dprinceEmilienM: ack17:57
weshayhttps://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-queens/4cf9425/overcloud-novacompute-bar-0/var/log/extra/rpm-list.txt.gz17:57
mwhahahabeagles: i don't see logs for it17:57
mwhahahaso maybe17:57
beaglesmmm...17:57
mwhahahait's not17:58
beaglesI predict things will not be happy if there is no OVS running on the compute ;)17:58
* mwhahaha wishes we had some sort of basic service validation before we did anything else17:59
* beagles nods17:59
weshaydon't see it running https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-queens/4cf9425/overcloud-novacompute-bar-0/var/log/extra/pstree.txt.gz17:59
mwhahahayea i don't see it in the sysctl service list from host_info either17:59
mwhahahahttps://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-queens/4cf9425/overcloud-novacompute-bar-0/var/log/host_info.txt.gz18:00
*** derekh has quit IRC18:00
mwhahahawhere did it go18:00
Tenguhello guys :)18:01
weshaybest data I have atm.. is to compare w/ pike18:02
weshayhttps://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-pike/5c746b0/overcloud-novacompute-bar-0/var/log/extra/pstree.txt.gz18:02
weshayand it's there18:02
weshayhttps://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-pike/5c746b0/overcloud-novacompute-bar-0/var/log/openvswitch/18:02
beaglesah whew18:02
beaglesobw18:02
beaglesyou are showing pike18:03
dprinceEmilienM: nice job on the new docs btw. My -1 is really for minor things, just so you notice it18:03
Tenguoh, hello beagles :). do you have a few minutes for a network issue?18:03
weshaybeagles, ya.. that was my only reference to something working atm18:03
EmilienMdprince: ok good, I'll look18:03
beaglesweshay, ah oka18:03
beagleslooks like ovs isn't being started18:03
beaglesTengu, depends :)18:03
Tengubeagles: doing an *upgrade* on pike (pike BM -> pike container - upgrade, not update), network seems broken on the three controllers I have. I heard you might know about it.18:04
Tengubeagles: if not, it's not a big issue, since apparently restarting the "network" service on the three controllers seems to correct the situation.18:05
beaglesTengu, interesting18:05
*** pblaho has quit IRC18:05
Tengubeagles: I know this usage isn't supported and it's kind of weird, but… ;)18:05
Tengubeagles: I just respawned my lab in order to re-run the upgrade process - if you're really interested, I can come back tomorrow with logs.18:06
beaglesTengu, not sure exactly what would cause that. The thing I'm looking at mainly has to do with process lifetimes and containers18:06
beaglesTengu, I don't think restarting networking would effect it18:06
*** quiquell has joined #tripleo18:06
beaglesTengu, it's worth reporting/cataloging !18:07
Tengubeagles: hmm ok. well, symptoms: controllers can't ping their default route anymore, nor do DNS resolutions. fun part, management network still work - I can ssh from the undercloud.18:07
Tengubeagles: so I'll let the script run and crash, and report back tomorrow the logs. What kind of logs would you need? /var/log/messages I guess, and… ?18:08
Tenguneutron maybe?18:08
beaglesTengu, yeah, sounds about right18:09
Tenguok. so stay tuned :).18:09
Tengupreparing the lab and fire.18:09
openstackgerritCarlos Goncalves proposed openstack/tripleo-heat-templates master: Containerize Neutron LBaaS service plugin  https://review.openstack.org/55501118:09
Tengu(it's soooo good to have a lab that can be respawned at will…)18:09
mwhahahaweshay: i thought ovs was enabled on the image18:09
EmilienMmwhahaha: do you hav ea package diff from before (working) to now?18:09
mwhahahaEmilienM: no cause someone broke logging18:10
*** ooolpbot has joined #tripleo18:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711118:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717418:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747418:10
*** ooolpbot has quit IRC18:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]18:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]18:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]18:10
EmilienMlet me see RDO history18:10
mwhahaha(of course they did)18:10
weshaymwhahaha, ya.. logging here is killing this18:10
mwhahahaso at the moment it looks like ovs isn't running on the compute node18:10
mwhahahai thought that was enabled by default from the image18:10
mwhahahaso i'm trying to track down where that's handled18:10
EmilienMso we updated in QUeens: https://review.rdoproject.org/r/#/c/12580/18:11
EmilienMbut looking for pike now18:11
*** quiquell has quit IRC18:11
mwhahahain pike we see https://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-pike/5c746b0/overcloud-novacompute-bar-0/var/log/journal.txt.gz#_Mar_21_05_10_4918:11
mwhahahabut iun queens it's missing, https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-queens/4cf9425/overcloud-novacompute-bar-0/var/log/journal.txt.gz#_Mar_21_06_30_3018:12
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart-extras master: Add support to mixed upgrade for overcloud-prep-container role.  https://review.openstack.org/54047318:13
EmilienMwhy do we have openvswitch-2.8.2-1.el7.x86_64 ?18:13
*** jpena is now known as jpena|off18:14
EmilienMwe should have openvswitch-2.7.3-1.1fc27.el718:14
EmilienMif I read rdoinfo18:14
mwhahahawe bumped to 2.8 a while ago18:14
EmilienMah nevermind, I read queens logs18:14
EmilienMthe issue is only on pike right?18:14
weshayya.. in queens.. we don't see Mar 21 05:10:49 localhost.localdomain ovs-ctl[743]: Creating empty database /etc/openvswitch/conf.db [  OK  ]18:15
mwhahahaEmilienM: No queens + master18:15
EmilienMahh18:15
*** dtantsur is now known as dtantsur|afk18:15
mwhahahanetwork templates regression?18:16
mwhahahadoes os-net-config enable ovs?18:16
*** atoth has quit IRC18:16
beaglesit might as a side effect18:17
weshaymwhahaha, I have a box w/ this18:18
beaglesifup-ovssystem or something18:18
weshayif you want to jump on18:18
mwhahahaweshay: sure18:18
beaglesifup-ovs18:18
*** nyechiel_ has joined #tripleo18:19
*** trown|lunch is now known as trown18:19
mwhahahaweshay: so that one has the service running18:20
mwhahahaoh wait it failed18:20
*** EmilienM is now known as mimi18:21
*** mimi is now known as EmilienM18:21
*** gfidente is now known as gfidente|afk18:22
trownlol @mimi18:22
trownthat is my Mom's grandma name18:22
weshaysame18:22
weshaytrown, we're live debugging the fs20 issue18:22
weshayif you want to jump on18:22
trownsure18:22
mwhahahawat the hell18:23
mwhahahai wonder if this is related to the permissions changes we did for ODL18:23
* mwhahaha looks at trozet 18:24
mwhahahaor was it sriov18:24
mwhahahasec let me go look for the patch18:24
trozetmwhahaha: what's up?18:24
weshaytrozet, openvswitch is not starting on the compute nodes18:25
mwhahahatrozet: did we change something around vswitch for ODL or was that nfv18:25
*** jfrancoa has quit IRC18:25
* beagles vaguely recalls something nfv related18:25
trozetweshay, mwhahaha: there was a bug where neutron certs were not created on compute nodes so neutron-openvswitch-agent wouldnt start, we fixed that though18:25
trozetweshay: openvswitch wont start or neutron-ovs agent?18:26
weshayopenvswitch18:26
trozetweshay: TLS deployment or no?18:26
weshayah good question18:27
mwhahahaMar 21 18:22:03 overcloud-novacompute-bar-0 ovsdb-server[129745]: ovs|00005|ovsdb_jsonrpc_server|ERR|punix:/var/run/openvswitch/db.sock: listen failed: Is a directory18:27
mwhahahawonder if the socket is a folder which is causing the problems18:27
trownhmm like somethiing mkdiring var/run/openvswitch/db.sock ?18:28
weshaythis is not tls afaict18:28
mwhahahadocker will create it as a folder if it doesn't exist18:28
mwhahahai vaguely remember something abotu this18:28
trozetmwhahaha: i thought OVS is not containerized?18:28
mwhahahait's not18:28
mwhahahabut it might be getting hit by something18:29
*** ffiore has joined #tripleo18:29
trozetmwhahaha: i dont see why anything would make db.sock in tripleo18:29
ykarel|afkmwhahaha, can you check my comment on the patch: i think it's relevant18:30
ykarel|afkhttps://review.openstack.org/#/c/554528/18:30
trozetweshay: if you can give me login to the setup I don tmind taking a look18:30
*** haleyb has joined #tripleo18:30
ykarel|afkmwhahaha, running puppet to start ovs from container is causing it i think, few days back before containerizing neutron-ovs agent it started ovs18:31
mwhahahaykarel|afk: yea but ovs i think used to be started outside of puppet18:32
ykarel|afkmwhahaha, from the log i only found that it's either started by os-net-config or ovs-agent18:32
*** hjensas has joined #tripleo18:32
* mwhahaha isn't sure which is supposed to be starting it18:33
weshaywhat did you do to start it?18:33
weshayjust start?18:33
mwhahaharestarted it a few times18:34
ykarel|afkmwhahaha, https://github.com/openstack/puppet-neutron/blob/master/manifests/agents/ml2/ovs.pp#L21118:34
ykarel|afkpuppet start it manage_vswitch is true, which is true by default18:34
ykarel|afkafter containerizing it's not working as specified in the https://review.rdoproject.org/paste/show/87/18:35
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade master: DNM: use rhos-release role pin_puddle option  https://review.openstack.org/55501818:35
mwhahahaykarel|afk: hmm ok so we were inheriting previously and we need to account for the thing that needs to be starting ovs18:35
ykarel|afkmwhahaha, yes18:36
mwhahahak i'll poke at it a bit more after my meeting18:36
trozetmwhahaha: ovs will actually be started before this18:36
trozetmwhahaha: os-net-config18:37
mwhahahawell it should be18:37
mwhahahabut isn't18:37
mwhahahaso somewhere we lost that startup18:37
mwhahahaor something18:37
* beagles guesses that it is running on the controller due to the presence of an OVS based interface/bridge (.e.g. br-ex)18:37
beaglesifup'ing that would start it up I think18:37
openstackgerritMerged openstack-infra/tripleo-ci master: Allow custom sequence of playbooks  https://review.openstack.org/54650118:38
dsneddonmwhahaha, beagles: Yeah, os-net-config writes out the ifcfg files that define the OVS bridges, then running ifup on the bridge starts OVS.18:38
mwhahahaso if it doesn't start until if up and a container started before it's up'ed the db.sock is a directory18:38
*** khyr0n has joined #tripleo18:38
mwhahahapreventing it from starting18:38
openstackgerritMerged openstack/tripleo-upgrade stable/pike: Remove ceph osd hieradata during upgrade  https://review.openstack.org/55357218:38
mwhahahawhich may be the problem18:38
mwhahahait needs to be started before all the containers18:39
dsneddonmwhahaha, You might have to put something in firstboot to start OVS, then, because os-net-config runs pretty early in the process18:39
ykarel|afkmwhahaha, but on compute without network isolation there is nothing created on /etc/os-net-config/config.json18:39
mwhahahait' snot relying on vswitch18:39
mwhahahaIIUC18:40
dsneddonmwhahaha, ykarel|afk: Without network isolation, the compute nodes use net-config-noop.yaml, which leads to an empty config.json18:40
ykarel|afkdsneddon, yes what should be done in this case to start ovs?18:40
dsneddonykarel|afk, As I was suggesting, I think a firstboot script to start OVS would work.18:40
ykarel|afkOk18:41
ykarel|afkor there is something called host-prep-task, won't that work18:41
ykarel|afki don't know much about it18:41
dsneddonykarel|afk, It might, I don't know anything about that either18:42
trozetdsneddon: why wouldnt you just systemctl enable openvswitch on the disk?18:42
dsneddonykarel|afk, Here is how to write a firstboot script: https://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/extra_config.html18:42
dsneddontrozet, Yeah, if you want to modify the image that's another easy fix18:42
trozetdsneddon: since the dataplane relies on openvswitch its safe to always have it enabled18:42
dsneddontrozet, Yeah, agreed18:43
ykarel|afktrozet, but so everywhere ovs is used, no other option like linuxbridge, etc18:43
dsneddontrozet, We already have logic to restart OVS with DPDK, which is the one case where os-net-config actually runs "systemctl restart openvswitch"18:43
openstackgerritBrent Eagles proposed openstack/puppet-tripleo master: Adding wrapper scripts for neutron agent subprocesses  https://review.openstack.org/55022418:43
trozetdsneddon: if the os-net-config is empty, then maybe OVS is getting started by https://github.com/openstack/puppet-vswitch/blob/master/manifests/ovs.pp#L9418:44
ykarel|afktrozet, but from containers ^^ is not working18:44
trozetykarel|afk: i dont understand what you mean by containers18:44
trozetykarel|afk: oh because tis the ML2?18:45
ykarel|afktrozet, https://review.rdoproject.org/paste/show/87/18:45
trozetykarel|afk: yeah this is why i said a while ago this needs to be removed from ML218:45
trozetykarel|afk: and tried to make openvswitch its own service in tripleo18:45
dsneddontrozet, My assumptions about OVS getting started because of the ifup script could be incorrect. Maybe it was started to begin with?18:45
ykarel|afktrozet, ack18:46
trozetykarel|afk: so in the container this is skipped becuase it isnt in puppet tags18:46
*** rbowen has quit IRC18:46
ykarel|afktrozet, systemctl don't work in chroot18:46
ykarel|afk()[root@overcloud-novacompute-bar-0 /]# systemctl status openvswitch18:47
ykarel|afkRunning in chroot, ignoring request.18:47
trozetykarel|afk: yeah but i dont think it should even attempt it, right? only the right puppet tags will get executed18:47
trozetykarel|afk: so then what tried to bring up OVS :) ?18:47
ykarel|afktrozet, haven't checked that yet18:47
dsneddontrozet, I know that I'm going to have to modify os-net-config to somehow restart the OVS container (which won't be restarted with "systemctl restart openvswitch"), so I'm curious about how to do that with an OVS container.18:47
Tengubeagles: just started the upgrade script. Will open an issue on launchpad once I get some information.18:48
beaglesTengu, ack thx18:48
trozetdsneddon: can you just use docker python api and restart it?18:49
dsneddontrozet, I suppose that would work.18:50
*** aputtur__ has quit IRC18:50
*** aputtur has joined #tripleo18:50
dsneddontrozet, That seems like it would depend on knowing the name of a specific container, but I'm not sure there is anything more standardized than that.18:50
dsneddonykarel|afk, I think trozet was right, the openvswitch service gets enabled here: https://github.com/openstack/puppet-vswitch/blob/master/manifests/ovs.pp#L8518:51
ykarel|afkdsneddon, and where this puppet module called up?18:51
ykarel|afkand when18:51
trozetdsneddon: i dont know why ovsdb-server is listed there i think that service is started when OVS is started18:51
mwhahahait is started by openvswitch18:52
mwhahahaas a dependency service18:52
*** raildo has quit IRC18:52
trozetmwhahaha: yeah so thats kind of weird18:52
*** fragatina has quit IRC18:53
*** nyechiel_ has quit IRC18:53
itlinuxhello guys, I want to build a bond with a specific nic.. using the mac address what's the option I should look at? Thanks18:55
itlinuxso I can add that to my template.18:56
*** jlabarre has joined #tripleo18:57
*** ebarrera has joined #tripleo18:58
itlinuxsince I could not find the answer in the https://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/network_isolation.html18:58
dsneddonitlinux, There is a way to do that, using mapping.yaml. I'm trying to find the documentation.18:58
itlinuxthanks dsneddon:18:58
trozetykarel|afk: called from neutron-ovs-agent.yaml which as you mentioned is a container18:59
ykarel|afkand that's not working19:00
trozetykarel|afk: with puppet_tags: neutron_config,neutron_agent_ovs,neutron_plugin_ml219:00
trozetykarel|afk: so that doesnt start ovs or try to19:01
trozetykarel|afk: ah but you know what19:01
mwhahahaykarel|afk, trozet, beagles: so i think we need a hostprep task for the neutron-vos-agent to ensure ovs is started https://github.com/openstack/tripleo-heat-templates/blob/a175c9e6aaf5d35f653fd14f05cf04ba069ea710/docker/services/neutron-ovs-agent.yaml19:01
trozetykarel|afk: nvm was going to say the neutron-ovs-agent systemd service depends on openvswitch, but that doesnt matter cause no systemd in the container19:01
mwhahahabecause we were inheriting the ovs service getting managed via the agent19:01
mwhahahaand not explicitly doing it anywhere19:02
mwhahahawhich worked when it wasn't containerized19:02
trozetmwhahaha: so i was just going to ask how you didnt hit this before...is it because this is the first attempt at containerizing the ovs agent?19:02
dsneddonitlinux, Can you read this? https://access.redhat.com/solutions/294002119:02
mwhahahacause i think it's specifically this causing problems: https://github.com/openstack/tripleo-heat-templates/blob/a175c9e6aaf5d35f653fd14f05cf04ba069ea710/docker/services/neutron-ovs-agent.yaml#L13819:02
dsneddonitlinux, If you don't have an account, I can paste the contents to paste.openstack.org19:02
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates master: Add support to ironic "direct" deploy interface  https://review.openstack.org/52934219:03
mwhahahai think the mounting of /var/run/openvswitch/db.sock before the service is causing problems19:03
mwhahahatrozet: evidently we weren't actually properly ensuring all the services were containerized19:03
mwhahahauntil recently19:03
* mwhahaha looks around19:03
beaglesmmm.. no that's not quite right19:03
ykarel|afkmwhahaha, i tried without it as well, and got Running in chroot, ignoring request.19:03
ykarel|afki mean without mounting of /var/run/openvswitch/db.sock19:04
beaglesthe ovs agent has been running in a container for quite some time (all of the octavia work was done this way)19:04
mwhahahabeagles: it might not have been in fs02019:04
beaglesmwhahaha, ah I see19:04
trozetmwhahaha: oh now i see the problem, you guys mount db.sock int eh docker service19:04
trozetmwhahaha: now it is clear :)19:04
mwhahaharight19:05
mwhahahaso when docker comes along and mounts it it's a directory19:05
mwhahahawhich is the error i saw from the service19:05
*** brault has joined #tripleo19:05
mwhahahaso we need to ensure ovs is running before any of the docker bits19:05
trozetmwhahaha: yep thats it19:05
itlinuxthanks dsneddon: I do not have an account.. I did when I was at Red Hat :)19:05
ykarel|afkmwhahaha, default containerization done 13 days ago: https://review.openstack.org/#/c/548554/ and it started failing after it19:05
mwhahahaor change the mount19:05
mwhahahaykarel|afk: right so we must have 'fixed' something when we switched it19:05
ykarel|afkmwhahaha, yes19:06
trozetmwhahaha: i think the *right* way is to use an openvswitch service to control openvswitch19:06
mwhahahathat was previously still inheriting a baremetal thing19:06
*** salmankhan has quit IRC19:06
dsneddonitlinux, Here is the file that actually does the work: https://github.com/openstack/tripleo-heat-templates/blob/master/firstboot/os-net-config-mappings.yaml19:06
mwhahahatrozet: well yes, but we don't have an openvswitch service officially19:06
* mwhahaha shrugs19:06
trozetmwhahaha: https://github.com/openstack/tripleo-heat-templates/blob/a175c9e6aaf5d35f653fd14f05cf04ba069ea710/puppet/services/openvswitch.yaml19:06
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade master: DNM: use rhos-release role pin_puddle option  https://review.openstack.org/55501819:06
mwhahahais that missing?19:06
itlinuxok..19:06
trozetmwhahaha: that service should call puppet-vswitch to start ovs and configure it19:06
mwhahahafrom fs02019:06
beaglesmwhahaha, that was a dpdk only kind of deal19:07
itlinuxso I should use the -e ...templates/ option to get this right19:07
trozetmwhahaha: no its just the way it is done now that service is inherited by neutron-ovs-agent, and does not actually control OVS19:07
trozetmwhahaha: but it should in the future19:07
*** brault_ has quit IRC19:07
mwhahahayea probably19:07
trozetmwhahaha: especially if in the future you may want to containerze ovs, it needs to be its own legit service19:07
mwhahahathat's the missing config bits19:08
dsneddonitlinux, The script will loop through all nodes (node1, node2, etc.), and when it finds a matching MAC, it will lay down the mapping for that node (whatever node number it is, doesn't matter).19:08
mwhahahasince it was always just inheirieting19:08
*** shreshtha has quit IRC19:08
mwhahahaanyway i much lunch, i shall continue investigating later19:08
dsneddonitlinux, I'll paste the instructions for you19:08
itlinuxthanks dsneddon:19:09
trozetmwhahaha: https://bugs.launchpad.net/tripleo/+bug/165609619:09
openstackLaunchpad bug 1656096 in tripleo "[RFE] Split Open Vswitch its own service" [Wishlist,In progress] - Assigned to Tim Rozet (trozet)19:09
trozetykarel|afk:^19:09
dsneddonitlinux, http://paste.openstack.org/show/707901/19:09
*** ooolpbot has joined #tripleo19:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION19:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711119:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717419:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747419:10
*** ooolpbot has quit IRC19:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]19:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]19:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]19:10
ykarel|afktrozet, ack19:11
itlinuxdsneddon: this file does not exist /usr/share/openstack-tripleo-heat-templates/firstboot/os-net-config-mappings.yaml19:11
*** jcoufal has quit IRC19:12
dsneddonitlinux, Interesting. What version of TripleO are you using? Does /usr/share/openstack-tripleo-heat-templates exist?19:13
itlinuxpike19:13
itlinuxone sec..19:14
itlinuxI was on the controller dam a$$19:14
dsneddonitlinux, It's pretty easy. Here's an example of what you add to network-environment.yaml to make it work: http://paste.openstack.org/show/707913/19:18
itlinuxthanks19:19
itlinuxso I could just add my option nic to my config like this..19:19
dsneddonitlinux, In that example, only the last line in resource_registry and the NetConfigDataLookup: block are what you add to your own file19:20
*** myoung is now known as myoung|biab19:22
*** ykarel|afk is now known as ykarel|away19:22
*** dprince has quit IRC19:23
*** jaosorior has quit IRC19:23
*** dprince has joined #tripleo19:23
*** ykarel|away has quit IRC19:28
openstackgerritMerged openstack/tripleo-heat-templates master: Add pre_upgrade_rolling_tasks  https://review.openstack.org/55207319:34
openstackgerritMerged openstack/tripleo-heat-templates master: Update service readme files  https://review.openstack.org/55332119:34
openstackgerritMerged openstack/tripleo-heat-templates master: Fixes ODL container failing to start due to missing etc config  https://review.openstack.org/55307919:34
openstackgerritMerged openstack/tripleo-validations master: Fix MySQL Open Files Limit validation  https://review.openstack.org/55488819:34
itlinuxdsneddon: so node1 I assume that is like compute1 and etc..19:34
itlinuxhow do I specify the diff between compute / controller?19:34
*** liverpooler has quit IRC19:34
*** radeks_ has joined #tripleo19:36
itlinuxdsneddon: I guess this looks correct? http://paste.openstack.org/show/707932/19:36
dsneddonitlinux, It really doesn't matter. The script will loop through each node and as soon as it finds a matching MAC address it knows it has found the right node and will write that node's mapping to disk.19:37
*** tesseract has quit IRC19:38
dsneddonitlinux, What you have there looks fine, but you also need the reference to os-net-config-mappings.yaml in the resource_registry: section of that same file19:38
*** radeks has quit IRC19:38
*** pickle is now known as dhill_19:39
itlinuxok so I just put all nodes in seq and then the role is assigned by the other file I have http://paste.openstack.org/show/707936/ and I could add them here. instead..19:39
*** radeks_ has quit IRC19:42
dsneddonitlinux, It really doesn't matter what sequence the nodes are in NetConfigDataLookup:, the script will loop through on each node, and find the matching MAC address.19:42
*** radeks_ has joined #tripleo19:43
*** ffiore has quit IRC19:43
dsneddonitlinux, It doesn't rely on the order that you put nodes into NetConfigDataLookup, only that a MAC address matches for each node.19:43
itlinuxok here is my new script then :) http://paste.openstack.org/show/707938/19:43
itlinuxif you can give it a blessing :) dsneddon:19:43
*** dprince has quit IRC19:43
dsneddonitlinux, Not quite. This line goes under "resource_registry:" OS::TripleO::NodeUserData: /usr/share/openstack-tripleo-heat-templates/firstboot/os-net-config-mappings.yaml19:43
dsneddonitlinux, And you have too much indentation in the NetConfigDataLookup block19:44
*** jaosorior has joined #tripleo19:44
itlinuxok fixing now..19:44
*** sri_ has quit IRC19:46
alee_mwhahaha, beagles -- just getting back to this -- any progress>19:46
alee_?19:46
itlinuxhttp://paste.openstack.org/show/707944/ dsneddon:19:47
dsneddonitlinux, That looks right. If you want to be super-picky, you can add one whitespace in front of the OS::TripleO... to line it up with the rest.19:48
itlinuxthanks will do now19:48
*** athomas has quit IRC19:48
itlinuxfinal version http://paste.openstack.org/show/707946/19:49
itlinuxthanks again dsneddon: much appreciated!19:50
*** rfolco is now known as rfolco|ruck19:51
openstackgerritTim Rozet proposed openstack/tripleo-heat-templates stable/queens: Fixes ODL container failing to start due to missing etc config  https://review.openstack.org/55503519:55
*** moshele has joined #tripleo19:55
*** eck` is now known as eck`gone19:57
mwhahahaalee_: so we left it at it seems like the openvswitch service is not properly started on the system20:03
mwhahahaalee_: so there's probably a few ways to tackle that20:03
mwhahahai'm going to look at something to see if it's that we just need to stop mounting /var/run/openvswitch/db.sock and mount the dir instead20:03
*** chem has quit IRC20:04
mwhahahathere's another problem in that we appear to be doing vsctl commands and it wasn't running and we didn't fail20:05
*** chem has joined #tripleo20:05
*** chem has quit IRC20:06
*** moshele has quit IRC20:06
weshaymwhahaha++20:06
alee_mwhahaha, how do we know that the openvspwitch service was not properly started?20:06
mwhahahaalee_: it's not running and there are no logs20:06
mwhahahaalee_: from weshay's reproducer we see that it's failing to launch because /var/run/openvswitch/db.sock is a folder evidently20:06
mwhahahaMar 21 18:22:03 overcloud-novacompute-bar-0 ovsdb-server[129745]: ovs|00005|ovsdb_jsonrpc_server|ERR|punix:/var/run/openvswitch/db.sock: listen failed: Is a directory20:07
*** chem has joined #tripleo20:07
*** radeks_ has quit IRC20:07
mwhahahaweshay: i'm off that box now20:07
* mwhahaha goes to fiddle with code20:07
weshayk.. thanks20:08
alee_mwhahaha, I wonder if thats the same case in https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset017-queens/6e4ffba/undercloud/home/jenkins/tempest/tempest.html.gz20:08
alee_mwhahaha, which is the case in the trello board20:08
*** akrivoka has quit IRC20:09
alee_mwhahaha, I see openvswitch logs there I think ,,20:09
Tengubeagles: stoll there? apparently OVS has some issues, its log if full of  2018-03-21T19:30:10.742Z|00359|rconn|WARN|br-ex<->tcp:127.0.0.1:6633: connection failed (Connection refused) lines (with other br-FOO)20:09
Tengu*sitll20:09
mwhahahaalee_: oh right so that's different, i was looking at the fs020 error20:09
mwhahahaalee_: let me check that one20:09
Tengu… darn. should go to bed, can't type anymore.20:09
mwhahahatoo many failures20:09
alee_mwhahaha, ah - and I thought you'd solved my isssue :)20:10
mwhahahaalee_: so that one is different20:10
mwhahahaalee_: but that one looks like my 3node failures20:10
*** ooolpbot has joined #tripleo20:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION20:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711120:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717420:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747420:10
*** ooolpbot has quit IRC20:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]20:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]20:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]20:10
mwhahahaalee_: if you look in the ovs-vswitchd you see errors about no such device20:10
mwhahahaalee_: so we might need an ovs expert on that one20:10
mwhahahaalee_: https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset017-queens/6e4ffba/subnode-2/var/log/openvswitch/ovs-vswitchd.log.txt.gz20:10
mwhahahaalee_: 2018-03-21T06:49:47.454Z|00056|netdev_linux|WARN|tap13c9ce98-76: removing policing failed: No such device20:10
mwhahahastuff like that20:10
Tenguhmmmmm. miht be linked to the LBaaS… its log is also full of issues. duh.20:10
Tenguanyway. will open the issue tomorrow, it's late here now.20:11
*** radeks_ has joined #tripleo20:11
alee_mwhahaha, ok  - who do we need to look at that?20:12
mwhahahathe mystical networking folks20:12
* mwhahaha waves hands20:12
* mwhahaha has no idea20:12
alee_beagles, ?20:13
alee_mwhahaha, ok - I see that on ykarel's reproducer machine too20:14
* bnemec would argue that none of the ci categories is green right now20:15
ianwhi everyone, https://review.openstack.org/554705 seems to be failing in the gate ... it's a bit of an issue because it blocks dib20:15
bnemecIn case anyone else is wondering if they should recheck20:15
bnemecianw: It's a known problem.  I think mwhahaha is looking into it.20:16
*** mwhahaha changes topic to "Welcome to Rocky. CI status - Promotions: RED; check/gate: RED; RDO CI jobs: Questionable | http://tripleo.org/ | https://docs.openstack.org/tripleo-docs/latest/"20:16
beaglesalee_, that's pretty wild20:17
beaglesalee_, but...20:17
ianwok, thanks.  i might have to drop the tripleo test from dib for a while as we need a new release20:17
beaglesalee_, it reeks of network namespace issues20:18
bnemecmwhahaha: Thanks20:19
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Remove tripleo jobs  https://review.openstack.org/55503720:19
mwhahahaweshay: in the promotion jobs, where are the images build from? is there a cached set i could pull?20:23
weshaymwhahaha, https://images.rdoproject.org/master/rdo_trunk/20:24
weshaymwhahaha, tripleo-ci-testing would be the very latest that didn't pass20:24
mwhahahak20:24
mwhahahai'm so sick of looking into images20:25
mwhahahanext time it's an image problem i'm leaving20:25
* mwhahaha shakes fists at libguestfs20:25
slaglewhat color is Questionable20:26
slaglelet's use Chartreuse20:26
mwhahahaworks for me20:28
bnemecChartreuse is pretty close to green though.20:30
bnemecNot that most people probably know that. :-)20:30
* bnemec has chartreuse fishing lures20:30
mwhahahasoooo20:31
mwhahahawe no longer automagically enable openvswitch20:31
mwhahahaas of queens20:31
mwhahahait's missing from the overcloud image20:31
mwhahahain pike, /etc/systemd/system/multi-user.target.wants/openvswitch.service is defined20:32
mwhahahanot so much for master/queens20:32
mwhahahai have no idea how any of this has worked for the last few months20:32
* mwhahaha gives up20:32
hjensasslagle: Purple is the color most associated with ambiguity. Like other colors made by combining two primary colors, it is seen as __uncertain__ and equivocal. ("Eva Heller, Psychologie de la couleur: effets et symboliques)20:33
*** jaosorior_ has joined #tripleo20:33
* bnemec looks at http://tripleo.org/20:34
bnemecYep, color scheme checks out.20:34
hjensaslol20:34
*** eck`gone is now known as eck`20:36
*** aputtur has quit IRC20:37
*** jaosorior has quit IRC20:37
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates master: Add HostsEntry to undercloud's /etc/hosts  https://review.openstack.org/55504120:37
weshaymwhahaha, hrm.. so now we a check on DIB?20:41
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade master: Use rhos-release role pin puddle option  https://review.openstack.org/55501820:41
mwhahahaso anyone want to take a guess as to how openvswitch was previously enabled by default on the overcloud-full.qcow220:41
mwhahahaweshay: I am here: ¯\_(ツ)_/¯20:41
weshayweee20:42
* mwhahaha assumes packaging20:42
* mwhahaha has no idea20:42
mwhahahacause pike had 2.7.320:42
mwhahahaand queens has 2.8.220:42
weshayrc.fit20:42
mwhahahabut i didn't spot anything in the spec20:42
*** amoralej is now known as amoralej|off20:43
mwhahahaah ha20:45
mwhahahahttps://github.com/openstack/tripleo-image-elements/blob/master/elements/openvswitch/install.d/74-openvswitch20:45
mwhahahawhich hasn't changed in 2 years20:45
* mwhahaha blames bnemec 20:46
mwhahahahe who last touches, it supports it for life20:46
*** myoung|biab is now known as myoung20:47
bnemecWe were still using that?!20:47
mwhahahai have no idea20:48
mwhahahabut that's the only reference i can find to enabling openvswitch :D20:48
mwhahahathough the service name is probably wrong now20:48
mwhahahaneed to go find the build logs20:48
*** radeks_ has quit IRC20:48
bnemecI mean, it wouldn't be the first time something I thought we had removed was still in use.20:48
mwhahahaweshay: do we keep the build logs for the images stuff around somewhere?20:49
weshaygetting20:49
openstackgerritJames Slagle proposed openstack/tripleo-common master: Add support for writing hostnames to inventory  https://review.openstack.org/55504920:49
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Begin adding environments with all params for a service  https://review.openstack.org/47592420:49
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Add ability to generate an environment index  https://review.openstack.org/49192520:49
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: WIP: Add service config env with all Designate settings  https://review.openstack.org/55500820:49
weshaymwhahaha, https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-queens-upload/e4728dd/undercloud/home/jenkins/overcloud_image_build.log.txt.gz20:49
openstackgerritJames Slagle proposed openstack/tripleo-common master: Add support for writing hostnames to inventory  https://review.openstack.org/55504920:51
mwhahahabnemec: so yea we were still using it20:52
mwhahahahttps://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-pike-upload/c54e735/undercloud/home/jenkins/overcloud_image_build.log.txt.gz#_2018-02-12_15_07_1520:52
mwhahahaexists in pike20:52
mwhahahadoes not in queens20:52
*** khyr0n has quit IRC20:52
openstackgerritSteve Baker proposed openstack/python-tripleoclient master: Use container_images_file for all image prepare  https://review.openstack.org/55467620:54
openstackgerritSteve Baker proposed openstack/python-tripleoclient master: Do container image prepare during undercloud deploy  https://review.openstack.org/54602420:54
openstackgerritSteve Baker proposed openstack/python-tripleoclient master: Use the build_service_filter from kolla_builder  https://review.openstack.org/55505120:54
weshaytrown, fyi ^20:55
openstackgerritJames Slagle proposed openstack/tripleo-validations master: Add --use-hostnames to tripleo-ansible-inventory  https://review.openstack.org/55505220:55
mwhahahawe lost a bunch of stuff20:56
mwhahahapike https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-pike-upload/c54e735/undercloud/home/jenkins/overcloud_image_build.log.txt.gz#_2018-02-12_15_07_1620:56
mwhahahavs queens https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-queens-upload/e4728dd/undercloud/home/jenkins/overcloud_image_build.log.txt.gz#_2018-03-21_06_18_2520:56
trowninteresting20:57
bnemecmwhahaha: Oh, I bet it's: https://github.com/openstack/tripleo-image-elements/blob/98b9c6a5145bbcb46a6d22265f63e219199599ba/elements/os-net-config/element-deps20:57
bnemecWe removed that recently, right?20:57
bnemecopenvswitch was previously getting pulled in as a dep of os-net-config.20:58
mwhahahawe did?20:58
mwhahahawe probably did20:58
mwhahahaso we need to add it back in or something20:58
bnemecThought so.  Let me look.20:58
bnemecmwhahaha: https://github.com/openstack/tripleo-common/commit/bce76efbcdf39383cae627c46634f2ca1b9aaf6b#diff-28d0b8f7801642ca03031c884319bc5a20:59
bnemecNo idea how any of this has worked since then though.20:59
mwhahahamagic20:59
mwhahahacause we weren't properly containerizing things20:59
mwhahahaso it was getting started else where20:59
mwhahahagood i didn't have anything to do with that change so i can blame everyone else20:59
* mwhahaha points fingers20:59
bnemeclol21:00
weshayit's his favorite thing21:00
* bnemec 's favorite thing is "I told you so"21:00
bnemecNot sure it applies in this case though.21:00
weshayhttps://goo.gl/images/v4d7re21:00
mwhahahawell i guess it's time to add openvswitch back in21:01
mwhahahawe should have explicitly had that defined anyway21:01
mwhahahai'm more concerned how this worked at all downstream21:02
*** trown is now known as trown|outtypewww21:02
mwhahahabut that's a whole other issue21:02
*** fragatina has joined #tripleo21:06
mwhahahaheh if we were using the overcloud-realtime-compute images we wouldn't have these problems21:07
weshaymwhahaha, it worked due to net-iso covering it up21:07
weshaynot sure  if there is a mix of jobs w/ and w/o net-iso21:08
mwhahahawell i'm surprised that the lack of openvswitch starting didn't show up somewhere else21:08
weshaymwhahaha, also not sure if lon's selinux patch is a valid21:08
weshaymwhahaha, meh.. who needs networking21:08
mwhahahadevstack is a single node, it's good enough for prod right21:08
*** moshele has joined #tripleo21:09
*** ooolpbot has joined #tripleo21:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711121:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,Triaged]21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717421:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747421:10
*** ooolpbot has quit IRC21:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]21:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]21:10
*** gfidente|afk has quit IRC21:10
openstackgerritAlex Schultz proposed openstack/tripleo-common master: Add openvswitch element back in  https://review.openstack.org/55505621:10
mwhahahaweshay, bnemec -^21:10
jaosorior_weshay: multinode jobs don't use netiso afaik21:11
*** pchavva has quit IRC21:11
mwhahahajaosorior_: yea cause it's only a single node21:11
weshayjaosorior_, yup.. either do a couple of the ovb jobs21:11
mwhahahaso it's geting started elsewhere21:11
mwhahahait's just the compute nodes where this manifests itself21:11
jaosorior_mwhahaha: ovb doesn't use netiso? wtf21:11
weshaymwhahaha, backporting to queens as well?21:11
mwhahahaweshay: yea will be21:11
mwhahahaonce we verify it work sin master21:11
weshaysinner21:12
mwhahahapretty much21:12
*** bnemec is now known as sin-master21:12
sin-masterAnd it's not even Friday yet!21:12
weshayrfolco|ruck, look for master fs20 first.. no queens backport yet21:12
*** sin-master is now known as bnemec21:12
rfolco|ruckweshay, ack21:12
mwhahahaok so now we need someone from networking to fix the barbican/3node thing in ovs21:13
* mwhahaha moves on to that issue next so we can downgrade to rainbow from qustionable21:13
weshaylolz21:14
mwhahahaalright now where did i put those logs for that issue21:15
*** raildo has joined #tripleo21:15
openstackgerritLiz Blanchard proposed openstack/tripleo-ui master: Remove h1 page header on deployment plan page  https://review.openstack.org/55506021:16
weshaymwhahaha, fwiw.. it might be easier / faster to test the fix on queens than master21:16
weshayas we'll surely hit some other bs on master21:16
mwhahahawell if we can actually land the stupid thing21:17
* mwhahaha sighs21:17
weshaylol21:17
mwhahahathe 3node thing blocks the thing that'll block that from landing21:17
mwhahahasince the image building job is dorked21:17
mwhahahawhich is what ianw was talking about21:18
mwhahahaso we need to figure out why 3node is flakey21:18
weshayya.. saw that21:18
weshayhrm21:19
*** chem has quit IRC21:20
EmilienMsorry for spam21:21
EmilienMbut I hav eno choice to rebase21:21
*** chem has joined #tripleo21:21
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: undercloud: workaround for masquerading network in CI/OVB  https://review.openstack.org/55362021:21
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: undercloud: workaround for masquerading network in CI/OVB  https://review.openstack.org/55362021:21
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: roles: rename overcloud-prep-containers to prep-containers  https://review.openstack.org/54301421:22
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: prep-containers: include containerized undercloud bits  https://review.openstack.org/54302421:22
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: undercloud: add missing TLS environments when preparing containers  https://review.openstack.org/54544421:22
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: Remove adjust-interface-mtus script  https://review.openstack.org/54621621:22
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: DO NOT REVIEW - Workarounds for containerized undercloud  https://review.openstack.org/54562821:22
* mwhahaha blames EmilienM 21:23
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart master: Run ovb-ha (fs001) with a containerized undercloud  https://review.openstack.org/54255621:23
EmilienMagain?21:23
mwhahahayou were on that os-net-config patch review, i can blame you :D21:24
EmilienMright21:25
EmilienMit was my ghost21:25
*** bfournie has quit IRC21:25
mwhahahathe ghost of reviews past21:25
*** ansmith has quit IRC21:26
* bnemec both +2'd and WIP'd it21:27
bnemecSo...neutral?21:27
*** ktibi has quit IRC21:27
bnemecI believe I also explicitly said it could merge when it wouldn't break CI. :-P21:27
EmilienMwhy our CI jobs didn't fail?21:28
mwhahahathey did, eventually21:28
mwhahahahouse of very thick cards21:29
weshaycastle of cards?21:31
mwhahahawith a moat!21:31
weshayin fact we do have a moat21:32
weshayEmilienM, you still in europe?21:32
EmilienMno I'm in canadaland21:33
openstackgerritHarald Jensås proposed openstack/python-tripleoclient master: Contanerized Undercloud - Routed Spine-Leaf  https://review.openstack.org/54345521:40
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Add ctlplane networking for routed networks  https://review.openstack.org/54732621:41
*** tcw has quit IRC21:42
EmilienMomg this is awesome https://beagle-hound.readthedocs.io/en/latest/21:44
*** agopi is now known as agopi|dinner21:46
*** rbrady is now known as rbrady-afk21:46
mwhahaharfolco|ruck, weshay: so has no one opened a bug for the flakey 3node yet?21:47
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Manage upgrades to a containerized undercloud  https://review.openstack.org/54962421:47
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Manage upgrades to a containerized undercloud  https://review.openstack.org/54962421:48
weshaymwhahaha, I'm talking to rfolco|ruck about it right now21:48
weshaywe'll have one up shortly21:48
mwhahahak i think it also needs some eyes from the networking dfg on it21:48
* mwhahaha doesn't see anything glaring21:48
mwhahahaother than some ovs-vswitchd warnings about No such device21:49
*** moshele has quit IRC21:50
*** itlinux has quit IRC21:51
weshaymwhahaha, I see timeouts in some of the neutron agents.. where do you see the no such device21:52
EmilienMin the journal21:52
EmilienMno?21:53
* weshay looks21:53
mwhahahano21:53
EmilienMhave we kidnapped ihrachys yet?21:53
mwhahahahttp://logs.openstack.org/13/554213/1/check/tripleo-ci-centos-7-3nodes-multinode/85d6763/logs/subnode-3/var/log/openvswitch/ovs-vswitchd.log.txt.gz21:53
mwhahaha2018-03-21T12:35:35.875Z|00077|bridge|INFO|bridge br-int: added interface qg-08994f8f-0d on port 321:53
mwhahaha2018-03-21T12:35:36.525Z|00078|bridge|INFO|bridge br-int: added interface tapce4be856-e6 on port 421:53
mwhahaha2018-03-21T12:35:37.452Z|00079|netdev_linux|INFO|ioctl(SIOCGIFHWADDR) on qg-08994f8f-0d device failed: No such device21:53
*** agopi|dinner has quit IRC21:53
EmilienMI think "No such device" is garbage21:54
ihrachyswat. scrolling up21:54
ihrachysyeah no such device happens all the time21:54
weshaylots of neutron errors in http://logs.openstack.org/22/531322/13/gate/tripleo-ci-centos-7-3nodes-multinode/fb044de/logs/subnode-3/var/log/extra/errors.txt.gz21:54
weshaytime outs though21:54
* weshay checks concurrency.. again21:54
mwhahahathere is usally one on start21:55
mwhahahasince we lost service containment with the docker switch21:55
mwhahaharabbit may or maynot be started21:55
ihrachysright. or host is overloaded so it will eventually back off timeout and hopefully manage to get reply21:55
*** pcaruana has quit IRC21:55
weshayheh.. just the one test there. so it's not overloaded21:57
*** rfolco|ruck is now known as rfolco|off21:57
mwhahahayea that's a service startup error21:57
ihrachyswhat's the definition of 'flakey'21:57
mwhahahaihrachys: 25% failure21:57
mwhahahaso not 100%21:57
mwhahahabut enough to be blocking21:58
ihrachysok21:58
mwhahahaihrachys: so according to http://cistatus.tripleo.org/ we failed 38% today21:59
mwhahahatripleo-ci-centos-7-3nodes-multinode21:59
mwhahahaon the gate, http://cistatus.tripleo.org:8000/ we've failed 18%21:59
openstackgerritSteve Baker proposed openstack/python-tripleoclient master: Use container_images_file for all image prepare  https://review.openstack.org/55467621:59
openstackgerritSteve Baker proposed openstack/python-tripleoclient master: Use the build_service_filter from kolla_builder  https://review.openstack.org/55505121:59
openstackgerritSteve Baker proposed openstack/python-tripleoclient master: Do container image prepare during undercloud deploy  https://review.openstack.org/54602421:59
mwhahahathe last 3 have been the tempest fails22:00
*** jmelvin has quit IRC22:00
ihrachys"tempest.lib.exceptions.SSHTimeout: Connection to the 192.168.24.100 via SSH timed out."22:02
ihrachysI love those22:02
ihrachysso informative /s22:02
ihrachysprobably half of issues I ever look at start with this22:02
mwhahahathe last time this was the stupid thing where the metadata was leaking to the undercloud22:02
ihrachyswell this is catch-all thingy22:03
ihrachys"SOMETHING HAPPENED"22:03
* mwhahaha is familiar with this poor error messaging concept 22:03
ihrachysso fip created, it gets to ACTIVE, but ssh times out. classic.22:04
mwhahahamagic black hole22:06
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates master: Mount openvswitch dir rather than socket  https://review.openstack.org/55507722:06
*** cdearborn_ has quit IRC22:08
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade stable/queens: Set osd_scenario and journals during ceph params conversion  https://review.openstack.org/55507922:08
* mwhahaha wanders off22:08
weshayihrachys, fyi https://bugs.launchpad.net/tripleo/+bug/175755622:10
openstackLaunchpad bug 1757556 in tripleo "timeouts in neutron are causing ssh failures in tempest test instances" [Critical,Triaged]22:10
*** ooolpbot has joined #tripleo22:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711122:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717422:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747422:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175755622:10
*** ooolpbot has quit IRC22:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]22:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]22:10
*** raildo has quit IRC22:15
ihrachysmwhahaha, weshay the test claims ping worked22:16
ihrachysbut ssh timed out22:16
ihrachysso probably security groups?22:17
weshayas in they were not open?22:18
weshayihrachys, so.. I'll bring up an env.. so we can poke at it22:18
weshayeasier that way22:18
weshayfyi ihrachys http://logs.openstack.org/22/531322/13/gate/tripleo-ci-centos-7-3nodes-multinode/fb044de/logs/reproducer-quickstart.sh22:19
*** yamahata has joined #tripleo22:23
ihrachysweshay, as in maybe port is closed, which may be either test doesn't configure it (since it passes I don't think it's the issue) or ovs agent fails to configure it22:24
*** rcernin has joined #tripleo22:25
ihrachysthe test creates those rules at ~16:17:32,713 so that side is ok22:25
*** liverpooler has joined #tripleo22:26
mwhahahaIt works sometimes22:26
mwhahahaSo it seems like a race somewhere22:26
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Revert "Prepare t-h-t for undercloud in a work dir"  https://review.openstack.org/55508522:28
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Revert "Prepare t-h-t for undercloud in a work dir"  https://review.openstack.org/55508522:29
*** threestrands has joined #tripleo22:29
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Manage upgrades to a containerized undercloud  https://review.openstack.org/54962422:29
*** threestrands has quit IRC22:30
*** threestrands has joined #tripleo22:30
*** d0ugal has quit IRC22:34
*** ccamacho has quit IRC22:35
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade master: Include connectivity check prepare scripts during FFU  https://review.openstack.org/55491422:36
*** ccamacho has joined #tripleo22:36
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Manage upgrades to a containerized undercloud  https://review.openstack.org/54962422:37
*** d0ugal has joined #tripleo22:37
*** thrash is now known as thrash|g0ne22:39
*** mcornea has quit IRC22:57
*** raildo has joined #tripleo23:06
*** Goneri has quit IRC23:06
*** dparkes has joined #tripleo23:07
*** jtomasek has quit IRC23:08
*** ooolpbot has joined #tripleo23:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175711123:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175717423:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175747423:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/175755623:10
openstackLaunchpad bug 1757111 in tripleo " fs020(both queens/master) tempest tests failing while booting an instance" [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)23:10
*** ooolpbot has quit IRC23:10
openstackLaunchpad bug 1757174 in tripleo "tripleo-buildimage-overcloud-full-centos-7 failing with diskimage_builder.element_dependencies.MissingElementException: Element 'size=4096'' not found" [Critical,Triaged]23:10
openstackLaunchpad bug 1757474 in tripleo "periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset027-master fails with undefined groups.overcloud" [Critical,Triaged]23:10
openstackLaunchpad bug 1757556 in tripleo "timeouts in neutron are causing ssh failures in tempest test instances" [Critical,Triaged]23:10
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Fix default partition type  https://review.openstack.org/55477123:11
*** itlinux has joined #tripleo23:20
*** tosky has quit IRC23:28
*** dparkes has quit IRC23:31
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Manage upgrades to a containerized undercloud  https://review.openstack.org/54962423:44
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates master: Add HostsEntry to undercloud's /etc/hosts  https://review.openstack.org/55504123:48
openstackgerritJames Slagle proposed openstack/tripleo-common master: Add support for writing hostnames to inventory  https://review.openstack.org/55504923:48
openstackgerritJames Slagle proposed openstack/tripleo-common master: Retry previously failed deployments  https://review.openstack.org/55427623:57

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!