*** ooolpbot has joined #tripleo | 00:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 00:10 |
---|---|---|
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 00:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 00:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 00:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 00:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 00:10 |
*** ooolpbot has quit IRC | 00:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 00:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 00:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 00:10 |
EmilienM | pabelanger: have a safe trip | 00:20 |
EmilienM | stevebaker: I won't -1 https://review.openstack.org/#/c/500646/1/docker/services/database/mongodb.yaml but you have a trailing space | 00:25 |
*** achadha has joined #tripleo | 00:31 | |
stevebaker | EmilienM: ok, I think we want to leave it in ci/environments/* because we want to apply mongodb-disabled.yaml | 00:39 |
EmilienM | stevebaker: how would we apply it? | 00:40 |
*** psahoo has joined #tripleo | 00:40 | |
stevebaker | mongo-disabled.yaml is the default in overcloud-resource-registry-puppet.j2.yaml, so including OS::TripleO::Services::MongoDb in the controller should apply it | 00:41 |
*** leitan has quit IRC | 00:44 | |
*** daidv has joined #tripleo | 00:45 | |
EmilienM | stevebaker: ah right, so we're good | 00:48 |
EmilienM | I missed that piece, thanks | 00:48 |
stevebaker | EmilienM: I think so | 00:48 |
EmilienM | stevebaker: wait, on scenario002-multinode.yaml look, we override it to ../../puppet/services/database/mongodb.yaml | 00:49 |
EmilienM | we need to drop that | 00:49 |
stevebaker | EmilienM: mmm, unless that scenario is supposed to have mongo? | 00:50 |
EmilienM | stevebaker: tbh, I don't know | 00:50 |
*** gbarros has joined #tripleo | 00:51 | |
EmilienM | we'll need to check with pradk when he's online tomorrow | 00:51 |
EmilienM | on stable/pike, the upgrade timeouts: http://logs.openstack.org/96/500596/1/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/e7f818e/logs/undercloud/home/jenkins/overcloud_deploy.log.txt.gz#_2017-09-04_19_15_17 | 00:52 |
EmilienM | I'm more concerned about this one for now | 00:52 |
EmilienM | I'm wondering if we're setting the good variable to deploy ocata | 00:53 |
stevebaker | me too | 00:54 |
EmilienM | stevebaker: before tackling scenarios, let's make gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv working on stable/pike | 00:54 |
EmilienM | it would be a huge progress | 00:54 |
EmilienM | I'm pretty sure it's something in quickstart or quickstart-extras | 00:54 |
*** leitan has joined #tripleo | 00:55 | |
EmilienM | see http://logs.openstack.org/96/500596/1/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/e7f818e/logs/rpm-qa.txt.gz | 00:55 |
EmilienM | we're deploying pike instead of ocata | 00:56 |
EmilienM | oh that's the undercloud, let me check overcloud | 00:56 |
EmilienM | yeah same :/ | 00:56 |
*** dixiaoli has joined #tripleo | 00:57 | |
EmilienM | we need to look here http://logs.openstack.org/96/500596/1/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/e7f818e/console.html | 00:58 |
EmilienM | why we deploy pike instead of ocata | 00:58 |
EmilienM | probably something in oooq or oooq-extras | 00:59 |
*** colonwq has joined #tripleo | 00:59 | |
EmilienM | stevebaker: I go dinner now, I'll catch-up later | 01:00 |
stevebaker | EmilienM: ok, will look | 01:00 |
EmilienM | hopefully we can solve this one before, i think it's high prio | 01:00 |
EmilienM | stevebaker: I think if we can get gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv working on stable/pike will be a green light for final pike release | 01:00 |
EmilienM | we can make scenarios working afterward, I remember we had the same thing during ocata release | 01:01 |
EmilienM | but if we know the basic workflow is in place, we can make the job voting to avoid regressions | 01:01 |
* EmilienM bbl | 01:01 | |
*** ooolpbot has joined #tripleo | 01:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 01:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 01:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 01:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 01:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 01:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 01:10 |
*** ooolpbot has quit IRC | 01:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 01:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 01:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 01:10 |
*** bkopilov has quit IRC | 01:28 | |
*** StevenK_ is now known as StevenK | 01:30 | |
*** dmacpher has joined #tripleo | 01:48 | |
*** fzdarsky_ has joined #tripleo | 01:51 | |
openstackgerrit | Merged openstack/tripleo-heat-templates stable/pike: Add clustercheck to service list for scenarios https://review.openstack.org/499133 | 01:54 |
*** fzdarsky|afk has quit IRC | 01:55 | |
stevebaker | EmilienM: I think we want to set STABLE_RELEASE=ocata http://logs.openstack.org/25/500625/1/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/a1c7297/console.html#_2017-09-04_19_26_24_530553 | 02:02 |
stevebaker | EmilienM: oh, wait | 02:03 |
*** achadha has quit IRC | 02:08 | |
*** lblanchard has quit IRC | 02:09 | |
*** ooolpbot has joined #tripleo | 02:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 02:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 02:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 02:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 02:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 02:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 02:10 |
*** ooolpbot has quit IRC | 02:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 02:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 02:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 02:10 |
*** gkadam has joined #tripleo | 02:15 | |
*** gkadam has quit IRC | 02:17 | |
*** dougbtv_ has quit IRC | 02:18 | |
*** gkadam has joined #tripleo | 02:18 | |
openstackgerrit | Tom Barron proposed openstack/tripleo-heat-templates stable/ocata: manila: set "host" to "hostgroup" https://review.openstack.org/499111 | 02:18 |
openstackgerrit | Steve Baker proposed openstack/tripleo-quickstart master: Add missing pike-undercloud-ocata-overcloud.yml https://review.openstack.org/500657 | 02:32 |
openstackgerrit | Steve Baker proposed openstack/tripleo-heat-templates stable/pike: Upgrade CI test on stable/pike - never merge https://review.openstack.org/500625 | 02:33 |
stevebaker | EmilienM: lets see what ^ does | 02:34 |
*** leitan has quit IRC | 02:36 | |
*** bkopilov has joined #tripleo | 02:46 | |
*** jlabarre has quit IRC | 02:52 | |
*** gbarros has quit IRC | 03:07 | |
*** ooolpbot has joined #tripleo | 03:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 03:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 03:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 03:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 03:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 03:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 03:10 |
*** ooolpbot has quit IRC | 03:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 03:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 03:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 03:10 |
*** noslzzp has quit IRC | 03:11 | |
*** noslzzp has joined #tripleo | 03:14 | |
*** noslzzp has quit IRC | 03:16 | |
*** shreshtha has joined #tripleo | 03:19 | |
*** shreshtha has quit IRC | 03:24 | |
*** shreshtha has joined #tripleo | 03:34 | |
*** shreshtha has quit IRC | 03:35 | |
*** shreshtha has joined #tripleo | 03:35 | |
*** cshastri has joined #tripleo | 03:37 | |
EmilienM | stevebaker: ok | 03:45 |
EmilienM | back from late dinner | 03:46 |
EmilienM | stevebaker: it's deploying the overcloud, let's see :) | 03:47 |
*** ramishra has joined #tripleo | 03:48 | |
*** links has joined #tripleo | 03:51 | |
EmilienM | stevebaker: https://review.openstack.org/#/c/500657/1/config/release/pike-undercloud-ocata-overcloud.yml is an excellent idea but the code itself will need some work, I'll post a comment, its just about using mirrors so nothing bad, but I think you found it :) | 03:52 |
EmilienM | stevebaker: I'm not sure config/release/pike-undercloud-ocata-overcloud.yml is used by quickstart jobs but I might have missed something | 03:54 |
*** ykarel has joined #tripleo | 03:56 | |
EmilienM | stevebaker: I thought config/release/tripleo-ci/pike-undercloud-ocata-overcloud.yml would be enough | 03:57 |
EmilienM | so on master, upgrade job deploys config/release/tripleo-ci/master-undercloud-pike-overcloud.yml | 04:04 |
EmilienM | and on stable/pike: config/release/tripleo-ci/pike-undercloud-ocata-overcloud.yml | 04:06 |
EmilienM | so your patch isn't useful, sorry :( | 04:06 |
*** achadha has joined #tripleo | 04:09 | |
EmilienM | stevebaker: but we have a first upgrade job working on stable/pike, from ocata | 04:10 |
*** ooolpbot has joined #tripleo | 04:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 04:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 04:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 04:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 04:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 04:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 04:10 |
*** ooolpbot has quit IRC | 04:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 04:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 04:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 04:10 |
*** achadha has quit IRC | 04:13 | |
*** garyk has joined #tripleo | 04:18 | |
*** udesale has joined #tripleo | 04:18 | |
*** ratailor has joined #tripleo | 04:18 | |
*** psachin has joined #tripleo | 04:21 | |
EmilienM | stevebaker: I saw something weird, when we upgrade, in stable/pike THT is on https://github.com/openstack/tripleo-heat-templates/commit/a4fa1c3 | 04:22 |
EmilienM | which is 11 days ago | 04:22 |
EmilienM | which is our RC1 I think | 04:22 |
EmilienM | and if you look at https://dashboards.rdoproject.org/rdo-dev | 04:24 |
EmilienM | it looks like we didn't have promotion since 11 days | 04:24 |
EmilienM | so it means CI picks packages from the promoted repo instead of current | 04:24 |
EmilienM | (packages like THT have to come from current) | 04:24 |
EmilienM | http://logs.openstack.org/45/500145/3/check/gate-tripleo-ci-centos-7-scenario004-multinode-oooq-container-upgrades-nv/aa5bbe4/logs/undercloud/etc/yum.repos.d/delorean-current.repo.txt.gz | 04:25 |
EmilienM | I wonder if the identation breaks it | 04:25 |
EmilienM | stevebaker: ok I found it | 04:28 |
EmilienM | it's a problem in the quickstart repos | 04:28 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-quickstart master: ocata2pike: add missing current-pike repo https://review.openstack.org/500671 | 04:34 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-quickstart master: ocata2pike: add missing current-pike repo https://review.openstack.org/500671 | 04:34 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Upgrade CI test on stable/pike - never merge https://review.openstack.org/500625 | 04:35 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Upgrade CI test on stable/pike - never merge https://review.openstack.org/500625 | 04:35 |
EmilienM | stevebaker: sorry to push over ^ but I think my patch will deploy the correct packages | 04:35 |
*** kristaps_ has quit IRC | 04:37 | |
*** kristaps_ has joined #tripleo | 04:37 | |
EmilienM | mandre, stevebaker: before I go to bed, I commented on https://bugs.launchpad.net/tripleo/+bug/1714905/comments/3 with my last findings. | 04:38 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 04:38 |
*** pdeore has joined #tripleo | 04:42 | |
*** pdeore has quit IRC | 04:43 | |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Switch manila-share to pacemaker version in scenario004/containers https://review.openstack.org/500314 | 04:43 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Remove bgp-vpn from scenario004-multinode-containers https://review.openstack.org/499626 | 04:43 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Add support for Dell EMC VMAX Manila Backend https://review.openstack.org/499199 | 04:44 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Add support for Dell EMC Isilon Manila backend https://review.openstack.org/499195 | 04:44 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: TLS proxy for redis https://review.openstack.org/499997 | 04:44 |
*** marios has joined #tripleo | 04:44 | |
*** yprokule has joined #tripleo | 04:45 | |
*** dparkes has quit IRC | 04:46 | |
*** yamahata has joined #tripleo | 04:47 | |
*** ratailor has quit IRC | 04:49 | |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-common stable/pike: Add missing OVN container service entries https://review.openstack.org/500582 | 04:51 |
*** pcaruana has joined #tripleo | 04:52 | |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Add CephExternal role for ceph-ansible https://review.openstack.org/500581 | 04:52 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-common stable/pike: Parse ceph_client_ansible_vars in ceph-ansible workbook https://review.openstack.org/500580 | 04:52 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Unset default value for the DockerCephDaemonImage https://review.openstack.org/500150 | 04:52 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-common stable/pike: Make curl healthchecks work with internal TLS https://review.openstack.org/500149 | 04:52 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: manila: set "neutron_admin_auth_url" correctly https://review.openstack.org/500145 | 04:52 |
openstackgerrit | Emilien Macchi proposed openstack/puppet-tripleo stable/pike: Support for Dell EMC VNX Manila Driver https://review.openstack.org/499439 | 04:52 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Persist containerized services httpd logs https://review.openstack.org/499235 | 04:52 |
*** pgadiya has joined #tripleo | 04:53 | |
* EmilienM out | 04:53 | |
*** pgadiya has quit IRC | 04:54 | |
*** cmyster has joined #tripleo | 04:58 | |
*** cmyster has quit IRC | 04:58 | |
*** cmyster has joined #tripleo | 04:58 | |
*** skramaja has joined #tripleo | 04:58 | |
*** dpawar has joined #tripleo | 05:00 | |
*** jtomasek has joined #tripleo | 05:02 | |
*** jaosorior has quit IRC | 05:08 | |
*** shreshtha has quit IRC | 05:09 | |
*** ooolpbot has joined #tripleo | 05:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 05:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 05:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 05:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 05:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 05:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 05:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1715029 | 05:10 |
*** ooolpbot has quit IRC | 05:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 05:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 05:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 05:10 |
openstack | Launchpad bug 1715029 in tripleo "[overcloud.AllNodesDeploySteps.ControllerDeployment_Step3]: CREATE_FAILED Resource CREATE failed: Operation cancelled" [Critical,Triaged] | 05:10 |
*** ratailor has joined #tripleo | 05:12 | |
*** shreshtha has joined #tripleo | 05:13 | |
*** liverpooler has joined #tripleo | 05:16 | |
openstackgerrit | garyk proposed openstack/puppet-tripleo stable/pike: Change references from nsx_v3 to nsx https://review.openstack.org/500674 | 05:17 |
*** dsariel has joined #tripleo | 05:22 | |
*** liverpooler has quit IRC | 05:27 | |
*** liverpooler has joined #tripleo | 05:27 | |
*** masco has joined #tripleo | 05:29 | |
*** jaosorior has joined #tripleo | 05:30 | |
*** dparkes has joined #tripleo | 05:39 | |
*** jfrancoa has joined #tripleo | 05:40 | |
*** dpawar has quit IRC | 05:41 | |
*** janki has joined #tripleo | 05:43 | |
*** dpawar has joined #tripleo | 05:43 | |
openstackgerrit | Steve Baker proposed openstack/tripleo-heat-templates master: Containerized mongodb, disable by default, fix upgrade https://review.openstack.org/500646 | 05:47 |
*** jaosorior has quit IRC | 05:47 | |
*** colonwq has quit IRC | 05:53 | |
*** colonwq has joined #tripleo | 05:53 | |
openstackgerrit | Rajat Sharma proposed openstack/python-tripleoclient master: tripleoclient uses print rather than LOG.warning https://review.openstack.org/500680 | 06:05 |
*** ooolpbot has joined #tripleo | 06:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 06:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 06:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 06:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 06:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 06:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 06:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1715029 | 06:10 |
*** ooolpbot has quit IRC | 06:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 06:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 06:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 06:10 |
openstack | Launchpad bug 1715029 in tripleo "[overcloud.AllNodesDeploySteps.ControllerDeployment_Step3]: CREATE_FAILED Resource CREATE failed: Operation cancelled" [Critical,Triaged] | 06:10 |
*** iranzo has joined #tripleo | 06:10 | |
*** pdeore has joined #tripleo | 06:13 | |
*** jaganathan has joined #tripleo | 06:15 | |
kristaps_ | morning fols | 06:16 |
kristaps_ | folks | 06:16 |
*** dhill|brb has quit IRC | 06:20 | |
*** dhill|brb has joined #tripleo | 06:20 | |
*** agurenko has joined #tripleo | 06:21 | |
*** jprovazn has joined #tripleo | 06:26 | |
*** jaosorior has joined #tripleo | 06:26 | |
*** fzdarsky_ is now known as fzdarsky | 06:26 | |
*** anshul has joined #tripleo | 06:28 | |
*** hjensas has joined #tripleo | 06:33 | |
kristaps_ | folks, after run overcloud deploy with network isolation get this error: http://paste.openstack.org/raw/620359/ | 06:35 |
kristaps_ | anu ideas? | 06:35 |
Tengu | you changed nickname, kristaps_ ? | 06:36 |
kristaps_ | which interfaces i must define in etc/os-net-config/config.json [ERROR] No interfaces defined in config: /etc/os-net-config/config.json | 06:36 |
kristaps_ | Tengu: its automatically changed | 06:37 |
Tengu | kristaps_: I think you'd need to include another env file setting up the networks/vlans. | 06:38 |
Tengu | something like net-multiple-nics.yaml or net-bond-with-vlans.yaml or any other net-* that suits your needs. maybe someone else in here might confirm or correct my statement. | 06:39 |
*** ccamacho has joined #tripleo | 06:39 | |
* Tengu still struggling with GPT support -.-' | 06:39 | |
*** pdeore has quit IRC | 06:41 | |
kristaps_ | Tengu: this controller.yaml i create with one of IRC folk assist | 06:45 |
kristaps_ | Tengu: so i think that problem not i this config | 06:45 |
*** rcernin has joined #tripleo | 06:46 | |
Tengu | kristaps_: in our case, we actually have -e network-isolation.yaml -e net-bond-with-vlans.yaml (and a custom environment.yaml file with some more registry entries and parameters) | 06:47 |
Tengu | and that just works fine. | 06:47 |
*** paramite has joined #tripleo | 06:47 | |
*** jlinkes has joined #tripleo | 06:47 | |
*** pdeore has joined #tripleo | 06:47 | |
kristaps_ | Tengu, can you paste you cnonfig files somewhere? | 06:49 |
Tengu | hmm. | 06:49 |
*** stendulker has joined #tripleo | 06:49 | |
*** pdeore_ has joined #tripleo | 06:50 | |
Tengu | we have a specific setup in here, with bonding and so on. | 06:50 |
kristaps_ | Tengu: pate just controller.yaml | 06:51 |
Tengu | don't have that file in here. | 06:51 |
*** pdeore has quit IRC | 06:52 | |
Tengu | anyway, you really want to include an environments/net-* file with the network-isolation… | 06:53 |
kristaps_ | Tengu: Tengu, but i already have included it | 06:55 |
kristaps_ | Tengu : bith files network-isolation.yaml and network-env.yaml | 06:55 |
jaganathan | d0ugal, hi, please look into https://review.openstack.org/#/c/489168/ | 07:05 |
*** ebarrera has joined #tripleo | 07:07 | |
*** cylopez has joined #tripleo | 07:09 | |
*** ooolpbot has joined #tripleo | 07:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 07:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 07:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 07:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 07:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 07:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 07:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1715029 | 07:10 |
*** ooolpbot has quit IRC | 07:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 07:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 07:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 07:10 |
openstack | Launchpad bug 1715029 in tripleo "[overcloud.AllNodesDeploySteps.ControllerDeployment_Step3]: CREATE_FAILED Resource CREATE failed: Operation cancelled" [Critical,Triaged] | 07:10 |
*** tesseract has joined #tripleo | 07:13 | |
*** aditya_r has joined #tripleo | 07:16 | |
*** hewbrocca_afk is now known as hewbrocca | 07:16 | |
janki | Hi. I can get reviews for https://review.openstack.org/#/c/493861/. mandre marios trozet EmilienM | 07:18 |
*** nyechiel has joined #tripleo | 07:18 | |
*** shreshtha_ has joined #tripleo | 07:21 | |
*** shreshtha has quit IRC | 07:21 | |
*** florianf has joined #tripleo | 07:22 | |
*** sshnaidm|off is now known as sshnaidm | 07:22 | |
*** cschwede_ has joined #tripleo | 07:23 | |
openstackgerrit | Janki Chhatbar proposed openstack/tripleo-common master: Add ODL container images to TripleO-common. https://review.openstack.org/498383 | 07:24 |
kristaps_ | guys, when my overcloud deploy fails i see this error in logs : No interfaces defined in config: /etc/os-net-config/config.json\n+ RETVAL=1\n+ set -e\n+ [[ 1 == 2 ]]\n+ [[ 1 != 0 ]]\n+ echo 'ERROR: os-net-config configuration failed.'\nERROR: os-net-config configuration failed.\n+ exit 1\n | 07:27 |
marios | ack janki | 07:27 |
kristaps_ | any ideas? | 07:27 |
kristaps_ | i remember someone few days ago was same issue | 07:33 |
*** jpena|off is now known as jpena | 07:33 | |
*** marios has quit IRC | 07:34 | |
*** aufi has joined #tripleo | 07:35 | |
*** oidgar has joined #tripleo | 07:38 | |
*** agurenko has quit IRC | 07:42 | |
kristaps_ | folks, overcloud network deployment fails on : http://paste.openstack.org/raw/620374/ | 07:43 |
kristaps_ | any ideas? | 07:44 |
*** agurenko has joined #tripleo | 07:46 | |
*** jpich has joined #tripleo | 07:47 | |
openstackgerrit | Janki Chhatbar proposed openstack/tripleo-heat-templates master: Add param to configure snat mechanism https://review.openstack.org/493861 | 07:48 |
*** psahoo has quit IRC | 07:52 | |
*** mcornea has joined #tripleo | 07:52 | |
*** egonzalez has joined #tripleo | 07:55 | |
kristaps_ | why when i get into /etc/os-net-config/config.json i see generated data not from controller.yaml but from undercloud.conf? | 07:57 |
*** pdeore has joined #tripleo | 08:00 | |
*** janki is now known as janki|lunch | 08:02 | |
*** pdeore_ has quit IRC | 08:02 | |
*** nyechiel_ has joined #tripleo | 08:02 | |
*** udesale has quit IRC | 08:03 | |
*** shardy has joined #tripleo | 08:03 | |
*** nyechiel has quit IRC | 08:03 | |
shardy | Morning all | 08:04 |
shardy | could I get a second reviewer for https://review.openstack.org/#/c/500585/ if anyone has a moment, thanks! | 08:05 |
*** psahoo has joined #tripleo | 08:05 | |
*** udesale has joined #tripleo | 08:05 | |
jprovazn | shardy, hi, do you have a minute for composable networks question? | 08:09 |
jtomasek | d0ugal: hi, is it possible to see some detail of a workflow error in mistral execution-get command? or what is the recommended way to debug failing workflow? | 08:09 |
shardy | jprovazn: Hi, sure! | 08:10 |
*** ooolpbot has joined #tripleo | 08:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 08:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 08:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 08:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 08:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 08:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 08:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1715029 | 08:10 |
*** ooolpbot has quit IRC | 08:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 08:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 08:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 08:10 |
openstack | Launchpad bug 1715029 in tripleo "[overcloud.AllNodesDeploySteps.ControllerDeployment_Step3]: CREATE_FAILED Resource CREATE failed: Operation cancelled" [Critical,Triaged] | 08:10 |
d0ugal | jtomasek: getting the workflow output is usually the most useful | 08:10 |
jprovazn | shardy, thanks, I'm trying to add a new network for ganesha service, something like this: http://paste.openstack.org/show/620376/ | 08:11 |
d0ugal | jtomasek: mistral execution-get-output | 08:11 |
jtomasek | d0ugal: ah, of course, I knew there was something I was missing... thanks | 08:11 |
d0ugal | jtomasek: what workflow are you debugging? | 08:11 |
jprovazn | shardy, stack-create fails with error: ...overcloud/puppet/controller-role.yaml>.resources.NetworkConfig.properties: : Unknown Property StorageNFSIpSubnet | 08:11 |
jprovazn | shardy, when staring at templates I suppose it's because this line geenrates IpSubnet param for networkConfig: | 08:12 |
jprovazn | https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/role.role.j2.yaml#L378 | 08:12 |
jtomasek | d0ugal: I am trying to help dbrusilo debug nodes registration error in one of their setup tools | 08:12 |
jprovazn | shardy, which then means that I would have to add StorageNFSIpSubnet parameters to all network templates? | 08:12 |
shardy | jprovazn: yes that's right - we don't yet j2 render the nic config templates, that's TODO for this next cycle | 08:13 |
shardy | so it's necessary to cp -r the networks/config/foo directory and add the network to each role network config | 08:13 |
*** marios has joined #tripleo | 08:14 | |
shardy | jprovazn: we decided to do that last because many/most "real" deployments will have customized nic templates anyway | 08:14 |
shardy | and dynamically generating them is going to be a fairly risky and operator visible change | 08:14 |
shardy | jprovazn: we need docs and better validation of this I guess | 08:14 |
* shardy has been meaning to do a docs patch | 08:14 | |
jprovazn | shardy, ack, I see, at this point it little bit degrades the flexibility of composable networks, but once it's done it will be much easier | 08:15 |
kristaps_ | shardy: if i remember you resolved few days ago same issue as main...my deploy fails on http://paste.openstack.org/raw/620374/ and when i get into /etc/os-net-config/config.json i see generated data not from controller.yaml but from undercloud.conf? | 08:15 |
jprovazn | shardy, thanks | 08:15 |
openstackgerrit | Wei Zheng proposed openstack/python-tripleoclient master: Fix some reST field lists in docstrings https://review.openstack.org/500725 | 08:15 |
shardy | jprovazn: also, we need to think about how to maintain optional networks - IMO adding a bunch of disabled networks to the default network_data.yaml won't really scale, so we need to think about a composable model similar to what mwhahaha did with roles/* | 08:15 |
shardy | jprovazn: again, what we have here is a first step, so any help improving the interfaces is welcome :) | 08:16 |
shardy | jprovazn: yeah, agreed it's not ideal, but hopefully it's something we can iterate on | 08:16 |
*** marios_ has joined #tripleo | 08:16 | |
shardy | I know that jtomasek needs the dynamic nic templates for the UI network config wizard | 08:16 |
shardy | so I guess we'll get it done pretty soon, just not in time for pike, unfortunately | 08:17 |
jprovazn | shardy, cool | 08:17 |
* jtomasek reads back | 08:18 | |
shardy | kristaps_: are you using the deployed server feature for the overcloud deploy? | 08:18 |
shardy | kristaps_: bogdan had a similar issue but IIRC it was due to a conflicting deployed server ID | 08:19 |
openstackgerrit | Wei Zheng proposed openstack/python-tripleoclient master: Fix some reST field lists in docstrings https://review.openstack.org/500725 | 08:19 |
d0ugal | jtomasek: cool, let me know if I can help at all | 08:19 |
*** psahoo has quit IRC | 08:22 | |
shardy | skramaja: Hi, can you check https://review.openstack.org/#/c/499456/ when you get a moment? Test fix related to derive params, passing CI so I think it's good to land? | 08:25 |
*** dbecker has joined #tripleo | 08:25 | |
skramaja | sure.. checking now shardy | 08:25 |
skramaja | shardy: oops its on tripleo-common, i am not supposed to +2 it :( | 08:26 |
skramaja | but will check thou | 08:27 |
*** udesale has quit IRC | 08:27 | |
*** udesale has joined #tripleo | 08:27 | |
kristaps_ | shardy: yes i used deployed server | 08:27 |
*** marios_ has quit IRC | 08:28 | |
*** marios has quit IRC | 08:28 | |
janki|lunch | marios, done, thanks :). 1 more reivew If I can get for https://review.openstack.org/#/c/500097/ | 08:28 |
*** marios has joined #tripleo | 08:29 | |
*** jaosorior has quit IRC | 08:29 | |
*** pdeore has quit IRC | 08:29 | |
*** lucas-afk is now known as lucasagomes | 08:32 | |
*** gfidente has joined #tripleo | 08:32 | |
*** gfidente has quit IRC | 08:32 | |
*** gfidente has joined #tripleo | 08:32 | |
shardy | skramaja: IMO it's fine to use discretion, since you were involved with this work - if you can +2 I'll approve it, thanks! :) | 08:32 |
skramaja | ok.. | 08:32 |
skramaja | shardy: done | 08:34 |
shardy | skramaja: thanks! | 08:34 |
*** janki|lunch is now known as janki | 08:34 | |
*** psahoo has joined #tripleo | 08:35 | |
Tengu | duh. I think I have a good solution for my new headache-ish question: how to use Let's Encrypt (and certbot) in order to have valide SSL certificates on three controllers when there's a VIP attached only to one of them :). | 08:42 |
*** derekh has joined #tripleo | 08:42 | |
Tengu | the solution might be as simple as: install some vault app with API on the undercloud, make it listen only on the "deployment" interface, and push a simple script on the controllers letting them check the vault content and get the certificate if exists, or ask a new one if near EOL + upload it to the vault. | 08:43 |
*** dtantsur|afk is now known as dtantsur | 08:46 | |
*** pdeore has joined #tripleo | 08:46 | |
shardy | Tengu: I'd chat with jaosorior about it when he's back | 08:47 |
*** tosky has joined #tripleo | 08:48 | |
Tengu | shardy: I think custodia might be a good "client" for that usage. seems pretty simple. I already talked to jaosorior about that matter a few weeks ago. I was thinking about barbican+freeIPA backend. | 08:50 |
Tengu | but something lighter like custodia might be better/faster/easier. | 08:50 |
*** iranzo1 has joined #tripleo | 08:52 | |
*** iranzo has quit IRC | 08:52 | |
*** aditya_r has quit IRC | 08:58 | |
shardy | d0ugal: hey can you check https://review.openstack.org/#/c/489168 please? Looks good to me and clears another rc2 bug | 09:00 |
kristaps_ | shardy: sorry for disturbing, maybe some quick ideas on this : http://paste.openstack.org/raw/620378/ | 09:01 |
d0ugal | shardy: sure! | 09:03 |
openstackgerrit | Janki Chhatbar proposed openstack/tripleo-heat-templates stable/pike: Mount folders and log file https://review.openstack.org/500740 | 09:05 |
*** shreshtha_ has quit IRC | 09:06 | |
shardy | kristaps_: are you perhaps trying to create two overclouds? Or do you have the network resources still present from a previous deploy? | 09:07 |
*** shreshtha has joined #tripleo | 09:07 | |
openstackgerrit | Merged openstack/tripleo-heat-templates stable/pike: Switch manila-share to pacemaker version in scenario004/containers https://review.openstack.org/500314 | 09:08 |
kristaps_ | shardy: one overcloud , there is nothing left from previous deployment, i checked ( no networks, subnets or ports) | 09:09 |
openstackgerrit | Merged openstack/tripleo-quickstart master: Switch scenario004-container to run Tempest https://review.openstack.org/500423 | 09:10 |
*** ooolpbot has joined #tripleo | 09:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 09:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 09:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 09:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714755 | 09:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 09:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 09:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1715029 | 09:10 |
*** ooolpbot has quit IRC | 09:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 09:10 |
openstack | Launchpad bug 1714755 in tripleo "CI: telemetry tempest tests fail for scenario 001 in periodic jobs" [Critical,In progress] - Assigned to Arx Cruz (arxcruz) | 09:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 09:10 |
openstack | Launchpad bug 1715029 in tripleo "[overcloud.AllNodesDeploySteps.ControllerDeployment_Step3]: CREATE_FAILED Resource CREATE failed: Operation cancelled" [Critical,Triaged] | 09:10 |
openstackgerrit | Merged openstack/tripleo-heat-templates stable/pike: Remove bgp-vpn from scenario004-multinode-containers https://review.openstack.org/499626 | 09:10 |
kristaps_ | shardy: http://paste.openstack.org/raw/620379/ | 09:10 |
*** milan has joined #tripleo | 09:11 | |
openstackgerrit | Giulio Fidente proposed openstack/tripleo-heat-templates stable/pike: Unset default value for the DockerCephDaemonImage https://review.openstack.org/500150 | 09:19 |
*** dpawar has quit IRC | 09:20 | |
*** dpawar has joined #tripleo | 09:20 | |
*** iranzo1 has quit IRC | 09:20 | |
*** dpawar has quit IRC | 09:22 | |
*** iranzo has joined #tripleo | 09:22 | |
*** tosky has quit IRC | 09:23 | |
openstackgerrit | Numan Siddique proposed openstack/python-tripleoclient master: Add neutron_driver value to prepare command https://review.openstack.org/498743 | 09:30 |
openstackgerrit | Marios Andreou proposed openstack/python-tripleoclient master: Fix py27 tests - expand the regex when adding 'when' to playbook https://review.openstack.org/500749 | 09:31 |
*** tosky has joined #tripleo | 09:31 | |
mrunge | is there someone to pester with a 5 lines review for oooq-extras? | 09:32 |
mrunge | https://review.openstack.org/#/c/495745/ | 09:32 |
mrunge | it has already one #2 | 09:32 |
mrunge | one +2 and another +1 | 09:32 |
shardy | mrunge: lgtm | 09:34 |
mrunge | thank you shardy | 09:35 |
openstackgerrit | Marios Andreou proposed openstack/python-tripleoclient stable/pike: Adds when in upgrade_tasks playbook written by config download https://review.openstack.org/500751 | 09:36 |
openstackgerrit | Marios Andreou proposed openstack/python-tripleoclient stable/pike: Convert step to integer in when statement for upgrade tasks https://review.openstack.org/500752 | 09:36 |
*** Vijayendra has quit IRC | 09:36 | |
openstackgerrit | Juan Antonio Osorio Robles proposed openstack/puppet-tripleo stable/pike: Enable TLS for rabbitmq's replication traffic https://review.openstack.org/500515 | 09:38 |
openstackgerrit | Juan Antonio Osorio Robles proposed openstack/tripleo-heat-templates stable/pike: Rabbitmq: Enable Erlang distribution TLS https://review.openstack.org/500516 | 09:39 |
marios | shardy: i just notice this is still in review should be good to go now? https://review.openstack.org/#/c/485732/9 | 09:39 |
openstackgerrit | Juan Antonio Osorio Robles proposed openstack/tripleo-heat-templates master: Use DeployedSSLCertificatePath for public TLS via certmonger https://review.openstack.org/500517 | 09:40 |
shardy | marios: yeah I noticed the same - I tried rechecking to get the ovb ha job to pass but AFAICT it's unrelated | 09:40 |
marios | shardy: i am cherrypicking for https://review.openstack.org/#/c/499540 and posted https://review.openstack.org/500751 https://review.openstack.org/500752 (and noticed the role one as i wanted to also cherrypick it). i forgot tripleoclient is special and already had the stable/pike for a while | 09:41 |
shardy | so we could just approve if someone else can take a look | 09:41 |
shardy | marios: yeah I think we'll need to cherry-pick a bunch of things then do another tripleoclient release | 09:41 |
marios | shardy: i also posted this one https://review.openstack.org/500749 as i saw it locally, not sure why py27 in ci passed (added a comment on the commit message ) | 09:41 |
shardy | marios: ack, yeah would be good to figure out why CI didn't catch that... | 09:42 |
marios | shardy: yeah i will cycle back to it and see if i can figure out why | 09:43 |
marios | shardy: would be funny if the py27 for that review now fails :) lets see | 09:44 |
*** Vijayendra has joined #tripleo | 09:46 | |
therve | sshnaidm, https://bugs.launchpad.net/tripleo/+bug/1714005 isn't "fixed" by the related patch. It doesn't happen anymore? | 09:46 |
openstack | Launchpad bug 1714005 in tripleo "CI. periodic master job fails because "ERROR: Error in 77 output role_data: No function "#operator_+" matches supplied arguments"" [Critical,Fix released] | 09:46 |
*** gcerami has joined #tripleo | 09:48 | |
openstackgerrit | Merged openstack/puppet-tripleo stable/pike: Support for Dell EMC VNX Manila Driver https://review.openstack.org/499439 | 09:49 |
*** Vijayendra has quit IRC | 09:51 | |
openstackgerrit | OpenStack Proposal Bot proposed openstack/tripleo-ui master: Imported Translations from Zanata https://review.openstack.org/500758 | 09:53 |
*** shardy has quit IRC | 09:55 | |
sshnaidm | therve, I didn't see it happens again, do you know what is exact problem there? | 09:56 |
*** dixiaoli has quit IRC | 09:56 | |
*** shardy has joined #tripleo | 09:56 | |
*** sshnaidm is now known as sshnaidm|afk | 09:58 | |
*** Vijayendra has joined #tripleo | 09:59 | |
openstackgerrit | OpenStack Proposal Bot proposed openstack/tripleo-ui stable/pike: Imported Translations from Zanata https://review.openstack.org/500267 | 10:00 |
*** dciabrin has quit IRC | 10:04 | |
*** dtantsur is now known as dtantsur|lunch | 10:06 | |
openstackgerrit | Giulio Fidente proposed openstack/tripleo-common master: Define ceph image in overcloud_containers.yaml.j2 https://review.openstack.org/499822 | 10:08 |
therve | sshnaidm|afk, Nope :/ | 10:09 |
*** ooolpbot has joined #tripleo | 10:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 10:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 10:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 10:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 10:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1715029 | 10:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 10:10 |
*** ooolpbot has quit IRC | 10:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 10:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 10:10 |
openstack | Launchpad bug 1715029 in tripleo "[overcloud.AllNodesDeploySteps.ControllerDeployment_Step3]: CREATE_FAILED Resource CREATE failed: Operation cancelled" [Critical,Triaged] | 10:10 |
*** dixiaoli has joined #tripleo | 10:12 | |
*** dixiaoli has quit IRC | 10:13 | |
*** ratailor has quit IRC | 10:14 | |
*** akrivoka has joined #tripleo | 10:14 | |
*** ratailor has joined #tripleo | 10:15 | |
*** akrivoka has quit IRC | 10:15 | |
*** sri_ has joined #tripleo | 10:16 | |
jbadiapa | mandre, could you take a look at https://review.openstack.org/#/c/467072/? | 10:16 |
*** akrivoka has joined #tripleo | 10:19 | |
*** nyechiel has joined #tripleo | 10:20 | |
*** nyechiel_ has quit IRC | 10:20 | |
*** egonzalez has quit IRC | 10:20 | |
*** jaosorior has joined #tripleo | 10:23 | |
*** fzdarsky is now known as fzdarsky|lunch | 10:25 | |
kristaps_ | folks, im trying to deploy overcloud with network isolation, bu get this error : http://paste.openstack.org/raw/620378/ | 10:29 |
kristaps_ | any ideas? | 10:29 |
*** sshnaidm|afk is now known as sshnaidm | 10:30 | |
*** bkopilov has quit IRC | 10:31 | |
sri_ | <kristaps_>: hi can you give me more information on your environment virtual or baremetal ? what is the command you try to deploy overcloud , and network setup with single nic ? | 10:37 |
*** udesale__ has joined #tripleo | 10:37 | |
*** udesale has quit IRC | 10:39 | |
*** udesale__ has quit IRC | 10:40 | |
*** udesale has joined #tripleo | 10:40 | |
sri_ | kristaps_: If you're using virtual env try this http://tripleo.org/install/advanced_deployment/network_isolation_virt.html?highlight=parameter_defaults | 10:40 |
sshnaidm | why do we have so many alerts? | 10:42 |
*** ratailor has quit IRC | 10:43 | |
sshnaidm | EmilienM, ^^ | 10:43 |
openstackgerrit | Brent Eagles proposed openstack/tripleo-heat-templates master: Add Neutron SR-IOV agent container https://review.openstack.org/469066 | 10:45 |
openstackgerrit | Ana Krivokapic proposed openstack/tripleo-ui master: Fix plan descriptions on plan cards https://review.openstack.org/496381 | 10:45 |
*** numans has quit IRC | 10:45 | |
kristaps_ | sri_: hi. im using baremetal env. (i have controller and compute node all in one) nodee have two physical nic, in one NIC ir provision network, in second external network | 10:45 |
kristaps_ | sri_: config files look like http://paste.openstack.org/raw/620039 | 10:46 |
kristaps_ | sri_: dont look compute.yaml i remove this node already | 10:47 |
openstackgerrit | Merged openstack/tripleo-heat-templates master: Add DhcpAgentNotification param to neutron base https://review.openstack.org/499395 | 10:48 |
shardy | sshnaidm: https://bugs.launchpad.net/tripleo/+bugs?field.tag=alert are the alerts, we can remove the alert tag if you think they are resolved | 10:48 |
sshnaidm | shardy, I mean we need to reconsider our alerts policy and current bugs, I'm not sure all of them are blocking CI or promotion. If it's used so many, it turns to be noise | 10:49 |
kristaps_ | sri_: command which i trying deploy : openstack overcloud deploy --templates -e one.yaml -e netnew.yaml -e network-isolation.yaml --compute-scale 0 -vv | 10:50 |
shardy | sshnaidm: yeah we should perhaps discuss the criteria | 10:50 |
*** fzdarsky has joined #tripleo | 10:50 | |
sri_ | kristaps_: Ok , as of now I've only tested with virtual env , so I am not sure what's you're missing in the configuration you can ask shardy, or you can try to find the answer here https://www.youtube.com/watch?v=zYNq2uT9pfM | 10:51 |
kristaps_ | sri_: thanks anyway | 10:54 |
*** numans has joined #tripleo | 10:54 | |
*** egonzalez has joined #tripleo | 10:55 | |
*** gkadam has quit IRC | 10:57 | |
openstackgerrit | Merged openstack/tripleo-heat-templates master: Set mode for ansible written files https://review.openstack.org/500585 | 10:59 |
*** jkilpatr has joined #tripleo | 10:59 | |
*** psahoo has quit IRC | 11:00 | |
*** fzdarsky has quit IRC | 11:00 | |
openstackgerrit | Merged openstack/tripleo-common master: Ensure that GetHostCpusListAction.run() returns a deterministic result https://review.openstack.org/499456 | 11:01 |
*** jkilpatr_ has joined #tripleo | 11:01 | |
openstackgerrit | Merged openstack/tripleo-common master: Derive params network config stack exists fix https://review.openstack.org/489168 | 11:01 |
*** jaganathan has quit IRC | 11:02 | |
shardy | kristaps_: does ExternalNetCidr conflict with some other network created via neutron or on the host? | 11:02 |
kristaps_ | shardy: not http://paste.openstack.org/raw/620379/ | 11:03 |
*** tosky has quit IRC | 11:04 | |
*** jkilpatr has quit IRC | 11:05 | |
*** lucasagomes is now known as lucas-hungry | 11:05 | |
*** ecerquei_ has quit IRC | 11:06 | |
*** egonzalez has quit IRC | 11:07 | |
*** stendulker has quit IRC | 11:10 | |
*** ooolpbot has joined #tripleo | 11:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 11:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 11:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 11:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 11:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 11:10 |
*** ooolpbot has quit IRC | 11:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 11:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 11:10 |
*** aufi has quit IRC | 11:10 | |
*** dpawar has joined #tripleo | 11:11 | |
jaosorior | larsks: around? | 11:15 |
*** dpawar has quit IRC | 11:16 | |
kristaps_ | shardy: in neutron server logs i see http://paste.openstack.org/raw/620391/ | 11:17 |
Tengu | jaosorior: heya! do you have 5 minutes so that I can expose an idea I got in order to share let's encrypt certificate between many controllers? | 11:18 |
*** dpawar has joined #tripleo | 11:19 | |
*** adarazs is now known as adarazs_afk | 11:19 | |
*** adarazs_afk is now known as adarazs_brb | 11:20 | |
openstackgerrit | Or Idgar proposed openstack/puppet-tripleo master: Adapting Octavia api to work with containerized environment https://review.openstack.org/500593 | 11:20 |
*** pkovar has joined #tripleo | 11:20 | |
Tengu | hmm in fact I might create an issue with that, and might become a blueprint. Maybe better that way. | 11:21 |
*** nyechiel has quit IRC | 11:22 | |
jaosorior | Tengu: sure, what's up? | 11:22 |
*** ansmith has quit IRC | 11:23 | |
Tengu | jaosorior: ah :). I was thinking about installing some vault app on the undercloud server, make it listen only on the provisionning network. I stumbled on "custodia", that might fulfill the needs in a clever way (or barbican if we can ensure it has a secured backend). | 11:24 |
Tengu | jaosorior: once that vault service is up, a script is installed on the N controllers - it will do all the "magic": - check vault content for existing certificate and fetch it + install/activation if newer than the one already present | 11:24 |
jaosorior | Tengu: we actually have plans to enable custodia for the overcloud | 11:25 |
Tengu | \o/ | 11:25 |
*** aufi has joined #tripleo | 11:25 | |
openstackgerrit | Javier Peña proposed openstack/tripleo-quickstart-extras master: Allow pre-installed DLRN https://review.openstack.org/499117 | 11:25 |
jaosorior | Tengu: the lack of custodia is what brought up the current way of deploying TLS. With it we could potentially make things way easier. | 11:25 |
Tengu | jaosorior: among other things the script will do is the renewal - for that it will need to check if the VIP is attached to the server, if yes it will renew the certificate and update the new version to the vault - if not, it will fallback on vault checking/fetching/install | 11:26 |
Tengu | jaosorior: +42 - especially if it's also enabled on the undercloud in fact | 11:26 |
Tengu | jaosorior: that would prevent any chicken-and-the-egg issue, as the undercloud is already installed/configured when we deploy the overcloud | 11:26 |
*** shardy is now known as shardy_lunch | 11:26 | |
jaosorior | Tengu: propose the blueprint, that would be good to bring up in the PTG. Are you going? | 11:27 |
Tengu | jaosorior: fine, I try to create the blueprint. Never done that, so be nice if I fail ;) | 11:27 |
jaosorior | Tengu: lol, don't worry about it. I'm sure it'll go fine. | 11:28 |
Tengu | jaosorior: and I'm glad you (the team) also consider custodia :) | 11:28 |
jaosorior | Tengu: the main focus we want to give custodia for now is actually to do passwordless openstack config files. | 11:28 |
Tengu | jaosorior: yup, that's part of custodia capabilities. and x509 keypair storage | 11:29 |
Tengu | so now… how may I call that blueprint? "shared x509 cert between multiple controllers" ? | 11:29 |
Tengu | ah, na | 11:30 |
*** aufi has quit IRC | 11:30 | |
*** egonzalez has joined #tripleo | 11:30 | |
Tengu | jaosorior: hmm. I must edit a wiki page in order to do the blueprint…? apparently it's not supposed to hold all the features/thoughts directly on launchpad | 11:31 |
openstackgerrit | Jaganathan Palanisamy proposed openstack/tripleo-common master: DPDK derive params network config validation https://review.openstack.org/488362 | 11:32 |
jaosorior | Tengu: where are you doing it? in launchpad? | 11:32 |
Tengu | jaosorior: well, yes… Is it the wrong place ? | 11:33 |
jaosorior | Tengu: no. I was just wondering what you meant by editing a wiki. | 11:33 |
Tengu | ah :). | 11:33 |
jaosorior | Tengu: but it's the right place. | 11:34 |
Tengu | "The URL of the specification. This is usually a wiki page." - apparently, the small textarea "Summary" is intended as a summary, not the full spec. | 11:34 |
jaosorior | Tengu: do the summary there, and folks usually use the tripleo-specs repo to propose blueprints. | 11:34 |
jaosorior | Tengu: https://github.com/openstack/tripleo-specs | 11:35 |
Tengu | jaosorior: hmm ok. May I do a pull request on that repository with the specs I already have in mind? | 11:35 |
*** rcernin has quit IRC | 11:36 | |
jaosorior | Tengu: well, it's also under openstack, so the github page is just a mirror. you would do git review -s ; and propose the change to gerrit. | 11:36 |
*** rcernin has joined #tripleo | 11:36 | |
jaosorior | Tengu: actually, just found this http://blog.nemebean.com/content/writing-tripleo-specs ; props to beekneemech for writing it. | 11:37 |
Tengu | jaosorior: hm. I must dig into that anyway. | 11:37 |
*** nyechiel has joined #tripleo | 11:37 | |
Tengu | jaosorior: ah, thanks for the link. I do the summary and will check "how to write a spec". | 11:37 |
*** dprince has joined #tripleo | 11:39 | |
*** thrash|g0ne is now known as thrash | 11:39 | |
Tengu | jaosorior: how is it? https://blueprints.launchpad.net/tripleo/+spec/shared-x509-certficates | 11:41 |
*** jpena is now known as jpena|lunch | 11:41 | |
*** nyechiel has quit IRC | 11:42 | |
*** gchamoul is now known as gchamoul|afk | 11:42 | |
openstackgerrit | Juan Antonio Osorio Robles proposed openstack/tripleo-heat-templates master: Mount public certificate in haproxy init container https://review.openstack.org/500779 | 11:42 |
*** nyechiel has joined #tripleo | 11:42 | |
jaosorior | afazekas: ^^ | 11:43 |
*** aufi has joined #tripleo | 11:43 | |
jaosorior | Tengu: well, we do use certmonger for TLS certs already though. Alternatively we could do a certmonger plugin that gets certs from custodia. | 11:45 |
Tengu | jaosorior: ah, yes, maybe. and that would be good as apache doesn't need to reload when nss container is updated. | 11:46 |
Tengu | jaosorior: "up to you" ;). I'm not an openstack dev, you know better than me how it might work together | 11:47 |
Tengu | goal of that blueprint is to start the discussion. Maybe I'll let you do the specs then, as I'm not aware of all the tls stuff already in place. | 11:47 |
*** rook|pto is now known as rook | 11:48 | |
jaosorior | Tengu: are you going to the PTG? | 11:48 |
Tengu | jaosorior: PTG ? (my question might answer yours ;)) | 11:48 |
jaosorior | lol, it sure does | 11:48 |
jaosorior | Tengu: https://www.openstack.org/ptg/ | 11:48 |
Tengu | uhu, ok. then nope, Denver is a bit out of reach | 11:49 |
jaosorior | Tengu: I'll bring up the blueprint with my team. One of the folks is a custodia contributor, so he'll be happy more folks are looking into using it. | 11:49 |
Tengu | oohh :) | 11:50 |
Tengu | jaosorior: if the IPA backend support for custodia can be marked as "stable", that would be just marvelous. | 11:50 |
jaosorior | Tengu: I'm not sure, would need to ask him. | 11:51 |
Tengu | jaosorior: small fact though: barbican will die if you push custodia, won't it? | 11:51 |
Tengu | or would barbican support custodia as a backend, allowing to do some ACL stuff with keystone ? | 11:52 |
jaosorior | actually, the opposite | 11:52 |
Tengu | the latter is probably the best | 11:52 |
jaosorior | Tengu: some folks did custodia with barbican as a plugin (backend) | 11:52 |
Tengu | jaosorior: hm, so we query barbican and it proxies custodia, or the reverse ? | 11:52 |
jaosorior | Tengu: query custodia and then it goes to barbican. | 11:53 |
Tengu | hmm. weird idea | 11:53 |
jaosorior | Tengu: not coded by us, but was an interesting presentation last summit. | 11:54 |
Tengu | ok | 11:54 |
jaosorior | Tengu: ultimately, what barbican provides is multitenancy, which custodia doesn't have. | 11:54 |
*** vpickard has quit IRC | 11:55 | |
Tengu | ah, yes. although custodia might provide it with an IPA backend :). | 11:55 |
jaosorior | Tengu: you could have an app using custodia, and authenticating to a certain backend that is mapped to an openstack project. | 11:55 |
Tengu | makes sense | 11:55 |
jaosorior | Tengu: but yeah, if using IPA, might as well use custodia -> IPA directly. | 11:56 |
*** marrusl has joined #tripleo | 11:56 | |
jaosorior | Tengu: when deploying barbican, we want to have IPA as a backend for it as well (well... barbican goes directly to dogtag) | 11:56 |
Tengu | jaosorior: in the meanwhile I'll probably do my scripting thing in order to get the wanted feature on pike already. i.e. configure a custodia on the undercloud, expose it, and do the script | 11:56 |
Tengu | jaosorior: yep, we talked about it earlier .) | 11:56 |
*** vpickard has joined #tripleo | 11:57 | |
jaosorior | Tengu: if you do, write a blog post about it :D | 11:57 |
*** dtantsur|lunch is now known as dtantsur | 11:57 | |
openstackgerrit | Numan Siddique proposed openstack/tripleo-quickstart master: DO NOT REVIEW : TESTING ONLY https://review.openstack.org/500609 | 11:57 |
Tengu | pretty sure my boss wants me to write something about "our entreprise TripleO experience with our internal export" XD | 11:57 |
Tengu | *expert | 11:57 |
Tengu | anyway. a bunch of buzzwords that might trigger SEO stuff, you know the thing I guess ;) | 11:58 |
jaosorior | lol sure | 11:58 |
*** abishop has joined #tripleo | 11:59 | |
Tengu | jaosorior: so I let you with the blueprint "as is"? I guess you'll be able to produce a good spec with your team. | 11:59 |
Tengu | and if I can provide some information based on my "messy tests", I'll ping you ;) | 11:59 |
skramaja | dprince: could you take a look at https://review.openstack.org/#/c/499086/? | 11:59 |
*** vpickard has quit IRC | 12:00 | |
sshnaidm | mandre, hi, do you if this is solved? https://trello.com/c/EUl9ACAF/229-cixlp1714412tripleociproa-upgrade-from-ocata-to-pike-fails-to-pull-containers-during-controllerdeploymentstep1 | 12:00 |
skramaja | stevebaker: dprince i have created a patch to onclude the kolla conf file by default in the tripleoclient image build - https://review.openstack.org/#/c/500475/, check if this approach is good. | 12:00 |
jaosorior | Tengu: lets do this, I'll bring it up with the folks, see if we can provide input. But I can't really promise we'll directly work on it (gotta prioritize a bunch of stuff first :/) ... however, we will most likely work on getting custodia for the overcloud, so that'll get you farther. | 12:01 |
Tengu | that said… I'm apparently unable to activate TLS for the public endpoints :/. weird. | 12:01 |
*** shardy_lunch is now known as shardy | 12:01 | |
jaosorior | Tengu: wha :/ | 12:01 |
Tengu | I probably missed something somewhere. | 12:02 |
*** vpickard has joined #tripleo | 12:02 | |
*** janki has quit IRC | 12:03 | |
*** bfournie has quit IRC | 12:04 | |
*** bfournie has joined #tripleo | 12:04 | |
*** aditya_r has joined #tripleo | 12:05 | |
Tengu | jaosorior: in fact, my script is installed, I have a certificate, but neither haproxy nor apache seem to get the proper TLS configuration. I'm pretty positive I have all the requested information. | 12:08 |
*** bfournie has quit IRC | 12:09 | |
*** jlabarre has joined #tripleo | 12:09 | |
openstackgerrit | Alan Bishop proposed openstack/tripleo-common stable/pike: Use CephAnsibleDisksConfig when deriving HCI parameters https://review.openstack.org/500784 | 12:09 |
*** fultonj has joined #tripleo | 12:10 | |
*** ooolpbot has joined #tripleo | 12:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 12:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 12:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 12:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 12:10 |
*** ooolpbot has quit IRC | 12:10 | |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 12:10 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 12:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 12:10 |
*** dpawar has quit IRC | 12:11 | |
*** sshnaidm is now known as sshnaidm|afk | 12:11 | |
openstackgerrit | Giulio Fidente proposed openstack/tripleo-heat-templates master: Move RadosGW config settings into ceph-rgw https://review.openstack.org/500785 | 12:12 |
openstackgerrit | afazekas proposed openstack/tripleo-heat-templates master: docker-puppet.py duplicated import https://review.openstack.org/500786 | 12:14 |
*** shreshtha has quit IRC | 12:15 | |
kristaps_ | shardy: no quick ideas? | 12:16 |
openstackgerrit | Marios Andreou proposed openstack/python-tripleoclient stable/pike: Convert step to integer in when statement for upgrade tasks https://review.openstack.org/500752 | 12:16 |
openstackgerrit | Marios Andreou proposed openstack/python-tripleoclient stable/pike: Fix py27 tests - expand the regex when adding 'when' to playbook https://review.openstack.org/500787 | 12:16 |
shardy | kristaps_: well the error is pretty clear, neutron thinks the physical network "external" is already in use - can you confirm neutron net-list and neutron net-show for all networks after the failure? | 12:17 |
shardy | your previous paste seems inconsistent, because all networks except external completed OK? | 12:17 |
*** lucas-hungry is now known as lucasagomes | 12:18 | |
shardy | https://github.com/openstack/neutron/blob/master/neutron/plugins/ml2/drivers/type_flat.py#L94 | 12:18 |
shardy | kristaps_: looking at the code which produces the error, I think the only think which can conflict is the physical_network name, as this is the primary key used ni the FlatAllocation db record | 12:19 |
* shardy not a neutron expert tho.. | 12:19 | |
*** leitan has joined #tripleo | 12:20 | |
mandre | sshnaidm|afk: it is fixed! I've moved the card to the 'done' column | 12:22 |
*** adarazs_brb is now known as adarazs | 12:23 | |
openstackgerrit | afazekas proposed openstack/tripleo-heat-templates master: flake8 rule mex line length https://review.openstack.org/500791 | 12:24 |
kristaps_ | shardy: this is log after failure : http://paste.openstack.org/raw/620400/ , this is how looks my network-isolation files : http://paste.openstack.org/raw/620401/ | 12:25 |
*** udesale has quit IRC | 12:25 | |
*** ecerquei has joined #tripleo | 12:26 | |
openstackgerrit | OpenStack Proposal Bot proposed openstack/tripleo-heat-templates master: Updated from global requirements https://review.openstack.org/488148 | 12:26 |
shardy | kristaps_: yes I saw those, but you also posted http://paste.openstack.org/raw/620378/ | 12:27 |
shardy | which has e.g > 2017-09-05 08:51:30Z [overcloud.Networks.InternalNetwork]: CREATE_COMPLETE state changed | 12:27 |
shardy | So there should be a network in neutron net-list for all those networks which completed (all except the External one AFAICS) | 12:27 |
shardy | unless you have some other environment file which only enables the isolated external network? | 12:28 |
shardy | http://paste.openstack.org/show/620402/ | 12:28 |
shardy | kristaps_: it should look like that, but without the external network | 12:29 |
openstackgerrit | Or Idgar proposed openstack/puppet-tripleo master: Adapting Octavia api to work with containerized environment https://review.openstack.org/500593 | 12:29 |
shardy | otherwise the two pastes aren't consistent, and I can't explain why if it's the same undercloud | 12:29 |
*** maeca has joined #tripleo | 12:30 | |
*** maeca has quit IRC | 12:30 | |
openstackgerrit | garyk proposed openstack/tripleo-heat-templates stable/pike: Add DhcpAgentNotification param to neutron base https://review.openstack.org/500794 | 12:31 |
*** fzdarsky has joined #tripleo | 12:31 | |
*** rlandy has joined #tripleo | 12:33 | |
*** rhallisey has joined #tripleo | 12:34 | |
openstackgerrit | Juan Antonio Osorio Robles proposed openstack/instack-undercloud master: Use integer for rabbitmq port https://review.openstack.org/500798 | 12:35 |
kristaps_ | shardy: but there is defined only external network in my network-env.yaml | 12:35 |
jaosorior | shardy: could you check that out? ^^ there was an upgrade for puppet-rabbitmq and that's needed | 12:35 |
*** tosky has joined #tripleo | 12:35 | |
shardy | kristaps_: where? There's no resource_registry mappings in http://paste.openstack.org/raw/620401/ AFAICS? | 12:36 |
shardy | (other than the Net::SoftwareConfig ones) | 12:36 |
shardy | kristaps_: maybe start an etherpad and include *all* information about how you're deploying - figuring out fragments like this is kind of hard to keep track of? | 12:37 |
shardy | the neutron error is still very clear, somewhere, somehow, you have an overlapping flat network name | 12:38 |
shardy | so it's a question of looking carefully at the config and neutron CLI to figure it out I think | 12:38 |
jaosorior | mwhahaha: could you check this out https://review.openstack.org/#/c/500798/ ? | 12:38 |
shardy | jaosorior: ack will do | 12:38 |
jaosorior | thanks | 12:41 |
*** jaosorior has quit IRC | 12:41 | |
openstackgerrit | Raoul Scarazzini proposed openstack/tripleo-quickstart-extras master: Remove validate-ha from available roles https://review.openstack.org/500806 | 12:42 |
*** eck`gone is now known as eck` | 12:43 | |
*** ansmith has joined #tripleo | 12:44 | |
*** cshastri has quit IRC | 12:45 | |
*** dmsimard is now known as dmsimard|afk | 12:47 | |
*** gchamoul|afk is now known as gchamoul | 12:47 | |
*** bfournie has joined #tripleo | 12:49 | |
*** garyk has quit IRC | 12:50 | |
*** oidgar has quit IRC | 12:50 | |
*** thrash is now known as thrash|biab | 12:50 | |
*** ykarel has quit IRC | 12:51 | |
kristaps_ | shardy: sorry but i dont understoot what resource mapping i need except Net::SoftwareConfig if i have only one node, and external network i have defined whith this ExternalNetCidr: 172.16.84.0/24 ExternalAllocationPools: [{'start': '172.16.84.60', 'end': '172.16.84.80'}] | 12:52 |
kristaps_ | what else mappings i need? | 12:52 |
*** janki has joined #tripleo | 12:52 | |
*** ecerquei has quit IRC | 12:52 | |
*** tzumainn has joined #tripleo | 12:54 | |
*** catintheroof has joined #tripleo | 12:55 | |
*** catintheroof has quit IRC | 12:55 | |
*** catintheroof has joined #tripleo | 12:55 | |
*** skramaja has quit IRC | 12:55 | |
*** pchavva has joined #tripleo | 12:56 | |
shardy | kristaps_: you're trying to deploy with network isolation, right, even though it's one node? | 12:56 |
shardy | there are mappings for each network that enable creating the neutron networks | 12:56 |
shardy | https://github.com/openstack/tripleo-heat-templates/blob/master/environments/network-isolation.j2.yaml#L16 | 12:56 |
shardy | all of your networks except external work, but there's nothing in neutron net-list | 12:57 |
shardy | that can't be right | 12:57 |
*** jpena|lunch is now known as jpena | 12:57 | |
shardy | unless you've overridden the default resource_registry setup by network-isolation.j2.yaml? | 12:57 |
openstackgerrit | Jose Luis Franco proposed openstack/tripleo-heat-templates master: Add yaml validation in docker upgrade_tasks. https://review.openstack.org/497940 | 12:57 |
openstackgerrit | Jose Luis Franco proposed openstack/tripleo-heat-templates master: Add tags to baremetal cron removal tasks https://review.openstack.org/497936 | 12:57 |
*** jmelvin has joined #tripleo | 12:59 | |
openstackgerrit | Jose Luis Franco proposed openstack/tripleo-heat-templates master: Add yaml validation in docker upgrade_tasks. https://review.openstack.org/497940 | 13:00 |
*** nyechiel_ has joined #tripleo | 13:04 | |
*** nyechiel has quit IRC | 13:05 | |
*** oidgar has joined #tripleo | 13:05 | |
*** aditya_r has quit IRC | 13:06 | |
*** bkopilov has joined #tripleo | 13:07 | |
openstackgerrit | Martin André proposed openstack/tripleo-common master: Fix the path to HEALTHCHECK_SCRIPTS in healthcheck/ironic-api https://review.openstack.org/500495 | 13:08 |
openstackgerrit | Javier Peña proposed openstack/instack-undercloud master: Make the RabbitMQ port be an integer https://review.openstack.org/500814 | 13:08 |
*** Goneri has joined #tripleo | 13:08 | |
jpena | EmilienM: our check on where would the puppet-rabbitmq bump affect TripleO missed this ^^ | 13:08 |
shardy | jpena: I think jaosorior posted the same fix, sec | 13:09 |
shardy | https://review.openstack.org/#/c/500798/ | 13:09 |
openstackgerrit | Numan Siddique proposed openstack/tripleo-quickstart-extras master: Increase the value of tempest config 'validation.ping_count' to 3 https://review.openstack.org/500815 | 13:09 |
jpena | shardy: oh, yes | 13:09 |
shardy | jpena: thanks anyway :) | 13:09 |
kristaps_ | shardy: yes, i have one node and want test network-isolation. my network-isolation.yaml looks like http://paste.openstack.org/show/620407/ i take it here https://github.com/cybertron/tripleo-network-templates/blob/master/simple/network-isolation.yaml | 13:09 |
shardy | I suspect we're not catching it in CI as we're still installing the stable/pike modules? | 13:09 |
jpena | we are, see all the failures in https://review.openstack.org/500798 | 13:10 |
shardy | due to https://bugs.launchpad.net/tripleo/+bug/1714361 | 13:10 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 13:10 |
openstackgerrit | Numan Siddique proposed openstack/tripleo-quickstart master: Switch scenario007 to run Tempest https://review.openstack.org/494293 | 13:10 |
jpena | I mean, https://review.openstack.org/#/c/499117/2 | 13:10 |
*** ooolpbot has joined #tripleo | 13:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 13:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 13:10 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 13:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 13:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 13:10 |
*** ooolpbot has quit IRC | 13:10 | |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 13:10 |
shardy | jpena: aha, so we have a dlrn workaround now then? Great! :) | 13:11 |
kristaps_ | shardy: i need just external network to test tls | 13:15 |
shardy | kristaps_: maybe chat with bnemec about those, but they've not been updated in 2 years and aren't tested in CI, I've never used them and thus have no idea at all if they work | 13:15 |
shardy | like, what's in /home/stack/external.yaml ? | 13:15 |
shardy | I can't possibly know since it's not in t-h-t, and this is another piece of incomplete information | 13:16 |
shardy | the etherpad I suggested earlier would help | 13:16 |
*** milan has quit IRC | 13:17 | |
* shardy needs a cup of tea | 13:17 | |
*** shardy is now known as shardy_afk | 13:17 | |
*** aditya_r has joined #tripleo | 13:17 | |
*** noslzzp has joined #tripleo | 13:20 | |
*** thrash|biab is now known as thrash | 13:22 | |
*** ykarel has joined #tripleo | 13:24 | |
gfidente | I think all ci is blocked on http://logs.openstack.org/85/500785/1/check/gate-tripleo-ci-centos-7-scenario001-multinode-oooq/40d5933/logs/undercloud/home/jenkins/undercloud_install.log.txt.gz#_2017-09-05_12_36_23 ? | 13:24 |
kristaps_ | shardy: what no tested in CI? Tls? | 13:24 |
gfidente | I remember bandini fixing something about it? | 13:24 |
gfidente | bandini probably needs a fix in the undercloud as well? | 13:25 |
kristaps_ | shardy:/home/stac/external.yaml looks like http://paste.openstack.org/raw/620408/ there is only default parametrs | 13:25 |
kristaps_ | shardy: ^ | 13:25 |
*** garyk has joined #tripleo | 13:26 | |
openstackgerrit | Giulio Fidente proposed openstack/instack-undercloud master: Change rabbitmq::port from string to integer https://review.openstack.org/500822 | 13:28 |
*** mrch has joined #tripleo | 13:28 | |
openstackgerrit | Marios Andreou proposed openstack/tripleo-heat-templates master: Remove package if service stopped and disabled https://review.openstack.org/479886 | 13:29 |
*** dsneddon has joined #tripleo | 13:29 | |
*** eck` is now known as eck`gone | 13:29 | |
jrist | rbrady: can you look at this last comment? https://review.openstack.org/#/c/469608/ | 13:32 |
*** numans has quit IRC | 13:32 | |
*** ykarel_ has joined #tripleo | 13:32 | |
*** rhallisey has quit IRC | 13:32 | |
*** eck`gone is now known as eck` | 13:33 | |
*** rhallisey has joined #tripleo | 13:33 | |
rbrady | jrist: yes, we'll dig into this further | 13:34 |
jrist | rbrady: much thanks. it's urgent | 13:34 |
*** ykarel has quit IRC | 13:34 | |
openstackgerrit | Giulio Fidente proposed openstack/tripleo-heat-templates master: Move RadosGW config settings into ceph-rgw https://review.openstack.org/500785 | 13:40 |
*** numans has joined #tripleo | 13:40 | |
*** mbeierl has quit IRC | 13:40 | |
*** artom_ has quit IRC | 13:41 | |
*** masco has quit IRC | 13:41 | |
EmilienM | hello | 13:43 |
*** jlinkes has quit IRC | 13:43 | |
*** lblanchard has joined #tripleo | 13:43 | |
*** jlinkes has joined #tripleo | 13:44 | |
EmilienM | jpena: sorry we missed that. Not sure how it's possible | 13:44 |
*** gbarros has joined #tripleo | 13:44 | |
*** pdeore has quit IRC | 13:44 | |
EmilienM | sshnaidm|afk: the alerts reflects critical issues we have right now in CI | 13:44 |
EmilienM | sshnaidm|afk: happy to re-discuss about criteria | 13:44 |
*** trown is now known as trown|brb | 13:45 | |
*** jaosorior has joined #tripleo | 13:45 | |
Lokesh_Jain__ | can someone please take a look at cherry-pick reviews for tripleo-heat-templates: https://review.openstack.org/#/c/492224/ and https://review.openstack.org/#/c/492245 | 13:47 |
shardy_afk | EmilienM: it's possible because we're not testing master in CI anymore | 13:47 |
*** shardy_afk is now known as shardy | 13:47 | |
EmilienM | shardy: master openstack you mean? | 13:48 |
*** ykarel_ is now known as ykarel | 13:48 | |
shardy | EmilienM: well also all the puppet modules, which I assume is why we promoted a puppet-rabbitmq which breaks us? | 13:49 |
EmilienM | I see all ovb jobs are red | 13:49 |
EmilienM | jaosorior: are you looking at ovb failures? | 13:50 |
*** milan has joined #tripleo | 13:51 | |
shardy | They were passing a couple of hours ago, e.g https://review.openstack.org/#/c/450708/ | 13:51 |
gfidente | fwiw I started something for the undercloud install https://bugs.launchpad.net/tripleo/+bug/1715152 | 13:51 |
openstack | Launchpad bug 1715152 in tripleo "CI blocked on undercloud install with Error while evaluating a Function Call, Class[Rabbitmq]: parameter 'port' expects an Integer value, got String" [Critical,In progress] - Assigned to Giulio Fidente (gfidente) | 13:51 |
gfidente | https://review.openstack.org/500822 | 13:51 |
shardy | other than the ha job which it seems to running too close to the timeout | 13:51 |
jaosorior | EmilienM: yeah, haven't figured out what's missing in the fix | 13:52 |
*** shardy has quit IRC | 13:52 | |
jaosorior | EmilienM, mwhahaha seems to be an issue with rabbitmq's management interface | 13:52 |
gfidente | jaosorior ah I think patch and bug are duplicated | 13:53 |
gfidente | you should have set critical on your | 13:53 |
gfidente | I didn't see it | 13:53 |
*** shardy has joined #tripleo | 13:54 | |
*** cdearborn has joined #tripleo | 13:55 | |
mwhahaha | fyi meeting in 5 mins, if you have anything for the agenda please add it https://etherpad.openstack.org/p/tripleo-meeting-items | 13:55 |
*** trown|brb is now known as trown | 13:57 | |
*** ykarel has quit IRC | 13:58 | |
*** ykarel has joined #tripleo | 14:00 | |
mwhahaha | #startmeeting tripleo | 14:00 |
openstack | Meeting started Tue Sep 5 14:00:13 2017 UTC and is due to finish in 60 minutes. The chair is mwhahaha. Information about MeetBot at http://wiki.debian.org/MeetBot. | 14:00 |
openstack | Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. | 14:00 |
*** openstack changes topic to " (Meeting topic: tripleo)" | 14:00 | |
openstack | The meeting name has been set to 'tripleo' | 14:00 |
EmilienM | o/ | 14:00 |
mwhahaha | #topic agenda | 14:00 |
*** openstack changes topic to "agenda (Meeting topic: tripleo)" | 14:00 | |
thrash | o/ | 14:00 |
cdearborn | o/ | 14:00 |
mwhahaha | * review past action items | 14:00 |
mwhahaha | * CI status | 14:00 |
mwhahaha | * bugs | 14:00 |
mwhahaha | * Projects releases or stable backports | 14:00 |
mwhahaha | * Specs / blueprints | 14:00 |
mwhahaha | * one off agenda items | 14:00 |
mwhahaha | * open discussion | 14:00 |
mwhahaha | Anyone can use the #link, #action and #info commands, not just the moderatorǃ | 14:00 |
mwhahaha | Hi everyone! who is around today? | 14:00 |
beagles | o/ | 14:00 |
EmilienM | hola | 14:00 |
*** dtrainor has joined #tripleo | 14:00 | |
jrist | o/ | 14:00 |
lyarwood | o/ | 14:00 |
larsks | o/ | 14:00 |
marios | o/ | 14:00 |
ccamacho | o/ | 14:00 |
shardy | o/ | 14:00 |
rbrady | o/ | 14:01 |
fultonj | o/ | 14:01 |
gcerami | \o | 14:01 |
gfidente | o/ | 14:01 |
gfidente | :D | 14:01 |
*** sshnaidm|afk is now known as sshnaidm | 14:01 | |
sshnaidm | 0/ | 14:01 |
jaosorior | o/ | 14:01 |
jpich | o/ | 14:01 |
adarazs | ö/ | 14:01 |
trown | o/ | 14:01 |
mwhahaha | #topic review past action items | 14:02 |
*** openstack changes topic to "review past action items (Meeting topic: tripleo)" | 14:02 | |
mwhahaha | team to help with reviewing https://review.openstack.org/#/q/topic:bug/1691403 (CI alert fix) - done | 14:02 |
mwhahaha | EmilienM sends a note about Queens blueprints / specs on ML | 14:02 |
EmilienM | done | 14:02 |
jfrancoa | o/ | 14:02 |
mwhahaha | ok that's all we have from the last meeting | 14:02 |
mwhahaha | moving on | 14:03 |
mwhahaha | #topic CI status | 14:03 |
*** openstack changes topic to "CI status (Meeting topic: tripleo)" | 14:03 | |
EmilienM | pretty bad :D | 14:03 |
janki | o/ | 14:03 |
mwhahaha | so where are we at in terms of CI, it appears a whole bunch is broken at the moment | 14:03 |
EmilienM | I think there are some alerts that we can remove and threat them as "normal" bugs | 14:03 |
mandre | o/ | 14:03 |
*** numans has quit IRC | 14:04 | |
gfidente | I just moved https://bugs.launchpad.net/tripleo/+bug/1713659 back into 'Confirmed'/'Critical' | 14:04 |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,Confirmed] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 14:04 |
EmilienM | we need an alert for OVB jobs for sure, they look all red right now | 14:04 |
owalsh | o/ | 14:04 |
mwhahaha | https://bugs.launchpad.net/tripleo/+bug/1713832 | 14:04 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 14:04 |
mwhahaha | marios: do we have an update on this issue? | 14:05 |
EmilienM | I think we can remove alert on this one | 14:05 |
mwhahaha | or is anyone looking at it? | 14:05 |
mwhahaha | k | 14:05 |
*** ebarrera has quit IRC | 14:05 | |
EmilienM | mwhahaha: not sure we have made progress on this one | 14:05 |
marios | mwhahaha: not since i discussed it last night with EmilienM ... it is still happening http://status.openstack.org/elastic-recheck/#1713832 | 14:05 |
EmilienM | gah | 14:05 |
mwhahaha | ok sounds like we need some more investigation from zaqar folks | 14:06 |
EmilienM | 1 fails in 24 hrs isn't terrible | 14:06 |
EmilienM | (we have seen worse lately) | 14:06 |
marios | mwhahaha: so whatever it is it is ongoing ... i went through recent zaqar/swift commits but can't see any obvious thing there | 14:06 |
mwhahaha | no it's not, let's drop the alert and poke the storage/zaqar folks to look into it | 14:06 |
EmilienM | mwhahaha: we'll need to discuss about marios's change that he wants to restore | 14:06 |
marios | EmilienM: well it seems to fail about the same rate since the revert as we discussed already (that is another issue lets talk about is separately) | 14:06 |
EmilienM | and eventually backport | 14:06 |
*** oidgar has quit IRC | 14:06 | |
sshnaidm | EmilienM, where do you see ovb jobs failing? | 14:06 |
EmilienM | sshnaidm: tripleo.org/cistatus.html | 14:06 |
mwhahaha | let's go thought he alert bugs first | 14:07 |
marios | mwhahaha: EmilienM for context/logs the revert we are talking about is discussed here http://lists.openstack.org/pipermail/openstack-dev/2017-September/121722.html | 14:07 |
mwhahaha | then we can talk about ovb | 14:07 |
EmilienM | ok | 14:07 |
EmilienM | the patch that jaosorior sent isn't passing CI jobs | 14:07 |
openstackgerrit | yolanda.robla proposed openstack/tripleo-image-elements master: WIP: Use volumes for security hardened images https://review.openstack.org/499588 | 14:08 |
mwhahaha | #action drop alert on Bug 1713832 and ping storage/zaqar people to investigate | 14:08 |
openstack | bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] https://launchpad.net/bugs/1713832 - Assigned to Marios Andreou (marios-b) | 14:08 |
*** ebarrera has joined #tripleo | 14:08 | |
mwhahaha | https://bugs.launchpad.net/tripleo/+bug/1714361 apetrich any update? | 14:08 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 14:08 |
jaosorior | EmilienM: trying to reproduce it locally to fix it up. | 14:08 |
shardy | jpena: does the latest dlrn have a workaround for the lack of master branch releases? ^^ | 14:09 |
*** pradk has joined #tripleo | 14:09 | |
shardy | mwhahaha: when I looked earlier there is a pending patch for mistral to release the bugfix, but AFAICT it's a general issue so we need to fix it for everything? | 14:09 |
mwhahaha | is this the missing semver patches to get versions bumped for queens | 14:09 |
jpena | shardy: it depends on the situation, more info needed :) | 14:10 |
* jrist really really wants this fixed | 14:10 | |
*** numans has joined #tripleo | 14:10 | |
*** ooolpbot has joined #tripleo | 14:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 14:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713659 | 14:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713832 | 14:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 14:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 14:10 |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,Triaged] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 14:10 |
shardy | mwhahaha: yeah AFAIK that's the issue here | 14:10 |
*** ooolpbot has quit IRC | 14:10 | |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 14:10 |
jrist | #link https://review.openstack.org/#/c/469608/ | 14:10 |
shardy | and I don't think it's restricted to mistral | 14:10 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 14:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 14:10 |
mwhahaha | the puppet modules we patched for the next version so those are good | 14:10 |
mwhahaha | this would be all the openstack services where they need a semver patch to bump numbers for the next cycle | 14:10 |
*** oidgar has joined #tripleo | 14:10 | |
shardy | jpena: apevec mentioned there may be a dlrn workaround for https://bugs.launchpad.net/tripleo/+bug/1714361 - there's a ML thread linked from it that has more context | 14:10 |
mwhahaha | if i recall this is always a struggle at the start of the cycle | 14:11 |
jpena | shardy: oh, I think I know what you mean. Having newer tags in stable/pike than master causes troubles to upgrade | 14:11 |
EmilienM | see https://review.openstack.org/#/q/owner:%22Emilien+Macchi+%253Cemilien%2540redhat.com%253E%22+semver | 14:11 |
shardy | basically we end up installing the old version from delorean-pike-testing | 14:11 |
EmilienM | that's what I did the last time to fix this up | 14:11 |
shardy | jpena: yeah | 14:11 |
EmilienM | I can do it again if you want but it's a fight :D | 14:11 |
*** mbeierl has joined #tripleo | 14:11 | |
shardy | mwhahaha: yeah, it'd be good to discuss again with the release team | 14:12 |
jpena | so apevec had a (partial) fix for review, but it was never complete and he has to revamp it | 14:12 |
mwhahaha | or should we make upgrade jobs non voting until m1 | 14:12 |
shardy | but is it only upgrade jobs? | 14:12 |
mwhahaha | in theory it should be | 14:12 |
EmilienM | yes | 14:12 |
EmilienM | I think? | 14:12 |
oidgar | o/ | 14:12 |
mwhahaha | or it it a stable/pike issue | 14:12 |
shardy | Not sure it is, the way we layer the repos means that I think all jobs are affected | 14:12 |
EmilienM | mwhahaha: upgrade jobs won't vote on maste for now, we have no process to upgrade containers to containers | 14:13 |
shardy | perhaps we need to double check that though | 14:13 |
EmilienM | but we might want to enable voting upgrade jobs on stable/pike once they work | 14:13 |
shardy | like, how did that puppet-rabbitmq patch get promoted? | 14:13 |
mwhahaha | ok sounds like we need more investigation on the exact impact of the verions | 14:13 |
mwhahaha | anyone want to volunteer for that | 14:13 |
* mwhahaha watches as everyone disapears | 14:13 | |
trown | shardy: puppet modules we pull from master | 14:14 |
shardy | the logs I linked from the bug show the undercloud job using wrong versions | 14:14 |
gfidente | non-openstack only I suppose | 14:14 |
trown | shardy: they are treated like tripleo projects in that regard | 14:14 |
*** shreshtha has joined #tripleo | 14:14 | |
shardy | trown: Ah yeah I forgot it's not only puppet-openstack modules | 14:14 |
gfidente | eg. puppet-ceph was tested against tripleo | 14:14 |
gfidente | but was more an exception | 14:15 |
mwhahaha | ok so we need to move on, can anyone devote some time ot investigate this more? | 14:15 |
trown | though we probably want to rethink that... I think we should only include packages that have not passed promote that we have a gate on | 14:15 |
EmilienM | trown: indeed | 14:16 |
shardy | I'm happy to help, but am currently unclear if we're going after a dlrn fix or a fight to release all the things approach | 14:16 |
Tengu | jaosorior: small question: in order to get Public Endpoint TLS, I only have to include the following env: openstack-tripleo-heat-templates/environments/ssl/tls-endpoints-public-dns.yaml and openstack-tripleo-heat-templates/environments/enable-tls.yaml right? Nothing more, nothing less ? | 14:16 |
*** shreshtha has quit IRC | 14:16 | |
shardy | the other approach is a whitelist for delorean-pike-testing | 14:16 |
mwhahaha | i think we need to figure out what is actually happening in the jobs before trying to come up with the fix. it seems to be several possible problems | 14:16 |
*** shreshtha has joined #tripleo | 14:16 | |
shardy | mwhahaha: Ok, well I already added some analysis to the bug but will do another pass looking at the different types of job | 14:17 |
*** shreshtha has quit IRC | 14:17 | |
mwhahaha | #action shardy to look into version impacts in ci jobs related to Bug 1714361 | 14:17 |
openstack | bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] https://launchpad.net/bugs/1714361 - Assigned to Adriano Petrich (apetrich) | 14:17 |
mwhahaha | https://bugs.launchpad.net/tripleo/+bug/1714905 | 14:17 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 14:17 |
*** shreshtha has joined #tripleo | 14:17 | |
mwhahaha | mandre: any update? | 14:17 |
EmilienM | I've been working on this one last night | 14:17 |
Tengu | jaosorior: sorry, tripleo-heat-templates/environments/ssl/enable-tls.yaml - the other one is deprecated - I've updated earlier and it didn't activate TLS :( | 14:18 |
EmilienM | i've pushed https://review.openstack.org/#/c/500671/ (which is partially related) | 14:18 |
EmilienM | sshnaidm: might need your help on this one if you can | 14:18 |
mandre | I need to sync up with EmilienM on this one, as it may be a few bugs mixed in together | 14:18 |
jaosorior | Tengu: depends; if you have to get the nodes to trust a CA. then you need to add another environment. | 14:18 |
EmilienM | on the bright news, I saw an upgrade job passing from ocata to pike on stable/pike | 14:19 |
mwhahaha | ok sounds like progress | 14:19 |
EmilienM | (the only one I think) 2 days ago | 14:19 |
sshnaidm | EmilienM, sure | 14:19 |
EmilienM | yes little progress | 14:19 |
mwhahaha | moving on | 14:19 |
gfidente | EmilienM curious if you got pingtest to pass scenario004/containers after moving manila-share to pacemaker? | 14:19 |
EmilienM | sshnaidm: let's sync after meeting | 14:19 |
sshnaidm | ok | 14:19 |
mwhahaha | https://bugs.launchpad.net/tripleo/+bug/1708832 | 14:19 |
openstack | Launchpad bug 1708832 in tripleo "DLRN build failures in gate" [High,In progress] - Assigned to wes hayutin (weshayutin) | 14:19 |
EmilienM | gfidente: yes and tempest as well | 14:19 |
gfidente | whoohooo | 14:19 |
openstackgerrit | Juan Antonio Osorio Robles proposed openstack/instack-undercloud master: Use integer for rabbitmq port and specify management IP https://review.openstack.org/500798 | 14:19 |
mwhahaha | anyone driving that since weshay_PTO is not around | 14:19 |
Tengu | jaosorior: not needed, public TLS will be signed by LE. | 14:19 |
jaosorior | Tengu: then that's it. | 14:19 |
*** jchhatbar has joined #tripleo | 14:19 | |
Tengu | hmm. | 14:19 |
Tengu | weid. | 14:19 |
Tengu | weird. | 14:19 |
EmilienM | this one should be closed | 14:20 |
mwhahaha | looks like dmsimard|afk had a patch that was merged, is it still open? | 14:20 |
adarazs | mwhahaha: that should be fixed. | 14:20 |
EmilienM | the fix was https://review.openstack.org/#/c/498074/ | 14:20 |
*** janki has quit IRC | 14:20 | |
EmilienM | and folks forgot to update launchpad :/ | 14:20 |
mwhahaha | ok looks like we need to fix release that bug then | 14:20 |
mwhahaha | https://bugs.launchpad.net/tripleo/+bug/1713127 | 14:20 |
openstack | Launchpad bug 1713127 in tripleo "tripleo fails to deploy in ci : Failed to call refresh: /usr/bin/clustercheck" [High,Triaged] | 14:20 |
mwhahaha | bandini: any thoughts -^ | 14:20 |
Tengu | jaosorior: though I have an idea: I suspect some receipt check for the existence of SSLCertificate and SSLKey - I didn't set them up, as the certificate will be generated on the (now unique) controller (I've overriden the OS::TripleO::NodeTLSData: resource) | 14:21 |
EmilienM | mwhahaha: 1708832 closed | 14:21 |
jaosorior | Tengu: you didn't? | 14:21 |
Tengu | jaosorior: nope. because the certificate isn't loaded like that :). | 14:22 |
EmilienM | mwhahaha: let's move forward | 14:22 |
mwhahaha | k i'm not sure that bug needs an alert | 14:23 |
mwhahaha | looking at it it seemed to be package downloads | 14:23 |
mwhahaha | will look at it more later | 14:23 |
mwhahaha | ok moving on | 14:23 |
mwhahaha | do we have a bug for the ovb failures (possibly related to rabbitmq) | 14:23 |
jaosorior | Tengu: you'll have issues: we explicitly check if it's set https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/services/haproxy.yaml#L83 and set the necessary value if it is https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/services/haproxy.yaml#L135 | 14:23 |
EmilienM | mwhahaha: no alert is needed | 14:23 |
mwhahaha | no, alert is needed or no alert is needed | 14:23 |
EmilienM | mwhahaha: https://bugs.launchpad.net/tripleo/+bug/1713659 | 14:23 |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,In progress] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 14:23 |
jaosorior | Tengu: you can work around it by creating something similar to this: https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/services/haproxy-public-tls-certmonger.yaml but that merely just sets the path for tripleo::haproxy::service_certificate | 14:24 |
EmilienM | mwhahaha: for the clustercheck, no alert is needed | 14:24 |
mwhahaha | k | 14:24 |
*** ratailor has joined #tripleo | 14:24 | |
EmilienM | https://bugs.launchpad.net/tripleo/+bugs?field.tag=alert | 14:24 |
EmilienM | we have 3 alerts now, how does it looks? | 14:24 |
*** ratailor has quit IRC | 14:24 | |
mwhahaha | better | 14:25 |
EmilienM | I'm not sure https://bugs.launchpad.net/tripleo/+bug/1714361 deserves an alert | 14:25 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 14:25 |
*** beekneemech is now known as bnemec | 14:25 | |
*** ratailor has joined #tripleo | 14:25 | |
EmilienM | it's a known issue and someone has to take some actions on it | 14:25 |
EmilienM | it doesn't block any gate, afik? | 14:25 |
mwhahaha | shardy is going to take a look at it, we can discuss more after the meeting | 14:25 |
shardy | well it might mean we're not testing anything properly for master branches | 14:25 |
mwhahaha | any other CI status items? | 14:25 |
shardy | but we can remove the alert and discuss as we investigate | 14:26 |
Tengu | jaosorior: so if I set tripleo::haproxy::service_certificate for example in hieradata, it will work as expected? The path is the very same the "original" script provides | 14:26 |
*** nyechiel has joined #tripleo | 14:26 | |
*** nyechiel_ has quit IRC | 14:26 | |
mwhahaha | k moving on | 14:26 |
mwhahaha | #topic bugs | 14:26 |
mwhahaha | #link https://launchpad.net/tripleo/+milestone/pike-rc2 | 14:26 |
*** openstack changes topic to "bugs (Meeting topic: tripleo)" | 14:26 | |
EmilienM | I think we covered critical bugs | 14:27 |
EmilienM | (or the most criticals) | 14:27 |
mwhahaha | any other bugs to discuss? | 14:27 |
EmilienM | did we miss something? | 14:27 |
jaosorior | Tengu: correct. | 14:27 |
larsks | I just want to ask about https://bugs.launchpad.net/tripleo/+bug/1713240. We have a fix available with one +2, but could use another set of eyes... | 14:27 |
openstack | Launchpad bug 1713240 in tripleo "Fluentd configuration not correctly written to disk" [High,In progress] - Assigned to Lars Kellogg-Stedman (larsks) | 14:27 |
shardy | we have 200 bugs targetted to rc2 | 14:27 |
shardy | can we start deferring things that aren't release blockers, to help focus review attention? | 14:28 |
*** ratailor has quit IRC | 14:28 | |
Tengu | jaosorior: good to hear :). I can do that in the puppet-stack-config-fix.yaml I aready have for some workarrounds. | 14:28 |
mwhahaha | shardy: yea makes sense | 14:28 |
mwhahaha | i'll take a look this week | 14:28 |
mwhahaha | i was already planning on doing some bug work | 14:28 |
mwhahaha | #action mwhahaha to retarget rc2 bugs if not release critical | 14:29 |
shardy | ack, and I guess if all folks doing triage or reporting bugs, please use queens-1 unless it's a blocker | 14:29 |
EmilienM | mwhahaha: I'll share my commands with you :D I use openstack release tools | 14:29 |
mwhahaha | any other bug related items? | 14:29 |
EmilienM | maybe we can move everything to Queens-1 with the script and manually move back to pike-rc2 what we actually want to solve this week | 14:29 |
larsks | mwhahaha: well, there's my question, if someone could commit to that or tell me to bug off or something... | 14:30 |
larsks | Any response, really. | 14:30 |
mwhahaha | larsks: we'll take a look afterwards, it loosk ok and you already have 2 +2s | 14:30 |
mwhahaha | larsks: but ci is hosed so probably not a +A right now | 14:30 |
larsks | Ah, looks like that second +2 just landed. Thanks jaosorior! | 14:30 |
mwhahaha | #topic projects releases or stable backports | 14:31 |
jaosorior | larsks: it looked reasonable. But yeah, no +A because of the CI situation | 14:31 |
*** openstack changes topic to "projects releases or stable backports (Meeting topic: tripleo)" | 14:31 | |
larsks | No worries, thanks. | 14:31 |
EmilienM | so pabelanger mentioned about zuul v3 upgrade end of this week | 14:31 |
EmilienM | and suggested to release pike-rc2 by Thursday | 14:31 |
EmilienM | tbh, I don't think zuul upgrade really affects pike-rc2 release, it's just a tag we push | 14:31 |
mwhahaha | ok so it sounds like we need to get everything landed and CI fixed like today | 14:32 |
EmilienM | what we don't want is our CI more hosed I guess :D | 14:32 |
mwhahaha | well zuul upgrades usually break CI | 14:32 |
EmilienM | Sep 11 - Sep 15 (R+2) is the official limit for us to release final Pike | 14:32 |
EmilienM | so we have 9 days to release this final pike | 14:33 |
EmilienM | but next week is PTG so probably not the best time | 14:33 |
EmilienM | let's target this week and do our possible to commit to it | 14:33 |
mwhahaha | indeed | 14:33 |
EmilienM | but let's keep us this window of releasing next week in case | 14:33 |
shardy | I think there's only 1 feature pending, so we could land that, release, then backport bugfixes and do an additional stable release after the GA? | 14:33 |
mwhahaha | which feature is still pending? | 14:34 |
shardy | unless we can identify release-blocker bugs of course | 14:34 |
EmilienM | shardy: yes, that's what we do usually. Good idea to do it again | 14:34 |
EmilienM | https://blueprints.launchpad.net/tripleo/+spec/websocket-logging | 14:34 |
EmilienM | there is one patch in instack-undercloud iiuc | 14:35 |
mwhahaha | k | 14:35 |
mwhahaha | https://review.openstack.org/#/c/469608/ | 14:35 |
mwhahaha | honza: are you going to be able to get that patch fixed up today or tomorrow? | 14:35 |
shardy | yeah honza can confirm but I think it was blocked on the old mistral in CI issue | 14:35 |
jpich | Yes, it's blocked due to one of the CI issues mentioned earlier IIUC, the one apetrich is working on | 14:36 |
mwhahaha | https://bugs.launchpad.net/tripleo/+bug/1714361 | 14:36 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 14:36 |
mwhahaha | so it looks like we need to figure that one out then | 14:36 |
mwhahaha | any other stable backports or release issues? | 14:37 |
mwhahaha | moving on | 14:37 |
mwhahaha | #topic specs / blueprints | 14:37 |
mwhahaha | #link https://review.openstack.org/#/q/project:openstack/tripleo-specs+status:open | 14:37 |
*** openstack changes topic to "specs / blueprints (Meeting topic: tripleo)" | 14:37 | |
*** links has quit IRC | 14:37 | |
mwhahaha | ptg is next week so a reminder get your specs out there so we can discuss next week | 14:38 |
EmilienM | I think there is one topic related to specs in the today's items | 14:38 |
mwhahaha | yup, so let's move on to that | 14:38 |
EmilienM | lyarwood: ^ | 14:38 |
mwhahaha | #topic one off agenda items | 14:38 |
mwhahaha | #link https://etherpad.openstack.org/p/tripleo-meeting-items | 14:38 |
*** openstack changes topic to "one off agenda items (Meeting topic: tripleo)" | 14:38 | |
*** mnaser has joined #tripleo | 14:38 | |
mwhahaha | lyarwood: the floor is yours | 14:38 |
lyarwood | WIP Skip level upgrade spec underway, reviews & comments welcome, also looking for a slot to discuss this at the PTG next week. | 14:38 |
lyarwood | https://review.openstack.org/#/c/497257/ | 14:39 |
lyarwood | ^ just added that to the agenda as a heads up ahead of the PTG next week | 14:39 |
EmilienM | let's look at the agenda https://etherpad.openstack.org/p/tripleo-ptg-queens | 14:39 |
EmilienM | Thursday, after Ceph integration future | 14:39 |
lyarwood | EmilienM: if there's time that would be great | 14:39 |
EmilienM | after 3.30pm | 14:40 |
EmilienM | there is no session after | 14:40 |
EmilienM | lyarwood: 40 min is good? | 14:40 |
EmilienM | 1h? | 14:40 |
lyarwood | EmilienM: 1h, I think the ceph session is until 16:30 no? | 14:40 |
*** mrch has quit IRC | 14:40 | |
EmilienM | https://calendar.google.com/calendar/embed?src=c1g5npdrsd3p37ods24s19gg0g%40group.calendar.google.com&ctz=America/Vancouver | 14:41 |
EmilienM | ah it changed, ok | 14:41 |
fultonj | 15.30 to 16.30 as per https://etherpad.openstack.org/p/tripleo-ptg-queens line 132 | 14:41 |
EmilienM | lyarwood: 16.30 to 17.15 is good? | 14:41 |
lyarwood | EmilienM: yup that would work | 14:41 |
EmilienM | ah, my calendar is in PST | 14:41 |
EmilienM | sorry | 14:41 |
*** paramite has quit IRC | 14:42 | |
EmilienM | I created your session, we're good | 14:42 |
lyarwood | EmilienM: thanks! | 14:42 |
mwhahaha | cool, any other topics? | 14:42 |
sshnaidm | yes | 14:43 |
sshnaidm | can we define which bugs deserve alerts? | 14:43 |
sshnaidm | because if many of them are alerting, it turns to be noise which people ignore | 14:43 |
sshnaidm | I know to set alert in CI gates and promotion blockers | 14:43 |
sshnaidm | do we have another criteria? | 14:43 |
*** udesale has joined #tripleo | 14:44 | |
*** ioggstream has joined #tripleo | 14:44 | |
ioggstream | gfidente: happy birthday! | 14:44 |
mwhahaha | I believe that to be the criteria | 14:44 |
EmilienM | sshnaidm: I agree it has been quite verbose this time but I also found useful to bring the information visible on what's blocking the production chain | 14:44 |
gfidente | ioggstream++ :D love | 14:45 |
sshnaidm | EmilienM, ok, so production chain blockers too? | 14:45 |
*** jtomasek has quit IRC | 14:45 | |
*** jchhatbar_ has joined #tripleo | 14:46 | |
mwhahaha | so production chain blockers include ci gates/promotioni ssues | 14:46 |
*** jlabarre has quit IRC | 14:46 | |
EmilienM | yeah | 14:46 |
*** jtomasek has joined #tripleo | 14:46 | |
EmilienM | the problem is that people do blind rechecks | 14:46 |
EmilienM | it wastes CI resources and it doesn't actually solve any problem | 14:46 |
EmilienM | pabelanger did a huge work in setting up logstash queries | 14:47 |
EmilienM | but now it's our turn to do it | 14:47 |
EmilienM | i've set alert tags on some bugs where we had a huge amount of hits in logstash | 14:47 |
sshnaidm | EmilienM, I doubt if logstash help not to do rechecks.. | 14:47 |
EmilienM | because I think we want people to stop doing recheck and start helping more in our CI system | 14:47 |
*** udesale has quit IRC | 14:47 | |
*** marios has quit IRC | 14:47 | |
*** udesale has joined #tripleo | 14:48 | |
mwhahaha | those bugs affect the ci gates which is why it would be a warrented alert | 14:48 |
*** marios has joined #tripleo | 14:48 | |
mwhahaha | generally if it's alerting folks need to be looking at it and it is not noise | 14:48 |
therve | EmilienM, FWIW I made some progress on https://bugs.launchpad.net/tripleo/+bug/1713832 | 14:48 |
openstack | Launchpad bug 1713832 in tripleo "Object PUT failed for zaqar_subscription" [Critical,In progress] - Assigned to Marios Andreou (marios-b) | 14:48 |
mwhahaha | if you're ignoring the alerting bugs you're not helping | 14:48 |
EmilienM | therve: great :) | 14:48 |
therve | I don't know if it's blocking stuff, but I know of a possible quick fix if that helps | 14:48 |
*** jchhatbar has quit IRC | 14:49 | |
*** jchhatbar_ has quit IRC | 14:49 | |
EmilienM | therve: awesome | 14:49 |
*** udesale has quit IRC | 14:49 | |
*** dmarlin has joined #tripleo | 14:49 | |
shardy | there are still some generic issues causing lots of rechecks, like all the OVB jobs are really close to timing out | 14:49 |
EmilienM | therve: should we stop using swift backend? | 14:50 |
shardy | We'll have to look at ways to speed them up, or we'll be forced to reduce the coverage | 14:50 |
EmilienM | therve: is it safe to release final pike with swift backend for zaqar if it's racy? | 14:50 |
therve | EmilienM, I don't think that's possible at that point :/ | 14:50 |
shardy | e.g now we have scenarios, perhaps we don't need the ovb jobs to deploy so many services? | 14:50 |
shardy | that could save some time | 14:50 |
therve | We don't have mongo anymore, and I don't think redis is there either | 14:50 |
sshnaidm | right, when I get timeout message from elastic recheck, it doesn't help me at all - it could be both a bug, infra problem or just bad luck | 14:50 |
EmilienM | shardy: yes, we already said OVB jobs should deploy minimal services (nova glance keystone neutron) | 14:51 |
sshnaidm | shardy, when moving ovb jobs to 3d party we don't need to limit them to 180 minutes | 14:51 |
shardy | Yeah either would work I guess | 14:51 |
EmilienM | someone needs to spend time on looking at what ovb is deploying today and try to reduce it | 14:52 |
dprince | sshnaidm: do you really want a CI job that takes longer than 180 minutes though? | 14:52 |
shardy | but there's many examples like https://review.openstack.org/#/c/450708/ where we've rechecked for $weeks due to slow jobs timing out | 14:52 |
EmilienM | anyone can take this work? ^ | 14:52 |
EmilienM | dprince: I agree, we don't want ovb more than 180 min for sure | 14:52 |
marios | abishop: o/ thanks for checking i just revoted at https://review.openstack.org/#/c/496921/3 | 14:52 |
sshnaidm | dprince, it's better to take 240 mins than do recheck and waste 180*2=360 mins | 14:53 |
shardy | EmilienM: I'll try to push a wip patch with a smaller ControllerServices list, at least for the HA job | 14:53 |
shardy | I guess we don't really need many of the services deployed there at all, now that few are managed by pacemaker | 14:53 |
EmilienM | shardy: thanks | 14:53 |
therve | dprince, Yeah the "C" in "CI" is not super compatible with 3h jobs. Maybe Perpetual Integration instead | 14:53 |
*** aditya_r has quit IRC | 14:54 | |
dprince | yeah, with jobs longer than to hours it really means you have 2 or 3 changes per day, which gets kind of sad... and ends up "camping out" on limited resources too long | 14:54 |
*** oidgar has quit IRC | 14:54 | |
jfrancoa | EmilienM: I was also doing some work disabling some unused service (and I ran into deeper problems in fact) https://review.openstack.org/#/c/499182/, but I can try to help in making the CI jobs ligther | 14:54 |
mwhahaha | ok we have about 5 mins left, do we have anything else to talk about? | 14:54 |
*** aditya_r has joined #tripleo | 14:54 | |
*** aufi has quit IRC | 14:55 | |
larsks | I have a quick question... | 14:55 |
EmilienM | jfrancoa: you can probably pair with shardy | 14:55 |
mwhahaha | larsks: what's up? | 14:55 |
larsks | I want to clean up the fluentd service implementation, because it predates service_config_settings and is unnecessarily invasive because of that. | 14:55 |
abishop | marios: many thx! | 14:55 |
EmilienM | #action shardy to look at how to reduce # of services deployed on ovb | 14:55 |
sshnaidm | mwhahaha, maybe to add action item about testing "limited" ovb jobs | 14:55 |
sshnaidm | EmilienM, you were first) | 14:56 |
larsks | This is obviously a queens only thing. Would it be appropriate to submit these changes now (ish)? | 14:56 |
EmilienM | anyone can take actions, btw | 14:56 |
*** aufi has joined #tripleo | 14:56 | |
sshnaidm | ok | 14:56 |
mwhahaha | larsks: yea you can | 14:56 |
EmilienM | larsks: yes you can but no backports | 14:56 |
larsks | EmilienM: right, hence "queens only thing" :) | 14:56 |
EmilienM | queens cycle has been open! | 14:56 |
EmilienM | though any help to release pike is more than welcome | 14:57 |
larsks | Just wanted to make sure. I know that folks are busy right now with the pike release. | 14:57 |
EmilienM | kinda | 14:57 |
shardy | larsks: it'd be good to land any bugfixes before the refactor, but otherwise sounds good to me | 14:57 |
mwhahaha | larsks: do try and consistently tag it witha bug or something so we can keep an eye on the changes together. Since they are invasive they probably won't be merged for a while | 14:57 |
mwhahaha | but it would be good to get them out early at least for initial reviews | 14:57 |
EmilienM | we should probably target m1 though | 14:57 |
EmilienM | since it's invasive | 14:57 |
larsks | Okay, thanks all. | 14:57 |
larsks | Many of the cleanup changes won't be. Just the stuff that pulls things out of common/services.yml. | 14:58 |
mwhahaha | ok anything else? | 14:58 |
shardy | larsks: ah, I was wondering about removing some of those, will be happy to help with reviews etc when you're ready | 14:58 |
*** jlabarre has joined #tripleo | 14:58 | |
mwhahaha | ok thanks everyone | 14:59 |
mwhahaha | #endmeeting | 14:59 |
larsks | shardy: I will tag you on the reviews as they go up. | 14:59 |
*** openstack changes topic to "CI Status: Orange - see alerts | TripleO | https://docs.openstack.org/tripleo-docs/latest/ | http://tripleo.org" | 14:59 | |
openstack | Meeting ended Tue Sep 5 14:59:08 2017 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4) | 14:59 |
openstack | Minutes: http://eavesdrop.openstack.org/meetings/tripleo/2017/tripleo.2017-09-05-14.00.html | 14:59 |
openstack | Minutes (text): http://eavesdrop.openstack.org/meetings/tripleo/2017/tripleo.2017-09-05-14.00.txt | 14:59 |
openstack | Log: http://eavesdrop.openstack.org/meetings/tripleo/2017/tripleo.2017-09-05-14.00.log.html | 14:59 |
mnaser | now that's done.. i was hoping to get some eyes on two oooq failures (which i would love to troubleshoot but have no idea how to): https://review.openstack.org/#/c/497320/ | 14:59 |
mnaser | is that related to the pending release (stable/pike change) or a known issue? | 14:59 |
ccamacho | hey gfidente congrats man !https://i.imgflip.com/dew09.jpg | 15:00 |
gfidente | ccamacho ahahah | 15:00 |
gfidente | you actually read the chan | 15:00 |
gfidente | ccamacho++ :D thanks | 15:00 |
ccamacho | heheheeh | 15:01 |
mwhahaha | mnaser: underclud install broke, we have a known issue that we're looking at | 15:01 |
mwhahaha | mnaser: http://logs.openstack.org/20/497320/2/check/gate-tripleo-ci-centos-7-scenario001-multinode-oooq-puppet/4541838/logs/undercloud/home/jenkins/undercloud_install.log.txt.gz#_2017-09-05_14_11_30 | 15:01 |
larsks | shardy: would it make sense to register a blueprint for the fluentd cleanup? It's going to span both t-h-t and puppet-tripleo. | 15:02 |
mnaser | mwhahaha i was looking for ages for that undercloud_install.log file but there it is, now i know, thanks :p | 15:02 |
EmilienM | sshnaidm, mandre: in a mtg now - let's talk about upgrades later | 15:03 |
*** artom_ has joined #tripleo | 15:04 | |
openstackgerrit | Tim Rozet proposed openstack/tripleo-quickstart master: Updates OpenDaylight feature set 31 to use tempest https://review.openstack.org/500872 | 15:05 |
*** dhill|brb has quit IRC | 15:05 | |
*** ccamacho has quit IRC | 15:06 | |
*** ebarrera has quit IRC | 15:06 | |
*** dhill_ has joined #tripleo | 15:07 | |
*** Vijayendra has quit IRC | 15:07 | |
openstackgerrit | Tim Rozet proposed openstack/tripleo-heat-templates master: Add OPNFV scenario environment https://review.openstack.org/486905 | 15:07 |
tbarron | https://review.openstack.org/#/c/499111/ is ready for +W (stable/ocata) | 15:07 |
shardy | larsks: sure, I'd probably just go with a bug tho as it's easier if any discussion needs persisting | 15:10 |
*** ooolpbot has joined #tripleo | 15:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 15:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713659 | 15:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 15:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 15:10 |
*** ooolpbot has quit IRC | 15:10 | |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,In progress] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 15:10 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 15:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 15:10 |
tbarron | EmilienM: thanks | 15:10 |
shardy | some way to track the patches sounds good though | 15:10 |
*** mcornea has quit IRC | 15:10 | |
jaosorior | chandankumar: ping | 15:17 |
chandankumar | jaosorior: pong | 15:17 |
jaosorior | chandankumar: you recently asked about the novajoin tempest plugin | 15:17 |
openstackgerrit | mathieu bultel proposed openstack/tripleo-docs master: Update documentation for O to P upgrade https://review.openstack.org/496223 | 15:18 |
jaosorior | chandankumar: some folks from my team are taking it | 15:18 |
*** oidgar has joined #tripleo | 15:18 | |
chandankumar | jaosorior: awesome | 15:18 |
*** ykarel has quit IRC | 15:19 | |
jaosorior | chandankumar: can you pass again the link to the puppet-tempest example patch? | 15:19 |
*** iranzo has quit IRC | 15:20 | |
chandankumar | jaosorior: can you point me the right person so that i can guide him | 15:20 |
chandankumar | jaosorior: sure | 15:21 |
EmilienM | sshnaidm: do you have an idea for https://review.openstack.org/#/c/500671/ ? if yes, can you push over maybe? so we can try to make progress today (I'm stuck in meeting now) | 15:21 |
sshnaidm | EmilienM, in retro mtg now, will ping you | 15:21 |
EmilienM | gfidente: do we want https://review.openstack.org/#/c/489168/ in stable/pike? | 15:22 |
EmilienM | shardy: dito for https://review.openstack.org/#/c/500585/ ? | 15:22 |
*** shreshtha_ has joined #tripleo | 15:22 | |
EmilienM | sshnaidm: ok np | 15:22 |
*** cylopez has quit IRC | 15:22 | |
*** Vijayendra has joined #tripleo | 15:23 | |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Set mode for ansible written files https://review.openstack.org/500884 | 15:23 |
gfidente | EmilienM looks sane to me | 15:23 |
EmilienM | shardy: I proposed, you can vote when you have time ^ | 15:23 |
shardy | EmilienM: I think that one is a release blocker because it's security related | 15:23 |
shardy | EmilienM: ah thanks, I was about to do it :) | 15:24 |
shardy | EmilienM: thanks, I'll approve when it passes CI | 15:24 |
*** shreshtha has quit IRC | 15:24 | |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-common stable/pike: Derive params network config stack exists fix https://review.openstack.org/500888 | 15:29 |
*** shreshtha_ has quit IRC | 15:31 | |
openstackgerrit | John Fulton proposed openstack/tripleo-heat-templates master: Add CephExternal role for ceph-ansible https://review.openstack.org/499627 | 15:33 |
*** dtrainor_ has joined #tripleo | 15:34 | |
*** dtrainor has quit IRC | 15:38 | |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-quickstart master: ocata2pike: add missing current-pike repo https://review.openstack.org/500671 | 15:38 |
EmilienM | sshnaidm: I pushed a patchset, so we use current for the tripleo projects (I think we want current (instead of consistent) but I'm maybe wrong, feel free to push over, I'm travelling today) | 15:39 |
*** anshul has quit IRC | 15:40 | |
dhill_ | gfidente, would you know, by any chance, how to make an overcloud deployment to go through step1 ? | 15:40 |
dhill_ | gfidente, I have a customer that tried to add manila/netapp cinder backend and now their stack is stuck at step1 | 15:41 |
*** yprokule has quit IRC | 15:41 | |
*** Vijayendra has quit IRC | 15:42 | |
* gfidente routing elsewhere | 15:42 | |
openstackgerrit | John Trowbridge proposed openstack/tripleo-quickstart master: Default update_images to false https://review.openstack.org/500898 | 15:43 |
sshnaidm | EmilienM, looking | 15:43 |
*** lucasagomes is now known as lucas-afk | 15:46 | |
shardy | mwhahaha: Hey did you already look at wiring in roles generate via quickstart? | 15:46 |
mwhahaha | shardy: no not yet | 15:46 |
mwhahaha | shardy: but we should | 15:46 |
shardy | I was thinking that's another CI cleanup - we shouldn't render resources for roles we don't deploy | 15:47 |
shardy | probably not a huge time benefit but it'll all help I guess | 15:47 |
*** agurenko has quit IRC | 15:47 | |
mwhahaha | the awkward thing is that in our ci we're overriding the services list | 15:47 |
mwhahaha | so we aren't really using proper roles | 15:47 |
EmilienM | we could change that ^ | 15:47 |
mwhahaha | that is something we should definately clean up in queens | 15:47 |
shardy | mwhahaha: that's OK isn't it, the *Services parameters can override the roles_data defaults? | 15:47 |
shardy | but we still don't need to render e.g BlockStorage and ObjectStorage on roles which don't use them (which I think is basically all CI jobs) | 15:48 |
*** nyechiel has quit IRC | 15:49 | |
shardy | mwhahaha: my original idea with roles_data.yaml was for the service lists to be optional - the operator can still pass *Services as parameters if they want | 15:49 |
shardy | I guess we could also have a tool that generates a role from a list of services, but that seems like it's more complex than just passing a parameter to override the defaults to me | 15:49 |
shardy | anyway, open to ideas on it :) | 15:50 |
mwhahaha | well with the containers we do have that list | 15:50 |
mwhahaha | or at least some of that list | 15:50 |
mwhahaha | but yea we can reduce the roles_data.yaml to not include the extra roles we don't use | 15:50 |
mwhahaha | that would be a start | 15:51 |
shardy | Yeah, then we could either have a ControllerHAMinimal role, or a ControllerServices override that does the same | 15:51 |
shardy | featuresets don't really define roles or services atm, so I guess we'll need to change that in quickstart either way | 15:52 |
*** artom_ is now known as artom | 15:53 | |
Tengu | jaosorior: thanks for the tip, I now have TLS enabled as expected. | 15:53 |
mwhahaha | yea | 15:53 |
*** cylopez has joined #tripleo | 15:53 | |
*** cylopez has quit IRC | 15:53 | |
Tengu | (and a Let's Encrypt SSL certificate, in addition :)) | 15:54 |
jaosorior | Tengu: great! :D | 15:54 |
*** psachin has quit IRC | 15:55 | |
Tengu | jaosorior: next step: we'll plug 2 more controllers (meaning: full re-deploy), and my job will be to enable TLS with the very same certificate on the three of them, using the service we spoke about earlier, running on the undercloud VM. | 15:55 |
*** aditya_r has quit IRC | 15:55 | |
Tengu | and that should just do it as expected. | 15:55 |
*** stendulker has joined #tripleo | 15:55 | |
*** aditya_r has joined #tripleo | 15:56 | |
Tengu | for now I'm using "staging LE" certificates as I didn't want to get blacklisted due to some thresholds, but fact is: mono-node is working just fine. | 15:56 |
*** ykarel has joined #tripleo | 16:00 | |
*** Vijayendra has joined #tripleo | 16:00 | |
*** aufi has quit IRC | 16:01 | |
*** egonzalez has quit IRC | 16:02 | |
*** hamzy has quit IRC | 16:04 | |
*** thrash is now known as thrash|biab | 16:04 | |
*** dtantsur is now known as dtantsur|afk | 16:07 | |
*** marios has quit IRC | 16:09 | |
Tengu | jaosorior: is there a better way to push a file on the remote nodes than a OS::Heat::SoftwareConfig resource un heat with a plain script in it? | 16:09 |
*** ooolpbot has joined #tripleo | 16:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 16:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713659 | 16:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 16:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 16:10 |
*** ooolpbot has quit IRC | 16:10 | |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,In progress] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 16:10 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 16:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 16:10 |
*** numans has quit IRC | 16:13 | |
*** yamahata has quit IRC | 16:14 | |
*** jmelvin has quit IRC | 16:15 | |
jaosorior | Tengu: a swift artifact? | 16:15 |
*** numans has joined #tripleo | 16:15 | |
*** dtrainor_ has quit IRC | 16:16 | |
*** dtrainor_ has joined #tripleo | 16:16 | |
shardy | Tengu: https://hardysteven.blogspot.co.uk/2016/08/tripleo-deploy-artifacts-and-puppet.html | 16:17 |
shardy | Tengu: that has some notes about the swift artifacts approach, it basically stores a tarball in swift on the undercloud | 16:18 |
shardy | Tengu: and we have a script that runs on every node to download and unpack it | 16:18 |
*** jlinkes has quit IRC | 16:18 | |
shardy | the script also supports installing an RPM from an http location: | 16:19 |
shardy | https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/deploy-artifacts.sh#L14 | 16:20 |
*** dtrainor_ has quit IRC | 16:21 | |
Tengu | hmmm ok. | 16:22 |
*** ioggstream has quit IRC | 16:22 | |
*** dtrainor_ has joined #tripleo | 16:22 | |
rwsu | if I wanted to deploy an containerized overcloud with tripleo-quickstart (featureset010), which release should I be using? master, master-tripleo-ci, pike? | 16:23 |
Tengu | shardy jaosorior is there a way to limit the deploy to only a profile, for instance "controller" ? | 16:24 |
shardy | Tengu: currently no, you'd still need an ExtraConfig template with SoftwareConfig for that | 16:24 |
Tengu | hmm ok. | 16:25 |
shardy | Tengu: adding such an interface wouldn't be hard though, if needed | 16:25 |
Tengu | :) | 16:25 |
shardy | Tengu: the primary use-case for that so far has been to sync developer puppet modules to all nodes | 16:25 |
Tengu | might be cool in order to avoid having a cron.daily script on nodes that doesn't need it | 16:25 |
shardy | Tengu: yeah, I don't think there would be any objections to enabling it, patches welcome :) | 16:26 |
Tengu | I'm still thinking about the way I'll share a certificate between the controllers, and although I'm pretty sure the script isn't hard to code, the script deployement is my main concern | 16:26 |
Tengu | shardy: patch wouldn't hit pike anyway I think? | 16:26 |
shardy | Tengu: there are already per-role NodeUserData interfaces, but they will only run once, via cloud-init | 16:27 |
shardy | Tengu: yeah it's too late for pike at this point | 16:27 |
Tengu | ^^ | 16:27 |
shardy | Tengu: there are host_prep_tasks which are per-service | 16:29 |
Tengu | hmm, I think I read that somewhere indeed | 16:29 |
Tengu | but here again it will be executed only once, won't it? | 16:29 |
shardy | Tengu: those are ansible tasks which could be made to run e.g only on the Controller, either by adding the task to an existing service, or creating a new one | 16:29 |
*** itlinux has joined #tripleo | 16:29 | |
shardy | Tengu: No it'll be run every update | 16:29 |
Tengu | hmmm | 16:30 |
Tengu | I like that | 16:30 |
Tengu | I might even love that in fact :) | 16:30 |
shardy | https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/database/mysql.yaml#L203 | 16:30 |
shardy | Tengu: example | 16:30 |
shardy | you can do anything ansible supports, but note it runs the tasks in localhost mode | 16:31 |
Tengu | so I'd need to do some download in order to get the script itself, right? | 16:31 |
Tengu | not that hard, I can provide it from the undercloud VM through the provisioning network | 16:31 |
shardy | https://github.com/openstack/tripleo-heat-templates/blob/master/common/deploy-steps.j2#L186 | 16:31 |
shardy | Tengu: no, you just add the tasks to the service template, we join them all together per-role | 16:32 |
shardy | then run them early in the deploy (before trying to start any services either via puppet on the host or in containers) | 16:32 |
Tengu | right, but as it's executed locally, I can't use a copy | 16:32 |
Tengu | I think that will do it. | 16:33 |
shardy | Tengu: Oh right, yeah you'd need to host whatever is downloaded somewhere, unless it's passed as a parameter into the service template | 16:33 |
shardy | which I think is how most of the certs are done atm | 16:33 |
Tengu | true. but in my case it will be a buffy script | 16:33 |
Tengu | (check/update/fetch custodia, request new cert, renew, and so on( | 16:34 |
Tengu | hence I think the best way would be to deliver it through a download. | 16:35 |
shardy | Tengu: sure, there is also a get_file function in the heat templates, which can reference a local file on the undercloud (or rather wherever you run the overcloud deploy command) | 16:36 |
*** sri_ has quit IRC | 16:36 | |
shardy | we use that in a few places to consume scripts which are then run either via SoftwareConfig or ansible tasks | 16:36 |
Tengu | shardy: so I create a new heat file, and point to it in the registry entry OS::TripleO::NodeTLSData ? | 16:36 |
Tengu | oh | 16:36 |
Tengu | get_file would be lovely | 16:36 |
* Tengu takes notes for tomorrow :) | 16:37 | |
*** ecerquei has joined #tripleo | 16:38 | |
*** gbarros has quit IRC | 16:38 | |
Tengu | thanks a lot :) | 16:39 |
Tengu | Hopefully I'll be able to get something clean with all your inputs. | 16:39 |
shardy | Tengu: Hmm, probably chat with jaosorior about NodeTLSData, that isn't a composable service but rather a special hook to enable TLS things | 16:39 |
*** agurenko has joined #tripleo | 16:39 | |
shardy | it may be useful to you, but it won't support the host_prep_tasks approach I mentioned | 16:39 |
Tengu | shardy: would it support the get_file ? | 16:40 |
jaosorior | Tengu: I don't see why you wouldn't be able to use NodeTLSData. It should be fine. | 16:40 |
shardy | Tengu: yeah any heat template wlil support get_file | 16:40 |
Tengu | ok, so get_file will it be :) | 16:40 |
Tengu | cleaner I think. | 16:40 |
shardy | Ok so yeah you could just have a special puppet/extraconfig/tls/tls-cert-inject.yaml alternative | 16:41 |
Tengu | I already have that right now for my mono-controller let's encrypt setup | 16:41 |
Tengu | (and it's actually working \o/) | 16:41 |
shardy | the config: | can be replaced with config: {get_file: path/to/foo.sh} | 16:41 |
Tengu | cool | 16:42 |
Tengu | and the path is relative to what? foo.yaml ? | 16:42 |
shardy | Tengu: Ok, sounds like that's the thing to iterate on then, put the host_prep_tasks suggestion aside for now :) | 16:42 |
shardy | Tengu: yeah it's either absolute or relative to the template it's defined in | 16:42 |
Tengu | :) perfect | 16:43 |
* shardy tries to remember if get_fila also works with URLs | 16:43 | |
*** jmelvin has joined #tripleo | 16:43 | |
Tengu | oh well, if I can just drop my script in my local directory, alongside the template... | 16:43 |
shardy | yup that should do it | 16:43 |
Tengu | no need to fetch over an HTTP server ;) | 16:43 |
Tengu | hence no need to configure a hacky vhost on the undercloud, hence less problems | 16:43 |
*** ecerquei has quit IRC | 16:44 | |
*** ecerquei has joined #tripleo | 16:45 | |
*** dparkes has quit IRC | 16:46 | |
*** jpich has quit IRC | 16:47 | |
*** dtrainor_ has quit IRC | 16:48 | |
*** sri_ has joined #tripleo | 16:48 | |
Tengu | last question, for jaosorior I think: the TLS endpoint is managed by haproxy only, right? Just want to ensure I don't need to reload any other service, like httpd, upon cert renewal | 16:48 |
*** ecerquei_ has joined #tripleo | 16:48 | |
*** aditya_r has quit IRC | 16:48 | |
*** aditya_r has joined #tripleo | 16:48 | |
*** yamahata has joined #tripleo | 16:49 | |
*** ecerquei has quit IRC | 16:50 | |
beagles | EmilienM, mwhahaha: | 16:54 |
beagles | https://review.openstack.org/#/c/500798/ just passed CI , +A ? | 16:54 |
mwhahaha | yes | 16:54 |
EmilienM | how come it passed? | 16:55 |
EmilienM | but cool :D | 16:55 |
mwhahaha | management_ip_address | 16:55 |
EmilienM | I mean | 16:55 |
beagles | mwhahaha, it fixed the rabbitmq thing it was an instack-undercloud patch | 16:55 |
EmilienM | ah | 16:55 |
EmilienM | do we need backport to stable/pike? | 16:55 |
mwhahaha | i don't think so unless we bumped rabbitmq puppet module for pike | 16:55 |
mwhahaha | at which point we would | 16:56 |
EmilienM | yes I think we did, let me check | 16:56 |
EmilienM | yes we did | 16:56 |
mwhahaha | then yes | 16:56 |
*** gbarros has joined #tripleo | 16:56 | |
EmilienM | jaosorior: you confirm? | 16:56 |
EmilienM | mwhahaha: https://review.rdoproject.org/r/#/c/9234/ | 16:56 |
shardy | Tengu: you can deploy with either TLS to the haproxy or recently support was added for TLS everywhere (e.g also the internal network between services) | 16:56 |
EmilienM | I'm backporting | 16:56 |
shardy | https://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/ssl.html | 16:56 |
shardy | Tengu: ^^ has some more info, but jaosorior is the best contact for any questions | 16:56 |
openstackgerrit | Emilien Macchi proposed openstack/instack-undercloud stable/pike: Use integer for rabbitmq port and specify management IP https://review.openstack.org/500926 | 16:57 |
*** jpena is now known as jpena|off | 16:57 | |
EmilienM | beagles, mwhahaha: will need review on ^ as well | 16:58 |
Tengu | shardy: for now we won't set up internal TLS - for that one we're waiting a proper freeipa support in order to be able to request certificates signed with our internal CA | 16:58 |
*** hewbrocca is now known as hewbrocca_afk | 16:58 | |
beagles | EmilienM, ack | 16:59 |
Tengu | ah, I'll need to add our freeIPA ca in the process so that keystone can actually talk to it for the authentication. | 16:59 |
*** thrash|biab is now known as thrash | 17:00 | |
*** ykarel has quit IRC | 17:00 | |
*** derekh has quit IRC | 17:00 | |
*** shardy is now known as shardy_afk | 17:01 | |
*** gcerami has quit IRC | 17:01 | |
*** dtrainor_ has joined #tripleo | 17:03 | |
*** ebarrera has joined #tripleo | 17:03 | |
*** stendulker has quit IRC | 17:03 | |
*** ecerquei_ has quit IRC | 17:04 | |
*** aditya_r has quit IRC | 17:05 | |
*** aditya_r has joined #tripleo | 17:05 | |
*** dtrainor_ has quit IRC | 17:07 | |
*** dtrainor_ has joined #tripleo | 17:08 | |
*** achadha__ has joined #tripleo | 17:08 | |
*** ooolpbot has joined #tripleo | 17:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 17:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1713659 | 17:10 |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,In progress] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 17:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 17:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 17:10 |
*** ooolpbot has quit IRC | 17:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 17:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 17:10 |
*** thrash is now known as thrash|bbl | 17:10 | |
*** ebarrera has quit IRC | 17:12 | |
*** tosky has quit IRC | 17:12 | |
*** achadha__ has quit IRC | 17:19 | |
*** ramishra has quit IRC | 17:19 | |
*** achadha has joined #tripleo | 17:20 | |
*** dtrainor_ has quit IRC | 17:20 | |
*** hamzy has joined #tripleo | 17:21 | |
*** tesseract has quit IRC | 17:25 | |
jaosorior | Tengu: right; only haproxy | 17:25 |
*** rcernin has quit IRC | 17:25 | |
*** numans has quit IRC | 17:25 | |
Tengu | jaosorior: cool :). I think it will be almost easy, compared to all the other stuff I had to do in order to get a working overcloud :D | 17:26 |
*** achadha has quit IRC | 17:26 | |
*** numans has joined #tripleo | 17:28 | |
openstackgerrit | Tong Liu proposed openstack/tripleo-heat-templates master: Change all references of nsx_v3 to nsx. https://review.openstack.org/498143 | 17:30 |
openstackgerrit | Tong Liu proposed openstack/tripleo-heat-templates master: Change to boolean for boolean type params https://review.openstack.org/500934 | 17:30 |
*** achadha has joined #tripleo | 17:33 | |
EmilienM | sshnaidm: ping me when you have time | 17:34 |
*** oidgar has quit IRC | 17:34 | |
EmilienM | mandre: same ^ | 17:34 |
sshnaidm | EmilienM, here | 17:35 |
openstackgerrit | Lars Kellogg-Stedman proposed openstack/puppet-tripleo master: fluentd: support service_config_settings configuration mechanism https://review.openstack.org/500935 | 17:35 |
EmilienM | pradk: have you seen https://review.openstack.org/#/c/500250/ ? | 17:35 |
EmilienM | pradk: sounds like telemetry services arent containerized in https://github.com/openstack/tripleo-heat-templates/blob/master/ci/environments/scenario001-multinode-containers.yaml | 17:36 |
*** dsneddon has quit IRC | 17:36 | |
pradk | looking EmilienM | 17:36 |
EmilienM | thx | 17:37 |
EmilienM | pradk: jd said it's because ceilo (probably other telemetry services) aren't containerized | 17:37 |
pradk | EmilienM, telemetry services are containerized ofcourse | 17:37 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Add support for Dell EMC VMAX Manila Backend https://review.openstack.org/499199 | 17:37 |
pradk | he said wrong :) | 17:37 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: Add support for Dell EMC Isilon Manila backend https://review.openstack.org/499195 | 17:37 |
EmilienM | pradk: ahah | 17:38 |
*** liverpooler has quit IRC | 17:38 | |
pradk | EmilienM, i'll investigate why ci is failing | 17:38 |
*** dmsimard|afk is now known as dmsimard | 17:38 | |
EmilienM | pradk: thank you, feel free to file a bug or send patches | 17:38 |
sshnaidm | EmilienM, so, why do we need both current-tripleo and current in upgrade jobs..? | 17:39 |
*** achadha has quit IRC | 17:40 | |
*** achadha has joined #tripleo | 17:41 | |
*** jfrancoa has quit IRC | 17:41 | |
*** pkovar has quit IRC | 17:41 | |
EmilienM | sshnaidm: I think we need one, right? | 17:41 |
pradk | EmilienM, so the deployment is actually successful.. i see stack created http://logs.openstack.org/50/500250/2/check/gate-tripleo-ci-centos-7-scenario001-multinode-oooq-container/bf0b1b0/logs/postci.txt.gz | 17:42 |
EmilienM | sshnaidm: for tripleo packages we need current | 17:42 |
pradk | EmilienM, and telemetry service containers are up and running | 17:42 |
EmilienM | pradk: tempest is failing | 17:42 |
EmilienM | pradk: see tempest results | 17:42 |
sshnaidm | EmilienM, we use it in master, in stable branches only consistent | 17:42 |
EmilienM | pradk: http://logs.openstack.org/50/500250/2/check/gate-tripleo-ci-centos-7-scenario001-multinode-oooq-container/bf0b1b0/logs/tempest.html.gz | 17:42 |
EmilienM | sshnaidm: mhh ok | 17:42 |
pradk | EmilienM, right so we might have to tweak some timeout .. | 17:42 |
EmilienM | sshnaidm: ok let me show you something weird, give me a minute | 17:42 |
*** dsneddon has joined #tripleo | 17:44 | |
sshnaidm | EmilienM, lemme check too if I ain't wrong | 17:44 |
*** trown is now known as trown|lunch | 17:45 | |
EmilienM | sshnaidm: look http://logs.openstack.org/80/500580/3/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/f0bd3bb/logs/ | 17:45 |
EmilienM | sshnaidm: it's an upgrade job that deploys Ocata on BM and upgrade to pike in container (it's a CI job in stable/pike branch) | 17:45 |
EmilienM | sshnaidm: so we would expect THT to be the latest commit in stable/pike right? | 17:45 |
sshnaidm | EmilienM, yep | 17:46 |
EmilienM | now look http://logs.openstack.org/80/500580/3/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/f0bd3bb/logs/rpm-qa.txt.gz | 17:46 |
EmilienM | openstack-tripleo-heat-templates-7.0.0-0.20170902232701.6a80731.el7.centos.noarch | 17:46 |
EmilienM | it's a commit from 3 days ago | 17:46 |
EmilienM | mwhahaha, jaosorior : I'm removing the alert on https://bugs.launchpad.net/tripleo/+bug/1713659 since fixes are in gate | 17:47 |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,In progress] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 17:47 |
mwhahaha | k | 17:47 |
openstackgerrit | Steven Hardy proposed openstack/tripleo-heat-templates master: DNM - WIP testing OVB job speed improvements https://review.openstack.org/500942 | 17:49 |
EmilienM | sshnaidm: other topic: are we still blocked in promotion pipeline? I haven't checked at log yet but I'm going to | 17:53 |
EmilienM | ok I see it failing: http://status.openstack.org/openstack-health/#/g/build_name/periodic-tripleo-ci-centos-7-scenario001-multinode-oooq | 17:54 |
sshnaidm | EmilienM, no, today we built nova and tomorrow we can run promotion jobs finally with new dlrn hash | 17:55 |
EmilienM | pradk: could you take a look also? | 17:55 |
EmilienM | pradk: ah nevermind | 17:55 |
pradk | ok | 17:55 |
EmilienM | sshnaidm: ok, let me know. I'll be away tomorrow and Thursday (for work but away) - if it still fails, please ping pradk to investigate why - and OK for ignoring autoscaling test in the meantime | 17:56 |
*** numans has quit IRC | 17:56 | |
EmilienM | sshnaidm: but if you do that, please do it only on master (not on pike) | 17:56 |
*** tosky has joined #tripleo | 17:56 | |
sshnaidm | EmilienM, I'll keep an eye | 17:56 |
EmilienM | sshnaidm: thank you | 17:56 |
EmilienM | sshnaidm: so have you seen my notes? it's weird right? | 17:57 |
*** thrash|bbl is now known as thrash | 17:57 | |
*** shardy_afk is now known as shardy | 17:57 | |
sshnaidm | EmilienM, still looking there | 17:57 |
sshnaidm | EmilienM, can you point me please what is exactly happens there? | 17:58 |
*** numans has joined #tripleo | 17:58 | |
*** sri_ has quit IRC | 17:58 | |
sshnaidm | EmilienM, we have ther https://trunk.rdoproject.org/centos7-pike/current-tripleo/delorean.repo and it should contain the latest IMHO | 17:59 |
openstackgerrit | Tong Liu proposed openstack/tripleo-heat-templates stable/pike: Change all references of nsx_v3 to nsx. https://review.openstack.org/500946 | 18:01 |
shardy | EmilienM: Should we consider merging https://review.openstack.org/#/c/485732/, https://review.openstack.org/#/c/450708 and https://review.openstack.org/#/c/450709 ? | 18:02 |
shardy | The last patch actually passed all CI, but the patch ahead of it timed out on the OVB HA job | 18:03 |
shardy | the first one is on the critical path for minor update support | 18:03 |
shardy | I've been holding off rebasing the final https://review.openstack.org/#/q/topic:bug/1635409+status:open patch as I'd like to avoid rebasing those two mostly green patches again | 18:04 |
shardy | I'll continue on the ovb speed investigations anyway, but I'm not sure we should block those any longer | 18:05 |
*** sshnaidm is now known as sshnaidm|afk | 18:05 | |
*** karthiks has quit IRC | 18:06 | |
EmilienM | shardy: ok for me | 18:06 |
*** numans has quit IRC | 18:07 | |
shardy | sshnaidm|afk: isn't current-tripleo the pin from the last promotion, and "current" is the latest? | 18:07 |
shardy | I know we whitelist current for most tripleo things in CI tho | 18:07 |
shardy | E.g if you look at https://trunk.rdoproject.org/centos7-pike/current/ it is stable/pike head | 18:08 |
*** ooolpbot has joined #tripleo | 18:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 18:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 18:10 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 18:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 18:10 |
*** ooolpbot has quit IRC | 18:10 | |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 18:10 |
pradk | EmilienM, is there a way to check the etc config file within the containers in ci? | 18:11 |
pradk | seems like it can get data from panko | 18:11 |
pradk | cant* | 18:12 |
EmilienM | pradk: yes, a sec | 18:14 |
*** shardy is now known as shardy_afk | 18:14 | |
EmilienM | pradk: http://logs.openstack.org/50/500250/2/check/gate-tripleo-ci-centos-7-scenario001-multinode-oooq-container/bf0b1b0/logs/subnode-2/var/log/config-data/panko/etc/panko/ | 18:15 |
EmilienM | and all in http://logs.openstack.org/50/500250/2/check/gate-tripleo-ci-centos-7-scenario001-multinode-oooq-container/bf0b1b0/logs/subnode-2/var/log/config-data/ | 18:15 |
EmilienM | that's where we write config and mount into /etc in the container, iiue | 18:15 |
EmilienM | iiuc* | 18:15 |
*** cschwede_ has quit IRC | 18:16 | |
openstackgerrit | Oliver Walsh proposed openstack/tripleo-common master: Add selinux policy rpms to nova_libvirt container image https://review.openstack.org/500951 | 18:17 |
EmilienM | pradk: also FYI https://bugs.launchpad.net/tripleo/+bug/1715031 | 18:18 |
openstack | Launchpad bug 1715031 in tripleo "Mongodb is default disabled on baremetal, enabled on containers" [High,In progress] - Assigned to Steve Baker (steve-stevebaker) | 18:18 |
pradk | EmilienM, cool, thx | 18:22 |
openstackgerrit | Tong Liu proposed openstack/tripleo-heat-templates master: Change to boolean for boolean type params https://review.openstack.org/500934 | 18:22 |
*** karthiks has joined #tripleo | 18:22 | |
EmilienM | mwhahaha: do you want me to move all bugs to queens-1 and we manually reschedule the ones for pike-rc2? | 18:25 |
*** numans has joined #tripleo | 18:27 | |
EmilienM | mwhahaha: also, I saw https://bugs.launchpad.net/tripleo/+bug/1715134 which is something you did for instack-undercloud but not THT, fyi | 18:27 |
openstack | Launchpad bug 1715134 in tripleo "Allow to increase docker daemon verbosity" [High,Triaged] | 18:27 |
EmilienM | dprince: could we have someone working on https://bugs.launchpad.net/tripleo/+bug/1715136 ? looks pretty good to have imho | 18:28 |
openstack | Launchpad bug 1715136 in tripleo "docker/docker-puppet.py should retry the pull" [High,Triaged] | 18:28 |
*** nyechiel has joined #tripleo | 18:29 | |
*** achadha_ has joined #tripleo | 18:30 | |
openstackgerrit | Oliver Walsh proposed openstack/tripleo-heat-templates master: Enable selinux within the nova_libvirt container https://review.openstack.org/500952 | 18:31 |
openstackgerrit | Tong Liu proposed openstack/tripleo-heat-templates stable/pike: Add DhcpAgentNotification param to neutron base https://review.openstack.org/500794 | 18:32 |
*** aditya_ra has joined #tripleo | 18:32 | |
*** aditya_r has quit IRC | 18:32 | |
*** achadha has quit IRC | 18:33 | |
owalsh | EmilienM: hola... looks like selinux is not enabled in *any* container | 18:33 |
EmilienM | owalsh: that sucks :) | 18:33 |
owalsh | EmilienM: not sure if it would cause any issues enabling it everywhere. Might "just work" | 18:34 |
*** achadha_ has quit IRC | 18:34 | |
EmilienM | owalsh: i'm pretty sure we'll hit more bugs if we don't setup it now | 18:34 |
*** tongl has joined #tripleo | 18:35 | |
owalsh | EmilienM: ack. dprince FYI ^^^ I though I saw a comment somewhere re disabling selinux for containers but I can't find it now. Does it sound familiar? | 18:36 |
owalsh | s/though/thought/ | 18:37 |
EmilienM | jrist: see my last comment on https://review.openstack.org/#/c/469608/ | 18:38 |
EmilienM | jrist: no worries if we miss the deadline, this one can be backported imho | 18:39 |
*** lblanchard1 has joined #tripleo | 18:42 | |
*** lblanchard has quit IRC | 18:42 | |
itlinux | hello all.. my gnocchi does not public any data.. what should I look at? Ocata version TY | 18:45 |
*** pcaruana has quit IRC | 18:49 | |
openstackgerrit | Pradeep Kilambi proposed openstack/tripleo-heat-templates master: Update panko port in env ssl yaml files to correct one https://review.openstack.org/500957 | 18:51 |
*** agurenko has quit IRC | 18:52 | |
pabelanger | o/ | 18:53 |
EmilienM | pabelanger: you made it? | 18:54 |
*** aditya_ra has quit IRC | 18:56 | |
pabelanger | Yup | 18:56 |
*** aditya_r has joined #tripleo | 18:56 | |
pabelanger | EmilienM: anybody looking at gate failures? | 18:57 |
EmilienM | pabelanger: nothing active on https://bugs.launchpad.net/tripleo/+bug/1714361 right now but shardy_afk is your point of contact | 18:58 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 18:58 |
pabelanger | k bug 1713659 what I see | 18:58 |
openstack | bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,In progress] https://launchpad.net/bugs/1713659 - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 18:58 |
EmilienM | pabelanger: https://bugs.launchpad.net/tripleo/+bug/1714905 isn't gate but upgrades for now | 18:58 |
mwhahaha | pabelanger: do we have a good way of tracking failures in just the gate? | 18:58 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 18:58 |
EmilienM | mwhahaha: status.openstack.org/openstack-health/#/ | 18:58 |
EmilienM | I use it a lot (with RSS) | 18:58 |
pabelanger | mwhahaha: logstash will tell you too | 18:58 |
mwhahaha | i have too many tabs open already :D | 18:59 |
EmilienM | mwhahaha: I have a link, useful (feel free to bookmark) http://status.openstack.org/openstack-health/#/?groupKey=build_name&searchProject=gate-tripleo-ci-centos-7&duration=PT12H&resolutionKey=day | 19:00 |
openstackgerrit | Pradeep Kilambi proposed openstack/tripleo-heat-templates master: Run gnocchi statsd and metrcd at step 5 https://review.openstack.org/498135 | 19:00 |
EmilienM | but it tells you what failed in gate the last 12 hours | 19:00 |
mwhahaha | thanks | 19:00 |
EmilienM | note there is no OVB in there because OVB isn't in gate | 19:01 |
mwhahaha | seems like something a specific team should be responsible for | 19:01 |
* mwhahaha takes notes | 19:01 | |
EmilienM | but it still reflect a lot what failures we had the last 12 hours | 19:01 |
EmilienM | mwhahaha: the bar graph on the right is a good indicator on what is high prio to look | 19:01 |
EmilienM | for ex, now it's gate-tripleo-ci-centos-7-undercloud-oooq obviously | 19:02 |
EmilienM | and I'm pretty sure if we look in "why", it's the rabbit thing | 19:02 |
EmilienM | http://logs.openstack.org/74/500674/1/gate/gate-tripleo-ci-centos-7-undercloud-oooq/e7ce0bf/logs/undercloud/home/jenkins/undercloud_install.log.txt.gz#_2017-09-05_18_57_33 | 19:02 |
EmilienM | bingo | 19:02 |
EmilienM | so openstack-health doesn't tell you WHAT fails but gives a kind of weather on how things look like | 19:02 |
EmilienM | while elastic-recheck tells you what failed | 19:03 |
EmilienM | I think we should use more openstack-health for our gate to feed elastic-recheck | 19:03 |
*** aditya_ra has joined #tripleo | 19:05 | |
EmilienM | pabelanger: I'm pretty sure you have more techniques, by just using elastic-recheck? | 19:07 |
EmilienM | gate-tripleo-ci-centos-7-scenario004-multinode-oooq-container is also in bad shape | 19:07 |
EmilienM | I saw a lot of timeouts lately | 19:07 |
EmilienM | I think it's https://bugs.launchpad.net/bugs/1715029 | 19:08 |
openstack | Launchpad bug 1715029 in tripleo "[overcloud.AllNodesDeploySteps.ControllerDeployment_Step3]: CREATE_FAILED Resource CREATE failed: Operation cancelled" [Critical,Triaged] | 19:08 |
pabelanger | EmilienM: ya, we talk more about it at PTG | 19:08 |
*** aditya_r has quit IRC | 19:08 | |
pabelanger | but mostly just use elastic-recheck, status.o.o/zuul and logstash | 19:08 |
EmilienM | pabelanger: i also found openstack-health useful, tbh | 19:09 |
EmilienM | pabelanger: the rss feature is quite cool | 19:09 |
pabelanger | EmilienM: ya, last I looked pingtest was 100% on some jobs, but haven't looked in a few weeks | 19:09 |
*** ooolpbot has joined #tripleo | 19:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 19:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 19:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 19:10 |
*** ooolpbot has quit IRC | 19:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 19:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 19:10 |
EmilienM | pabelanger: we enabled tempest now on ALL multinode jobs | 19:11 |
EmilienM | pabelanger: no pingtest anymore | 19:11 |
*** zidolar has quit IRC | 19:11 | |
pabelanger | EmilienM: k | 19:11 |
pabelanger | EmilienM: http://status.openstack.org/openstack-health/#/job/gate-tripleo-ci-centos-7-scenario004-multinode-oooq-container | 19:12 |
pabelanger | pingtest 100% failure, | 19:12 |
pabelanger | EmilienM: last I looked, it was because of https://review.openstack.org/495517/ | 19:12 |
*** gbarros has quit IRC | 19:12 | |
*** eck` is now known as eck`gone | 19:13 | |
dprince | EmilienM: yes, I will take 1715136 | 19:13 |
*** aditya_ra has quit IRC | 19:14 | |
openstackgerrit | Steven Hardy proposed openstack/tripleo-heat-templates master: DNM - WIP testing OVB job speed improvements https://review.openstack.org/500942 | 19:15 |
gfidente | EmilienM I think you got this passing https://bugs.launchpad.net/tripleo/+bug/1714310 ? | 19:15 |
openstack | Launchpad bug 1714310 in tripleo "scenario004 failing to create the Manila share in pingtest" [High,In progress] - Assigned to Jan Provaznik (jan-provaznik) | 19:15 |
*** gbarros has joined #tripleo | 19:16 | |
EmilienM | gfidente: yes, but it's timeouting | 19:16 |
EmilienM | dprince: thx | 19:16 |
*** achadha has joined #tripleo | 19:17 | |
*** gbarros has quit IRC | 19:18 | |
*** hexo_ has joined #tripleo | 19:20 | |
*** dtrainor has joined #tripleo | 19:24 | |
*** gbarros has joined #tripleo | 19:25 | |
*** jtomasek has quit IRC | 19:26 | |
*** itlinux has quit IRC | 19:35 | |
*** ioggstream has joined #tripleo | 19:38 | |
*** eck`gone is now known as eck` | 19:41 | |
*** trown|lunch is now known as trown | 19:42 | |
openstackgerrit | Lee Yarwood proposed openstack/tripleo-specs master: WIP - Introduce skip level upgrades https://review.openstack.org/497257 | 19:43 |
*** dtrainor has quit IRC | 19:45 | |
*** dtrainor has joined #tripleo | 19:46 | |
*** nyechiel has quit IRC | 19:46 | |
pabelanger | mwhahaha: EmilienM: 500926 still need +3 | 19:48 |
mwhahaha | k | 19:49 |
dprince | owalsh: is this what you are after http://git.openstack.org/cgit/openstack/puppet-tripleo/tree/manifests/profile/base/docker.pp#n34 | 19:54 |
dprince | owalsh: we do this in Centos 7.3 as that version of docker has an older overlayfs2 driver that doesn't support selinux | 19:54 |
dprince | owalsh: w/ RHEL 7.4's docker it is newer and has patches to support it so we can enable it there | 19:55 |
dprince | owalsh: still a TODO is to try the docker-current package and see what happens there | 19:55 |
*** dtrainor has quit IRC | 19:56 | |
*** jprovazn has quit IRC | 19:59 | |
*** abishop has quit IRC | 20:10 | |
*** ooolpbot has joined #tripleo | 20:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 20:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 20:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 20:10 |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 20:10 |
*** ooolpbot has quit IRC | 20:10 | |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 20:10 |
*** dtrainor has joined #tripleo | 20:10 | |
*** rhallisey has quit IRC | 20:10 | |
*** toure|gone is now known as toure | 20:12 | |
openstackgerrit | Martin Kopec proposed openstack/tripleo-quickstart-extras master: Allow removing of options from tempest conf https://review.openstack.org/477079 | 20:13 |
owalsh | dprince: ah, yea that was it. Doesn't affect selinux within the container IIUC | 20:15 |
*** lblanchard1 has quit IRC | 20:15 | |
*** rlandy is now known as rlandy|brb | 20:17 | |
owalsh | dprince: which I need for https://launchpad.net/bugs/1715171 | 20:17 |
openstack | Launchpad bug 1715171 in tripleo "Live-migration fails from baremetal to containerized compute, selinux disabled in nova_libvirt container" [High,In progress] - Assigned to Oliver Walsh (owalsh) | 20:17 |
*** ebarrera has joined #tripleo | 20:18 | |
owalsh | dprince: any info the overlayfs2 issues? https://review.openstack.org/500952 works for me on Centos 7.3.1611 | 20:20 |
dprince | owalsh: none other than what is already linked into puppet-tripleo profile for docker I linked above | 20:22 |
*** dtrainor has quit IRC | 20:24 | |
owalsh | dprince: hmm, ok. Not sure --selinux-enabled does what we think it does https://www.projectatomic.io/blog/2016/07/docker-selinux-flag/ | 20:24 |
*** shardy_afk has quit IRC | 20:25 | |
*** dtrainor has joined #tripleo | 20:25 | |
*** jkilpatr_ has quit IRC | 20:25 | |
owalsh | dprince: I'm deploying with openstack-selinux in the base image & /sys/fs/selinux in containers-common.yaml, see if anything obvious breaks | 20:29 |
dprince | owalsh: does added --selinux-enabled to the docker daemon do the same thing as what you are adding with the custom volume? | 20:31 |
*** rlandy|brb is now known as rlandy | 20:32 | |
larsks | trown: Is "Evaluation Error: Error while evaluating a Function Call, Class[Rabbitmq]: parameter 'port' expects an Integer value, got String" the rabbitmq issue that came up at the meeting this morning? | 20:33 |
owalsh | dprince: doesn't sound like it does, but I'll check when the deploy finishes | 20:33 |
trown | larsks: indeed | 20:33 |
larsks | trown: okay, thanks. Where is that sort of thing logged so that I can tell without asking if I'm hitting a common error? | 20:34 |
larsks | I thought logstash.openstack.org but I guess it doesn't make it there... | 20:35 |
trown | larsks: just the bug really https://bugs.launchpad.net/tripleo/+bug/1713659 | 20:36 |
openstack | Launchpad bug 1713659 in tripleo "Rabbitmq class expects ports as integers, not strings" [Critical,In progress] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles) | 20:36 |
*** artom has quit IRC | 20:36 | |
larsks | trown: ah, got it. | 20:36 |
openstackgerrit | David Sariel proposed openstack/tripleo-validations master: Add containers_sanity_check https://review.openstack.org/483403 | 20:40 |
*** jkilpatr has joined #tripleo | 20:41 | |
*** ansmith has quit IRC | 20:43 | |
*** rcernin has joined #tripleo | 20:44 | |
*** dtrainor has quit IRC | 20:44 | |
*** pchavva has quit IRC | 20:45 | |
*** dtrainor has joined #tripleo | 20:46 | |
*** gbarros has quit IRC | 20:46 | |
tbarron | EmilienM: just raised and triged https://bugs.launchpad.net/tripleo/+bug/1715238 | 20:51 |
openstack | Launchpad bug 1715238 in tripleo "upgrade failure due to missing nat rule after undercloud upgrade" [High,Triaged] | 20:51 |
tbarron | EmilienM: for queens1 but since I also see it OtoP I want to make sure someone better informed than me | 20:51 |
tbarron | EmilienM: takes a look in case it might impact gate, I dunno enough about the latter to have a judgment | 20:52 |
tbarron | EmilienM: I mean, since I also see it NtoO ! | 20:52 |
*** dtrainor has quit IRC | 20:56 | |
*** oneswig has joined #tripleo | 20:56 | |
*** dtrainor has joined #tripleo | 20:56 | |
*** trown is now known as trown|outtypewww | 20:59 | |
*** florianf has quit IRC | 21:00 | |
*** noslzzp has quit IRC | 21:04 | |
*** nyechiel has joined #tripleo | 21:06 | |
*** noslzzp has joined #tripleo | 21:07 | |
*** ooolpbot has joined #tripleo | 21:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 21:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 21:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 21:10 |
*** ooolpbot has quit IRC | 21:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 21:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 21:10 |
*** leitan has quit IRC | 21:11 | |
*** itlinux has joined #tripleo | 21:13 | |
*** nyechiel has quit IRC | 21:16 | |
EmilienM | tbarron: ok let me look! | 21:17 |
EmilienM | curl: (6) Could not resolve host: trunk.rdoproject.org; Unknown error | 21:17 |
EmilienM | tbarron: do we hit that in CI? | 21:17 |
tbarron | EmilienM: I'm kinda dumb about CI but willing to learn. | 21:18 |
EmilienM | tbarron: how did you find this problem? by testing locally I guess? | 21:18 |
EmilienM | (if I read your bug report, yes I guess) | 21:19 |
tbarron | EmilienM: yes, I was trying to reproduce an LP bug that I was assigned, on OtoP | 21:19 |
bnemec | tbarron: The rule should not be added to BOOTSTACK_MASQ. | 21:19 |
EmilienM | ok | 21:19 |
bnemec | That gets overwritten by the undercloud every time the install/upgrade is run. | 21:19 |
tbarron | EmilienM: hit it there, then repro-ed NtoO, not expecting to | 21:19 |
*** colonwq has quit IRC | 21:20 | |
bnemec | It should get added directly to POSTROUTING to be persistent: sudo iptables -A POSTROUTING -s 10.0.0.0/24 ! -d 10.0.0.0/24 -j MASQUERADE -t nat | 21:20 |
tbarron | bnemec: so should that rule have been in POSTROUTING? | 21:20 |
tbarron | bnemec: jinx | 21:20 |
bnemec | Heh, yes. | 21:20 |
*** ebarrera has quit IRC | 21:21 | |
tbarron | bnemec: but I think I see it in /etc/sysconfig/iptables.save | 21:21 |
tbarron | bnemec: not an opinion on my part, just an observation | 21:21 |
bnemec | If it's added to BOOTSTACK_MASQ then that's kind of like editing a puppet-managed file - the next time the undercloud install happens the changes will be wiped out. | 21:21 |
*** bfournie has quit IRC | 21:22 | |
bnemec | tbarron: The undercloud install clears and re-creates that chain every time, so I don't think it matters. | 21:22 |
tbarron | bnemec: EmilienM so I haven't tracked down where these nat iptables updates are happening, color me naive | 21:22 |
bnemec | I'm assuming this one comes from quickstart. | 21:22 |
tbarron | bnemec: EmilienM probably, and may not be a big deal then | 21:23 |
bnemec | We've intentionally not added it to instack-undercloud because it's not a production-ready configuration. | 21:23 |
bnemec | Nobody should be routing their public network traffic through the undercloud in production. | 21:23 |
tbarron | bnemec: ok, so impact is on developers only right now? | 21:23 |
tbarron | bnemec: and not on CI either? | 21:23 |
bnemec | tbarron: Should be. | 21:23 |
bnemec | Our upgrade ci may route directly out from the overcloud node. | 21:24 |
tbarron | bnemec: then Queens1 is the right target for triage I think. | 21:24 |
bnemec | I'm not as familiar with how that networking is setup though. | 21:24 |
tbarron | EmilienM: bnemec I just wanted to check b/c I know there are upgrade problems for gate and I didn't want to put this one under a bushel if by chance it was relevant | 21:25 |
itlinux | hello pradk: are you around? | 21:26 |
pradk | itlinux, yea whats up | 21:26 |
itlinux | quick question as I was talking to mwhahaha.. my gnocchi does not show any data.. any tips? | 21:27 |
pradk | itlinux, hmm is ceilo configured to send data to gnocchi | 21:27 |
itlinux | I do have ceilometer | 21:28 |
itlinux | here is what I can see.. | 21:28 |
stevebaker | morning | 21:28 |
pradk | itlinux, does your pipeline has gnocchi:// as one of the publisher | 21:28 |
itlinux | http://paste.openstack.org/show/620442/ | 21:29 |
itlinux | http://paste.openstack.org/show/620443/ | 21:30 |
itlinux | this show the compute nodes | 21:30 |
itlinux | node | 21:30 |
pradk | itlinux, hmm ok, so you do have resources | 21:31 |
pradk | itlinux, do you see any errors in ceilometer agent logs when data is dispatched to gnocchi? | 21:31 |
itlinux | well i have the vm runnig | 21:32 |
itlinux | http://paste.openstack.org/show/620444/ | 21:32 |
itlinux | show metric | 21:32 |
pradk | itlinux, so you do have data in gnocchi .. why do you think gnocchi does not show any data then? | 21:33 |
itlinux | I want to see my cpu since I want to create an autoscaling but there is not data I can retrieve.. can you guide me on that..TY | 21:34 |
itlinux | I maybe missing a step. | 21:34 |
pradk | itlinux, whats your vm running? | 21:37 |
pradk | cirros? | 21:37 |
itlinux | this is cirros but I tried ubuntu | 21:37 |
EmilienM | mwhahaha: looking at zuul, 500798 will merge in ~35 min - if you can please recheck patches we need for pike-rc2 | 21:37 |
EmilienM | (I might be offline at this time) | 21:38 |
itlinux | as well | 21:38 |
mwhahaha | EmilienM: yea i'll recheck later tonight if i'm not here as well | 21:38 |
EmilienM | stevebaker: good morning sir | 21:38 |
EmilienM | mwhahaha: lgtm | 21:38 |
*** ansmith has joined #tripleo | 21:38 | |
jrist | EmilienM: ok | 21:40 |
jrist | EmilienM: thanks | 21:40 |
*** akrivoka has quit IRC | 21:42 | |
*** catintheroof has quit IRC | 21:43 | |
pradk | itlinux, hmm so with cirros i think some metrics wont gather .. so some missing metrics is expected | 21:44 |
pradk | itlinux, but if you're seeing the same with ubuntu then i guess thats not the issue here | 21:44 |
itlinux | yes same in ubuntu.. | 21:44 |
pradk | itlinux, do you see cpu_util resource type in gnocchi resources.yaml? | 21:44 |
itlinux | do you have an image I can check | 21:44 |
itlinux | what's the best way to check the resource.yaml.. | 21:45 |
itlinux | I can do resource show and I see cpu_util | 21:45 |
pradk | itlinux, it should be in /etc/ceilometer/ | 21:45 |
itlinux | ok let me check | 21:45 |
itlinux | I do not have resources.yaml in mine | 21:46 |
itlinux | but gnocchi has it.. | 21:46 |
itlinux | http://paste.openstack.org/show/620445/ | 21:46 |
pradk | itlinux, yea that should do it | 21:47 |
pradk | itlinux, check if you have any issues in agent logs | 21:48 |
pradk | errors | 21:48 |
EmilienM | just a heads-up about upgrade jobs : | 21:48 |
EmilienM | we have some successful jobs on stable/pike, here's an example: http://logs.openstack.org/94/500794/2/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/bdc7c53/console.html | 21:49 |
itlinux | which folder.. I have ceilometer and gnocchi.. | 21:49 |
*** Goneri has quit IRC | 21:49 | |
EmilienM | I checked, and it deployed a baremetal ocata and upgrades to a containerized pike, pingtest ran successfuly | 21:49 |
EmilienM | now I'm investigating why scenario upgrades are not working | 21:49 |
itlinux | ceilomter is stuck at the bottom and with no much data | 21:49 |
itlinux | and I do not see any agent logs on the controllers /var/log/gnocchi/ | 21:50 |
pradk | itlinux, agent logs would be in /var/log/ceilometer | 21:50 |
itlinux | http://paste.openstack.org/show/620446/ | 21:51 |
EmilienM | also, I found out that upgrade jobs don't use mirrors: http://logs.openstack.org/94/500794/2/check/gate-tripleo-ci-centos-7-containers-multinode-upgrades-nv/bdc7c53/logs/undercloud/etc/yum.repos.d/delorean-current.repo.txt.gz | 21:51 |
*** rcernin has quit IRC | 21:53 | |
pradk | itlinux, hmm so you're not getting any metrics or just not from compute nodes? | 21:56 |
itlinux | do not see any metrics.. | 21:57 |
itlinux | what should I check to see them? | 21:57 |
pradk | metric list should show if you have any and then you can dig into if you're gathering measures/samples for that metric | 21:57 |
itlinux | I cannot see any data on any resources.. | 21:58 |
pradk | itlinux, if its just the compute, i would double check if the compute agent is running on the compute nodes | 21:58 |
itlinux | I can dig in but the data is always zero | 21:58 |
itlinux | ok let me check | 21:59 |
openstackgerrit | Ian Wienand proposed openstack/tripleo-common master: [DNM] Testing ansible setup of swap https://review.openstack.org/501006 | 21:59 |
*** numans has quit IRC | 21:59 | |
itlinux | what should I check on the compute nodes? | 22:01 |
itlinux | gnocchi folder is empty | 22:01 |
pradk | itlinux, check and make sure ceilometer-agent-compute is running.. you can try restarting | 22:02 |
itlinux | I see some erros here is the link | 22:02 |
itlinux | http://paste.openstack.org/show/620449/ | 22:02 |
itlinux | http://paste.openstack.org/show/620450/ | 22:04 |
*** oneswig has quit IRC | 22:08 | |
pradk | itlinux, can you try restarting the ceilo-compute .. so it tries force polls libvirt apis | 22:08 |
itlinux | ok one sec | 22:09 |
pradk | also | 22:09 |
itlinux | openstack-service restart | 22:09 |
pradk | whats the os_endpoint_type in ceilo.conf on compute | 22:09 |
itlinux | running this onw.. | 22:09 |
openstackgerrit | Emilien Macchi proposed openstack/puppet-tripleo stable/ocata: firewall: generally accept "jump" param and use tripleo:firewall for log rule https://review.openstack.org/501013 | 22:09 |
itlinux | ok letm e check | 22:09 |
openstackgerrit | Emilien Macchi proposed openstack/puppet-tripleo stable/newton: firewall: generally accept "jump" param and use tripleo:firewall for log rule https://review.openstack.org/501014 | 22:10 |
itlinux | are you looking for this interface=internalURL | 22:10 |
*** ooolpbot has joined #tripleo | 22:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 22:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 22:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714905 | 22:10 |
*** ooolpbot has quit IRC | 22:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 22:10 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 22:10 |
pradk | itlinux, yea if you're on newer its probably interface as endpoint might be deprecated .. that all looks ok | 22:13 |
*** jmelvin has quit IRC | 22:13 | |
itlinux | well this is the lastest pass ci tripleo | 22:13 |
itlinux | two weeks or so! | 22:13 |
itlinux | http://paste.openstack.org/show/620451/ | 22:14 |
itlinux | this is after the restart | 22:14 |
pradk | itlinux, is this pike? | 22:14 |
itlinux | ceilometer | 22:14 |
itlinux | ocata | 22:14 |
itlinux | sorry | 22:14 |
itlinux | typed too fast! | 22:15 |
pradk | itlinux, ok so you still have collector | 22:15 |
*** dparkes has joined #tripleo | 22:15 | |
pradk | can you check if any errors in /var/log/ceilo/collector.log | 22:15 |
itlinux | controller or compute | 22:15 |
itlinux | the compute does not have let me check controllers | 22:16 |
pradk | itlinux, controller | 22:16 |
EmilienM | mwhahaha: I'm removing alert on https://bugs.launchpad.net/tripleo/+bug/1714905 - I think we got enough eyes on it | 22:16 |
openstack | Launchpad bug 1714905 in tripleo "Composable scenarios BM -> Containers Upgrade jobs never deploy on BM" [Critical,In progress] - Assigned to Martin André (mandre) | 22:16 |
mwhahaha | k | 22:16 |
itlinux | so two controllers have errors one has hits.. | 22:16 |
itlinux | http://paste.openstack.org/show/620452/ | 22:16 |
itlinux | http://paste.openstack.org/show/620453/ controller1 | 22:17 |
itlinux | http://paste.openstack.org/show/620454/ controller2 | 22:17 |
pradk | itlinux, so that says ceilometer cannot talk to gnocchi and is getting a 503 | 22:18 |
itlinux | let me restart the services on controller1 | 22:18 |
itlinux | and then controller2 | 22:18 |
itlinux | one sec | 22:18 |
pradk | ok | 22:18 |
itlinux | ok controller1 came back with the same msg of controller0 | 22:20 |
*** dtrainor has quit IRC | 22:21 | |
itlinux | ok 3 came back with this now. | 22:22 |
itlinux | http://paste.openstack.org/show/620455/ | 22:22 |
itlinux | same as 1 and 0 | 22:22 |
*** dtrainor has joined #tripleo | 22:23 | |
pradk | itlinux, ok if its able to talk now.. metrics should appear in gnocchi within 10 mins or so | 22:23 |
itlinux | ok | 22:24 |
itlinux | sounds good | 22:24 |
itlinux | let me watch for it.. I will let you know TY | 22:24 |
pradk | ok cool.. i need to step out for a bit.. will check back later | 22:26 |
openstackgerrit | Oliver Walsh proposed openstack/tripleo-common master: Add selinux policy rpms to base container image https://review.openstack.org/500951 | 22:26 |
itlinux | thanks | 22:28 |
openstackgerrit | Merged openstack/instack-undercloud master: Use integer for rabbitmq port and specify management IP https://review.openstack.org/500798 | 22:38 |
*** numans has joined #tripleo | 22:39 | |
*** dprince has quit IRC | 22:42 | |
*** dmarlin has quit IRC | 22:43 | |
*** numans has quit IRC | 22:44 | |
EmilienM | merged ^ we can start doing recheck | 22:45 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-quickstart master: ocata2pike: add missing current-pike repo https://review.openstack.org/500671 | 22:47 |
openstackgerrit | Oliver Walsh proposed openstack/tripleo-heat-templates master: Enable selinux in containers https://review.openstack.org/500952 | 22:51 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates master: Containerized mongodb, disable by default, fix upgrade https://review.openstack.org/500646 | 22:52 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates master: Maintain ceph-osd package only on nodes hosting CephOSD service https://review.openstack.org/496921 | 22:55 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates master: Mount folders and log file https://review.openstack.org/500097 | 22:55 |
openstackgerrit | Emilien Macchi proposed openstack/puppet-tripleo stable/pike: Use TLS proxy for Redis' internal TLS https://review.openstack.org/499995 | 22:55 |
*** itlinux has quit IRC | 22:55 | |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates stable/pike: TLS proxy for redis https://review.openstack.org/499997 | 22:55 |
openstackgerrit | Ronelle Landy proposed openstack/tripleo-quickstart-extras master: Add the option to run the container-check script https://review.openstack.org/501028 | 23:01 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-quickstart master: scenario001-container: run autoscaling tests as well https://review.openstack.org/500250 | 23:03 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-quickstart master: scenario001-container: run autoscaling tests as well https://review.openstack.org/500250 | 23:03 |
*** dparkes has quit IRC | 23:05 | |
*** cdearborn has quit IRC | 23:05 | |
*** lbragstad has joined #tripleo | 23:07 | |
lbragstad | hey folks - i'm going through and updating projects that aren't affected by https://governance.openstack.org/tc/goals/queens/policy-in-code.html | 23:07 |
lbragstad | to the best of my knowledge, tripleo falls into that category | 23:07 |
lbragstad | if that is not the case, please feel free to ping me or leave a comment on https://review.openstack.org/#/c/501031/ | 23:08 |
lbragstad | thanks! | 23:08 |
*** ooolpbot has joined #tripleo | 23:10 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 23:10 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1714361 | 23:10 |
*** ooolpbot has quit IRC | 23:10 | |
openstack | Launchpad bug 1714361 in tripleo "mistral on gates seems old and does not have the required patchs" [Critical,Triaged] - Assigned to Adriano Petrich (apetrich) | 23:10 |
*** bfournie has joined #tripleo | 23:10 | |
*** bfournie has quit IRC | 23:12 | |
*** catintheroof has joined #tripleo | 23:26 | |
*** tosky has quit IRC | 23:29 | |
*** catintheroof has quit IRC | 23:31 | |
*** rlandy is now known as rlandy|bbl | 23:34 | |
openstackgerrit | Andreas Karis proposed openstack/tripleo-heat-templates master: Case insentitive MAC address matching in OsNetConfigMappings https://review.openstack.org/492244 | 23:42 |
*** dtrainor has quit IRC | 23:42 | |
*** dtrainor has joined #tripleo | 23:45 | |
*** tongl has quit IRC | 23:47 | |
*** artom has joined #tripleo | 23:56 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!