Tuesday, 2019-01-29

*** macza has quit IRC00:20
*** DanyC has quit IRC00:23
*** sdake has quit IRC00:23
*** sdake has joined #openstack-ansible00:24
*** ansmith has joined #openstack-ansible00:34
*** cmart has quit IRC00:35
*** gyee has quit IRC00:51
*** ansmith has quit IRC01:03
*** ansmith has joined #openstack-ansible01:04
*** markvoelker has joined #openstack-ansible01:10
*** macza has joined #openstack-ansible01:10
*** markvoelker has quit IRC01:15
*** macza has quit IRC01:15
*** ThiagoCMC has joined #openstack-ansible01:25
ThiagoCMCJust a quick note... Looks like that new Rock OSA deployments with Cinder with Ceph is broken due to: https://bugs.launchpad.net/cinder/+bug/180615601:26
openstackLaunchpad bug 1806156 in Cinder "shared_targets_online_data_migration fails when cinder-volume service not running" [Undecided,Confirmed]01:26
ThiagoCMCMy cinder-volume is running and it failes anyway.01:26
*** sdake has quit IRC01:28
ThiagoCMCI'm just commenting out the Ansible block "Perform online database migrations", hope it's okay to not run it!01:28
ThiagoCMC=P01:28
*** ansmith has quit IRC01:32
*** tosky has quit IRC01:34
*** TxGirlGeek has quit IRC01:40
*** nurdie has joined #openstack-ansible02:16
*** nurdie has quit IRC02:21
*** cmart has joined #openstack-ansible02:21
*** sdake has joined #openstack-ansible02:31
*** sdake has quit IRC02:37
*** sdake has joined #openstack-ansible02:38
*** sdake has quit IRC02:58
*** sdake has joined #openstack-ansible02:59
*** bgmccollum has quit IRC03:00
*** bgmccollum has joined #openstack-ansible03:02
*** gkadam has joined #openstack-ansible03:11
*** sdake has quit IRC03:26
*** sdake has joined #openstack-ansible03:30
*** sdake has quit IRC03:42
*** udesale has joined #openstack-ansible04:02
*** macza has joined #openstack-ansible04:05
*** macza_ has joined #openstack-ansible04:07
*** macza has quit IRC04:09
*** macza_ has quit IRC04:11
*** jpward1981 has quit IRC04:11
*** macza has joined #openstack-ansible04:21
*** chkumar|out is now known as chandankumar04:22
*** macza has quit IRC04:25
*** cmart has quit IRC04:44
*** sdake has joined #openstack-ansible04:49
*** spsurya has joined #openstack-ansible05:05
*** shyamb has joined #openstack-ansible05:11
*** udesale has quit IRC05:29
*** nurdie has joined #openstack-ansible05:32
*** sdake has quit IRC05:41
chandankumarjrosser: odyssey4me \o/05:42
chandankumarjrosser: odyssey4me https://review.openstack.org/633655 will fix nova lxd tempest listing issue05:42
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_nova master: Use venv_packages_to_symlink to symlink to import libvirt-python  https://review.openstack.org/63347405:43
*** radeks_ has joined #openstack-ansible05:44
*** shyamb has quit IRC05:44
*** shyamb has joined #openstack-ansible05:48
*** sdake has joined #openstack-ansible05:49
*** udesale has joined #openstack-ansible05:51
*** hwoarang has quit IRC05:52
*** hwoarang has joined #openstack-ansible05:55
*** udesale has quit IRC05:59
*** udesale has joined #openstack-ansible06:00
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Update cirros from 3.5 to 3.6  https://review.openstack.org/63320806:14
*** markvoelker has joined #openstack-ansible06:20
*** markvoelker has quit IRC06:24
jrosserchandankumar: for the lxd patch, aren’t there other references to self.client...? That doesn’t look right?06:42
chandankumarjrosser: self.client is used at one place only06:43
chandankumarjrosser: let me pass the file url06:45
chandankumarjrosser: sorry correct, I think I need to fix more stuff06:47
*** shyamb has quit IRC06:52
*** shyamb has joined #openstack-ansible06:55
*** arbrandes has joined #openstack-ansible07:01
*** radeks_ has quit IRC07:01
*** arbrandes1 has quit IRC07:04
*** faizy98 has joined #openstack-ansible07:05
jrosserchandankumar: I spoke with nova-lxd yesterday (check yesterday morning #osa eavesdrop) and it looks like that tempest test duplicates low level tests which should only be done in nova-lxd gate, not a tempest plugin07:07
jrosserIt is not right for tempest to need to contact the compute host lxd daemon07:09
jrosserIMHO the right thing to do here is to disable the nova-lxd tempest plugin, if that is possible07:10
*** udesale has quit IRC07:10
*** jawad_axd has joined #openstack-ansible07:15
*** udesale has joined #openstack-ansible07:18
chandankumarjrosser: only way to do that to remove nova-lxd tempest plugin from there since it is not used then07:18
chandankumarjrosser: or let me find another way to disable tempest plugin07:21
jrosserCan we make a list of excluded plugins? That would be neat, then it can be done on a case by case basis07:21
*** udesale has quit IRC07:22
*** udesale has joined #openstack-ansible07:26
*** shyamb has quit IRC07:38
chandankumarjrosser: I think last year I have added a gate on tempest side to filter out broken plugin http://logs.openstack.org/98/631998/1/check/tempest-tox-plugin-sanity-check/d7bfeb3/07:45
chandankumarjrosser: But I didnot get a chance to cleanup its code07:46
chandankumarjrosser: will i remove the tempest plugin entry point from nova-lxd?07:46
chandankumarjrosser: it will be not discovered by tempest then07:47
chandankumarjrosser: just this part https://github.com/openstack/nova-lxd/blob/master/setup.cfg#L2607:47
jrosserPerhaps best to talk to tinwood about a long term fix07:48
jrosserHowever in the short term we need to unstick the osa nova tests07:48
*** radeks_ has joined #openstack-ansible07:49
jrosserIf the os_tempest role has a way of filtering out known troublesome plugins we can work around upstream issues07:50
chandankumarjrosser: I will take a look on blacklist plugin07:57
chandankumarjrosser: this one is good to go https://review.openstack.org/#/c/633513/07:57
*** shardy has joined #openstack-ansible08:02
*** sdake has quit IRC08:11
*** shardy has quit IRC08:11
*** shardy has joined #openstack-ansible08:12
*** kopecmartin|off is now known as kopecmartin08:19
evrardjpmnaser: could you have a look at https://review.openstack.org/#/c/631326/ ?08:20
*** markvoelker has joined #openstack-ansible08:20
chandankumarjrosser: these errors are known http://logs.openstack.org/08/633208/9/check/openstack-ansible-functional-centos-7/95a334e/logs/openstack/openstack1/neutron/neutron-l3-agent.log.txt.gz08:26
chandankumarand this one http://logs.openstack.org/08/633208/9/check/openstack-ansible-functional-centos-7/95a334e/logs/openstack/openstack1/nova/nova-api-wsgi.log.txt.gz#_2019-01-29_07_47_46_94108:27
chandankumar?08:27
*** electrofelix has joined #openstack-ansible08:35
*** django has quit IRC08:36
jrosserchandankumar: the first one looks like the neutron vhost setup on rabbitmq is not done until later08:36
jrosserchandankumar: the second one looks more fundamental08:37
*** django has joined #openstack-ansible08:40
*** tosky has joined #openstack-ansible08:42
chandankumarjrosser: is there a way to fix those issue?08:42
*** nurdie has quit IRC08:45
*** nurdie has joined #openstack-ansible08:45
*** shyamb has joined #openstack-ansible08:47
*** nurdie has quit IRC08:50
*** pcaruana has joined #openstack-ansible08:51
*** rgogunskiy has joined #openstack-ansible08:54
*** priteau has joined #openstack-ansible08:54
*** markvoelker has quit IRC08:54
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-os_nova master: Disable nova-lxd tempest plugin during nova-lxd test  https://review.openstack.org/63367709:03
jrosserchandankumar: lets try that ^09:03
*** Darcidride_ has joined #openstack-ansible09:04
*** Darcidride_ has quit IRC09:04
chandankumarjrosser: yes, it will not clone nova-lxd repo, thanks :-)09:13
chandankumarjrosser: one more things, Is there a way to clone those depends on also in test-evirnonment https://review.openstack.org/#/c/633208/ for example this one?09:14
*** faizy98 has quit IRC09:18
*** DanyC has joined #openstack-ansible09:18
*** DanyC has quit IRC09:19
*** DanyC has joined #openstack-ansible09:19
*** DanyC has quit IRC09:22
*** DanyC has joined #openstack-ansible09:22
*** DanyC has quit IRC09:24
*** DanyC has joined #openstack-ansible09:24
*** DanyC has quit IRC09:28
*** DanyC has joined #openstack-ansible09:43
*** markvoelker has joined #openstack-ansible09:51
jrosserchandankumar: sorry i'm not sure about that - i'd need to spin up a vagrant box and start digging..... i'm not very familiar with the tox test stuff09:56
jrosserand afaik the depends-on is resolved by zuul, not the test code in the repo?09:57
chandankumarjrosser: ok09:58
*** PTO has joined #openstack-ansible10:02
*** CeeMac has quit IRC10:08
*** shyamb has quit IRC10:10
*** shyamb has joined #openstack-ansible10:11
*** exbob has joined #openstack-ansible10:16
*** markvoelker has quit IRC10:24
*** shyamb has quit IRC10:38
*** shyamb has joined #openstack-ansible10:40
*** shyamb has quit IRC10:45
jrosserchandankumar: so this https://review.openstack.org/#/c/633677/ unsticks the lxd bit10:59
jrosserwe are just left with whatever is upsetting centos10:59
chandankumarjrosser: we have kept one node on hold11:06
chandankumarfor debugging temepst issude11:06
jrosserexcellent - hopefully we can find whats up there11:06
chandankumarjrosser: but sshing into the node, it appears that all the tempest related venv files are not there11:06
*** d3n14l has joined #openstack-ansible11:06
*** mkuf_ has quit IRC11:07
chandankumarjrosser: does it wipes out all the stuff11:07
chandankumar?11:07
jrosserit should all stay afaik11:07
*** mkuf has joined #openstack-ansible11:07
chandankumarin /root there is only openrc11:08
*** udesale has quit IRC11:09
d3n14lHey guys - trying to deploy octavia (osa rocky) - I have set up the network and it looks good until my dhcp-Agent ports for the lbaas mgmt net are put into VLAN 4095 in OVS. Any hint is appreciated…11:09
*** d3n14l has quit IRC11:12
*** markvoelker has joined #openstack-ansible11:21
*** ansmith has joined #openstack-ansible11:33
*** slaweq has joined #openstack-ansible11:34
slaweqhi11:34
slaweqchandankumar11:34
chandankumarslaweq: jrosser Hello11:34
jrosserhi11:34
slaweqjrosser: hi11:34
chandankumarjrosser: slaweq is in temepst container11:35
jrosserso ssh fails from the tempest container to the vm11:35
*** shyamb has joined #openstack-ansible11:35
slaweqjrosser: I didn't check it yet but it looks so11:35
slaweqfor now I tried manually to create network/subnet/router and spawn vm11:35
jrosserhere is the error http://logs.openstack.org/77/633677/1/check/openstack-ansible-functional-centos-7/8d55690/logs/openstack/infra1/stestr_results.html11:35
slaweqall worked fine for me11:36
jrosserwhat is the ip of the vm?11:36
slaweqjrosser: my manually created vm is 4b6e7448-2ab4-49d5-bdf8-6cfcd1b3a90c11:37
slaweqjrosser: it pings from host but not from tempest-1 container11:38
jrosseri cant ping the ip of the router 73a5c739-1758-44ad-9ea7-891a043b9945 10.1.3.197 from inside the tempest container11:38
jrosseralso cannot ping router 6f50d731-c43e-4e55-80fd-d4f429e3c444 10.1.3.10111:39
slaweqyep, all is reachable from host but not from container11:40
chandankumarjrosser: slaweq this review was added https://github.com/openstack/openstack-ansible-tests/commit/fe6c8344d1cdf23add574a461af280d4033b8428 to use systemd networkd role11:42
slaweqjrosser: when I added IP address from "public" network inside container on eth12 it works fine11:43
slaweqip a a 10.1.3.254/24 dev eth1211:43
jrosserok i can now ping the router11:44
slaweq[root@tempest1 ~]# ping 10.1.3.19711:44
jrosseryes we did the same thing at the same tim e:)11:44
slaweqPING 10.1.3.197 (10.1.3.197) 56(84) bytes of data.11:44
slaweq64 bytes from 10.1.3.197: icmp_seq=1 ttl=64 time=0.924 ms11:44
slaweq:)11:44
slaweqso IMHO this is missing in container config11:44
jrossereth12 is in the same bridge on the host as br-vlan11:44
jrosser^ ykwim11:44
slaweqyes, it's on same bridge11:45
jrosseri wonder why this works on the other distros11:45
slaweqbut in container You don't have route to it11:45
jrosseri tried adding a route via eth0 but that didnt work11:45
slaweqso You have:11:45
slaweq[root@tempest1 ~]# ip route get 10.1.3.19711:45
slaweq10.1.3.197 dev eth0 src 10.100.100.5111:45
slaweq    cache11:45
chandankumarslaweq: https://github.com/openstack/openstack-ansible-os_tempest/search?q=eth12&unscoped_q=eth1211:45
jrosserperhaps the default route works on the other platforms11:45
slaweqand Your ping is going through eth011:45
slaweqwhen we added IP from same subnet on eth12 packets to IP from this network were send via eth1211:46
slaweqnot via default route and it works then11:46
slaweqjrosser: but I can't help with OSA containers config - I have zero experience with that :/11:46
jrosseri think we need to add vlan_address in here https://github.com/openstack/openstack-ansible-os_tempest/blob/8bc47db7c00d1dbdcba2464ff0c22a364f44faf8/tests/host_vars/tempest1.yml11:47
jrosserand then wire it into the setup of the tempest container11:48
jrosserlet me take a look at that - i have some meetings but will try to do it after lunch11:48
slaweqjrosser: thx11:48
chandankumarslaweq: jrosser thanks :-)11:48
jrosseri have no idea why only on centos this breaks though :/11:48
slaweqjrosser: I'm disconnecting from this nodes now11:48
slaweqif You would need help from neutron side, ping me or someone on neutron channel later11:49
slaweqI will be afk in few minutes11:49
chandankumarjrosser: let me know when you are done, so that we can inform the openstack-infra team for reuse :-)11:49
jrosseri'm logged out now, you can release the node unless we want it for anything else11:49
*** rgogunskiy has quit IRC11:49
chandankumarjrosser: sure11:50
*** ansmith has quit IRC11:52
*** markvoelker has quit IRC11:53
*** d3n14l has joined #openstack-ansible11:53
*** shyamb has quit IRC11:53
*** shyamb has joined #openstack-ansible11:55
*** kukacz has quit IRC12:04
*** kukacz has joined #openstack-ansible12:04
*** fdegir is now known as fdegir_12:10
odyssey4meThiagoCMC two things - if the repo masters group has no hosts, then something is broken in your inventory/host group config.... without that, there are no repo servers... and without those, you'll be running an OSA deployment i a very slow and non-repeatable way12:19
odyssey4meThiagoCMC also, online migrations for cinder are important - if not running them makes the playbook continue, then you have something broken somewhere12:19
chandankumarjrosser: so basically we need to centos7 job then we can able to land lxd changes12:20
*** mkuf has quit IRC12:20
odyssey4mechandankumar if https://review.openstack.org/633655 fixes the issue, then perhaps that means that we need to add pylxd as a package into the OSA venv, or we need to ensure the plugin includes it as a dependency?12:21
odyssey4mejrosser chandankumar given that our tests aren't using the nova-lxd tempest plugin, perhaps it's best for us to just remove it from the tempest role - or perhaps to prevent re-addition by mistake, we should comment it out of the lists with a note that it is intentionally left out12:23
chandankumarodyssey4me: +1 on removal12:23
chandankumarodyssey4me: jrosser is working on a patch to fix tempest container vxlan networking issue12:24
odyssey4memnaser evrardjp I noticed today that Stein is past M2, and OSA has not released M1 or M2, and as I recall, you have to release at least 2 milestones in order to be accepted for the final release. Is this something that we know about and are sorting out, or are we dropping the ball?12:24
evrardjpI don't think that's still the case12:25
evrardjpI think with the move to cycle-with-rc that requirement dropped12:26
evrardjpbut let me double check12:26
jrosserodyssey4me: yes we have two options, remove the nova-lxd plugin entirely or just disable it like this https://review.openstack.org/#/c/633677/12:26
nowsterSigh. Just been chasing a failure. If one reboots a compute node, nova starts before the libvirt framework, and promptly disables itself.12:26
*** mkuf has joined #openstack-ansible12:26
evrardjpodyssey4me: I have a confirmation12:27
evrardjpit's not needed anymore12:27
evrardjpodyssey4me: I will check if the docs in release is up to date12:29
*** fdegir has joined #openstack-ansible12:30
odyssey4meevrardjp oh ok, thanks - I didn't realise there was a governance change... I guess this takes the pressure off a bit12:30
evrardjpodyssey4me:  you can see the text was adapted in here: https://releases.openstack.org/reference/release_models.html#cycle-trailing . The cycle-with-milestones where this applied is considered legacy12:31
evrardjp(we moved to cycle-with-rc)12:31
odyssey4menowster hmm, which release is that - we had a patch go in some time ago to fic that12:31
evrardjpodyssey4me: I think the requirement for trailing was changed further longer ago, but anyway, long story short, we don't need to release that often, as it doesn't really make sense for us (we should rather point to master instead)12:32
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Remove nova_lxd tempest plugin  https://review.openstack.org/63371112:32
*** fdegir_ has left #openstack-ansible12:32
chandankumarodyssey4me: ^^12:32
chandankumarodyssey4me: will I remove the nova_lxd service available flag also?12:32
chandankumarevrardjp: cloudnull https://review.openstack.org/633513 is good to go12:33
evrardjpchandankumar: will have a look12:33
PTOIs the openstack ansible pike release dead?12:33
odyssey4mechandankumar I think it might be better to just set the enablement to 'false' like jrosser did in https://review.openstack.org/#/c/633677/, with the same note to ensure we remember why... but leaving that does mean we can still enable it later if we want to12:34
odyssey4mePTO nope12:34
chandankumarodyssey4me: sure12:34
odyssey4mePTO it passed the deploy test just yesterday: http://zuul.openstack.org/build/ddae77d295a944fea1593ab7a6759fe412:34
evrardjpPTO: ocata is "kinda" as we decided to not release anymore12:34
odyssey4mewell, this morning12:35
evrardjpPike and others are fine12:35
PTOI just tried to bootstrap the pike release again and some git repos are missing (have been deleted on github.com)12:35
evrardjpI haven't tagged Pike last week-end, which I had to do on monday which got pretty busy. But it's still on my todolist for today.12:35
evrardjpPTO which release of pike?12:35
odyssey4mePTO yeah, that's a known issue which is fixed at stable/pike, but not in a release yet12:36
evrardjpoh the ceph bit?12:36
odyssey4meyep12:36
evrardjpok12:36
evrardjpyeah I tried to find time on sunday, but I got sidetracked12:36
PTOthe ceph-defaults12:36
evrardjpI am only part time in this (0% of my full time:p )12:36
odyssey4mePTO the ceph folks deleted all the ceph-* role repositories.12:37
PTOI'm gonna upgrade very soon. Can i jump from pike to rocky in one step?12:37
PTOOr should i goto queens first?12:37
odyssey4mePTO I know we have done some tests - maybe antonym can comment when he comes online. However, the official response would be that you must go through each release.12:38
jamesdentonmornin' folks12:38
odyssey4meYou *might* be able to skip some things, but you'd have to do test runs of that in a lab env to qualify parts you can skip.12:38
PTOSo better be safe and jump to queens first - agree?12:39
odyssey4meYes, absolutely.12:39
PTOSo just follow the guide, checkout and run-update.sh12:39
odyssey4mePTO either that, or run the steps the script does yourself by hand - it's up to you and your config.12:40
odyssey4meAnd your uptime expectations.12:40
*** pcaruana has quit IRC12:40
PTOI think im gonna try the script (easy mode) and if it fails then go through the steps12:40
PTOShould I apply any minor updates in pike before the major upgrade to queens?12:41
jamesdentonPay special attention to the notes of neutron agent going baremetal (from container). odyssey4me worked out a lot of the automation to make that smooth, but any feedback is appreciated12:42
odyssey4mePTO yeah, pike->queens includes a consolidation of multiple containers per service to one per service, and a move of the neutron agents on to bare metal.12:43
PTOInteresting... I have not yet deployed all minor patches. Should these be deployed before attempting the major upgrade?12:44
odyssey4mePTO it's usually better to update to the latest release tag in the series, then do the upgrade, because that's closest to what we test... but you can do a validation test in a lab to see whether your current tag will just work... or of course you can look through the changes in the same series to see whether they look like they should be done before upgrading... it's really down to your specific use case12:46
PTOI was planning to update the minor releases, but im not able to bootstrap the pike package. Any ETA on when you will release a fix?12:47
ioniPTO, odyssey4me what's important and not mentioned in the upgrade documentation is to move the agents from container to bare metal before deleting the container12:47
PTO@ioni good point! I will write that down :-)12:48
ionii had issues with dhcp ports not being moved automatically to bare metal12:48
ionialso make sure to update network configuration12:48
ionibr-vxlan didn't had any ip on the controller12:48
PTOI assume the script will redeploy the agents on bare metal during the process - correct?12:48
odyssey4meioni oh? then that's a bug and we should fix that - if you can register the bug and the steps you had to do manually, then we can automate it in12:48
ionibecause it ip was inside the container12:48
ioninow you have to create the right configuration for br-vxlan and br-vlan(the eth12 pair)12:49
evrardjpyeah that sounds like a bug12:49
odyssey4mePTO you can just use stable/pike rather than a tag for now... or wait for the tag release12:49
*** pcaruana has joined #openstack-ansible12:50
ioniodyssey4me, well, i'm not sure if is a bug or not, on my 5 regions that i've done the upgrade, only one didn't moved automatically the port12:50
*** kaiokmo has joined #openstack-ansible12:50
ionii had to disable dhcp and reenable it12:50
odyssey4meioni oh, that's odd12:50
PTOodyssey4me: git checkout stable/pike12:50
*** markvoelker has joined #openstack-ansible12:50
odyssey4mePTO yep12:50
ionibecause the agent id was missing in order to move it using neutron commands like this: https://www.openstackfaq.com/openstack-migrate-routers-and-dhcp/12:51
PTOcool12:51
PTOI have bootstraped the queens package. Should I manually remove /etc/ansible/roles/* before?12:52
ionithere is a playbook for that12:52
ionihttps://docs.openstack.org/openstack-ansible/queens/admin/upgrades/major-upgrades.html12:52
odyssey4mePTO yeah, if you're using stable/queens then there is... I don't think the adjustment made it into the last queens release12:53
odyssey4mebut yes, it's advisable to wipe out /etc/ansible/roles/ceph* before doing the pike/queens upgrades12:53
ioniPTO, i was able to boostrap pike  16.0.2512:54
ionii  have a forked version of openstack-ansible that i manage into my private git12:54
ioniand from time to time a sync12:54
*** d3n14l has quit IRC12:55
odyssey4meI'll propose https://github.com/openstack/openstack-ansible/commit/d528daf069559c3686f05a26f9b4d68c84a34b77#diff-0e0b5a4ebeeb2dd9a60106998e218e0b to pike to make sure people know to do that12:55
PTOI'm currently running pike (16.0.9). Got the queens branch and did a bootstrap.12:55
*** shyamb has quit IRC12:55
evrardjpOk it seems there is some kind of urgency to release so I will stop what I am doing to do it12:56
evrardjpit thought it could wait a few more hours.12:56
PTOI can go with the stable/pike branch - no problems12:57
PTOJust want to know how to "roll back" the queens bootstrap12:57
*** mkuf_ has joined #openstack-ansible12:59
openstackgerritJesse Pretorius (odyssey4me) proposed openstack/openstack-ansible stable/pike: Add release note about ceph role changes  https://review.openstack.org/63371812:59
odyssey4mePTO to roll back the ansible bootstrap, you'll need to wipe /opt/ansible-runtime and wipe /etc/ansible/roles - then checkout stable/pike and do the bootstrap again13:01
odyssey4meevrardjp no urgency for a pike release just yet, because https://review.openstack.org/633718 should still go in first :)13:01
evrardjpok13:02
*** mkuf has quit IRC13:02
evrardjpwell13:02
openstackgerritJean-Philippe Evrard proposed openstack/openstack-ansible stable/rocky: Bump version to 18.1.4  https://review.openstack.org/63372013:02
evrardjpyour patch can merge fast13:03
evrardjplet me -w another patch then13:03
*** priteau has quit IRC13:04
*** priteau has joined #openstack-ansible13:04
evrardjpodyssey4me: I guess your patch should be merged first, so we should probably get another vote from cores?13:04
evrardjpmaybe jrosser?13:04
nowsterSigh². It appears that the VXLAN is being mapped to br-vxlan on the infra host, but br-mgmt's IP on the compute node.13:05
evrardjpanyway now that I am interrupted, I will just do the other releases now13:05
evrardjp:p13:05
nowsterwrong values in the linuxbridge conf.13:07
ioninowster, are you sure that on compute node  has an ip to br-vxlan  ?13:09
ionii had this issue on infra when i forgot to apply networking modification from pike to queens13:09
openstackgerritJean-Philippe Evrard proposed openstack/openstack-ansible stable/queens: Bump version to 17.1.8  https://review.openstack.org/63372513:09
nowsterioni: I'm checking now.13:10
*** priteau has quit IRC13:12
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-tests master: Set a defined IP address range for tempest test public addresses  https://review.openstack.org/63372813:14
*** strattao has joined #openstack-ansible13:18
*** markvoelker has quit IRC13:20
*** gkadam has quit IRC13:20
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-os_tempest master: Add an ip address to eth12 in OSA test containers  https://review.openstack.org/63373213:28
jrosserchandankumar: ^ that should fix it13:28
odyssey4mejrosser oh nice, and thank you so much for figuring it out - assuming the test passes ;)13:29
jrosserwell i hope so - this had been on my mind as the proxy scenario is failing, and i'd come to a very similar conclusion about why that was13:30
nowsterioni: the interface is there and has the right address13:32
jrossernowster: i think that this https://github.com/openstack/openstack-ansible/blob/master/playbooks/common-tasks/dynamic-address-fact.yml is used in the neutron setup to decide which IP to pick13:33
nowsterta. I fixed up the linuxbridge config and it seems to have done the right thing13:34
openstackgerritMerged openstack/openstack-ansible stable/pike: Add release note about ceph role changes  https://review.openstack.org/63371813:34
nowsterjrosser: looks like it picked the wrong one, as it had the ip address of br-mgmt in there13:36
*** ansmith has joined #openstack-ansible13:36
chandankumarjrosser: thanks, In a meeting, will take a look soon13:38
*** zul has quit IRC13:38
PTOodyssey4me: thx for clarifying13:38
jrossernowster: or the data being fed in is wrong, it will take a bridge name and return the IP, so if the openstack_user_config has br-mgmt somewhere instead of br-vlxan then the same thing will happen13:40
*** nurdie has joined #openstack-ansible13:42
*** mkuf has joined #openstack-ansible13:43
*** mkuf_ has quit IRC13:45
odyssey4meevrardjp you can release the -w on https://review.openstack.org/633348 now13:45
evrardjpreleasing the kraken!13:47
evrardjpand pike13:47
evrardjpI am sad release k was kilo and not kraken13:48
openstackgerritMerged openstack/openstack-ansible-tests master: Ensure selinux bindings are linked into the venv  https://review.openstack.org/63351313:48
PTOI have read somewhere that its possible to use swift with ceph as storage backend. Is this somethink you have looked at?13:51
odyssey4mePTO I don't know if that's ever been a thing, but ceph has ceph rgw, which provides a swift API to a ceph back-end13:52
*** cmart has joined #openstack-ansible13:53
ioninot related top OSA, but i know that you guys also operate public or private clouds.13:54
ionihow do you "disable" obsolete images ?13:54
ionii use --deactivate but this has problems with nova when instances want to resize, i got Not authorized for image and instances is then in error state13:54
*** CeeMac has joined #openstack-ansible13:55
PTOJust wanted to check if you had a code snippet in your stash for testing :-)13:55
CeeMacafternoon channel13:55
jrosserPTO i run ceph rgw with both swift and S3 API13:55
jrosseryou should find an example setup in the osa ceph test scenario iirc13:56
jrossercertainly for swift api13:56
ThiagoCMCodyssey4me, thanks for clarifying that! Not sure why repo masters aren't there, here is my conf: https://github.com/tmartinx/openstack_deploy/blob/master/openstack_user_config.yml - can you see something wrong?14:02
odyssey4meThiagoCMC odd, repo_infra-hosts should put it there.14:02
ThiagoCMCThe cinder thing finally worked! But Cinder still can't create volumes on Ceph. When I try to create a vol, the cinder-vlumes becomes "Down" but, daemon still running, very weird.14:02
ThiagoCMCodyssey4me, I'll do it now! Thanks!14:03
ThiagoCMCI already have "repo-infra_hosts:"14:03
ThiagoCMC:-/14:03
ThiagoCMCis it with underscore or dash?14:04
odyssey4meThiagoCMC ok, OSA is built in such a way that if it's got the right config it'll just work - so if something turns out broken, then usually it's missing config or missing underlying network/storage config... so I'd suggest ensuring that you get it running through setup-infrastructure without error before trying to fix setup-openstack14:04
odyssey4meThiagoCMC you have 'repo-infra_hosts' which is correct, so it should be there - if you run through the repo-server playbook, does it work?14:05
ThiagoCMCyeah, I just did this yesterday for the first time, setup-everything.yml worked without a single error.14:05
ThiagoCMCBut still, Glance can't upload images to Ceph (while openstack images list works), Cinder can't create volumes and Heat (openstack stack list) returns error 500.14:06
ThiagoCMCHard time... lol14:06
ThiagoCMCThe syntax check tells me about the repo masters with a warning.14:06
ThiagoCMCbut I can deploy Rocky anyway14:07
odyssey4meThiagoCMC yeah, the syntax check will return that - that's not a concern14:07
ThiagoCMCHmm... ok lol14:07
odyssey4meok, if glance can list but not upload images, then that points at some sort of inability to write to the back-end14:07
odyssey4mecheck the cinder-volume service log...14:08
ThiagoCMCNothing hits the cinder-volume logs.14:08
ThiagoCMCI was watching with journalctl, nothing.14:08
odyssey4meThiagoCMC ok, check the systemd journal on the cinder-volume host?14:08
ThiagoCMCyep, that's what I did14:09
odyssey4meok, try watching that and restarting the cinder-volume service?14:09
ioniquestions regarding keystone in rocky14:09
ioniis not used anymore 35357  ?14:09
odyssey4meioni nope14:09
ThiagoCMCjournalctl -f -u cinder-volume - after systemctl restart cinder-volume14:09
ioniso we have to delete the old nginx configuration for that port14:09
ioniis not going to work if updating from a version that had one14:09
ionii had this problem now14:09
ThiagoCMCI can see it is restarted and up, then, when I try to create a vol, it fails and status down but daemon still running.14:09
odyssey4meioni I think we baked that all in already14:10
ionicool14:10
ioniwainting for new tag then14:10
odyssey4meioni https://github.com/openstack/openstack-ansible-os_keystone/commit/ff63ec8a3ef0057eced6467980f4f5c4833e0db614:10
*** zul has joined #openstack-ansible14:10
ionicool14:10
ThiagoCMCI have to drive to office now, chat soon! I'm in desperate need for help!  ^_^14:10
ioniodyssey4me, so what happens for old endpoints that point to 35357?14:11
odyssey4meThiagoCMC hmm, so the cinder agent list shows it as down?14:11
odyssey4meioni hmm, I don't think we have something to delete it...?14:11
ioniodyssey4me, right now i have a mix setup with rocky and queens14:12
ionirocky is where the keystone is14:12
chandankumarjrosser: http://logs.openstack.org/32/633732/1/check/openstack-ansible-functional-centos-7/cff6c8f/job-output.txt.gz#_2019-01-29_14_01_52_15017014:12
nowsterjrosser: can't see where that might be wrong. I've just been through openstack_user_config.yml14:12
*** shyamb has joined #openstack-ansible14:12
chandankumarjrosser: failing at create container mac script14:13
odyssey4meioni yeah, it looks like we don't have a task to remove the old admin endpoint - could you register a bug or submit a patch for that?14:14
ioniodyssey4me, well, i think it got updated to 5000 for admin14:14
odyssey4meioni oh yes, that is right14:15
ioniodyssey4me, i see it on rocky region having 500014:15
ioniodyssey4me, i need to see if queens works with admin being 500014:15
odyssey4meoh, that's correct - there is still an admin endpoint, just on the same port as all other endpoints14:15
odyssey4meit's no longer a seperate wsgi app14:15
ionicool14:15
ioniless memory :D14:15
odyssey4meyep14:15
ionisorry for the noise, i wasn't up to date from git14:16
ionii'm testing the branch with latest commit being  Merge "Update all SHAs for 18.1.3" into stable/rocky14:17
jrosserchandankumar: yes just having a look14:18
*** samc-bbc has joined #openstack-ansible14:19
*** shyamb has quit IRC14:20
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-os_tempest master: Add an ip address to eth12 in OSA test containers  https://review.openstack.org/63373214:21
jrosserchandankumar: i missed one of the containers, that should be better14:22
nowsterGood news after that is that I have IPv6 pings having fixed the vxlan binding.14:23
*** udesale has joined #openstack-ansible14:24
jrossernowster: what did you need to fix in the end - is there a bug we need to look at?14:29
*** gshippey has joined #openstack-ansible14:33
nowsterjrosser: it was "local_ip = 172.29.xxx.12" in /etc/neutron/plugins/ml2/linuxbridge_agent.ini on the compute node.14:33
nowsterxxx = 236 from ansible, I changed it to 240, and things meshed correctly14:34
nowster236 is mgmt, 240 is vxlan (as per example config)14:34
openstackgerritJean-Philippe Evrard proposed openstack/openstack-ansible stable/pike: Bump version to 16.0.26  https://review.openstack.org/63375114:36
*** SimAloo has joined #openstack-ansible14:37
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-os_tempest stable/rocky: Update all plugin urls to use https rather than git  https://review.openstack.org/63375214:37
*** SimAloo has quit IRC14:37
*** sum12 has quit IRC14:38
*** sum12 has joined #openstack-ansible14:38
*** dave-mccowan has joined #openstack-ansible14:38
*** SimAloo has joined #openstack-ansible14:40
*** dave-mccowan has quit IRC14:45
*** pcaruana has quit IRC14:45
*** sdake has joined #openstack-ansible14:50
*** sdake has quit IRC14:51
*** pcaruana has joined #openstack-ansible14:53
evrardjpodyssey4me and cores: does it sounds reasonable to say bootstrap-ansible is always run when doing a minor update in a branch14:54
odyssey4meevrardjp absolutely, yes14:54
*** sdake has joined #openstack-ansible14:54
evrardjpI thought too.14:54
jrosserevrardjp: otherwise you dont get the roles checked out to the right points14:54
odyssey4meevrardjp shall I put a patch together to take the option of using ansible-galaxy out?14:55
evrardjpwell14:55
jrosseror a potential minor ansible version upgrade14:55
evrardjpjrosser: I thought ppl could use the play to fetch latest, and not update ansible for example14:55
evrardjpbut yeah, I think it's fair to say so14:55
evrardjpodyssey4me: good idea14:55
odyssey4meok, will do that now - then we can add a tracking branch into a-r-r too14:55
evrardjpyeah14:56
evrardjpexplicit is better than implicit :)14:56
evrardjpbut next to that I got an idea for the openstack-ansible wrapper that removes the need to update the update file. Simple :)14:56
evrardjpthe version file*14:57
evrardjpI found a bug in bootstrap ansible at the same time14:57
evrardjpso I will file a few things14:57
evrardjpwith that in mind we should do a first alpha release of master branch14:59
evrardjpthen we can go full auto14:59
evrardjp I am thrilled :)14:59
* nowster tries to work out what sets neutron_local_ip on each type of host15:00
odyssey4me:)15:00
nowsterbut meeting first15:00
*** nurdie has quit IRC15:00
*** nurdie has joined #openstack-ansible15:01
openstackgerritJesse Pretorius (odyssey4me) proposed openstack/openstack-ansible master: Remove ANSIBLE_ROLE_FETCH_MODE  https://review.openstack.org/63375615:04
odyssey4meevrardjp ^15:05
*** nurdie has quit IRC15:06
evrardjpwoot15:10
evrardjpI am in a meeting but reviewing15:10
chandankumarjrosser: thanks !15:11
jamesdentonnowster neutron_local_ip should be populated from tunnel_address for a given host/container15:11
antonymPTO: odyssey4me: i was able to jump from newton to queens (and separately rocky) last week in the lab by migrating all DBs for each release, and then only running the final release upgrade.  i pulled some cleanup stuff from queens along with the neutron bare metal migration scripts and everything still seemed to work afterwards, there were a few oddies that come up but nothing really major and easy15:13
antonymto fix up.  i'm continuing testing on it this week to automate15:13
odyssey4meantonym oh nice :) I'm guessing that's mostly just doing a subset of actions from each series?15:14
antonymyeah, i tossed together some playbooks that stitch together all of the actions migrations for each release and runs them from a venv, just loop through that for each release and then run the final target run-upgrade.sh... then just have to go back and pick up all the cleanup items from older upgrades15:15
openstackgerritJean-Philippe Evrard proposed openstack/openstack-ansible master: Define OSA clone dir in the openstack-ansible.sh script  https://review.openstack.org/63375915:17
nowsterthis is odd: http://paste.openstack.org/show/744170/15:17
antonymit's running all the ansible_fact_cleanup, config changes, secrets adjustments, etc too for each release so we're not missing anything15:17
nowsterI'd be expecting an "After=libvirtd.service" in there.15:18
*** udesale has quit IRC15:18
jrossernowster: can you give this a spin? https://review.openstack.org/#/c/633104/15:19
*** jwitko has joined #openstack-ansible15:22
nowsterjrosser: that seems sensible to me15:23
* nowster = meetinging15:24
*** jawad_axd has quit IRC15:27
*** jawad_axd has joined #openstack-ansible15:28
*** jawad_axd has quit IRC15:29
*** sdake has quit IRC15:32
evrardjpantonym: nice15:34
evrardjpodyssey4me: what do you think of https://review.openstack.org/#/c/633759/1 ?15:34
*** sdake has joined #openstack-ansible15:35
*** nurdie has joined #openstack-ansible15:38
openstackgerritJean-Philippe Evrard proposed openstack/openstack-ansible master: Mark OSA version in the wrapper script  https://review.openstack.org/63376215:43
openstackgerritJean-Philippe Evrard proposed openstack/openstack-ansible master: Use an env lookup to determine the OSA version  https://review.openstack.org/63376315:43
*** mattheca has joined #openstack-ansible15:44
odyssey4meevrardjp will look in a bit after my meeting15:46
openstackgerritJean-Philippe Evrard proposed openstack/openstack-ansible master: Use an env lookup to determine the OSA version  https://review.openstack.org/63376315:47
*** ztr has joined #openstack-ansible15:50
*** openstackgerrit has quit IRC15:51
*** openstackgerrit has joined #openstack-ansible15:51
openstackgerritFrancois Deppierraz proposed openstack/openstack-ansible-lxc_hosts stable/rocky: Increase LXC container default shutdown delay  https://review.openstack.org/63376715:51
*** mbuil has joined #openstack-ansible15:53
mbuilguys, could you check https://review.openstack.org/#/c/622216/ please? thx!15:53
openstackgerritFrancois Deppierraz proposed openstack/openstack-ansible-lxc_hosts stable/rocky: Increase LXC container default shutdown delay  https://review.openstack.org/63376715:55
CeeMacanyone got a fix for dhcp-agent unable to bind port, with vif_type=binding_failed noted in dhcp-agent logs?15:58
CeeMacenabled dubug, can't see anything more useful in there sadly :(15:59
mnasercloudnull, DimGR, d34dh0r53, hughsaunders, b3rnard0, palendae, odyssey4me, serverascode, rromans, erikmwilson, mancdaz, _shaps_, BjoernT, claco, echiu, dstanek, jwagner, ayoung, prometheanfire, evrardjp, arbrandes, scarlisle, luckyinva, ntt, javeriak, spotz, vdo, jmccrory, alextricity25, jasondotstar, admin0, michaelgugino, ametts, bgmccollum, darrenc, JRobinson__, colinmcnamara, thorst, adreznec, eil397,16:00
mnaserqwang,nishpatwa_, cathrichardson, drifterza, hwoarang, cshen, ullbeking, mnaser, nicolasbock, jrosser, cjloader, antonym, dcdamien, jamesdenton16:00
mnasermeeting time!16:00
mnaser#startmeeting openstack_ansible_meeting16:00
openstackMeeting started Tue Jan 29 16:00:29 2019 UTC and is due to finish in 60 minutes.  The chair is mnaser. Information about MeetBot at http://wiki.debian.org/MeetBot.16:00
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.16:00
*** openstack changes topic to " (Meeting topic: openstack_ansible_meeting)"16:00
openstackThe meeting name has been set to 'openstack_ansible_meeting'16:00
mnaser#topic rollcall16:00
mnasero/16:00
*** openstack changes topic to "rollcall (Meeting topic: openstack_ansible_meeting)"16:00
hwoarango/16:00
evrardjpo/16:00
prometheanfireo/16:01
mnaser(sorry for the past 2, i meant to send an email asking if someone could run it, only 2 weeks off :)16:01
mnasernot much attendance16:02
mnaser#topic last week highlights16:02
*** openstack changes topic to "last week highlights (Meeting topic: openstack_ansible_meeting)"16:02
mnasersection seems empty, is anyone around to share anything in specific?16:02
evrardjpnot really16:02
evrardjpmaybe jrosser or odyssey4me16:03
prometheanfiresure :D16:03
odyssey4meapologies - I'm stuck in another meeting16:03
mnaseri see some gentoo patches finally ;)16:03
prometheanfirehttps://review.openstack.org/#/q/topic:add-gentoo-support+status:open gentoo stuff is working, though we need systemd-241 to finally be released16:03
evrardjpdon't say it's a highlight of last week: p16:03
prometheanfireof course it is, for me :P16:03
evrardjp:)16:04
mnaserhehe, gentoo is an interesting deployment target16:04
evrardjpprometheanfire: if this isn;t a done deal yet, should we speak about that during open discussion?16:04
prometheanfireonce the dib change merges and dib release is made (I don't think os-infra builds from master) and then the gentoo image is rebiult osa-tests should pass16:04
evrardjpmnaser: so is tumbleweed ? :D16:04
hwoaranggentoo is always a highlight16:04
prometheanfireevrardjp: sure16:04
jrosserwe need to get the tempest and nova stuff unstuck, but there seems to be progress on that today16:04
mnaserevrardjp: you sold out! :P16:05
evrardjpmnaser: :)16:05
mnaseri was hoping to try and dive through, i know centos-7 is been bad16:05
mnaserand as part of that was just trying to rip out the container bits16:05
evrardjpI thought of comparing how much it would take me to build an arch linux OSA thing. Probably faster than doing it on gentoo :p but I will stop the flamebait there16:05
mnaserbut anyways, our triage list has grown big but i feel like everyone gets bored and disappears in triage :)16:06
evrardjpthat's kinda true16:06
mnaserso i'm proposing a short open discussion portion where we can talk about this stuff now, then we can do bug triage with whoever survives hah16:06
evrardjpit's sad16:06
evrardjpyeah that sounds fair16:06
evrardjpwhat about organising a bug killing day?16:06
mnaserevrardjp: sounds like a good idea, i'll try to gather up and see what everyone's availabilties seem like over the ML16:07
evrardjpI haven't done one in the last cycles, but I used to run one.16:07
evrardjpthanks16:07
mnaserjrosser: i see you had policy-in-code stuff in open discusison, was that from last week or meant for todays?16:07
jrosserit was for last week but we were time out16:07
*** jpward1981 has joined #openstack-ansible16:07
mnaseri assume it's removing all the hard coded stuff we ship in our roles16:08
jrosserjust really for someone who knows the deal there to update on what still needs to be done16:08
chandankumarodyssey4me: mnaser https://review.openstack.org/#/c/633732/ needs merges unblocks centos gates16:08
mnaserthere is a list of all projects that have moved to policy in code16:08
evrardjpoh yeah I have another topic for open discussion: releasing. Bumping is now automatic, and I have a few patches in to have automatic versioning with setuptools, which should be good enough to not change code anymore. Releases would still require manual intervention to say what/when to tag, until releases CLI is working for us at 100% (stein and above)16:09
chandankumarjrosser: it worked16:09
chandankumarjrosser: we are good to go now :-)16:09
chandankumarthanks to jrosser and slaweq for the gates fixes :-)16:09
odyssey4meyeah, I was thinking that perhaps we need to organise a hack day around each milestone and get agreement from our employers to do it16:10
jrossermnaser: there seem to have been a few bugs crop up which felt related to policy stuff16:10
mnaserodyssey4me: just sent an email to the ML about that, so that'd be cool :)16:10
odyssey4meit's been quite tough to get focused attention, and loads of bugs are just sitting there with no attention16:10
mnaser++16:10
mnaserchandankumar: good work on catching that, thank you.16:11
chandankumarmnaser: it's a team work, we jrosser odyssey4me and slaweq did it :-)16:11
mnaserevrardjp: i like that, simplifying our life is always a good thing.  we're all quite busy16:11
guilhermespnow with more time to take a look at the osa bugs we found during some deployments, this week Im focusing on a bunch of PR to submit. One of the focus is related to upgrade jobs https://review.openstack.org/#/c/627782/16:13
evrardjpshould that series of patches merge, stein will be able to be released fully automatically. The patches can still be backported for simpler releasing in older branches, but not 100% perfect solution there.16:13
guilhermespI'm going to take a look at the failures but me and mnaser agreed that the workspace fix is still not complete https://review.openstack.org/#/c/633549/16:13
mnaserright, upgrades has been rough for us, and i'm pretty sure there's a bug with the way we deploy rabbitmq too where a cluster failure results in the cluster not routing anything anymore unless you delete all queues16:14
mnaseri've seen this repeatedly over multiple rocky envs, so there's still some clean up and work to do16:14
odyssey4meouch16:14
mnaserdeleting a vhost isnt enough, you have to delete every single queue, and it just magically starts working again16:14
mnaserit's affected us and a few customers. i'm confident it's a confirmed issue by now as it's always been fixed this way. i haven't had time to dig deeper, but yeah.16:14
mnaseranyhow, subjects so far: releases, upgrades and hackday.16:15
mnaserreleases => we will try to use the new tooling that evrardjp worked on and then *IF* someone has time, we could backport i guess16:15
mnaserupgrades => guilhermesp is working on it and will continue to iterate, we're so so so close because it's failing in tempest after a full upgrade, so that's great news overall16:16
mnaserhackday => i sent an email to ML, so if you can respond to it, that'd be awesome :)16:16
odyssey4meyeah, let's see how it goes with stein - then work it back if it all goes well16:16
odyssey4mefor upgrades, I'm happy to help - although I need to focus back on figuring out the final bits for the python builds16:17
chandankumarmnaser: on centos Jobs, we find errors in neutron logs, is there any plan to get rid of that16:17
chandankumarin the morning, jrosser and I were discussing about that16:17
mnaserodyssey4me: i think your time is well invested in the python build to wrap it up, in the meantime i'll work with guilhermesp to get upgrades done, it should be minor things afaik16:17
mnaserchandankumar: do you mind explaining more about that?16:17
chandankumarmnaser: grabbing the logs16:18
chandankumarmnaser: http://logs.openstack.org/32/633732/2/check/openstack-ansible-functional-centos-7/a8cb2f1/logs/openstack/openstack1/neutron/neutron-dhcp-agent.log.txt.gz#_2019-01-29_15_09_31_52616:20
mnaserchandankumar: thats probably because the service goes up before we setup the mq's16:20
odyssey4meyeah, it'd be nice to sort that out16:21
odyssey4meit should be a straightforward fix - just re-ordering some tasks16:21
chandankumarmnaser: http://logs.openstack.org/32/633732/2/check/openstack-ansible-functional-centos-7/a8cb2f1/logs/openstack/openstack1/neutron/neutron-l3-agent.log.txt.gz#_2019-01-29_15_09_30_54416:21
chandankumarwe fixed libvirt import error issues16:21
chandankumarmnaser: on tripleo side, we have a role named collect-logs to dump all errors in a single file16:22
chandankumarmnaser: I will check with wes tomorrow how we can use it here16:22
mnaseroh that's super awesome.  yes, let's share tooling. chandankumar16:23
mnaseri have a subject -- evrardjp brought this up before but i think we should move to office hours instead of an actual meeting16:24
chandankumarmnaser: odyssey4me something like this http://logs.openstack.org/85/633185/8/check/tripleo-ci-centos-7-standalone/9c2e95c/logs/undercloud/var/log/extra/errors.txt.gz16:24
mnaserif you use a role to collect the logs, we can probably reuse it in the gate together16:24
chandankumarhttps://github.com/openstack/tripleo-quickstart-extras/tree/master/roles/collect-logs16:25
chandankumarthere was a plan to move it to a seperate project but stalled due to other priorities16:25
chandankumarI will check with team tomorrow and let you know16:25
mnaserok cool, it might be pretty beneficial in our gates too16:26
mnaseri mean like, in all of openstack16:26
*** fdegir has quit IRC16:26
mnaserso, thoughts about office hours instead of meetings?16:28
jrosseri would be concerned that the bug triage gets even more out of hand - how would we handle that?16:28
*** jawad_axd has joined #openstack-ansible16:28
jrosserimho it's quite a good way of socialising whats broken and how folks are using our stuff16:28
odyssey4mewhat's the difference between the two?16:29
odyssey4me(office hours vs meetings)16:29
mnaserjrosser: office hours is just a time where we try to all be available to discuss things (rather than async reaching each other), without a specific agenda, just a time where we're all there16:29
chandankumaroffice hours ~= meeting without predefined agenda16:30
mnaserthe bug triage, i'm hoping that we can do some sort of bug smash thing every here and there.16:30
mnaserthe difficult part is that it ends up being 1 or 2 people doing most of the triage16:30
odyssey4mewell, we kinda have office hours daily during the crossover time between UK and US16:30
ThiagoCMCI finally have OSA/Rock up and running with Ceph! At least Glance and Cinder are working!  Wheee!16:31
openstackgerritMerged openstack/openstack-ansible stable/pike: Bump SHAs for stable/pike  https://review.openstack.org/63334816:31
mnaserThiagoCMC: w00t16:31
ThiagoCMCTrying to boot a VM now16:31
ThiagoCMCI'm so happy!16:31
ThiagoCMC:-D16:31
odyssey4meI would rather try and do a bug triage/fix team rotation than let it slip to happening once every so often.16:31
jrosserare we struggling for people to attend the meeting due to $dayjob pressure?16:31
* redrobot sneaks in through the back16:31
mnaserjrosser: i'm not sure.  i don't have much of an explanation. but i think it's largely a time constraint16:32
mnaseri think its late in EU timezone, and conflicts with a lot of other meeting timeslots16:32
mnaseri often see people mention they're inbetween meetings (and that's fine, i understand people need to get their jobs done), but yeah16:32
odyssey4meI unfortunately have two meetings at the same time today - this one and my internal team meeting.16:32
mnaserright, i'm all for keeping doing bug triage, but it ends up being a subset of folks that do it.  we can either look into a rotation, or maybe we can come up with another time where we have more resources/people to help do it16:34
*** chandankumar is now known as chkumar|out16:35
*** jawad_axd has quit IRC16:35
jrosserperhaps we should look at some bugs?16:36
mnaseranyhow, we can defer this to next week and see how this weeks bug triage goes :)16:36
mnaser#topic bug triage16:36
*** openstack changes topic to "bug triage (Meeting topic: openstack_ansible_meeting)"16:36
mnaser#link https://bugs.launchpad.net/openstack-ansible/+bug/181366016:36
openstackLaunchpad bug 1813660 in openstack-ansible "Upgrade from Pike to Queens skips setup-hosts when running neutron on bare metal" [Undecided,New] - Assigned to Bjoern Teipel (bjoern-teipel)16:36
mnaserlooks like that's already assigned16:37
*** tstrul has joined #openstack-ansible16:37
jrosserthere may even be a patch for that16:37
mnaseryeah, i'm trying to search under that name :p16:37
mnaserhttps://review.openstack.org/#/q/owner:%22Bjoern+Teipel+%253Cbjoern.teipel%2540rackspace.com%253E%22 i don't think so16:37
guilhermespworth to ask updates for that guy?16:38
*** tstrul has quit IRC16:38
prometheanfireguilhermesp: he's a coworker, should I bug him about something specific?16:38
prometheanfire#1813660 ?16:38
mnaseryep16:39
mnaseri mean16:39
mnaserreported 19 hours ago16:39
jrosserodyssey4me: didnt you have a patch for this?16:39
prometheanfireya, kinda recent16:39
mnaserok so i think we can mark this down as confirmed medium16:40
mnaserand we'll have a patch soon :)16:40
prometheanfireya, pinged him16:40
odyssey4mejrosser sort-f, I made it work better - then for master I fixed it properly16:41
mnaseroh, so fixed?16:41
odyssey4mehang a sec16:41
odyssey4methe issue there is pike->queens, right?16:41
guilhermespyep odyssey4me16:41
odyssey4meok, I think that bug is relating to the thing I fixed - yes, lemme provide a review16:42
*** TxGirlGeek has joined #openstack-ansible16:42
odyssey4mehmm: https://review.openstack.org/62589816:42
odyssey4methat was rocky - there was a reason I didn't port that back to queens... but I can't remember what that reason is16:43
odyssey4mein master I did a bunch more: https://review.openstack.org/62477316:44
mnaserso we can safely triage this and figure out fix later? :)16:44
odyssey4meyeah, it's valid and already set to medium16:45
odyssey4meI'll comment what's already in place for queens & master. Bjoern can then decide what to do about Pike.16:46
*** sdake has quit IRC16:46
evrardjpodyssey4me: for once you don't remember? :p16:46
mnaser#link https://bugs.launchpad.net/openstack-ansible/+bug/181330016:46
openstackLaunchpad bug 1813300 in openstack-ansible "NFS mount point for Glance is created with wrong permissions" [Undecided,New]16:46
evrardjpthat rings me a bell ... haven't we changed that already in the past?16:47
evrardjpbut there is a patch included!16:47
odyssey4meYeah - I feel that this one keeps coming up, and a new patch goes in, and then another one later... and so on.16:47
chkumar|outmnaser: if we have time I want to discuss about using https://trunk.rdoproject.org/centos7-master/delorean-deps.repo in OSa for installing dependencies not maintained around openstack ecosystem16:50
chkumar|outmnaser: I was checking the openstack-ansible-tests code on nodepool test file but no clue how to use it16:51
chkumar|outmnaser: http://codesearch.openstack.org/?q=delorean-deps.repo&i=nope&files=&repos=16:51
chkumar|outmnaser: it is used in POI and tripleo16:51
chkumar|outmnaser: can we use it here also?16:52
odyssey4mechkumar|out I think we already do?16:52
chkumar|outodyssey4me: we only use delorean.repo only16:52
odyssey4meoh, I see16:53
odyssey4mewould this repo be used in production at all?16:53
chkumar|outodyssey4me: https://github.com/openstack/openstack-ansible-tests/blob/401fc3d5cdef09f99470f20256c2ecd7e36925fa/common-tasks/test-set-nodepool-vars.yml#L4916:53
chkumar|outodyssey4me: in downstream, we import packages from same16:53
mnaserconfirmed/medium for the nfs bug, i asked Juri if it's possible to work with them to get them to push it to gerrit16:54
chkumar|outodyssey4me: it is maintained here https://github.com/redhat-openstack/rdoinfo/blob/master/deps.yml16:54
mnaserchkumar|out: i'd be in favour, using delorean deps was very helpful and made our gate usually quite stable in poi times (it also helped crossgate with rdo)16:55
chkumar|outmnaser: I need some pointers and I can make the changes in openstack-ansible-tests16:55
mnaserchkumar|out: we can discuss post meeting if you're not "out" :)16:55
*** hamzy has quit IRC16:56
chkumar|outmnaser: may be tomorrow, I can ping you in evening from my time zone16:56
mnaserchkumar|out: great!16:56
mnaserwe're running close to time, maybe we can get one more triage in16:57
mnaser#link https://bugs.launchpad.net/openstack-ansible/+bug/181318716:57
openstackLaunchpad bug 1813187 in openstack-ansible "CentOS tempest test_server_basic_ops failure" [Undecided,New]16:57
mnaseroh, that was resolved by the patch listed above16:57
mnaserdone16:58
mnaser#link https://bugs.launchpad.net/openstack-ansible/+bug/181314916:58
openstackLaunchpad bug 1813149 in openstack-ansible "Missing git respo: https://github.com/ceph/ansible-ceph-defaults" [Undecided,New]16:58
prometheanfirecjloader: ^?16:59
odyssey4meja, that's all fixed16:59
mnaserdid we release since16:59
mnaserlooks like 16.0.24 is the tag the user used16:59
cjloaderyes was fixed16:59
odyssey4meocata: https://review.openstack.org/632182 & pike: https://review.openstack.org/63214217:00
*** hamzy has joined #openstack-ansible17:00
mnasernice work cjloader17:00
odyssey4meno release based on that yet, I think evrardjp did the release requests earlier today17:00
mnasercool, ill update the bug17:00
mnaserok, we're over time, but it looks like we don't need any bug triage cause everything just works ;) haha.17:01
*** pcaruana has quit IRC17:01
mnaserthanks everyone, and please please take time to respond to the hackday ML post on openstack-discuss17:02
mnaser<317:02
mnaser#endmeeting17:02
*** openstack changes topic to "Launchpad: https://launchpad.net/openstack-ansible || Weekly Meetings: https://wiki.openstack.org/wiki/Meetings/openstack-ansible || Review Dashboard: http://bit.ly/2xA1eZC"17:02
openstackMeeting ended Tue Jan 29 17:02:07 2019 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)17:02
openstackMinutes:        http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2019/openstack_ansible_meeting.2019-01-29-16.00.html17:02
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2019/openstack_ansible_meeting.2019-01-29-16.00.txt17:02
openstackLog:            http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2019/openstack_ansible_meeting.2019-01-29-16.00.log.html17:02
cjloadermnaser: https://review.openstack.org/#/c/632182/17:02
cjloaderhttps://review.openstack.org/#/c/632142/17:03
*** gyee has joined #openstack-ansible17:03
prometheanfirequestion while people are here, nothing sets up the main nginx.conf, so nothing tells nginx to point to a sites-enabled type of directory17:04
prometheanfireis there a better way of handling this?     https://review.openstack.org/#/c/633423/1/tasks/keystone_nginx.yml@6217:04
prometheanfirehttps://review.openstack.org/#/c/633423/1/files/nginx.conf17:04
mnaserprometheanfire: i've been thinking that we should just rip out nginx from there17:05
odyssey4memnaser keystone requires a web server for federation support17:05
mnaserodyssey4me: we fallback to apache2 when we do federation17:06
odyssey4meso we can't just do uwsgi17:06
prometheanfiremnaser: I'd be fine with that, it seems default though (it was installed in my gentoo testing)17:06
mnaserbut for some reason we do both nginx *and* uwsgi for non-federated deployments17:06
odyssey4meyeah, we implemented nginx because the keystone team recommended we do so17:06
odyssey4methe plan was to switch the apache config for federation over to nginx too, whenever someone had the time to figure out how17:06
mnaserah i see17:07
*** macza has joined #openstack-ansible17:07
prometheanfiregentoo stuuf doesn't support federation yet, I'd have to package some things I think17:07
odyssey4mehowever, it seems that RDO still does apache/mod_wsgi - and a lot of openstack docs still help people config that way, so our config is confusing to many... I find myself wondering whether we shouldn't just confirm to what everyone else does as a default17:08
odyssey4me(even if it is a bit crappy)17:08
prometheanfiremeh (no strong opinion)17:10
jrosserthis is needed to make the tempest vm ssh stuff even more robust https://review.openstack.org/#/c/633728/17:11
*** hamzy has quit IRC17:11
* prometheanfire is testing if the same problem exists with apache17:12
prometheanfirejrosser: how does limiting it help?17:13
jrosserbecasue we now assign IPs to bridges on the containers in that subnet17:13
jrosserand neutron needs to know to not try to use one of those for an instance IP17:14
prometheanfireah, and could get a conflict17:14
jrosserif thats not clear enough perhaps we should reference the commit which consumes some of those IP in the commit msg?17:15
chkumar|outjrosser: regarding today's debugging we need to add some validation also17:16
chkumar|outjrosser: if something fails we should have some data to verify those stuffs17:16
jrosserwhat did you have in mind?17:17
*** sdake has joined #openstack-ansible17:17
chkumar|outjrosser: I will propose a patch tomorrow to test ping stiff from container17:17
jrosseri guess once the tempest resources are created you would expect to be able to ping the router17:17
jrossereven before the tests are run17:18
*** sdake has quit IRC17:18
prometheanfirejrosser: hardcoding the fix makes it more likely to fail in the future (though it's still useful)17:18
jrosserwell it's not a fix17:18
*** kopecmartin is now known as kopecmartin|off17:24
*** ThiagoCMC has quit IRC17:28
openstackgerritMerged openstack/openstack-ansible-os_tempest master: Add an ip address to eth12 in OSA test containers  https://review.openstack.org/63373217:31
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Update cirros from 3.5 to 3.6  https://review.openstack.org/63320817:34
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Added dependencies of os_tempest role  https://review.openstack.org/63272617:34
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Always generate stackviz irrespective of tests pass or fail  https://review.openstack.org/63196717:34
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Use tempest_cloud_name in tempestconf  https://review.openstack.org/63170817:34
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Added tempest.conf for heat_plugin  https://review.openstack.org/63202117:35
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-os_tempest master: Add telemetry distro plugin install for aodh  https://review.openstack.org/63212517:36
jrosserodyssey4me: which way would you prefer this to go? do it in nova role test vars or in tempest role?17:39
jrosserhttps://review.openstack.org/#/c/633677/17:39
odyssey4mejrosser tempest role, I think - then it's universally applied17:42
odyssey4mejrosser rather than override a default - fix the default17:42
jrosserok, i'll fix that up17:43
odyssey4megreat, thanks17:43
*** sdake has joined #openstack-ansible17:44
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-os_tempest master: Disable nova_lxd tempest plugin  https://review.openstack.org/63371117:57
openstackgerritMerged openstack/openstack-ansible-tests master: Set a defined IP address range for tempest test public addresses  https://review.openstack.org/63372817:58
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-os_tempest master: Disable nova-lxd tempest plugin  https://review.openstack.org/63371117:58
*** sdake has quit IRC18:10
*** cmart has quit IRC18:19
*** electrofelix has quit IRC18:34
*** sdake has joined #openstack-ansible18:37
openstackgerritJacob Wagner proposed openstack/openstack-ansible-ops master: Add ability to deploy Designate (DNSaaS)  https://review.openstack.org/63380118:50
*** hamzy has joined #openstack-ansible18:52
*** hamzaachi has joined #openstack-ansible18:59
*** exbob has quit IRC19:01
*** cmart has joined #openstack-ansible19:08
*** strattao has quit IRC19:32
ioniquick questions, i hope that's not a stupid one but i'm curious19:36
ioniwhat's in openstack-ansible-ops?19:36
ioniand MNAIO ?19:37
*** ztr has quit IRC19:37
jamesdentonMNAIO is a 'multi-node all-in-one' deploy. Basically, a set of scripts that deploys OSA in a set of VMs. infra, compute, storage, etc19:57
*** ThiagoCMC has joined #openstack-ansible19:57
jamesdentonopenstack-ansible-ops is the kitchen sink repo19:58
ThiagoCMCGuys, I just finished a fresh OSA/Rocky deployment on top of Ubuntu 18.04 (all hosts deployed via MaaS) and almost everything is working! Except Heat.19:58
ThiagoCMCThe command `openstack stack list` returns: ERROR: Internal Error19:58
jamesdentonsounds like everything is good then. lol19:58
ThiagoCMCWith --debug: http://paste.openstack.org/show/744195/19:59
ionijamesdenton, thanks19:59
ThiagoCMCMaybe it is broken due to this: https://github.com/openstack/openstack-ansible-os_heat/commit/785fcfd33d29ddfee54f09cd6bf126990d64e4dd ?19:59
jamesdentonThiagoCMC you may want to check the heat api logs to see the traceback20:00
ThiagoCMCok20:00
jamesdentondoes openrc point to the internal or public uri for heat?20:01
*** cmart has quit IRC20:01
ThiagoCMC"OS_AUTH_URL=http://172.29.239.250:5000/v3" internal (br-mgmt20:02
*** cmart has joined #openstack-ansible20:02
ThiagoCMCSorry, you said "for heat"? It's all "internalURL"20:03
jamesdentonk20:05
ThiagoCMCNo errors on heat-api logs, only this: http://paste.openstack.org/show/744197/ - everytime that I run `openstack stack list`20:07
ThiagoCMCHere is ful `openstack stack list --debug` output: http://paste.openstack.org/show/744198/20:10
jamesdentonhmm, i'm sure. I'm about to head out, but may be worth filing a bug and hopefully someone can look at that if you don't figure it out beforehand20:12
jamesdenton*not sure, rather20:12
jamesdentonthe only thing i see is its hitting http url and complaining of an sslerror, so that needs to be looked at20:13
ThiagoCMCI see, thanks anyway!20:14
ThiagoCMCI'll chat with admin0 to see if he can help  =)20:14
*** fdegir_ has joined #openstack-ansible20:14
jamesdentoncool. see ya20:14
ThiagoCMCSee U!20:15
*** cmart has quit IRC20:31
*** strattao has joined #openstack-ansible20:40
*** strattao has quit IRC20:44
*** SimAloo has quit IRC20:45
*** sdake has quit IRC20:45
*** cmart has joined #openstack-ansible20:48
*** fdegir_ is now known as fdegir20:55
*** SimAloo has joined #openstack-ansible20:58
openstackgerritMerged openstack/openstack-ansible-os_tempest master: Adds tempest run command with --test-list option  https://review.openstack.org/63135120:59
*** DanyC has quit IRC21:04
*** DanyC has joined #openstack-ansible21:04
*** DanyC has quit IRC21:08
ThiagoCMCGuys, I'm seeing the following haproxy.log entry: "keystone_service-front-1/1: SSL handshake failure", any idea to where start looking?21:31
ThiagoCMCIt's on my third controller (the one with the VIPs)21:31
*** ansmith has quit IRC21:38
*** hamzy has quit IRC21:42
jrosserThiagoCMC: if it were me i'd start with very simple tools, like wget, from a host that is nothing to do with your openstack deploy but has connectivity to the external vip21:48
jrossertry wget https://<external endpoint>:500021:49
ThiagoCMCTrying it now...21:52
ThiagoCMCThe `wget --no-check-certificate https://172.29.235.250:5000/` just worked...22:05
ThiagoCMCThat's my br-public subnet IP22:05
ThiagoCMCThe SSL handshake problem (haproxy.log line message) is always close/before to the heat_api thing.22:07
ThiagoCMCAnd my `openstack stack list` is returning Error 50022:07
ThiagoCMCI believe that there is a bug on stable OSA/Rocky branch. Where heat_api is trying to talk clear text against a https endpoint.22:10
ThiagoCMCThis SSL handshake problem and Heat Error 500 might be related, because of this: http://paste.openstack.org/show/744211/22:11
ThiagoCMCAlways the two lines together.22:11
*** sdake has joined #openstack-ansible22:15
jrosserThiagoCMC: Jan 29 22:08:18 localhost haproxy[13236]: 10.0.3.41:54086......22:25
jrosseri don't like that, it suggests that eth0 on a container has been used to contact the external keystone endpoint, which just feels wrong22:26
jrossermnaser: are you sure about this patch? https://github.com/openstack/openstack-ansible-os_heat/commit/785fcfd33d29ddfee54f09cd6bf126990d64e4dd22:27
mnaseryep, why's that jrosser ?22:28
jrosseri'm concerned that there is an assumption that the mgmt network can talk to the external endpoint, which isn't necessarily the case22:28
mnaserso all those urls i actually provided are ones which are presented to the user22:28
mnaserbut not used for auth22:28
mnaserfor example, www_authenticate_uri is just exposed in the headers etc22:28
*** nurdie has quit IRC22:28
jrosserwhat about this http://paste.openstack.org/show/744211/22:28
jrosserwhere the external vip is hit from a 10.x address which looks like a container eth022:29
jrosseri.e mgmt net traffic to external endpoint must take the default rout22:29
jrossere22:29
openstackgerritMerged openstack/openstack-ansible-os_tempest master: Enable port security  https://review.openstack.org/61771922:29
mnaseri think this introduces another issue22:29
mnaserheat-agent actually uses that url to give to vms22:30
jrossersomehow there the wires look crossed between internal mgmt net and external22:30
mnaseros-cloud-config or whatever22:30
mnaserno, not os-cloud-config, grr22:30
jrosserlate here, i need to stop, but i think theres a few folks having trouble with heat22:31
jrosserand it just all has a bit of a smell of internal/external networks getting mixed up22:32
*** TxGirlGeek has quit IRC22:32
*** gyee has quit IRC22:35
ThiagoCMCAHA!22:36
*** slaweq has quit IRC22:36
ThiagoCMCjrosser, mnaser I just reverted https://github.com/openstack/openstack-ansible-os_heat/commit/785fcfd33d29ddfee54f09cd6bf126990d64e4dd and executed os-heat-install.yml, no more Erro 500! `openstack stack list` is finally working!22:38
mnaseryikes, that breaks magnum though22:38
mnaserThiagoCMC: can you try reverting the values one by one and seeing which one breaks?22:38
ThiagoCMCyes22:39
ThiagoCMCIt will take a few minutes to try again, waiting for ansible to finish...22:43
ThiagoCMCAlso, no more SSL handshake error message!   ;-)22:43
ThiagoCMCmnaser, I reverted only line 46, under [clients_keystone], auth_uri.22:58
ThiagoCMCThe others still points to the public one.22:58
ThiagoCMCI think that the bug is with clients_keystone that tries to talk in clear text over a https connection.22:59
*** SimAloo has quit IRC23:03
-openstackstatus- NOTICE: http://zuul.openstack.org is not working. https://zuul.openstack.org does work. Please use that while we investigate.23:12
*** radeks_ has quit IRC23:15
*** TxGirlGeek has joined #openstack-ansible23:28
*** TxGirlGeek has quit IRC23:29
*** TxGirlGe_ has joined #openstack-ansible23:29
*** cmart has quit IRC23:34
*** sdake has quit IRC23:35
*** sdake has joined #openstack-ansible23:37
*** cmart has joined #openstack-ansible23:43
*** errr_ has joined #openstack-ansible23:53
*** sdake has quit IRC23:55
*** sdake has joined #openstack-ansible23:55
*** errr has quit IRC23:56
*** hamzaachi_ has joined #openstack-ansible23:58

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!