Monday, 2017-08-07

*** thorst has joined #openstack-ansible00:08
*** thorst has quit IRC00:12
*** yifei has joined #openstack-ansible00:24
*** sdake_ has quit IRC00:32
*** sdake_ has joined #openstack-ansible00:32
*** hw_wutianwei has joined #openstack-ansible00:43
*** charcol has joined #openstack-ansible00:57
*** markvoelker has joined #openstack-ansible01:15
*** woodard has joined #openstack-ansible01:20
*** woodard has quit IRC01:25
*** charcol has quit IRC01:30
*** charcol has joined #openstack-ansible01:39
*** markvoelker has quit IRC02:02
*** markvoelker has joined #openstack-ansible02:02
*** thorst has joined #openstack-ansible02:13
openstackgerritzhongshengping proposed openstack/openstack-ansible-os_sahara master: Add ceph_client role  https://review.openstack.org/48500002:15
*** charcol has quit IRC02:17
*** thorst has quit IRC02:18
*** woodard has joined #openstack-ansible02:21
*** woodard has quit IRC02:25
*** markvoelker has quit IRC02:34
*** thorst has joined #openstack-ansible03:14
*** thorst has quit IRC03:19
*** gkadam has joined #openstack-ansible03:32
*** hybridpollo has quit IRC03:46
*** charcol has joined #openstack-ansible03:49
*** udesale has joined #openstack-ansible04:06
*** udesale has quit IRC04:15
*** udesale has joined #openstack-ansible04:18
*** woodard has joined #openstack-ansible04:22
*** woodard has quit IRC04:27
*** udesale has quit IRC04:40
*** udesale has joined #openstack-ansible04:41
*** zerda3 has joined #openstack-ansible04:44
*** zerda3 has quit IRC04:57
*** thorst has joined #openstack-ansible05:15
*** thorst has quit IRC05:20
*** Oku_OS-away is now known as Oku_OS06:09
*** tobberydberg has joined #openstack-ansible06:10
*** drifterza has joined #openstack-ansible06:23
*** woodard has joined #openstack-ansible06:24
*** woodard has quit IRC06:28
*** perniciouscaffei has quit IRC06:34
*** jvidal has joined #openstack-ansible06:35
*** udesale__ has joined #openstack-ansible06:41
*** cshen has joined #openstack-ansible06:42
*** cshen has quit IRC06:42
*** udesale has quit IRC06:43
*** udesale__ has quit IRC06:48
*** udesale has joined #openstack-ansible06:49
*** flaviosr has quit IRC06:52
*** flaviosr has joined #openstack-ansible06:52
*** armaan has joined #openstack-ansible07:01
*** charcol has quit IRC07:02
*** armaan has quit IRC07:02
*** armaan has joined #openstack-ansible07:03
*** shardy has joined #openstack-ansible07:07
*** mbuil has joined #openstack-ansible07:15
*** thorst has joined #openstack-ansible07:16
*** markus_z has joined #openstack-ansible07:18
*** yolanda has joined #openstack-ansible07:20
*** thorst has quit IRC07:21
*** udesale__ has joined #openstack-ansible07:22
*** udesale has quit IRC07:23
*** mpranjic has quit IRC07:23
*** woodard has joined #openstack-ansible07:25
*** lvdombrkr has joined #openstack-ansible07:25
*** mpranjic has joined #openstack-ansible07:28
*** woodard has quit IRC07:29
mpranjicmorning07:30
*** udesale__ has quit IRC07:42
*** udesale has joined #openstack-ansible07:42
*** xdfil has quit IRC07:45
lvdombrkrmorning, i have question about openstack-ansible re-deployment... if after deployment i want change some data in user_variables.yml what i need to do after data changed? depploy all infrostructure for begining or what?07:47
andymccrlvdombrkr: it depends on what you have changed - for example if you just want to change the nova_memory_allocation_ratio thats just a conf change and service restart within nova, so you could rerun the nova playbook only, or re-run it with the "nova-config" ansible tag to only adjust the config07:50
andymccrin general you shoudlnt need to redeploy unless you are changing some architectural that impacts everything (but then i'd question whether that change is required!) - saying that sometimes a redeploy is easier when you havent even got the deploy up and running, or if its just a test AIO and you are changing something really large.07:57
*** jafeha has joined #openstack-ansible08:08
evrardjpgood morning08:12
eoseqmorning, do you know how to disable setting "physical_interface_mappings" on a compute node and have it only on "neutron_l3_agent" group?08:13
andymccreoseq: what are you trying to do by changing that?08:17
*** markvoelker has joined #openstack-ansible08:17
evrardjpyeah, what lead you to do that, eoseq?08:19
*** markvoelker_ has joined #openstack-ansible08:19
eoseqI have such settings: http://paste.openstack.org/show/617643/08:21
eoseqso after ansibling I got "physical_interface_mappings = routers01:ethv22" in /etc/neutron/plugins/ml2/linuxbridge_agent.ini08:22
eoseqon compute nodes, but I want it to have it only on the l308:23
*** markvoelker has quit IRC08:23
eoseqon the compute node I get: ERROR neutron.plugins.ml2.drivers.linuxbridge.agent.linuxbridge_neutron_agent [-] Interface ethv22 for physical network routers01 does not exist. Agent terminated!08:24
evrardjpeoseq: our openstack_user_config already helps you by setting up the container interfaces, and their carrying interface on the host08:25
evrardjpyou shouldn't need this user_variables.08:25
*** woodard has joined #openstack-ansible08:25
eoseqevrardjp: I can't find how to set network_mappings: "routers01:ethv22" in openstack_user_config.yml08:26
evrardjphave a look at the host bind overrides too08:26
eoseqevrardjp: ok, so you mean that host overrides it the best/only way?08:26
eoseqis08:26
evrardjpno i meant global_overrides with provider_networks are good enough08:27
evrardjpsee for example: https://docs.openstack.org/project-deploy-guide/openstack-ansible/ocata/app-config-test.html#test-environment-config08:27
*** woodard has quit IRC08:30
openstackgerritManuel Buil proposed openstack/openstack-ansible-os_tacker master: Fill the tests directory  https://review.openstack.org/49044008:30
eoseqevrardjp: ok, so what I don't have now is the idea what variable should I put into that "global_overrides/providers_network/network" path to get it converted to "physical_interface_mappings" in the "neutron_l3_agent" group08:32
evrardjplet's take a step back again.08:33
evrardjpI know that's what you are trying to achieve, but why?08:33
evrardjpIsn't what ansible configures for you correct?08:34
evrardjpdo you have one specific host that's different? Why is it different? Is that a good long term approach?08:34
evrardjpeoseq: because ansible will generate the physical_interface_mappings for you08:36
evrardjplet me show you08:36
evrardjpthat is where we run neutron and gether provider networks data beforehand: https://github.com/openstack/openstack-ansible/blob/master/playbooks/common-playbooks/neutron.yml#L69-L8808:39
evrardjpand08:39
evrardjphttps://github.com/openstack/openstack-ansible-os_neutron/blob/a5357ecac5d25e9623e2dbf6fc2130454160cb41/templates/plugins/ml2/linuxbridge_agent.ini.j208:39
evrardjpis where we generate the physical_interface_mappings for ml2 lxb for those neutron_provider_networks08:39
evrardjpeoseq: with that in mind, global_overrides probably give you what you need.08:41
eoseqso it's configured on every linuxbridge agent08:41
evrardjpyeah, ovs do the same IIRC08:42
evrardjpbut you are LXB right?08:42
eoseqyes, LXB08:42
evrardjpok08:42
evrardjpso?08:42
evrardjpThere is something I still don't understand :D08:42
eoseqso I have to have that "external/public" bridge on every compute node08:43
evrardjpoh ok now I understand your problem08:44
eoseqbut I think I need that only on l3 agents08:44
eoseqso i.e. infra hosts08:44
evrardjpso how do you intend to communicate between your compute node and your l3 agent on this network?08:45
eoseqvxlan?08:45
evrardjpok08:45
evrardjpso08:45
evrardjpwhy don't you try to change the group_binds for the flat/br-vlan interfaces?08:46
evrardjpI have never tried myself, but that's what I'd try to do.08:47
evrardjpWe should probably document this08:47
evrardjpfor now we require br-vlan to be defined everywhere though: https://docs.openstack.org/project-deploy-guide/openstack-ansible/ocata/overview-network-arch.html08:48
eoseqevrardjp: thx, I'll try to play with a little bit more and will let you know :-)08:50
eoseqthe idea is to have br-vxlan on compute nodes and route it to flat network on l3 agent nodes08:51
evrardjpyou're heading down an undocumented path, but I understand why. We should technically be able to do it. If not, we should implement it.08:52
evrardjpyeah I guess you want to have a vtep in neutron agent container, and from then on speak to outside world08:52
evrardjpit makes sense.08:52
evrardjpdouble check your flat is well wired too on your neutron agents containers.08:53
eoseqso as for now that only way is to override infra host setting to keep it empty on compute nodes? :-)08:53
eoseqworks like a charm when I delete that08:54
eoseq"physical_interface_mappings" on a compute node08:54
eoseqevrardjp: thanks a lot for trying to understand our case :-)08:55
yifeiHi all, I want to create the LXCs locally (using a local image). How can I achieve this in OSA? Thanks08:59
yifeior is there anyone can help with building a lxc image server? Thanks09:00
evrardjplxc doesn't have a notion of "image server" itself yifei09:01
evrardjplxc has a template that downloads from an external source though09:01
evrardjpyifei: what are you trying to achieve?09:01
evrardjpif you want to have an webserver that holds your images, and point to it, it's possible.09:02
yifeievrardjp: but i found that "lxc_container_download_template_options" indicates the --server {{ lxc_image_cache_server }}09:02
evrardjpyeah the download template will automatically fetch from ubuntu server, but you can use this variable to bypass it indeed, and use your own server.09:03
evrardjpadvanced topic :p09:03
yifeievrardjp: yes I have. I have build all the deb and pip packages in the local docker. I would use it as a local source, so i can deploy osa offline09:03
openstackgerritManuel Buil proposed openstack/openstack-ansible-os_tacker master: Fill the tests directory  https://review.openstack.org/49044009:04
yifeievrardjp: how to build a lxc image server?09:04
yifeievrardjp: i have googled it, but no guide09:05
yifeievrardjp: sorry, got something now. Will try. Thanks for your help09:07
evrardjpyifei: sorry I was busy09:11
yifeievrardjp: got it now, thank you again, hah09:11
evrardjpok09:11
evrardjpwell, don't hesitate to share how you did it. We probably did it differently, so it's maybe better to come with one single approach in the future.09:12
odyssey4meyifei the lxc server has a very simple API to serve images, you can reverse engineer it by sniffing the traffic when creating instances using the default tooling09:12
odyssey4mewe will very likely be doing some work in Queens to move towards an approach where everything is built, then deployed rather than the current approach09:13
odyssey4meyifei if you'd like to see how we've put together artifacts for newton/ocata so far, feel free to inspect https://github.com/rcbops/rpc-openstack/tree/master/scripts/artifacts-building09:14
yifeiodyssey4me: I just found a command : lxc image copy09:14
yifeiodyssey4me: use it to copy the image to my webserver and configure it in lxc-create09:15
evrardjpyifei: pay attention to lxc command!09:15
evrardjplxc command is part of lxd tooling.09:16
evrardjpand lxd requires different metadata.09:16
evrardjpif you want pure lxc metadata, check at what odyssey4me pasted you.09:16
yifeiodyssey4me: wow it's great is there any doc or readme about it ?09:16
yifeievrardjp: currently just use lxc09:17
*** thorst has joined #openstack-ansible09:17
evrardjpyifei: we don't have docs at the moment.09:18
evrardjpfor that09:18
odyssey4meyifei the code is fairly well documented, but as that's a short term hack it is as it is09:19
yifeiodyssey4me: found the readme, checking its detail now09:19
yifeiodyssey4me: I took a lot time to build a webserver in docker for offline deployment of OSA.....09:20
yifeiodyssey4me: i am nearly crazy now......09:20
odyssey4meyes, an offline build is not simple at this time - it'd be ideal, though, to document everything needed so that we can share and improve it09:21
yifeiodyssey4me: how it could be good it if I asked it here eralier09:21
*** thorst has quit IRC09:22
evrardjpyeah :)09:24
yifeiodyssey4me: I wrote some scripts to scan the distro and pip packages in every role and used docker to download them and then used the packages to build a PPA and a pip repo.....09:25
odyssey4meyifei we already have a lookup that does that scan - you didb't need to write anything ;)09:25
evrardjp...09:26
odyssey4mealso, the pip repo is built as part of the normal deployment - so you could have just used that09:26
odyssey4meand the repo also contains a cache, so that could have perhaps been captured and used09:26
yifeiodyssey4me: i did not know that.. I found it in OSA wiki but did not found it, so i wrote them myself09:26
*** woodard has joined #openstack-ansible09:26
evrardjpOSA wiki?09:27
yifeievrardjp: openstack docs about osa09:27
yifeihttps://docs.openstack.org/openstack-ansible/latest/09:28
evrardjpoh the docs.09:28
evrardjpok09:28
evrardjpyeah09:28
evrardjpoffline install to be improved next cycle.09:28
*** fxpester has joined #openstack-ansible09:29
*** stuartgr has joined #openstack-ansible09:30
*** woodard has quit IRC09:31
andymccrmake it so evrardjp. make it so!09:37
evrardjpyeah. I'm looking forward to it :D09:39
*** jafeha has quit IRC09:46
*** charcol has joined #openstack-ansible09:49
*** chhavi has joined #openstack-ansible09:51
mbuilguys, I need a bit of help. I am trying to fix the tests executed in the tacker role using Vagrant. Things are deployed successfully but the tests fails. How can I run the tests manually? I am inside my "infra1" container of the Vagrant VM but I cannot find any ansible command or my tacker role repo with the playbooks :(10:03
*** thorst has joined #openstack-ansible10:05
openstackgerritAndy McCrae proposed openstack/openstack-ansible-os_sahara master: Implement uWSGI for sahara-api  https://review.openstack.org/49142110:06
*** charcol has quit IRC10:08
andymccrmbuil: the test is just doing a curl on http://localhost:8888 from inside the tacker containers10:08
andymccrmbuil: you could try that manually and see what response you get (should be a 300 based on the test you setup)10:08
mbuilandymccr: yes, but can I just run that test?10:09
mbuilandymccr: I mean, can I run the playbook "test-tacker-functional.yml"?10:09
andymccrmbuil: sure an easy way would be to edit the tests/test.yml and comment out all the includes except the test-tacker-functional.yml and then do a ./run_tests.sh10:10
*** thorst has quit IRC10:11
mbuilandymccr: ok. But there is no way to do "ansible-playbook test-tacker-functional.yml" inside the infra1 container or the tacker container?10:11
*** deadnull has joined #openstack-ansible10:11
openstackgerritManuel Buil proposed openstack/openstack-ansible-os_tacker master: Fill the tests directory  https://review.openstack.org/49044010:13
*** deadnull has quit IRC10:13
andymccrmbuil: wouldnt be inside the infra1 container, since thats where keystone/rabbit etc are deployed.10:15
andymccrthe tests/files etc would exist outside of the infra1 container10:15
mbuilandymccr: ok10:15
*** yifei has quit IRC10:17
*** lvdombrkr has quit IRC10:19
*** lvdombrkr has joined #openstack-ansible10:19
lvdombrkrguys, last weeek i created and lounch first instance without any problems today when i tried tou lounch istance i get error: Message10:27
lvdombrkr    No valid host was found. There are not enough hosts available.10:27
*** woodard has joined #openstack-ansible10:27
lvdombrkrthere is no problems with resources on node10:27
*** woodard has quit IRC10:31
andymccrlvdombrkr: check your nova logs - i think those logs go in the scheduler or the api-os-compute log.10:34
andymccrlvdombrkr: it should tell you why its filtering out the host10:34
*** cshen has joined #openstack-ansible10:35
lvdombrkrandymccr: nova_scheduler node log http://paste.openstack.org/raw/617667/10:41
*** pbandark has joined #openstack-ansible10:42
andymccrlvdombrkr: you'll need to learn how to debug the logs :) but that one looks like AggregateDiskFilter: (start: 1, end: 0)'] so something to do with the disk space on the compute host10:42
lvdombrkrandymccr: nova_conductor logs: http://paste.openstack.org/raw/617669/10:45
andymccrlvdombrkr: the scheduler filters hosts out based on criteria - https://docs.openstack.org/nova/latest/user/filter-scheduler.html10:47
mbuilandymccr: finally it worked :). Thanks for your suggestion ==> https://review.openstack.org/#/c/490440/10:47
andymccrmbuil: no problem!10:47
andymccrmbuil: makes sense that its a 200 instead of a 300 i guess :)10:47
mbuilandymccr: yes :)10:47
*** markvoelker has joined #openstack-ansible10:49
*** markvoelker_ has quit IRC10:53
*** markvoelker_ has joined #openstack-ansible10:54
*** markvoelker has quit IRC10:54
*** markvoelker has joined #openstack-ansible10:57
*** schwicht has joined #openstack-ansible10:58
*** markvoelker has quit IRC10:59
*** markvoelker_ has quit IRC11:00
*** markvoelker has joined #openstack-ansible11:01
*** strigazi_OFF is now known as strigazi11:05
*** markvoelker_ has joined #openstack-ansible11:12
*** markvoelker has quit IRC11:15
lvdombrkrandymccr:thank you, but looks there is no problem with disk space11:20
andymccrlvdombrkr: you'll need to check why that filter is rejecting the host then - perhaps you have some builds in error that are counting towards reserved space on the host11:21
andymccror some other issue along those lines11:21
*** markvoelker_ has quit IRC11:28
*** thorst has joined #openstack-ansible11:28
*** drifterza has left #openstack-ansible11:29
*** markvoelker has joined #openstack-ansible11:29
*** drifterza has joined #openstack-ansible11:29
*** drifterza has left #openstack-ansible11:30
*** drifterza has joined #openstack-ansible11:32
*** thorst has quit IRC11:33
*** markvoelker has quit IRC11:34
openstackgerritAndy McCrae proposed openstack/openstack-ansible-os_ironic master: Implement uWSGI for ironic-api  https://review.openstack.org/49145411:34
eoseqevrardjp: if there is no override for linuxbridge_agent.ini.j2 here: https://docs.openstack.org/project-deploy-guide/openstack-ansible/ocata/app-advanced-config-override.html#currently-available-overrides the only way is to modify the template?11:35
*** markvoelker has joined #openstack-ansible11:40
*** markvoelker_ has joined #openstack-ansible11:41
*** schwicht has quit IRC11:42
*** schwicht has joined #openstack-ansible11:42
*** markvoelker_ has quit IRC11:42
*** markvoelker_ has joined #openstack-ansible11:44
*** markvoelker has quit IRC11:44
odyssey4meeoseq that list is likely out of date, every ini file has an override11:44
eoseqodyssey4me: great, thx! :-)11:45
odyssey4mecheck the defaults for the role to find the current list11:45
odyssey4mewe should probably remove that list from the docs and tell people to do that11:45
eoseqneutron_linuxbridge_agent_init_overrides: {}11:45
odyssey4meif you could push a patch for that, or register a bug for it, that's be great11:45
*** markvoelker has joined #openstack-ansible11:47
eoseqodyssey4me: ok, I'll submit a bug11:47
*** pbandark1 has joined #openstack-ansible11:49
*** schwicht has quit IRC11:49
*** markvoelker_ has quit IRC11:50
lvdombrkrandymccr: thank you, can i try just reboot nova containers, or its senseless?11:50
*** markvoelker_ has joined #openstack-ansible11:51
*** holmsten has joined #openstack-ansible11:51
*** pbandark has quit IRC11:51
*** pbandark1 is now known as pbandark11:51
*** shardy is now known as shardy_lunch11:51
*** markvoelker has quit IRC11:52
*** markvoelker has joined #openstack-ansible11:53
*** markvoelker_ has quit IRC11:56
*** markvoelker_ has joined #openstack-ansible11:56
andymccrlvdombrkr: well it'd be the compute service on the host - but yes probably, you need to figure out why nova thinks you dont have enough space11:57
*** cshen has quit IRC11:59
*** markvoelker has quit IRC12:00
*** markvoelker has joined #openstack-ansible12:02
*** cshen has joined #openstack-ansible12:03
*** markvoelker_ has quit IRC12:03
*** thorst has joined #openstack-ansible12:05
jvidalhello12:08
jvidalI am getting failures on vagrant tests, and I saw some gate checks with the same failure12:09
jvidaltempest does not run due to missing dependencies12:09
jvidalis anybody else having the same errors?12:09
*** thorst has quit IRC12:09
jvidalhttp://logs.openstack.org/43/490543/3/check/gate-openstack-ansible-os_neutron-ansible-func-ubuntu-xenial/b16350e/console.html.gz12:10
jvidalexample of the failure ^12:10
*** mgariepy has quit IRC12:10
*** thorst has joined #openstack-ansible12:10
*** cshen has quit IRC12:12
*** gkadam has quit IRC12:22
lvdombrkrandymccr: in api_placement container in logs i found : 2017/08/07 15:21:42 [info] 136#0: *82 recv() failed (104: Connection reset by peer) while sending response to client, client: 172.16.84.124, server: , request: "HEAD / HTTP/1.0"12:24
lvdombrkrcan it be releated with my issue?12:24
*** cshen has joined #openstack-ansible12:24
*** randy^^ has joined #openstack-ansible12:26
*** prometheanfire has quit IRC12:27
randy^^hello/good afternoon.12:27
*** woodard has joined #openstack-ansible12:29
*** schwicht has joined #openstack-ansible12:32
*** woodard has quit IRC12:33
*** cshen has quit IRC12:35
*** schwicht has quit IRC12:39
*** mgariepy has joined #openstack-ansible12:42
*** hw_wutianwei has quit IRC12:44
mgariepygood morning everyone12:44
*** schwicht has joined #openstack-ansible12:47
*** shardy_lunch is now known as shardy12:52
evrardjphello mgariepy!12:54
mgariepywhat's up ?12:55
*** esberglu has joined #openstack-ansible13:03
*** schwicht has quit IRC13:04
cloudnullmornings13:10
*** smatzek has joined #openstack-ansible13:11
*** lucasxu has joined #openstack-ansible13:12
mgariepyodyssey4me, are you around ?13:16
odyssey4meo/ mgariepy cloudnull13:16
mgariepyso did you had time to dig out repo server issues ?13:17
odyssey4memgariepy I haven't been able to do any further testing. I think someone had a review up though.13:17
*** woodard has joined #openstack-ansible13:18
odyssey4memgariepy here it is https://review.openstack.org/48918813:19
*** woodard has quit IRC13:19
*** woodard has joined #openstack-ansible13:19
*** schwicht has joined #openstack-ansible13:20
mgariepyok great, i'll try to dig that out then.13:21
cloudnullare we seeing timeouts in general within the integrated gate?13:21
odyssey4mecloudnull yep, and even for roles tests13:24
odyssey4mewithout OSIC, it seems we're quite stuck13:24
cloudnull:'(13:25
openstackgerritAndy McCrae proposed openstack/openstack-ansible-os_sahara master: Implement uWSGI for sahara-api  https://review.openstack.org/49142113:25
openstackgerritJesse Pretorius (odyssey4me) proposed openstack/openstack-ansible master: Add subunit output to ARA reporting  https://review.openstack.org/49148913:27
*** drifterza has quit IRC13:29
openstackgerritAndy McCrae proposed openstack/openstack-ansible-os_ironic master: Implement uWSGI for ironic-api  https://review.openstack.org/49145413:30
mhaydendmsimard: yeah, i was noticing some slow gates w/o osic :/13:34
mhaydenhappy monday13:35
cloudnullI wonder if that is something being generally felt todya13:35
cloudnullor if it's just us13:35
cloudnull^ maybe we could/should reintroduce the hyperconverge things to try push runtimes back down ?13:35
*** lbragstad has joined #openstack-ansible13:36
dmsimardFYI I made a thing http://grafana.openstack.org/dashboard/db/openstack-ansible13:38
dmsimardBut I'm not sure it really works the way I think it was supposed to13:38
odyssey4mecloudnull I hate to inform you that the revert still hasn't merged, so your changes are still in place: https://review.openstack.org/47182213:38
* mhayden was wrestling with swift until late friday :/13:39
odyssey4mewe would have to consolidate even more, and considering that the role tests are all converged onto a minimal set of containers, I don't think it'll help all that much13:39
logan-http://grafana.openstack.org/dashboard/db/openstack-ansible?panelId=2&fullscreen&from=1499521222557&to=1502113222557 nice last 30 days view of gate timings13:40
odyssey4methe information is a little weird due to the lack of periodics13:42
odyssey4meif we implemented per-repo periodics then that might be a little more consistent and useful13:43
odyssey4mebut, I suppose we could build a dashboard for the periodic tests - I wasn't aware that this was available13:43
logan-it seems like there should be a lot more data points on that13:43
logan-maybe it is averaged per-day13:44
dmsimardodyssey4me: yeah I submitted those graphs just to try and experiment with the available data13:51
dmsimardI'm not sure why it's not picking up the centos jobs13:51
dmsimardsurely something silly in https://git.openstack.org/cgit/openstack-infra/project-config/tree/grafana/openstack-ansible.yaml13:52
andymccr+2 to the revert patch!13:55
andymccrmhayden: yeah - mines timing out :( even with the fix to ensure the erasure code stuff doesnt impact13:55
cloudnullodyssey4me: ah, so... maybe we need to do more of it ?13:56
odyssey4mecloudnull given the role results, I don't think it'll make that much difference13:57
cloudnullok13:57
odyssey4meit may or may not - no idea... th epoint is that the roles are already hyperconverged to use only two containers in most cases... and they're also timing out13:57
odyssey4metiming it with a shorter (1hr timeout), but still timing out13:57
andymccrwell centos is taking ridiculous amounts of time13:58
cloudnullmaybe we need to break up the installation into multiple scenarios ?13:58
andymccri had a swift job time out after an hour and it hadnt even installed rabbitmq yet on the infra container13:58
andymccrwhich suggests that could impact any role test - since we do that for almost all of them13:58
odyssey4methe rabbit and mariadb roles could be radically simplified to improve performance13:58
andymccras long as it has a centos job enabled13:58
dmsimardAye, I needed to turn the OSA jobs to non-voting in the ARA gate this weekend because I was clocking in on 8+ rechecks on a trivial patch13:58
andymccrdmsimard:  :( sorry13:58
mhaydenandymccr: i put in a patch to remove rabbitmq from the swift gate13:58
andymccrdmsimard: is that on the integrated test build?13:58
mhaydenbut it won't pass :|13:58
dmsimardso I'm not sure if there is any one problem or if the timeout just needs to be bumped13:58
*** asettle_ is now known as asettle13:59
dmsimardandymccr: no, osa-tests13:59
andymccrdmsimard: what is the timeout?13:59
odyssey4methe timeout is an hour for the roles13:59
dmsimardandymccr: different places, see here https://review.openstack.org/#/c/491029/13:59
odyssey4meand osa-tests only does two services IIRC13:59
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible master: Revert "Reduce container counts for cinder, ironic and trove"  https://review.openstack.org/47182213:59
dmsimardok, I lie, maybe it's getting timeouts running tempest14:00
dmsimardhttp://logs.openstack.org/29/491029/1/gate/gate-ara-integration-openstack-ansible-func-centos-7/a27f9c5/console.html#_2017-08-06_06_11_50_84886614:00
dmsimardhttp://logs.openstack.org/29/491029/1/gate/gate-ara-integration-openstack-ansible-func-centos-7/d302dba/console.html#_2017-08-06_02_03_01_87204314:00
dmsimardbut it starts running tempest with like 5 minutes left..14:00
dmsimardsounds awfully slow14:00
dmsimardthis one failed with timeout but just barely completed so it has logs http://logs.openstack.org/29/491029/1/check/gate-ara-integration-openstack-ansible-func-centos-7/748d366/console.html14:02
dmsimardsorting by duration in ara, we're seeing tempest tests at ~11 minutes but there is a lot of package installation tasks taking a long time14:03
andymccrdmsimard: for the purposes of ARA, do you need the CentOS gate? we are having issues with that - or have you seen the ubuntu gate fail too?14:04
dmsimardthere's almost 30 minutes worth just of yum install and pip install in there14:04
odyssey4medmsimard yeah, the centos gate specifically is known to take a very long time and often time out14:04
andymccrdmsimard: yea!14:04
andymccrdmsimard: yeah we're battling that rn, i know mhayden was doing some work on that already14:04
andymccrbut right now its still a bit bad and really inconsistent14:04
dmsimardyeah14:05
dmsimardtbh I don't *need* the OSA centos coverage so much as I like having coverage on a deb and a rpm distro14:05
andymccrdmsimard: agreed, for now though i think disabling the centos tests would work better :( until we can figure out the deal with that.14:06
dmsimardit's non-voting right now so it's okay, I'm not going to merge things if there is a legit issue14:06
TahvokHey guys! Currently I don't have any instances running on my environment (so I don't care about downtime). I would like to change the 'tunnel' cidr addresses. From what I saw only the neutron agent container has ip assigned from this cidr. How should I proceed on changing it? I've alreayd modified the openstack_user_config.yml file with correct cidr14:07
andymccrdmsimard: yeah - also from our perspective a lot will fail if that integrated test repo fails on centos :P14:08
dmsimardhm, I thought passing a list to "package" was supposed to do everything in one transaction ?14:08
dmsimardhttp://logs.openstack.org/29/491029/1/check/gate-ara-integration-openstack-ansible-func-centos-7/748d366/logs/ara/file/4ba2983d-4380-4ccb-8311-38f743706faf/#line-7714:08
dmsimardyields http://logs.openstack.org/29/491029/1/check/gate-ara-integration-openstack-ansible-func-centos-7/748d366/logs/ara/result/9d5c9db8-4b97-4302-9b34-9b744c44d3dd/14:09
*** marst has joined #openstack-ansible14:09
*** cshen has joined #openstack-ansible14:14
*** perniciouscaffei has joined #openstack-ansible14:18
openstackgerritMajor Hayden proposed openstack/openstack-ansible master: [WIP] Testing dnf with host/containers  https://review.openstack.org/49108414:23
*** schwicht has quit IRC14:24
*** drifterza has joined #openstack-ansible14:25
lvdombrkrandymccr: quick question after physical node restart all services must be up and running automatically?14:26
randy^^mhm, is there any documentation about "how to contribute to openstack-ansible"?14:31
*** hw_wutianwei has joined #openstack-ansible14:32
cloudnulllvdombrkr: yes the services should all automatically restart14:34
cloudnullrandy^^: what are you looking for?14:35
cloudnullwe've the developer docs14:35
cloudnullhttps://docs.openstack.org/openstack-ansible/newton/newton/developer-docs/contribute.html14:35
spotz+2 on the docs:)14:35
cloudnullif that's what you are looking for ?14:35
evrardjprandy^^: contributions welcomed :)14:36
*** schwicht has joined #openstack-ansible14:37
spotzAnd there's some good and improving docs for getting setup for the community as a hole but can help with that if you get stuck14:37
*** schwicht has quit IRC14:37
openstackgerritLogan V proposed openstack/openstack-ansible master: Run resolvconf role on hosts only after unbound is deployed  https://review.openstack.org/49150814:37
odyssey4mecloudnull that would unfortunately be some sort of outdated thing, this is the link: https://docs.openstack.org/openstack-ansible/latest/contributor/index.html14:37
evrardjpI think everythink is linked from that page spotz :)14:37
*** armaan has quit IRC14:37
evrardjpif not, let's fix it.14:38
*** armaan has joined #openstack-ansible14:38
spotzevrardjp: thingee is working on something new and cool:)14:38
evrardjpyeah I remember that part. We could link that from our guides.14:38
odyssey4meit's two links deep from the index page14:39
odyssey4methat's hardly worth changing the index for14:40
randy^^cloudnull: thanks. just want to report som ebugs - or better, patches :)14:42
*** armaan has quit IRC14:42
randy^^just minor things, missing environment settings in various roles14:42
*** woodard has quit IRC14:43
cloudnullcool14:44
cloudnullwell not cool that there are bugs , but nice to get them reported and even better to get them fixed :)14:44
evrardjprandy^^: yeah don't hesitate to report bugs and even patch them, according to practices! :D14:44
*** schwicht has joined #openstack-ansible14:45
evrardjpwe can help you in any case.14:45
*** tobberyd_ has joined #openstack-ansible14:45
evrardjpthere are variables that we don't define explicitly though.14:45
randy^^btw - is anybody of you from rackspace germany, sitting in munich office?14:46
* cloudnull US14:46
evrardjpI don't think so14:47
evrardjprandy^^: are you from germany?14:47
randy^^we swapped "office rooms" last month - RS DE got our 4 former offices and we (company) took theire 2 rooms :)14:47
randy^^i should get some nerfguns as well, than we can fight against them "over the floor" :>14:48
randy^^evrardjp: yes, munich.14:48
*** tobberydberg has quit IRC14:48
*** tobberyd_ has quit IRC14:50
holmstenHello! I'm looking for best practices for using OSA to deploy Openstack with 3rd party neutron agent and ml2 driver (more specifically Cisco ACI OpenStack Plugin). I'm afraid this isn't possible without rewriting current os_neutron playbook?14:52
dmsimardandymccr: there's really something weird going on with those package resources being split into different transactions14:55
dmsimardseems like #ansible-devel is of my opinion that there should not be different transactions14:56
evrardjpholmsten: you could contribute to our os_neutron role14:57
evrardjpthere are already different 3rd party neutron agents.14:57
logan-why does our specs page only show newton and older http://specs.openstack.org/openstack/openstack-ansible-specs/14:57
evrardjpwow.14:58
evrardjpgood catch logan-14:58
odyssey4medmsimard I think you may be seeing what made mhayden decide that dnf was the answer.14:58
logan-yeah i was just going to link dmsimard to the dnf spec14:58
logan-hhe14:58
logan-heh *14:58
evrardjpholmsten: you could also have your thing on the side, and tap into the process, but I think it would be a great contribution to the community to adapt the os_neutron role.14:58
mhaydendmsimard / odyssey4me: we have a few straggling package tasks that aren't passing as a list :/14:58
odyssey4melogan- 'cos we need to update the index I reckon14:59
mhaydenwhich is why i added that linter14:59
logan-dmsimard: http://docs-draft.openstack.org/24/488424/1/gate/gate-openstack-ansible-specs-docs-ubuntu-xenial/c933a16//doc/build/html/specs/pike/centos-and-dnf.html14:59
evrardjpmhayden: you added a linter?14:59
mhaydenhttps://review.openstack.org/49089914:59
mhaydenjmccrory is excited about it, but it needs more reviews ;)14:59
logan-mhayden: he put an example above that looked like it was using a list though and doing some weird stuff14:59
dmsimardmhayden: yeah but even though we're passing items as a list instead of with_items, it looks like the underlying module is still passing it as sort-of with_items14:59
logan-http://logs.openstack.org/29/491029/1/check/gate-ara-integration-openstack-ansible-func-centos-7/748d366/logs/ara/file/4ba2983d-4380-4ccb-8311-38f743706faf/#line-7715:00
evrardjpmarked15:00
logan--> http://logs.openstack.org/29/491029/1/check/gate-ara-integration-openstack-ansible-func-centos-7/748d366/logs/ara/result/9d5c9db8-4b97-4302-9b34-9b744c44d3dd/15:00
mhaydenswift still has a with_items on a package task -- can't get it merged :|15:00
dmsimardmhayden: yeah see logan-'s links15:00
odyssey4melogan- oh weird - there's a pike index and spec, but not published15:00
* mhayden is about ready to just call apt/yum/dnf/zypper directly with shell :P15:00
logan-haha15:00
dmsimardmhayden: ara is picking up multiple results just as if the task used with_items15:00
mhaydenweird15:00
evrardjpmhayden: I love it15:00
mhaydenlogan-: well, it all goes in one transaction,b ut ansible parses the output multiple times15:01
mhaydenbased on what i saw in the plugin code15:01
logan-oh right15:01
mhaydenevrardjp: feel free to rub a little workflow on that one if you like it15:01
mhaydenlogan-: ansible's yum/dnf modules read over that output bunches of times15:01
mhaydenoh, btw -- i updated https://github.com/major/osa-gate-parser so that it will output tasks that time out15:02
mhaydenexample -> https://paste.fedoraproject.org/paste/eBSjXRp223IMz4UlX7clng/raw15:02
*** tobberydberg has joined #openstack-ansible15:02
logan-very  nice15:03
*** markvoelker has quit IRC15:03
mhaydenevrardjp: i've gotta read up on how to write them linter tests15:04
odyssey4memhayden until the erasure encoding disable is merged, centos for swift is broken15:05
odyssey4meand that patch is failing due to some weird bindep thing15:05
mhaydeni might have to combine that with mine15:05
mhaydenthe rabbitmq removal one15:05
*** markvoelker has joined #openstack-ansible15:06
evrardjpmhayden: worst case scenario: call in our openstack-ansible-tests repo a linting script that runs a failing play on purpose, catch the output and return "OK" :p15:06
mhaydenodyssey4me: let me see if i can combine a few of these together15:06
dmsimardmhayden: sweet15:07
*** tobberydberg has quit IRC15:07
dmsimardmhayden: I could go and say that ara provides that already but /me shrug15:07
holmstenevrardjp: Thanks. If we end up writing our own thing we will most likely push it upstream (assuming it's good enough) to contribute to the community. :)15:08
evrardjpplease don't rewrite os_neutron if you write something external :p15:08
dmsimardmhayden: oh, wait, tasks that time out ? What do you mean ?15:09
mhaydendmsimard: so if a task is running with zuul kills it15:09
dmsimardoh, if the job timed out, not individual tasks15:09
mhaydenthe script catches that now15:09
mhaydensi15:09
*** drifterza has quit IRC15:09
evrardjpdmsimard: also long running tasks can "timeout"15:10
evrardjpthat's how async appeared with timeout:15:10
evrardjpIIRC15:10
evrardjpssh timeout15:10
evrardjpdue to(15:10
evrardjpgrr15:10
evrardjpI can't type15:10
openstackgerritMajor Hayden proposed openstack/openstack-ansible-os_swift master: Optimize swift gate  https://review.openstack.org/49108915:10
mhaydenandymccr: ^^ added your change15:10
dmsimardyeah and I guess ara report or database isn't available if job didn't collect logs15:11
dmsimardcollecting logs should be part of the job publisher, though, not the actual job -- so that you can still retrieve logs if the job times out ?15:11
openstackgerritMajor Hayden proposed openstack/openstack-ansible-os_swift master: Optimize swift gate  https://review.openstack.org/49108915:11
mhaydenokay, i jammed together three speedups for the swift gate -> https://review.openstack.org/49108915:12
odyssey4medmsimard yeah, we've always used an in-repo script so that we can modify it per-branch and not have to patch project-config all the time15:12
* palendae grabs swift gate by the lapels and shakes it15:12
mhaydenif it fails, imma move to a van down by the river15:12
*** Oku_OS is now known as Oku_OS-away15:12
dmsimardodyssey4me: you can still use that in-repo script, but have a publisher call it instead of a job builder15:12
odyssey4meonce zuul-v3 is ready for us we'll be able to achieve the same thing, but let it be in the post-test phase15:12
mhaydenor i might move to canada15:12
odyssey4meor yeah, that's an option15:12
dmsimardmhayden: come say hi if you do o/15:12
odyssey4meback then we didn't understand all this stuff15:13
mhaydenwill so15:13
palendaeSo you've rolled https://review.openstack.org/#/c/490462/ in to that?15:13
mhaydens/so/do/15:13
mhaydensi15:13
mhayden491089 has the bindep update + packages as list + erasure code fix15:13
palendaeA smorgasbord15:13
mhaydenyeah15:13
mhaydenhopefully it passes swiftly15:13
palendaeNo we use Python here15:14
openstackgerritMajor Hayden proposed openstack/openstack-ansible-os_swift master: Add dnf support  https://review.openstack.org/49015715:14
palendaeOk, what does 'dnf' stand for?15:14
palendaeCause I'm never gonna read it as anything but did not finish15:14
*** nollide has joined #openstack-ansible15:14
evrardjpmhayden: I don't know why I giggled on that joke.15:14
mhaydenhah15:15
mhaydenevrardjp: on the linter -- do we have tests for custom linters anywhere?15:15
dmsimardpalendae: https://dnf.readthedocs.io/en/latest/user_faq.html#what-does-dnf-stand-for15:15
odyssey4memhayden nope, so you can be the first to write one!15:15
evrardjpit's not a reason to not have them :D15:15
mhaydendamn15:15
mhaydenwalked right into that one15:15
evrardjpodyssey4me: D15:15
palendaedmsimard: lol15:15
evrardjpgood job there odyssey4me.15:15
odyssey4mewhat a great opportunity!15:15
evrardjpWe didn't even sync before doing it.15:16
openstackgerritJuan Vidal Allende proposed openstack/openstack-ansible-os_neutron master: Add option to deploy OpenDaylight SDN Controller  https://review.openstack.org/48012815:16
odyssey4meok, now that I know what dnf stands for - my previous state remains15:16
odyssey4mebut for a different reason15:16
mbuilodyssey4me: after fighting a bit, I managed to fix the tacker tests issues. Could you check once again please? Then I can continue with the other patch :) Thanks! ==> https://review.openstack.org/#/c/490440/15:17
*** jvidal has quit IRC15:22
odyssey4membuil builting a functional test is hard, but once it works it keeps you honest :)15:36
mbuilodyssey4me: Yes, I feel proud :)15:37
*** markus_z has quit IRC15:37
mbuilodyssey4me: how long do we have until the "code freeze" for Pike?15:37
odyssey4membuil it's already too late to have tacker in the integrated release - a new repo has to be included for two milestones15:38
*** holmsten has quit IRC15:38
mbuilodyssey4me ok I was more thinking about the changes that jvidal did to integrate ODL in the neutron role. I have several patches that I wanted to add on top to provide SFC functionality. Right now, I am working on a private branch15:39
odyssey4meah ok, as the patch is in review already and receiving some attention, it'll get in - no worries there15:40
odyssey4meyou can base patches on top of a patch in review, although obviously rebasing gets tricky15:40
mbuilyep, that's why I was waiting for it to be merged before adding my patches15:41
*** woodard has joined #openstack-ansible15:41
odyssey4membuil looks fine, just a whitespace nit and a question in https://review.openstack.org/#/c/490440 which may need a comment added15:42
mbuilok, good. I need to run but I'll probably create the patch then15:42
*** mbuil has quit IRC15:42
*** cshen has quit IRC15:46
cloudnullso looking over ara for our runtimes, it doesn't look like we're doing too bad in terms of tasks however the gate vms are swapping like crazy15:48
cloudnullhttp://paste.openstack.org/show/617694/15:48
cloudnullodyssey4me: I wonder if we can reduce some of that.15:48
cloudnull?15:48
odyssey4meyeah, OSIC was the only provider that wasn't overcommitted in terms of RAM and CPU15:48
*** drifterza has joined #openstack-ansible15:50
cloudnulleven so, the vms themselves are swapping pretty heavily under test conditions.15:50
odyssey4mewell, I expect that reducing the containers down to a smaller set will help - but that would mean continuing *and* finishing the hyperconvergence path15:51
odyssey4mewhich requires the spec15:51
odyssey4meunless anyone has better ideas15:51
cloudnullI've a local vm with 16GiB of ram and its still swapping which is quite impactful in terms of runtime15:51
palendaeYeah, I've got a few VMs that are maxing out at 16GiB too15:52
openstackgerritMajor Hayden proposed openstack/openstack-ansible-tests master: Add linter for passing pkgs as list  https://review.openstack.org/49089915:52
mhaydenevrardjp: ^^15:52
*** markvoelker has quit IRC15:53
odyssey4meperhaps the 'aio' scenario needs some converged env.d overrides15:54
cloudnullmaybe so15:54
cloudnulli'm poking around15:54
odyssey4mewe can then perhaps figure out what the right hyperconverged default is next cycle15:54
cloudnullmaybe there are some tunables we can use15:54
palendaecloudnull: Weren't you looking at combining a bunch of the OpenStack API containers?15:54
palendaeI seem to recall a mention of that15:54
cloudnullmhayden: and i were15:54
odyssey4mepalendae yep, that's the 'hyperconverged' thing15:54
palendaeGotcha15:55
* mhayden is running glance + nova hyperconverged on centos + ocata15:55
openstackgerritMerged openstack/openstack-ansible-os_sahara master: Add ceph_client role  https://review.openstack.org/48500015:55
mhaydenthe nova hyperconverged setup saves a bunch of hassle15:55
odyssey4meI'm totally in favor of moving to something far simpler - like one container per upstream project, or perhaps even simpler... but we need to ensure that we do so responsibly and provide an automated upgrade path.15:55
palendae^15:55
openstackgerritMajor Hayden proposed openstack/openstack-ansible master: [WIP] Testing dnf with host/containers  https://review.openstack.org/49108415:56
odyssey4meif we move to doing JIT container creation, then we'd also be able to prep per-role rootfs bases and build the containers using cow15:57
odyssey4methat would be super-fast15:57
odyssey4mebut that's work for queens really15:58
*** hw_wutianwei has quit IRC16:00
cloudnullanother interesting thing going on in the gate16:01
cloudnullhttps://snag.gy/1o3pru.jpg16:01
cloudnullcinder seems to be freaking out16:01
cloudnullodyssey4me: I've got a patch up for JIT install of containers.16:01
cloudnullit recently passed 1 gate but timedout on the other.16:02
odyssey4mecloudnull yep, saw that - I was working on one too, but you got further than me... so once youre is in I'll add my adjustments on top which will optimise further16:02
evrardjpmhayden: oh nice well have a look16:02
evrardjpmhayden: maybe running the ansible-lint with -T could also help by showing the rule is effectively loaded16:03
evrardjpIIRC16:03
evrardjplet me see a linter result :D16:03
mhaydenoh fudge, i missed half the commit16:04
openstackgerritMajor Hayden proposed openstack/openstack-ansible-tests master: Add linter for passing pkgs as list  https://review.openstack.org/49089916:04
mhaydennow it should make more sense16:04
cloudnullodyssey4me: https://snag.gy/afoGuh.jpg16:07
cloudnullthe first is with cinder having a panic attack16:07
cloudnullthe second is after I killed it16:07
cloudnullI suspect this is a major cause of our gate situation16:08
*** udesale has quit IRC16:08
*** udesale has joined #openstack-ansible16:08
cloudnullthe volume service and scheduler are being bad actors16:08
*** woodard has quit IRC16:08
palendaeHopefully there's a findable bug to report in there16:09
*** woodard has joined #openstack-ansible16:09
cloudnulli've not looked for a bug just yet, still investigating the issues16:09
palendaeYeah16:10
logan-quite a lot of swapping going on https://jenkins.lstn.net/job/openstack-upstream/job/openstack-ansible-check/82/artifact/logs/munin/localdomain/localhost.localdomain/memory.html16:10
logan-reducing memory usage would help some16:11
cloudnullafter restarting the cinder services they seem to be behaving, i wonder if there's something we're doing that is causing it to run crazy?16:13
palendaecloudnull: Before or after tempest?16:14
*** tobberydberg has joined #openstack-ansible16:14
*** drifterza has quit IRC16:18
*** tobberydberg has quit IRC16:18
andymccrahh mhayden nice work on the swift patch16:26
mhaydenyw16:26
andymccrmhayden: thoughts on applying that with_items -> package change to all repos? now that we're hitting feature freeze im tempted to apply some standards like that16:26
palendaeandymccr: +116:26
mhaydendamn, so 490157 passed but 491089 is going to fail due to the lxc image download :|16:26
*** cshen has joined #openstack-ansible16:26
mhaydenandymccr: well i did propose a linter ;)16:26
mhaydenso we will know where it is16:27
mhaydenor where those bad tasks are16:27
mhaydenandymccr: https://review.openstack.org/49089916:27
andymccrmhayden: why you stealing my ideas?!16:27
andymccrnice one :D16:27
mhaydengracias16:27
andymccrok cool i'll take a look at that - but lets get some standards, maybe we can discuss, i was thinking the service_list thing would be a good one to apply too16:27
palendaemhayden: Is lxc image download broken at the mirror level?16:28
palendaeI'm doing a kilo deploy for upgrade testing, and it won't get past downloading xenial amd64 images16:28
mhaydeni think it flaked out during that one job :/16:28
palendae(Kilo downloads everything in the repo, not scopred)16:28
palendaescoped*16:28
* cloudnull was getting coffee16:31
openstackgerritMajor Hayden proposed openstack/openstack-ansible-os_swift master: Optimize swift gate  https://review.openstack.org/49108916:31
cloudnullpalendae:  that was before tempest16:31
openstackgerritMajor Hayden proposed openstack/openstack-ansible-os_swift master: Add dnf support  https://review.openstack.org/49015716:31
*** udesale has quit IRC16:31
cloudnulllogan-:  that munin change, has that merged in the integrated gate yet ?16:31
logan-no16:31
cloudnullmaybe we can enable that by default?16:32
cloudnullhttps://review.openstack.org/#/c/491005/16:32
logan-yeah I'd like to eventually. it needs a lot of smoothing out16:32
logan-i'm not sure if munin is the right tool yet but I haven't found something better for resource monitoring + static metrics site generation16:33
logan-munin kinda sucks at the time period scoping with our relatively short gate job lifetimes16:33
*** nollide_ has joined #openstack-ansible16:34
logan-its not really a drop in, gonna have to hack around and see how to get the charts actually useful16:34
cloudnulllogan-: it looks like your job was having the same symptoms mine was https://jenkins.lstn.net/job/openstack-upstream/job/openstack-ansible-check/82/artifact/logs/munin/localdomain/localhost.localdomain/cpu.html16:34
logan-yeah16:34
logan-148% avg usage, so cinder is sitting there burning up a core16:35
*** nollide has quit IRC16:35
*** nollide_ is now known as nollide16:35
cloudnullwe have sar on the hosts.16:35
cloudnullI wonder if we could just use something like sargraph16:35
cloudnullor  we used to , maybe we dont any more16:39
cloudnulloh no, its still there.16:40
cloudnullin the openstack_host role16:40
logan-i also put in https://review.openstack.org/#/c/491151/ to help us with the charting16:40
cloudnullcool16:42
*** openstackstatus has quit IRC16:42
*** openstackstatus has joined #openstack-ansible16:42
*** ChanServ sets mode: +v openstackstatus16:42
logan-sargraph looks like it might do the trick16:45
logan-maybe not though, doesnt really look like it has a static site export type of setup16:45
logan-this person is making rrds from sar data https://labs.eleks.com/2013/12/rrd-and-rrdtool-sar-graphs-using-pyrrd.html16:46
*** cshen has quit IRC16:51
cloudnulloh thats pretty cool16:53
*** perniciouscaffei has quit IRC16:55
cloudnullso i can recreate the cinder freak out, detroy the cinder container on an aio, then rerun the cinder playbook.16:57
cloudnullwhen the scheduler starts16:57
cloudnullit goes 100% cpu and never calms down16:58
*** armaan has joined #openstack-ansible16:58
cloudnullit looks as if the scheduler starts before the api16:58
cloudnullwell it only looks like the first start is the issue17:01
cloudnullafter that restarting the services in any order does not exhibit the same behavior17:01
*** shardy has quit IRC17:04
*** woodard_ has joined #openstack-ansible17:10
mhaydencloudnull: any more feedback on https://review.openstack.org/490192 ?17:10
*** woodard has quit IRC17:13
*** woodard_ has quit IRC17:14
cloudnullnope, my questions are answered17:15
*** lostRhino has joined #openstack-ansible17:15
*** pcaruana has quit IRC17:15
*** nollide has quit IRC17:16
mhaydencloudnull: gracias17:17
cloudnullmucho si17:18
*** tobberydberg has joined #openstack-ansible17:18
spotzcloudnull: you're letting mhayden off to easy...17:20
lostRhinoall - we’re having issues with the novnc repo override (nova_novncproxy_git_repo:) - 2 weeks ago we updated our internal repos for stable/newton.  After the update of the repos the build for some reason still wants to go to GitHub.com no matter what we do.  I found a workaround that would pickup the internal repo if I specified the branch as stable/v0.6.  However the repo is still causing issues when we are in a fully disconnected environme17:22
lostRhinoCan someone help me understand why the repo server is not honoring the override in bothe the remote_package_parts and the remote_packages (where the repo points back to GitHub)??  thanks17:22
*** tobberydberg has quit IRC17:22
odyssey4melostRhino when you run the playbook there's a bunch of json output before it executes the role - can you pastebin that?17:23
lostRhinoodyssey4me: sure - do you want everything or just the corresponding information…. its A LOT ...17:24
logan-lostRhino: do you have both nova_novncproxy_git_repo AND nova_novncproxy_git_install_branch specified in your overrides file17:24
odyssey4megimme a sec17:24
lostRhinoyes, recently I had to use the branch to get the internal repo to pick up17:25
logan-i have seen the py pkgs filter ignore override definitions if you _only_ provide a repo url override, you also need to specify _install_branch in the same file, even if you're using the same sha as upstream OSA17:25
lostRhinovery very odd - didnt have to before… but things do change in 3 months17:25
odyssey4mecheck whether this piece of data has the right git repo info: http://logs.openstack.org/88/490988/1/check/gate-openstack-ansible-openstack-ansible-aio-ubuntu-xenial/fd610f9/console.html#_2017-08-04_17_23_40_50129617:25
mgariepyhmm odyssey4me with 14.2.7 tag. http://paste.openstack.org/show/617699/17:25
odyssey4memgariepy did you ensure you updated ansible and all that? never seen that before17:27
mgariepyyep i ran the bootstap ansible script17:27
lostRhinoour egg name is all lower- does that matter?17:27
mgariepyhttp://paste.openstack.org/show/617700/17:27
lostRhinonovnc vs noVNC17:28
*** nollide has joined #openstack-ansible17:28
odyssey4memgariepy this is suposed to lay it down: https://github.com/openstack/openstack-ansible-repo_build/blob/master/tasks/repo_build_prepare.yml#L1617:28
lostRhinothe odd thing that is happening is that it sees the correct repo under remote_package_parts / but under remote_packages it does not17:28
odyssey4melostRhino no, it does not17:29
logan-lostRhino: in the override url? no it does not matter17:29
odyssey4meoh really? well that's interesting17:29
lostRhinohttp://logs.openstack.org/88/490988/1/check/gate-openstack-ansible-openstack-ansible-aio-ubuntu-xenial/fd610f9/console.html#_2017-08-04_17_23_40_50497917:29
lostRhinois where its github17:29
lostRhinohttp://logs.openstack.org/88/490988/1/check/gate-openstack-ansible-openstack-ansible-aio-ubuntu-xenial/fd610f9/console.html#_2017-08-04_17_23_40_50138017:29
lostRhinois where its internal17:29
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible master: Change the cinder playbook to run API commands first  https://review.openstack.org/49155017:30
cloudnulllogan-: odyssey4me ^17:30
odyssey4mecloudnull hmm, lemme check that17:30
odyssey4meI know with some services I had to guess the order17:31
*** cshen has joined #openstack-ansible17:31
cloudnullwith that order changed I'm not seeing the 100% cpu anymore17:31
odyssey4mecloudnull well, something else is not right then - because https://docs.openstack.org/cinder/latest/upgrade.html#after-maintenance-window specifies that cinder-api should be last17:32
mgariepyodyssey4me, https://github.com/openstack/openstack-ansible-repo_build/blob/master/tasks/repo_build_prepare.yml#L2617:32
cloudnullodyssey4me: maybe its the speed at which the services are restarted ?17:33
cloudnullhowever I tried this three times, with the current order it ended in cinder running at 100%17:34
odyssey4meok, so basically it's in compatibility mode at that time - handling both RPC versions17:34
odyssey4mewhich kinda makes sense17:35
logan-even after the initial greenfield install it still does that if you restart in that order?17:35
odyssey4methis is something that should be troubleshot with the upstream guys I would think17:35
odyssey4meeither the docs are bad, or the code is bad, or we're doing something wrong17:36
odyssey4mewhichever way, I'd rather we made informed decisions instead of guess work17:36
openstackgerritMarc Gariépy proposed openstack/openstack-ansible-repo_build stable/newton: Add protection against no local facts being defined  https://review.openstack.org/49155217:36
logan-do we need that in ocata also mgariepy?17:37
cloudnulllogan-:  to recreate the issue on an AIO: destroy / recreate the cinder api container. stop the cinder-volume,backup services on the host. run the os-cinder-install.yml play17:37
mgariepylogan-, probably, but i'll just run a quick test to see something17:37
cloudnullat the end of the play, cinder scheduler will be running at 100%17:38
logan-ouch17:38
cloudnullit happens as soon as this task fires17:39
cloudnullhttps://github.com/openstack/openstack-ansible/blob/master/playbooks/os-cinder-install.yml#L16417:39
cloudnullodyssey4me: how do you want to proceed ?17:39
odyssey4mecloudnull ideally get hold of a cinder developer and dig deeper into why that's happening and whether it's expected or not17:40
odyssey4mewe need to know if our start/restart order needs adjusting, and if so then their docs need adjusting17:40
odyssey4meif it's done the right way, then what's causing the lock-up17:41
cloudnullodyssey4me: maybe there's a way to signify a greenfield deployment and skip all of this https://github.com/openstack/openstack-ansible/blob/master/playbooks/os-cinder-install.yml#L145-L23717:41
logan-i wonder if a reboot could trigger it too depending on container restart order17:42
odyssey4mecloudnull there are possibly, but then we start creating two code paths - one of which is never tested17:42
logan-it seems like a pretty bad bug even if the playbook ordering is "wrong" and the docs are adjusted17:42
odyssey4meit is better to just do what we're doing here and work out where the bugs lie and fix them17:42
odyssey4meworking around things all the time is not good17:42
andymccrif we can get a run down of the things im happy to talk to the cinder peeps if nobody else has bandwidth for it17:42
cloudnullandymccr:  the tl;dr is when the cinder services are hup'd / restarted in the following order HUP: [backup, volume, scheduler] RESTART: [api] -- we're seeing cinder scheduler run at 100% cpu17:44
openstackgerritMarc Gariépy proposed openstack/openstack-ansible-repo_build stable/ocata: Add protection against no local facts being defined  https://review.openstack.org/49155617:45
andymccrcloudnull: got it, but only on startup?17:45
cloudnullso a greenfield deployment with already running services then has that restart dance done and it begins to freakout.17:46
cloudnullthe scheduler never recovers until its restarted manually.17:46
andymccrok cool17:47
cloudnullodyssey4me: maybe we need to make the sheduler just restart, instead of hup?17:47
jmccrorycloudnull : just tried and seeing the same freakout. do you have this in your cinder-scheduler log too? 'Possible hang: wait is waiting for stop to complete'17:49
cloudnulljmccrory:  let me go look17:49
mgariepyarf. was a cache issue.. needed to rm /etc/openstack_deploy/ansible_facts/*repo_container*17:49
cloudnulljmccrory: yes17:50
cloudnullhttp://paste.openstack.org/show/617701/17:50
cloudnullso maybe the easier fix would be to restart the cinder-scheduler instead?17:52
cloudnullwe can hup the others and then restart that one17:52
cloudnullI guess we also should wait until we've heard back from the cinder folks17:52
cloudnullthis potential impacts ocata - https://github.com/openstack/openstack-ansible/blob/stable/ocata/playbooks/os-cinder-install.yml#L140-L162 and newton https://github.com/openstack/openstack-ansible/blob/stable/newton/playbooks/os-cinder-install.yml#L140-L16217:53
odyssey4mewell, the issue with that is that if any transactions have started, they will be lost17:53
odyssey4methe point of the hup is to not lose transactions17:53
odyssey4meand that matters to everything except the API17:54
*** markvoelker has joined #openstack-ansible17:54
cloudnullthis is true17:54
odyssey4meanyway, I'm out for the eve - time to get home17:54
cloudnulltake care odyssey4me17:54
odyssey4melooking forward to seeing whether you find any answers - if you don't, we can pick it up in the morning17:55
*** cshen has quit IRC17:55
spotznight odyssey4me17:55
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible master: Add support for OpenDaylight deployment in neutron  https://review.openstack.org/48013117:55
lostRhinoodyssey4me: can i follow up with you tomorrow?17:56
evrardjplostRhino: most probably :p17:57
lostRhinothanks17:57
*** cshen has joined #openstack-ansible17:57
*** gouthamr has joined #openstack-ansible17:58
*** lostRhino has left #openstack-ansible18:00
mhaydenandymccr: OMG MERGE IT https://review.openstack.org/49108918:03
mhaydenodyssey4me ^^18:04
*** poopcat has joined #openstack-ansible18:04
*** cshen has quit IRC18:04
*** albertcard1 has joined #openstack-ansible18:06
spotzmhayden: that a hint?18:08
mhaydenhaha finally passed the darned gate18:08
mhaydenbut barely18:08
*** poopcat has quit IRC18:08
*** cshen has joined #openstack-ansible18:09
*** albertcard1 is now known as poopcat18:12
spotzotherone failed still:(18:12
*** stuartgr has quit IRC18:13
openstackgerritMerged openstack/openstack-ansible-repo_build stable/newton: Add protection against no local facts being defined  https://review.openstack.org/49155218:13
LiterateHawkHi all - Is the Centos version of OSA functional with linuxbridges? I've installed a small deployment with ubuntu, but we're a ~100% centos/RHEL shop, so my next step is to see if I can't move nodes over to centos18:14
*** cshen has quit IRC18:14
LiterateHawkOf course, if it's still heavily under development, that might not be the best choice18:14
mgariepyLiterateHawk, Centos is working but it still under dev. and might be a little bumpy for upgrades.18:16
mgariepyif you have time to tests stuff it would be great tho.18:17
*** woodard has joined #openstack-ansible18:19
*** woodard has quit IRC18:19
*** armaan has quit IRC18:19
mgariepyLiterateHawk, mhayden, have a running centos cluster.18:19
*** woodard has joined #openstack-ansible18:19
*** armaan has joined #openstack-ansible18:19
*** sauloaislan has joined #openstack-ansible18:20
mgariepyI had one but it's not dismantled as I needed the HW for other purpose.18:20
sauloaislanMorning!18:21
*** cshen has joined #openstack-ansible18:23
sauloaislanIs anyone having trouble launching an OSA?18:23
*** pbandark has quit IRC18:24
*** tobberydberg has joined #openstack-ansible18:24
sauloaislanI can not launch, I'm having some errors18:24
*** prometheanfire has joined #openstack-ansible18:24
*** tobberydberg has quit IRC18:28
*** cshen has quit IRC18:28
*** markvoelker has quit IRC18:28
*** cshen has joined #openstack-ansible18:29
*** chhavi has quit IRC18:31
andymccrmhayden: nice! :D18:36
*** cshen has quit IRC18:37
LiterateHawkmgariepy: I don't have a problem putting in bug reports. My ansible knowledge is ~minimal, though...18:38
LiterateHawkmgariepy: Presumably I could drop/wipe a single master node and reinstall w/centos?18:39
LiterateHawkOr would it be best to just blow the whole thing away18:39
mhaydenthat darned pip_install role keeps giving me the sads https://paste.fedoraproject.org/paste/Rb78hph917HUVJZSsNp2sw/raw18:40
*** nollide has quit IRC18:40
*** cshen has joined #openstack-ansible18:41
sauloaislanI have fail in "Gather variables for each operating system" task18:42
odyssey4memhayden whatever happened to your experiment to install it in playbooks instead of roles?18:42
mhaydenstill scratching my head on that one18:42
*** cshen has quit IRC18:50
mgariepyLiterateHawk, I think it's easier to start over with centos instead of trying to migrate from ubuntu to centos18:53
*** nollide has joined #openstack-ansible18:53
*** tobberydberg has joined #openstack-ansible18:57
LiterateHawkmgariepy: Fair enough, I'm just running a shedload of cirros images anyway..18:59
*** hybridpollo has joined #openstack-ansible19:04
*** cshen has joined #openstack-ansible19:05
*** vishwanathj has quit IRC19:06
*** vishwanathj has joined #openstack-ansible19:06
*** tobberydberg has quit IRC19:07
*** nollide has quit IRC19:08
*** cshen has quit IRC19:08
*** cshen has joined #openstack-ansible19:09
*** nollide has joined #openstack-ansible19:10
*** hybridpollo has quit IRC19:13
*** hybridpollo has joined #openstack-ansible19:15
*** cshen has quit IRC19:15
*** nollide has quit IRC19:17
openstackgerritMerged openstack/openstack-ansible master: Add dnf support  https://review.openstack.org/49019219:19
*** tobberydberg has joined #openstack-ansible19:22
*** tobberyd_ has joined #openstack-ansible19:30
*** armaan has quit IRC19:30
*** tobberydberg has quit IRC19:31
openstackgerritMajor Hayden proposed openstack/openstack-ansible master: Add dnf support to ceph playbook  https://review.openstack.org/49158419:34
openstackgerritMajor Hayden proposed openstack/openstack-ansible master: [WIP] Testing dnf with host/containers  https://review.openstack.org/49108419:35
*** tobberyd_ has quit IRC19:37
*** nollide has joined #openstack-ansible19:41
openstackgerritMajor Hayden proposed openstack/openstack-ansible-pip_install master: Rename package install task  https://review.openstack.org/49158519:51
*** cshen has joined #openstack-ansible19:53
mhaydeni wonder if we could possibly have a directory bind mounted in the gate that would allow us to re-use wheels20:00
mhaydenfor example, we build cryptography a bunch of times on the neutron CI job20:00
*** thorst has quit IRC20:00
mhaydenbut if we could stash compiled wheels in a shared directory, we would only need to build it once20:00
*** thorst has joined #openstack-ansible20:03
*** thorst has quit IRC20:07
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible master: Change the cinder playbook to run API commands first  https://review.openstack.org/49155020:12
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible-os_swift master: Add dnf support  https://review.openstack.org/49015720:16
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible-os_neutron master: Add dnf support  https://review.openstack.org/49014420:16
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible-os_swift master: Disable Erasure Code test for CentOS/RHEL  https://review.openstack.org/49046220:16
*** gouthamr has quit IRC20:17
*** thorst has joined #openstack-ansible20:18
openstackgerritMajor Hayden proposed openstack/openstack-ansible-os_swift master: Add dnf support  https://review.openstack.org/49015720:19
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible master: Revert "Reduce container counts for cinder, ironic and trove"  https://review.openstack.org/47182220:20
openstackgerritMajor Hayden proposed openstack/openstack-ansible master: Apply pip_install after container creation  https://review.openstack.org/48520320:21
cloudnullmhayden: if we use the upstream wheel mirrors we'd not have to build it at all20:22
cloudnullhttps://github.com/openstack/openstack-ansible/blob/master/scripts/fastest-infra-wheel-mirror.py#L32-L4020:22
cloudnullwe could improve our gate test runtimes by pointing at these mirrors, if not already doing so20:23
*** gouthamr has joined #openstack-ansible20:24
*** asura has quit IRC20:25
*** tobberydberg has joined #openstack-ansible20:25
mhaydenhmm, are we using those?20:27
mhaydeni thought we had a task for that20:27
mhaydenperhaps we're using it for the integrated build20:27
mhaydenbut not roles?20:28
* mhayden scurries off to check20:28
mhaydenit seems like we are20:29
*** tobberydberg has quit IRC20:30
*** smatzek has quit IRC20:31
*** radafshar1983 has quit IRC20:35
cloudnullthen it should not be building any wheels unless they're not in the mirror20:36
cloudnullor we're somehow passing in the --isolated flag20:36
*** gouthamr has quit IRC20:36
mhaydenwell we don't take the binary for quite a few things, including cryptography20:39
openstackgerritMajor Hayden proposed openstack/openstack-ansible-openstack_hosts master: Rename package install task  https://review.openstack.org/49160721:01
*** lucasxu has quit IRC21:15
*** tobberydberg has joined #openstack-ansible21:17
openstackgerritJimmy McCrory proposed openstack/openstack-ansible master: Update Ansible to 2.3.2.0  https://review.openstack.org/48174521:25
*** markvoelker has joined #openstack-ansible21:26
*** tobberydberg has quit IRC21:30
mhaydenweird, got a commit where the short sha was all numbers21:47
mhaydenhad to do a double take21:47
mhaydenoh man, the bundle of swift patches just might merge21:48
mhaydenjust maybe21:48
openstackgerritJimmy McCrory proposed openstack/openstack-ansible-os_sahara master: Updated from OpenStack Ansible Tests  https://review.openstack.org/48847621:56
cloudnullmhayden: fingers crossed21:56
openstackgerritJimmy McCrory proposed openstack/openstack-ansible-os_sahara master: Update URLs in documents according to document migration  https://review.openstack.org/48370721:56
mhaydenit started the functional tests at 47 minutes in21:56
mhaydenwhich means it's likely to pass21:56
openstackgerritJimmy McCrory proposed openstack/openstack-ansible-os_sahara master: Updated from global requirements  https://review.openstack.org/48806821:56
mhaydenwhy do i know these things by heart21:56
mhaydenthis is sad21:56
*** cshen has quit IRC21:57
openstackgerritJimmy McCrory proposed openstack/openstack-ansible-os_sahara master: Implement uWSGI for sahara-api  https://review.openstack.org/49142121:57
openstackgerritMerged openstack/openstack-ansible-plugins master: Updated from OpenStack Ansible Tests  https://review.openstack.org/49093421:57
*** markvoelker has quit IRC22:00
mhaydencloudnull: test failed :(22:02
mhaydenno timeout, something blew up in functional tests22:02
*** sliver has quit IRC22:04
openstackgerritMajor Hayden proposed openstack/openstack-ansible master: Add dnf support to ceph playbook  https://review.openstack.org/49158422:07
openstackgerritMajor Hayden proposed openstack/openstack-ansible master: [WIP] Testing dnf with host/containers  https://review.openstack.org/49108422:08
*** esberglu has quit IRC22:09
*** esberglu has joined #openstack-ansible22:10
*** esberglu has quit IRC22:14
*** gtrxcb has joined #openstack-ansible22:15
*** tobberydberg has joined #openstack-ansible22:16
*** nollide has left #openstack-ansible22:16
openstackgerritMerged openstack/openstack-ansible-os_sahara master: Update URLs in documents according to document migration  https://review.openstack.org/48370722:19
*** tobberydberg has quit IRC22:23
*** esberglu has joined #openstack-ansible22:25
*** gouthamr has joined #openstack-ansible22:29
*** thorst has quit IRC22:32
*** markvoelker has joined #openstack-ansible22:32
*** marst has quit IRC22:36
*** tobberydberg has joined #openstack-ansible22:40
*** tobberydberg has quit IRC22:51
*** markvoelker has quit IRC22:54
*** markvoelker has joined #openstack-ansible22:54
*** tobberydberg has joined #openstack-ansible22:55
*** markvoelker has quit IRC22:59
*** tobberydberg has quit IRC22:59
openstackgerritLogan V proposed openstack/openstack-ansible-galera_server master: Allow external restart handler orchestration  https://review.openstack.org/49121123:00
openstackgerritLogan V proposed openstack/openstack-ansible master: Implement galera restarts in playbook  https://review.openstack.org/49121223:00
*** chhavi has joined #openstack-ansible23:08
openstackgerritLogan V proposed openstack/openstack-ansible master: Implement galera restarts in playbook  https://review.openstack.org/49121223:12
*** tobberydberg has joined #openstack-ansible23:29
-openstackstatus- NOTICE: The Gerrit service on review.openstack.org will be offline momentarily at 00:00 utc for a quick reconfiguration-related restart23:32
cloudnullmhayden: :'(23:33
cloudnulllogan-: jmccrory: when you get a chance https://review.openstack.org/#/c/491550/23:33
cloudnullif you wouldn't mind testing that23:33
cloudnullit seems to fix the cinder cpu freakout.23:33
*** tobberydberg has quit IRC23:33
cloudnullin my env but it'd be good to get validation on that point.23:33
*** schwicht has quit IRC23:36
jmccroryi was seeing cinder-backup spiking at 100% earlier too, will give this a shot and see though23:37
cloudnullI had that one go crazy too initially23:37
cloudnullhowever i've not been able to reproduce it23:37
cloudnulland i dont have that halt message in my error log23:37
cloudnullwe can make it restart instead of reload with that patch if we must23:37
cloudnulllooking back through ara nothing stands out as taking too long any more.23:38
jmccroryyeah, didn't have a halt message for it either23:38
cloudnullwe're still swapping like crazy in the gate23:40
cloudnullhttp://paste.openstack.org/show/617718/23:40
cloudnullhttp://logs.openstack.org/50/491550/2/check/gate-openstack-ansible-openstack-ansible-aio-ubuntu-xenial/2ffb40f/logs/openstack/instance-info/host_system_info_22-10-32.log.txt.gz23:40
cloudnullbut its at least succeeding23:40

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!