*** jtomasek has quit IRC | 00:14 | |
*** chlong has quit IRC | 00:20 | |
*** chlong has joined #tripleo | 00:20 | |
*** jtomasek has joined #tripleo | 00:27 | |
*** chlong has quit IRC | 00:41 | |
*** chlong has joined #tripleo | 00:41 | |
*** david-lyle has joined #tripleo | 00:42 | |
*** chlong has quit IRC | 00:48 | |
*** chlong has joined #tripleo | 00:48 | |
*** chlong has quit IRC | 00:53 | |
*** chlong has joined #tripleo | 00:53 | |
*** mkovacik has quit IRC | 01:09 | |
*** yuanying has joined #tripleo | 01:10 | |
*** yuanying_ has quit IRC | 01:11 | |
*** chlong has quit IRC | 01:55 | |
*** chlong has joined #tripleo | 01:55 | |
*** yuanying has quit IRC | 02:00 | |
*** tiswanso has joined #tripleo | 02:02 | |
*** sthillma has quit IRC | 02:03 | |
*** yuanying has joined #tripleo | 02:03 | |
*** david-lyle has quit IRC | 02:05 | |
*** tiswanso has quit IRC | 02:06 | |
*** tiswanso has joined #tripleo | 02:07 | |
*** trozet has joined #tripleo | 02:12 | |
*** davidlenwell has quit IRC | 02:14 | |
*** davidlenwell has joined #tripleo | 02:16 | |
*** ChanServ sets mode: +v davidlenwell | 02:16 | |
*** shivrao has quit IRC | 02:19 | |
*** yuanying has quit IRC | 02:25 | |
*** yuanying has joined #tripleo | 02:27 | |
*** chlong has quit IRC | 02:49 | |
*** chlong has joined #tripleo | 02:49 | |
*** superflyy has joined #tripleo | 02:50 | |
*** pradk has quit IRC | 03:02 | |
*** sbalukoff has quit IRC | 03:02 | |
*** pradk has joined #tripleo | 03:16 | |
*** chlong has quit IRC | 03:17 | |
*** chlong has joined #tripleo | 03:18 | |
*** chlong has quit IRC | 03:19 | |
*** yuanying_ has joined #tripleo | 03:19 | |
*** chlong has joined #tripleo | 03:19 | |
*** yuanyin__ has joined #tripleo | 03:20 | |
*** yuanying has quit IRC | 03:22 | |
*** yuanying_ has quit IRC | 03:23 | |
*** david-lyle has joined #tripleo | 03:25 | |
*** yuanyin__ has quit IRC | 03:29 | |
*** lazy_prince has joined #tripleo | 03:29 | |
*** yuanying has joined #tripleo | 03:29 | |
*** bnemec has quit IRC | 03:34 | |
*** bnemec has joined #tripleo | 03:38 | |
*** thrash is now known as thrash|g0ne | 03:39 | |
openstackgerrit | Steve Baker proposed openstack/python-tripleoclient: openstack overcloud software deployment show https://review.openstack.org/271113 | 03:42 |
---|---|---|
openstackgerrit | Steve Baker proposed openstack/python-tripleoclient: openstack overcloud failures https://review.openstack.org/271114 | 03:42 |
openstackgerrit | Steve Baker proposed openstack/python-tripleoclient: openstack overcloud software deployment list https://review.openstack.org/252670 | 03:42 |
*** yamahata has quit IRC | 03:43 | |
prometheanfire | what do you think about an element to specificly install cloud-init? | 03:46 |
prometheanfire | I'd rather not have to do so in the gentoo element because it'll be inherited even if not wanted | 03:46 |
*** tzumainn has quit IRC | 03:47 | |
*** yuanying has quit IRC | 03:57 | |
*** yuanying has joined #tripleo | 03:58 | |
*** superflyy has quit IRC | 04:01 | |
*** yuanying has quit IRC | 04:02 | |
*** yuanying has joined #tripleo | 04:08 | |
prometheanfire | well, made one, it only installs / sets it up for gentoo atm, but the format is simple, check if DISTRO_NAME (or another identifier) is something, then set up init or whatever | 04:08 |
prometheanfire | and it installs cloud-init via package-installs | 04:08 |
*** trozet has quit IRC | 04:21 | |
*** sbalukoff has joined #tripleo | 04:22 | |
*** lazy_prince has quit IRC | 04:29 | |
*** rlandy has quit IRC | 04:29 | |
*** tiswanso has quit IRC | 04:44 | |
*** pradk has quit IRC | 04:56 | |
*** yamahata has joined #tripleo | 04:56 | |
*** cody-somerville has quit IRC | 05:00 | |
*** ayoung is now known as ayoung_ZZZzzzz | 05:01 | |
*** pradk has joined #tripleo | 05:08 | |
prometheanfire | btw, here's my current diff... | 05:10 |
prometheanfire | http://paste.openstack.org/show/484952/ | 05:10 |
prometheanfire | good think for you is I think that's all the stuff needed in dib, the other changes to the elements are in nodepool | 05:11 |
*** masco has joined #tripleo | 05:18 | |
*** shivrao has joined #tripleo | 05:26 | |
*** masco has quit IRC | 05:27 | |
*** cmyster has quit IRC | 05:32 | |
*** cmyster has joined #tripleo | 05:33 | |
*** cmyster has quit IRC | 05:33 | |
*** cmyster has joined #tripleo | 05:33 | |
*** masco has joined #tripleo | 05:38 | |
*** dshulyak has joined #tripleo | 05:54 | |
*** masco has quit IRC | 06:02 | |
prometheanfire | . | 06:14 |
prometheanfire | is there a way for a user to pass in a package-installs.yml or the like? | 06:14 |
*** panda_ has quit IRC | 06:25 | |
*** panda_ has joined #tripleo | 06:26 | |
*** absubram has joined #tripleo | 06:40 | |
*** absubram_ has joined #tripleo | 06:41 | |
*** rbrady has quit IRC | 06:43 | |
*** ukalifon1 has joined #tripleo | 06:44 | |
*** absubram has quit IRC | 06:44 | |
*** absubram_ is now known as absubram | 06:44 | |
*** rbrady has joined #tripleo | 06:49 | |
*** rcernin has joined #tripleo | 06:53 | |
cmyster | prometheanfire: like in deploy --templates -e package-installs.yml ? | 06:58 |
prometheanfire | oh, I mean for dib | 06:59 |
*** absubram has quit IRC | 07:01 | |
*** oshvartz has joined #tripleo | 07:11 | |
*** cody-somerville has joined #tripleo | 07:19 | |
*** cody-somerville has joined #tripleo | 07:19 | |
*** nijaba has quit IRC | 07:27 | |
*** nijaba has joined #tripleo | 07:27 | |
*** jprovazn has joined #tripleo | 07:47 | |
*** liverpooler has joined #tripleo | 07:54 | |
*** rwsu has quit IRC | 07:58 | |
*** jhenner has quit IRC | 08:02 | |
*** jhenner has joined #tripleo | 08:02 | |
*** hjensas has quit IRC | 08:03 | |
*** fgimenez has joined #tripleo | 08:06 | |
*** shivrao has quit IRC | 08:09 | |
*** jhenner has quit IRC | 08:12 | |
*** gfidente has joined #tripleo | 08:14 | |
*** cody-somerville has quit IRC | 08:14 | |
*** sthillma has joined #tripleo | 08:14 | |
openstackgerrit | xin wu proposed openstack/python-tripleoclient: Install bigswitch networking agent by default https://review.openstack.org/271990 | 08:16 |
*** sthillma_ has joined #tripleo | 08:17 | |
*** devvesa has joined #tripleo | 08:18 | |
*** sthillma has quit IRC | 08:19 | |
*** sthillma_ is now known as sthillma | 08:19 | |
*** shardy has joined #tripleo | 08:24 | |
*** jcoufal has joined #tripleo | 08:26 | |
*** ifarkas has joined #tripleo | 08:30 | |
*** jhenner has joined #tripleo | 08:33 | |
*** jaosorior has joined #tripleo | 08:40 | |
*** mcornea has joined #tripleo | 08:50 | |
*** sthillma has quit IRC | 08:54 | |
*** aufi has joined #tripleo | 08:55 | |
*** derekh has joined #tripleo | 08:56 | |
openstackgerrit | Marios Andreou proposed openstack/tripleo-common: Adds a simple overcloud tenant vm ping test to tripleo.sh https://review.openstack.org/272191 | 08:57 |
jaosorior | marios: Was the only thing added the cherry-picked-from ^^ ? | 08:58 |
marios | jaosorior: yeah commented now. didn't think/expect it to kill the +2s for that tho :/ | 08:59 |
*** bvandenh has joined #tripleo | 08:59 | |
jaosorior | marios: Didn't expect that either, bummer | 08:59 |
jaosorior | marios: Though... it already had two +2, you might as well merge it | 09:00 |
marios | jaosorior: ci | 09:00 |
marios | jaosorior: plus, i woulnd't feel comfortable. i mean, i just uploaded the patch. it is one thing to +2 it (really the unwritten rule is you don't +2 you own reviews, or it doesn't count anyway) but wouldn't want to be the one to merge it, unless emergency | 09:01 |
marios | jaosorior: in this case, bnemec made the cherrypick, so I think is ok to +2 it ('yeah this looks like a good cherrypick from the master') | 09:02 |
marios | jaosorior: ("uploaded the patch" i mean revision 2) | 09:02 |
jaosorior | marios: You uploaded a patch that changes the commit message, so I didn't think it would count | 09:02 |
marios | jaosorior: yeah you're right, i mean there were no code changes. but still I wouldn't want to merge unless it was really necessary/no-one else about | 09:04 |
derekh | marios: +2 from me also | 09:04 |
*** mkovacik has joined #tripleo | 09:05 | |
marios | thx derekh o/ morning | 09:05 |
*** hjensas has joined #tripleo | 09:06 | |
derekh | marios: morning, I think we're good to merge this, the ping test worked for the nonha test https://jenkins07.openstack.org/job/gate-tripleo-ci-f22-nonha/435/consoleText | 09:07 |
derekh | marios: going to see why the other 2 failed and merge if its unrelated | 09:07 |
marios | derekh: thanks, yeah lgtm too /me looks at that console | 09:07 |
marios | derekh: is a nice thing to see in the output :) | 09:08 |
marios | https://jenkins07.openstack.org/job/gate-tripleo-ci-f22-ha/252/console still running | 09:10 |
derekh | marios: yup, it is, ok the ping test failed for the ceph test, but thats a problem with the cloud so I think we can still mere | 09:11 |
derekh | marios: thats for PS2 I'm looking at the results of PS1 | 09:12 |
*** lucas-dinner is now known as lucasagomes | 09:12 | |
marios | derekh: what is the 'ceph test' is there a bug #? or you're referring generally to current ha ci being broken? | 09:12 |
marios | derekh: yeah realised since was still running (we may as well wait for it no? is there an emergency?) | 09:13 |
marios | jaosorior: ^^ | 09:13 |
derekh | marios: need to look at the reason it failed, no bug yey that I know of | 09:13 |
*** fgimenez has quit IRC | 09:13 | |
derekh | marios: ok, lets wait, no emergency | 09:14 |
derekh | school run, bbiab | 09:14 |
jaosorior | marios: Wasn't it the case that stable/liberty CI is broken? And that CR was an attempt to fix it? | 09:14 |
marios | derekh: o/ in a bit | 09:14 |
marios | jaosorior: i don't know or missed that conversation. i know ci was broken for a couple of reasons last few days but they were fixed. I didn't realise the ping test fixed something there | 09:15 |
jaosorior | marios: IIRC stable/ci tries to run the ping test, and the test hadn't been packported yet | 09:15 |
*** fgimenez has joined #tripleo | 09:16 | |
*** fgimenez has joined #tripleo | 09:16 | |
*** dtantsur|afk is now known as dtantsur | 09:16 | |
marios | jaosorior: i see that makes sense and thanks for the info ... hmm i guess since landed in toci_instack now (for ci) | 09:17 |
marios | jaosorior: but this test adds it, so ci run for it should be ok if that was the problem | 09:17 |
marios | jaosorior: s/test/review | 09:17 |
marios | (pingtest on the brain) | 09:18 |
marios | jaosorior: I mean v1 passed the non-ha like at https://jenkins07.openstack.org/job/gate-tripleo-ci-f22-nonha/435/consoleText | 09:18 |
marios | jaosorior: let's see what happens with v2 and we merge? is that reasonable? | 09:18 |
*** jistr has joined #tripleo | 09:19 | |
jaosorior | marios: Sure man, no worries | 09:19 |
jistr | morning | 09:21 |
jaosorior | jistr: Sup dude | 09:21 |
*** mcornea has quit IRC | 09:24 | |
*** nico_auv has joined #tripleo | 09:26 | |
*** electrofelix has joined #tripleo | 09:28 | |
*** olap has joined #tripleo | 09:28 | |
*** akrivoka has joined #tripleo | 09:38 | |
*** dtantsur is now known as dtantsur|brb | 09:48 | |
*** olap has quit IRC | 09:49 | |
*** olap has joined #tripleo | 09:51 | |
shardy | https://review.openstack.org/#/c/272194/ combined with https://review.openstack.org/#/c/272191/ was supposed to fix stable CI | 09:52 |
shardy | I don't think the latter with pass without the former | 09:52 |
shardy | although 272191 still isn't passing the HA job, not yet looked into why | 09:53 |
shardy | marios: ^^ FYI | 09:53 |
jistr | tried to look a while ago but CI logs are unavailable | 09:55 |
jistr | some infra problem, they're talking about it on #openstack-infra | 09:55 |
shardy | Yeah I saw the -dev mail about unstable jobs | 09:56 |
shardy | wow we've had a rough time with CI lately :( | 09:56 |
jaosorior | bummer | 09:57 |
openstackgerrit | Derek Higgins proposed openstack-infra/tripleo-ci: [NO MERGY] Test a update in trunk repository version https://review.openstack.org/229789 | 09:58 |
derekh | jistr: is your looking for ci logs you can get the console log directly from jenkins , find the jobs you looking for here http://tripleo.org/cistatus.html | 10:00 |
derekh | jistr: then clock on the date | 10:00 |
derekh | *click | 10:00 |
jistr | derekh: cool, thanks! | 10:01 |
*** dshulyak has quit IRC | 10:01 | |
derekh | of course that doesn't help you find the other logs but at least its something | 10:02 |
*** dshulyak has joined #tripleo | 10:02 | |
*** mkovacik has quit IRC | 10:04 | |
jistr | http://fpaste.org/314802/53802605/ | 10:05 |
jistr | pingtest failing -- when checking for the tenant stack creation, it got 503 from heat? | 10:05 |
marios | shardy: jistr thanks looking again now (v1 was passing, was waiting for v2 at https://jenkins07.openstack.org/job/gate-tripleo-ci-f22-ha/252/ | 10:14 |
marios | well passing the non-ha afaik/saw | 10:16 |
jistr | marios: we probably need to recheck this one though, as it should fix HA and depends on the pingtest change: https://review.openstack.org/#/c/272194/ but for now openstack-infra advises not to recheck anything as they expect failures (see e-mail [openstack-dev] [all] UNSTABLE jobs - Jenkins failure - do not recheck/approve) | 10:18 |
marios | jistr: ack o/ morning | 10:19 |
jistr | morning :) | 10:19 |
*** rebrego has joined #tripleo | 10:22 | |
*** regebro is now known as Guest80927 | 10:22 | |
*** rebrego is now known as regebro | 10:22 | |
*** jaosorior has quit IRC | 10:23 | |
marios | jistr: sorry just re-read. this one is from the pingtest i think (i.e. https://jenkins07.openstack.org/job/gate-tripleo-ci-f22-ha/252/console) | 10:23 |
*** jaosorior has joined #tripleo | 10:24 | |
jistr | marios: yea but IIUC it cannot pass, as there's another bug which causes the HA job to fail (see the fix i linked above and its depends-on) | 10:25 |
*** athomas has quit IRC | 10:25 | |
*** panda_ has quit IRC | 10:25 | |
marios | jistr: ah ok thanks (yeah i am looking through the console output now). i was confused by the fpaste showing the pingtest fail (or does the pingtest run regardless of overcloud being deployed? wont we exit by then?) | 10:26 |
*** panda_ has joined #tripleo | 10:26 | |
marios | *try* to run and fail like you showed i mean | 10:26 |
*** paramite has joined #tripleo | 10:27 | |
jistr | marios: it tried to run the pingtest and failed on the HA change. That job had (at least i hope so :) ) both fixes present (the mysql HA fix and backport of pingtest to tripleo.sh). Not sure why it got 503... | 10:28 |
jistr | marios: the fpaste is from the latest run on https://review.openstack.org/#/c/272194/ | 10:28 |
marios | jistr: thx looking at http://logs.openstack.org/94/272194/2/check-tripleo/gate-tripleo-ci-f22-ha/21858d0/console.html | 10:29 |
marios | (and see it now) | 10:29 |
marios | jistr: well, it makes sense if 'there is no overcloud heat' | 10:30 |
marios | jistr: i mean ERROR: <html><body><h1>503 Service Unavailable</h1> | 10:30 |
*** athomas has joined #tripleo | 10:30 | |
jistr | just saw an e-mail to dev list re infra -- CI should be back to normal | 10:31 |
shardy | must pass controller_virtual_ip to Class[Tripleo::Loadbalancer] | 10:32 |
shardy | anyone seeing that error recently? | 10:33 |
marios | shardy: that rings a bell... thinking | 10:33 |
* shardy is failing to figure out what he's failed to update | 10:33 | |
shardy | I pulled latest t-h-t yesterday and rebuilt my images, but getting that since | 10:33 |
marios | shardy: ah gfidente from yesterday ^^^ | 10:33 |
marios | shardy: is this like 7.1/7.2 update | 10:33 |
marios | ? | 10:33 |
shardy | marios: no, it's upstream tripleo, I just pulled the latest templates etc | 10:34 |
shardy | was working before that so I assume something changed, I'm just not yet sure what :) | 10:34 |
*** fgimenez has quit IRC | 10:34 | |
gfidente | shardy, I noticed the ControlVirtualIP resource doesn't emit any IP if the underlying neutron port is deleted | 10:35 |
marios | shardy: ah ok we saw it yesterday cos the ControlFixedIPs: was passed in param_defaults vs params | 10:35 |
*** paramite is now known as paramite|afk | 10:35 | |
gfidente | shardy, in which case I got that very same error | 10:35 |
marios | gfidente: there was a bz i believe? | 10:35 |
gfidente | shardy, and I worked around it using replacement_policy: REPLACE_ALWAYS | 10:35 |
gfidente | so that it would recreate the neutron port | 10:35 |
gfidente | but I am not sure if/how the neutron port could get deleted in your case | 10:35 |
shardy | Yeah it's definitely not set in vip_data.yaml on the node | 10:36 |
shardy | gfidente: this is on a fresh stack create | 10:36 |
gfidente | marios, in the BZ they deleted the neutron port manually | 10:36 |
*** fgimenez has joined #tripleo | 10:36 | |
*** fgimenez has joined #tripleo | 10:36 | |
*** mcornea has joined #tripleo | 10:37 | |
gfidente | shardy, oh I thought it was on upgrade ... so you see the actual ip_address with resource-show on ControlVirtualIP ? | 10:37 |
shardy | Yeah control_virtual_ip has no IPs, it's CREATE_COMPLETE, but the status is DOWN | 10:42 |
shardy | I guess that explains it | 10:42 |
derekh | Looking at PS2 ci results for the pingtest (in liberty), NONHA the ping didn't work (but it did try to ping the instance), in the Ha test the overcloud failed to deploy. In PS1 we have a successful ping, I think we have enough to show that the actaul ping test works. The problems are with the actaully deployment being tested. I'm thinking we just merge it to make deciphering these failures easier. | 10:43 |
derekh | anybody disagree ? ^^ | 10:43 |
*** tosky has joined #tripleo | 10:43 | |
gfidente | shardy, not sure I have it in DOWN state too but works | 10:43 |
shardy | derekh: +1 | 10:43 |
shardy | gfidente: Hmm, so it's DOWN, but fixed_ips isn't empty? | 10:43 |
marios | derekh: sounds good to me | 10:44 |
*** yamahata has quit IRC | 10:44 | |
gfidente | shardy, not empty no | 10:44 |
shardy | fixed_ip's for both control_virtual_ip and redis_virtual_ip are empty for me | 10:44 |
shardy | hrm | 10:44 |
gfidente | that's exactly what I saw | 10:44 |
gfidente | when the port is empty, the resource is empy heat can't get the actual ip_address | 10:44 |
derekh | shardy: marios done | 10:44 |
gfidente | but in the BZ marios mentioned, people deleted their neutron port | 10:44 |
gfidente | and we recreated it using the replacement_policy | 10:45 |
gfidente | why it is empty in your case, I dunno | 10:45 |
shardy | I wonder if it's failing to delete the portws | 10:45 |
* shardy tries stack-delete again | 10:45 | |
gfidente | shardy, I have also another strange thing here | 10:45 |
gfidente | basically I seem to have a scenario which consistely reproduced the Timed out waiting for message problem | 10:45 |
gfidente | regardless of the number of cores and memory, I have enough | 10:46 |
shardy | gfidente: that's strange, I've only ever seen it with insufficient engines, or when one gets killed | 10:46 |
openstackgerrit | Merged openstack/tripleo-common: Adds a simple overcloud tenant vm ping test to tripleo.sh https://review.openstack.org/272191 | 10:46 |
gfidente | yeah I am trying to understand what is triggering it | 10:46 |
cmyster | can it be a missing signaling on failure to do something ? | 10:47 |
gfidente | I always get the timeout on the same resource | 10:47 |
gfidente | or better, when trying to update a specific resource | 10:47 |
* cmyster hmmms | 10:47 | |
shardy | gfidente: to clarify, you mean an RPC error in the logs right? | 10:47 |
shardy | timed out waiting for message... etc | 10:47 |
cmyster | sounds too familiar | 10:47 |
cmyster | gfidente: is there an open bug here with steps ? | 10:48 |
shardy | cmyster: it's different from the signalling I think | 10:48 |
gfidente | shardy, yeah rpc/client.py | 10:48 |
cmyster | shardy: I thought so, but this error message is familiar | 10:48 |
gfidente | but it says 'NotFound' on delete | 10:48 |
cmyster | yup | 10:48 |
cmyster | seen it on working on upgrading tests between different kilo versions | 10:49 |
cmyster | but I don't remember what I did in that case and it was sporadic and rare in my case | 10:49 |
shardy | cmyster: IIRC the downstream bugs with this error were fixed by increasing the undercloud resources | 10:50 |
shardy | folks were DoSing themselves by running a single heat-engine on a single-core VM, and/or running without enough RAM and no swap | 10:50 |
gfidente | yeah the interesting thing for me here is that I can reproduce it consistently | 10:50 |
shardy | gfidente: can you raise a bug with the steps? | 10:50 |
shardy | the NotFound thing sounds like a good clue to investigate | 10:51 |
cmyster | gfidente: and please add how many cpus/ram | 10:51 |
gfidente | shardy, cmyster here is a trace | 10:51 |
gfidente | http://paste.openstack.org/show/484977/ | 10:51 |
gfidente | # free -m | 10:51 |
gfidente | total used free shared buff/cache available | 10:51 |
gfidente | Mem: 7823 4507 1920 390 1395 2660 | 10:51 |
gfidente | Swap: 0 0 0 | 10:51 |
gfidente | ^^ that's the undercloud free | 10:51 |
gfidente | and it has 2 cores | 10:52 |
gfidente | I will try increasing the workers | 10:52 |
gfidente | just to see if that makes any difference | 10:52 |
cmyster | gfidente: its not enough... | 10:54 |
*** dtantsur|brb is now known as dtantsur | 11:00 | |
gfidente | cmyster, what is not enough? 2 cores? | 11:03 |
*** chlong has quit IRC | 11:05 | |
cmyster | back | 11:09 |
shardy | gfidente: It should be enough, but if we're still pinned to an old heat, you may want to bump the workers to 4 | 11:09 |
shardy | gfidente: and add some swap, just in case: | 11:09 |
*** chlong has joined #tripleo | 11:10 | |
cmyster | gfidente: 8gb RAM is a limit | 11:10 |
shardy | https://github.com/openstack-infra/tripleo-ci/blob/master/toci_instack.sh#L198 | 11:10 |
gfidente | shardy, yeah I am trying with 6 workers now | 11:10 |
shardy | I'm running a 6G undercloud with some swap, and it mostly doesn't need it | 11:10 |
shardy | although I mostly do 2 node nonha deployments | 11:10 |
cmyster | gfidente: FYI every time I cook images nowadays I use a guest image and add 2gb swap as a file in / | 11:10 |
gfidente | shardy, so any chance this might have more to do with parallelism rather than memory? | 11:11 |
shardy | gfidente: Yes, that was the main issue | 11:11 |
shardy | gfidente: 1 worker is definitely not enough, 4 or more should be | 11:11 |
shardy | otherwise you throw like 100 stacks at a single engine and it just falls over | 11:11 |
gfidente | shardy, ok so let me see how it goes with 6 workers, cause cpu average remained pretty low too ~0.6 | 11:11 |
gfidente | but I had 3 workers | 11:11 |
shardy | gfidente: If you had 3 heat-engine processes e.g via ps ax | grep heat-engine, that is only two workers | 11:12 |
shardy | the parent process doesn't actually do anything | 11:12 |
gfidente | ack, I had 2, one per core | 11:12 |
gfidente | 3 processes | 11:12 |
cmyster | legit | 11:12 |
*** mbound has joined #tripleo | 11:13 | |
gfidente | so I set it to 6 in the config, I want to see if it passes and if maybe we 'consume' the cpu better with such a configuration | 11:13 |
shardy | Yeah, I've not tested that, it may not be enough - the heat default is now a minimum of 4 | 11:13 |
shardy | gfidente: it's more that the Linux scheduler does much better multi-tasking than the one inside heat | 11:13 |
gfidente | shardy, yeah so for the undercloud we might want to use a higher default regardless of the cores count | 11:14 |
gfidente | let's see if it passes first though | 11:14 |
shardy | gfidente: The heat default is now a minimum of 4, the problem is we're still pinned to a really old heat | 11:15 |
shardy | I'm running heat-master locally FWIW and it works fine | 11:15 |
gfidente | shardy, I was scaling a 1ctrl+1cmp+1ceph to +1ceph | 11:15 |
gfidente | but interestingly scaling to +1cmp worked | 11:16 |
gfidente | as if the type of work needed to scale the ceph node was heavier | 11:17 |
gfidente | fwiw scaling ceph requires an update of a resource which is shared with the controllers too | 11:17 |
gfidente | while scaling computes doesn't afaik | 11:17 |
*** regebro has quit IRC | 11:26 | |
openstackgerrit | Derek Higgins proposed openstack/tripleo-common: split overcloud setup out of pingtest https://review.openstack.org/270631 | 11:26 |
*** regebro has joined #tripleo | 11:31 | |
openstackgerrit | Derek Higgins proposed openstack-infra/tripleo-ci: [NO MERGY] Test a update in trunk repository version https://review.openstack.org/229789 | 11:34 |
openstackgerrit | Derek Higgins proposed openstack-infra/tripleo-ci: Switch the overcloud pingtest to use the new heat client https://review.openstack.org/272479 | 11:34 |
*** mgould has joined #tripleo | 11:34 | |
*** fgimenez has quit IRC | 11:44 | |
*** paramite|afk is now known as paramite | 11:55 | |
*** fgimenez has joined #tripleo | 11:56 | |
*** fgimenez has quit IRC | 11:56 | |
*** fgimenez has joined #tripleo | 11:56 | |
*** paramite is now known as paramite|afk | 12:00 | |
*** paramite|afk is now known as paramite | 12:02 | |
*** Marga_ has quit IRC | 12:03 | |
*** Marga_ has joined #tripleo | 12:03 | |
*** jaosorior has quit IRC | 12:12 | |
*** jaosorior has joined #tripleo | 12:13 | |
openstackgerrit | Derek Higgins proposed openstack-infra/tripleo-ci: Nothing to see here https://review.openstack.org/111011 | 12:35 |
*** weshay has quit IRC | 12:35 | |
*** regebro has quit IRC | 12:39 | |
*** regebro has joined #tripleo | 12:39 | |
*** trellooobot has joined #tripleo | 12:41 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 12:41 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 12:41 |
trellooobot | | Title | URL | Members | Last Active | | 12:41 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 12:41 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 0 min | | 12:41 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 12:41 |
*** trellooobot has quit IRC | 12:41 | |
*** jhenner1 has joined #tripleo | 12:47 | |
*** jhenner has quit IRC | 12:50 | |
openstackgerrit | Giulio Fidente proposed openstack/tripleo-heat-templates: Support the deployment of Ceph over IPv6 https://review.openstack.org/272089 | 12:51 |
*** rhallisey has joined #tripleo | 12:51 | |
*** pradk has quit IRC | 12:52 | |
*** jhenner1 has quit IRC | 12:53 | |
mgould | hi everyone | 12:57 |
*** akrivoka has quit IRC | 12:58 | |
mgould | https://review.openstack.org/#/c/270869/ includes a change to a Jenkins script and was merged at 1900UTC last night, but a build run at 1141UTC this morning was still using the old code: http://logs.openstack.org/36/265336/4/check/check-osc-plugins/a7b06f6/ | 12:59 |
mgould | will that sort itself out eventually, or is there some process to update the code run by the Jenkins workers? | 12:59 |
*** dprince has joined #tripleo | 12:59 | |
*** trown|outttypeww is now known as trown | 13:04 | |
*** lucasagomes is now known as lucas-hungry | 13:07 | |
*** trellooobot has joined #tripleo | 13:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 13:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 13:10 |
trellooobot | | Title | URL | Members | Last Active | | 13:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 13:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 29 min | | 13:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 13:10 |
*** trellooobot has quit IRC | 13:10 | |
*** jayg|g0n3 is now known as jayg | 13:14 | |
*** weshay_xchat has joined #tripleo | 13:19 | |
d0ugal | mgould: I think you linked to the wrong patch | 13:21 |
*** jhenner has joined #tripleo | 13:22 | |
*** aufi has quit IRC | 13:22 | |
*** weshay_xchat is now known as weshay | 13:27 | |
*** larsks has quit IRC | 13:27 | |
*** akrivoka has joined #tripleo | 13:32 | |
gfidente | shardy, fwiw, 6 workers didn't make the trick here, timed out at the time of updating the very same resources | 13:33 |
gfidente | shardy, but with a different trace, no kidding | 13:35 |
gfidente | http://paste.openstack.org/show/484999/ | 13:35 |
gfidente | but hey cpu average is much higher, like 2.16 | 13:38 |
*** fgimenez has quit IRC | 13:39 | |
*** Goneri has joined #tripleo | 13:40 | |
*** fgimenez has joined #tripleo | 13:41 | |
*** fgimenez has quit IRC | 13:41 | |
*** fgimenez has joined #tripleo | 13:41 | |
shardy | gfidente: check the logs and ensure a heat-engine worker didn't get killed due to OOM | 13:42 |
shardy | that looks like it timed out trying to update a nested stack | 13:42 |
shardy | and/or add some swap and see if it still happens :) | 13:43 |
gfidente | no still have 7 processes | 13:44 |
gfidente | and plenty of ram | 13:44 |
gfidente | http://paste.openstack.org/show/485000/ | 13:44 |
shardy | hmm, strange | 13:45 |
shardy | Is there any engine backtrace just after the update of the nested stack happens? | 13:46 |
shardy | something must be going wrong handling the stack_resource update (which happens via RPC) | 13:46 |
marios | jistr /me palmface with the brno nordic themed meeting room names (who was it that thought this would be memorable/useful/fun? | 13:48 |
marios | jistr: heh, though i guess it *is* memorable if you know what they mean :) | 13:49 |
*** julim has joined #tripleo | 13:50 | |
marios | jistr: ddg tells me iceland | 13:50 |
jistr | marios: it was selection between proposals by a democratic vote :D (The participation wasn't too high IIRC, and people from all buildings voted, not just the ones who would be actually based in FBC2, which explains the result a bit :D) | 13:53 |
marios | jistr: i can only imagine the memolist flamefest fallout from an event like that | 13:54 |
jistr | yea :D | 13:55 |
*** tzumainn has joined #tripleo | 14:00 | |
dprince | meeting time | 14:01 |
*** eggmaster has joined #tripleo | 14:01 | |
derekh | trown: we got half a meatloaf song https://review.openstack.org/#/c/229789/ | 14:02 |
*** mkovacik has joined #tripleo | 14:02 | |
derekh | trown: one out of three aint bad | 14:02 |
trown | derekh: lol | 14:02 |
*** lucas-hungry is now known as lucasagomes | 14:06 | |
*** trellooobot has joined #tripleo | 14:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 14:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 14:10 |
trellooobot | | Title | URL | Members | Last Active | | 14:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 14:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 89 min | | 14:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 14:10 |
*** trellooobot has quit IRC | 14:10 | |
*** dmacpher has quit IRC | 14:11 | |
*** tiswanso has joined #tripleo | 14:13 | |
*** rlandy has joined #tripleo | 14:14 | |
gfidente | shardy, so yes there is an error | 14:16 |
*** pblaho has quit IRC | 14:17 | |
gfidente | but is logged right after stack is created | 14:17 |
gfidente | not during update | 14:17 |
gfidente | http://paste.openstack.org/show/485008/ | 14:17 |
gfidente | so in despite that error, stack creation is reported as COMPLETE | 14:18 |
*** pblaho has joined #tripleo | 14:18 | |
*** lblanchard has joined #tripleo | 14:19 | |
shardy | gfidente: aha! | 14:20 |
shardy | I bet that truncation is happening on the resource that gets stuck on update ;) | 14:21 |
gfidente | shardy, I am trying to understand that | 14:21 |
shardy | gfidente: can you work out what resource type it is from the lines before the error? | 14:21 |
mgould | d0ugal, I don't think so | 14:21 |
openstackgerrit | Jiri Stransky proposed openstack/tripleo-heat-templates: Introduce update/upgrade workflow https://review.openstack.org/271358 | 14:21 |
openstackgerrit | Jiri Stransky proposed openstack/tripleo-heat-templates: Split pacemaker common check_service function out of _restart.sh https://review.openstack.org/260443 | 14:21 |
openstackgerrit | Jiri Stransky proposed openstack/tripleo-heat-templates: Add resources for major upgrade in Pacemaker scenario https://review.openstack.org/253276 | 14:21 |
openstackgerrit | Jiri Stransky proposed openstack/tripleo-heat-templates: Use timeout to check for services status https://review.openstack.org/259573 | 14:21 |
* shardy wonders about a huge manifest in a SoftwareConfig | 14:21 | |
jistr | ^ just rebases | 14:22 |
gfidente | shardy, that is it indeed | 14:22 |
mgould | d0ugal, I think those links are correct | 14:22 |
shardy | gfidente: Ah, I suspect that is the issue | 14:22 |
mgould | I just clicked on them both and they took me where I expected :-) | 14:22 |
shardy | it's a heat bug, we should either handle it or fail on create | 14:22 |
*** absubram has joined #tripleo | 14:23 | |
*** trozet has joined #tripleo | 14:24 | |
mgould | one goes to the build log rather than the patch; the patch is https://review.openstack.org/#/c/265336/ | 14:24 |
*** absubram_ has joined #tripleo | 14:24 | |
*** hjensas has quit IRC | 14:25 | |
*** panda_ has quit IRC | 14:25 | |
*** thrash|g0ne is now known as thrash | 14:26 | |
*** panda_ has joined #tripleo | 14:26 | |
*** absubram has quit IRC | 14:28 | |
*** absubram_ is now known as absubram | 14:28 | |
openstackgerrit | Miles Gould proposed openstack/python-tripleoclient: Use Ironic API v1.11 to support ENROLL state https://review.openstack.org/272206 | 14:29 |
openstackgerrit | Miles Gould proposed openstack/python-tripleoclient: Remove tripleoclient.baremetal wrapper https://review.openstack.org/265336 | 14:29 |
*** larsks has joined #tripleo | 14:30 | |
*** tiswanso has quit IRC | 14:31 | |
*** tiswanso has joined #tripleo | 14:32 | |
* mgould submits a new version in response to review comments, so we'll see what happens to the build this time | 14:33 | |
d0ugal | mgould: oh, sorry - I didn't understand the question. I do now. I don't know how often Jenkins is updated. | 14:33 |
mgould | how many Jenkins workers are there? | 14:33 |
*** pradk has joined #tripleo | 14:34 | |
*** pblaho has quit IRC | 14:35 | |
*** davidlenwell has quit IRC | 14:35 | |
d0ugal | mgould: hundreds I think :) | 14:36 |
*** dmsimard is now known as dmsimard|afk | 14:39 | |
*** pblaho has joined #tripleo | 14:39 | |
d0ugal | Although, I don't know how many are dedicated to TripleO | 14:40 |
*** jhenner has quit IRC | 14:42 | |
*** jhenner has joined #tripleo | 14:42 | |
*** mcornea has quit IRC | 14:42 | |
*** davidlenwell has joined #tripleo | 14:49 | |
*** ChanServ sets mode: +v davidlenwell | 14:49 | |
*** fgimenez has quit IRC | 14:50 | |
*** fgimenez has joined #tripleo | 14:53 | |
*** fgimenez has quit IRC | 14:53 | |
*** fgimenez has joined #tripleo | 14:53 | |
openstackgerrit | Ben Swartzlander proposed openstack/tripleo-heat-templates: Enable Manila integration https://review.openstack.org/188137 | 14:56 |
*** tosky has quit IRC | 14:59 | |
*** tosky has joined #tripleo | 15:01 | |
*** hjensas has joined #tripleo | 15:03 | |
*** pblaho has quit IRC | 15:03 | |
openstackgerrit | Ben Swartzlander proposed openstack/tripleo-heat-templates: Add NetApp integration to Manila https://review.openstack.org/188138 | 15:07 |
openstackgerrit | John Trowbridge proposed openstack-infra/tripleo-ci: [NO MERGY] Test a update in trunk repository version https://review.openstack.org/229789 | 15:07 |
trown | derekh: I put the latest hash to pass RDOCI in there | 15:08 |
*** trellooobot has joined #tripleo | 15:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 15:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 15:10 |
trellooobot | | Title | URL | Members | Last Active | | 15:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 15:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 149 min | | 15:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 15:10 |
*** trellooobot has quit IRC | 15:10 | |
derekh | trown: ack | 15:11 |
*** tiswanso has quit IRC | 15:15 | |
*** rpothier has joined #tripleo | 15:15 | |
*** tiswanso has joined #tripleo | 15:15 | |
*** tiswanso has quit IRC | 15:15 | |
*** mcornea has joined #tripleo | 15:16 | |
*** ukalifon1 has quit IRC | 15:22 | |
*** pblaho has joined #tripleo | 15:24 | |
*** dshulyak has quit IRC | 15:25 | |
*** dshulyak has joined #tripleo | 15:26 | |
*** aufi has joined #tripleo | 15:26 | |
*** yamahata has joined #tripleo | 15:29 | |
*** tiswanso has joined #tripleo | 15:30 | |
*** regebro has quit IRC | 15:31 | |
*** egafford has joined #tripleo | 15:36 | |
egafford | dprince, bnemec, shardy: Any chance I could get a quick review on https://review.openstack.org/#/c/271042/ ? (One-line change to remove the Sahara default password in heat-templates.) | 15:37 |
openstackgerrit | Derek Higgins proposed openstack-infra/tripleo-ci: Nothing to see here https://review.openstack.org/111011 | 15:40 |
*** regebro has joined #tripleo | 15:40 | |
*** rasca has quit IRC | 15:41 | |
egafford | Thanks shardy! | 15:42 |
*** rasca has joined #tripleo | 15:43 | |
openstackgerrit | Merged openstack/tripleo-heat-templates: Removing Sahara password default https://review.openstack.org/271042 | 15:45 |
*** shardy_ has joined #tripleo | 15:53 | |
*** shardy has quit IRC | 15:54 | |
*** kukacz has joined #tripleo | 15:56 | |
*** Goneri has quit IRC | 16:04 | |
*** mbound has quit IRC | 16:09 | |
*** trellooobot has joined #tripleo | 16:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 16:10 |
*** devvesa has quit IRC | 16:10 | |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 16:10 |
trellooobot | | Title | URL | Members | Last Active | | 16:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 16:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 209 min | | 16:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 16:10 |
*** trellooobot has quit IRC | 16:10 | |
*** jcoufal_ has joined #tripleo | 16:11 | |
*** mcornea has quit IRC | 16:12 | |
*** jcoufal has quit IRC | 16:12 | |
mgould | is anyone else still having problems with check-osc-plugins gate failures? | 16:12 |
mgould | and is there a way I can see all the recent runs of a given check script? | 16:14 |
jaosorior | mgould: Isn't that because of the missing python-tuskarclient? | 16:15 |
jaosorior | IIRC it was recently deprecated, yet it's still trying to find it' | 16:15 |
mgould | jaosorior, I think so, but the patch to remove that was merged at 1900 UTC last night | 16:15 |
jaosorior | it's setup.py, and fails there | 16:15 |
mgould | yep | 16:15 |
jaosorior | mgould: funky | 16:15 |
mgould | because python-tuskarclient is now only a README | 16:15 |
* mgould goes to hassle openstack-infra again | 16:16 | |
*** oshvartz has quit IRC | 16:18 | |
gfidente | shardy_, so there is no bug in LP either for the truncate bug? | 16:19 |
*** rbrady has quit IRC | 16:21 | |
*** rbrady has joined #tripleo | 16:21 | |
shardy_ | gfidente: No, AFAIK you're the first to break Heat in this way, congratulations! ;) | 16:24 |
gfidente | :( | 16:24 |
shardy_ | Feel free to raise one with a reproducer | 16:24 |
gfidente | I was looking forward to do some copy/pate | 16:25 |
gfidente | not to be the first! | 16:25 |
*** devvesa has joined #tripleo | 16:26 | |
shardy_ | gfidente: It's probably an easy fix, if you have a reproducer I'm fairly sure we can fix it quickly | 16:26 |
openstackgerrit | Ethan Gafford proposed openstack/python-tripleoclient: Sahara integration https://review.openstack.org/272625 | 16:26 |
shardy_ | gfidente: the main issue is we'll ideally need to avoid a DB migration | 16:26 |
gfidente | which makes it less of an easy fix | 16:27 |
openstackgerrit | Ethan Gafford proposed openstack/tripleo-heat-templates: Removing Sahara password default https://review.openstack.org/272627 | 16:27 |
*** chlong is now known as chlong_zzz | 16:28 | |
shardy_ | gfidente: If you've identified the failing config, you obviously have the option of splitting it so it's not so big as a workaround | 16:30 |
*** Goneri has joined #tripleo | 16:30 | |
*** devvesa has quit IRC | 16:31 | |
*** paramite has quit IRC | 16:31 | |
*** aufi has quit IRC | 16:35 | |
*** dtantsur is now known as dtantsur|afk | 16:39 | |
*** fgimenez has quit IRC | 16:43 | |
*** fgimenez has joined #tripleo | 16:44 | |
*** fgimenez has joined #tripleo | 16:44 | |
*** mbound has joined #tripleo | 16:44 | |
* derekh still hasn't gotten any logs from the timed out overcloud, | 16:44 | |
*** mgould has quit IRC | 16:48 | |
*** alop has joined #tripleo | 16:51 | |
*** pblaho has quit IRC | 16:53 | |
*** mcornea has joined #tripleo | 16:57 | |
*** a2hill has left #tripleo | 17:01 | |
*** julim_ has joined #tripleo | 17:02 | |
*** mgould has joined #tripleo | 17:03 | |
*** trozet_ has joined #tripleo | 17:05 | |
*** fgimenez has quit IRC | 17:06 | |
*** akrivoka has quit IRC | 17:07 | |
*** trozet has quit IRC | 17:07 | |
*** julim has quit IRC | 17:07 | |
*** nijaba has quit IRC | 17:07 | |
*** jtomasek has quit IRC | 17:07 | |
*** akrivoka has joined #tripleo | 17:07 | |
*** nijaba has joined #tripleo | 17:08 | |
*** nijaba has quit IRC | 17:08 | |
*** nijaba has joined #tripleo | 17:08 | |
*** pblaho has joined #tripleo | 17:08 | |
*** jtomasek has joined #tripleo | 17:08 | |
*** trellooobot has joined #tripleo | 17:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 17:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 17:10 |
trellooobot | | Title | URL | Members | Last Active | | 17:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 17:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 269 min | | 17:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 17:10 |
*** trellooobot has quit IRC | 17:10 | |
*** openstackgerrit has quit IRC | 17:17 | |
*** openstackgerrit has joined #tripleo | 17:17 | |
openstackgerrit | Ben Swartzlander proposed openstack/tripleo-heat-templates: Enable Manila integration https://review.openstack.org/188137 | 17:22 |
openstackgerrit | Jiri Stransky proposed openstack/tripleo-heat-templates: Increase galera sync timeout in yum_update.sh https://review.openstack.org/272664 | 17:26 |
*** sthillma has joined #tripleo | 17:29 | |
*** sthillma_ has joined #tripleo | 17:30 | |
*** dmsimard|afk has quit IRC | 17:30 | |
*** EmilienM has quit IRC | 17:30 | |
*** sthillma has quit IRC | 17:33 | |
*** sthillma_ is now known as sthillma | 17:33 | |
*** dmsimard has joined #tripleo | 17:34 | |
*** EmilienM has joined #tripleo | 17:35 | |
*** ayoung_ZZZzzzz is now known as ayoung | 17:35 | |
*** ayoung has quit IRC | 17:36 | |
openstackgerrit | Miles Gould proposed openstack/python-tripleoclient: Use Ironic API v1.11 to support ENROLL state https://review.openstack.org/272206 | 17:38 |
openstackgerrit | Miles Gould proposed openstack/python-tripleoclient: Remove tripleoclient.baremetal wrapper https://review.openstack.org/265336 | 17:38 |
shardy_ | derekh: FYI I just did a clean VM environment install w/tripleo.sh | 17:38 |
*** olap has quit IRC | 17:38 | |
shardy_ | there's something wrong with the RPC, heat-api can't connect to rabbit | 17:38 |
shardy_ | I'm not sure why yet, could possibly be related to the timeout issue, all CLI calls to heat hang then time out | 17:39 |
slagle | hey, i just saw that too | 17:39 |
derekh | shardy_: yup, same here, heat commands arn't responding , I'm thinking its this puppet-heat commit https://review.openstack.org/#/c/249711/ | 17:39 |
slagle | rpc_backend is set to qpid in /usr/share/heat!! | 17:39 |
derekh | gonna try a puppet-heat pin in a minute | 17:39 |
shardy_ | In the heat logs there's ECONNREFUSED, can't connect to rabbit at all | 17:39 |
slagle | shardy_: is it trying to use qpid or rabbit? | 17:39 |
slagle | mine was trying to use qpid | 17:39 |
shardy_ | woah - it's qpid! | 17:40 |
slagle | i was just looking into this | 17:40 |
slagle | the packaging has been fixed, https://github.com/openstack-packages/heat/commit/a2ed21a64ca39596bb94d9af063f02c7dba1cd0f | 17:40 |
slagle | but i guess our delorean is too old | 17:40 |
slagle | but soemthing else must have changed, maybe in puppet-heat | 17:40 |
*** ayoung has joined #tripleo | 17:41 | |
shardy_ | Ah, that probably explains why I didn't hit this earlier when I was testing a locally build master heat package | 17:41 |
derekh | slagle: https://review.openstack.org/#/c/249711/ | 17:42 |
shardy_ | Ok, well thanks both for the clarification | 17:42 |
slagle | doh, failed tripleo ci on that one :( | 17:42 |
slagle | on a side node, config under /usr/share needs to die for all of openstack | 17:44 |
*** cody-somerville has joined #tripleo | 17:45 | |
*** cody-somerville has joined #tripleo | 17:45 | |
openstackgerrit | Derek Higgins proposed openstack/tripleo-common: Pin puppet heat https://review.openstack.org/272671 | 17:47 |
derekh | <slagle> the packaging has been fixed, https://github.com/openstack-packages/heat/commit/a2ed21a64ca39596bb94d9af063f02c7dba1cd0f | 17:47 |
derekh | that explains why the test of the newer repository passed earlier | 17:47 |
*** regebro has quit IRC | 17:50 | |
*** rwsu has joined #tripleo | 17:50 | |
*** electrofelix has quit IRC | 17:53 | |
*** cody-somerville has quit IRC | 17:53 | |
*** kukacz has quit IRC | 17:53 | |
openstackgerrit | Derek Higgins proposed openstack-infra/tripleo-ci: [NO MERGY] Test a update in trunk repository version https://review.openstack.org/229789 | 17:54 |
openstackgerrit | Derek Higgins proposed openstack-infra/tripleo-ci: Switch the overcloud pingtest to use the new heat client https://review.openstack.org/272479 | 17:54 |
*** ifarkas has quit IRC | 17:54 | |
derekh | trown ^^ I had a patch to use the new heat client patch you have proposed, when you updated the hash we wanted to test you dropped the patch | 17:54 |
*** jaosorior has quit IRC | 17:55 | |
derekh | So, I gotta run, we got 2 options to unblock CI, | 17:55 |
derekh | 1. merge this if CI passes https://review.openstack.org/#/c/272671/ | 17:55 |
derekh | 2. merge the 4 patches referenced in this patch https://review.openstack.org/#/c/229789/ and also update this link on the delorean server http://trunk.rdoproject.org/centos7/current-tripleo/ | 17:56 |
*** regebro has joined #tripleo | 17:57 | |
trown | derekh: ah whoops, sorry about that | 17:59 |
derekh | slagle: bnemec dprince ^^ | 17:59 |
derekh | trown: no prob ;-) | 18:00 |
dprince | derekh: +2 on your 272671 patch | 18:00 |
*** julim has joined #tripleo | 18:00 | |
derekh | dprince: ack, will check back in a bit | 18:00 |
*** derekh has quit IRC | 18:01 | |
*** julim_ has quit IRC | 18:03 | |
shardy_ | Interestingly I also get the "inc: command not found" No bootable device error on all nodes with the default undercloud deployed via tripleo.sh | 18:04 |
shardy_ | Upgrading ironic seems to fix it | 18:04 |
* shardy_ wonders why it works in CI | 18:04 | |
*** lucasagomes is now known as lucas-dinner | 18:04 | |
bnemec | shardy_: CI is running on Fedora virt hosts, I believe. They have a newer ipxe rom for the vms. | 18:08 |
shardy_ | bnemec: aha | 18:08 |
bnemec | This has been broken for a long time. :-( | 18:08 |
bnemec | It's one of the biggest reasons we need to update the CI pinned repo, because Ironic fixed it a long time ago too. | 18:09 |
shardy_ | Yeah I recall upgrading ironic last time I rebuilt my undercloud, but that was weeks ago :( | 18:09 |
shardy_ | tripleo.sh --delorean-build ftw, I now have upgraded all-the-things :) | 18:10 |
*** trellooobot has joined #tripleo | 18:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 18:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 18:10 |
trellooobot | | Title | URL | Members | Last Active | | 18:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 18:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 329 min | | 18:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 18:10 |
*** trellooobot has quit IRC | 18:10 | |
*** jistr has quit IRC | 18:11 | |
*** olap has joined #tripleo | 18:17 | |
*** mbound has quit IRC | 18:19 | |
*** tosky has quit IRC | 18:19 | |
*** trozet_ is now known as trozet | 18:20 | |
*** panda_ has quit IRC | 18:25 | |
*** mgould has quit IRC | 18:25 | |
*** eil397 has joined #tripleo | 18:26 | |
*** panda_ has joined #tripleo | 18:26 | |
*** penick has joined #tripleo | 18:28 | |
openstackgerrit | Ben Swartzlander proposed openstack/tripleo-heat-templates: Enable Manila integration https://review.openstack.org/188137 | 18:29 |
*** olap has quit IRC | 18:32 | |
*** Marga_ has quit IRC | 18:33 | |
*** gchamoul is now known as gchamoul|relocat | 18:39 | |
shardy_ | slagle: if you get a moment I'd appreciate feedback on https://bugs.launchpad.net/tripleo/+bug/1538082 | 18:45 |
openstack | Launchpad bug 1538082 in tripleo "Re running undercloud install fails" [Undecided,New] | 18:45 |
shardy_ | The root cause appears to be missing selinux-policy-devel | 18:45 |
shardy_ | I tried removing selinux from the blacklist in centos-7-undercloud-packages.json and installing a new undercloud with an updated instack-undercloud | 18:46 |
shardy_ | I thought including that element would install the package, but it appears not | 18:46 |
slagle | shardy_: can you try deleting /usr/libexec/os-refresh-config and running again | 18:46 |
slagle | we probably need to clean that dir on each run | 18:46 |
*** bvandenh has quit IRC | 18:46 | |
slagle | i thought there was a patch for this up already | 18:46 |
shardy_ | slagle: ah, I'll give that a try, thanks! | 18:47 |
slagle | yes, here, https://review.openstack.org/#/c/270809/ | 18:47 |
*** akuznetsov has joined #tripleo | 18:48 | |
slagle | hmm, it's merged on master already | 18:48 |
slagle | shardy_: anyway, check if you have that patch | 18:48 |
slagle | and if so, then i guess that's not the issue :) | 18:48 |
openstackgerrit | Emilien Macchi proposed openstack/tripleo-heat-templates: Deploy Aodh services, replacing Ceilometer Alarm https://review.openstack.org/241408 | 18:49 |
EmilienM | pradk: ^ let's see how CI works | 18:49 |
pradk | cool | 18:49 |
shardy_ | slagle: ah, yeah I spotted that patch earlier and I do have it | 18:50 |
shardy_ | nvm, I'll dig a bit further, thanks! | 18:51 |
*** trown is now known as trown|lunch | 18:51 | |
*** bvandenh has joined #tripleo | 18:51 | |
pradk | EmilienM, so for ha the tricky part with pacemaker is orchestrating the services? trying to understand whats left for ha | 18:52 |
EmilienM | pradk: yeah | 18:55 |
EmilienM | with apache | 18:55 |
pradk | EmilienM, ah running the api with wsgi? | 18:56 |
EmilienM | yep | 18:56 |
*** shivrao has joined #tripleo | 18:57 | |
*** jaosorior_ has joined #tripleo | 18:57 | |
slagle | shardy_: sorry, i was going to try reinstalling my uc in a minute | 18:58 |
slagle | i'm currently stuck in some weird haproxy/httpd/keystone competing for the same port hell | 18:59 |
*** bswartz has joined #tripleo | 19:00 | |
slagle | EmilienM: oh hi! | 19:00 |
EmilienM | slagle: oh hey james, how is snow? | 19:00 |
*** nico_auv has quit IRC | 19:00 | |
EmilienM | it's snowing as hell here | 19:00 |
slagle | EmilienM: now that keystone is in wsgi on the undercloud, haproxy and httpd are both trying to bind to 35357 | 19:00 |
openstackgerrit | Dan Prince proposed openstack/tripleo-image-elements: Work around leak in dhcp-all-interfaces udev rule https://review.openstack.org/272697 | 19:01 |
EmilienM | slagle: wait, on which port was keystone binded before? | 19:01 |
dprince | athomas ^^ | 19:01 |
dprince | athomas: https://review.openstack.org/272697 | 19:01 |
slagle | EmilienM: the same one | 19:02 |
slagle | EmilienM: ok, haproxy is only binding the vips on that port | 19:02 |
slagle | but httpd has <VirtualHost *:35357> | 19:02 |
EmilienM | we can change it ! | 19:02 |
EmilienM | let me send a patch right now | 19:03 |
slagle | in 10-keystone_wsgi_admin.conf | 19:03 |
slagle | doesnt that mean httpd will also try and bind on the vips? | 19:03 |
EmilienM | it will bind on 0.0.0.0 | 19:03 |
EmilienM | which is iiuc something you don't want | 19:03 |
slagle | i think we probably want the local_ip value there | 19:04 |
EmilienM | wait | 19:04 |
slagle | instead of * | 19:04 |
*** shardy_ has quit IRC | 19:04 | |
EmilienM | slagle: https://github.com/openstack/instack-undercloud/blob/master/elements/puppet-stack-config/puppet-stack-config.yaml.template#L138-L139 | 19:04 |
EmilienM | how do you explain that? It was binded on 0.0.0.0 before anyway | 19:05 |
EmilienM | anyway, I'm updating it so we can bind on local_ip | 19:05 |
openstackgerrit | Ben Swartzlander proposed openstack/tripleo-heat-templates: Add NetApp integration to Manila https://review.openstack.org/188138 | 19:06 |
athomas | dprince, Cool. Thanks. | 19:06 |
slagle | EmilienM: i'm not sure | 19:06 |
EmilienM | slagle: right we override it in the manifest | 19:07 |
EmilienM | I'm cleaning it | 19:07 |
*** gfidente has quit IRC | 19:07 | |
slagle | i updated an existing undercloud to this config | 19:07 |
slagle | maybe some old config is leftover? | 19:07 |
openstackgerrit | Dan Prince proposed openstack/tripleo-image-elements: Work around leak in dhcp-all-interfaces udev rule https://review.openstack.org/272697 | 19:07 |
openstackgerrit | Emilien Macchi proposed openstack/instack-undercloud: keystone/wsgi: bind on local IP https://review.openstack.org/272699 | 19:08 |
EmilienM | slagle: ^ | 19:08 |
EmilienM | oops ^ | 19:08 |
EmilienM | it should do the job | 19:08 |
openstackgerrit | Emilien Macchi proposed openstack/instack-undercloud: keystone/wsgi: bind on local IP https://review.openstack.org/272699 | 19:08 |
*** sthillma has quit IRC | 19:08 | |
EmilienM | cleaning also hiera ^ | 19:08 |
*** athomas has quit IRC | 19:09 | |
*** trellooobot has joined #tripleo | 19:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 19:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 19:10 |
trellooobot | | Title | URL | Members | Last Active | | 19:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 19:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 389 min | | 19:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 19:10 |
*** trellooobot has quit IRC | 19:10 | |
*** akuznetsov has quit IRC | 19:10 | |
*** akrivoka has quit IRC | 19:11 | |
*** Marga_ has joined #tripleo | 19:11 | |
*** mcornea has quit IRC | 19:12 | |
slagle | EmilienM: ok, will try it. just fixing it locally, i had to also add the IP to the Listen directive in /etc/httpd/conf/ports.conf | 19:18 |
*** dprince has quit IRC | 19:21 | |
*** athomas has joined #tripleo | 19:22 | |
*** mkovacik has quit IRC | 19:25 | |
*** gchamoul|relocat is now known as gchamoul | 19:28 | |
*** sthillma has joined #tripleo | 19:31 | |
openstackgerrit | Emilien Macchi proposed openstack/instack-undercloud: keystone/wsgi: bind on local IP https://review.openstack.org/272699 | 19:35 |
*** rlandy has quit IRC | 19:35 | |
EmilienM | slagle: yeah, my patch should take care of this, we'll check though | 19:35 |
*** dshulyak has quit IRC | 19:37 | |
*** mkovacik has joined #tripleo | 19:39 | |
*** rlandy has joined #tripleo | 19:41 | |
*** dprince has joined #tripleo | 19:42 | |
*** eil397 has quit IRC | 19:43 | |
*** eil397 has joined #tripleo | 19:46 | |
*** trown|lunch is now known as trown | 19:56 | |
*** rcernin has quit IRC | 19:58 | |
*** lblanchard1 has joined #tripleo | 20:02 | |
*** lblanchard has quit IRC | 20:03 | |
*** rpothier has quit IRC | 20:09 | |
*** sthillma has quit IRC | 20:10 | |
*** trellooobot has joined #tripleo | 20:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 20:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 20:10 |
trellooobot | | Title | URL | Members | Last Active | | 20:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 20:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 449 min | | 20:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 20:10 |
*** trellooobot has quit IRC | 20:10 | |
*** pradk has quit IRC | 20:14 | |
*** pradk_ has joined #tripleo | 20:14 | |
*** egafford has quit IRC | 20:14 | |
*** pradk_ is now known as pradk | 20:14 | |
*** mcornea has joined #tripleo | 20:18 | |
*** bswartz has left #tripleo | 20:30 | |
openstackgerrit | Derek Higgins proposed openstack/tripleo-common: Pin puppet heat https://review.openstack.org/272671 | 20:31 |
*** derekh has joined #tripleo | 20:31 | |
* derekh screwed up the puppet-heat pin | 20:31 | |
*** rcernin has joined #tripleo | 20:32 | |
derekh | had to start the puppet-heat pin CI again, screwed up the REPOREF variable name | 20:32 |
*** yamahata has quit IRC | 20:37 | |
openstackgerrit | Steve Baker proposed openstack/python-tripleoclient: openstack overcloud software deployment show https://review.openstack.org/271113 | 20:47 |
openstackgerrit | Steve Baker proposed openstack/python-tripleoclient: openstack overcloud failures https://review.openstack.org/271114 | 20:47 |
*** bvandenh has quit IRC | 20:52 | |
*** pradk has quit IRC | 20:53 | |
*** derekh has quit IRC | 20:53 | |
*** pradk has joined #tripleo | 20:54 | |
alop | Random question... in DIB, why is finalize stage misspelled as 'finalise'? | 20:56 |
slagle | we aint all from 'murica | 20:57 |
slagle | there are some british spellings mixed in throughout tripleo | 20:58 |
alop | Took me a while to figure out why my 'finalize.d' was getting ignored | 21:01 |
*** derekh has joined #tripleo | 21:06 | |
*** trellooobot has joined #tripleo | 21:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 21:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 21:10 |
trellooobot | | Title | URL | Members | Last Active | | 21:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 21:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 509 min | | 21:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 21:10 |
*** trellooobot has quit IRC | 21:10 | |
*** ccrouch has joined #tripleo | 21:12 | |
openstackgerrit | Ben Swartzlander proposed openstack/tripleo-heat-templates: Add integration with NetApp Manila driver https://review.openstack.org/188138 | 21:19 |
*** rcernin has quit IRC | 21:28 | |
*** julim has quit IRC | 21:30 | |
*** tiswanso has quit IRC | 21:32 | |
*** jcoufal_ has quit IRC | 21:32 | |
*** tiswanso has joined #tripleo | 21:33 | |
*** sthillma has joined #tripleo | 21:34 | |
*** trown is now known as trown|outttypeww | 21:37 | |
*** jprovazn has quit IRC | 21:38 | |
*** jaosorior_ has quit IRC | 21:39 | |
*** regebro has quit IRC | 21:48 | |
*** ayoung has quit IRC | 21:51 | |
*** lblanchard1 has quit IRC | 21:53 | |
*** lblanchard has joined #tripleo | 21:54 | |
*** ayoung has joined #tripleo | 21:58 | |
*** yamahata has joined #tripleo | 21:58 | |
*** lblanchard has quit IRC | 22:00 | |
*** yamahata has quit IRC | 22:01 | |
*** yamahata has joined #tripleo | 22:01 | |
*** ccrouch has quit IRC | 22:02 | |
*** ccrouch has joined #tripleo | 22:04 | |
*** dprince has quit IRC | 22:09 | |
*** trellooobot has joined #tripleo | 22:10 | |
trellooobot | URGENT TRIPLEO TASKS NEED ATTENTION | 22:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 22:10 |
trellooobot | | Title | URL | Members | Last Active | | 22:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 22:10 |
trellooobot | | overcloud deploys timing out | https://trello.com/c/xm08OSBy | **NEEDS MEMBERS** | 570 min | | 22:10 |
trellooobot | +------------------------------+-------------------------------+-------------------+-------------+ | 22:10 |
*** trellooobot has quit IRC | 22:10 | |
*** penick has quit IRC | 22:14 | |
*** penick has joined #tripleo | 22:18 | |
openstackgerrit | Merged openstack/tripleo-common: Pin puppet heat https://review.openstack.org/272671 | 22:20 |
*** davidlenwell has quit IRC | 22:24 | |
*** penick has quit IRC | 22:24 | |
*** davidlenwell has joined #tripleo | 22:25 | |
*** ChanServ sets mode: +v davidlenwell | 22:25 | |
*** rlandy has quit IRC | 22:26 | |
*** derekh has quit IRC | 22:27 | |
*** rlandy has joined #tripleo | 22:27 | |
*** jayg is now known as jayg|g0n3 | 22:32 | |
*** tiswanso has quit IRC | 22:47 | |
*** ccrouch has quit IRC | 22:49 | |
*** weshay has quit IRC | 22:52 | |
greghaynes | Not sure if you all have seen, but https://bugs.launchpad.net/diskimage-builder/+bug/1538135 | 22:54 |
openstack | Launchpad bug 1538135 in diskimage-builder "disk-image-create deleting system /dev during image create " [Critical,New] | 22:54 |
greghaynes | beware | 22:54 |
*** ccrouch has joined #tripleo | 22:57 | |
*** sthillma_ has joined #tripleo | 23:05 | |
*** sthillma has quit IRC | 23:05 | |
*** sthillma_ is now known as sthillma | 23:05 | |
*** shivrao has quit IRC | 23:06 | |
*** shivrao has joined #tripleo | 23:07 | |
*** mcornea has quit IRC | 23:07 | |
openstackgerrit | Emilien Macchi proposed openstack/instack-undercloud: keystone/wsgi: bind on local IP https://review.openstack.org/272699 | 23:10 |
*** sthillma has quit IRC | 23:11 | |
*** sthillma has joined #tripleo | 23:12 | |
*** derekh has joined #tripleo | 23:29 | |
*** jcoufal has joined #tripleo | 23:33 | |
*** ooolpbot has joined #tripleo | 23:49 | |
ooolpbot | URGENT TRIPLEO TASKS NEED ATTENTION | 23:49 |
ooolpbot | https://bugs.launchpad.net/tripleo/+bug/1538127 | 23:49 |
openstack | Launchpad bug 1538127 in tripleo "overcloud deploys timing out" [Critical,Triaged] | 23:49 |
*** ooolpbot has quit IRC | 23:49 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!