Tuesday, 2015-09-01

*** achanda has quit IRC00:01
*** achanda_ has quit IRC00:02
*** dims has quit IRC00:03
*** sdake_ has joined #kolla00:08
*** britthou_ has joined #kolla00:09
*** jasonsb has joined #kolla00:09
*** sdake has quit IRC00:11
*** britthouser has quit IRC00:12
bmaceyou tagging l3 today sdake?  or just pushing up / tagging some images to dockerhub?00:15
*** diogogmt has quit IRC00:18
*** sdake has joined #kolla00:18
*** jruano has joined #kolla00:19
*** sdake_ has quit IRC00:21
*** weiyu has quit IRC00:22
sdakebmace i will tag l3 when its ready00:32
sdakeatm it isnt' working multinode for me00:32
sdakehttps://bugs.launchpad.net/kolla/+bug/149078000:32
openstackLaunchpad bug 1490780 in kolla "rabbitmq fails during startup" [Critical,Triaged]00:32
sdakebmace more importantly:00:33
sdakehttps://bugs.launchpad.net/kolla/+bug/149077800:33
openstackLaunchpad bug 1490778 in kolla "multinode deploy results in intermittent authentication failures" [Critical,Triaged] - Assigned to Steven Dake (sdake)00:33
sdakehowever the mariadb permission problem is fixed00:33
sdakeonce i can reliably get a multinode deploy l3 will be tagged00:34
sdakebmace do those bugs ring any bells?00:34
*** blahRus has quit IRC00:41
*** zhiwei has joined #kolla00:44
bmacei have seen some failures in various places, but since we are using ndb rather than mariadb.. mileage probably may vary :)00:45
*** sdake_ has joined #kolla00:47
*** sdake__ has joined #kolla00:49
*** sdake has quit IRC00:50
*** sdake_ has quit IRC00:53
sdake__bmace what is nbd?00:54
*** sdake__ is now known as sdake00:54
sdakeopen source database?00:55
*** banix has joined #kolla00:55
*** weiyu has joined #kolla01:07
*** banix has quit IRC01:12
*** Haomeng has joined #kolla01:14
*** Haomeng|2 has quit IRC01:17
*** dims has joined #kolla01:20
*** banix has joined #kolla01:20
*** erkules_ has joined #kolla01:22
*** erkules has quit IRC01:25
*** zhiwei has quit IRC01:37
*** SourabhP has quit IRC01:38
sdakeubuntu gate just merged yay ;)01:57
*** jruano has quit IRC01:58
sdakesamyaple you alive yet02:01
*** rhallisey has quit IRC02:11
*** banix has quit IRC02:13
*** jruano has joined #kolla02:27
*** jruano has quit IRC02:33
*** jruano has joined #kolla02:39
*** klint has joined #kolla02:51
*** britthou_ has quit IRC02:53
*** britthouser has joined #kolla02:53
*** alisonh has quit IRC02:56
*** dayou has joined #kolla03:01
*** achanda has joined #kolla03:07
*** alisonh has joined #kolla03:13
*** jruano has quit IRC03:19
*** alisonh has quit IRC03:22
SamYaplesdake: yea03:23
sdakemultinode is completely busted: https://bugs.launchpad.net/kolla/+bug/149077803:25
openstackLaunchpad bug 1490778 in kolla "multinode deploy results in intermittent authentication failures" [Critical,Triaged] - Assigned to Steven Dake (sdake)03:25
sdakebut your mariadb bug fix did the trick there03:25
sdakeeating bbiaf - can you read that bug log03:25
*** alisonh has joined #kolla03:26
sdakesamyaple here is the diff I'm working with - still busted03:27
sdakehttp://ur1.ca/nmldb -> http://paste.fedoraproject.org/261788/4107801603:27
SamYaplehey before you go digging into that let me submit my keepalive config changes that we need03:29
SamYaplemight solve your issue03:29
SamYaple(again... no issues on ubuntu. centos seems bad)03:29
*** unicell has quit IRC03:31
sdakecool i'll be happy to test03:32
sdakeplease git review it ;)03:33
sdakei think the issue is haproxy tho not keepalived03:33
sdakeprobably ubuntu has some different defaults then centos03:33
SamYapleprobably but the patch affects both03:33
SamYapleno we lay down the haproxy config03:33
sdakei understand that03:33
SamYapleand keepalived for that matter03:33
SamYaplethere are no other options coming into play though03:34
sdakei have been digging itn othe code for the last 8 hurs i understand how it works now :)03:34
sdake+  stick on dst03:34
sdake+  stick-table type ip size 203:34
sdakegoing to give that a go03:34
sdakei htink this is the last thing pending for liberty-303:34
sdakeand i can tag it and we can get on our way to rc103:34
SamYaplewe might as well tag it since we arent fixing that tonight...03:35
SamYaplewere definetely not using those sticky options03:35
sdakewe aren't tagging until its fixed03:36
sdakeits totally busted atm03:36
SamYapleno its not03:36
sdakemultinode is completely unusable03:36
SamYapleits busted in an inactive environment03:36
SamYapleif you actually used the environment it would be fine03:37
sdakeif i sit ther eand spam glance image-list03:37
SamYaplewe are not solving for an idle case03:37
sdakethe first 2 go through03:37
sdakethe 3rd fails03:37
sdakethat is not inactive03:37
sdakethen it never works after that03:37
sdakeagain completely unusable03:37
SamYaplealright well we are not thorwing options at it to fix it03:37
SamYaplewe will need to understand the issue and then fix it and the sticky options are not the fix03:37
sdakei dont even know what sticy options do, just in the red hat bugzilla tracker03:38
SamYapleexactly03:38
sdakebut ya we have until the 3rd to fix it03:38
SamYaplethats my point03:38
sdakeso lets get cracking03:38
sdakeif you need access to my enviornment i am happy to provide it03:38
sdakepeople will actually try out liberty-303:38
SamYapleI may have to since centos is the only one affected03:38
sdakei dont want centos completely busted03:38
sdakeemail me your ssh key03:39
sdakepublic key that is03:39
SamYaplessh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQD02SUqeQSk4sapStfsh8HbBfSxNoipYAeKJHzhhldHHXXuW+csByqWfyFQBXAKGdDa1RBk7JNAz9WpyUpmqtYwGvlDofzY52RYdm3jzw9tZCS5ERuMrYB14xokFA5nMyOBluiEkkZvlkJpTywh9Be9kaMJutx46Li+grcSXCtUo3l8mcbMHF0uzg+JFm8Q6KpW0hmmyiPyvwMd4m1+jXw4rVJiHW8GD5Pkhmfwo6srT0mDaOGnrfdXoucoP0Is0U4uwnBxpPfyvG/MKuekyEeoM+zc7kBhHwXyPVYSnOkqgzO4j2eUIYt8ca+ya/8cVdSC/GsGAxBcx4ICMpvxhGXV null@null03:39
sdakewhat login id do you want03:40
SamYapledoesnt matter03:41
sdaketell me what you want plz03:41
sdakes8m?03:41
SamYapleyea i really dont care03:42
SamYaplethats fine03:42
sdakesticky doesn't seem to fix it03:44
sdakebummer03:44
sdakesec making creds03:44
*** britthou_ has joined #kolla03:46
*** thetrav has joined #kolla03:48
*** britthouser has quit IRC03:48
thetravdo you guys still publish docker containers to docker_hub?03:49
thetravthe one sitting there under fedora-keystone is out of date and broken03:49
SamYaplethetrav: we do not03:50
SamYaplethetrav: we will for stable branches03:50
SamYaplewe cant keep up with publishing for every commit due to how slow docker hub is03:50
thetravare there any stable branches yet?03:50
SamYaplethetrav: now for about 6 weeks03:51
SamYaplenot*03:51
thetravmeaning you are planning on releasing something stable in about 6 weeks?03:51
SamYapleI may push images once we snapshot L3 thetrav, ive had a few requests03:51
SamYaplethetrav: officially, yes03:51
thetravoooooh03:51
*** achanda has quit IRC03:51
thetravI've used working containers published in the past, it's useful03:52
SamYapleyea weve gone through alot of changes recently so the containers need to be built fro mthe same commit id that the playbooks are run from03:52
SamYaplethats part of the issue03:52
SamYaplewe are solving this through a few areas, including having packaged Kolla so it can be pip installed03:53
thetrav?03:53
SamYapleright now the ansible playbooks and docker images are very closely tied together03:53
SamYapleany images we push owuld quickly go out of date03:53
SamYapleso we do not push any03:53
sdakethetrav its pretty painless to build your own images03:54
thetravso you're moving into a model where the containers are service centric, ideally allowing anyone to use them for various orchestrators?03:54
sdakeand that way they match the dev commit your using03:54
SamYaplethetrav: yea that was always the plan, but we are very close to it03:54
thetravsdake: i've built my own in the past, both from scratch and by forking the kolla github03:54
sdakethetrav i'd like to see the api to the containers stabilize to such a point that tripleo could use them equally03:54
SamYaplethetrav: Triple-O uses puppet right now i believe, so we are halfway there03:55
sdakehowever we aren't quite there yet03:55
SamYapleyea03:55
*** britthou_ has quit IRC03:55
thetravso should I build from master or stable/kilo?03:56
SamYaplethetrav: master03:57
SamYaplestable/kilo is not really a valid branch03:57
sdakesam try ssh into 101,102,10303:57
sdakemake sure your ssh keys are working there03:58
sdakeits a branch just not maintained03:58
sdakei expect in liberty the branches will be better maintained thetrav03:58
sdakethe liberty branch that is03:58
SamYaplesdake: im not going to copy my private key to your server if thats what you are asking03:59
sdakeno03:59
sdakei already have you setup with a private key on my machines03:59
SamYapleah yes 101 seems borked03:59
SamYaplethe other two are fine03:59
sdakedefine borked?03:59
sdakeoh you can't ssh you mean03:59
SamYapleyea03:59
SamYaple101 is asking for pass04:00
*** diga has joined #kolla04:00
SamYaplerejecting key04:00
sdaketry 101 again04:00
SamYaplegtg04:00
SamYapleah wrong window04:00
SamYapleyea its working04:00
sdakeyou should be able to sudo04:01
thetravdo you guys have a docker container for building kolla containers?04:01
sdakeholy batman summer storm time04:01
sdakethetrav no04:01
sdakeSamYaple 101-103 are the deployment targets04:02
sdake100:4000 is the registry server04:02
sdakei rebuilt the images from master this morning and no new commits have hit the repo04:02
sdakethis is after your change to fix mariadb04:03
sdakeuse this openrc on YOUR hardware04:03
sdakehttp://ur1.ca/nmlof -> http://paste.fedoraproject.org/261804/4108023304:04
SamYapleare you saying I would be able to sudo? passwordless?04:04
sdakeyes04:05
SamYapleasking for passwords04:05
sdakedo you know how to turn the password request off04:05
sdakeand i'll make that change04:06
sdakeyour in the wheel group04:06
SamYaples8m ALL=(ALL) NOPASSWD: ALL04:06
SamYaplethe NOPASSWD: is key there04:06
sdakeyou will have to relog for tht to take efect04:08
sdakeshould be set on all 4 machines04:08
sdakei deploy from 100 to 101-10304:08
SamYaplenah you dont have to relog for sudo04:09
SamYapleit rereads each time04:09
SamYapleall good04:09
sdakecool04:09
SamYapleyou have to relog for group changes04:09
sdakedont damage my 100 box plz ;)04:09
SamYaplei wont04:09
sdakethe others can be reloaded04:10
sdakeall 4 boxes can be rebooted04:10
sdakefeel free to delete the images or containers on any box04:10
sdakewhat is wierd about this problem is openstack endpoint list in a loop works fine04:11
sdakebut glance image list does not04:11
sdakeopenrc to be used on your laptop http://ur1.ca/nmlqz -> http://paste.fedoraproject.org/261809/1080754104:12
sdakeoh i pasted that already i see above04:13
sdakelet me know when your planning to take it over04:13
sdakebtw AIO works fantasitcally well on 10004:14
SamYaplecan I install tmux on 100?04:15
sdakethetrav i'd highly encourage you to try out master and follow the docs as best as possible04:15
sdakefeel free04:15
SamYaplethanks04:15
sdakejust don't yum upgrade plz04:15
sdakei don tthink yo ucan use that openrc from my hardware, but i am not certain04:15
sdakethetrav and provide feedback - only 3 weeks left until we wrap up dev for this cycle04:16
sdake101-103 are centos 100 is fedora 2104:17
sdakenotice i am a bit tight on disk space04:17
thetravsdake: ok I'll have a look, I assume you're still focussing on single node though?  I see mention of multi-node, but only mentions04:17
sdakethetrav lbirty-3 will be full on multinode04:17
sdake3 control + ~100 compute + storage nodes04:18
thetravsorry, what is lbirty-3?04:19
sdakeliberty-3 is the deadline approaching in the next 3 days04:19
sdakeafter that we have liberty-rc1 on 25th - hard deadline04:19
sdakeafter the 25th no new feateures can hit the code base, only bug fixes04:19
sdakeuntil liberty is releaed04:19
thetravahh, ok, openstack liberty04:19
thetravgetting it ready for the summit?04:20
sdakeright04:20
sdakegetting it ready yes ;)04:20
thetravok, so this might be time for me to try it out then... I've got two environments, one with 6 servers one with 14, if kolla can get me 3 control nodes and 3 compute nodes in the smaller env that's a win for me04:21
thetravis it going to run with the kilo codebase or the indev work for openstack liberty?04:22
sdaketheoretically ha shoudl be working well to but atm its busted04:22
sdakeit runs with master of liberty atm04:22
thetravok04:22
sdakeat end of liberty it will run with liberty stable branches04:22
sdakeand then we will run with master of mitaka04:22
thetravso I'm trying to get Telstra to send me to the summit.  I'll try to find the Kolla guys there and chat about it04:23
sdakewe will have design session space04:23
thetravhmm, I'd be flying out on the Friday if it happens04:23
thetravMy plan is to use Kolla or something similar to manage the on prem cloud for: http://www.crn.com.au/News/407398,telstra-opens-software-skunkworks-in-melbourne.aspx04:24
sdakewe have design summit space tue-thur - time is undecided at this point04:24
thetravok, I'll keep an eye on it04:24
sdakeSamYaple have you taken usage of my machine or shall I keep at it04:25
SamYapleoh yea im messing with this now04:28
sdakecool its almost time for me to be human and sleep :)04:28
sdakei have a bad cold or flu or something04:29
sdakei'd recommend sticking to debugging justglane + keystone04:29
sdakenot quite sure why centos behaves differently then ubuntu04:29
sdakethat said it may be an environmental thing04:30
sdakewehre ubuntu also behaves the same way in my environment04:30
*** dims has quit IRC04:30
sdakebut since you and i are about the only two people who have done multinode deploys..04:30
sdakemaybe harmw04:30
SamYapleso one thing ive noticed is that your internal url is pointed to your external address04:31
sdakelet me look - that would explain it04:32
sdakekolla_internal_address: "192.168.1.148"04:32
sdake148 = free network ip on my network04:32
sdakebroked.selfip.net is my public address04:32
sdakeconfig file looks ok?04:32
SamYaple[s8m@bigiron ~]$ openstack endpoint show f90304592bd04ca88910eea1c3b4e1b504:33
SamYaple+--------------+----------------------------------+04:33
SamYaple| Field        | Value                            |04:33
SamYaple+--------------+----------------------------------+04:33
SamYaple| adminurl     | http://192.168.1.148:9292        |04:33
SamYaple| enabled      | True                             |04:33
SamYaple| id           | f90304592bd04ca88910eea1c3b4e1b5 |04:33
SamYaple| internalurl  | http://broked.selfip.net:9292    |04:33
SamYaple| publicurl    | http://broked.selfip.net:9292    |04:33
SamYaple| region       | RegionOne                        |04:33
SamYaple| service_id   | 24935aaca8e24f03858d69a353447e10 |04:33
SamYaple| service_name | glance                           |04:33
SamYaple| service_type | image                            |04:33
SamYaple+--------------+----------------------------------+04:33
sdake        admin_url='http://{{ kolla_internal_address }}:{{ glance_api_port }}'04:35
sdake        internal_url='http://{{ kolla_internal_address }}:{{ glance_api_port }}'04:35
sdake        public_url='http://{{ kolla_external_address }}:{{ glance_api_port }}'04:35
sdakebug in shade?04:36
SamYaplewe dont really use shade04:36
sdakewhere does kolla_keystone_service come from?04:36
SamYaplethis is keystone v2 nonsense04:36
SamYaplei wrote that04:37
sdake                                                   internalurl=public_url,04:38
sdake                                                   publicurl=public_url,04:38
sdakethere is a bug there04:38
SamYapleyup04:38
SamYaplethats not the issue though04:39
*** weiyu has quit IRC04:39
sdakenice diagnoitics hope thats it04:39
SamYapleits not04:39
SamYapleor i take that back04:39
SamYapleit _could_ be04:39
sdakecan you fix and build that container04:39
SamYaplebut that would be your routers fault04:39
sdakeif it is - win :)04:39
thetravso right away it appears to want to publish to docker hub04:40
*** diga has quit IRC04:40
thetravcan I tell it with .buildconf to push to a private repo?04:40
sdakelike i said you can't use openrc from my hardware ;)04:40
sdakedot fiesl are gone thetrav04:40
sdakeuse -n to set the repo name04:41
thetravthey still exist in the docs04:41
thetravhttps://github.com/stackforge/kolla/blob/master/docs/image-building.rst04:41
sdakee.g -n 192.168.1.100:4000/kollaglue will publish to my local registry04:41
sdakeyes docs are out of date04:41
sdakethat one in particular04:41
*** diga has joined #kolla04:41
sdakewtb peole to improve doccs ;004:41
thetravsure, it's just that you told me to follow the docs and give feedback :P04:41
sdakeoh right04:42
thetravI will use -n04:42
sdakei dont know if that iwll let you push to a privat registry04:42
sdakebut it will let you push to your own registry server04:42
thetravyeah that's what I meant04:42
thetravso tools/build-all-docker-images --release --push -n docker.ipc.telstra.net:5000/ should work? or am I setting a namespace, and require /kollaglue04:43
sdaketools/build.py is what your after04:44
sdakeour imge building docs are completely out of date04:44
sdakeyou ned a namespace as well - such as telestra04:44
sdakeor kolla04:44
sdaketools/build.py -n docker.ipc.telstra.net:5000/kolla04:45
SamYapleidk sdake i updated the endpoints in the db same behaviour04:45
thetravso /tools/build.py --release --push -n docker.ipc.telstra.net:5000/kolla04:45
*** achanda has joined #kolla04:45
thetravdoes --release and --push happen anymore?04:45
sdaketools/build.py --push -n docker.ipc.telstra.net5000/kolla04:45
sdakethere is no --release anymore04:45
SamYaplemy only suggestion right now is to limit the active haporxy servers to one like galera04:45
sdakerun build.py -h to see docs on build04:45
thetravahh, I need to pip install some stuff first04:45
sdakeSamYaple that doesnt really get at the source of the problem though04:47
SamYapleno04:47
SamYaplei think there is something up with delorean04:47
SamYaplethere are no errors04:47
SamYapleand glance appropriately fetchs credentials04:47
sdakeif you read that ask.openstack article they imply that it has to do with haproxy killing connections prematurely04:48
SamYaplewhich just isn't correct04:48
SamYaplewhen shutting down the other 2 glance servers it works 100% of the itme04:48
sdakenot sure what one has to do with the other04:49
SamYapleglance is saying bad token when it clearly fetched a good one?04:50
sdakekeystone says it can't find the token in the registry04:50
sdakerather in the db04:50
SamYaplehavent seen that error anywhere04:52
sdakeif it always is syncornized throug hone glance server there is never a condition where glance wont know what the toekn is04:52
sdakeits in my logs04:52
sdakein the bug tracker04:52
sdake"RBAC token not found"04:52
SamYapleit may be in the bug tracker, but its not in your logs04:52
sdakedocker exec keystone cat /var/log/keysotne/keysotne.log04:52
SamYapleim watching it right now dude04:53
SamYaple2015-08-31 21:54:40.310 11 DEBUG keystone.common.controller [-] RBAC: Authorization granted inner /usr/lib/python2.7/site-packages/keystone/common/controller.py:16104:53
sdakewell it was earlier - i saw it with my own eyes ;)04:53
SamYapleand yet it said authorization failed04:53
sdakewhich node is that on, i want to see the log output04:53
sdakedid yu turn on debug mode or something04:53
SamYapleyup04:55
SamYaplerestarted the container and now its saying "Auth token not in the request header"04:57
sdakekeystone is only running on 101 atm04:57
SamYaplesometimes04:57
SamYapleyes04:57
sdakeglane image-list still fails04:57
SamYaplei know04:58
sdakese if glance -d image-list is helpful to you04:59
SamYaplei always run with debug05:00
sdakei got an error 50005:00
sdakehttp://ur1.ca/nmm5k -> http://paste.fedoraproject.org/261839/8372614405:02
*** unicell has joined #kolla05:02
sdakeseems to imply mariadb is boned in some way05:02
sdakesince that i the only dependency in the call path05:02
SamYaplewhy on earth would oyu think that a 500 client error had anything to do with the database?05:03
SamYapleyou know the client never talks to the database right?05:03
sdakeyes05:03
SamYapleif ther eare no database complaints in the service logs, its not the database05:03
SamYapleand since there arent... its not hte database05:04
sdakeif you run 1 glance and 1 keystone does it work?05:04
sdakein other words killign off the 2 ha keystone and glance services?05:04
SamYapleit works if i run just 1 glance server05:05
sdakedoes the haproxy roundrobin to the other glance servers?05:05
sdakeor does it pick one and stick to it05:07
sdaketry 3 glance 1 keystone - that eliminates glance as the source of the problem05:10
SamYaplethat doesnt work05:10
sdakedoesn't work at all or produces authentication failure?05:11
sdakebasically with 1 glance 3 keystone, you still are not serializing all db access thorugh keystone05:12
sdakeanything that uses glance would be serialized05:12
sdakebut with 1 keystone, db would be serialized05:12
*** inc0 has joined #kolla05:15
sdakeyo inc005:15
inc0good morning05:15
sdakeinc0 you have done multinode deployment haven't you?05:16
inc0I did, but with just one controller and lately we had lots of problems05:16
inc0but shoot05:16
sdakeoh doing multicontroller - no bueno05:16
sdakedid you get authentication failures with services like glance?05:17
inc0yup05:17
sdakeinc0 https://bugs.launchpad.net/kolla/+bug/149077805:17
openstackLaunchpad bug 1490778 in kolla "multinode deploy results in intermittent authentication failures" [Critical,Triaged] - Assigned to Steven Dake (sdake)05:17
inc0keystone is busted05:17
sdakepretty sure its db not keystone05:17
inc0well, yeah05:17
inc0keystone failed to add user05:18
inc0I'd guess because we didn't set up keepalived+haproxy properly05:18
sdakewere you going to wiat for the apocoplyse to report this? :)05:18
inc0no, we were trying to fix it;) it's from Friday05:19
sdakeinc0 your using ubuntu or centos05:19
inc0centos05:19
sdakewhat have you tried05:19
sdakeread that bug log plz05:19
inc0yeah, I didn't go that far, I've just started to suspect ks05:21
inc0keepalived*05:21
inc0I'll work on it today05:22
sdakethis bug is holding up the release05:23
sdakewhat would keepalived have to do with it inc005:24
inc0well, problem is keystone failed to add it's own user05:25
inc0so I05:25
sdakei can run keystone endpoint list in a loop and it never fails to authenticate05:25
sdakerther openstack endpoint list05:25
inc0so you're futher than I was05:25
sdakethis is just using master with a 3 node ha setup05:26
inc0ok05:26
sdakebut glance image-list in a loop fails with authentication problems05:27
*** weiyu has joined #kolla05:27
sdakehow do the two differ from a haproxy + keepalive perspective?05:28
inc0well they shouldn't05:28
inc0in your openrc you point out to keepalived right?05:28
sdakedoes keystone cache tokens?05:28
inc0what do you mean?05:29
inc0by default ks keeps tokens for 2hrs afair05:29
sdakei pointed to my external endpoint which NATs to my VIP address05:29
sdakedoes it re-read the database if it gets a rquest for a  token it doesn't have?05:29
inc0do you have glance user in database?05:29
sdakei.e. is there a cachin problem05:29
inc0ah not like that05:30
sdakeno idea SamYaple has taken over my dev box ;)05:30
inc0it checks db on every request I think05:30
sdakedoes haproxy roundrobin ortarget one specific server?05:30
inc0its rr by default05:31
inc0however, we didn't get haproxy working right?:)05:31
inc0SamYaple, around?05:31
sdakehuh?05:31
inc0this problem with "no key ansible_eth1" is solved?05:31
SamYaplesorry went food05:34
inc0one you ran into last week05:34
SamYaplenot fixed yet inc005:35
SamYaplefixable, just have to implement it05:35
SamYaplewont be a problem05:35
inc0how do you want to fix it?05:35
SamYapleive got a couple of ways, im going to try them out and see which one works best05:35
inc0do share plz;)05:36
sdakeinc0 there is a workaround05:37
sdakeset all nodes to the same for control network etc05:37
sdakefor the moment use that05:37
sdakewe need to stay focused on the fact that multinode deploy is busted atleast on centos05:37
inc0yeah, let me get to office and I'll try to deploy it05:38
inc0I should have fresh images05:38
sdakeyou need to rebuild with master05:38
sdakesome bug fixes hit the repo which fix the mariadb problem05:39
inc0ok, thanks for heads up05:39
inc0I'll focus on this problem today05:39
inc0afk for 1hr05:40
sdakei'd like to tag and release asap05:40
sdakeso lets get this one solved05:40
sdakethe keytone docs say to setup keystone with a memcache server05:41
sdakeand that keystone on ubuntu ships with sqlite as the database of choice05:42
SamYapleignore those05:42
SamYaplethey are tainted with OSAD05:42
SamYapleyou absolutely do not need a memcache server05:42
sdakei dont particuarly care at this point how its done as long as it works ;)05:43
sdakemy point is maybe that is why ubuntu behaves differently then centos05:43
SamYaple....05:43
SamYapleit installs from source05:43
SamYaplewhy do you think openstack installed from source would have a default per idstro05:43
SamYaplethat entire guide is talking about cloud-archive packaging05:44
SamYaplebesides that entire thing is controlled by configuration options05:44
sdakeya05:44
*** inc0 has quit IRC05:44
sdakeinstall from centos uses two configuration files05:45
*** SourabhP has joined #kolla05:46
sdake2015-08-31 20:41:50.317 1 DEBUG glance.common.config [-] config files: ['/usr/share/glance/glance-api-dist.conf', '/etc/glance/glance-api.conf'] log_opt_values /usr/lib/python2.7/site-packages/oslo_config/cfg.py:221905:46
sdakeglance-api-dist.conf -> http://ur1.ca/nmmia -> http://paste.fedoraproject.org/261850/4410864705:48
sdakecould be trigger of the problem05:50
sdakedoes ubuntu read in a defualt api-dist.conf file?05:50
sdakesamyaple try running glance without that glance-api-dist.conf garbage05:51
sdakethat is a huge delta05:51
SamYapleyea im doing that now05:51
sdakeprobably want to do a full clean and deploy on the deployment nodes ;)05:52
SamYaplei rekicked the glance containers05:52
SamYaplethis is a test of upgrades05:54
SamYaplewhat is that05:54
SamYaple'this is a test of upgrades05:54
SamYaple'05:54
sdakei just added an echo05:54
SamYaplethats cool05:55
sdakeduring a demo which failed05:55
SamYapleyou said this was master though05:55
sdakeit was earlier today05:55
SamYapleok05:55
sdakeit was, but i did reuibld glance05:55
sdakeforgot about that05:55
sdakeeverything else is master05:55
sdakethis problem could explain why heat is busted on centos too05:56
SamYaplei said it was delorean packaging a long time ago :)05:56
sdakeyou did05:57
SamYapleim testing that though before laying final blame05:57
sdakeand you would be correct if this fixes it :)05:57
SamYaplei think you should trust me more05:58
*** weiyu has quit IRC05:58
sdakeyou hae root on my box dude05:58
sdakei'm not sure what else you want ;)05:58
SamYaplevalid point05:58
SamYaplecarry on05:58
SamYapleyouve convinced me05:58
*** dims has joined #kolla05:59
sdakewhatever the case is we need to get rid of that dist crap in the delorean packaging06:00
sdakekolla is responsible for config ;)06:00
SamYapleyea there is some delorean crust in the ubuntu-source ontainer too06:01
SamYaplehavent had a chance to remove06:01
*** dims has quit IRC06:04
*** sdake_ has joined #kolla06:05
SamYaplestill nothing but also no errors06:06
SamYaplenone of the bugs you posted share erros wtih what youre seeing on the server side06:06
SamYapleserver side i see " Auth token not in the request header. Will not build auth context. "06:06
SamYaplebut that makes me blame the glance client06:06
SamYapleidk man i gots other stuff to do at the moment06:07
SamYaplelet me check ubuntu and verify its not hte same behaviour06:07
SamYaplealso sdake you need to update teh kolla namespace move patch06:08
sdake_yup waiting for the rest of the reviews to be approved for project-config06:08
*** sdake has quit IRC06:08
SamYaplecool06:08
sdake_the move is the 11th06:09
sdake_so I have time06:09
sdake_i've got one more for removal of the f21 gate06:09
SamYaplewhen will the ubuntu gate be active?06:11
*** Haomeng|2 has joined #kolla06:12
*** sdake has joined #kolla06:13
sdakesamyaple if your done debugging for the moment let me take over for a bit06:13
sdakei have a few things i want to try06:13
SamYapleyea go for it06:13
SamYapleyou may wany to rekick at least glance06:13
sdakei am gong to destory it all06:15
*** unicell has quit IRC06:15
*** sdake_ has quit IRC06:15
sdakekeystone also has a default dist file06:15
*** unicell has joined #kolla06:15
SamYapleyea i saw06:15
SamYaplenot sure it thats whats going on but thats why im going to check ubuntu when i get a chance06:15
SamYapleto ensure that from source pure works06:15
*** Haomeng has quit IRC06:16
SamYaplehey sdake in regards to ansible modules, i dont want to keep having the same arguemnets over and over again06:16
SamYapleansible will always import *06:16
SamYapleit is required06:16
SamYaplelets drop this06:16
sdakehuh?06:17
sdakeI am pretty sure i agreed with you on that point06:17
SamYaplesdake: im saying formally, its coming up again06:17
SamYaplehttps://review.openstack.org/#/c/218570/6/ansible/library/bslurp.py06:17
SamYapleim not pointing the finger at you06:17
*** weiyu has joined #kolla06:22
sdakecan't we just use the openstack client instead of shade for keystone v3 stuff?06:27
SamYapleagain, we dont use shade irght now06:28
SamYaplejust because i import shade doesnt mean im using it06:28
SamYapleif you notice im calling the keystone client _within_ shade06:29
SamYaplethe only thing shade is doing is a bit of authentication abstraction06:29
SamYapleand a few other nice things06:29
SamYaplebut in the end im using the keystone client directly06:29
SamYaplei can do v3 right now, i just hae to do it06:30
sdakethe less dependencies a system has the better06:30
sdakeif we don't need shade no reason to import it ;006:30
SamYaplei just said we do need it, i said we dont _use_ it (as in the keystone part of shade)06:30
SamYapleit still sets stuff up06:31
SamYaplethis reduces the lines of code i eed from 300 to like 5006:31
SamYaplebut im still calling keystone directly06:31
*** SourabhP has quit IRC06:35
*** inc0 has joined #kolla06:35
inc0back06:36
openstackgerritHengqing Hu proposed stackforge/kolla: Expose exception objects via the as keyword  https://review.openstack.org/21875706:37
sdakesamyaple can you tell me if ubuntu has defualt config options it runs06:37
SamYaplehow do you mean?06:38
SamYapleopenstack itself has default options06:38
sdakei mean default config files it loads up from packaging06:41
sdakelike delorean does06:41
sdakeand do you know how they do that in the packaging?06:41
SamYaplei dont know. probably. i havent done a cloud-archive install in about a year06:41
*** tfukushima has joined #kolla06:47
coolsvapSamYaple, the ceph review will fail with pep8 untill you change the license imho06:49
sdakeSamYaple can you paste your diff for fixing glance pleae06:50
sdakei want to see how you fixed the paste deploy problem on centos06:50
SamYaplesdake: i never fixed it06:51
SamYaplesdake: i gave up and said i would check ubuntu to ensure that isnt happening06:52
SamYaplethen we can revisit06:52
SamYaplecoolsvap: yea ill change the licensing06:52
SamYaplethat stuff is copypasta from my yaodu modules06:52
sdakeok well i am fixing now, can you look at what defualt file is loaded on ubuntu06:52
SamYapleagain, I dont know what the ubuntu packaged defaults are06:53
sdakefrom source06:53
sdakelook at the logs06:53
sdakesee the glance-api.conf line06:53
sdakeit tells you which stuff is loaded by default06:53
SamYaplei se what you mean06:53
SamYapleill look at that when i rekick06:53
sdakeclearly it is loading somehting  extra06:53
sdaketo pull in the psate deploy file06:54
SamYapleif it is doing any hackery, it owuld be in /usr/bin/glance-api06:54
SamYapleathat isn't provided by the source packaging06:54
sdakewhere does that come from?06:54
SamYaplethe centos packaging has thier own script i mean06:54
SamYaplei believe06:54
SamYaplei know ubuntu did06:54
sdakeubuntu start.sh still calls glance-api06:55
SamYapleubuntu packaging06:55
SamYaplecloud-archive06:55
sdakeno, from source06:55
sdakeour start.sh calls the same binary06:55
SamYaplethat doesnt mean is the same binary06:55
SamYaplewhere is the disconnect06:55
sdakepossibly not06:55
sdakeit has the same name, i'd like to see what is in it06:56
SamYaplei mean the ubuntu cloud-archive (binary) had a special glance-api06:56
SamYaplecentos might as well06:56
sdakeand what it loads by default06:56
sdakeok, build from ubuntu source creates a glance-api06:56
sdakebeecause start.sh calls it06:56
SamYapleyes06:56
sdakewoul be helpful to see ap aste of that file06:57
sdakeand the logs it generates06:57
sdakebecause it is definately loading in some defaults that are not in the configuration file06:57
sdakebeause if i run glance with --config-file /etc/glance/glance-api.conf06:57
sdakeit looks for a paste deploy section06:57
sdakei assume ubuntu would do precisely the same hting so my change would break ubuntu06:58
SamYapleit wouldn't06:58
SamYaplethat paste file is in /usr/share/glane06:58
SamYapleit (by default) looks for it in /etc/glance/06:58
sdakeok well it is oading the config for it06:58
SamYaplethats where it lives on ubuntu06:58
SamYapleanyway i gotta stay focused for a bit06:59
sdakejust paste the log and glance-api file plz06:59
sdakenext time you deploy06:59
sdakewhih hopefully is soon06:59
openstackgerritMick Thomspon proposed stackforge/kolla: Add ansible support for Murano  https://review.openstack.org/21889407:01
*** slagle has quit IRC07:06
*** achanda has quit IRC07:11
*** achanda has joined #kolla07:17
*** chandankumar has joined #kolla07:24
*** achanda has quit IRC07:25
*** achanda has joined #kolla07:26
*** achanda has quit IRC07:30
*** kjelly has quit IRC07:35
*** shardy has joined #kolla07:35
*** kjelly has joined #kolla07:36
*** thetrav has quit IRC07:40
kjellyHi all, I try to use ansible to deploy openstack. And it gives me the error, unable to connect to database, check login_user and login_password are correct or ~/.my.cnf has the credentials. Exception message: (1130, \"Host '172.22.2.81' is not allowed to connect to this MariaDB server07:40
kjellyWhat's wrong for me?07:40
*** tfukushima has quit IRC07:42
*** tfukushima has joined #kolla07:42
*** tfukushima has quit IRC07:44
*** tfukushima has joined #kolla07:45
*** tfukushima has quit IRC07:47
*** tfukushima has joined #kolla07:48
*** tfukushima has quit IRC07:49
*** tfukushima has joined #kolla07:50
inc0kjelly, check if your mysql binded to correct ip07:52
inc0however, also try to rebuild mariadb images because there were bugs fixed this night07:52
kjellyinc0: Ok. Thank you for suggestion.07:54
sdakeSamYaple so built ubuntu keystone and ubuntu glance api and glance registry08:03
sdakeglance image-list still producing identiy errors08:03
sdakeso pretty sure its not the openstack packaged services that are at fault - although the configs are indeed different08:04
openstackgerritSteven Dake proposed stackforge/kolla: Register internal_url properly  https://review.openstack.org/21917108:10
sdakekjelly the reason that is happeing if yur doing multinode deployment on centos + binary is because mariadb was busted before t-12 hours ago08:13
sdakeif your not doing multinode deployment, I have no idea why that is happening - docker logs mariadb should help you understand whats happening08:13
SamYaplesdake: I am doing multinode08:16
SamYaplebut not right now08:16
SamYaplereal work calls08:16
sdakeok08:16
sdakeif you could confirm glance image list in a loop works that would be great08:17
sdakejust type it in irc08:17
SamYapleno problem08:17
sdakeor if it doesn't work type that in irc too08:17
SamYapleif oyu say its broken it probably is08:17
SamYaplewell figure it out08:17
sdakei am preparing to deploy full ubuntu on centos kernel08:17
sdakehpeofully that works08:17
sdakebuilding now08:17
sdakeinc0 have you thought of using a data container for logging08:19
*** jmccarthy has quit IRC08:19
sdakein other words everything goes to /var/log and the data container exports /var/log to all containers in the system08:19
kjellysdake: I use all-in-one node. Now I try to rebuild images.08:19
sdakeall in one didn't have this problem kjelly08:19
sdakekjelly make sure to use mater08:19
sdakemaster08:19
SamYaplesdake: there isn't a single data container08:19
inc0also, I don't want to multiply containers08:20
SamYaplebut my idea was a loging data container so all hosts can see all logs08:20
inc0later rsyslog will forward logs instead of writing to file08:20
*** jmccarthy has joined #kolla08:20
SamYapleinc0: actually it will do both08:20
SamYapleit will write to disk and forard for logstash08:20
SamYaplelocal copy never goes away08:20
inc0well, as long as we add log rotation there08:21
SamYapleyea08:21
sdakeso my idea is one data contianer per host that goes to one data container per deplooyment08:21
sdakethat way you don't have to hunt down a million different containers to find the log files08:21
openstackgerritMerged stackforge/kolla: Expose exception objects via the as keyword  https://review.openstack.org/21875708:21
inc0sdake, my question is...do we really need yet another container?08:21
kjellysdake: I see. Maybe my envirement have another problem. I will use new envirement to try it.08:21
SamYaplethe perdeployment thing sdake is central logging, and thats mitaka at earliest08:21
inc0it lands on volume, so it's visible by host08:22
sdakethe container contains all /var/log for the entire node08:22
sdakeso yes, we do need another contianer for that08:22
sdakehunting down these log files is a real chore08:22
inc0sdake, new patchset will create /var/log/kolla08:22
sdakeand that contains what?08:23
inc0and when we'll add logging to services, we'll add logging per service as well08:23
inc0so it will be nova-all, neutron-all08:23
sdakeand that goes in what, a data continaer mounted by all the different containers on the host?08:24
inc0each host will have /var/log/kolla/nova-all.log and there where logs from this hosts nova is08:24
sdakewe need logs seprated by service08:24
SamYaplewhy are you combining logs inc008:24
SamYapleyea08:24
sdakenot jamming all nova logs in one file08:24
inc0SamYaple, well, that's just templating we're going to do08:24
sdakecombinging logs is a different services job08:24
SamYapleyea all the logs need to spit out per service08:25
SamYapleany compining parsing is done later08:25
inc0ok08:25
SamYaplethats teh central logging blueprint08:25
sdakeput in a data container08:25
sdakeit will make operators life much easier08:25
SamYaplelies08:25
SamYaplebound to host makes operators livfe easier08:25
inc0I agree with Sam08:25
sdakeyes one data container for the host08:26
inc0it's almost automatic to look at /var/log08:26
sdakenot one per service08:26
inc0not to enter some container08:26
SamYapleno one is doing one logging container per services sdake08:26
sdakewe are not doing a bindmount for logging08:26
SamYaplesdake yes we are08:26
sdake-2 inc08:27
inc0oO08:27
SamYaplejust do it inc0 ill show you how to make it optional08:27
sdakecontainers do not write to the host filesystem in kolla08:27
SamYaplesdake: its happening this was talked about in the midcycle08:27
sdakeand we aer not going to start08:27
SamYapleworst project ever if we dont08:27
sdakeyou can do the work, i can -2 it, big waste of time08:27
sdakeput in a data container08:27
SamYapleget ready to kill the project then sdake08:27
inc0wow that escalated quickly08:28
sdakeya sam likes to escalate :)08:28
sdakeSamYaple i'm pretty sure your overreacitng08:28
inc0sdake, but really I'm kinds in favor of Sam's approach08:28
sdakebut we are not writing data from the container to the host os08:28
inc0ops will look for logs there08:28
SamYaplesdake: we are, as discussed in the midcycle08:28
SamYaplebut as i said it will be optional08:28
sdakewhich part of we are not writing data from the contianer to the host os are you not hearing08:29
inc0if logs won't be there, they'll panic08:29
SamYaplesdake: its happening08:29
sdakelike i said, -2 inc08:29
inc0and only after they stop panicking they'll start thinking and look at containers08:29
sdakewe are not writing data from the container to the host os08:29
sdakeits a huge security problem to do so08:29
sdakenot going to happen08:29
SamYaplesdake: stop. we talked about this. it will be optional. but it _will_ be an option08:29
inc0soo...I'm off to a meeting08:30
sdakeif you want o make bindmounting optional that is fine, but it needs to come from a data container so when the option is turned off by default08:30
inc0in the meantime, please sort it out08:30
inc0and let me know08:30
sdakethere is one place to look for logs08:30
sdakei just did sort it out inc008:30
inc0bb in 1.5h08:30
SamYaplehow many times did i say optional sdake?08:30
SamYapleseriously. read08:30
sdakei want it in one place08:30
sdakewhen its optinoally turned off08:30
SamYaplethats what was discussed08:30
SamYaplelike i said inc0 do it08:30
sdakei can live with an optional bindmount  if its in one place08:31
SamYapleinc0: ill show you how to make it optional08:31
sdakeas long as the data is in a bindmounted data container08:31
SamYaplethats not how bindmounts work, but dont worry because its _optional_08:31
sdakeso people can look in one place for the logs assuming the option is turned off by default08:31
*** inc0 has quit IRC08:35
openstackgerritJames McCarthy proposed stackforge/kolla: Swift WIP - add some missing services, also ansible configuration  https://review.openstack.org/21879508:40
*** dims has joined #kolla08:42
*** dims has quit IRC08:48
kjellyhttps://github.com/stackforge/kolla/blob/master/docs/image-building.rst  The document seems to be out of date. Could I update the document ?08:56
sdakeplesae do update kjelly08:59
sdakeare you familiar with the openstack workflow?08:59
kjellysdake: I know. I contributed code to horizon ever.09:01
sdakence ;)09:01
*** vbel has joined #kolla09:02
vbelgood morning and evening :)09:03
sdakehey vbel09:03
sdakehey quick09:03
sdakeq09:03
sdakei noticed the footer was after CMD09:03
sdakedoes that still work for the proxy building use case?09:04
vbelyes09:04
sdakegood news eveyrone our docs job was merged09:04
sdakeso soon our docs will be published to docs.openstack.org09:04
vbelit also was easy to add to all containers :)09:04
sdakethe bad news is our docs are in bad shape ;(09:05
*** dayou has quit IRC09:05
vbeli am going to fix Dockerfile.j2 for novnc -  debian, ubuntu AND oraclelinux don't have it as binary and should install from git. I wonder if the correct approach would be to get it from git for all distros for source build type09:06
sdakesamyaple heads up the ubuntu source job has merged but has gate failures: https://review.openstack.org/#/c/218757/09:06
kjellysdake: Should I post a bug in launchpad for update document ?09:07
vbelcurrently rpm based distros get novnc from rdo for source builds09:07
sdakevbel we don't implement binary ubuntu or debian09:07
sdakekjelly docs do not require bug ids09:07
sdaketo lower the bar to encourage people to submit inremental updates ;)09:07
vbelI mean the section is for 'source' but rpm dists get it from rdo09:08
kjellysdake: Ok. I see.09:08
vbelhttp://paste.openstack.org/show/437223/09:09
sdakevbel looks ok to me?09:10
vbelone problem is that we don't use rdo for source based oraclelinux as much as we can, another is that we could generalize things and fetch novnc from url/git09:10
vbelthe same way we do for nova/cinder/neutron ..09:10
sdakeoraclelinux binary = rdo + oraclelinux / oraclelinux + source = build from source09:11
vbelyes, I am ok with binary, but source could be really 'source' :)09:12
sdakeoh right line 1309:13
sdakeya that should be a binary conditional09:13
vbelwe have binary above with the same stuff, but here we could just leave cross-distro ADD09:14
*** zhiwei has joined #kolla09:14
sdakefile a bug submit a patch cr's will review it09:15
vbelyep, will do09:15
sdakeSamYaple full source deploy from master yields same problem with glance image-list repeated in a loop09:17
sdakefull ubutnu source deploy09:17
sdakecentos kernel09:17
sdakevbel have you done any multinode deploys?09:18
*** inc0 has joined #kolla09:18
vbelsdake: so far without haproxy09:18
sdakevbel using what in its place?09:18
vbelsdake: but I did check haproxy and keepalived working09:18
inc0sdake, SamYaple soo...I'm making logging container with data container, and if we want bindmount, let's do it in another patch ok?09:19
vbeljust no proxy :)09:19
sdakeinc0 sounds good ;)09:19
sdakei'm +2 on a bindmount patch if it comes from a data container09:19
sdakeand its optional09:19
SamYaplesdake: thats not how bindmoutns work09:19
sdakeeach service uses volume_from to mount data container09:19
sdaketht data container bindmounts /var/log09:20
sdakethen we can expand the dat container to log via logstash or whatever forwarding we deicde on later09:21
inc0sdake, my question....is there anything against rsyslog container being data container as well?09:21
sdakeinc0 nope09:21
inc0so we can just make it single container?09:21
sdakeyes09:22
inc0so if I'll make bindmounting /var/log optional09:22
inc0it works?09:22
inc0for you?09:22
sdakefrom singel container where all the individual volume_froms on a node access yup09:22
inc0no other container even needs volume from09:22
inc0I mean, nova doesn't even need to have access to /var/log09:23
sdakei'd like the files stored on disk09:23
inc0it will be in rsyslog09:23
inc0container09:23
sdakei want rsyslog to read from files on disk not sent by openstack services via syslog09:23
inc0nova will have volume on /dev/log, but /var/log doesn't have to be one09:23
inc0hmmm...why?09:24
sdakesyslog blocks09:24
sdakesyslog in nonblocking mode loses data09:24
sdakei dont want openstack services to block on logging09:24
sdakebeen doing diags for 20 years09:24
inc0rsyslog too?09:24
sdakeyou will just have to trust me this is the best way to proceed09:24
inc0allright09:24
inc0well...hmm09:25
sdakei'm ok on logging blocking on write to disk not wirte to network09:25
inc0currently it's only writting to disk09:25
sdakewhat i mean is dont setup the service to log directly via syslog09:26
inc0yeah, I understand09:26
sdakeservice/services09:26
inc0unless service doens't know how to log to fiel09:26
inc0file09:26
sdakethat is a special circumstance09:26
sdakenothing we can do about that09:27
sdakelets not optimize around the one or two services tht cant log to file09:27
* inc0 thinking if every service will write to log in syslog-readable way09:28
sdakeif not, lagstash ftw ;-)09:28
inc0well, anyway, this will be approached case by case09:28
sdakelets take baby steps get em logged to files first ;)09:28
inc0sdake, it has to be understood by syslog because it's syslog what will forward it to logstash09:28
sdakelogstash-forwarder?09:29
SamYaplesdake why exactly are the services going to be logging to a file again?09:29
SamYapleand if so, why do we need syslog at this time?09:29
SamYapledo you know how this is being setup?09:30
sdakei am not sure we need syslog09:30
SamYaplejeez if you just want the logs to hit the disk why are we doign all of this thne09:30
sdakei'm suffering deja vu because i've had this same conversation in like 10 other programs09:30
SamYaplethis would have been good information to have during all of those logging conversations you particpated in09:30
sdaketook me awhile to process09:31
SamYaple09:27 < sdake> lets not optimize around the one or two services tht cant log to file09:31
SamYapleso the opposite of what you said?09:31
SamYapleeverything now logs to files09:31
sdakei thought inc0 said some services only log to syslog09:32
SamYaplecorrect09:32
SamYapleand some only to files09:32
sdakethose are the one or two services iw as referencing09:32
SamYaplebut you want just files09:32
SamYaplewell you get no keepalived or haproxy logs at all now09:33
SamYaplewhich is bad enough09:33
sdakesyslog blocks - to operate syslog in noblocking mode it has to lose data09:33
openstackgerritVladislav Belogrudov proposed stackforge/kolla: Source based builds should get novnc from tarball (git/url)  https://review.openstack.org/21920409:33
SamYaplebut inc0 migh as well scrap that rsyslog container that would be needed09:33
SamYaplesdake: writing to a file blocks as well :/09:33
sdakeinc0 my apologies on any time you have wasted thus far09:33
sdakeyes writing to files blocks for short periods09:34
sdakewriting to network blocks for long periods09:34
SamYapledude. syslog isnlt talking over the netowkr yet09:34
sdakethere in't much way to get around blocking except to used memory mapping09:34
SamYaplethats central logging09:34
SamYaplei dont think you know whats going on09:34
sdakei really want to tag a release, meaning i need to focus on fixing the fact that multinode deployment just plain doesn't work09:36
sdakeso no, I don't have time to go dig into the work already completed at this time09:36
*** dims has joined #kolla09:36
SamYapleso maybe dont comment on it?09:37
sdakei dont get to talk to inc0 very often09:37
sdakei wanted to sync up with him09:37
SamYapleso confuse the hell out of him when you do09:37
sdakehence the comments09:37
SamYaplebut you dont know whats happening and are trying to redirect him09:37
sdakeinc0 are you confused?09:37
SamYapleyoure kidding right sdake? What do you think the current direction is? do we have a syslog container or not? does each service write out its own file? or write it to the syslog container?09:39
sdakeeach service writes to its own file09:40
vbelgentlemen, we are near release and want to add syslog right now?09:40
SamYaplesdake: thats not what we are doing right now. and thats not what were going to do09:40
SamYaplesyslog isnt talking over the network, its over the /dev/log device.09:40
sdakevbel i agree, I am trying ti simplif y things09:40
sdakethis is one area of simplification09:40
*** dims has quit IRC09:41
SamYaplelies09:41
sdakei would prefer not to have a syslog service running at all tbh09:41
SamYaplesyslog was already identified as a requirement09:41
sdakei would prefer a data container09:41
SamYaplewe need it before central logging09:41
SamYaplethis was teh path to central logging which you said you wanted not 15 minutes09:41
sdakebut we dont need it before rc109:41
SamYaplethis is how all the logging was going to be written to a file09:42
SamYaplethis was teh design09:42
sdakehow does the data get into syslog?09:42
SamYapleas stated by myself and inc0 several times, over /dev/log09:42
SamYapleaka normal syslog way09:42
vbeland you want to consolidate those from all the hosts via rsyslog?09:43
SamYapleno09:44
SamYaplesyslog per host09:44
SamYaplecentral logging is mitaka09:44
SamYaplesyslog is needed for central logging09:44
SamYapleso we do syslog now, no remote stuff09:44
SamYaplethen we add remote stuff in mitaka09:44
sdakeagenda item for wednesday's meeting09:45
vbelI see. Wonder if docker itself does not have something similar already09:45
sdakeproblem solved09:45
SamYaplethe worst part is sdake, you were in the conversatoins were we were discussing this and now you are comming in at the eleventh hour this stuff09:45
sdakevolumes from, make each service log to /var/log, take 4-8 hours to implement09:46
SamYapleno joke. super easy09:47
SamYapleany yet whats in the queue is almost done as is09:47
SamYaplebecause we have been working on it for a while09:47
SamYaplebut you come in and dont understand the situation and start making decisions about things09:47
vbelwhy not to do this - https://docs.docker.com/reference/logging/overview/09:48
sdakemultinode deploy doesn't work, syslog log perfection is not really high on the priority list for the project09:48
SamYapleagreed. and yet you tried to control it for some reason09:48
vbel--log-driver=syslog09:48
vbelimplemented :)09:49
SamYaplevbel: docker logging is... not so good. But it was discussed. loogging in 1.8 got alot better so maybe we can come back to it09:49
SamYaplethe real issue is some thing will not log to stdout/stderr09:49
SamYapleand in those cases we cant use that09:49
sdakelooking for a fast simple solution to any of our blueprints or bugs for rc109:49
vbelyes, like keystone09:49
sdakewe have 25 days to seal rc109:50
sdakenothing like diags to raise poeople's blood pressure ;)09:53
*** coolsvap has quit IRC10:01
*** coolsvap has joined #kolla10:02
sdakevbel you aid you got multinode deploy to work without haproy10:04
sdakewhat did you use instead of haproxy10:04
SamYaplei setup an F5 with Kolla recently10:05
SamYaplenot to bad, but i dont have a script to generate the config for the F5 yet10:05
sdakesamyaple havey ou been running without haproxy in your multinode deployments?10:05
SamYaplesome of them10:06
SamYaplei test about 75% haproxy+keepalived10:06
SamYapleeerrr 25%10:06
*** dwalsh has joined #kolla10:10
sdakeubuntu source:10:11
sdake2015-09-01 03:10:14.191648 2015-09-01 03:10:14.191 20 WARNING keystone.middleware.core [-] RBAC: Invalid token10:11
sdake2015-09-01 03:10:14.191773 2015-09-01 03:10:14.191 20 WARNING keystone.common.wsgi [-] The request you have made requires authentication.10:11
inc0sorry, I had ad-hoc meeting- going through logs10:14
inc0ok...guys10:18
inc0let's talk this through tomorrow10:19
inc0I'm going back to multinode fixes, logging is targeted for rc1 anyway10:19
sdakethis is an interesting workaround10:22
sdakeIf i used mariadb in haproxy conf i would run in the exact same problem. Instead i configured a VIP on all galera nodes and pointed all [database] sections in openstack configs to that VIP.10:22
sdakeSo i would try to remove galera cluster from haproxy config and try the VIP route for mysql.10:22
sdakewith this galera doesn't round robin10:23
sdakei am pretty sure the problem is galera writes and reads are racing10:23
SamYaplegalera doesnt round robin now10:24
SamYaplethere are no deadlocks, how oculd there be10:24
sdakeinc0 said earlir haproxy round robins10:24
SamYapleonly one databsae is ever read/write, the others are backup10:24
*** akwasnie has joined #kolla10:24
SamYaplehaproxy does10:24
SamYaplegalera is set to backup10:24
SamYapleas weve been over10:24
sdakeya haproxy roundrobin reads to galera10:25
sdakebased upon what inc0 stated10:25
SamYapleit does not10:25
inc0haproxy round robins by default10:25
inc0to APIs10:25
SamYaplesomeone look at a config please10:25
inc0let me look at config for galera10:25
inc0ah my mistake10:27
inc0for mysql its for backup10:27
SamYapleyep10:27
inc0but just for mysql10:27
inc0I mean maria10:27
SamYapleexactly as it should be10:27
inc0agree10:27
sdakeok making progrss10:28
sdakeinc0 which one is the roundrobin flg?10:28
inc0https://github.com/stackforge/kolla/blob/master/ansible/roles/haproxy/templates/haproxy.cfg.j2#L2310:29
inc0there is if statement10:30
SamYapleyup10:30
inc0and it's marked as backup unless it's first in the loop10:30
SamYaplecorrect10:30
SamYaplejust like i wrote it 6 months ago10:30
*** dims has joined #kolla10:30
*** dayou has joined #kolla10:31
*** tfukushima has quit IRC10:33
sdakeyes i see10:34
sdakegoing to try makign keystone acive backup backup10:35
*** dims has quit IRC10:36
vbelsdake: I used one of hosts' IPs instead of VIP, will be checking haproxy  soon10:36
vbelmysql should not have roundrobin haproxy setting.10:37
*** diga has quit IRC10:37
sdakeit doesn't10:37
vbelsource based routing is recommended10:41
vbelwe don't have it. It is important for db and dashboard10:41
vbelhttp://docs.openstack.org/high-availability-guide/content/ha-aa-haproxy.html10:41
vbelbasically anything that has 'session' should have "balance  source"10:42
SamYaplevbel: agreed on dashboard10:57
SamYapledoesnt matter for database at this time10:57
*** dwalsh has quit IRC10:57
SamYaplewithout source based routing the dashboard wont carry session info and we would have to use memcache10:57
vbelyes, db is smart enough. If you use client you will see "reconnecting" but it does not fail10:57
SamYaplei have a keepalived patch with some haproxy tweaks for this already10:57
SamYaplevbel: well no the db in this case only has one 'active' connections that can server requests10:58
inc0sdake, about multinode...does your rabbit run?11:00
sdakeinc0 yes11:00
inc0freshly built container - INFO - This container has already been configured; Refusing to copy new configs11:00
SamYapleinc0: its probably restarting11:01
inc0yeah it is11:01
SamYapleinc0: change the container restart policy11:01
SamYaplei set it to 'no'11:01
inc0kk let me check what's actual error11:03
SamYaplemutlinode rabbitmq has a race condition for initial cluster startup11:03
inc0yeah that might be it, change rabbit to just one node in inventory?11:04
inc0that's informative... docker logs rabbitmq -> su: System error11:05
SamYapleoh yea vbel reported a su error with docker its a kernel thing11:06
SamYaple iwas unable to reproduce on ubuntu11:06
SamYaplewhats your system info11:06
vbelinc0 - you have bad kernel :)11:06
SamYaplevbel ^11:06
vbelwe do too :)11:06
inc0it's 14.04 cloud img11:06
vbelI use sudo instead of su for db init commands and  run rabbit via sudo -u rabbitmq, otherwise rabbit will try to use su11:07
inc0vbel, you do it in container?11:08
*** weiyu has quit IRC11:08
SamYapleinc0: kernel?11:08
vbelI have CMD="sudo -u rabbitmq /usr/sbin/rabbitmq-server" in my Dockerfile.j211:08
inc0vbel, mind upstreaming it as a patch?11:09
SamYapleinc0: no thats not the way to do it11:09
*** britthouser has joined #kolla11:09
SamYaplewe need ot be dropping permsiisions in the containers11:09
inc0SamYaple, what is way to do it then?11:09
vbelUSER rabbitmq11:10
SamYaplethat would be the correct way11:10
SamYaplethen a special sudoers file that allows us to change the rabbitmq folder perms since the data contaienr will screw those perms up11:10
SamYaplethats the model from yoadu and i have full permissions dropping there11:10
SamYapleit works well11:11
inc0why aren't we doing that then?11:11
SamYapledont look at me11:11
vbeltime and people11:11
SamYapleits a major change that sdake targed for mitaka11:11
SamYaplei wanted t oget it done in RC1 but im not fighting him on everything11:11
SamYaple;)11:11
inc0ok, can we add this workaround till we do?11:12
inc0rabbitmq isn't working by default now11:12
*** slagle has joined #kolla11:12
SamYaplei suggest we do it correctly now11:12
vbelinc: you will get this problem in all other containers where we have 'su'11:12
SamYaplei can submit a patch for this right now11:12
inc0SamYaple, please do11:13
SamYapleinc0: just keep in mind what vbel said11:13
sdakethings that iprove security i wont generally push back on11:13
SamYaplei know sdake but you did say no to it for RC111:13
inc0maybe as full change11:13
SamYapleanyway looks like we have bugs and reasons to address the issue now11:13
inc0but simple container by container11:14
sdakei dont think rc1 is settled yet11:14
SamYaplei am fully familar with how to do this thanks to yoadu11:14
SamYaplei can lead the conversion11:14
SamYapleill submit something for rabbitmq11:14
SamYaplewe can go form there11:14
inc0yeah, keystone is affected too11:15
SamYapleinc0: all containers will be broke for you11:16
SamYaplewe use su in just about every one11:16
inc0that's encouraging11:16
SamYaplesu is what is broked iwth your kernel11:16
inc0sdake, that's not problem you're facing?11:17
SamYaplei just wrote this udev rule to fix openstack in juno11:19
SamYapleKERNEL=="brq*", ACTION=="add", RUN+="/bin/bash -c 'export dummy=taprpc-$(awk -Fbrq\|- {print\ \$2} <<< %k);(sleep 15; ip l a $dummy type dummy && brctl addif %k $dummy)'"11:19
SamYapleawesome i know11:19
inc0I hate operators...11:19
inc0:P11:19
inc0or rather I hate bash, and ops just happen to use it11:20
SamYaplebecause we can actually make things work :D11:20
SamYapleoh comone that is perfectly readable11:20
SamYapleand i only use bash because the default shell is sh11:21
inc0http://devopsreactions.tumblr.com/post/84407988311/scripts-written-by-developers-versus-scripts11:21
SamYapleso useful everyday versus easily broken when you throw a square into the mix ;)11:22
*** banix has joined #kolla11:23
sdakevbel > 900011:24
sdakehttp://ur1.ca/nmp5j -> http://paste.fedoraproject.org/261924/1106685111:25
*** dims has joined #kolla11:25
sdakeglance image-list in a loop works like a champ with above change11:25
*** jruano has joined #kolla11:26
SamYaplethat is not the root cause of the issue though...11:26
SamYapleand you have limited your environment severly11:27
SamYaplekeystone does a whole bunch of requests11:27
*** dwalsh has joined #kolla11:27
sdakein the oifficla ha docs11:27
SamYapleyep11:28
SamYaplemy point still stands11:28
vbeltrue, official ha docs put 'balance source' everywhere :)11:28
sdakeproably because openstack is bust without it11:28
inc0I just wonder why it's even problem11:29
sdakewht does that balane shit do11:29
SamYapleits not11:29
SamYaplenever used it in yaodu11:29
*** dims has quit IRC11:29
inc0it shouldn't be, I mean...meh http is stateless11:29
SamYapleindeed11:29
SamYaplehe is maksing the issue here11:29
SamYaplehorizon is only because of that csrf token and stuff11:30
SamYapleand even then it can balance _if_ you use memcache backend11:30
vbelif you use keystone with localhost memcache you need it11:30
inc0it might mean that onlu one of 3 ks is working and you happen to reach correct one11:30
inc0for example11:30
vbeland localhost memcache is the only secure option for memcache :)11:30
sdakeperhap yaodu didn't really work ;)11:30
*** pbourke has quit IRC11:30
inc0vbel, yeah, we don't do local memcache11:30
sdakeanother option is there is a keystoon option missing from our ansible code11:30
SamYaplelets not insult the project that was working 4 months before kolla11:31
sdakebut i a mrunning ubuntu userspace11:31
inc0sdake, try to call keystone by actual ip, not floating11:31
sdakei was making no insult11:31
sdakeyou said kolla worked, it did not atleast on my hardware or inc0s or vbels11:31
sdakemaybey ou  never tried same test case as I ran11:32
SamYaplewhat is your test case?11:32
sdakeglance image-list in a loop11:32
SamYaplei mean i used it until i replaced it with kolla11:32
SamYapleyea no i actually _used_ it multinode until about 2 months ago11:32
vbelglance image-list in loop is a good test case!11:32
SamYaplei do all my dev in virtualmachines running on Kolla in ha right now11:33
sdakesimple and obvious ;)11:33
inc0https://blueprints.launchpad.net/kolla/+spec/sanity-check-container11:34
sdakewhat exactly does that option do11:34
inc0that was reason I posted this11:34
SamYapleinc0: that probably wont help whiles its running11:35
SamYaplewe do retry loops on all interactoins with teh database11:35
SamYapleapi*11:35
inc0vbel, so you replaced su in every container?11:35
inc0SamYaple, hence monitoring discussion11:35
vbelinc0: yes11:35
*** pbourke has joined #kolla11:36
sdakesamyaple could you explain how that option limits how openstack behaves11:36
vbelinc0: if you can upgrade your kernel you won't need it11:36
inc0vbel, which version is ok?11:37
sdakehey pbourke11:37
SamYaplesdake: if you have three controllers with source balancing potentially all or a large majority of your servers can end up _only_ going to one backend11:37
inc0 3.13.0-54-generic <- I have this one11:37
SamYapleinc0: which version of docker?11:37
inc01.6.211:38
sdakedoes it look at the source ip to balance to the target?11:38
inc0pretty ancient...11:38
SamYapleinc0: try a newer version11:38
SamYaplei tried 1.71 and up with the 3.13 kernels and it was fine11:38
vbelinc: if you use magic backported RH - 3.1011:38
SamYapleubuntu for me11:38
SamYapleoh yea my ubuntu gate is running!11:39
sdakeya i told lyou that earlier11:39
sdakewe should have docs pubhlsihing soon too11:39
vbelmonth ago I had ubuntu lts server (14) and it had this problem too11:39
inc0yeah I'm running ubuntu 14.04 lts11:40
SamYapleinc0: what backend?11:40
SamYaplei think this may only affect AUFS...11:40
SamYaplei cant remember11:40
inc0devicemapper11:40
vbelubuntu desktop has different kernel and it works fine with su in containers11:40
inc0our centos doesn't build on ubuntu+aufs11:40
SamYapleoh well that has all kinds of iisuse for ubuntu11:40
vbeldevicemapper11:40
*** rhallisey has joined #kolla11:40
SamYapleinc0: centos doesnt work on ubuntu anymore11:41
SamYapleif youre on ubuntu you have to use ubuntu-source11:41
inc0oh...11:41
inc0ok..11:41
SamYaplewe removed the nopam hack sdake had11:41
inc0that's sad11:41
SamYapleremoving hacks is never sad :)11:41
inc0removing "it works" is11:41
SamYapleit broke ceph11:41
SamYapleceph > hack to make centos work for ubuntu11:41
inc0ubuntu source has all bits ready?11:43
inc0for ansiblish stuff11:43
SamYapleinc0: its the only one where they all build :)11:43
SamYapleyep11:43
SamYaplehavent touched centos in almost 3 weeks for full builds11:44
SamYaplewell11:44
SamYapleno i take that back11:44
SamYaplei did a full build ofr heat+horizon testing11:44
vbelhttps://review.openstack.org/#/c/219204/1  - for all bits ready ;)11:45
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924311:45
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924311:47
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924311:48
sdakeinc0 i am running multinode now and rabbitmq appears to be suffering some kind of problem12:00
SamYaplewhich problem12:01
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924312:01
sdakerunning with this diff:12:01
sdakehttp://ur1.ca/nmpf2 -> http://paste.fedoraproject.org/261935/1088791412:01
sdakeall hypervisors appear down12:01
sdakebut they did register12:01
sdakealthough the api services all got through an init-runonce12:01
sdakewhich was impossible prior12:01
SamYaplealmost like you are masking the issue?12:02
sdakethese are in the offical upstream docs...12:02
sdakeprobably a discussion t obe had on the mailing list12:02
SamYaplewhich were unmaintained until like 2 months ago12:02
sdakewell what other solution would you propose12:03
SamYapleim working on it12:03
SamYaplehavent let you down yet. have a little faith12:03
inc0there is little faith after 20hrs being awake isn't it sdake ?;)12:04
SamYaplei find that lack of faith.... disturbing12:04
sdakepretty much inc0 i am beat ass tired of debugging this particular problem12:04
inc0what kind of problems rabbitmq is showing?12:04
sdakeinc0 not sure12:04
sdakegoing to do some mor emasking12:04
sdakesee if that fixes it12:04
sdakethe ha guide does not ecommend usin haproxy with rabbitmq12:06
SamYapleyea weve already been over this we arent going to be doing that12:06
sdakewe are tagging in t-2 days, if you have a solution that works in that itmeline fine, if not my patch goes in12:07
sdakeif you want to refactor later, feel free12:07
SamYaplewell see12:07
*** weiyu_ has joined #kolla12:07
* inc0 grabs popcorn12:07
* sdake grabs popcorn12:07
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924312:07
sdakethe alternative is to tag something that totally doesnt work except aio12:08
sdakewhich seems like a nonstarter to me12:08
sdakeso pick your poison12:08
SamYapleor you know, fix it correctly12:08
sdakei have tried everything i see to fix it12:08
sdakenothing12:08
sdakeyou gae up after 1 hour12:09
sdakei have a working solution after 2012:09
SamYapleim at my job12:09
SamYaplei gave up nothing12:09
SamYapleive fixed all kinds of shit today12:09
sdaket-2 days we are tagging12:09
SamYapleits just not kolla for once12:09
sdakei understand eveyrone has other shit to do12:09
sdakei'm not asking you to work miracles12:09
inc0SamYaple, then tell us how it SHOULD be fixed12:09
sdakebut i am at a loss as to a better solution12:09
SamYapleinc0: since no one has produced an error thats kind of hard12:10
SamYaplei need ot look at it12:10
sdakeit got put in those docs for a reason :)12:10
SamYapleand no client errors dont count12:10
*** banix has quit IRC12:10
SamYaplesdake: its said source since those docs initially went up 3 years ago12:10
SamYapleit was not tought out i can assure oyu12:10
SamYapleits only been in the last few months that that doc has gotten any attention12:11
sdakeok well if you want to refactorit and fix it prperly thats fine12:11
sdakethat is how normal software engineering works12:11
sdakebut from what I can tell the proper fix is this integration solution12:11
SamYaplewait i thought it was the database that was the problem12:12
SamYapleyou said that for the first 4 hours12:12
inc0sdake, I'm about to setup brand new cluster of vms with ubuntu 14.0412:12
inc0is that your config?12:12
sdakeyes it is a synrhoniation problem wiht the database in some way12:12
sdakei'm notentirely sure where in the stack12:12
SamYapleit is not12:12
inc0sdake, ubuntu-source?12:12
sdakethe sourc erouting forces lal data through one server12:13
SamYapleotherwise rabbitmq wouldnt be affected would it12:13
sdakeinc0 ubuntu source fails, centos binary fails12:13
SamYaplethats not what source does12:13
sdakei've only asked 3 times how itworks12:13
sdakenobody will explain12:13
inc0ubuntu-binary is working?12:13
sdakeonly that its evil12:13
sdakeubntu-binary doesn't build dude12:13
SamYapleinc0: ubuntu-binary doesnt exist12:13
SamYaplethere is no delorean type repo for ubuntu12:13
inc0ok...my question is12:13
inc0if I build ubuntu-source, on ubuntu 10.0412:14
SamYapleoh good god12:14
inc014.0412:14
sdakefwiw I don't think we need haproxy at all for rabbitmq12:14
SamYapleoh whew12:14
inc0will it reproduce your failing env sdake?12:14
SamYaplesdake: as discussed many times, we dont12:14
sdakeif you deploy to 3 nodes12:14
sdakesamyaple there is haproxy config for rabbitmq in the haproxy file dude12:14
inc03 controllers?12:14
sdake3 ontrolls12:14
sdakeyou ccan see my diff of my inventory file12:15
vbelyes, haproxy is not needed for rabbitmq12:15
SamYaplesdake: im well aware12:15
SamYapledo you know why its there?12:15
sdakewell its in there, not sure if its used12:15
SamYaplebecause we have no means to update the config at the moment12:15
vbelwe should put rabbit hosts into openstack configs instead12:15
SamYapleso it has to stay like that until kolla-ansible can update configs12:15
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924312:24
sdakeinc0 on the plus side of things today, I've become an expert in how our ha solution is implemented12:25
sdakelooks pretty solid12:25
inc0I'll treat it as a complement12:25
sdakenice simple tidy12:26
SamYapleinc0: i normally just say 'youre welcome' when i dont know if its an insult or compliment :)12:26
sdakeif i dont have anything nice to say i dont say anything at all12:27
sdakewrite it down12:27
SamYapleIRC _is_ writing it down =D12:27
*** weiyu_ has quit IRC12:28
*** dims has joined #kolla12:33
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924312:34
sdakesomeetimes I get a lockup here12:34
sdakeTASK: [neutron | Ensuring OVS bridge is properly setup] ***********************12:34
sdakechanged: [minime-one]12:34
sdakeonly solution is a reboot12:34
SamYapleit would be modprobing ovs right around that stage12:34
SamYapleactually, it wouldn't12:35
SamYapleit would have tried to do that earlier12:35
SamYapleit may not have successed12:35
SamYaplebut at that stage it would try to start ocmmunicating with ovs-vsctld for the first time12:35
sdakeonly happens on multinode as well12:36
*** dwalsh has quit IRC12:36
sdakenova doesn't work multinode12:51
sdakehttp://paste.openstack.org/show/437638/12:51
*** klint has quit IRC12:57
*** dwalsh has joined #kolla12:58
openstackgerritSteven Dake proposed stackforge/kolla: Make API services work multinode  https://review.openstack.org/21926112:59
sdakethis is our next blocking bug blocking the release of liberty-313:03
sdakehttps://bugs.launchpad.net/kolla/+bug/149097313:03
openstackLaunchpad bug 1490973 in kolla "compute nodes don't show hypervisors as active" [Critical,Triaged]13:03
SamYaplesdake: not seeing your issues on my multinode13:04
SamYapleat all13:04
sdakesam remove your -2 vote13:04
SamYaplemultinode working for me with glane image-list loop13:04
sdakei am relaly not in any mood13:05
SamYapleim not either13:05
sdakelet the other cores judge13:05
SamYaplei am -2 on the source balancing for anything other than horizon _unless_ it is needed to tag13:05
sdakeit is needed to tag13:05
sdakeif you want  to rework it later feel free as i said13:05
SamYapleyou said i have two days, AND i dont see the issue at all13:06
sdakeinc0 confirmed it13:06
SamYapleinc0 had other issues...13:06
SamYaplehe was running centos on ubuntu too13:06
SamYaplewhich would never work without nopam13:06
sdakeit is not the packaging i ran full ubntu source build13:07
SamYaplewhich i am also running with no issues13:07
inc0I'm buiilding sources now13:07
inc0let me check it locally plz13:07
SamYapleso likely it is a problem with centos on the host level13:07
rhalliseywhat can I test that someone is not doing atm?13:07
rhalliseycentos binary?13:08
sdakefix https://bugs.launchpad.net/kolla/+bug/149097313:08
openstackLaunchpad bug 1490973 in kolla "compute nodes don't show hypervisors as active" [Critical,Triaged]13:08
rhalliseyso multinode13:08
rhalliseyk13:08
SamYaplealso dont have that problem sdake13:08
SamYapleworking fine13:08
SamYapleso lets take this time to _fix_ issues rather than kneejerking around13:08
sdakei am at a  loss for further solutions13:09
sdakeif you have a solution i am willing to test13:09
sdakeyou have access to my box13:09
sdakei want that patch approved so i can tag13:09
SamYapleif its alright with you know that my shift is over im am going to jump back into your bbox13:09
sdakeyup its aok13:09
SamYaplefyi, i just rebuilt and rekicked on ubuntu _multienode_ not a problem at all13:10
sdakei understand why it works, it keeps all traffic to one keystone server13:10
*** dwalsh has quit IRC13:10
SamYapleright but why is that needed :)13:10
SamYaplebecause broken networking most likely13:10
sdakeyou said your deploying on virt?13:11
inc0yeah, with RESTful APIs it shouldn't matter13:11
sdakeit may if the performance of the boxes isvery high13:11
sdakewe have already elimited the databae since its master/slave/slave13:12
sdakethat means keystone in nwayactive is bust on bare metal13:12
sdakeinc0 arey u on baremetal?13:12
inc0both simultaneously13:12
inc0akwasnie, is running baremetal aio13:13
sdakecould you expand13:13
inc0right now13:13
inc0we're deploying 2 deployments independently13:13
sdakethat may not work iirc haproxy's mcat addr is hard coded13:13
sdakeor keepalive13:14
sdakesamyaple are you running on virt deployment?13:17
*** sdake has quit IRC13:17
*** sdake has joined #kolla13:21
sdakelaptop died, samyaple are you running on virt or baremetal?13:22
SamYaplesdake: ive just rekicked my two main boxes for baremetal13:24
SamYaplesdake: 5 minutes ill be on your box doing things13:25
SamYaplewhats the current state?13:25
SamYapleare you in the middle of a test13:25
sdakecentos binary and ubuntu source are built form master in the registry13:25
vbelshould we add in inventory/multinode second host to [network] as default? To have 2 keepalived/haproxies ..13:25
sdakei do want to look at the compute node log13:26
sdakeso let me save those13:26
SamYaplevbel: those inventory files are more like docs than anything ,so whatever makes it more clear ot the user in your opinion13:27
vbelI see13:27
*** jtriley has joined #kolla13:28
sdakeive got the logs13:28
sdakeenjoy13:28
SamYaplefor the record, still no issues on ubuntu13:29
SamYaplehad an image-list going this whole time13:29
SamYaplewhile true; do glance image-list || break; done13:30
SamYaplestill hasnt broken13:30
sdakeon virt?13:32
SamYaplebaremetal13:32
SamYapledidnt break on virt either13:33
sdake3 node?13:33
SamYaple2 node baremetal13:33
SamYaple3 node virt13:33
sdakelet me try 2 node baremetal13:33
sdakeyo udont have a 3rd node i assume?13:33
sdakedeploying, one moment13:35
SamYaplenot a baremetal host no13:36
sdakelooks like you chnged glance-api13:42
sdakeand its failing to boot13:42
sdakeso i can't test if two node reproduces13:42
sdakeSamYaple your free to use the system now if you like13:45
*** Haomeng has joined #kolla13:47
SamYapleok hey i saw 1 error!13:47
SamYapleit was a 50013:47
SamYaplei tracked it back to too many db connectiosn ( a common issue)13:47
SamYaplebut only 1 error13:47
*** Haomeng|2 has quit IRC13:49
sdakepbourke are you about13:50
sdakei could live with 1 in 50013:50
sdakei am at 1 in 213:50
*** ftcpops has joined #kolla13:52
SamYaplesdake: ok sorry work cleanup13:53
SamYapleim in your box now13:53
SamYaplelets walk through this13:53
SamYaple100 is a different machine from 101,102,103?13:54
sdakeright13:54
sdake100 is hp z820 xeon13:54
SamYapleany kolla stuff ruinning there?13:54
sdake101/102/103 are dells13:54
SamYaple(not ansible)13:54
sdakenothing running on 10013:54
SamYapleok13:54
sdakeif there ws, it joins the cluster13:54
sdakewhich causes chaos13:54
SamYapleso kolla not deployed atm13:54
sdakeright13:55
sdakemy ansible imae has that change for fixing the internal_url13:55
sdakethe ansible-kolla13:55
sdakeotherwise its mater13:55
*** jruano has quit IRC13:55
SamYapleim going to deploy ubuntu source since thats what i was using to rule that out13:55
sdakefeel free13:56
sdakeits already built13:56
sdakeso no waiting 70 inutes or a build and push13:56
SamYaplei think 103 locked up13:57
sdakeya its busted13:57
sdakei'll go reboot13:57
sdakei rebooted all313:59
SamYapleok13:59
SamYaplethat explains that13:59
SamYapleare these vms?13:59
sdake101/102/103?13:59
SamYapleyea13:59
sdakebare metal cat /proc/cpuinfo13:59
sdake16gb ram13:59
sdake256 gb ssd14:00
sdakemaybe 32gb of ram actually14:00
sdakei don't recall14:00
SamYapleno problem14:00
SamYaplejust checking14:00
*** dayou has quit IRC14:00
sdakemodel name: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz14:01
vbelcool i7 with ssd :) My dream laptop14:01
sdakeya 32gb ram14:02
*** Kennan has joined #kolla14:03
SamYaplesdake: you are welcome to join my tmux session while i work14:03
SamYaplealso how do i check the version of an installed package in centos?14:03
*** Kennan2 has quit IRC14:03
sdakerpm -qi packagename14:04
sdakehow do i join a tmux session14:05
sdakenever eused tmux before14:05
SamYaplesu to my user (s8m)14:05
SamYapletmux attach14:05
*** zhiwei has quit IRC14:05
*** dayou has joined #kolla14:06
SamYapleyou attached?14:07
*** dwalsh has joined #kolla14:07
SamYapleoh on the 100 server14:07
SamYaplenot the dells14:07
inc0mkdir: cannot create directory '/var/lib/rabbitmq/mnesia' that's a new - error from rabbitmq14:08
sdakehttp://docs.openstack.org/kilo/config-reference/content/section_keystone.conf.html14:08
sdakeinc0 pbourke has a patch up for that14:08
sdakethere are some keystone caching options14:08
sdakeworth exploring turning those off14:08
sdakecaching plus nwayactive = bad news14:08
sdakeimpossible to keep cache coherency14:09
SamYaplewow dude can you resize your window alow14:10
SamYaplealot*14:10
SamYaplei cannot work14:10
sdakelet me star twith a fresh term14:10
SamYapleok i detached you14:10
SamYaplemicro window size there14:10
sdakeso you want it max sized?14:11
SamYapleif possible14:11
SamYapleyour window affects mine14:11
SamYaplethats fine14:11
inc0must love tmux14:12
SamYaplei love it14:12
SamYapleso much14:12
inc0I do too14:12
SamYaple :setw sync14:12
inc0I have it on shortcut14:12
SamYapleyea i do too14:12
inc0ctrl + s14:12
SamYapleon my boxes....14:12
SamYaplesdake: no openstack client14:15
SamYaplei dont want to dirty these boxes alot14:15
SamYaplebut i need build stuff or somethin14:15
SamYapleg14:15
sdakeits fine14:15
sdakei can reload them if i need14:16
SamYaplealrighty14:16
sdakejust tyr to to break them because it does take about 30 mintes to reload14:16
vbelquestion: we don't use keystone token caching at all currently, correct?14:23
SamYaplecorrect14:23
vbelSamYaple: thanks14:23
SamYapletough the configs dont prevent it14:24
*** banix has joined #kolla14:24
*** shardy_ has joined #kolla14:30
sdakeya turn that token caching shit off14:31
*** shardy has quit IRC14:32
*** shardy_ has quit IRC14:35
*** shardy has joined #kolla14:36
vbelI am in glance image-list, 100 times ok. haproxy/keepalived/keystone/glance on 2 virtual nodes (virtualbox with host-only network + router vm)14:43
SamYaplevbel: i think i have a thread on the issue...14:43
SamYaplepulling it now14:43
*** shardy_ has joined #kolla14:47
*** shardy has quit IRC14:48
openstackgerritMerged stackforge/kolla: Register internal_url properly  https://review.openstack.org/21917114:49
*** shardy_ has quit IRC14:52
*** blahRus has joined #kolla14:53
*** shardy has joined #kolla14:53
*** sdake_ has joined #kolla14:53
openstackgerritMerged stackforge/kolla: rsyslog container  https://review.openstack.org/21458914:54
*** sdake has quit IRC14:58
sdake_samyaple i think yo uwant something relate to immidate flushing14:58
inc0I gtg14:58
inc0will come back to this tomorrow14:59
inc0cyas14:59
sdake_you would think turning off caching woudl enable flushing but who nkows :)]14:59
SamYapleyea14:59
SamYaplei do _not_ want ot put memcache as a backing for keystone14:59
*** inc0 has quit IRC14:59
SamYaplebut that would instantly make this work :(14:59
sdake_ya let stry not to do that14:59
SamYaplemajor security hole14:59
SamYaplefernet tokens would also fix this15:00
sdake_redis is a caching backend as well15:02
*** dayou has quit IRC15:02
sdake_seems a bit like overkill15:02
sdake_maybe some day we will want that15:02
sdake_i just want the damn thing to work15:03
SamYaplefernet is the future15:04
SamYapleits pretty damn slick15:04
SamYaplei believe it is default for liberty15:04
*** diogogmt has joined #kolla15:04
*** sdake_ is now known as sdake15:06
*** dayou has joined #kolla15:07
*** diogogmt has quit IRC15:07
vbelI cannot recreate rabbitmq mkdir problem, if one can - please pastebin ansible output15:09
*** diogogmt has joined #kolla15:09
sdakesamyaple asking around various people what the config options are for keystone15:09
SamYaplesdake: for master liberty ;)15:09
SamYaplesure there is lots of that15:09
sdakei cannot recreate rabbitmq mkdir problem either15:10
sdakethe permissions are set properly in the contianers i debugged15:10
*** banix has quit IRC15:10
*** dwalsh has quit IRC15:11
*** diogogmt has quit IRC15:11
* rhallisey is stuck in building limbo15:11
rhalliseybrb15:11
*** dayou has quit IRC15:12
sdakewhat types of tokens are we using samyaple15:14
sdake#openstack-keystone15:14
sdakepls join15:14
*** banix has joined #kolla15:15
*** rhallisey has quit IRC15:15
*** dims has quit IRC15:17
*** diogogmt has joined #kolla15:21
vbeluuid as default15:23
*** jasonsb has quit IRC15:23
*** jasonsb has joined #kolla15:23
*** diogogmt has quit IRC15:24
*** jasonsb has quit IRC15:28
*** dayou has joined #kolla15:39
*** tfukushima has joined #kolla15:44
*** jtriley has quit IRC15:47
*** chandankumar has quit IRC15:49
*** banix has quit IRC15:50
*** dayou has quit IRC15:50
*** banix has joined #kolla15:51
*** juggler has joined #kolla15:57
*** rhallisey has joined #kolla15:59
*** diga has joined #kolla16:03
SamYapleso to confirm, has _ANYONE_ been able to reproduce what sdake was seeing?16:05
*** dims has joined #kolla16:05
*** sdake_ has joined #kolla16:05
SamYapleI know i havent, inc0 never said he was able to16:05
*** banix has quit IRC16:05
SamYaplehe was having the su kernel issue16:05
*** ftcpops has quit IRC16:07
*** sdake has quit IRC16:09
*** jtriley has joined #kolla16:09
*** jasonsb has joined #kolla16:13
*** diga has quit IRC16:18
*** sdake_ is now known as sdake16:21
*** erkules_ is now known as erkules16:33
*** erkules has joined #kolla16:33
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924316:36
*** jruano has joined #kolla16:39
*** jasonsb has quit IRC16:42
*** jasonsb has joined #kolla16:43
*** SourabhP has joined #kolla16:43
*** SourabhP_ has joined #kolla16:45
*** diogogmt has joined #kolla16:46
*** jasonsb has quit IRC16:47
*** SourabhP has quit IRC16:48
*** SourabhP_ is now known as SourabhP16:48
*** banix has joined #kolla17:02
*** unicell has quit IRC17:03
*** sdake_ has joined #kolla17:06
*** sdake has quit IRC17:06
openstackgerritVladislav Belogrudov proposed stackforge/kolla: Source based builds should get novnc from tarball (git/url)  https://review.openstack.org/21920417:06
*** sdake has joined #kolla17:07
*** jasonsb has joined #kolla17:08
*** SourabhP has quit IRC17:09
openstackgerritMick Thomspon proposed stackforge/kolla: Add ansible support for Murano  https://review.openstack.org/21889417:10
*** sdake_ has quit IRC17:10
*** jasonsb has quit IRC17:12
SamYaplelooks like keystone is borked in centos-binary bad pacakges17:17
SamYaplesame with cinder17:17
SamYaple2015-09-01 17:16:08.741 | DEBUG:kolla.cmd.build:cinder-api:[91mError: Package: 1:python-keystone-8.0.0.0b3-dev209.el7.centos.noarch (delorean)17:18
SamYaple2015-09-01 17:16:08.741 | DEBUG:kolla.cmd.build:cinder-api:           Requires: python-oslo-config >= 2:1.11.017:18
SamYaple2015-09-01 17:16:08.741 | DEBUG:kolla.cmd.build:cinder-api:           Installed: 2:python-oslo-config-1.9.3-1.el7.noarch (@openstack-kilo)17:18
SamYaple2015-09-01 17:16:08.741 | DEBUG:kolla.cmd.build:cinder-api:               python-oslo-config = 2:1.9.3-1.el717:18
SamYaple2015-09-01 17:16:08.742 | DEBUG:kolla.cmd.build:cinder-api:           Available: 2:python2-oslo-config-2.3.1-dev2.el7.centos.noarch (delorean)17:18
SamYaple2015-09-01 17:16:08.742 | DEBUG:kolla.cmd.build:cinder-api:               python-oslo-config = 2.3.1.dev217:18
*** jasonsb has joined #kolla17:19
sdakeNS RHW PEOVLWM IA QHr?17:20
sdakeand the problem is what?17:20
SamYaple?17:21
*** jasonsb has quit IRC17:21
rhalliseystill building17:21
SamYapleidk looks like the kilo repo is taking priority over delorean17:21
SamYaplei dont know how that stuff works17:21
SamYaplewhich package wins from which repo17:21
sdakethat is correct that is wat should happen17:21
SamYaplewell thats bad17:22
SamYaplethe delorean repo has newer packages17:22
SamYaplein this case newer ones that are required17:22
SamYapleanyway just a heads up since its centos binary i just saw the gate fail17:22
SamYaplesomeone will need to patch17:22
sdakehttp://ur1.ca/nms9w -> http://paste.fedoraproject.org/262087/1441128117:22
sdakesam what cuaes this problem17:23
SamYaplebad ram17:23
sdakeinventory file http://ur1.ca/nmsa7 -> http://paste.fedoraproject.org/262088/1441128117:23
SamYaplei legit dont know sdake but its that problem node again17:23
sdakei am tyring to deploy to single nodes17:23
sdakeis that possibly caused by a borked install of centos?17:24
SamYaplepython would have to be shot17:24
SamYaplebut i guess maybe17:24
SamYaplei really cant tell you17:24
SamYapleif you changed nothing else it should be fine17:24
sdakeold kernel on node-0317:25
sdakei didn't yum update it after installing17:25
sdakebut i yum updated the other nodes...17:25
*** achanda has joined #kolla17:25
SamYaplethat would explain ovs bombing out17:25
*** loth has joined #kolla17:27
sdakeso interesting fact17:27
sdakenode 1 was yum updated17:27
sdakenode 2 and node 3 were not17:27
openstackgerritSam Yaple proposed stackforge/kolla: Setup ubuntu build gate  https://review.openstack.org/21924317:28
SamYaplesdake: that makes sense17:28
SamYaplewhen i yum installed packages node 1 completed with way fewer instaleld packages than node 2 and 317:29
SamYaplei mean im not saying thats the issue but there are some inconstencies to work through17:29
sdakenode 2 doesn't install17:29
sdakesame erro about keystone dict object17:29
*** unicell has joined #kolla17:30
sdakedid you install all the same software on all 3 nodes?17:31
SamYaplefor openstackclient yea17:32
sdakeis there anything you didn't install on node1?17:32
SamYapleno17:32
sdakethat you installed on node 2 and node 317:32
sdakeso my router somehow asigned minime-one to 192.168.1.14817:44
sdakeso who knows what tht would do17:45
sdakeprobably nothing good17:45
sdakeafter fixing that I get fatal: [minime-01] => {'msg': "AnsibleUndefinedVariable: One or more undefined variables: 'dict object' has no attribute 'keystone'", 'failed': True}17:48
*** jasonsb has joined #kolla17:53
*** SourabhP has joined #kolla17:58
*** banix has quit IRC18:00
*** chandankumar has joined #kolla18:00
*** banix has joined #kolla18:02
*** SourabhP has quit IRC18:16
*** SourabhP has joined #kolla18:17
*** akwasnie has quit IRC18:19
*** dperaza has joined #kolla18:22
*** dperaza has left #kolla18:26
*** jruano has quit IRC18:30
*** dims has quit IRC18:40
*** dims has joined #kolla18:40
*** dims has quit IRC18:45
*** diogogmt has quit IRC18:48
*** diogogmt has joined #kolla18:54
*** SourabhP has quit IRC18:58
*** dims has joined #kolla19:09
*** sdake_ has joined #kolla19:12
*** sdake has quit IRC19:13
*** dims has quit IRC19:13
*** sdake has joined #kolla19:13
*** sdake_ has quit IRC19:18
*** dims has joined #kolla19:18
*** banix has quit IRC19:21
*** dims has quit IRC19:25
*** whydidyoustealmy has quit IRC19:34
*** whydidyoustealmy has joined #kolla19:34
*** whydidyoustealmy has quit IRC19:35
*** dims has joined #kolla19:35
*** whydidyoustealmy has joined #kolla19:36
*** unicell has quit IRC19:38
*** sdake_ has joined #kolla19:45
*** sdake has quit IRC19:49
*** sdake has joined #kolla19:49
*** sdake_ has quit IRC19:53
*** chandankumar has quit IRC19:57
*** unicell has joined #kolla20:05
*** SourabhP has joined #kolla20:31
*** absubram has joined #kolla20:43
*** sdake has quit IRC20:53
*** achanda has quit IRC20:54
*** tfukushima has quit IRC21:05
*** achanda has joined #kolla21:07
*** jtriley has quit IRC21:10
*** achanda has quit IRC21:30
*** SourabhP has quit IRC21:42
*** shardy has quit IRC22:01
*** tfukushima has joined #kolla22:05
*** tfukushima has quit IRC22:10
*** SourabhP has joined #kolla22:11
*** achanda has joined #kolla22:54
*** achanda has quit IRC22:59
*** dims has quit IRC23:03
*** blahRus has quit IRC23:05
*** tfukushima has joined #kolla23:06
*** tfukushima has quit IRC23:12
*** absubram has quit IRC23:24
*** dims__ has joined #kolla23:24
*** diogogmt has quit IRC23:40
*** diogogmt has joined #kolla23:43
*** diogogmt has quit IRC23:45

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!