Wednesday, 2016-10-05

*** haplo37_ has joined #openstack-kolla00:00
sdake_kfox1111 sam tried ceph in the gatte and couldn't get it to work00:03
sdake_however i dont recall if he was done trying or not00:03
sdake_kfox1111 may ask him for adice00:03
sdake_kfox1111 he seems open to answering questions atleast on irc :)00:04
kfox1111ok. cool.00:09
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104100:11
*** diogogmt has joined #openstack-kolla00:12
*** sdake_ has quit IRC00:16
kfox1111arg... need both ps's combined now...00:20
kfox1111I think the remaining issues in the Start testing may be ceph related, which the other ps tests.00:20
kfox1111may be time to merge the Start testing review.00:21
*** schwicht has joined #openstack-kolla00:22
kfox1111or, I guess I can pull off the wip to get it ready, and then rease the ceph one on top as a follow on. will have to do that anyway.00:22
kfox1111heh... this one worked sort of: http://logs.openstack.org/41/381041/31/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/5f94252/console.html00:28
kfox1111I can't really see much between the working and not working ones. :/00:28
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: Split endpoint jobs and start testing the deployments  https://review.openstack.org/38086800:30
kfox1111sbezverk_: https://review.openstack.org/#/c/380868 Is ready I think. it does create all the endpoints fine.00:31
kfox1111arg...00:34
kfox1111ceph-mon image has xfs mkfs but not ext4... weird.00:34
*** huikang has joined #openstack-kolla00:36
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104100:38
*** schwicht has quit IRC00:39
*** tonanhngo_ has quit IRC00:40
*** HyperJohnGraham has joined #openstack-kolla00:42
*** tonanhngo has joined #openstack-kolla00:42
*** inc0 has quit IRC00:45
*** schwicht has joined #openstack-kolla00:46
*** tonanhngo has quit IRC00:47
*** senk__ has joined #openstack-kolla00:49
*** senk_ has quit IRC00:49
*** tonanhngo has joined #openstack-kolla00:49
*** v1k0d3n has quit IRC00:51
*** v1k0d3n has joined #openstack-kolla00:52
*** tonanhngo_ has joined #openstack-kolla00:52
*** david-lyle has quit IRC00:52
*** sdake has joined #openstack-kolla00:53
*** daneyon has joined #openstack-kolla00:54
*** tonanhngo has quit IRC00:54
openstackgerritDuong Ha-Quang proposed openstack/kolla: Improve VIP existence check  https://review.openstack.org/38158900:56
*** tonanhngo_ has quit IRC00:56
*** daneyon has quit IRC00:58
*** g3ek has quit IRC00:59
*** phuongnh has joined #openstack-kolla01:00
*** haplo37 has quit IRC01:02
*** schwicht has quit IRC01:03
*** karlamrhein has quit IRC01:03
*** haplo37 has joined #openstack-kolla01:05
*** duonghq has joined #openstack-kolla01:07
duonghqmorning01:07
openstackgerritDuong Ha-Quang proposed openstack/kolla: Specify 'become' to neccesary tasks (general roles)  https://review.openstack.org/35853901:08
openstackgerritDuong Ha-Quang proposed openstack/kolla: Specify 'become' for only neccesary tasks (default roles)  https://review.openstack.org/35903101:08
*** g3ek has joined #openstack-kolla01:09
HyperJohnGrahamhi all01:09
*** eaguilar has joined #openstack-kolla01:11
*** karlamrhein has joined #openstack-kolla01:13
*** otavio has joined #openstack-kolla01:14
MarMatsdake fyi, magnum is broken for me https://bugs.launchpad.net/magnum/+bug/163041801:16
openstackLaunchpad bug 1630418 in Magnum "Minions are not registering because of hostname translation failure" [Undecided,New]01:16
sdakeMarMat what happened to "looksgood" :)01:16
sdakeMarMat are ou talking about the kube case or the ansible case?01:17
sdakethat bug above looks like it has nothign to do with magnum?01:17
MarMatsdake well, ya know, that's the way forward to brigher tomorrows01:17
MarMatsdake it's a magnum thing01:17
sdakeoh, you mean inside magnum the minions can't register01:18
MarMatsdake yes, they made a change recently and it broke things, at least for me01:18
sdakeMarMat follow me to #openstack-containers please01:19
sdakeMarMat what we have done in the past is apply a revert patch on top01:24
sdakeas a short term workaround01:24
sdakemarmat this is slightly harder then it sounds unforutnately :)01:24
sdakemarmat I have faith in you :)01:24
*** v1k0d3n has quit IRC01:25
openstackgerritDuong Ha-Quang proposed openstack/kolla: Specify 'become' for only neccesary tasks (all other roles)  https://review.openstack.org/35909601:25
MarMatsdake luckily I have some guys behind me who are helping me01:26
*** duonghq has quit IRC01:26
sdakeMarMat sweet - its not super hard01:26
sdakejust annoying to make the patch ;)01:26
sdakei'd point you at an example that exists today but we have none01:26
sdakeif you check the history of the horizon j2 file, yo uwill find an example01:27
*** v1k0d3n has joined #openstack-kolla01:27
sdakei know for certain i added work there to hack around this problem01:27
MarMatsdake well first let's see what they reply to the report, right?01:27
sdakethere are also other files that have reverts in them01:27
sdakemarmat we work faster then upstream in some cases, my recommendation is to put in the revert now, and revert it later once magnum fixes it (if they think its a bug)01:28
MarMatsdake good, i have to teleport home now and will take a look on it later in the evening01:29
openstackgerritOpenStack Proposal Bot proposed openstack/kolla: Updated from global requirements  https://review.openstack.org/37598901:30
sdakeMarMat ship me a teleporter pls :)01:30
sdakei'll even pay for shipping :)01:30
sdakeif you want to get fancy teleport it :)01:30
*** schwicht has joined #openstack-kolla01:32
*** MarMat has quit IRC01:37
*** duonghq has joined #openstack-kolla01:45
*** huikang has quit IRC01:48
*** huikang has joined #openstack-kolla01:48
*** v1k0d3n has quit IRC01:49
*** huikang has quit IRC01:52
*** v1k0d3n has joined #openstack-kolla01:55
*** v1k0d3n has quit IRC02:02
*** v1k0d3n has joined #openstack-kolla02:05
*** severion has joined #openstack-kolla02:07
*** v1k0d3n has quit IRC02:07
*** MarMat has joined #openstack-kolla02:10
*** unicell1 has quit IRC02:17
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104102:18
*** sdake has quit IRC02:18
*** sdake has joined #openstack-kolla02:20
*** schwicht_at_work has joined #openstack-kolla02:25
MarMatsdake not sure what you mean by history of horizon j2 file, I cannot see anything what would look like a revert patch... anyway now we ate talking about application of a revert patch on magnum, right?02:25
*** schwicht has quit IRC02:25
sdakeMarMat right02:28
sdakemake the revert patch02:28
sdakethen apply it to the dockerile in kolla02:28
sdakethis is normal operational behavior for us to unblock others02:29
sdakethe reality is this revert patch will be reverted once magnum fixes the issue upstream ;)02:29
sdakeMarMat can you ping the mailing list with your problem as well and tag it [kolla][magnum] Patch causes regression in magnum02:29
sdakeand then link the patch and give a brief explination along with a link to the bug and ask for bug triage on it.02:30
sdakepoint out the bug triage is needed from a magnum-driver not a kolla-driver member02:30
sdakeotherwise people reading it might think its SEP's ;)02:30
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104102:31
*** HyperJohnGraham has quit IRC02:32
*** HyperJohnGraham has joined #openstack-kolla02:34
MarMatsdake looks like a plan for a nice evening :-)02:38
*** yuanying_ has quit IRC02:39
*** haplo37 has quit IRC02:39
*** haplo37 has joined #openstack-kolla02:44
*** eaguilar has quit IRC02:46
*** diogogmt has quit IRC02:54
sdakerbergeron ping03:00
otavioIs someone working on magnum-ui integration?03:12
duonghqanybody got MariaDB error like this: http://pastebin.com/0cxkUzGe03:22
duonghqafter 3 controller form a HA cluster of mariadb is restarted03:22
sdakeduonghq master?03:30
duonghqsdake: mitaka03:31
sdakeotavio that has not been integrated and noboddy to my knowledge is owrking on it03:31
sdakeduonghq how did you install mitaka?03:31
sdakevia git or pip03:31
sdakerun pip show kolla -> this will give me answer03:31
sdakeotavio if you wnat to take a crack - feel free - also note our third party plugins work should enable a customization for htis quite easily03:32
sdakeduonghq mitaka mariadb worked from registry when pushed03:33
sdake(and then pulled on a fresh 3 node setup)03:33
duonghqsdake: I'm supporting a operator, he is using pip03:34
sdakeso he did pip install kolla?03:34
sdakeor pip install .03:34
duonghqpip install kolla03:34
sdakeneed output of pip show kolla03:34
sdakeok - do you ahve any other data?03:34
sdakesuch as globals.yml03:34
sdakeor are we playing super telephone here :)03:34
sdakeprechecks worked?03:34
duonghqIt failed after system reboot03:35
duonghqWorked previously03:35
sdakeok so it worked prior03:35
sdakethere is this little playbook called "maraidb_recovery"03:35
sdakerun that pls :)03:35
sdakemariadb03:35
sdakeits an action in kolla-ansible03:36
duonghqyup, seen that before but I'm not sure about this,03:36
sdakelights out requires special recovery circumstances03:36
sdakea reboot = lights out03:36
*** daneyon has joined #openstack-kolla03:37
sdakewere all 3 nodes rebooted at same time?03:37
duonghqya, same time03:37
sdakeya you need mariadb_recovery03:37
sdakeit exists for that exact situation03:37
sdakeand its in mitaka03:37
*** severion has quit IRC03:38
sdakekolla-ansible -i /path/to/inventory mariadb_recovery should get it g oing03:38
duonghqunderstood03:38
sdakeif there is something else i'm missing let me know ;)03:38
sdakethe info on the reboot was ninja slipped in there - that should have been first ;)03:38
duonghqroger03:39
sdakeif that doesn't fix it03:39
sdakeiptables could be busted in some way03:39
sdakelets try mariadb_recovery and get back to us03:39
sdakeyou shouldn't need mariadb recovery on a single node reboot03:40
sdakejust on a full lights out03:40
sdakethe odds of 3 servers failing at the same time are astronomically low03:40
sdakeusually this failure is triggered by power loss in the data center03:41
*** daneyon has quit IRC03:41
duonghqsdake: he is testing Kolla and he restarted all servers 'cause of some mystic reasons03:41
*** sdake has quit IRC03:43
*** sdake has joined #openstack-kolla03:43
duonghqsdake: everything is fine now, thanks03:44
sdakeduonghq mariadb_recovery worked?03:45
duonghqyup, the cluster is up and running03:45
duonghqbut why we need the specific task?03:45
duonghq*action03:45
sdakemariadb has specific recovery mechanism for lights out03:46
sdakeno other service does03:46
sdakei debated a long time for a general "kolla-ansible lights-out-recover" action03:46
duonghqand we must trigger the maechanism by hand?03:46
sdakebut lost the debate once it became clear only mariaadb was needed03:46
sdakeya that part - the by hand part - is annoying03:46
sdakea proper cluster infrasturcture would just work without a special recovery mechanism03:47
sdakei dont know why mariadb is designed the way it is03:47
sdakethis is not a kolla bug tho :)03:47
duonghqsorry but it's mariadb's fault or Galera one?03:47
duonghqjust for clarify03:47
sdakegalera03:47
duonghqroger03:47
sdakegoogle galera power outage recovery or something like that03:48
sdakeyou will probably find the docs that were used to construct that playbook :)03:48
duonghqseen03:48
sdakeduonghq we can't *make* upstreams of ours do anything03:49
sdakethey have to want to do it, and i'm not sure if they care to fix the lights out recovery problem or not03:49
*** mdnadeem_ has joined #openstack-kolla03:49
sdakeuntil that time, the best we can do that I know of atm is make a playbook to handle it - which is what happend 9+ months ago :)03:49
duonghqhmm, it's a bug or a feature?03:50
sdakelgihts out recovery of mariadb is a feature for us03:52
sdakei think its a general design defect of mariadb+galera03:52
sdakeanswer to your q is - it depends on your pov :)03:52
duonghqI interested in Galera POV,03:52
sdakeno idea- ask them :)03:53
duonghqya03:53
duonghqwhy we need xtradb backup from percona?03:53
sdakei would find it hard to make an argument for rationalizing it as a feature03:53
sdakeduonghq no idea why that is needed03:53
sdakeduonghq if you can figure out how to get rid of it, more power to ya03:54
sdakei dont want it in there03:54
sdakethats the only thing we use from percona03:54
sdakethe last i heard from sean I htink is that is enables replication in some special way03:54
sdakei dont recall the details03:54
sdakeit sounded good03:54
sdakeand it is mandatory at present03:54
sdakei think it doesn't need to be03:55
duonghqokay03:55
sdaketo be not mandatory requires R&D time tho04:02
sdakeits not like "remove it and thingss still work"04:02
sdake(I've tried that..:)04:02
duonghqya04:02
*** yuanying has joined #openstack-kolla04:02
*** jmccarthy has quit IRC04:04
*** jmccarthy has joined #openstack-kolla04:05
duonghqsdake: do you deploy latest master code?04:08
sdakedaily04:08
sdakebut not today - internet was out04:08
duonghqI just stuck at "Waiting for virtual IP to appear"04:08
duonghqtimetou04:08
sdakestill is -at my parents leaching off their internet atm04:08
duonghqI'm not sure why it's use DB port as a sign04:10
duonghqand the DB task hasn't run yet04:10
*** yuanying has quit IRC04:12
*** yuanying has joined #openstack-kolla04:13
sdakeduonghq not sure - can't tell from here if master has problem or not04:16
sdakeduonghq as i am not able to access my gear04:16
duonghqya,04:16
sdakeduonghq my network at home is down04:16
sdakeso can't just ssh into my boxes04:17
sdakeand its still down - just tried again04:17
duonghqya, I think that in US, Internet is very good04:17
sdakeit is pretty good in my neighboorhood04:19
sdakegige04:19
sdakebut its broken atm04:19
sdakei am going to jet home and work on getting a tech out to fix it or repair it myself04:20
sdakettyl :) wish me well04:20
duonghqya04:21
duonghqblame the ISP is good enough?04:22
duonghqhow long is it fixed last time?04:22
*** sdake has quit IRC04:25
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104104:28
*** g3ek has quit IRC04:34
*** haplo37 has quit IRC04:35
*** yuanying has quit IRC04:36
*** g3ek has joined #openstack-kolla04:40
*** haplo37 has joined #openstack-kolla04:40
*** unicell has joined #openstack-kolla04:40
*** salv-orlando has joined #openstack-kolla04:41
*** yuanying has joined #openstack-kolla04:42
*** coolsvap has joined #openstack-kolla04:45
*** unicell has quit IRC04:45
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104104:46
*** unicell has joined #openstack-kolla04:49
*** sdake has joined #openstack-kolla04:51
*** unicell1 has joined #openstack-kolla04:53
*** senk__ has quit IRC04:53
*** unicell has quit IRC04:54
sdakesweet internet working :)04:55
sdakeping rbergeron04:55
*** bjolo_ has joined #openstack-kolla04:58
openstackgerritOpenStack Proposal Bot proposed openstack/kolla: Updated from global requirements  https://review.openstack.org/37598904:58
*** skramaja has joined #openstack-kolla05:02
bjolo_sdake, right channel this time05:15
bjolo_triage please :)05:15
bjolo_https://bugs.launchpad.net/kolla/+bug/162695805:15
openstackLaunchpad bug 1626958 in kolla "neutron_server not working when lbaas enabled" [Undecided,New]05:15
bjolo_https://bugs.launchpad.net/kolla/+bug/162924605:15
openstackLaunchpad bug 1629246 in kolla "keystone fernet deploy fail" [Undecided,New]05:15
sdakebjolo_ the second one is marked critical although it feels more like high05:16
sdakethe second one (fernet fails to deploy) super critical05:16
sdakebjolo_ that said, we are tagging on the 12th come hell or highwater ;)05:17
sdakeand that is a short 8 days away05:17
sdakeJeffrey4l_ if you could turn your considerable talent on this fernet issue05:17
sdakeJeffrey4l_ it would be appreciated05:17
bjolo_i know, tag is soon05:17
sdakei hear rumblings fernet is not ready to go05:17
sdakeJeffrey4l_ it absolutely needs to be ready to go :)05:17
bjolo_will do what i can to help out05:17
sdakebjolo_ not sure unless you want to pick up a keyboard - moment wife pingin gme05:18
Jeffrey4l_will check it .05:18
sdakeJeffrey4l_ cool - the other bug - with the vpn and lbaas not building - we need someone to take a look05:26
sdakeJeffrey4l_ if someone else (other then you) could look that would be great05:26
sdakewe got a whole slew of high/critical bugs and I don't think you can solve them all alone :)05:26
sdakei was mia today jerking around with getting my internet working05:27
bjolo_sdake, i have picked up the keyboard and submitted a few PS (simple ones, but still) :)05:27
coolsvapsdake: i have ubuntu-source build in progress i can triage https://bugs.launchpad.net/kolla/+bug/162695805:28
openstackLaunchpad bug 1626958 in kolla "neutron_server not working when lbaas enabled" [Critical,Triaged]05:28
sdakewell its been triaged05:28
sdakea confirmation is next ;)05:28
sdakeif you want to tackle that wfm - although i htink bjolo has pretty much provided evidence it is confirmed05:29
Jeffrey4l_coolsvap, cool. we need add neutron-service-plugin section in kolla-build.conf file.05:29
sdakecoolsvap if by triageyou mean fix, that would rock :)05:29
coolsvapJeffrey4l_: alright i will pick that up05:30
Jeffrey4l_hmm. i know the root cause and solution. i will leave some comments in the bug page.05:30
sdakeJeffrey4l_ root cause of which?05:30
Jeffrey4l_neutron-server + lbaas05:30
sdakefernet?05:30
Jeffrey4l_https://bugs.launchpad.net/kolla/+bug/1626958 this one.05:30
openstackLaunchpad bug 1626958 in kolla "neutron_server not working when lbaas enabled" [Critical,Triaged] - Assigned to Swapnil Kulkarni (coolsvap)05:30
sdakeoh - ya that looks pretty straightforward05:31
sdakethe fernet bug is the one I'd ask you to look at - since its complex :)05:31
Jeffrey4l_yep.05:31
sdakeand fernet is a highlight feature05:31
Jeffrey4l_checking it.05:32
bjolo_Jeffrey4l_, please let me know if i should test anything or provide more logs05:33
Jeffrey4l_bjolo_, ok.05:33
*** msimonin has joined #openstack-kolla05:35
Jeffrey4l_bjolo_, could u run the playbook again with `-vvv` parameter.05:38
*** v1k0d3n has joined #openstack-kolla05:38
bjolo_sure can05:38
bjolo_just give me a few, my daily build is not completed yet05:38
Jeffrey4l_OK. np05:38
bjolo_ubuntu or centos?05:38
bjolo_the bug was reported on ubuntu source05:39
Jeffrey4l_either05:40
Jeffrey4l_so ubuntu + source.05:40
bjolo_ok05:40
sdakeelrog 845 tubes = winning05:40
Jeffrey4l_sdake, what's that means?  elrog?05:41
sdakeelrog is a german brand05:41
sdakethey make vacuum tubes05:41
sdakefor my hobby :)05:41
bjolo_sdake, if i want to submit some BP, is there deadline for ocata cycle or how does it work?05:42
sdakebjolo_ any deadlines for ocata are at this point undefined05:42
Jeffrey4l_cool. got.05:42
sdakebjolo_ what Id recommend is to submit the bp05:42
*** unicell1 has quit IRC05:42
sdakewe are going to move to a specs model this cycle I suspect05:42
*** unicell has joined #openstack-kolla05:43
sdakereally up to what the core team wants to do05:43
bjolo_i will, but it is all about priorities05:43
sdakethe goal of that is to flatten out workload over the cycle05:43
sdakeatm, our workload is very spikey around milestones05:43
*** v1k0d3n has quit IRC05:43
sdakeand everyone is in chill mode the rest of the time05:43
bjolo_specs modell as in specs.openstack.org?05:43
sdakei'd like to see our community hae a flat (high) workload05:43
sdakebjolo_ yes like that, but lighter weight05:44
bjolo_hehe05:44
sdakeas LIGHT WEIGHT as possible05:44
sdakethe purpose of specs isn't to slow down dev (as used in other projects)05:44
*** senk_ has joined #openstack-kolla05:44
sdakeits to help keep priorities straight05:44
sdakeand even out workload as well05:44
sdakethose two merge together in my mind05:45
wznoinsksdake: +105:45
sdakemost projects use specs to slow work down05:45
sdakeroadblocks = bad05:45
sdakeroadblocks = hassle or everyone05:45
sdakeso lets not do that ;)05:45
sdakei suspect a whole bunch of ocata will be spent jerking around with the repo split05:46
wznoinsksdake: what's the view on new features in ocata in kolla?05:46
wznoinsk(given the shorter release timeframe and "less focus" on features this cycle)05:47
sdakewznoinsk who said less focus on features?05:52
sdakeyou mean inc0's statements?05:53
sdakeI think our focus on features or lack thereof will come out of the specs process we use05:53
wznoinsknope, I can try to dig out that I think it was on ML, Ocata suppose to be a less/non-feature release05:53
sdakenot of any dictate anyone makes05:53
sdakewznoinsk openstack wide?05:53
wznoinskI better find out the message I'm reffering to first05:54
sdakewznoinsk please dig for subject05:54
wznoinskwhile I'm trying to find it, lack of spec/feature freeze dates and other struck me the other day: https://releases.openstack.org/ocata/schedule.html05:55
sdakewznoinsk i dont think its mandatory for us to set that right now05:55
sdakewznoinsk lets tackle specs first05:55
sdakethen when to freeze them if/when the core team gets comfortable with specs in the first place ;)05:55
*** msimonin has quit IRC05:58
*** salv-orlando has quit IRC06:01
*** egonzalez90 has joined #openstack-kolla06:04
*** tonanhngo has joined #openstack-kolla06:06
wznoinskwhen I think about it now it might have been my wrong interpretation of the shortest release cycle and Ocata being the transision release while we introduce PTGs06:07
*** tonanhngo has quit IRC06:07
*** david-lyle has joined #openstack-kolla06:08
bjolo_Jeffrey4l_, http://paste.openstack.org/show/58436706:14
bjolo_ticket updated as well06:14
*** yuanying has quit IRC06:16
bjolo_wznoinsk, PTGs?06:18
*** bjolo_ has quit IRC06:24
duonghqanybody stuck at waiting for virtual IP appear?06:33
*** shardy has joined #openstack-kolla06:33
*** unicell has quit IRC06:33
*** unicell has joined #openstack-kolla06:34
*** tonanhngo has joined #openstack-kolla06:34
Jeffrey4l_roger06:35
*** tonanhngo has quit IRC06:35
Jeffrey4l_bjolo, it is not full of log.06:35
*** salv-orlando has joined #openstack-kolla06:36
*** Serlex has joined #openstack-kolla06:39
*** salv-orlando has quit IRC06:40
*** mnasiadka has joined #openstack-kolla06:45
*** tonanhngo has joined #openstack-kolla06:54
*** MarMat has quit IRC06:55
*** hieulq has quit IRC06:55
*** tonanhngo has quit IRC06:56
*** salv-orlando has joined #openstack-kolla06:56
*** hieulq has joined #openstack-kolla06:58
*** hieulq has quit IRC06:59
*** gfidente has joined #openstack-kolla06:59
*** david-lyle has quit IRC07:02
openstackgerritMerged openstack/kolla: Create /var/log/kolla/rally before running rally-manage db create/upgrade  https://review.openstack.org/38207407:03
*** hieulq has joined #openstack-kolla07:05
*** hieulq has quit IRC07:06
*** tonanhngo has joined #openstack-kolla07:08
*** haplo37 has quit IRC07:09
duonghqI got dead keepalived container, anybody see that?07:09
*** g3ek has quit IRC07:09
*** tonanhngo has quit IRC07:09
*** matrohon has joined #openstack-kolla07:09
*** hieulq has joined #openstack-kolla07:11
*** daneyon has joined #openstack-kolla07:13
*** athomas has joined #openstack-kolla07:15
bjoloJeffrey4l_, no i cut away first 3000 lines that had nothing todo with keystone07:15
*** g3ek has joined #openstack-kolla07:15
bjoloJeffrey4l_, http://paste.openstack.org/show/58437407:16
*** haplo37 has joined #openstack-kolla07:16
bjoloouch07:17
*** daneyon has quit IRC07:18
*** b_bezak has joined #openstack-kolla07:20
*** hogepodge has quit IRC07:21
bjolowhats the char limit on paste.openstack?07:23
bjoloJeffrey4l_, here is the grep fernet version of the log http://paste.openstack.org/show/584376/07:24
*** msimonin has joined #openstack-kolla07:30
*** msimonin has quit IRC07:34
*** shardy_ has joined #openstack-kolla07:34
*** shardy has quit IRC07:36
*** tonanhngo has joined #openstack-kolla07:44
*** tonanhngo has quit IRC07:46
*** salv-orl_ has joined #openstack-kolla07:54
*** salv-orlando has quit IRC07:57
*** shardy_ is now known as shardy07:57
*** egonzalez90 has quit IRC07:59
*** msimonin has joined #openstack-kolla08:00
*** yuanying has joined #openstack-kolla08:01
*** tonanhngo has joined #openstack-kolla08:02
*** tonanhngo has quit IRC08:04
duonghqsometime I got kolla_start not found when deploy, after some restart, destroy... it's run again, no clue.08:08
*** bmace has joined #openstack-kolla08:09
duonghqsame images, same revision08:09
*** hogepodge has joined #openstack-kolla08:11
*** mgoddard has joined #openstack-kolla08:16
*** HyperJohnGraham has quit IRC08:18
wznoinskbjolo: http://lists.openstack.org/pipermail/openstack-dev/2016-September/102981.html08:24
bjolowznoinsk, tnx08:26
*** egonzalez90 has joined #openstack-kolla08:29
sdakeJeffrey4l_ did you see that rtnetlink bug is not a regression08:35
sdakeit has always been around08:35
*** vincent_vdk has left #openstack-kolla08:39
*** tonanhngo has joined #openstack-kolla08:43
*** strigazi_AFK is now known as strigazi08:44
*** tonanhngo has quit IRC08:44
*** salv-orl_ has quit IRC08:48
*** mkoderer has joined #openstack-kolla08:54
*** sdake has quit IRC08:59
*** tonanhngo has joined #openstack-kolla09:03
*** awiddersheim has quit IRC09:06
*** tonanhngo has quit IRC09:08
*** egonzalez90 has quit IRC09:11
*** awiddersheim has joined #openstack-kolla09:11
denaitrehello09:11
denaitreI would like to work on kolla based on the newton version of OS09:11
denaitreare the docker images available somewhere?09:12
pbourkedenaitre: it's recommend you build them manually09:15
denaitrepbourke: thanks, from master would be fine?09:17
pbourkedenaitre: yes09:17
denaitreok thanks09:17
*** berendt has joined #openstack-kolla09:20
*** berendt has quit IRC09:20
*** berendt has joined #openstack-kolla09:20
*** huikang has joined #openstack-kolla09:27
*** tonanhngo has joined #openstack-kolla09:32
*** huikang has quit IRC09:33
*** tonanhngo has quit IRC09:33
*** v1k0d3n has joined #openstack-kolla09:40
*** v1k0d3n has quit IRC09:45
*** haplo37 has quit IRC09:50
*** egonzalez90 has joined #openstack-kolla09:51
*** g3ek has quit IRC09:52
*** tonanhngo has joined #openstack-kolla09:53
*** tonanhngo has quit IRC09:54
*** daneyon has joined #openstack-kolla09:55
*** haplo37 has joined #openstack-kolla09:56
*** g3ek has joined #openstack-kolla09:57
*** daneyon has quit IRC10:00
*** egonzalez90 has quit IRC10:02
openstackgerritPaul Bourke (pbourke) proposed openstack/kolla: Fix horizon to use cache  https://review.openstack.org/38224410:10
*** hieulq has quit IRC10:13
*** rstarmer has joined #openstack-kolla10:17
rstarmermorning.10:17
rstarmeris it expected that the 2.0 stable/mitaka branch support config merge? Or is that a 3.0 capability?10:18
*** salv-orlando has joined #openstack-kolla10:19
*** mgoddard_ has joined #openstack-kolla10:22
pbourkerstarmer: config merge is in 2.010:23
openstackgerritOpenStack Proposal Bot proposed openstack/kolla: Updated from global requirements  https://review.openstack.org/37598910:23
rstarmerpbourke:   ok, then I must be doing somethign wrong.  I've added an ini file in /etc/kolla/config/cinder-api/cinder.conf on the system running kolla-ansible, and none of the parameters are showing up on my target system.  thoughts on where I can look to see what might be breaking?10:25
*** mgoddard has quit IRC10:25
*** salv-orlando has quit IRC10:25
pbourkerstarmer: try putting it in /etc/kolla/config/cinder.conf10:26
rstarmerok, will give that a run. thanks!10:26
pbourkerstarmer: I'm going to file a bug to make this more obvious as you're not the first to trip over this ;)10:27
rstarmerah, ok, thanks!10:27
rstarmeralll good now...10:31
*** egonzalez90 has joined #openstack-kolla10:32
*** duonghq has quit IRC10:38
*** athomas has quit IRC10:40
*** eaguilar has joined #openstack-kolla10:42
*** zhurong has joined #openstack-kolla10:56
rstarmerpbourke: if you let me know the bug number, perhaps I can fix the docs10:57
*** awiddersheim has quit IRC10:57
*** rstarmer has quit IRC11:01
*** rstarmer has joined #openstack-kolla11:02
*** athomas has joined #openstack-kolla11:03
berendtwhen trying to bootstrap neutron with current rc packages i got the error "ImportError: No module named setup" inside the bootstrap container11:06
openstackgerritPaul Bourke (pbourke) proposed openstack/kolla: Fix horizon to use cache  https://review.openstack.org/38224411:13
*** DanyC has joined #openstack-kolla11:14
*** DanyC has left #openstack-kolla11:15
pbourkerstarmer: Im not sure if its so much a doc thing or we need to improve the mechanism. Here's the bug anyway feel free to pitch in https://bugs.launchpad.net/kolla/+bug/163051911:18
openstackLaunchpad bug 1630519 in kolla "Improvements required for config merging" [Wishlist,New]11:19
rstarmerthanks, I'll see if I can contribute meaningfully :)11:19
pbourkethanks!11:19
*** salv-orlando has joined #openstack-kolla11:21
*** salv-orlando has quit IRC11:26
*** eaguilar has quit IRC11:26
*** ccesario has joined #openstack-kolla11:28
*** hrito has joined #openstack-kolla11:29
*** salv-orlando has joined #openstack-kolla11:30
*** coolsvap is now known as coolsvap_11:30
*** hrito has quit IRC11:32
*** jtriley has joined #openstack-kolla11:32
mliimamorning guys11:34
*** b_bezak_ has joined #openstack-kolla11:35
*** shardy is now known as shardy_lunch11:35
*** b_bezak has quit IRC11:37
*** mgoddard_ has quit IRC11:44
*** v1k0d3n has joined #openstack-kolla11:45
*** jtriley has quit IRC11:48
*** v1k0d3n has quit IRC11:51
openstackgerritChristian Berendt proposed openstack/kolla: Use keystone_internal_url to access keystone from horizon  https://review.openstack.org/38235612:00
*** b_bezak has joined #openstack-kolla12:02
*** msimonin has quit IRC12:04
*** b_bezak_ has quit IRC12:04
openstackgerritMerged openstack/kolla: Fix horizon to use cache  https://review.openstack.org/38224412:05
*** dwalsh has joined #openstack-kolla12:06
openstackgerritMerged openstack/kolla: fixed kestone fernet prechecks for multinode deployments  https://review.openstack.org/38001412:09
*** msimonin has joined #openstack-kolla12:11
*** msimonin has quit IRC12:11
*** coolsvap_ is now known as coolsvap12:13
*** jtriley has joined #openstack-kolla12:17
*** phuongnh has quit IRC12:17
*** sdake has joined #openstack-kolla12:19
*** eaguilar has joined #openstack-kolla12:20
*** fguillot has joined #openstack-kolla12:22
*** sean-k-mooneyAFK has quit IRC12:22
*** jtriley has quit IRC12:23
*** tonanhngo has joined #openstack-kolla12:26
kfox1111darn... this just missed 1.4: https://github.com/kubernetes/kubernetes/pull/3125112:27
kfox1111that woudl have been awesome to use.12:27
*** tonanhngo has quit IRC12:27
*** shardy_lunch is now known as shardy12:28
*** haplo37 has quit IRC12:28
sdakemorning peeps12:28
sdakekfox1111 do you ever sleep ;)12:28
kfox1111my body doesn't believe in it. :/12:28
kfox1111morning. :)12:29
kfox1111seems like you don't sleep either.12:29
*** g3ek has quit IRC12:29
*** haplo37 has joined #openstack-kolla12:30
*** schwicht_at_work has quit IRC12:30
*** mgoddard has joined #openstack-kolla12:30
*** g3ek has joined #openstack-kolla12:30
sdakekfox1111 ptl's job is to be sleep deprived :)12:39
kfox1111:)12:39
sdakemexican coke coming in today12:40
sdake24$ a box12:40
* sdake liking amazon primenow12:40
kfox1111man, this ceph thing is so weird... everything loks to be setup right, no errors in the logs, it works in minikube fine, and it works randomly in the gate occationally.12:40
sdakethey also had a 10$ first timer coupuon12:40
kfox1111:)12:41
sdake24$ a box is a good deal12:41
sdakeand i don't have to lug it around12:41
sdakeright to my doorstep12:41
sdake$1 a coke12:41
sdakethe lugging part is what stops me from getting the magic of hecho en mexico coca-cola rolling in our house :)12:42
kfox1111hehe12:42
sdakeso lets talk about failures of ceph12:43
sdakewhy is it failing - and why is it succeeding12:43
sdakeunderstanding this will solve your problem for you12:43
sdakeyou may just be looking now at "why is it failing"12:43
kfox1111yup.12:44
sdakelooking at when it goes right is also important12:44
kfox1111I've so far not been able to spot any differences.12:44
sdakethe fact that it works tells me your running into some environmental issue of some sort12:44
kfox1111doesn't seem to make a difference which node types.12:44
sdakehow are you doing disk labeling?12:44
kfox1111seen failures on each.12:44
sdakeor using some other ceph12:44
kfox1111loopback device and just telling it to bootstrap it.12:45
sdaketelling which to bootstrap what in which way?12:45
kfox1111the block device is zero'ed.12:45
sdakethe block device comes from a losetup?12:46
kfox1111yeah.12:47
kfox1111https://github.com/openstack/kolla-kubernetes/blob/master/tools/setup_gate.sh is the code12:47
kfox1111losetup bits here: https://github.com/openstack/kolla-kubernetes/blob/master/tools/setup_gate.sh#L15912:48
otaviosdake: 3rd-party plugins? where do I think information about it?12:48
*** jtriley has joined #openstack-kolla12:49
sdakekfox1111 in that example your using a bootstrap method which is documented but I don't test12:49
sdakekfox1111 havey ou tried simplifying by using the other parted method?12:50
sdakeotavio undocumented atm I htink12:50
sdakekfox1111 re your diskspace, are you sure you are not running out of it12:50
otaviosdake: but in the source, where I can find it?12:50
sdakeotavio the docker directory contains j2 files12:51
sdakeif you want to customize them, you can read the customization commands by looking at the j2 files12:51
sdakethey are in a file called "macros.j2"12:51
sdakeotavio this may be documented at htis point - I know pbourke did some work to write some docs in this area recently (iirc)12:51
kfox1111sdake: the other method requires it to do parted bits itself. I ran into issues with that a while back when I tried it. Cant remember the details though.12:52
sdakekfox1111 "requires it" requires who?12:52
sdakekfox1111 are you saying tha tthe bootstrap 0 -1 method with parted ends up looking like lines 162-164?12:52
*** jtriley has quit IRC12:53
sdakekfox1111 specifically on line 159, bs=1, seek=3g12:53
*** rstarmer has quit IRC12:54
sdakethis is wierd usage of dd from my pov12:54
sdakeI'd go with something like bs=1M count=300012:54
*** schwicht has joined #openstack-kolla12:54
sdakeor bs=1m12:54
sdakei don't know what that seek thing does12:54
sdakeif it creates a sparse file, that may nto work well with your use case12:54
*** tonanhngo has joined #openstack-kolla12:54
kfox1111when letting the bootstrap container do the partioning, it wanted to add partitions and labels and things I think that caused issues when loopback. cant remember the exact details.12:55
kfox1111sdake: the seek dd does a sparse file.12:55
kfox1111garanteed to be all 0's.12:56
*** tonanhngo has quit IRC12:56
openstackgerritChristian Berendt proposed openstack/kolla: Install MySQL-python with pip in horizon container (type source)  https://review.openstack.org/38239812:56
sdakesparse files aren't guaranteed to be anything :)12:56
sdaketry taking out the sparse file part12:56
sdakesee if that fixes it12:56
sdakewhile your about it, put a df in there to check disk space12:56
sdakeyou could be running out of disk space later as that sparse file fills up12:57
sdakethis would cause all kinds of wierd crater behavior12:57
sdakeor sparse files on their own could be the cause12:57
kfox1111kk12:58
sdakewhen debugging software, when we find something busted, we fix that - then on to the next possible problem ;)12:58
sdakenot that seek=3g is busted12:58
sdakeit seems elegant12:58
sdakebut to me, it also seems fragile12:58
kfox1111certainly couildn't hurt to try.12:58
sdakeright12:58
sdakedo you have a gate log of where it fails?12:58
sdakevs where it succeeds?12:59
sdakeif so, does it always fail in the same way ?12:59
*** jheroux has joined #openstack-kolla12:59
kfox1111I've got logs up the wazoo. :)12:59
kfox1111sec12:59
kfox1111http://logs.openstack.org/41/381041/31/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/5f94252/console.html   ceph comes up ok13:00
sdakeosic provider13:00
sdakewhat about fails?13:00
kfox1111http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/1b20e62/console.html13:00
kfox1111http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/52c3137/console.html13:00
sdakeso works and fails on same provider?13:00
kfox1111http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/42ec450/console.html13:01
sdakeok another problem: http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/52c3137/console.html#_2016-10-05_04_57_58_66566913:01
sdakeopenstackclient is required on gates13:01
*** huikang has joined #openstack-kolla13:01
kfox1111thats fine. I just added a huge amount of logging and some of the commands are not there yet when the log dumpper runs on error.13:02
sdakekfox1111 what do you make of this line: http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/52c3137/console.html#_2016-10-05_04_57_54_47151813:02
kfox1111same f or that one.13:02
kfox1111let me show you...13:02
kfox1111https://github.com/openstack/kolla-kubernetes/blob/master/tools/setup_gate.sh#L80 and13:03
*** schwicht has quit IRC13:03
kfox1111https://github.com/openstack/kolla-kubernetes/blob/master/tools/setup_gate.sh#L3913:03
kfox1111so if anything goes wrong, it graps a metric crap ton of logs. :)13:04
sdakeright - what i'm getting at is what could possibly cause cat /var/log/messages to fail13:04
kfox1111ubuntu.13:04
kfox1111they call it /var/log/syslog.13:04
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104113:06
kfox1111so in the logs, the real error happens right before trap_error gets called.13:08
kfox1111the rest is log collection.13:09
kfox1111if you slice off the console.html off the url, you c an see all the rest of the logs collected.13:09
*** schwicht has joined #openstack-kolla13:14
*** tonanhngo has joined #openstack-kolla13:14
*** jtriley has joined #openstack-kolla13:15
*** jtriley has quit IRC13:23
kfox1111sdake: http://logs.openstack.org/41/381041/37/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/946d70e/console.html13:23
kfox1111unsparse13:23
*** rhallisey has joined #openstack-kolla13:23
kfox1111sdake: http://logs.openstack.org/41/381041/37/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/946d70e/console.html#_2016-10-05_13_12_58_98386213:25
kfox1111looks like there is pleanty of space.13:25
openstackgerritOpenStack Proposal Bot proposed openstack/kolla: Updated from global requirements  https://review.openstack.org/37598913:30
*** HyperJohnGraham has joined #openstack-kolla13:30
sdakekfox1111 https://www.youtube.com/watch?v=4gyeixJLabo13:31
sdakekfox1111 when you have time - might check this video13:31
kfox1111cool. thanks.13:31
*** schwicht has quit IRC13:32
sdakekfox1111 re http://logs.openstack.org/41/381041/37/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/946d70e/console.html#_2016-10-05_13_12_58_98386213:32
*** daneyon has joined #openstack-kolla13:32
sdakerun a recheck on that13:32
*** zhenguo has joined #openstack-kolla13:32
sdakekfox1111 we want to eliminate variance in the cloud environment13:32
sdakewe are most likely to be scheduled on osic13:32
sdakeso get the job scheduled to osic (to find the df)13:33
sdakethat df you gave was for internap13:33
*** rstarmer has joined #openstack-kolla13:34
kfox1111http://logs.openstack.org/41/381041/37/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/8c0caa9/console.html13:35
*** rstarmer has quit IRC13:36
*** daneyon has quit IRC13:37
*** rstarmer has joined #openstack-kolla13:37
sdakekfox1111 are you setting /etc/kolla/config/ceph.conf?13:38
kfox1111should look like: http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/52c3137/console.html#_2016-10-05_04_53_54_25786013:39
*** rstarmer has quit IRC13:39
*** eaguilar has quit IRC13:44
*** mgoddard_ has joined #openstack-kolla13:45
*** schwicht has joined #openstack-kolla13:46
*** ayoung has quit IRC13:47
*** mgoddard has quit IRC13:48
kfox1111sdake: cool video.13:50
*** jtriley has joined #openstack-kolla13:50
*** dave-mccowan has joined #openstack-kolla13:50
*** huikang has quit IRC13:50
*** MarMat has joined #openstack-kolla13:51
*** huikang has joined #openstack-kolla13:51
*** mgoddard has joined #openstack-kolla13:52
*** caowei has joined #openstack-kolla13:53
*** mgoddard_ has quit IRC13:53
*** salv-orl_ has joined #openstack-kolla13:54
*** huikang has quit IRC13:55
*** salv-orlando has quit IRC13:57
*** mnasiadka has quit IRC13:57
sdakekfox1111 where is the df in that log13:58
sdakekfox1111 i have looked for it for about 15 mins, and still dont see it13:58
*** inc0 has joined #openstack-kolla13:59
sdakekfox1111 ok, so your working directly with /etc/ceph/ceph.conf then?13:59
inc0good morning13:59
sdakeor generating that from genconfig13:59
sdakesup inc013:59
sdakeif your generating that from genconfig, it is unclear to me where exactly you specify that you are using only one disk13:59
sdakethe merge configs stuff is an ansible work, not part of genconfig14:00
*** pbourke has quit IRC14:03
*** pbourke has joined #openstack-kolla14:03
*** lrensing has joined #openstack-kolla14:05
sdakekfox1111 - can you link me the df line from tha tlast log - i dont see it14:05
Jeffrey4l_sdake, could u review this https://review.openstack.org/37273714:05
kfox1111sdake: both. genconfig, then tweaking it a bit.14:05
sdakeJeffrey4l_ yes14:06
Jeffrey4l_thanks.14:06
kfox1111https://github.com/openstack/kolla-kubernetes/blob/master/tools/setup_gate.sh#L12614:06
kfox1111I tweak it a bit more in the PS, but didn't seem to help.14:06
*** huikang has joined #openstack-kolla14:08
*** absubram has quit IRC14:08
*** LamT__ has joined #openstack-kolla14:10
*** huikang has quit IRC14:10
*** huikang has joined #openstack-kolla14:11
*** dims has quit IRC14:11
*** dwalsh has quit IRC14:14
sdakekfox1111 this magic number: http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/52c3137/console.html#_2016-10-05_04_53_54_35414414:15
sdakekfox1111 you said infra told you you could use that address14:15
sdakekfox1111 who specifically?14:15
sdakekfox1111 that magic number looks totally suspect to me14:15
sdakeand I still dont see df after looking at  the logs for 30 minutes14:15
*** huikang has quit IRC14:15
*** jtriley has quit IRC14:16
*** dims has joined #openstack-kolla14:17
kfox1111that one's dockers default address.14:19
kfox1111you can see ip stuff here: http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/52c3137/logs/ip.txt14:19
kfox1111and14:19
kfox1111here: http://logs.openstack.org/41/381041/36/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/52c3137/logs/routes.txt14:20
*** lamt has joined #openstack-kolla14:20
kfox1111looks like there shouldn't be any conflict there.14:20
sdakekfox1111 what I want to see is df at the same point as the mons report the 64 stuck states14:20
kfox1111k. I'll add one to the trace_error hook14:23
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104114:24
Jeffrey4l_sdake, what's wrong here? https://review.openstack.org/#/c/372737/9/docker/ceilometer/ceilometer-base/Dockerfile.j214:24
sdakeJeffrey4l_ nothing14:24
Jeffrey4l_ /var/log/ceilometer is useless and /var/lib/ceilometer is needed14:24
sdakeJeffrey4l_ just commenting on the fact that /var/log/ceilometer was an error14:25
Jeffrey4l_got.14:25
sdakeit should be ignored - nothing for you to be concerned with14:25
openstackgerritJeffrey Zhang proposed openstack/kolla: integrate gnocchi with ceilometer  https://review.openstack.org/37273714:26
*** dwalsh has joined #openstack-kolla14:27
sdakepbourke whats the story on the -2 on https://review.openstack.org/#/c/375989/14:32
openstackgerritPaul Bourke (pbourke) proposed openstack/kolla: Heka template missing optional params  https://review.openstack.org/38246314:32
pbourkesdake: its not critical to newton being released14:33
sdakepbourke it is indeed critical14:33
sdakeall openstack should have same requirements14:33
sdakeotherwise deps dont install properly14:33
pbourkesdake: hmm ok fair point14:33
kfox1111sdake: konw when 3.x containers will land in the hub? it would be good to test with jewel too.. maybe this is a hammer related bug.14:33
pbourke-2 removed14:33
pbourkesdake: thanks14:34
sdakekfox1111 after the 12th14:34
sdakepbourke roger14:34
Jeffrey4l_sdake, this one https://review.openstack.org/38082414:34
sdakeJeffrey4l_ thanks for fixin that14:36
sdakei noticed that the other day and was like "GROAN"14:36
Jeffrey4l_np ;)14:36
openstackgerritMerged openstack/kolla: Updated from global requirements  https://review.openstack.org/37598914:37
*** senk_ has quit IRC14:38
Jeffrey4l_sdake, pbourke shouldn't we stop merge requirements PS ^^   the requirements is now O branch, and kolla is still on N branch. ^^14:38
*** LamT__ has quit IRC14:38
*** mkoderer has quit IRC14:38
*** hogepodge has quit IRC14:38
*** bmace has quit IRC14:38
*** imcsk8 has quit IRC14:38
*** imcsk8 has joined #openstack-kolla14:38
pbourke:/14:39
*** bmace has joined #openstack-kolla14:39
sdakeoh christ14:39
*** hogepodge has joined #openstack-kolla14:39
pbourkerevert?14:39
openstackgerritMerged openstack/kolla: Handle the KeyboardInterrunpt properly for build.py script  https://review.openstack.org/38082414:39
sdakei thought that was the fix for docker-py14:39
sdakenot all requirements14:39
sdakeyes revert14:39
sdakeand then cherry-pick the docker-py requirements change line14:40
sdakei can do if you like14:40
*** mkoderer has joined #openstack-kolla14:40
sdakepbourke the basic deal was a week or so ago, magnum needed a requirements change for docker-py14:40
sdakeI thought that change was this change14:40
sdakewithout looking at the actual change seeing it changed all kinds of deps14:40
sdakewe don't want that obviously :)14:41
*** DuncanT has quit IRC14:41
*** LamT__ has joined #openstack-kolla14:43
openstackgerritSteven Dake proposed openstack/kolla: Revert "Updated from global requirements"  https://review.openstack.org/38247014:43
sdakepbourke Jeffrey4l_ can you ack that real quick plz14:43
Jeffrey4l_np14:43
*** MagnumBonum has joined #openstack-kolla14:43
MagnumBonumHi! I need some help. I have set up a two-node14:44
MagnumBonum Hi! I need some help. I have set up a two-node OpenStack system using Kolla. However, I cannot launch any instances on the second node. When the first node is full of instances, additional launches fail with Host Not Found.14:45
*** david-lyle has joined #openstack-kolla14:45
MagnumBonumIt passes all filters except for the last one, "Filter ComputeFilter returned 0 hosts"14:45
MagnumBonumMy hunch is that it is related to networking...14:46
*** zhurong has quit IRC14:46
*** DuncanT has joined #openstack-kolla14:47
*** caowei has quit IRC14:47
sdakeMagnumBonum interesting hunch14:47
sdakeMagnumBonum run nova hypervisor-list14:48
MagnumBonumsdake: yup14:48
sdakepaste to a paste service14:48
sdakeMagnumBonum can you ssh from one host to another?14:49
sdakeand visa-versa14:49
MagnumBonumâžœ  ~ openstack hypervisor list -f value 3 control01.beans.local 5 control02.beans.local 7 host04 9 host0514:49
MagnumBonumsdake, yes14:49
MagnumBonumsdake, kolla-ansible deploy went well14:50
sdakeok14:50
sdakeso just run openstack hypervisor list without any options and paste to a paste service14:50
MagnumBonumOK.14:50
MagnumBonumhttp://pastebin.com/kx6wa5jT14:51
sdakeyou ahve a total of 4 nodes there14:51
MagnumBonumI have now disabled control01 to force instances to control0214:52
sdakeis it that you have 2 control 2 compute?14:52
sdakeok well dont do that :)14:52
sdakewe are debugging your environment14:52
MagnumBonumsdake, yes there  are Windows Hyper-V hosts (host4 and host5)14:52
MagnumBonumOK I will enable control0214:52
MagnumBonumsry control0114:52
sdakewell I know nothing of hyperv :)14:53
sdakeso run nova hypervisor-show 3 and 5 in pastes14:53
*** afranc has quit IRC14:53
*** afranc has joined #openstack-kolla14:53
*** david-lyle has quit IRC14:54
MagnumBonumhttp://pastebin.com/UNzhZa9V14:54
sdaket-7 days :)14:54
MagnumBonumsdake: me neither :-D14:54
MagnumBonumThey may be a culprit...14:55
sdakeMagnumBonum in this case, you have no vms running14:55
sdakewith the hypervisor show you showed me14:55
sdakevcpu=0 on both nodes14:55
MagnumBonumyup. should I deploy?14:55
sdakecan you load it up to where yo uthink it breaks14:55
sdakedeploy which?14:56
*** Serlex has quit IRC14:56
sdakeyour application? or openstack14:56
MagnumBonumthe app.14:56
sdakeyup deploy the app14:56
MagnumBonumI am deploying a stack to see that control01 fills up.14:56
sdakeyour assertion that the problem is networking related does not seem correct - nova hypervisor shows both hypervisors as active and ready to rock14:57
MagnumBonumhttp://pastebin.com/qZsvrt1T14:59
*** david-lyle has joined #openstack-kolla15:00
MagnumBonumsry pasted hypervisor 5 twice, hypervisor 3 http://pastebin.com/mVLpjsHg15:00
sdakeMagnumBonum hypervisor show 5 and 3 pls, you did 5 twice15:01
MagnumBonumsdake, yes sry check second paste: sry pasted hypervisor 5 twice, hypervisor 3 http://pastebin.com/mVLpjsHg15:01
*** eaguilar has joined #openstack-kolla15:01
MagnumBonumas can be seen, control01 does not have sufficient disk, the flavor requires 20 GB. But control02 does not seem to be valid...15:02
*** HyperJohnGraham has quit IRC15:02
sdakeline 3515:02
sdakeyour running out of disk space15:02
sdakenot vcpus15:02
sdakeMagnumBonum ^15:02
sdakethat "host can't schedule" thing means some kind of capacity constraint has been hit in nova15:02
sdakeyes - the error isn't helpful at all15:03
sdakeI think its a huge problem for nova15:03
sdakeeveryone complains nobody fixes15:03
MagnumBonumerror message: http://pastebin.com/7wAgFmGF15:03
MagnumBonumyup, sdake. there is not enough disk. so it should launch on control02, no?15:03
sdakeMagnumBonum how about a simple test before deploying your application15:04
*** afranc has quit IRC15:04
MagnumBonumyes, please!15:04
sdakeMagnumBonum possibly it should - but it isn't15:04
sdakeMagnumBonum that really isn't kolla's fault, its novas :)15:04
MagnumBonum:)15:04
*** dwalsh has quit IRC15:04
sdakethe scheduler in nova - if it can't find resources sometimes it sort of just "gives up"15:04
*** g3ek has quit IRC15:04
MagnumBonumso what test should we do15:04
sdakecreate a flavor with small disk space requirements15:04
sdake1 vcpu15:05
sdake256mb of ram15:05
*** schwicht has quit IRC15:05
openstackgerritJeffrey Zhang proposed openstack/kolla: Fix the fail when using keystone fernet  https://review.openstack.org/38249215:05
sdakethen launch 10 vms one at a time15:05
sdakewith this new flavor15:05
MagnumBonumwhat about m1.tiny?15:06
sdakeMagnumBonum run flavor-show m1.tiny15:06
sdakethe m1.* was removed from nova15:06
sdakeso unless you added it, not sure where thats coming from15:06
sdakenova flavor-show m1.tiny15:06
sdakeI'm not sure what m1.tiny is in your environment15:06
*** afranc has joined #openstack-kolla15:07
MagnumBonumI am on mitaka :-/15:07
MagnumBonumdownloading cirros...15:08
sdakeright - it was removed in mitaka15:08
sdakeMagnumBonum can you do the nova flavor-show m1.tiny15:08
sdakei want t o see what m1.tiny is deifned as15:08
*** senk_ has joined #openstack-kolla15:08
sdakewe need something with small ram and small disk and 1 vcpu15:08
sdakeenough to fit 10 of em on that 50gb hard disk you have :)15:09
sdakeDiskFilter: (start: 4, end: 2)15:11
MagnumBonumhttp://pastebin.com/3L32cAPt15:11
sdakewhat this tells me is "hey I tried to schedule 4, but could only end up getting 2 going"15:11
Jeffrey4l_bjolo, around?15:11
Jeffrey4l_bjolo, could u try this fix PS for keystone fernet https://review.openstack.org/38249215:11
MagnumBonumsdake so m1.tiny is 1 GB each which should comfortably fit15:12
sdakeMagnumBonum yup m1.tiny is good15:12
sdakehow did you get kolla to deploy hyperv?15:12
*** schwicht has joined #openstack-kolla15:13
MagnumBonumsdake, they are installed via Cobbler + the CloudBase stuff. May use Ironic in future15:13
sdakeMagnumBonum they are speaking to kolla's control nodes, which means they are authenticating with rabbitmq15:14
sdakehow did you get that part to work?15:14
*** salv-orl_ has quit IRC15:15
MagnumBonumjust entered the rabbitmq user "openstack" + password from /etc/kolla/passwords.yml15:15
MagnumBonumhad to specify a raw IP, not VIP for connection though15:15
MagnumBonumhttp://pastebin.com/AfXbMLZY15:15
*** huikang has joined #openstack-kolla15:15
MagnumBonum10 instances of cirros running!15:15
*** g3ek has joined #openstack-kolla15:16
MagnumBonumand they are spread between control01 and control02 ---- weirdness15:16
sdakethere ya go15:16
sdakeroot caused :)15:16
sdakeMagnumBonum we recommend ceph for storage for your nodes15:17
sdakeor atleast I do :)15:17
MagnumBonumme too, we haven't gotten around to it yet.15:17
sdakeyou can use external ceph (your own deploy) or we have a ceph that is containerized15:17
sdakeboth work well15:17
*** dwalsh has joined #openstack-kolla15:18
sdakewould this scenario happen in the real world (the nova problem)? not sure - i think quota management comes into play here15:18
sdakerecommend filing a bug against nova15:18
sdakesaying that when nova runs out of disk space, other machines are not used to schedule15:18
MagnumBonumok so there is one weirdness, instance 1,2,3 and 6 have two Floating IP.15:18
sdakeand say you are using local disk storage15:18
MagnumBonumOK. so we may be better off using Ceph, this problem would not occur?15:19
sdakeif you dont use ceph your in for a world of pain :)15:19
MagnumBonumOK. sounds like a plan for tomorrow then...15:19
sdakeceph centralizes storage15:19
MagnumBonumyup we want to go there.15:20
sdakerather then decentralizing it as is done with the default config15:20
sdakeMagnumBonum file a nova bug - and then link to me when done and then we can tackle your next issue15:20
sdakeMagnumBonum give them enough logs to work with :)15:21
MagnumBonumI will. thank you for your help.15:21
sdakei'd file bug myself but you have all the data (and the particular problem)15:21
sdakeMagnumBonum roger - thats what we are here for (besides implementing all this stuff in addition:)15:22
MagnumBonumsdake: should I follow this: https://wiki.openstack.org/wiki/Bugs ?15:22
sdakeMagnumBonum here is the link you use:15:22
*** diogogmt has joined #openstack-kolla15:22
sdakehttps://bugs.launchpad.net/nova/+filebug15:22
sdakejust roll with what yo uthink is right - try to give them more then enough information to prove the case15:23
sdakethen once done, I'll confirm that I see same behavior in your environment15:23
*** david-lyle has quit IRC15:26
*** david-lyle has joined #openstack-kolla15:32
kfox1111back15:32
kfox1111sdake: http://logs.openstack.org/41/381041/38/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/dd875d4/console.html15:33
kfox1111df at time of error http://logs.openstack.org/41/381041/38/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/dd875d4/logs/df.txt15:33
kfox1111looks fine15:33
*** duonghq has joined #openstack-kolla15:34
*** vhosakot has joined #openstack-kolla15:37
sbezverk_sdake: any links for flannel troubleshooting guides?15:38
sdakesbezverk_ none that i know of15:38
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104115:39
sdakekfox1111 that is 90gb, the other time you ran df, it was 150gb15:39
sdakedid this run in osic?15:39
sdakeor somewhere  else15:39
kfox1111sbezverk_: what issue are you having with flannel?15:39
kfox1111oh. here's an osc one: http://logs.openstack.org/41/381041/38/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/6c04d05/console.html15:40
kfox1111http://logs.openstack.org/41/381041/38/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/6c04d05/logs/df.txt15:40
kfox1111each check experimental is producing 3 runs right now.15:41
*** zhenguo has quit IRC15:43
sbezverk_kfox1111: no connectivity between remove nodes, no packets going through flannel interface15:43
sbezverk_kfox1111: remote nodes15:43
kfox1111iptables in the way? (sorry, have to ask)15:43
kfox1111flannel.1 ip addresses non overlapping and in the right space?15:44
kfox1111are there multiple nics in the machines?15:44
sbezverk_kfox1111: nope, nothing no firewalld I have15:44
sbezverk_the same setup and it is working perfectly15:45
sbezverk_but on John's it does not15:45
kfox1111(did you stand it up or did you use a deamonset from the new repo's?)15:45
*** salv-orlando has joined #openstack-kolla15:45
sbezverk_kofox1111: it is the one that came with kubeadm installation15:46
kfox1111etcdctl get /coreos.com/network/config  (or whichever prefix you used15:46
kfox1111ah.15:46
kfox1111there isn't a pure flannel one last I looked. which version did you use?15:46
kfox1111cannal?15:46
inc0kfox1111, why do we need storage_ceph.key if we already have secret defined?15:46
sbezverk_kfox1111: actually they call it canal, combination of flannel with calico15:47
sdakepbourke re https://review.openstack.org/#/c/382463/115:47
sbezverk_and it is version 0.6115:47
sdakepbourke is that needed for everything else too?15:47
kfox1111sbezverk_: ok. was just curious.15:47
inc0https://github.com/openstack/kolla-kubernetes/blob/master/services/common/common-pv.yml.j2#L3515:47
kfox1111it has a hard coded addres space. does that conflict with anything he has?15:47
sbezverk_kfox1111: it does not have hardcoded space, I figured out how to customize it15:48
kfox1111inc0: that stuff is a bit in flux. I never really liked that way of doing things and am working on gutting that code.15:48
sbezverk_and I made sure it is not overlapping anywhere15:48
openstackgerritWaldemar Znoinski (wznoinsk) proposed openstack/kolla: use ironic_conductor volume for conductor's /var/lib/ironic  https://review.openstack.org/37211815:48
kfox1111sbezverk_: yeah, you can easily sed it to something else before importing it. was just double checking you saw that.15:48
inc0kfox1111, so we should be good to just remove this conditional right?15:48
kfox1111inc0: I'm leaving it for now, as if you just delete the entry in the config, it ignores it. I've configuring the gate job to do it differently, using the secret path: https://review.openstack.org/#/c/381041/15:49
sdakeMagnumBonum did you wrap up that nova bug - i was dc'ed briefly15:49
kfox1111inc0: once I get that worklfow working relyiably, then I'll push to have the torage_ceph.key thing totaly removed. its pretty dangerious I think.15:50
kfox1111too corse grained.15:50
MagnumBonumsdake: still working on it...15:50
sdakeMagnumBonum roger welcome to my world :)15:50
kfox1111sbezverk_: can you find the service ip for the cannal etcd?15:50
kfox1111then do something like: etcdctl --endpoint http://... get /coreos.com/network/config15:51
inc0also..fsType does anything in tpls?15:52
kfox1111lets see if averything in there looks ok and the nodes are all registering themselves.15:52
kfox1111tpls?15:52
rhalliseyinc0, what's on the schedule for the meeting?15:52
inc0we hardcode ext4 in templates15:52
kfox1111inc0: the ps fixes that.15:53
*** schwicht has quit IRC15:53
inc0rhallisey, rc2 and summit schedule are my topics15:53
*** david-lyle has quit IRC15:53
rhalliseyinc0, ok15:53
*** schwicht has joined #openstack-kolla15:54
inc0wanna wanna add k8s state-of-union to the bunch?15:54
inc0;)15:54
*** matrohon has quit IRC15:54
rhalliseythe 2 kolla-kubernetes sessions are 1) kolla-kubernetes architecture 2) kolla-kubernetes road map15:54
rhalliseyroad map will be the state-of-the-union15:55
sdakekfox1111 my speculation that out of disk space was the problem is wrong: http://logs.openstack.org/41/381041/38/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/6c04d05/console.html#_2016-10-05_14_30_31_74179715:55
sdakekfox1111 add dmesg to the end of the setup_gate.sh15:55
inc0I was thinking of todays meeting15:55
rhalliseyinc0, have a meeting conflict15:55
rhalliseyoh for today15:55
kfox1111sdake: think that will have anything syslog doesnt have?15:55
sdakekfox1111 also recommend checking your not ooming15:55
sdakekfox1111 YES15:55
rhalliseyinc0, if we do it first I can make it15:55
sdakekfox1111 because your not actually saving syslog :)15:55
*** hrito has joined #openstack-kolla15:55
kfox1111sdake: yeah I am.: http://logs.openstack.org/41/381041/38/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/6c04d05/logs/messages15:56
kfox1111its syslog on ubuntu machines, and messages on redhat machines.15:56
sbezverk_kfox1111: http://paste.openstack.org/show/584495/15:57
sdakekfox1111 i see15:57
kfox1111sbezverk_: that looks ok...15:58
sdakekfox1111 well what we need is just dmesg without all the other stuff :)15:58
sdakekfox1111 and a memory check15:58
kfox1111sbezverk_: are there multiple nics on the boxes, or some of the boxes?15:58
sbezverk_kfox1111: yes15:58
kfox1111sdake: k. I'll gather dmesg. what do you want for the memory check?15:58
sdakekfox1111 not quite sure15:59
sdakekfox1111 if there is a way to get top to print out memory without updating - that would be ideal15:59
sdakethere is probably a one-shot cli operation15:59
sdakebut i am connected to vpn atm15:59
sdakeand can't actually login to my machines to tell you and we have team meeting now ;)15:59
kfox1111probably a way to get ps to do that...15:59
kfox1111sbezverk_: one issue I had with flannel on multinic machines was which ip it used for tunneling.16:00
inc0meeting time folks16:00
kfox1111if the nodes have different nics and each one can't talk to all the others, it might cause issues.16:00
kfox1111I've seen it by picking two hosts, A and B,16:00
kfox1111running ping on A to the ip of flannel.1 on B16:01
kfox1111then doing a tcpdump on B for the vlxan traffic.16:01
kfox1111I noticed the ip on the response from B was going to a not desired ip.16:01
MagnumBonumsdame16:02
sbezverk_kfox1111: well it is not the case here, stats on flannel interface are all 016:02
MagnumBonumsdake so how do I tag you16:02
sbezverk_so the traffic from pod is not even making to the interface to be encapsulated16:02
MagnumBonumsdake here is the bug https://bugs.launchpad.net/nova/+bug/163065816:03
openstackLaunchpad bug 1630658 in OpenStack Compute (nova) "nova-scheduler fails when running out of disk space" [Undecided,New]16:03
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104116:05
sdakeMagnumBonum ok we have team meeting now but i did leave a comment in your support :)16:07
*** huikang has quit IRC16:11
*** huikang has joined #openstack-kolla16:11
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: Split endpoint jobs and start testing the deployments  https://review.openstack.org/38086816:14
*** eaguilar has quit IRC16:14
*** huikang has quit IRC16:16
sbezverk_kfox1111: please check my comments in PS 30 38086816:17
*** haplo37_ has quit IRC16:18
openstackgerritMerged openstack/kolla: Heka template missing optional params  https://review.openstack.org/38246316:18
*** haplo37_ has joined #openstack-kolla16:21
*** DanyC has joined #openstack-kolla16:22
sdakeshort meeting16:25
duonghqvery short indeed, huh?16:25
inc0yeah, everyone knows what to do:)16:25
duonghqtoday, I hit my keyboard many times because keepalived container failed to grap the VIP16:26
duonghqanybody got this kind of error?16:26
kfox1111rhallisey: thanks for the note about the blueprints... I havent looked at them in a while. I just closed a bunch of done stuff and left some status comments.16:26
duonghqthe keepalived doesn't get MASTER role, indeed16:26
*** DanyC has quit IRC16:26
*** egonzalez90 has quit IRC16:27
kfox1111sbezverk_: you see no tcpdump traffic at all over the vxlan ports?16:28
kfox1111sbezverk_: k16:28
*** ayoung has joined #openstack-kolla16:28
kfox1111sbezverk_: we need to go through the code base and just do a ws cleanup. theres a lot of that in there. :/16:29
*** DanyC has joined #openstack-kolla16:29
sdakeMagnumBonum ok meeting over- you have a second problem ?16:29
berendtsorry, i missed the meeting :/16:31
sdakeberendt there are logs16:31
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: Split endpoint jobs and start testing the deployments  https://review.openstack.org/38086816:31
sdakeberendt along with a general finish the job 4 step process16:32
sdakeberendt worth reading i think - atleast the steps ;)16:32
*** DanyC has left #openstack-kolla16:32
*** unicell1 has joined #openstack-kolla16:32
duonghqsdake: did you got above keepalived problem?16:33
*** unicell has quit IRC16:33
kfox1111sbezverk_: I fixed the ws issue.16:33
sdakeduonghq i haven't - can you file a bug16:33
sbezverk_kfox1111: ok16:34
duonghqsure, but I'm not sure it's really a bug, today it hit me many time, tomorrow maybe it run away16:35
hritosdake: hi, i wrote a bp and its status is discussion now. what is the next step?16:36
sdakehrito in flux - inc0 has talked about going to a specs process and I think we think its a good idea16:36
sdakeduonghq i see - so is inconsistent from day to day, but comes and goes?16:37
inc0hrito, basically our code is frozen now, for week more16:37
inc0but please link the bp so we can discuss16:37
inc0until we agree on new spec process old process keeps working16:38
sdakeinc0 wfm16:38
hritohttps://blueprints.launchpad.net/kolla/+spec/graceful-shutdown16:38
duonghqsdake: it comes today, for about 80%16:38
*** huikang has joined #openstack-kolla16:38
inc0hrito, it's accepted:)16:38
inc0and imho very important, so as soon as we branch newton (next week approx) you're free to push code16:39
hrito:)16:39
kfox1111hrito: awesome. :)   do you know which processes support it? I implemented it a different way for kolla-k8s and would like to use the native solution if possible.16:39
inc0you still can publish reviews now, just will not be reviewed with high priority (bugs comes first) or merged (feature freeze)16:40
sdakehrito ya - just lookingat it already it was marked for ocata16:40
inc0for next week16:40
sdakehrito you can work on it now, but wont be able to commit it to repo for about a week16:40
sdakeits priority was also essential16:40
hritograceful shutdown is implemented in oslo.service16:40
kfox1111hrito: kubernetes has a preStop hook that we can use to ask it to gracefully shutdown.16:41
hritoso if process uses it, we can use graceful shutdown16:41
kfox1111just need to know what the command is, and which services support it.16:41
duonghqsdake: ah, remembered, if it's failed and I leave it alone for awhile and deploy again, it's ok until I destroy, reboot the node and deploy16:41
*** dwalsh has quit IRC16:44
hritosend SIGTERM to processes hooks graceful shutdown, and its implemented nova, cinder and newtron processes as far as i know16:45
*** unicell1 has quit IRC16:46
qwangrhallisey: Hi Ryan. I have some questions about the workflows in kolla-k8s, do you have a moment now?16:47
inc0qwang, he mentioned that he had another meeting around now, so you might want to ask agian later:)16:48
qwanginc0: sure. thanks16:49
*** david-lyle has joined #openstack-kolla16:52
*** huikang has quit IRC16:56
*** huikang has joined #openstack-kolla16:56
sdakeduonghq wierd16:58
kfox1111hrito: was it added in neuton or is it supported in mitaka too?16:58
sdakeduonghq is it reproducibel?16:58
sdakeduonghq sounds like a bug, but I don't see it16:58
sdakeduonghq but I don't reboot running nodes often16:58
duonghqsdake: tommorow I think I can reproduce it quite easy16:59
*** senk_ has quit IRC16:59
duonghqsdake it fails even without reboot16:59
duonghqjust after the cluster is destroy16:59
sdakedefinately neverseen that16:59
sdakeand I do that all the time16:59
duonghqthe keepalived container take very long time to got MASTER state16:59
duonghqsdake: yup, today is the 1st time I got it17:00
sdakeduonghq please file a bug :)17:00
sdakeduonghq atleast we can get a place to track this kind of conversation17:00
duonghqon both 14.04 and 16.04 host (VM in VirtualBox)17:01
sbezverk_kfox1111: I kind of pinpoint the failure point, but not sure where to go after. have second to discuss?17:01
kfox1111yeah17:01
*** huikang has quit IRC17:01
sbezverk_kfox1111: the issue is veth is not plugged or plugged incorrectly into docker17:01
kfox1111I don't think they plug into docker0 with canal17:02
sbezverk_I tested connectivity and between flannels node I can ping their interfaces17:02
hritokfox1111: i think it is supported in mitaka, but i have not confirmed it17:02
kfox1111from the flannel interfaces?17:02
sbezverk_right17:02
kfox1111k.17:03
kfox1111hrito: k. thanks.17:03
sbezverk_when I ping from container I do see ping traffic on veth cali414aa30554e17:03
sbezverk_but then nothing gets bridged to flannel17:03
sbezverk_so whatever ties these to together is either missing17:04
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104117:04
sbezverk_or misconfigured17:04
kfox1111hmm...17:04
kfox1111can you ping the host its on from the container?17:04
sbezverk_kfox1111: if it is not docker then what bridge canal would use to plug veth interfaces and flannel?17:04
kfox1111this may be a calico issue instead of a flannel one. unfortunately I've never debugged calico yet. but do want to learn... :)17:05
sbezverk_kfox1111: nope when I try to ping from container local's host flannel interface it fails17:05
kfox1111what does a 'brctl show' show?17:05
sbezverk_but in my setup it works17:05
*** dwalsh has joined #openstack-kolla17:05
sbezverk_there is no linux bridge installed17:06
kfox1111no linux bridges or no tools?17:06
kfox1111cause it can be setting up the bridges without tools.17:06
sbezverk_neither I think17:06
sbezverk_will install17:07
kfox1111lets double check. on both the working and non working systems.17:07
kfox1111k.17:07
sbezverk_kfox1111: on both working and not working see the same thing, one liner for docker017:08
kfox1111and no members?17:08
*** daneyon has joined #openstack-kolla17:08
sbezverk_it seems calico is not using bridging I remember it is pure L3 solutiob17:09
sbezverk_nada17:09
kfox1111thats what I thought.17:09
kfox1111so the calico interfaces get traffic to the flannel.1 interface somehow else.17:09
*** mkoderer has quit IRC17:09
kfox1111I dont think the calico interrfaces have ip's...17:09
kfox1111http://logs.openstack.org/41/381041/40/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/22c3b34/logs/ip.txt17:10
kfox1111hmm.. not ipv4 addressses anyway.17:10
sbezverk_kfox1111: nope but I do see traffic coming from container17:10
*** strigazi is now known as strigazi_AFK17:10
kfox1111http://docs.projectcalico.org/en/1.3.0/architecture.html17:11
kfox1111it might mess with iptables a bunch?17:12
kfox1111ah, here we go: http://logs.openstack.org/41/381041/40/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/22c3b34/logs/routes.txt17:12
kfox1111the calico interfaces get routes on the host.17:13
kfox1111do you see that?17:13
*** daneyon has quit IRC17:13
sbezverk_default         169.254.1.1     0.0.0.0         UG    0      0        0 eth017:14
sbezverk_169.254.1.1     *               255.255.255.255 UH    0      0        0 eth017:14
sbezverk_same thing on both sides, working and not working17:15
kfox1111thats it?17:15
kfox1111and canal on both?17:15
sbezverk_this is from the container perspective17:16
kfox1111oh. sry. do the route on the host.17:16
sbezverk_yes they are identical17:16
kfox1111should see calico interfaces and flannel17:17
sbezverk_http://paste.openstack.org/show/584512/17:17
kfox1111looks like enp2s0f1 already has a 10 space address.17:19
kfox1111looks non overlapping....17:19
sbezverk_right I calculated all subnet masks :-)17:20
rhalliseyqwang, hey17:20
kfox1111and on the host you can ping flannel.1 but not the ip of the container?17:20
qwangrhallisey: hi Ryan17:21
*** hrito has quit IRC17:21
kfox1111hmm.... is that eth0 route from above from the container on the non working host? it seems like it only has a link local address?17:21
rhalliseyqwang, what questions did you have?17:21
kfox1111that seems maybe wrong.17:21
*** unicell has joined #openstack-kolla17:21
sbezverk_eth0 side is working17:22
qwangrhallisey: I am reading your patch set about ansible support for a few openstack services: mariadb/rabbimq/glance/keystone17:23
*** mgoddard has quit IRC17:23
qwangrhallisey: and i am trying to help with it.17:23
kfox1111sbezverk_: but it doesn't have the ip the route lists?17:23
rhalliseyqwang, excellent17:23
rhalliseywe can divide up some of the other services17:23
qwangrhallisey: I don't know how you do the tests, tho17:24
openstackgerritMartin Matyas proposed openstack/kolla: Patch for magnum minion register issue  https://review.openstack.org/38257917:24
rhalliseyqwang, ok. I'll write a doc as part of that patch chain that will describe it17:24
sbezverk_kfox1111: are you refering to paste here above or paste.openstack.org17:24
sbezverk_?17:24
kfox1111sbezverk_: the one in the irc logs here.17:25
*** mark-casey1 has joined #openstack-kolla17:25
qwangrhallisey: great. thank you so much17:25
MarMatsdake something like this? https://review.openstack.org/#/c/382579/17:25
sbezverk_kfox1111: there are identical on both sides working and not working17:25
*** senk_ has joined #openstack-kolla17:25
kfox1111rhallisey: have a look at: https://review.openstack.org/#/c/380868/   it will cause the ansible workflow to need to change a little big17:25
kfox1111bit17:25
kfox1111sbezverk_: yeah, but I'd expect different ip addresses, so near identicle? I mean 'ip a', not 'route'17:26
sbezverk_k17:26
rhalliseykfox1111, so difference names for the bootstrap jobs17:27
rhalliseygotcha17:27
kfox1111rhallisey: yeah. its split out, and there's a new ordering constraint to work around ansible doing too much in one call.17:27
rhalliseyand ordering constraint17:28
sdakemarmat what about binary distros?17:28
sbezverk_kfox1111: http://paste.openstack.org/show/584518/17:28
MarMatsdake hm...17:28
kfox1111there's a race in the ansible jobs. it does 2 things in one cli call. 'a, look up service_id. if it doesn't exist, create service_d. b, create endpoint associated with service_id'17:29
sdakeMarMat the work looks good - but arey ou saying we should say binary distros dont work for magnum?17:29
*** mark-casey1 has quit IRC17:29
*** DanyC has joined #openstack-kolla17:29
kfox1111if you launch multiple of them at the same time, the a part can race, and cuase wo different service_id's to be created for the same service. causing later issues.17:29
sbezverk_kfox1111: btw that ps 380868, deployment logs even though shows success, does not really deploy anyhitng17:29
sdakeMarMat lets go to #openstack-containers again plz17:30
sbezverk_kfox1111: so before I workflow it I will have to test it, unless Ryan can test it earlier17:30
MarMatsdake any example how are the binary packages patched usually?17:30
MarMatsdake I'm there already, no reaction on my question17:30
kfox1111sbezverk_: that looks good.17:30
rhalliseykfox1111, ok cool.  Ya we need everything to be fully granular17:30
kfox1111sbezverk_: it tests things only in the ceph case. the iscsi stuff doesn't work until there's a release of kolla.17:31
kfox1111sbezverk_: so long as it gets past the part where it deployes the endoints, and checks to see if they are all properly created, then it works.17:31
*** shardy is now known as shardy_afk17:32
kfox1111rhallisey: longer term,  I think we want to break that up into the two different pieces, and at that point, we can just use the openstack cli rather then ansible. will make people hapier anyway.17:32
sbezverk_kfox1111: totaly agree but I could not find any occurences of success, please point me out17:32
kfox1111sbezverk_: sec17:32
kfox1111http://logs.openstack.org/68/380868/32/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/67e577b/console.html17:33
kfox1111ceph is still not working in the gate, so it got all the way up to where it tried to upload the glance image to ceph and hang.17:33
*** DanyC has quit IRC17:34
kfox1111but it got to the point where all services were running, and passed the endpoint test here:17:34
kfox1111http://logs.openstack.org/68/380868/32/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/67e577b/console.html#_2016-10-05_16_58_56_28002417:34
kfox1111all endpoints have 3 entries.17:34
kfox1111it was failing that check before the gate test.17:35
kfox1111it was failing that check before the job split/order.17:35
rhalliseyqwang, to test the existing ansible patches use: ansible-playbook -i workflows/ansible/inventory/all-in-one workflows/ansible/site.yml -e @/etc/kolla-kubernetes/kolla-kubernetes.yml17:35
rhalliseyqwang, that should be able to get you started17:35
qwangrhallisey: thank you. will test with it. I'll start with the horizon role17:36
*** huikang has joined #openstack-kolla17:36
kfox1111sdake: http://logs.openstack.org/41/381041/41/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/4ef0df5/logs/dmesg  not much interesting. :/17:37
rhalliseyqwang, awesome :)17:37
sbezverk_kfox1111: ok17:38
sdakekfox1111 try without selinux17:38
sdakekfox1111 and try with btrfs17:39
kfox1111sdake: selinux is disabled.17:39
kfox1111the setup script disables it.17:39
kfox1111so its enabled for a bit, then turned off.17:39
sdakekfox1111 permissive or disabled?17:39
kfox1111https://github.com/openstack/kolla-kubernetes/blob/master/tools/setup_gate.sh#L13417:40
sdakekfox1111 what about btrfs?17:40
kfox1111not sure why it would matter? ceph is using the loopback device.17:41
kfox1111hmm...17:41
sdakekfox1111 i mean for the graph driver17:41
sdakethere is an ioctl error in there17:41
sdakein devicemapper17:41
kfox1111well, I ugess its using aufs on ubuntu and loopback on centos.17:41
*** bjolo_ has joined #openstack-kolla17:41
kfox1111and both fail the same way17:41
sdakedm is a pile of groan17:41
*** MagnumBonum has quit IRC17:42
*** haplo37 has quit IRC17:43
kfox1111hre's one for an ubuntu box: http://logs.openstack.org/41/381041/41/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/997569e/logs/dmesg17:44
*** haplo37 has joined #openstack-kolla17:44
*** athomas has quit IRC17:45
kfox1111sdake: here's a slightly different one, where ceph seems to be quite happy: http://logs.openstack.org/68/380868/32/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/a2bcfb0/logs/17:47
kfox1111don't see these too often.17:48
kfox1111unfortunatly, its not in the ceph branch, so not much ceph logging. :/17:48
kfox1111but you might be able to spot some node differences.17:48
*** dwalsh has quit IRC17:54
*** tonanhngo has quit IRC17:57
*** tonanhngo has joined #openstack-kolla18:01
*** SamYaple_ has joined #openstack-kolla18:04
*** SamYaple_ has quit IRC18:04
*** SamYaple has quit IRC18:04
*** SamYaple has joined #openstack-kolla18:04
kfox1111SamYaple: alive?18:05
SamYaplekfox1111: always18:05
SamYapleunless this has to do with ceph18:05
kfox1111rumber has it you ....18:05
kfox1111hehe :)18:05
kfox1111saw me coming. :)18:05
*** berendt has quit IRC18:05
SamYapleheh. is this ceph in the gate question?18:05
kfox1111ceph/gate=bad, ceph/anywhere else, good.18:06
kfox1111totally weirded out by this.18:06
*** mark-casey1 has joined #openstack-kolla18:06
SamYaplenah its not that. its a space issue *mostly*18:06
SamYapleso there is limited and varying amount of space throughout all the gates18:07
kfox1111runs with the same params in minikube. :/18:07
kfox1111I near doubled the space in the gate just to make sure. and validated there is pleanty of space on the gate vm's.18:07
SamYaplekfox1111: oh i doubt it!18:07
SamYapleyou forget, there are like 8 different cloud providers18:08
*** huikang has quit IRC18:08
SamYaple1 of them has like 20GB root disk or something18:08
SamYaplei forget the exact details to be honest18:08
SamYapleand its possible it changed18:08
*** huikang has joined #openstack-kolla18:08
kfox1111I've set up logging up the wazoo and verify on failure that df shows pleanty of free space.18:08
SamYaplebut it was not possible to run on all the gates with enough space for ceph18:08
kfox1111its not out of free space.18:08
SamYapleoh wait, lets take a step back18:08
kfox1111I'm doing it on minikube with 3gigs of loopback for ceph.18:09
SamYapleyou are saying ceph doesnt _work_ at all?18:09
SamYaplei could have implemetned ceph, but i didnt have space too18:09
kfox1111The issue I see is,18:09
SamYapleare you saying you did implemnet and its not working?18:09
kfox1111most of the time, ceph gets stuck allocating the pg's, for example:18:09
kfox1111http://logs.openstack.org/41/381041/41/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/ddb4997/console.html18:09
kfox1111after 4 min there, it still hadn't allocated any pg's.18:10
*** duonghq has quit IRC18:10
kfox1111but the same test occationally works, and the pg's schedule.18:10
kfox1111and then all is well with ceph from then on out.18:10
SamYapleim sure you just didnt update the crush map ruleset, right?18:10
kfox1111I didn't. but did tweak the config file a bit. right before the rbd create there, you can see the ceph.conf used.18:11
*** Satya_ has joined #openstack-kolla18:11
Satya_Hi Everyone18:11
kfox1111i did tweak the pool sizes and crush leaf type.18:11
*** LamT__ has quit IRC18:11
Satya_Want to check if anyone configured the designate with kolla18:11
Satya_i saw the config and templates are there already with kolla18:12
kfox1111with minikube with the same setup, I can laucnh vm's with glance images in ceph, and cinder volumes out of ceph attached.18:12
*** huikang has quit IRC18:13
kfox1111somethings weird about the gate vm's though that differ from minikube in some way. (a lot of ways, but I've thrown out a lot of them. not iptables, not multiple nics, not out of space, not distro)18:13
kfox1111the logs from ceph look basically identicle between working and nonworking.18:13
kfox1111the pg's just don't schedule.18:13
SamYaplekfox1111: just looked at the logs, so based on the proceedure of what you did, you need to restart your ceph-osd container after you create the kollavolumes pool and deleted the rbd pool18:15
SamYaplesometimes teh pgs get stuck when dropping to 1 replica18:16
SamYaple*technically* you can do all this stuff before creating the first ceph-osd in teh ceph-mon container and then you wont have this issue18:16
kfox1111oh, really? interesting...18:16
kfox1111ok. let me reorder the script. :)18:16
SamYaplethis is really only a problem because of the single osd18:17
SamYapleyou might be better off keeping the default of 3 (or dropping it to 2) and changing the rush map so its ok with single host18:17
SamYapleso two osds on a single host satifies18:17
kfox1111yeah. it was just more work doing 2 then 1. (I thought)18:18
SamYapleyou arent wrong. but ceph does get stuck like this when you drop below 2 replicas18:19
SamYaplerestarting the osd is normally enough to fix it18:19
SamYaplethe behaviour is pretty undefined though, consider it a Wishlist bug for ceph18:19
kfox1111yeah. probably not well tested...18:20
*** salv-orlando has quit IRC18:22
kfox1111hmm.... gota redo a little bit of logic to get ceph-admin going... if osd is later...18:22
SamYaplei would personally jsut restart the ceph-osd18:23
SamYapleseems easiest18:23
SamYapleyou guys need to have a ceph health check parser anyway18:23
SamYaplefor upgrades18:23
kfox1111k. will try that.18:23
kfox1111could just kubectl exec -it ceph-osd 'killall ceph-osd' i guess.18:24
SamYapleok well im going back to Rust! let me know if you have other questions i might be able to help with. because that kube thing is beyond me18:24
kfox1111thanks for the help. :)18:24
*** david-lyle has quit IRC18:25
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104118:25
*** tonanhngo has quit IRC18:27
*** DanyC has joined #openstack-kolla18:28
Satya_@SamYaple just want to check the deployment of designate with kolla18:31
*** senk_ has quit IRC18:32
rhalliseySamYaple, rust :)18:32
openstackgerritMauricio Lima proposed openstack/kolla: [WIP] Add HNAS as Manila backend  https://review.openstack.org/38205418:32
*** DanyC has quit IRC18:33
SamYapleSatya_: i dont think its up to date for mitaka18:38
Satya_i am looking at the master branch18:39
SamYapleSatya_: but someone will have to answer that for sure. there was major changes in Mitaka that im 95% sure never made it (or were worked on since)18:39
SamYaplethere are even more changes in Newton18:39
SamYaplei would be shocked if designate is up to speed in Kolla18:39
SamYapleit has radically changed in the way it is setup18:40
SamYaplerhallisey: ive been using rust! i like it18:40
Satya_i want to deploy that but not sure what changes need to go to globals and multinode18:40
SamYapleSatya_: designate has no ansible playbooks in the master branch of the kolla repo18:42
SamYapleSatya_: so its certianly not there18:42
Satya_i saw something here "https://github.com/openstack/kolla/tree/master/docker/designate"18:43
SamYapleSatya_: thats an image, not the deployment of said image18:43
Satya_ok18:43
SamYapleits confusing because currently Kolla has images in the same repo as deployment, but my understanding is that is changing?18:43
Satya_ansible playbook is not yet created?18:43
SamYapleansible playbook was removed because it wasnt maintained IIRC18:44
rhalliseySamYaple, neat :).  I'll have to explore :)18:44
kfox1111SamYaple: yeah. going to break out the ansible stuff to its own repo.18:44
rhalliseySamYaple, yes that is changing18:44
kfox1111a lot of folks are using the kolla containers.18:44
kfox1111some folks are scared off by ansible.18:44
SamYaplerhallisey: im using it to interface with my waterrower! ive built a small app that I will eventually be using to in-real-time inform me if my launch-to-recovery time ratio is good18:45
SamYaplerhallisey: https://libraries.io/cargo/waterrower18:45
SamYaplei dont know which dev community is worse, Ansible or Docker18:45
SamYapleprobably Docker, but Ansible gives them a run for thier money!18:46
*** HyperJohnGraham has joined #openstack-kolla18:47
kfox1111arg.. the kill didn't seem to work...18:47
kfox1111big guns time...18:48
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104118:49
rhalliseySamYaple, oh nice!18:49
rhalliseySamYaple, on the rower there is a display that graphs you drive18:50
rhalliseycould also be a useful indicator18:50
SamYaplethe rower displays distance, stroke rate, (heartrate if attached), and time i believe18:50
SamYapleno graph18:50
SamYapleat least on mine18:50
rhalliseySamYaple, that's right you don't have a concept218:51
rhalliseygenerally those have it18:51
SamYaplei have a waterrower, with s4 monitor18:51
SamYaplebut thats why im writting this! its been a fun project for the past few days18:51
rhalliseynice :)18:51
rhalliseywill you be in Barcelona?18:52
SamYaplei will not be. im not sure how many summits I will be going to in the future. ive really settled into this working form home thing18:52
rhalliseygotcha18:52
*** DanyC has joined #openstack-kolla18:53
SamYaplewho am i kidding? im just going to rewrite openstack in rust18:53
SamYaplewithout all the openstack pieces18:53
rhallisey:)18:53
*** haplo37_ has quit IRC18:53
*** DanyC has quit IRC18:53
rhalliseyperfect18:53
*** DanyC has joined #openstack-kolla18:54
kfox1111I'd settle for just nova. ;)18:54
SamYaplekfox1111: yesh. i know right?18:55
kfox1111I want someone to take a stab at implementing a nova api wrapper that just launches k8s pods with qemu in them to launch the vm's.18:55
*** haplo37_ has joined #openstack-kolla18:56
kfox1111I think all the api I can think of has coresponding stuff in k8s that it can be mapped to.18:57
kfox1111flavors/az/hostaggregates = nodelabels/podlables. it already has a scheduler and restapi.18:58
kfox1111there would be a 1/1 mapping between vm and k8s pod/container.18:58
openstackgerritMauricio Lima proposed openstack/kolla: [WIP] Add HNAS as Manila backend  https://review.openstack.org/38205418:58
openstackgerritQin Wang (qwang) proposed openstack/kolla-kubernetes: Add ansible workflow for Horizon  https://review.openstack.org/38262018:58
openstackgerritMauricio Lima proposed openstack/kolla: [WIP] Add HNAS as Manila backend  https://review.openstack.org/38205419:00
kfox1111SamYaple: same error.19:02
*** matrohon has joined #openstack-kolla19:02
kfox1111I flat out killed the osd container and rebuilt it. no joy. :/19:02
openstackgerritRyan Hallisey proposed openstack/kolla-kubernetes: Workflow Gate Testing  https://review.openstack.org/38202119:05
openstackgerritRyan Hallisey proposed openstack/kolla-kubernetes: Ansible workflow for Glance  https://review.openstack.org/38080119:05
*** lamt has quit IRC19:08
*** tonanhngo has joined #openstack-kolla19:09
SamYaplekfox1111: im not sure what rebuilt means here, it should just be restarted19:12
kfox1111I deleted the container and k8s rebuilt it. all hte data's kept though.19:13
kfox1111I'm redoing everything to launch a second osd now. :/19:14
SamYapleyou could also try not deleting rbd and simply changing the size of it to 1 copy19:15
kfox1111didn't help. tried that already. :/19:15
SamYaplethere is a combination of little things like that which will work for sure19:15
SamYapleits a bit annoying, but ceph is designed to eb distributed and redundant19:15
kfox1111yeah. thats why I just gave up and am adding a second.19:15
SamYapleremoving both of those like you are doing is bad :P19:15
*** HyperJohnGraham has quit IRC19:16
*** DanyC_ has joined #openstack-kolla19:22
*** david-lyle has joined #openstack-kolla19:23
*** DanyC has quit IRC19:23
*** lrensing has quit IRC19:24
kfox1111oh... I see one thing off in the template... hostname is set to minikube...19:24
kfox1111I wonder if that will make a difference... shouldn't... but still...19:24
*** DanyC_ has quit IRC19:24
sdakeMarMat for some reason i thought we were in this channel19:25
MarMatsdake back from other room, it's https://bugs.launchpad.net/kolla/+bug/1630248 I need to double check, the fact is that I did not test it in reconfiigure scenario19:25
openstackLaunchpad bug 1630248 in kolla "magnum genconfig fails" [Undecided,New]19:25
kfox1111oh. nm... just on my machine....19:26
kfox1111darn.19:26
sdakeMarMat been triaged19:28
*** DanyC has joined #openstack-kolla19:29
*** dwalsh has joined #openstack-kolla19:31
*** bjolo_ has quit IRC19:33
*** DanyC has quit IRC19:34
*** HyperJohnGraham has joined #openstack-kolla19:34
openstackgerritRyan Hallisey proposed openstack/kolla-kubernetes: Workflow Gate Testing  https://review.openstack.org/38202119:35
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104119:35
openstackgerritMauricio Lima proposed openstack/kolla: [WIP] Add HNAS as Manila backend  https://review.openstack.org/38205419:42
*** sdake has quit IRC19:45
*** sdake has joined #openstack-kolla19:46
sdakerbergeron hecho en mexico coca cola has arrived - 4 24 packs straight to my door at store pricing19:48
kfox1111I wonder if this is the orcestrator deleting the bootstrap pods before they are done...19:49
kfox1111no other service does it that way, as the kolla bootstrap scripts are a little sketchy in the ceph containers...19:50
*** daneyon has joined #openstack-kolla19:51
kfox1111hmm... first try worked... so either lucky, 2 osd's helped, or it was the extra sleep time on bootstrap...19:51
*** HyperJohnGraham has quit IRC19:52
kfox11112 in a row...19:52
kfox1111so either extremely lucky, 2 osd's helped or it was the extra sleep time on bootstrap. :)19:52
kfox11113 in a row! :)19:53
*** salv-orlando has joined #openstack-kolla19:53
kfox1111I don't buy I'm that lucky. not with the week I've had so far. :)19:53
kfox1111this is a good sign. :)19:54
*** salv-orl_ has joined #openstack-kolla19:54
*** daneyon has quit IRC19:55
*** HyperJohnGraham has joined #openstack-kolla19:57
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104119:58
*** salv-orlando has quit IRC19:58
*** dwalsh has quit IRC19:59
*** jtriley has joined #openstack-kolla20:06
*** schwicht has quit IRC20:07
*** matrohon has quit IRC20:11
kfox1111consistently healty ceph now. :)20:14
*** mark-casey1 has left #openstack-kolla20:17
*** Pavo has joined #openstack-kolla20:17
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104120:21
*** tonanhngo has quit IRC20:24
*** schwicht has joined #openstack-kolla20:24
openstackgerritRyan Hallisey proposed openstack/kolla-kubernetes: Workflow Gate Testing  https://review.openstack.org/38202120:24
sdakekfox1111 what fixed teh gatefor you?20:24
*** Pavo has quit IRC20:27
*** Pavo has joined #openstack-kolla20:27
kfox1111sdake: one of two things... I added a second osd, and I found a potential case where the bootstrap doesn't finish before the container is killed. I added a 10 second sleep.20:29
kfox1111haven't narrowed it down to which of the two it is.20:29
kfox1111I suspect the second case is the real cause.20:29
sdakewhy wuld the bootstrap container get killed?20:29
kfox1111as it works with one osd with minikube, but its manually orchestrated, so slower to delete the bootstrap.20:30
kfox1111so...20:30
kfox1111heres the deal..... :)20:30
sdakebootstrap doesn't exit automatically?20:30
kfox1111the bootstrap script in the docker container does some... interesting things.20:31
sdakei know i wrote alot of it20:31
*** HyperJohnGraham has quit IRC20:31
sdakei understand it may not be ottally compatibl with k8s atm20:31
kfox1111it allocates a ceph id, then formats the disks, dadada da da, then registers it all in the ceph and alls well.20:31
kfox1111the problem was, when I mapped it to a kubernetes job, if it failed after the first step,20:32
sdakeoh that part20:32
kfox1111kubernetes restarts the job, and it spins creating random ceph id's.20:32
kfox1111:/20:32
kfox1111so I didn't do it as a job, but a pod with restart policy none.20:32
*** schwicht has quit IRC20:32
kfox1111but my orchestration script doesn't really handle a pod entering ready state, but isn't done running. as its kind of half way between a job and a pod.20:33
MarMatJeffrey4l_ ping20:34
kfox1111I just need to make a different check for those pods.20:34
sdakekfox1111 cool sounds good20:35
sdakekfox1111 sleep 10s not ideal :)20:35
kfox1111+1.20:35
kfox1111horibly hackish. but identified a potential problem.20:36
kfox1111if I pull the multiost thing and it still works, then its the sleep for sure.20:36
*** schwicht has joined #openstack-kolla20:36
kfox1111and that should be easy to get rid of.20:36
kfox1111so weird...20:40
kfox1111somehow docker exec and kubectl exec are very different when it comes to /sys read/write...20:40
*** jtriley has quit IRC20:40
sdakejust guessing, kubectl exec would be slower20:41
sdakeperhaps far far slower20:41
kfox1111rbd map works when running via kubectl exec but not via docker exec into the same type of container.20:42
sdakehmm20:43
sdakemakes sense if the speed thing is accurate20:43
sdakedocker exec is faster20:43
sdakedocker exec may not be synchronized in our scripts where it is needed20:43
sdaketypically we rely on ansible for that as well20:44
kfox1111thits a20:48
kfox1111the weird thing is,20:49
kfox1111the error is "/sys is read only" on the docker exec case, but read write on the kubectl exec case.20:49
kfox1111I think thats the only thing stopping this idea from working.20:49
sdakewhy not retry the docker exec casse20:50
kfox1111what do you mean?20:50
sdakei think what is happening is your docker execing before the container is setup20:50
sdakeso delay the docker exec20:50
kfox1111it can.t let me show you...20:50
kfox1111https://review.openstack.org/#/c/381041/46/services/ceph/ceph-rbd-pod.yml.j220:50
sdakehow about a line # :)20:51
kfox1111the container comes up, writes out a /usr/bin/rbd shell script on the host, then sleeps.20:51
kfox1111kubelet on the host then calls /usr/bin/rbd whenever it needs to mount a ceph rbd volume.20:51
kfox1111the script should docker exec back into the same container and run the rbd command there.20:52
kfox1111which it does, but its erroring because the /sys volume is read only.20:52
sdakein the docker case?20:52
kfox1111yeah.20:52
*** schwicht has quit IRC20:52
kfox1111but, in the setup_gate.sh script,20:52
sdakehow fast after container startup until the ceph rbd volume is created?20:53
kfox1111I kubectl exec into the ceph-admin script, rbd map volume, format it, and unmount it just fine.20:53
sdakeor mounted20:53
kfox1111minutes. and kubelet retries a few times with a minute or two between.20:53
kfox1111so the container has to be there and ready even to write out the script, and its calling the script ok, so its got to be running.20:54
sdakehow is /sys accessed?20:54
kfox1111by rbd.20:54
sdakewhich file in partcilar20:55
*** lamt has joined #openstack-kolla20:55
sdakedid you try docker exec --privileged?20:55
kfox1111that and -u 0 just to be extra sure.20:56
sdakeok well permissions may be not working sa expected20:56
kfox1111I tried with a docker run -i --rm -v /sys:/sys -v /dev:/dev <image> rbd ... too. same issue.20:56
sdakecan you do that with a df?20:57
sdakewould love to see how the filesystems are mounted20:57
kfox1111yeah. let me add that.20:57
*** huikang has joined #openstack-kolla20:57
sdakekfox1111 and mnttab\20:58
sdakeiirc its in /etc dir20:58
sdakejust cat it :)20:59
kfox1111k20:59
sdakei dont have accesss to my env atm or i'd tell yu for sure wehre it is20:59
kfox1111no mnttab...20:59
kfox1111 /proc/mounts?21:00
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104121:01
sdakekfox1111 that file would rock thanks21:02
sdakekfox1111 sorry was stuffing some food in face21:02
sdakeneed to eat more often21:03
sdakei've lost 15 pounds in the last 6 months21:03
sdakehappens a bit in the summer here21:03
sdake5-8 pounds = ok21:03
sdakemore then that not good - need more minerals :)21:03
kfox1111np.21:10
kfox1111heh. I must be a geek when starcraft comes to mind when you say that. :)21:11
*** ayoung has quit IRC21:12
rhalliseykfox1111, http://logs.openstack.org/21/382021/10/experimental/gate-kolla-kubernetes-ansible-workflow-ceph-nv/6f541a9/console.html21:12
rhalliseyprogressing a bit21:12
rhalliseywow21:12
rhalliseymariadb made it through on it's last try21:13
rhalliseyO.o21:13
rhalliseybbiab21:13
*** rhallisey has quit IRC21:13
*** fguillot has quit IRC21:15
kfox1111sdake: http://logs.openstack.org/41/381041/47/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/7a4e15a/logs/pods/kolla-ceph-rbd-qf7b6-main.txt21:16
sdakekfox1111 sysfs is read/write21:17
sdakekfox1111 was that via docker exec?21:17
kfox1111http://logs.openstack.org/41/381041/47/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/7a4e15a/logs/pods/kolla-mariadb-bootstrap-zntqq.txt21:17
kfox1111maybe its a false error... there are a bit of other error messages there... maybe I should just dump the whole thing to a log file...21:18
*** schwicht has joined #openstack-kolla21:18
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104121:20
inc0ehh k8s doesn't have superb logs :/21:22
*** haplo37_ has quit IRC21:24
*** tonanhngo has joined #openstack-kolla21:24
kfox1111they kind of assume you buidl out a logging infrastructure of your choice.21:25
kfox1111as that seems to be a deeply personal thing. :/21:26
*** haplo37_ has joined #openstack-kolla21:26
*** tonanhngo has quit IRC21:26
*** tonanhngo has joined #openstack-kolla21:26
kfox1111it is one of their unwritten rules, but seems to be followed, that they don't want to get into the middle of religious wars.21:26
kfox1111they do one thing, container orchestration and leave the rest to be built on top.21:27
kfox1111sdake: thats kind of weird....21:35
kfox1111http://logs.openstack.org/41/381041/48/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/7995d17/logs/rbd.log21:35
kfox1111the commands kubelets issueing.21:35
kfox1111I wouldn't have expected it to write the temp keyring out to etc...21:35
kfox1111I'll have to mount that in. maybe thats the problem.21:35
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104121:36
kfox1111sdake: know why this wont merge? https://review.openstack.org/#/c/380868/21:43
kfox1111sbezverk_: alive?21:43
sdakekfox1111 lacking a workflow?21:44
kfox1111sdake: I'm thinking maybe it unworkflowed it when the experimental results came back?21:44
kfox1111the last action on it was a workflow though.21:44
kfox1111haven't seen a ps in this state before.21:45
kfox1111should we juts reworkflow it?21:45
sdakekfox1111 i can merge it without reviewing it if you like21:46
sdakesince its already been erviewed by others21:46
*** b_bezak has quit IRC21:47
sdakedont have time right at this moment to review it - but trust your udgement  and if its broken we can fix it later ;)21:47
kfox1111k. I think it just needs the workflow put back, as the last action in the list was a workflow.21:47
kfox1111thanks.21:47
kfox1111once that patch, and the ceph one I've got going, we should have a pretty close to solid gate I think.21:47
kfox1111(don't want to jinks it by saying solid. :)21:48
*** b_bezak_ has joined #openstack-kolla21:48
openstackgerritMerged openstack/kolla-kubernetes: Split endpoint jobs and start testing the deployments  https://review.openstack.org/38086821:51
kfox1111OH... net=host.... arg21:51
*** jheroux has quit IRC21:52
kfox1111shouldn't matter, but maybe...21:52
*** b_bezak_ has quit IRC21:53
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104121:59
*** lamt has quit IRC21:59
*** huikang has quit IRC22:00
*** huikang has joined #openstack-kolla22:01
SamYaplekfox1111: ceph is going to require net=host btw22:04
kfox1111yup.22:04
*** lamt has joined #openstack-kolla22:04
SamYapleits a hard requirement if you have multiple osds on the same host22:04
kfox1111and pid namespace too.22:04
kfox1111that one's a really subtile bug we ran into here.22:04
kfox1111otherwise containers kill each others connections.22:04
kfox1111if yo udon't have much traffic you might not notice it.22:04
SamYaplewell they mark each other down22:04
kfox1111yeah.22:05
SamYapleafaik i was the first one to get ceph running in docker with ipv4 where each osd was there own container22:05
SamYaplei filed a bug about this, but ceph closed it as a docker bug22:05
SamYapleits not22:05
kfox1111nice. :)22:05
*** HyperJohnGraham has joined #openstack-kolla22:05
kfox1111yeah. I'd say not too.22:05
SamYaplethe ceph osds know they are on the same host and do healthchecks against pids22:05
kfox1111right.22:06
*** huikang has quit IRC22:06
kfox1111the clients have the same issue we've found.22:06
SamYapleif you did ipv6 this wouldn't be an issue fyi22:06
kfox1111the client connects, has the same pid as in another container, and the osd's kill all the connectsions, as a new client came in.22:06
kfox1111then the two containers fight with each other over.22:06
kfox1111:)22:06
kfox1111ipv6 is the answer to a lot of problems.22:07
SamYaplekfox1111: related http://tracker.ceph.com/issues/10763 https://github.com/ceph/ceph-docker/issues/1922:07
kfox1111its also still way to far out. :/22:07
* kfox1111 nods22:07
*** HyperJohnGraham has quit IRC22:11
*** HyperJohnGraham has joined #openstack-kolla22:14
kfox1111wha...22:16
kfox1111oh... $@ includes $0?22:16
kfox1111that may be the problem....22:16
bmacefor bug https://bugs.launchpad.net/kolla/+bug/1629024  feel free to nuke it.  i'm going to abandon my change in this area and handle things in a different way.22:21
openstackLaunchpad bug 1629024 in kolla "Destroy needs to have the option to be slightly less destructive" [Undecided,In progress] - Assigned to Borne Mace (borne-mace)22:21
kfox1111hmm... no. seems to not. everything's passed through docker exec proper.22:22
kfox1111 bash test.sh foo bar baz "ark barm" baz22:23
kfox1111foo|bar|baz|ark barm|baz22:23
sdakebmace ok let me have a look22:24
sdakebmace oh right this work22:24
sdakebmace so - curious how you are going to address it22:25
sdakebmace but i guess we can find that out later22:25
bmacesome of the changes i am just going to make internally.  especially the nuke of the files under /etc/kolla.  for some i am also going to probably add a simple stop playbook.22:25
sdakebmace yup makes sense22:26
bmacei appreciate the feeling that the containers are just cattle so the systems can be wiped aggressively, but for our environment we have the need at times to not nuke everything under /etc/kolla for example, since our kollacli config is under there.22:26
sdakebmace oh ya that makes sense22:26
sdakebmace our destroy atm is not ideal22:27
sdakei wish it destroyed only things relevant to kolla22:27
sdakenot other htings the operator adds22:27
bmacewell, in most cases /etc/kolla should be pretty safe.. we just happen to put some of our own stuff in there :(22:27
sdakean implementation of such a thing is difficult22:27
sdakebmace your not alone22:27
*** Pavo has quit IRC22:27
sdakewe have reverse filters for certain items22:27
sdakemay be able to make use of that22:27
*** Pavo has joined #openstack-kolla22:28
sdakethat would probably work with less work then maintaining some variance internally :)22:28
sdakee.g. globals.yml and passwords.yml and config are filtered from removal22:29
sdakecould probably be configurable22:29
bmaceyup.  for our internal stuff i can probably just add kollacli into that list.22:29
sdakeyup22:29
bmacetrue, but configurable how? feels odd to pass in as an extra arg or env_var22:29
sdakethe stop playbook22:29
kfox1111oh... keyring's the default.... maybe its not fetching the secret right....22:29
sdakewhat is that for22:29
sdakebmace ya - i know configuration how here is hard22:30
sdakebmace there is a kolla-build.conf22:30
sdakeor globals.yml would work22:30
sdakebut its kind of clunky22:30
bmacesdake mostly i think small environment / AIO type stuff on stop.. you may want to bring down your services but not actually need to nuke all your containers.  also if you want to mess with some container contents, whatever..22:31
sdakebmace that sounds good for master to me22:31
bmacealso, internally i think we will still support not deleting the images as an option to the destroy playbooks, because not everyone is on a great internet connection or has local caches registries22:32
sdake(the part that sounds good for master is a stop action)22:32
bmacesdale just sort of simple stuff that maybe not everyone wants but some people do22:32
bmacesdake got it22:32
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104122:34
sdakebmace i thin kthe not deleting containers as an option would make a ifne config value22:36
*** Satya_ has quit IRC22:38
*** awiddersheim has joined #openstack-kolla22:39
kfox1111yeah... the code path changes if its a secret or a keyfile. and the command line indicates its using a keyfile.22:39
kfox1111thats the issue.22:39
sdakebmace no real reason to do these thigns internally unless externally just doesn't want it :)22:41
sdakebmace i like the stop idea - i have wanted this for a long time22:41
sdakebmace but been too busy to do it on my own :)22:41
bmacesdake happy to toss it upstream when it is ready, which shouldn't take very long and agreed that we like to keep as little internal as possible.  better for everyone.22:42
sdakebmace sweet - now with that said master is blocked for about 1 more week :)22:43
sdakeinc0 ping22:43
bmacesdake yeah, no sweat, i don't mind it lingering up there and feedback is great and if there is some stuff you guys don't want that is fine too :)22:44
*** lamt has quit IRC22:44
kfox1111oh... wait.. wat is storage_ceph.key for.... hmm...22:45
sdakebmace you guys = us guys :and gals ;)22:45
bmacesdake sure.. you folks :)22:45
sdakebmace just because you left the drivers team doesns't mean your not one of us :)22:46
sdakethere is no escape ;-)22:46
bmacesdake lol, fair enough.  now i'm a back seat driver ;)22:46
kfox1111there's the bug I think....22:46
sdakebmace me too, me too22:46
sdakebmace hope to get out of that shortly :)22:46
openstackgerritQin Wang (qwang) proposed openstack/kolla-kubernetes: Add ansible workflow for Horizon  https://review.openstack.org/38262022:47
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104122:47
sdakekfox1111 sbezverk_ can you have a look at that workflow patch above22:48
*** Pavo has quit IRC22:49
sdakekfox1111 sbezverk_ is that the direction we are headed re workflows?22:49
*** vhosakot has quit IRC22:50
kfox1111I think so, at least until kolla-ansible is moved out. then I think the plan is to store the kolla-kubernetes ansible workflow with the other ansible stuff in kolla-ansible.22:50
sdakeintereseting22:51
sdakecool thanks22:51
sdakei'd review that patch but don't want to break anything :)22:51
*** Pavo has joined #openstack-kolla22:51
kfox1111there isn't anything to break yet. all the ansible stuff is under review right now. nothing has merged yet.22:52
sdakegot it22:52
sdakei'd rather you and sbezverk_ review it22:52
kfox1111the ansible stuff is like, a couple days old so far. :)22:52
sdakemake sure it suits your tastes22:52
kfox1111k. ryan's doing all the work so far, so he's probably most suited to it.22:52
kfox1111it looks inline with what he's been doing though.22:53
sdakei think as far as where hte workflow goes, probably not kolla-ansible22:53
sdakeprobably a new repo22:53
kfox1111it doens't really belong in kolla-kubernetes either.22:53
kfox1111yeah. I suggested a kolla-kubernetes-ansible but inc0's didn't like the idea.22:54
sdakewe are going to be spending a lot of time on this repo split in ocata :(22:54
kfox1111yeah.22:54
kfox1111but  Ithink it will be very good for kolla's future.22:54
sdakeagree22:54
sdakenow is the time to do it22:54
kfox1111I'd really rather start the kolla-kubernetes ansible stuff out of repo.22:55
kfox1111for similar reasons.22:55
kfox1111keeping it in, then moving it out has all the same issues keeping ansible in the main kolla repo has had I think.22:55
kfox1111I'd rather just start seperate.22:55
sdakecome up with a name people can agree on22:55
sdakeand i'll get you a fresh repo to start with22:56
kfox1111so, there's two ways to go I think.22:56
kfox1111kolla-kubernetes is the repo name, and kollakube is the cli name.22:56
kfox1111I'd kind of like for the ansible workflwo cli name to be kollakube-ansible to match up with kolla-ansible if they are kept seperate.22:56
kfox1111so the repo name could be ither kolla-kubernetes-ansible or if thats too long, kollakube-ansible22:57
sdakeinstead of calling it kolla-kubernetes should have called it kollakube22:58
sdakethat would have made that choice alot easier22:58
sdakewhat you just proposed in a is too long and in b is inconsistent :(22:59
kfox1111yeah. hindsite and all. :)22:59
sdakeright22:59
sdakeif i could predict the future i'd be a billionaire ;)22:59
kfox1111or c, kolla-kube-ansible22:59
sdakethats good22:59
sdakebounce it off the ml for a vote22:59
sdakeany core can request a vote23:00
kfox1111inc0 already said he prefers sticking it in kolla-ansible... I don't want to step on the ptl's shoes.23:00
sdakeactuallly your not corei n kolla itself so i think yo ucan't propose a vote23:00
sbezverk_kfox1111: why do we need ansible in kube repo name? or I missed something?23:00
sdakewell i dont really want it in kolla-ansible ;)23:01
kfox1111should we discuss it more first to see how strongly he feels?23:01
*** ccesario has quit IRC23:01
sdakeyes needs more discussion over some indian food in barcelona23:01
kfox1111sbezverk_: the topic is, seperating the ansible code from the kolla-kubernetes repo. what name would the ansible repo have.23:01
sbezverk_kfox1111: frankly I do not see reason to mention ansible in the name of kubernetes repo, it will send wrong message23:01
sdakebut that means it blocks workflow dev23:01
*** eaguilar has joined #openstack-kolla23:01
kfox1111sdake: the issue is, the code's starting to land soon. its all in review now, so could be pushed off to another repo. but how long to we keep the stuff in review?23:02
kfox1111yeah.23:02
sdakekfox1111 right - either need decision now23:02
sbezverk_kfox1111: got it, it is not about naming the main kube repo23:02
sdakeor punt and make a new repo later23:02
sdakesbezverk_ we should have named that differently I think ;-)23:02
sbezverk_sdake: I really hoe we will..23:03
kfox1111sbezverk_: yeah.23:03
sdakeanyway i guess we need to circle aorund with inc023:03
kfox1111+123:03
kfox1111oh... goodie...23:03
sdakesbezverk_ hope we will what23:03
kfox1111it looks like the ceph backed mariadb finally cleared! :)23:03
sdakesweet23:03
sbezverk_sdake: name that differently23:03
sdakesbezverk_ you mean kolla-kubernetes?23:03
sdakesbezverk_ it is basically impossible to do a rename i think23:04
sbezverk_sdake: no I am all for kolla-kubernetes name :-) by mistake I thought you want to rename it into kolla-kube-ansible23:04
*** diogogmt has quit IRC23:05
sdakeno kollakube woudl have been better23:05
sdakeso we could call the ansible workflow kollakube-ansible23:05
kfox1111repos and commands are two different things. the cli's much more imporatnt I think.23:06
kfox1111as users type it in every day.23:06
kfox1111devs don't clone as often.23:07
kfox1111(usually)23:07
sdakewhat i mean about the repo rename23:07
sdakeis infra i dont think does that anymore23:07
sdakewe could create a new mirror repo with "upstream"23:07
*** salv-orl_ has quit IRC23:07
sdakebut woudl have to retire the old name23:07
kfox1111soo close now: http://logs.openstack.org/41/381041/52/experimental/gate-kolla-kubernetes-deploy-ubuntu-binary-ceph-nv/62580c4/console.html23:07
kfox1111I think that was because I didn't create the glance volume in rbd... now to add that back....23:08
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104123:08
kfox1111sdake: finally got that rbd trick to work. so now you don't have to install any ceph stuff on the hosts. :)23:10
kfox1111it uses a daemonset for the rbd binary. :)23:10
*** ccesario has joined #openstack-kolla23:13
kfox1111sdake: I guess it wouldn't hurt to ask infra if it was still possible.23:20
kfox1111worst they could say is no.23:21
kfox1111I guess the ohter thing we could do....23:21
kfox1111is make a new repo, import the existing one into it,23:21
sdakewell if we decide on that option - lets decide and then ask :)23:21
sdakekfox1111 we can do that one easily23:21
kfox1111and add a final commit to the old one delting all the stuff and leaving a pointer.23:21
sdakebut it results in a repo that has to be deprecated23:21
kfox1111yeah.23:22
kfox1111so whats infra want more in that case, to manage a depericated repo, or to rename one.23:22
*** lamt has joined #openstack-kolla23:25
sdakekfox1111 let me ask23:25
kfox1111kk23:25
sdakekfox1111 join #openstack-infra23:26
sdakejust asked23:26
kfox1111k23:26
*** logan- has quit IRC23:28
sdakekfox1111 - clarkb said renames are possible23:28
kfox1111:)23:28
sdakeclarkb further indicated they do them about once a month23:28
sdakeit has to be scheduled23:28
sdakethey prefer that to what you proposed (a new proejct and deprecate old repo)23:29
kfox1111k. so lets decide on if we're doing anything, and what, and then get it scheduled if we are.23:29
sdakewould probbly make agood mailing list discussion ;)23:29
sdakevs 2 guys kibitzing in an irc channel :)23:29
sdakekolla23:30
sdakekollakube23:30
sdakekolla-ansible23:30
sdakekollakube-ansible23:30
sdakeseems reasonble to me23:30
kfox1111+123:30
sdakeconsistent and short23:31
kfox1111nice.... vm lauched! :)23:31
sbezverk_sdake kfox1111: one thing that kills me in all virtualization networking is difficulty and sometimes impossibility to test the complete path packet takes:-(23:31
kfox1111sbezverk_: yeah. it can be quite twisty. :)23:32
sbezverk_like the problem I am facing now, it freaking crazy23:32
kfox1111debugging neutron dvr was a lot of fun until I figured it out. :)23:32
*** HyperJohnGraham has quit IRC23:32
kfox1111sbezverk_: still fighting the canal thing?23:32
sbezverk_kfox1111: yep and it is getting more wierd23:33
sdakekfox1111 that aha moment23:33
sdakekfox1111 then it passes23:33
sdakekfox1111 i like that too :)23:33
kfox1111sdake: have a look: http://logs.openstack.org/41/381041/53/experimental/gate-kolla-kubernetes-deploy-centos-binary-ceph-nv/3a1637c/console.html23:33
kfox1111all thats left is testing cinder volume attachment and floating ips... just a few more bugs to work out. :)23:34
sdakekfox1111 only 17 min to run23:34
kfox1111yup. :)23:35
sdakekfox1111 looking good :)23:35
kfox1111I might be able to shave a bit more back off.23:35
kfox1111sbezverk_: whats it doing?23:35
sdakekfox1111 bindeps.txt will shave 1-2 minutes23:35
kfox1111already did bindeps. :/23:36
sbezverk_kfox1111: from host where pod is running I can ping container by ip addressed provided to it by flannel, but from the container to the local host or anything beyon does not work..23:36
kfox1111sbezverk_: hmm... that really feels like a firewall issue...23:37
*** logan- has joined #openstack-kolla23:37
sbezverk_kfox1111: or calico policy23:37
kfox1111yeah. but in theory, there shouldn't be any of those.23:37
sbezverk_also firewall usually protect inbound traffic so I would expect in reverese behavior23:38
kfox1111I'm deploying canal in the gate, and haven't run into that yet.23:38
kfox1111sbezverk_: yeah, but, indbound and outbound get reversed at odd types in virtualization.23:38
sbezverk_kfox1111: well as I said it is working perfectly fine in my cluster23:38
kfox1111the host firewall might consider the container poking twards it, incoming.23:38
kfox1111have you tried two containers on the same host pinging eachother?23:39
sbezverk_so it is somehitng special to John setup and it escapes me :-(23:39
kfox1111as that would be a forward rather then an input.23:39
kfox1111that might rule out a calico policy.23:39
kfox1111OH.. I know why cinder isn't working...23:40
kfox1111there are 2 sets of endpoints for it...23:41
kfox1111one for v1 and one for v2...23:41
sdakekfox1111 ca nyou start a discussion on the ml re the irc discussion we just had23:41
kfox1111I can probably start it tomrorow. if you can't get to it sooner. I have to go in a couple minutes.23:41
sbezverk_kfox1111: yeah, end points reg job was registering both v1 and v223:43
sdakekfox1111 its your idea not mine23:44
sdakekfox1111 it is viable and i'd support it if it makes sense23:44
sdakekfox1111 rely on the cores to make that decision - rather then operate in a vacuum23:44
sdakerather I rely23:44
sdakerename before summit not possible23:45
sdakerename after possible23:45
sdakemight as well start with correct name for workflow engine23:45
sdakeor rather the workflow bits23:45
kfox1111hmm... yup. the cinder entries have 2 versions...23:45
sdakekfox1111 that is normal - v2 and v323:45
sdakeiirc23:45
sdakeor v1/v223:45
kfox1111k. I'll add them back.23:46
sdakedc with sbezverk_23:46
sbezverk_yeah v1/v223:46
sdakei'm pretty sure they are there for a reason :)23:46
sdakeok gotta jet to folks house23:46
sdakebbl23:47
kfox1111yup. the opentsack cinder client fails. its not autofailing back to v123:47
*** david-lyle has quit IRC23:47
kfox1111k. have a good one.23:47
sbezverk_take care23:47
kfox1111looks like its the same with just v2 suffixed.23:47
*** sdake has quit IRC23:48
openstackgerritKevin Fox proposed openstack/kolla-kubernetes: WIP: External Ceph Tools and Docs  https://review.openstack.org/38104123:49
kfox1111heading out. bbl23:50
sbezverk_kfox1111: I probably mentioned it to you, if not sorry I have not done it earlier, please check this bug I filed for cinder I hit on kube. https://bugs.launchpad.net/cinder/+bug/161970123:51
openstackLaunchpad bug 1619701 in Cinder newton "in k8s environment vgs return extra line in output" [Medium,Fix released] - Assigned to Gorka Eguileor (gorka)23:51
sbezverk_kfox1111: the fix they released does not fix it :-(23:52
sbezverk_kfox1111: hopefully it does not impact ceph..23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!