Tuesday, 2017-10-17

*** salv-orlando has joined #openstack-kuryr00:17
*** salv-orlando has quit IRC00:22
*** yamamoto has joined #openstack-kuryr00:24
*** yamamoto has quit IRC00:29
*** aojea has joined #openstack-kuryr00:47
*** limao has joined #openstack-kuryr00:48
*** aojea has quit IRC00:52
*** caowei has joined #openstack-kuryr00:53
*** salv-orlando has joined #openstack-kuryr01:18
*** kiennt26 has joined #openstack-kuryr01:21
*** salv-orlando has quit IRC01:22
*** yamamoto has joined #openstack-kuryr01:24
*** aojea has joined #openstack-kuryr01:48
*** hongbin has joined #openstack-kuryr01:49
*** aojea has quit IRC01:52
*** wangbo has joined #openstack-kuryr02:01
*** hongbin_ has joined #openstack-kuryr02:04
*** salv-orlando has joined #openstack-kuryr02:18
*** salv-orlando has quit IRC02:23
*** aojea has joined #openstack-kuryr02:49
*** hongbin_ has quit IRC02:51
*** aojea has quit IRC02:53
*** wangbo has quit IRC03:11
*** wangbo has joined #openstack-kuryr03:18
*** salv-orlando has joined #openstack-kuryr03:19
*** vikasc has quit IRC03:23
*** salv-orlando has quit IRC03:24
*** limao has quit IRC03:36
*** vikasc has joined #openstack-kuryr03:36
*** hongbin has quit IRC03:37
*** kiennt26 has quit IRC03:45
*** kiennt26 has joined #openstack-kuryr03:46
*** limao has joined #openstack-kuryr03:46
*** limao has quit IRC03:46
*** limao has joined #openstack-kuryr03:47
*** aojea has joined #openstack-kuryr03:49
*** limao has quit IRC03:51
*** aojea has quit IRC03:54
*** gouthamr has quit IRC03:59
*** salv-orlando has joined #openstack-kuryr04:20
*** salv-orlando has quit IRC04:25
*** kiennt26 has quit IRC04:41
*** wangbo has quit IRC04:42
*** vikasc has quit IRC04:45
*** vikasc has joined #openstack-kuryr04:48
*** aojea has joined #openstack-kuryr04:50
*** aojea has quit IRC04:55
*** yboaron has joined #openstack-kuryr05:01
*** limao has joined #openstack-kuryr05:07
*** salv-orlando has joined #openstack-kuryr05:21
*** salv-orlando has quit IRC05:25
*** wangbo has joined #openstack-kuryr05:33
*** salv-orlando has joined #openstack-kuryr05:34
*** wangbo has quit IRC05:34
*** garyloug has quit IRC05:38
*** janki has joined #openstack-kuryr05:40
*** wangbo has joined #openstack-kuryr05:45
*** aojea has joined #openstack-kuryr05:51
*** aojea has quit IRC05:56
*** janki has quit IRC05:57
*** janki has joined #openstack-kuryr05:57
*** kiennt26 has joined #openstack-kuryr06:04
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Fix ref to ports pool at nested-vlan documentation  https://review.openstack.org/51249506:25
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Fix ports pool documentation  https://review.openstack.org/51249506:35
*** aojea has joined #openstack-kuryr06:38
*** pcaruana has joined #openstack-kuryr06:44
*** aojea has quit IRC06:47
*** wangbo has quit IRC06:54
*** wangbo has joined #openstack-kuryr06:57
*** yboaron has quit IRC07:02
*** kiennt26 has quit IRC07:12
*** kiennt26 has joined #openstack-kuryr07:13
*** vikasc has quit IRC07:31
*** danil has joined #openstack-kuryr07:34
*** aojea has joined #openstack-kuryr07:44
*** vikasc has joined #openstack-kuryr07:44
*** egonzalez has joined #openstack-kuryr07:45
*** aojea has quit IRC07:48
*** wangbo has quit IRC07:51
*** karimb has joined #openstack-kuryr07:51
*** karimb has quit IRC07:52
*** wangbo has joined #openstack-kuryr07:52
*** wangbo has quit IRC07:56
*** wangbo has joined #openstack-kuryr08:02
*** salv-orlando has quit IRC08:05
*** salv-orlando has joined #openstack-kuryr08:05
*** wangbo has quit IRC08:07
*** yboaron has joined #openstack-kuryr08:08
*** wangbo has joined #openstack-kuryr08:10
*** wangbo has quit IRC08:10
*** wangbo has joined #openstack-kuryr08:13
*** karimb has joined #openstack-kuryr08:19
*** phuoc_ has quit IRC08:24
*** wangbo has quit IRC08:34
irenabdulek, hi08:35
irenabI was trying https://review.openstack.org/#/c/480028/, pods stuck at creating status08:36
irenabeventually one pod is running, second is reporting error08:39
*** wangbo has joined #openstack-kuryr08:40
*** garyloug has joined #openstack-kuryr08:43
*** aojea has joined #openstack-kuryr08:44
irenabdulek, janonymous another issue I see is when I deleted pod and then new pod was started, daemon gets netlink error on binding commit08:47
*** yamamoto has quit IRC08:47
irenabNetlinkError (17, 'File exists')08:47
*** garyloug has quit IRC08:48
*** aojea has quit IRC08:49
dulekirenab: I'll try to reproduce it.08:50
irenabdulek, I tried apuimedo's demo pod. Created one and then scaled to 2 instances08:50
irenabwithiut waiting for the first to become running08:51
dulekirenab: Okay.08:51
janonymousirenab: i thought that was unrelated to that08:52
*** garyloug has joined #openstack-kuryr08:52
irenabjanonymous, I see exception in the kuryr-daemon08:52
irenabsame worked for me with non-daemon CNI08:53
janonymousirenab: ohh..sometime back there was a patch related to this change, i dont recall exactly i will check08:54
janonymousirenab: that's why i mentioned it in commit msg of https://review.openstack.org/#/c/480028/ about netlink error .. strange08:55
*** yamamoto has joined #openstack-kuryr08:55
*** yamamoto has quit IRC08:55
irenabjanonymous, it didn't fix itself. After seversl times I removed pod that failed, I got one running08:58
janonymousirenab: oh..09:02
janonymousdulek: another way might be to use --replias=2/more with `kubectl run <image>` command , i will be checking too09:04
irenabjanonymous, exactly what I did09:06
janonymous:)09:06
janonymousdulek: exact command `kubectl run hello-node --image=gcr.io/google-samples/hello-app:1.0 --port=8080`09:09
janonymous--replicas =209:09
dulekjanonymous: I'm stacking a fresh env to take a look.09:09
dulekirenab, janonymous: Okay, reproduced, although I'm getting "KeyError: u'tap77ccc560-59'"09:14
*** yamamoto has joined #openstack-kuryr09:15
*** karimb has quit IRC09:28
*** jchhatbar has joined #openstack-kuryr09:34
*** janki has quit IRC09:37
*** aojea has joined #openstack-kuryr09:45
dulekirenab, janonymous: Oh, I think I know what's wrong… The fix is on patch that adds logging. Let me move it to the base patch while fixing irenab's comments.09:47
*** aojea has quit IRC09:50
*** salv-orlando has quit IRC09:50
*** salv-orlando has joined #openstack-kuryr09:50
dulekOh, it's more complicated than I thought. Back to the code…09:54
*** karimb has joined #openstack-kuryr09:55
*** salv-orlando has quit IRC09:55
*** c00281451 has joined #openstack-kuryr09:57
dulekapuimedo: Hi, are you able to tell what https://github.com/openstack/kuryr-kubernetes/blob/6d9e564251853885ba54868fefb09f6741de96dc/kuryr_kubernetes/cni/binding/bridge.py#L35-L37 is doing?10:15
janonymousdulek: is it the sys. logger which i used earlier?10:15
dulekjanonymous: No, it shouldn't be related to logger.10:15
janonymousok10:16
dulekjanonymous: I've briefly thought it's fault of not doing config.init_config(), but now I doubt it.10:16
dulekDo containers on the same pod go into a single netns?10:18
*** yamamoto has quit IRC10:18
*** yamamoto has joined #openstack-kuryr10:25
*** wangbo has quit IRC10:28
*** yamamoto has quit IRC10:30
*** c00281451 has quit IRC10:31
*** salv-orlando has joined #openstack-kuryr10:32
*** c00281451 has joined #openstack-kuryr10:32
*** limao has quit IRC10:32
*** openstackgerrit has quit IRC10:33
*** limao has joined #openstack-kuryr10:33
*** limao_ has joined #openstack-kuryr10:36
*** limao_ has quit IRC10:36
*** limao_ has joined #openstack-kuryr10:37
*** limao has quit IRC10:37
*** caowei has quit IRC10:39
janonymouscan you do these changes in your env : http://textuploader.com/d4uq610:40
janonymousand check using grep -inr "kuryr_kubernetes.controller.drivers.additional_subnets" in kuryr-kubernetes package10:41
janonymousif any , try to change to `kuryr_kubernetes.controller.drivers.additional_subnet` and restart controller, i think it should work10:41
janonymous@danil : ^^10:42
daniljanonymous, yeah, thanks, one min10:43
dulekltomasbo, irenab: Maybe you have an idea why we're setting netns of an interface to os.getpid() here:10:44
dulekltomasbo, irenab: https://github.com/openstack/kuryr-kubernetes/blob/6d9e564251853885ba54868fefb09f6741de96dc/kuryr_kubernetes/cni/binding/bridge.py#L35-L3710:44
*** aojea has joined #openstack-kuryr10:46
irenabdulek, I do not remember. ivc_ are you around?10:49
ltomasbodulek, I don't know10:50
*** aojea has quit IRC10:50
irenabdulek, does this method given netns or it is None?10:57
dulekirenab: According to traceback it's given netns.10:57
* dulek baked his env, needs to restack to continue testing…10:58
irenabso this is provided through params. Maybe this is the way kubelet gives the netns11:01
*** wangbo has joined #openstack-kuryr11:09
*** wangbo has quit IRC11:10
*** atoth has joined #openstack-kuryr11:14
*** wangbo has joined #openstack-kuryr11:14
*** yamamoto has joined #openstack-kuryr11:26
*** yamamoto has quit IRC11:32
*** c00281451 is now known as zengchen11:36
*** karimb has quit IRC11:38
*** yamamoto has joined #openstack-kuryr11:41
*** yamamoto_ has joined #openstack-kuryr11:42
*** yamamoto has quit IRC11:46
yboaronping irenab11:51
zengchenapuimedo & irenab: Sorry to interrupt you. Could you have time to review the patches of fuxi-k8s. The patches are ready for review for a long time. From my perspective, I hope the left patches could be merged. Thanks very much!11:54
zengchenhttps://review.openstack.org/#/q/project:openstack/fuxi-kubernetes+status:open11:54
*** salv-orlando has quit IRC11:54
*** yamamoto_ has quit IRC11:56
*** c00281451 has joined #openstack-kuryr12:03
*** zengchen has quit IRC12:06
*** c00281451 is now known as zengchen12:16
*** yamamoto has joined #openstack-kuryr12:19
apuimedozengchen: please, remember to add us as reviewers, as we usually mostly check the patches in which we are listed as reviewers12:21
apuimedoI'm sorry that I didn't notice or that they went out of my mind12:21
irenabzengchen, same appology from me. Will take a look asap12:25
irenabyboaron, hi12:25
yboaronHi, I checked the service access in case devstack env HA-PROXY12:25
irenabzengchen, can you please resolve merge conflict?12:26
irenabyboaron, any idea how it works with ref. implementation?12:26
yboaronit appears that for HA-PROXY The load balancer port will be assigned to the projects default security group12:26
irenabso why curl to FIP works?12:27
yboaronand in devstack default security group enable all IPV4 ingress traffic12:27
yboaronI assume that in your env - the default security doesnt allow all IP V4 right ?12:27
irenabyboaron, I do not see it in my environment . I see that VIP port has same SG as pods12:27
yboarontake a look at this one : https://github.com/kubernetes/kubernetes/issues/2974512:28
irenabyboaron, I just use kuryr-kubernetes local.conf and deploy devstack, this should not be different12:29
yboaronhte only difference is the ml2 plugin ?12:29
irenabml2 drvier12:30
irenabI thnk we see the same issue as reported in the link you posted12:30
apuimedoirenab: maybe this can interest you for DF https://github.com/alibaba/ApsaraCache12:30
irenabapuimedo, thanks. Will take a look.12:31
apuimedo;-)12:31
irenabyboaron, we assign pods' SG to LB port in kuryr-kubernetes12:32
zengchenapuimedo & irenab: thanks for your response. I will add you as the reviewer for each patches. thanks very much!12:33
yboaronright , in my devstack all IPV4 ingress are allowed at this SG , see https://pastebin.com/4qdJTLfi12:33
apuimedo;-)12:33
yboaronport_security_enabled is True for VIP port , and SG is the default one - right ?12:35
zengchenapuimedo:btw, will the vPTG be held this week?12:36
irenabyboaron, but ingress ipv4 is enabled only for ones with same SG12:36
irenabnot anyone12:36
irenabyboaron, and for your question, the answer is that port security enabled and SG is the one provided by the kuryr-kubernetes SG driver (the same as set to pod ports)12:39
irenabapuimedo, please verify that I am correct12:39
*** openstackgerrit has joined #openstack-kuryr12:39
openstackgerritDaniel Mellado proposed openstack/kuryr-tempest-plugin master: [WIP] Add scenario test manager  https://review.openstack.org/51089612:39
*** salv-orlando has joined #openstack-kuryr12:40
apuimedozengchen: I still did not hear from limao12:41
apuimedoand I wanted to keep the last two sessions together12:41
apuimedoI guess we can schedule for next week12:41
irenabapuimedo, dulek , janonymous : I think we must have scenario tests before having cni-deamon version as default choice for deployment12:41
apuimedoirenab: I can agree with that12:41
dulekirenab: Me too, I'm still trying to figure out why the heck is this failing.12:42
zengchenapuimedo:got. if you have a schedule, please send an email. thanks!12:42
apuimedozengchen: w\\12:42
apuimedo-12:42
apuimedozengchen: I will12:42
dulekapuimedo: Maybe you have an idea what's happening in https://github.com/openstack/kuryr-kubernetes/blob/6d9e564251853885ba54868fefb09f6741de96dc/kuryr_kubernetes/cni/binding/bridge.py#L35-L3712:42
dulekapuimedo: I've tried suppressing the exception I get there, but it looks like then pod gets no network access.12:43
apuimedoirenab: what should I verify the correctness on?12:44
irenabmy answer to yboaron regarding load balancer12:44
apuimedodulek: that's not good for you12:45
irenabyboaron, apuimedo I think we will have to add explicit SG crreation/addition for VIP port to enable service specific traffic12:45
apuimedo:-)12:45
dulekapuimedo: I've figured that already. :D12:45
apuimedodulek: does your daemon run with hostnetworking?12:46
dulekapuimedo: Yes, we're talking about non-containerized case now.12:46
apuimedooh12:46
dulekapuimedo: daemon is running with sudo.12:46
apuimedothen it should be okay12:46
dulekapuimedo: We've figured out that scaling deployment e.g. from 1 to 3 causes NetlinError: File already exists.12:47
duleks/NetlinErorr/NetlinkError12:47
apuimedodulek: very interesting indeed12:47
apuimedo:-)12:47
*** aojea has joined #openstack-kuryr12:47
apuimedomaybe that's related to the errors we saw in the scale lab :-)12:47
apuimedodulek: you just made me happy12:48
dulekapuimedo: I'm guessing that for normal CNI os.getpid() returns different value every time.12:48
dulekWell, not the reaction I've expected? :D12:48
dulekapuimedo: And when running as daemon - we're getting same value of course.12:48
dulekSo my question is - why the heck os.getpid()?12:48
dulekWhy CNI binary process pid matters?12:49
apuimedodulek: did you try what happens if you comment that out?12:49
dulekapuimedo: Wrapping it in "except: pass" resulted in some containers getting IP but being unpingable.12:50
dulekapuimedo: I guess all of them will get unpingable, but let me try.12:51
apuimedodulek: that's what I'd expect as well12:51
*** aojea has quit IRC12:52
*** garyloug has quit IRC12:53
dulekapuimedo: KeyError: u'tap764255bb-40' - look like next lines start to fail.12:54
apuimedowell, of course12:54
*** yamamoto has quit IRC12:54
dulekapuimedo: Which starts to make sense - the lines I've commented out move iface into our namespace.12:54
apuimedolook closely at line 3612:54
dulekSo we can modify it.12:54
apuimedoit is creating the veth in the container netns12:55
apuimedoline 37 moves the host side to the host networking12:55
apuimedo(putting the netns of the current pid)12:55
*** rwallner has joined #openstack-kuryr12:55
apuimedoif you comment that out12:55
apuimedoline 3912:55
apuimedowhich uses the host ipdb12:55
apuimedowon't find the host side veth12:55
apuimedosince it was not moved there12:56
apuimedodulek: did I explain it well?12:56
dulekapuimedo: Yup!12:56
dulekapuimedo: So… Why do we get error when moving more than one iface?12:57
apuimedohowever, the reason why itgets the problem of already existing file...12:57
apuimedolet me check the pyroute2 code12:57
irenabdulek, apuimedo can you add docstring in the code to help kuryr followers12:58
dulekirenab: Where exactly you mean?12:58
apuimedoirenab: where?12:58
irenabon the binding code12:59
irenabonce it works12:59
apuimedoirenab: I'll quote G&G: "The code is the documentation" xD12:59
irenabto solve the mystery of moving netns/pids/...12:59
apuimedono, now seriously, we should put a proper docstring12:59
dulekirenab: Okay, I can add comment, that will help me learn what happens in there. :)12:59
apuimedo:-)12:59
irenab:-)12:59
*** yboaron_ has joined #openstack-kuryr13:01
apuimedodulek: are you sure that the host side veth name didn't collide?13:02
dulekapuimedo: I'll need to check, but then same should happen for non-daemonized CNI plugin.13:02
dulekAnd it isn't.13:03
dulekapuimedo: Let's investigate after the meeting?13:03
apuimedook13:03
*** yboaron has quit IRC13:04
*** wangbo has quit IRC13:08
*** karimb has joined #openstack-kuryr13:13
*** garyloug has joined #openstack-kuryr13:29
*** gouthamr has joined #openstack-kuryr13:43
dulekapuimedo: Okay, so you want me to check if veth names are not colliding when doing scaling. Let's see…13:45
apuimedoin the mean time I check the kernel13:47
*** aojea has joined #openstack-kuryr13:48
dulekapuimedo: I've got tap86d5e73e-1c and tap7ddf14d1-96, second one fails. :(13:49
*** aojea has quit IRC13:53
apuimedookey dokey13:53
dulekHm?13:55
*** yamamoto has joined #openstack-kuryr13:55
yboaron_irenab, are u sure that kuryr sets the VIP port SG ? from source code , it seems that SG value not used in lbaas driver13:55
yboaron_https://github.com/openstack/kuryr-kubernetes/blob/master/kuryr_kubernetes/controller/drivers/lbaasv2.py#L5113:55
yboaron_irenab, maybe I'm missing something ...13:56
apuimedodulek: I meant...13:57
apuimedogood. So I continue checking the kernel side13:57
apuimedodulek: I'm checking get_net_ns_by_pid13:58
dulekapuimedo: Hm. This fails more on setting than getting. Want the traceback?13:58
apuimedodulek: no need13:58
apuimedothe get is used in do_setlink13:59
*** yboaron__ has joined #openstack-kuryr13:59
apuimedoin net/core/rtnetlink.c13:59
apuimedowhich is what pyroute2 uses13:59
apuimedolike when you do ip link set13:59
dulekMhm, okay.14:00
*** yboaron_ has quit IRC14:02
*** jchhatbar has quit IRC14:03
*** danil has quit IRC14:03
*** yamamoto has quit IRC14:05
*** limao_ has quit IRC14:05
apuimedodulek: you got an EEXIST right?14:05
dulekNetlinkError: (17, 'File exists')14:05
dulekapuimedo: Looks like it matches - EEXIST is 17.14:06
apuimedoI only see it in the dev_change_net_namespace14:08
apuimedodulek: and it is name related14:10
dulekapuimedo: So when it's fired?14:11
apuimedodulek: https://github.com/torvalds/linux/blob/master/net/core/dev.c#L8268-L827514:11
apuimedoI wonder what dev->name is in that moment14:12
apuimedohey14:13
apuimedoit's a race14:13
openstackgerritYossi Boaron proposed openstack/kuryr-kubernetes master: Closes-Bug: ####### 1714204 - Delete service/deployment causes exception  https://review.openstack.org/51263614:14
dulekapuimedo: Hm? We have 2 parallel requests, but with different names and namespaces.14:15
apuimedodulek: https://github.com/svinota/pyroute2/blob/master/pyroute2/ipdb/interfaces.py#L584-L60514:15
dulekapuimedo: Oooow.14:15
dulekapuimedo: But I *am* getting the exception, it's not suppressed.14:16
apuimedodulek: how consistently do you get it?14:17
dulekapuimedo: Okay, got to admin, it seems a bit random.14:18
duleks/admin/admit14:18
apuimedodulek: I ask because I'm curious is changing things around14:18
apuimedoand creating the veth pair in h_ipdb and then moving would behave better14:19
*** karimb has quit IRC14:21
dulekI can try that, although I don't think this tells us why the error happens.14:21
apuimedoyeah14:22
dulekI can pretty consistently hit it when scaling a deployment with +2. Doing one-by-one seem fine most of the time.14:22
dulekIt could be a race, but why there's a conflict.14:23
dulekIf name is different.14:23
apuimedoit is a race between the different c_ipdbs14:23
apuimedowhy... I don't know14:23
apuimedolet's ask svinota14:23
dulekActually…! A difference here is that those will get run in the same process.14:24
dulekWhile in case on non-daemonized CNI we're guaranteed to run in different processes.14:24
apuimedodulek: are you sure that ipdb doesn't spawn it's own process?14:24
* apuimedo does not remember14:24
dulekIs it that pyroute2 is non threadsafe?14:24
dulekI'm not sure, haven't checked.14:25
apuimedolet's think14:25
apuimedodulek: I suppose you saw https://github.com/svinota/pyroute2/issues/30614:27
dulekapuimedo: I've noticed, but ignored it since it's a different error.14:28
apuimedoyeha14:32
apuimedommm14:32
*** phuoc_ has joined #openstack-kuryr14:33
*** hongbin has joined #openstack-kuryr14:34
apuimedodulek: are we sure that we don't have a recreation of a pod?14:38
apuimedoi.e., when it fails to go running and it tries to create it again?14:38
apuimedocouldn't in that case there be the old host side veth with the same tapxxx name?14:38
dulekapuimedo: Hm, interesting idea, I sometimes see CNI timeouts and reruns.14:40
dulekapuimedo: It would be pretty cool if that's it, as it would solve 2 problems at once.14:40
apuimedothat's my worry14:40
dulekapuimedo: So scenario would be:14:41
apuimedobut why do we take so long time that we get to a rerun14:41
apuimedothat I don't know14:41
*** tonygunk has joined #openstack-kuryr14:41
dulekapuimedo: That would need to be related to file socket, the HTTPServer daemon is spawning and the way CNI is doing the request.14:41
dulek1. CNI sends the request.14:41
dulek2. CNI daemon gets requests, plugs vif, returns.14:42
dulek3. CNI doesn't get reply for some reason - so it retries.14:42
dulek4. CNI daemon gets request again and fails.14:43
dulekBTW - it shouldn't really fail, we should probably be idempotent.14:43
apuimedodulek: that's true14:44
apuimedodulek: can you do something fast14:44
apuimedoin case it fails, use h_ipdb to check if it exists and log it14:44
apuimedoif it does, then we can see on which namespace its pair is and move it to where it needs to be14:44
dulekapuimedo: Okay, let me see if I can code it.14:45
apuimedodulek: forget about the moving part14:46
apuimedofor now the logging of the h_ipdb.interfaces[host_ifname]14:46
apuimedois sufficient14:47
dulekapuimedo: Just logging?14:47
apuimedoyeah14:47
apuimedoto see if my hypothesis is correct14:47
*** yamamoto has joined #openstack-kuryr14:47
*** yamamoto has quit IRC14:47
dulekapuimedo: http://paste.openstack.org/show/623848/14:55
dulekapuimedo: 'link_netnsid': 2614:55
apuimedoso it did exist, eh?14:56
apuimedoso now it's about to check where the pair is14:58
apuimedos/to check/checking/14:58
dulekapuimedo: What do you mean by "pair" in this context?14:58
dulekBridged interfaces?14:58
apuimedothese are veth pairs14:58
*** janki has joined #openstack-kuryr14:58
apuimedoit means it is a pair of linux virtual device14:59
apuimedoone end on the host side14:59
apuimedoone on the container side14:59
apuimedoso it's about looking for the container side of this already existing host side14:59
apuimedodevice14:59
dulekUhm.15:00
*** kiennt26_ has joined #openstack-kuryr15:03
apuimedodulek: ?15:04
dulekapuimedo: I'm trying to figure it out. ;) I guess I shouldn't kill the pod after getting the error.15:04
apuimedoprobably not :-)15:08
*** egonzalez has quit IRC15:14
dulekapuimedo: http://paste.openstack.org/show/623852/15:20
* apuimedo reconnecting15:21
dulekapuimedo: Not sure if that's what I should look for…15:21
openstackgerritYossi Boaron proposed openstack/kuryr-kubernetes master: Eliminate wrong ERROR report (in kuryr log file) when service of type LoadBalancer type is deleted  https://review.openstack.org/51267015:26
*** yboaron__ has quit IRC15:31
apuimedodulek sudo nsenter -t 2799 -n ip -o -d link show15:40
dulekapuimedo: http://paste.openstack.org/show/623857/15:41
apuimedodulek: which is this namespace?15:43
apuimedothe one of the failed pod?15:43
apuimedowhat about in the host namespace?15:43
apuimedoIs the tapf7a... device tehre?15:43
apuimedowhich is its index?15:43
apuimedoand @15:43
dulekapuimedo: "CNI_NETNS": "/proc/2799/ns/net" - that's failed CNI request.15:44
dulekI have the tap interface on the host.15:44
dulekapuimedo: 423: tapf7afbf1c-ba@if315:44
apuimedoif3...15:44
apuimedoI don't suppose there is an if3 in the host namespace, is there?15:45
dulekapuimedo: Nope. But all the tap interfaces have @if3.15:45
dulekEven those that work correctly.15:45
apuimedodulek: yeah... I think it just means "no fucking clue where the pair is"15:45
dulekapuimedo: Should it be paired with eth0?15:46
apuimedoof some container namespace15:46
apuimedohow many infra containers you have on `docker ps`?15:46
apuimedoyou have to try to find an eth0 inside a container15:47
apuimedowith an interface index number 424 (or 422 now I don't remember)15:47
dulekapuimedo: infra containers?15:47
*** yamamoto has joined #openstack-kuryr15:48
apuimedodulek: when you do `docker ps`15:48
apuimedofor each pod15:48
apuimedoyou have an infra container15:48
apuimedoand a container with the actual image that is being run15:49
dulekAh, okay. 4 containers.15:49
apuimedoI'm telling you to look at the infra containers only, so that you don't need to look 2x15:49
apuimedo:-)15:49
apuimedoso do15:49
apuimedodocker exec name_of_each_container ip link show15:49
*** aojea has joined #openstack-kuryr15:50
dulekapuimedo: http://paste.openstack.org/show/623859/15:51
dulekapuimedo: First one is from infra container.15:51
apuimedodulek: right15:52
apuimedois that pod running?15:52
apuimedois it terminating15:52
apuimedois it a troll pod?15:52
dulekThe one with CNI failure is in "cannot join network of a non running container", previously ContainerCreating.15:53
dulekTo catch the container namespace I needed to stop kubelet - otherwise kubelet retried constantly while changing it.15:54
apuimedodulek: let me rephrase15:54
*** aojea has quit IRC15:54
apuimedoif you find the pod for this infra container15:54
apuimedowhat does it say15:54
apuimedoin describe?15:54
apuimedothe 'cannot join'?15:54
*** gouthamr has quit IRC15:54
apuimedobut is it a pod that was created before teh one in which we now fail to move the device to the host namespace?15:55
*** yamamoto has quit IRC15:55
*** gouthamr_ has joined #openstack-kuryr15:55
dulekJust a second…15:57
*** limao has joined #openstack-kuryr15:57
*** rwallner has quit IRC15:58
dulekapuimedo: Okay, finally - cannot join.15:58
dulekapuimedo: Let's take a step back. This isn't very productive, as I'm having troubles following what we're doing…15:59
*** limao_ has joined #openstack-kuryr15:59
apuimedodulek: I've got an important meeting now16:00
*** rwallner_ has joined #openstack-kuryr16:00
apuimedoI'll try to answer as much as possible16:00
dulekapuimedo: Okay, what I'll try to first is to get rid of those timeouts on CNI.16:00
*** pcaruana has quit IRC16:01
dulekapuimedo: Maybe it'll kill 2 birds with one stone.16:01
*** rwallne__ has joined #openstack-kuryr16:02
*** limao has quit IRC16:02
*** rwallner_ has quit IRC16:04
*** rwallne__ has quit IRC16:04
*** rwallner has joined #openstack-kuryr16:05
openstackgerritHongbin Lu proposed openstack/kuryr master: Allow multiple binding drivers  https://review.openstack.org/50877816:07
apuimedo:-)16:07
*** kiennt26_ has quit IRC16:07
openstackgerritHongbin Lu proposed openstack/kuryr-libnetwork master: Support searching existing port with macaddress  https://review.openstack.org/50544316:14
*** salv-orl_ has joined #openstack-kuryr16:29
*** salv-orlando has quit IRC16:32
*** tonygunk has quit IRC16:40
*** jchhatbar has joined #openstack-kuryr16:44
*** jchhatbar has quit IRC16:45
*** jchhatbar has joined #openstack-kuryr16:45
*** rwallner has quit IRC16:46
*** janki has quit IRC16:47
*** aojea has joined #openstack-kuryr16:50
*** aojea has quit IRC16:55
*** jchhatbar has quit IRC16:56
*** garyloug has quit IRC16:57
*** karimb has joined #openstack-kuryr17:04
*** limao_ has quit IRC17:08
*** rwallner has joined #openstack-kuryr17:09
*** leyal has quit IRC17:17
*** leyal has joined #openstack-kuryr17:19
*** rwallner_ has joined #openstack-kuryr17:25
*** salv-orlando has joined #openstack-kuryr17:28
*** rwallner has quit IRC17:28
*** salv-orl_ has quit IRC17:32
*** salv-orlando has quit IRC17:32
*** aojea has joined #openstack-kuryr17:51
*** gouthamr_ is now known as gouthamr17:52
*** aojea has quit IRC17:56
*** aojea has joined #openstack-kuryr18:38
*** aojea has quit IRC18:47
*** aojea has joined #openstack-kuryr19:43
*** aojea has quit IRC19:48
*** rwallner_ has quit IRC19:55
*** tonygunk has joined #openstack-kuryr20:27
*** aojea has joined #openstack-kuryr20:44
*** aojea has quit IRC20:49
*** tonygunk has quit IRC21:00
*** gouthamr has quit IRC21:08
*** salv-orlando has joined #openstack-kuryr21:42
*** salv-orlando has quit IRC21:44
*** salv-orlando has joined #openstack-kuryr21:44
*** aojea has joined #openstack-kuryr21:45
*** aojea has quit IRC21:49
*** atoth has quit IRC22:43
*** salv-orlando has quit IRC22:45
*** salv-orlando has joined #openstack-kuryr22:46
*** oanson has quit IRC22:50
*** salv-orlando has quit IRC22:51
*** oanson has joined #openstack-kuryr22:52
*** hongbin has quit IRC23:32
*** salv-orlando has joined #openstack-kuryr23:46
*** salv-orlando has quit IRC23:52

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!