Wednesday, 2017-10-18

*** karimb has quit IRC00:18
*** zengchen has quit IRC00:22
*** c00281451 has joined #openstack-kuryr00:34
*** aojea has joined #openstack-kuryr00:47
*** salv-orlando has joined #openstack-kuryr00:48
*** limao has joined #openstack-kuryr00:48
*** aojea has quit IRC00:51
*** salv-orlando has quit IRC00:52
*** aojea has joined #openstack-kuryr01:48
*** salv-orlando has joined #openstack-kuryr01:48
*** c00281451 has quit IRC01:52
*** aojea has quit IRC01:52
*** salv-orlando has quit IRC01:53
*** hongbin has joined #openstack-kuryr01:54
*** yamamoto has joined #openstack-kuryr01:56
*** limao has quit IRC02:14
*** limao_ has joined #openstack-kuryr02:14
*** wangbo has joined #openstack-kuryr02:28
*** gouthamr has joined #openstack-kuryr02:32
*** gouthamr has quit IRC02:42
*** gouthamr has joined #openstack-kuryr02:42
*** salv-orlando has joined #openstack-kuryr02:50
*** salv-orlando has quit IRC02:54
*** limao_ has quit IRC03:03
*** limao has joined #openstack-kuryr03:05
*** limao has quit IRC03:06
*** limao_ has joined #openstack-kuryr03:06
*** limao has joined #openstack-kuryr03:09
*** limao has quit IRC03:09
*** limao has joined #openstack-kuryr03:10
*** limao_ has quit IRC03:11
*** limao has quit IRC03:12
*** limao has joined #openstack-kuryr03:12
*** limao has quit IRC03:16
*** limao has joined #openstack-kuryr03:17
*** limao has quit IRC03:21
*** openstackgerrit has quit IRC03:22
*** limao has joined #openstack-kuryr03:22
*** limao has quit IRC03:22
*** goncalo has quit IRC03:25
*** phuoc_ has quit IRC03:31
*** goncalo has joined #openstack-kuryr03:32
*** yamamoto_ has joined #openstack-kuryr03:39
*** hongbin has quit IRC03:40
*** yamamoto has quit IRC03:42
*** aojea has joined #openstack-kuryr03:49
*** salv-orlando has joined #openstack-kuryr03:50
*** aojea has quit IRC03:54
*** salv-orlando has quit IRC03:55
*** kiennt26 has quit IRC03:57
*** kiennt26 has joined #openstack-kuryr03:57
*** janki has joined #openstack-kuryr04:36
*** vikasc has quit IRC04:46
*** kiennt26 has quit IRC04:47
*** limao has joined #openstack-kuryr04:47
*** aojea has joined #openstack-kuryr04:50
*** salv-orlando has joined #openstack-kuryr04:51
*** aojea has quit IRC04:54
*** salv-orlando has quit IRC04:57
*** vikasc has joined #openstack-kuryr04:59
*** wangbo has quit IRC05:09
*** yboaron__ has joined #openstack-kuryr05:10
*** salv-orlando has joined #openstack-kuryr05:15
*** pmannidi has quit IRC05:15
*** limao has quit IRC05:23
*** limao has joined #openstack-kuryr05:24
*** janki has quit IRC05:25
*** limao has quit IRC05:29
*** gouthamr has quit IRC05:34
*** aojea has joined #openstack-kuryr05:51
*** wangbo has joined #openstack-kuryr05:51
*** aojea has quit IRC05:55
*** kiennt26 has joined #openstack-kuryr06:17
*** janki has joined #openstack-kuryr06:24
*** salv-orlando has quit IRC06:33
*** salv-orlando has joined #openstack-kuryr06:35
*** openstackgerrit has joined #openstack-kuryr06:40
openstackgerritYossi Boaron proposed openstack/kuryr-kubernetes master: Eliminate wrong ERROR report (in kuryr log file) when service of type LoadBalancer type is deleted  https://review.openstack.org/51267006:40
*** pcaruana has joined #openstack-kuryr06:45
*** yboaron__ has quit IRC07:19
*** danil has joined #openstack-kuryr07:26
*** caowei has joined #openstack-kuryr07:29
*** aojea has joined #openstack-kuryr07:38
*** karimb has joined #openstack-kuryr07:41
*** egonzalez has joined #openstack-kuryr07:46
*** aojea has quit IRC07:48
*** wangbo has quit IRC07:49
*** wangbo has joined #openstack-kuryr07:50
*** caowei has quit IRC08:04
*** caowei has joined #openstack-kuryr08:05
*** wangbo has quit IRC08:06
*** jerms has quit IRC08:07
*** jerms has joined #openstack-kuryr08:07
*** wangbo has joined #openstack-kuryr08:08
*** phuoc_ has joined #openstack-kuryr08:17
*** wangbo has quit IRC08:23
*** wangbo has joined #openstack-kuryr08:27
*** yboaron__ has joined #openstack-kuryr08:30
*** wangbo has quit IRC08:31
*** yamamoto_ has quit IRC08:37
*** yboaron_ has joined #openstack-kuryr08:38
*** leyal has quit IRC08:38
*** yboaron__ has quit IRC08:39
*** yamamoto has joined #openstack-kuryr08:40
*** yamamoto has quit IRC08:40
*** aojea has joined #openstack-kuryr08:44
*** leyal has joined #openstack-kuryr08:45
*** garyloug has joined #openstack-kuryr08:45
*** wangbo has joined #openstack-kuryr08:46
*** aojea has quit IRC08:48
openstackgerritYossi Boaron proposed openstack/kuryr-kubernetes master: Closes-Bug: #1714204   https://review.openstack.org/51263608:57
openstackbug 1714204 in kuryr-kubernetes "Delete service/deployment causes exception" [Medium,Triaged] https://launchpad.net/bugs/1714204 - Assigned to yossib (yossi-boaron-1234)08:57
openstackgerritYossi Boaron proposed openstack/kuryr-kubernetes master: Eliminate wrong ERROR report  when service of type LoadBalancer type is deleted  https://review.openstack.org/51267009:02
*** yamamoto has joined #openstack-kuryr09:07
*** pmannidi has joined #openstack-kuryr09:09
*** yamamoto has quit IRC09:11
*** yamamoto has joined #openstack-kuryr09:17
*** wangbo has quit IRC09:17
*** yamamoto has quit IRC09:21
*** wangbo has joined #openstack-kuryr09:26
*** wangbo has quit IRC09:28
*** yamamoto has joined #openstack-kuryr09:28
*** caowei has quit IRC09:28
*** egonzalez has quit IRC09:31
*** wangbo has joined #openstack-kuryr09:33
apuimedodulek: how's the cni eexist investigation going?09:33
dulekapuimedo: Awful, as I'm just finding new bugs.09:34
*** yamamoto_ has joined #openstack-kuryr09:34
dulekapuimedo: To get things tested after a failed run I sometimes need to restack, as there seems to be one more bug:09:34
dulekIf a vif never got plugged correctly, when deleting that pod kubelet will call CNI, CNI will spawn watcher thread and this thread never gets it's WATCH notification.09:35
dulekSo everything fails with timeouts constantly.09:35
apuimedodulek: yeah09:36
apuimedoand I learned of another bug today thinking in the shower09:36
apuimedoif you are waiting for the port to go active and in the meantime the pod is terminated09:37
apuimedobooom09:37
apuimedowe don't cancel that thread09:37
apuimedoso we just timeout09:37
dulekapuimedo: Yup, very similar thing, isn't it?09:37
apuimedosimilar09:37
dulekapuimedo: I'd paraphrase a saying - there are 2 things that's hard in computer programming and distributed systems. ;)09:37
*** yamamoto has quit IRC09:37
dulek(and off-by-one errors)09:37
apuimedothe good tihng is that with daemonized cni it is easier to fix it :-)09:37
apuimedodulek: and async09:38
apuimedofucking callbacks09:38
apuimedoxD09:38
apuimedodulek: so, the game plan should be09:38
apuimedoin the cni daemon we should probably keep in a data structure the events we've seen09:39
apuimedothe key should maybe be the pod id09:39
apuimedoand the value a list with the last events seen09:39
apuimedoso if when add to network is called we already saw the damn vif09:40
apuimedowe don't need to wait again09:40
dulekapuimedo: That's feasible once daemon will be watching /pods?host=host09:40
apuimedoif a delete action comes09:40
apuimedodulek: that should happen ASAP, yes09:40
apuimedoit is of the utmost importance to change to that09:40
apuimedooh, the value should be a tuple of the list of events and active thread09:41
apuimedoso it can be cancelled09:41
apuimedoso we should have a handler that on cancelling the thread it cleans up the resources09:41
dulekapuimedo: I think I've drafter something like that, just haven't saved it into a dict (event was saved in the handler).09:42
dulekapuimedo: So here's the issue: Watcher expects oslo.service.ThreadGroup instance. This means eventlet.09:42
apuimedodulek: I know09:42
dulekapuimedo: Question is - do we fear eventlet so much, that it's worth to reimplement Watcher to be less eventlet-dependent?09:42
dulekI've tried running with eventlet.monkey_patch() and I see that pyroute2 fails.09:43
dulekBut there's this: https://github.com/craneworks/python-pyroute2/blob/master/pyroute2/config/eventlet.py09:43
dulek(I should probably find it in Neutron as author states it's from there)09:44
apuimedodulek: maybe you can write to svinota and put me in CC09:44
*** aojea has joined #openstack-kuryr09:45
apuimedodulek: I wonder if we really gain much by using evenlet compared to just having pthreads09:45
dulekapuimedo: I doubt it, but I'm not sure if some standard lib have same interface as oslo.service?09:46
apuimedovery doubtful09:46
dulekapuimedo: Good news - I think I've overcame issue with IPDB.09:47
apuimedoyou mean the eexist? Or with eventlet?09:47
dulekeexist.09:47
apuimedohow?09:48
dulekOkay, I didn't. :P I don't really know what's happening here, but it started to manifest again.09:48
apuimedo:-)09:48
dulekI thought chaning https://github.com/openstack/kuryr-kubernetes/blob/master/kuryr_kubernetes/cni/binding/base.py#L31-L40 to return new IPDB instance for each thread will help.09:48
*** aojea has quit IRC09:49
apuimedoshouldn't if it is due to timeouts09:49
*** egonzalez has joined #openstack-kuryr09:50
dulekapuimedo: You're right. But I'm seeing 2 types of errors. Second one is KeyError: u'tapdc9c0043-a2' - and for this IPDB per thread helps.09:50
dulekI'll dig a bit more… I'm really getting sick of BaseHTTPServer, I blame it for the timeouts.09:51
apuimedodulek: wanna do gRPC?09:52
apuimedoit's in openstack/requirements09:52
apuimedoor you want to use the same wsgi as neutron and keystone?09:53
dulekapuimedo: Hard to decide. gRPC should do it. And if k8s is moving to that for CNI communication…09:54
apuimedodulek: that's my thought09:55
*** yamamoto_ has quit IRC09:56
*** yboaron__ has joined #openstack-kuryr10:01
*** pmannidi has quit IRC10:02
*** yboaron_ has quit IRC10:03
*** kiennt26 has quit IRC10:05
*** wangbo has quit IRC10:22
*** yamamoto has joined #openstack-kuryr10:26
*** yboaron_ has joined #openstack-kuryr10:34
*** yboaron__ has quit IRC10:36
*** egonzalez has quit IRC10:39
*** aojea has joined #openstack-kuryr10:45
*** aojea has quit IRC10:50
*** caowei has joined #openstack-kuryr10:52
*** atoth has joined #openstack-kuryr11:12
openstackgerritYossi Boaron proposed openstack/kuryr-kubernetes master: Fix KeyError exception upon  service/deployment deletion  https://review.openstack.org/51263611:19
*** rwallner has joined #openstack-kuryr11:27
*** caowei has quit IRC11:31
*** wangbo has joined #openstack-kuryr11:34
*** wangbo has quit IRC11:43
*** aojea has joined #openstack-kuryr11:46
*** aojea has quit IRC11:50
*** wangbo has joined #openstack-kuryr11:55
*** phuoc_ has quit IRC11:58
*** yamamoto has quit IRC11:59
*** salv-orlando has quit IRC12:26
*** salv-orlando has joined #openstack-kuryr12:28
*** aojea has joined #openstack-kuryr12:47
*** aojea has quit IRC12:51
*** yamamoto has joined #openstack-kuryr12:59
*** ajo has quit IRC13:03
*** yamamoto has quit IRC13:07
dulekapuimedo: Okay, simple question… How do you think a single file socket behaves with multiple upcoming TCP connections?13:08
dulekapuimedo: Because I'm now starting to think that this is the main issue - I don't get any timeouts where there aren't multiple requests from kubelet coming simultaneously.13:09
*** wangbo has quit IRC13:20
openstackgerritMerged openstack/kuryr master: Allow multiple binding drivers  https://review.openstack.org/50877813:23
*** gouthamr has joined #openstack-kuryr13:26
*** danil has quit IRC13:34
*** limao has joined #openstack-kuryr13:41
*** limao_ has joined #openstack-kuryr13:48
*** limao has quit IRC13:49
*** yamamoto has joined #openstack-kuryr13:54
*** yamamoto has quit IRC13:55
openstackgerritMerged openstack/kuryr-libnetwork master: Add installation guide for kuryr-libnetwork  https://review.openstack.org/50965513:58
*** salv-orlando has quit IRC14:11
*** salv-orlando has joined #openstack-kuryr14:11
*** garyloug has quit IRC14:15
*** garyloug has joined #openstack-kuryr14:15
*** salv-orlando has quit IRC14:16
*** janki has quit IRC14:25
*** limao_ has quit IRC14:25
*** limao has joined #openstack-kuryr14:25
*** tonygunk has joined #openstack-kuryr14:43
apuimedodulek: afaik it should work14:46
apuimedolet me check14:46
dulekapuimedo: Mhm, I think so too after some research. I'm now looking if it isn't client's fault.14:47
apuimedook14:47
*** aojea has joined #openstack-kuryr14:48
dulekapuimedo: One of the funnier bugs I'm getting is kubelet/CNI driver starting to use all CPU. Restart doesn't help…14:49
*** aojea has quit IRC14:53
*** yamamoto has joined #openstack-kuryr14:55
*** ajo has joined #openstack-kuryr15:00
apuimedo:/15:01
apuimedodulek: maybe you can upgrade to latest hyperkube15:02
apuimedogotta go pick up the kid from school15:02
apuimedowill check on things later15:02
dulekapuimedo: I'll try switching to normal INET socket first… If that doesn't help I'm puzzled.15:02
dulekapuimedo: Because that would mean the problem is… elsewhere?15:03
*** yamamoto has quit IRC15:05
*** limao has quit IRC15:09
*** pcaruana has quit IRC15:19
*** hongbin has joined #openstack-kuryr15:27
dulekapuimedo: Sweet mother of TPC/IP stacks - it works now!15:42
dulekapuimedo: I'll just switch the server to bind to 127.0.0.1:CONF.port_number and client to call that. Looks like it'll help.15:44
dulekapuimedo: That will be easiest way for now, then we can evaluate grpc or whatever.15:44
*** yboaron_ has quit IRC15:44
apuimedodulek: how much did that help?15:45
*** aojea has joined #openstack-kuryr15:49
*** aojea has quit IRC15:54
*** salv-orlando has joined #openstack-kuryr16:12
*** salv-orlando has quit IRC16:17
*** salv-orlando has joined #openstack-kuryr16:25
*** aojea has joined #openstack-kuryr16:50
*** aojea has quit IRC16:54
*** leyal has quit IRC17:04
*** leyal has joined #openstack-kuryr17:05
*** garyloug has quit IRC17:14
*** aojea has joined #openstack-kuryr17:22
*** phuoc_ has joined #openstack-kuryr18:02
*** tonygunk has quit IRC18:24
*** aojea has quit IRC18:30
*** aojea has joined #openstack-kuryr19:16
*** irenab has left #openstack-kuryr19:17
*** aojea has quit IRC19:21
*** jdavis has joined #openstack-kuryr19:28
*** atoth has quit IRC19:28
*** salv-orlando has quit IRC19:31
*** salv-orlando has joined #openstack-kuryr19:35
openstackgerritHongbin Lu proposed openstack/kuryr-libnetwork master: [WIP] Introduce sriov port driver  https://review.openstack.org/50043619:40
*** salv-orlando has quit IRC20:02
*** salv-orlando has joined #openstack-kuryr20:02
*** salv-orlando has quit IRC20:07
*** openstackgerrit has quit IRC20:17
*** salv-orlando has joined #openstack-kuryr20:18
*** rwallner has quit IRC20:31
*** rwallner has joined #openstack-kuryr20:33
*** rwallner_ has joined #openstack-kuryr20:34
*** rwallner has quit IRC20:37
*** rwallner_ has quit IRC20:38
*** gouthamr has quit IRC20:43
*** jdavis has quit IRC21:11
*** salv-orlando has quit IRC21:16
*** salv-orlando has joined #openstack-kuryr21:18
*** gouthamr has joined #openstack-kuryr21:20
*** salv-orlando has quit IRC21:21
*** salv-orlando has joined #openstack-kuryr21:23
*** rwallner has joined #openstack-kuryr22:05
*** rwallner has quit IRC22:09
*** salv-orlando has quit IRC22:33
*** gouthamr has quit IRC22:41
*** rwallner has joined #openstack-kuryr22:44
*** rwallner has quit IRC22:44
*** rwallner has joined #openstack-kuryr22:45
*** rwallner has quit IRC22:48
*** jdavis has joined #openstack-kuryr23:12
*** jdavis has quit IRC23:17
*** hongbin has quit IRC23:21
*** karimb has quit IRC23:21
*** salv-orlando has joined #openstack-kuryr23:34
*** salv-orlando has quit IRC23:39

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!