Wednesday, 2018-12-12

*** rh-jelabarre has quit IRC01:24
*** garyloug has quit IRC01:45
*** JJ-Club has joined #openstack-kuryr02:09
*** hongbin has joined #openstack-kuryr02:44
*** hongbin has quit IRC04:55
*** irenab has joined #openstack-kuryr05:43
*** gcheresh_ has joined #openstack-kuryr06:15
*** yboaron_ has joined #openstack-kuryr06:29
*** ccamposr has joined #openstack-kuryr07:22
*** openstackgerrit has quit IRC07:29
*** gcheresh_ has quit IRC07:50
*** openstackgerrit has joined #openstack-kuryr08:08
openstackgerritDaniel Mellado proposed openstack/kuryr-kubernetes master: Add needed driver and handler for policy gate  https://review.openstack.org/62460408:08
openstackgerritDaniel Mellado proposed openstack/kuryr-tempest-plugin master: Add network policy security group rules tests  https://review.openstack.org/60361808:12
openstackgerritDaniel Mellado proposed openstack/kuryr-tempest-plugin master: Add network policy security group rules tests  https://review.openstack.org/60361808:15
*** gcheresh_ has joined #openstack-kuryr08:16
*** yboaron_ has quit IRC08:18
*** yboaron_ has joined #openstack-kuryr09:07
*** garyloug has joined #openstack-kuryr09:43
openstackgerritDaniel Mellado proposed openstack/kuryr-kubernetes master: Add needed driver and handler for policy gate  https://review.openstack.org/62460409:45
*** gkadam has joined #openstack-kuryr10:05
*** gkadam has quit IRC10:05
openstackgerritGenadi Chereshnya proposed openstack/kuryr-tempest-plugin master: Working with noop for vif_pool_driver  https://review.openstack.org/62141110:25
openstackgerritMaysa de Macedo Souza proposed openstack/kuryr-kubernetes master: Add support to match expressions in Network Policies  https://review.openstack.org/62400910:51
*** maysams has joined #openstack-kuryr10:51
dulekyboaron_: Ping!12:23
dulekyboaron_: I'm trying to tackle python 3.6 gate errors. I think timeouts on that gate are more often.12:24
dulekyboaron_: When you take a look at this log: http://logs.openstack.org/84/623184/1/gate/kuryr-kubernetes-tempest-daemon-octavia-py36/346910f/controller/logs/screen-kuryr-kubernetes.txt.gz#_Dec_11_14_17_39_64853312:24
dulekAnd look for kuryr-service-471643875 - it seems that LBaaS spec is getting annotated, but no state ever is.12:24
dulekAlso I can't find any try to create an LB with IP of 10.1.0.143, which corresponds to that svc.12:25
dulekLooks exactly as if it's being ignored. Any idea why can that happen?12:25
dulekOoooh, maybe _has_pods is failing somehow…? This might be it…12:28
yboaron_dulek, pong12:31
dulekyboaron_: Some info above. :)12:32
yboaron_dulek, yep, checking12:32
dulekOooh, this code is awful: https://github.com/openstack/kuryr-kubernetes/blob/300dc36b062aaa2c5eb1c3ee2f8c041e1e33a99d/kuryr_kubernetes/controller/handlers/lbaas.py#L323-L32612:32
dulekBut _has_pods should be fine.12:33
dulekyboaron_: port['name']?12:36
dulekyboaron_: https://github.com/openstack/kuryr-kubernetes/blob/300dc36b062aaa2c5eb1c3ee2f8c041e1e33a99d/kuryr_kubernetes/controller/handlers/lbaas.py#L31212:36
dulekyboaron_: I cannot see 'name' in ports inside subsets, inside endpoints: http://logs.openstack.org/84/623184/1/gate/kuryr-kubernetes-tempest-daemon-octavia-py36/346910f/controller/logs/kubernetes/endpoints.txt.gz12:36
yboaron_dulek, IIRC, name is optional12:37
dulekOkay. So looking at endpoint definition this check should pass as well…12:38
yboaron_dulek, Does all service creation fail in py36 gate?12:39
yboaron_dulek, I meant all services12:39
dulekyboaron_: I think that some succeed.12:39
dulekyboaron_:12:40
dulek2018-12-11 14:24:12.320 26470 INFO kuryr_tempest_plugin.tests.scenario.base [-] Look for openstack.org/kuryr-lbaas-state string in ep=kuryr-service-168770576 annotation12:40
dulek2018-12-11 14:24:16.555 26472 INFO kuryr_tempest_plugin.tests.scenario.base [-] We received 2 replies from prot=TCP;10.1.0.132:80 - connectivity was veified!12:40
dulekE.g.12:40
dulekOh, it may be only a timeout waiting for LB or member to be created?12:41
dulekyboaron_: So lbaas-spec is annotated on endpoint on 14:17:39.12:48
dulekyboaron_: But no LB creation happens after that date in log of Octavia API.12:48
yboaron_dulek, we should see the LB creation POST request at OCTAVIA12:49
*** rh-jelabarre has joined #openstack-kuryr12:50
dulekyboaron_: That's what I'm looking for. Just a sec, I'll try to match all the POST's I see to a K8s svc.12:50
yboaron_dulek, first service handler annotates the lbaas spec at endpoint, and after that the endpoint handler should do rest of the work12:50
yboaron_dulek, by rest of the work I mean LB,listener, bla bla  creation12:51
dulekyboaron_: Yes, so I only see 3 POST's on LB's in octavia-api logs.12:53
dulekyboaron_: One is the K8s API LB, two others are related to different services that passed connectivity test.12:53
yboaron_dulek, do we have the endpoints spec anywhere? k8s logs?12:54
dulekyboaron_: The one for kuryr-service-471643875 is missing.12:54
dulekyboaron_: Sure: http://logs.openstack.org/84/623184/1/gate/kuryr-kubernetes-tempest-daemon-octavia-py36/346910f/controller/logs/kubernetes/endpoints.txt.gz12:54
dulekIt's still there as it timed out on it.12:54
dulekSo this one endpoint never got deleted.12:54
yboaron_dulek, need to leave for 10 minsm will continue to look at this later12:57
dulekyboaron_: I've walked through the code and besides _should_ignore() there is nothing that could prevent POST from being set.13:06
dulekOther than endpoint notification not coming from K8s at all, of course. :P13:06
dulekyboaron_: _should_ignore() does some Python magic with any() and generators. Maybe Python 3 changes some behavior of those.13:07
dulekyboaron_: First I'll try to find such a failure on non-Python 3 gate.13:08
dulekIf I won't find it, I'll add additional logs there and debug further.13:08
openstackgerritGenadi Chereshnya proposed openstack/kuryr-tempest-plugin master: Testing port_pool_min and port_pool_max  https://review.openstack.org/62006013:08
dulekyboaron_: Okay, this is not tied to Python 3, we can see something similar here: http://logs.openstack.org/84/623184/1/check/kuryr-kubernetes-tempest-daemon-octavia/eac0c8c/controller/logs/kubernetes/endpoints.txt.gz13:26
yboaron_dulek, this one is kubernetes endpoint, we create the LB  for this service in deployment/plugin.sh13:30
dulekyboaron_: kuryr-service-1704367057 is not.13:31
dulekThis run is pretty similar, but… we're missing a ton of Octavia API logs for some reason…13:31
yboaron_dulek, that;s correct- but maybe it's related to py36 and the should_ignore code13:31
dulekyboaron_: Symptoms are the same…13:32
yboaron_dulek, not sure that I followed all the details, do u see the same behavior also in non py3.6 gates?13:33
dulekyboaron_: Yes, on the same patch I've found kuryr-kubernetes-tempest-daemon-octavia run that has similar issue.13:33
dulekyboaron_: Aaaand… There's one pod that never got wired nor it's name appear in kuryr-controller log.13:34
dulekDaaamn, something's really screwed up.13:35
yboaron_dulek, can u share the patch link?13:35
dulekYeah, that pod never gets annotated.13:35
dulekyboaron_: You can always get the commit id from logs link: 62318413:36
dulekyboaron_: So https://review.openstack.org/#/c/62318413:36
yboaron_dulek, OK!, 10x13:36
dulekMaybe watcher on controller dies on some point…13:37
dulekHm, nope, services are watched fine.13:39
dulekIt seems like it's loosing some of the notifications from K8s API. I wonder if there are etcd errors?13:39
dulekNah, etcd looks pretty much okay.13:40
yboaron_dulek, IIRc, K8S updates endpoints subset details only after relevant pods were wired, and we can see that subsets are updated at endpoints13:40
*** garyloug_ has joined #openstack-kuryr13:40
dulekyboaron_: But we see this stuff updated.13:40
dulekyboaron_: http://logs.openstack.org/84/623184/1/check/kuryr-kubernetes-tempest-daemon-octavia/eac0c8c/controller/logs/kubernetes/endpoints.txt.gz - it's pretty much updated.13:41
dulekBoth pods are up and running.13:41
yboaron_dulek, that's exactly what I meant, seems that pod handlers are fine13:42
dulekIt's a different one that never got noticed by controller.13:42
dulekWaaaait, this one's pending… It's something with Kubernetes itself…13:43
*** garyloug has quit IRC13:44
*** gcheresh_ has quit IRC14:00
dulekAwwww, this seems to be etcd issue again.14:00
*** garyloug_ has quit IRC14:01
yboaron_dulek, How do you know?14:01
*** garyloug_ has joined #openstack-kuryr14:02
dulekyboaron_: All the failed runs have higher number of timeouts in etcd and kubernetes-api logs.14:02
dulekyboaron_: I'm trying to confirm that and see if it's related to the cloud the gate is running on.14:02
*** irclogbot_1 has quit IRC14:32
*** gcheresh_ has joined #openstack-kuryr14:38
*** irclogbot_1 has joined #openstack-kuryr14:42
*** irclogbot_1 has quit IRC14:55
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Add support for podSelector  https://review.openstack.org/62355414:58
*** garyloug_ has quit IRC15:03
*** garyloug_ has joined #openstack-kuryr15:04
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Testing etcd with highest IO priority  https://review.openstack.org/62473115:13
*** maysams has quit IRC15:20
*** irclogbot_1 has joined #openstack-kuryr15:21
*** gcheresh_ has quit IRC15:41
*** maysams has joined #openstack-kuryr15:43
openstackgerritYossi Boaron proposed openstack/kuryr-kubernetes master: Log k8s endpoints being ignored by Kuryr  https://review.openstack.org/62474916:15
openstackgerritMerged openstack/kuryr-tempest-plugin master: Use kuryr-kubernetes-tempest-jobs project template  https://review.openstack.org/62318416:23
*** yboaron_ has quit IRC16:55
*** ccamposr has quit IRC17:06
*** garyloug__ has joined #openstack-kuryr17:21
*** garyloug_ has quit IRC17:24
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Add support for podSelector  https://review.openstack.org/62355417:51
*** garyloug__ has quit IRC18:05
*** celebdor_ has left #openstack-kuryr19:55
*** yboaron_ has joined #openstack-kuryr20:56
*** yboaron_ has quit IRC21:24
*** s1061123 has joined #openstack-kuryr22:23
*** rh-jelabarre has quit IRC22:29
*** s1061123_ has quit IRC22:29
*** rh-jelabarre has joined #openstack-kuryr22:29

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!