Wednesday, 2019-04-03

*** spotz has quit IRC01:12
*** hongbin has joined #openstack-kuryr01:41
*** spsurya has joined #openstack-kuryr02:12
*** gkadam has joined #openstack-kuryr02:37
*** hongbin has quit IRC03:00
*** ccamposr has joined #openstack-kuryr06:13
*** janki has joined #openstack-kuryr06:17
*** gcheresh has joined #openstack-kuryr06:25
*** pcaruana has joined #openstack-kuryr06:36
*** pcaruana has quit IRC06:38
*** pcaruana has joined #openstack-kuryr06:38
*** maysams has joined #openstack-kuryr06:52
dulekltomasbo: I could use some of your judgment here, you were working closer to that stuff.07:10
dulekltomasbo: So the root issue of the gate breakage is the fact that inside Amphora's amphora-proxy netns, no default route is set anymore.07:11
dulekltomasbo: Previously it was set to our Neutron router 10.1.0.190.07:11
dulekltomasbo: So life was good, even though Amp was only connected to services subnet (and kubelet OVS bind is on pod subnet).07:12
dulekltomasbo: I don't yet know why that is happening.07:12
dulekltomasbo: Anyway my fix was to add --subnet-id to member creation, which made API's amphora to be connected to pod subnet.07:13
ltomasbodulek, umm07:13
dulekltomasbo: But now the octavia_pod_access SG, which only opens ingress from services subnet is not enough.07:13
dulekltomasbo: Should I just make it open to pod subnet as well?07:13
ltomasbodulek, I think if you add subnet-id, then you are enforcing L2 mode instead of L3 mode for amphora07:13
dulekltomasbo: Crap, it looked like L2 mode is enforced even without that.07:14
dulekltomasbo: Like no default route in amphora's internal netns routes.07:14
ltomasboand that was actually not properly working, so I think we actually made k8s amphora L3, even if the rest of LBs (svc lbs) are L207:14
dulekltomasbo: Hm… So what do you think we should do?07:16
dulekltomasbo: What's triggering L2 or L3 modes of amps?07:16
ltomasbodulek, adding or not the subnet-id07:22
ltomasbowhen attaching the members07:22
ltomasboso, in your patch, by adding the subnet id, you are enforcing it is L207:22
*** jistr is now known as jistr|afk07:22
dulekltomasbo: OH MY. This might be an openstack-client/openstacksdk regression.07:23
dulekThe former, we don't use openstacksdk on devstack plugin.07:23
* dulek digs.07:23
ltomasboohh, that could be, yes07:24
ltomasboit is an easy way to check if you are using L2 or L307:25
ltomasboyou can check if a port on the subnet id is created for the amphora07:25
ltomasbodulek, ^07:25
dulekltomasbo: Yeah, yeah, but if it implicitly set --subnet-id=service-subnet, I won't see it…07:26
dulekltomasbo: Because Amphora is bound there by default.07:26
ltomasboahh, ok07:27
dulekDamn, nothing striking in both python-openstack and octaviaclients.07:27
dulekltomasbo: This seems to be bigger, as we seem to have same issue with service tests - i.e. no connectivity. So I assume amps are forced into L2 mode as well.07:29
dulekIf that's true, the L2 job should succeed on my patch.07:29
ltomasboyou don't have that set in your local.conf, right?07:29
*** shachar has quit IRC07:30
ltomasbodulek, not sure, I think we made K8s API L3 for a reason... (but I don't remember what was it)07:30
dulekltomasbo: Double checking, but I don't think I have.07:31
dulekNope.07:31
ltomasbodulek, if you use an old amphora, is it working?07:43
ltomasbodulek, I remember there were some problems (at some point) with the centos base amphpra image and L2 routing, perhaps something similar broke07:44
dulekltomasbo: I *think* so. I'm not sure here, it might be something else not being updated on my older env.07:44
dulekltomasbo: It's the Ubuntu, but I get the point.07:44
*** snapiri has joined #openstack-kuryr07:46
ltomasbodulek, sorry for the slow replies.... did the lbaas guys hit the same problem?08:10
dulekltomasbo: cgoncalves was debugging my env and pointed out that Amp is missing a default route that would direct the traffic to the router.08:12
ltomasbodulek, ok!08:16
*** ccamposr has quit IRC09:03
*** ccamposr has joined #openstack-kuryr09:04
*** celebdor has joined #openstack-kuryr09:07
*** ltomasbo has quit IRC10:12
*** ltomasbo has joined #openstack-kuryr10:39
dulekltomasbo: Hey, so why do we even have L3 mode in the first place?11:00
dulekltomasbo: Octavia folks are rather surprised that this worked before (well, as always).11:00
dulekltomasbo: And they advise to just always specify subnet_id.11:00
ltomasbowell, you are wasting an extra port per loadbalancer on the member subnet with L211:00
dulekltomasbo: I still don't know what change triggered the error, I'm now testing with one less neutron commit, but that's probably last resort, I don't have any more ideas.11:01
ltomasboplus, it was not working with some sdns at that time, like odl11:01
dulekltomasbo: Suuuure. But looks like we'll need to do it. Did I told you that my patch + L2 mode enabled works fine?11:01
dulekAh crap.11:01
ltomasboif you did, I missed it (has some problem with the irc today)11:02
*** jistr|afk is now known as jistr11:02
ltomasbodulek, also, to be honest... not sure if amphora L2 mode works with network policies....11:02
ltomasboor better say, the other way around...11:02
dulekltomasbo: Wonderful. <311:03
ltomasbo:/11:08
ltomasboI know...11:09
ltomasbodulek, anyway, if that was working before and not anymore, they have a regression, right?11:09
dulekltomasbo: Oh come one, you must know how this works? :P11:10
ltomasboxD11:10
ltomasboI know I know...11:10
dulek- "You were abusing some bug that's now fixed."; - "Sure, but what bug?!"; - "Dunno"11:10
ltomasboI hope we have ovn-octavia soooooon11:11
dulekltomasbo: ubuntu-minimal was updated at March, 20th. This is a far shot but maybe the infra image cache wasn't updated until 29th and that's the issue…11:13
dulekIt definitely has new version of cloud init…11:13
dulekltomasbo: Let's see what happens with centos Amphora. :D11:13
ltomasbook!11:15
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: DNM: Testing with centos amphora  https://review.openstack.org/64958211:19
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: DNM: Testing with centos amphora  https://review.openstack.org/64958211:19
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: DNM: Testing with centos amphora  https://review.openstack.org/64958211:22
*** rh-jelabarre has quit IRC12:06
*** rh-jelabarre has joined #openstack-kuryr12:06
*** celebdor has quit IRC12:06
dulekltomasbo, dmellado: Okay, centos amp works fine on my local env. Do we switch our gates now and I'll continue to check if it's cloud-init update or something else?12:15
dulekgcheresh: Want the workaround for the DevStack issue?12:16
dulek(finally)12:16
gchereshdulek: of course12:16
dulekgcheresh: Just set those in local.conf while making sure to do `rm -rf /opt/stack/octavia/diskimage-create/amphora*`12:17
dulekOCTAVIA_AMP_BASE_OS=centos12:17
dulekOCTAVIA_AMP_DISTRIBUTION_RELEASE_ID=712:17
dulekOCTAVIA_AMP_IMAGE_SIZE=312:17
dulekgcheresh: Please note that you need to disable using downloaded amphora, so OCTAVIA_AMP_IMAGE_FILE must be unset.12:18
gchereshdulek: and restart controller after the change?12:18
dulekgcheresh: More like restack whole DevStack.12:18
ltomasbodulek, \o/12:18
dulekgcheresh: Or build the image CentOS image yourself and reconfigure Octavia, but I don't know how to do that.12:18
dulekgcheresh: I mean it's certainly possible, but I wanted DevStack to do that for me. :P12:19
gchereshdulek: ok, will give a try for the first option12:19
dulekgcheresh: Also note that centos image in https://tarballs.openstack.org/octavia/test-images/ is broken.12:19
dulekgcheresh: So you need to make sure your devstack builds a new one from latest octavia code (fix got in around 8 AM).12:19
dulekltomasbo: Question is - what do we do? :P12:20
ltomasbolet's move the default to centos (at least until ubuntu image is fixed)12:22
ltomasboI think we used to have it on centos until it broke and we moved it to ubuntu12:22
dulekltomasbo: Hah, nice. :D12:24
ltomasbo:)12:24
dulekltomasbo: Okay, so let's see if my commit will work. Current centos tarball is broken, so it won't work with predownloaded amphora until tomorrow morning.12:24
ltomasbodulek, great! thanks for unlocking the gates!!!12:26
*** janki has quit IRC12:38
*** gaoyan has joined #openstack-kuryr13:00
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Switch to CentOS Amphora  https://review.openstack.org/64958213:29
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Restore using infra build of amphora  https://review.openstack.org/64961413:29
dulekltomasbo, dmellado: Okay, my test commit worked in the gate so here is merge-worthy version of it. ^13:30
dulekThe second patch will most likely fail and is to be merged tomorrow, when nightly build includes the fix by gconcalves.13:31
*** shachar has joined #openstack-kuryr13:53
*** oanson has quit IRC13:55
*** snapiri has quit IRC13:56
openstackgerritMaysa de Macedo Souza proposed openstack/kuryr-kubernetes master: Fix LBaaS SG rules update  https://review.openstack.org/64963614:44
openstackgerritMaysa de Macedo Souza proposed openstack/kuryr-kubernetes master: Fix LBaaS SG rules update  https://review.openstack.org/64963614:45
*** celebdor has joined #openstack-kuryr14:52
*** gcheresh has quit IRC14:57
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Restore using infra build of amphora  https://review.openstack.org/64961415:06
*** celebdor has quit IRC15:06
*** ccamposr has quit IRC15:17
*** gcheresh has joined #openstack-kuryr15:24
*** gaoyan has quit IRC15:25
dmelladohi dulek, the feverish me is slightly better15:39
dmelladoso what happened, did switch to centos-amp sorted out any stuff?15:39
dulekdmellado: Yes, it seems to be working.15:42
dulekdmellado: So +2 here is advisable: https://review.openstack.org/#/c/649582/ :)15:42
*** gcheresh has quit IRC15:44
openstackgerritMaysa de Macedo Souza proposed openstack/kuryr-kubernetes master: Add support for text ports on Network Policy Spec  https://review.openstack.org/64890515:46
dmelladodulek: cloud-init? damn...15:46
dulekdmellado: That's the only thing that changed between releases and matches.15:48
dulekMaybe libdns, but I doubt it.15:48
dmelladoin any case I wonder how could that affect, is there any route that is not getting to the amphora?15:48
dulekEither way - there's some regression. My only problem is that until we know exactly what it is, we can't ever guarantee it'll get fixed.15:48
dulekAnd CentOS might get updated package one day too. ;)15:49
dmelladoI'll wait for CI in order to get merged but yeah15:49
dmelladowhat did the octavia folks said?15:49
dulekdmellado: Yeah, exactly. I linked that a while ago - new Ubuntu Amphora doesn't get the default route in amphora-haproxy netns.15:49
dulek<ltomasbo> dulek, anyway, if that was working before and not anymore, they have a regression, right?15:50
dulek<dulek> ltomasbo: Oh come one, you must know how this works? :P15:50
dulek<ltomasbo> xD15:50
dulek<ltomasbo> I know I know...15:50
dulek<dulek> - "You were abusing some bug that's now fixed."; - "Sure, but what bug?!"; - "Dunno"15:50
dmelladowill they take care of this issue on their side15:51
dulekdmellado: I strongly doubt it, they assume everything is working correctly now.15:58
dulekAnd the fact that we had that default route was there just by chance.15:58
dmellado'by chance'15:58
* dmellado sighs15:58
dmelladothis is getting me sick again15:58
dmelladodulek: in any case please open a bug on octavia if you have the details there15:58
dmelladoand let me try handling this with carlos15:59
dulekdmellado: Well, we can sigh or we can do our job. ;) If we pinpoint the root cause, it's quite easy to convince whoever fault it is to fix the regression.15:59
dulekAt this point of my investigation it isn't Octavia's fault, it's something on lower layer.15:59
dmelladoif it's related to cloud-init16:02
dmelladowe can fetch larsks16:02
dmelladodo you know him?16:02
*** gkadam has quit IRC16:42
*** rh-jelabarre has quit IRC16:44
*** spsurya has quit IRC16:46
*** rh-jelabarre has joined #openstack-kuryr16:49
*** spsurya has joined #openstack-kuryr17:27
*** gcheresh has joined #openstack-kuryr20:37
*** spsurya has quit IRC20:59
*** gcheresh has quit IRC21:04
*** pcaruana has quit IRC21:06

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!