Monday, 2019-01-28

*** jistr has quit IRC01:00
*** jistr has joined #openstack-kuryr01:01
*** hongbin has joined #openstack-kuryr01:03
*** hongbin has quit IRC01:37
*** hongbin has joined #openstack-kuryr01:56
*** hongbin has quit IRC02:25
*** hongbin has joined #openstack-kuryr03:46
*** spsurya has joined #openstack-kuryr04:28
*** hongbin has quit IRC04:55
*** gkadam has joined #openstack-kuryr07:22
*** maysams has joined #openstack-kuryr07:24
*** gcheresh has joined #openstack-kuryr07:53
*** pcaruana has joined #openstack-kuryr08:01
dulekltomasbo: I have another hit of that SG issue when subnet namespace driver is enabled: http://logs.openstack.org/99/632999/3/check/kuryr-kubernetes-tempest-daemon-openshift-octavia/b0b6707/controller/logs/screen-kuryr-kubernetes.txt.gz08:05
ltomasbodulek, checking08:13
dulekltomasbo: Thanks! I'll be available for a while from the airport, my flight's delayed.08:14
dulekltomasbo: Are you on a train? ;)08:14
ltomasbodulek, yep!08:15
ltomasbodulek, I just switched trains, heading to Barcelona in 15 mins or so08:16
dulekltomasbo: Regarding the issue - to me it seems like some race condition. You can look up the missing SG id's in q-svc logs.08:16
ltomasbodulek, umm, strange08:17
dulekltomasbo: That's why I wasn't able to debug it myself. :)08:18
ltomasbodulek, it happens on openshift gates, right? so related to namespace isolation probably08:18
dulekltomasbo: Yes, yes, I'm pretty sure it's due to namespace subnet driver.08:18
ltomasbodulek, I'm wondering if there was some issue (neutron timing issue) and then the rollback is not fully working... leading to a broken env...08:19
dulekltomasbo: I thought about that SG getting removed in rollback, but if I remember correctly there was no SG deletion on q-svc. Let me double check.08:20
ltomasboahh, wait08:21
ltomasbodulek, I see there is 2 subsequent calls08:21
ltomasbocreate namespace, create security group rule08:21
dulekOh, okay, I only see the SG rule creation and that's failing.08:21
ltomasboand the error seems to come from the second one08:21
ltomasbocreating the security group rule, while the create_security_group seems to not have finished08:22
dulekltomasbo: Yeah, from creation of SG rule.08:22
ltomasboperhaps some race on the neutron side?08:22
dulekltomasbo: Ooooh. It's 201 that's returned from SG creation. So it's only ACCEPTED.08:22
* dulek checks if Neutron changed something there recently.08:23
ltomasbodulek, yep, but I think it is only accepted because they cannot ensure it is applied on the hypervisors08:23
ltomasbodulek, but it should be created on the database at least!08:23
ltomasboit will be really dumb to have to poll in there...08:23
dulekltomasbo: Yup, I agree here!08:23
ltomasboperhaps we can add a retry if NotFound exception for the second...08:24
ltomasbodulek, ^^08:24
ltomasboto be on the safe side...08:24
dulekltomasbo: That would work, but I think we both find it nasty? :D08:24
ltomasboyes yes! I don't think that should be the way08:25
ltomasbothat should be ensure on the neutron side08:25
ltomasboit will be just 'defensive' programing :/08:25
dulekltomasbo: https://review.openstack.org/#/c/628691/ - might be related?08:25
dulekltomasbo: Got merged 4 days ago. That should be our cause.08:26
ltomasbodulek, yep, makes sense08:26
ltomasboshould we ping our neutron folks?08:27
openstackgerritDanil Golov proposed openstack/kuryr-kubernetes master: Fix a misprint in SR-IOV binding driver  https://review.openstack.org/63345308:28
dulekltomasbo: Well, there are 2 RH folks that accepted it. :D08:29
dulekltomasbo: I'll ping slaweq, he'll be willing to help me. :)08:29
ltomasbodulek, great! thanks!08:29
ltomasbodulek, btw, if you are bored waiting at the airport... https://review.openstack.org/#/c/631587/08:31
dulekltomasbo: Sure, in a moment.08:32
ltomasbodulek, no hurry!08:32
maysamsdulek: I am having the same issue. Did you try with the default driver?08:45
maysamsdulek, ltomasbo: I just tried and the problem remains08:46
ltomasbomaysams, default as in without namespace nor network policy?08:47
dulekmaysams: With the default SG driver?08:47
maysamsdulek, ltomasbo: I was trying to create a NP and it was not able to create the sg08:47
ltomasbomaysams, I assume it will happen everytime we create SGs + SG rules08:47
maysamsdulek: yes08:47
dulekltomasbo: +108:47
maysamsyup, I think so08:47
dulekmaysams, ltomasbo: So slaweq told me that nobody else is complaining.08:47
ltomasbomaysams, ahh, then, when creating a np, the action is creating a sg + sg_rules, so it will happen too08:48
maysamsI saw that you guys thought it was only related to namespace subnet driver08:48
ltomasbodulek, if people is creating them manually perhaps there is no problem...08:48
dulekAnd create-SG-then-rules is a pretty common pattern, so either we do it differently of everyone's broken.08:48
maysamsso, I thought it maybe be good to point that this happens with default as well08:48
dulekltomasbo: Don't we create SG and rules in the DevStack plugin as well?08:48
ltomasbodulek, I would be amaze if neutron folks don't have a gate creating security group and then rules on top of it...08:49
ltomasbodulek, perhaps our tests are creating a few more in parallel than they do08:49
ltomasbodulek, because we will create a few when the kuryr-controller is started and handles all the base openshift namespaces08:50
maysamswill be heading to the office see you guys later08:50
ltomasbodulek, so, probably other people is not triggering that many sg+sg_rules creation concurrently as we do08:50
dulekltomasbo: Good point. Are you able to point me to the code in namespace subnet driver that the error comes from?08:50
ltomasbodulek, sure!08:50
ltomasboone sec08:50
dulekltomasbo: You can probably tell that my "don't do `raise ex`" patch is due to traceback being lost on exceptions in those logs. :P08:51
ltomasbodulek, https://github.com/openstack/kuryr-kubernetes/blob/master/kuryr_kubernetes/controller/drivers/namespace_security_groups.py#L99-L12108:51
ltomasbodulek, yep, your patch will be helpful! xD08:52
*** maysams has quit IRC08:55
dulekAh, and just in case everyone - due to issue discussed above merges will fail, so you can abstain from rechecking until we figure it out.09:02
dulekltomasbo: Okay, I have a hypothesis.09:08
dulekltomasbo:             self._check_security_group(context, remote_group_id,09:08
dulek                                       project_id=rule['tenant_id'])09:08
*** ccamposr has joined #openstack-kuryr09:08
dulekltomasbo: That is probably failing. That tenant_id is probably None as we don't specify it.09:09
*** maysams has joined #openstack-kuryr09:09
*** maysams has joined #openstack-kuryr09:12
ltomasbodulek, tenant_id?09:12
ltomasbodulek, wasn't that deprecated in favor of project_id?09:13
dulekltomasbo: Whatever, it's DB code, Neutron still names the field tenant_id internally.09:13
dulekltomasbo: Doesn't matter too much. But I'm pretty sure it fails because SG in DB has tenant_id and that code doesn't fill it automatically with context.09:14
dulekltomasbo: I'm trying to confirm the latter.09:14
ltomasbodulek, I see part of the neutron ps you linked is actually moving from tenant_id to project_id09:15
dulekltomasbo: Uh, oh?09:15
dulekltomasbo: Oh crap, it does. xD09:16
dulekltomasbo: Good thinking!09:16
* dulek checks DB schemas.09:16
ltomasbodulek, I'm wondering if the rule should have project_id too instead of tenant_id...09:16
dulekltomasbo: Uh, they define it as synonym in DB model. Now it's SQLAlchemy magic…09:18
ltomasboufff...09:18
openstackgerritMerged openstack/kuryr-kubernetes master: devstack: Create LB objects only if Octavia is enabled  https://review.openstack.org/63299909:30
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: DNM, just testing now  https://review.openstack.org/63346109:31
dulekHm, a patch merged?09:32
dulekI don't understand a thing now. :D09:32
dulekOh, OpenShift's non-voting?09:32
dulekOkay, gotta board my flight!09:32
*** garyloug has joined #openstack-kuryr09:40
*** garyloug has quit IRC09:41
*** mrostecki has quit IRC10:02
openstackgerritMerged openstack/kuryr-kubernetes master: Ensure host to pod connectivity for NP  https://review.openstack.org/63250310:04
*** mrostecki has joined #openstack-kuryr10:08
*** gcheresh has quit IRC11:20
openstackgerritMerged openstack/kuryr-kubernetes master: Avoid doing `raise ex` when only logging  https://review.openstack.org/63303411:30
*** aperevalov has joined #openstack-kuryr11:55
*** pcaruana has quit IRC12:36
*** pcaruana has joined #openstack-kuryr12:37
*** danil has joined #openstack-kuryr12:43
*** rh-jelabarre has joined #openstack-kuryr12:48
*** pcaruana has quit IRC13:32
*** gcheresh has joined #openstack-kuryr13:37
*** pcaruana has joined #openstack-kuryr13:50
openstackgerritMerged openstack/kuryr-kubernetes master: Ensure lb sg rules are deleted when no longer allowed  https://review.openstack.org/63158714:35
*** zul has joined #openstack-kuryr15:05
openstackgerritMerged openstack/kuryr-kubernetes master: Ensure NP changes are applied to services  https://review.openstack.org/62985615:19
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Add gate for Octavia provider OVN  https://review.openstack.org/60403615:27
*** openstackgerrit has quit IRC15:51
*** gkadam has quit IRC16:00
*** openstackgerrit has joined #openstack-kuryr16:40
openstackgerritPaul Belanger proposed openstack/kuryr-kubernetes master: Remove non-voting job from gate  https://review.openstack.org/63355116:40
*** pcaruana has quit IRC16:47
*** gcheresh has quit IRC17:00
*** dims has quit IRC17:08
openstackgerritMerged openstack/kuryr-tempest-plugin master: Service cleanup should be optional  https://review.openstack.org/63145917:17
dulekmaysams: config.CONF.neutron_defaults.project17:27
*** maysams has quit IRC17:39
*** ccamposr has quit IRC17:51
*** aperevalov has quit IRC19:13
*** spsurya has quit IRC19:14
*** dims has joined #openstack-kuryr19:15
*** aojea has joined #openstack-kuryr19:59
*** yboaron has quit IRC20:09
*** yboaron has joined #openstack-kuryr20:09
*** aojea has quit IRC20:12
*** yboaron has quit IRC20:15
*** aojea has joined #openstack-kuryr20:20
*** aojea has quit IRC20:20
*** aojea has joined #openstack-kuryr20:20
*** aojea has quit IRC22:03
*** premsankar has joined #openstack-kuryr22:08
*** aojea has joined #openstack-kuryr22:20
*** aojea has quit IRC22:24
*** aojea has joined #openstack-kuryr22:25
*** aojea has quit IRC22:29

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!