Thursday, 2020-11-19

*** armax has quit IRC00:05
*** yamamoto has joined #openvswitch00:12
*** armax has joined #openvswitch00:16
*** yamamoto has quit IRC00:23
*** yamamoto has joined #openvswitch00:39
*** aconole has quit IRC00:44
*** zhouhan_ has quit IRC00:47
*** zhouhan has joined #openvswitch00:47
*** yamamoto has quit IRC00:47
*** zhouhan_ has joined #openvswitch00:48
*** zhouhan has quit IRC00:52
*** dmellado has quit IRC01:03
*** dmellado has joined #openvswitch01:04
*** spatel has joined #openvswitch01:19
*** spatel has quit IRC01:23
*** yamamoto has joined #openvswitch01:27
*** spatel has joined #openvswitch01:33
*** spatel has quit IRC01:36
*** yamamoto has quit IRC01:38
*** spatel has joined #openvswitch01:50
*** yamamoto has joined #openvswitch01:50
*** tamas_erdei has joined #openvswitch02:14
*** terdei has quit IRC02:16
*** rcernin has quit IRC02:22
*** rcernin has joined #openvswitch02:26
*** jobewan has joined #openvswitch02:29
*** zhouhan has joined #openvswitch02:31
*** zhouhan_ has quit IRC02:31
*** acidfoo_ has quit IRC02:45
*** armax has quit IRC03:10
*** spatel has quit IRC04:36
*** jaicaa has quit IRC04:56
*** jaicaa has joined #openvswitch04:56
*** mmichelson_ has joined #openvswitch05:04
*** mmichelson has quit IRC05:05
*** yamamoto has quit IRC05:24
*** yamamoto has joined #openvswitch05:32
*** yamamoto has quit IRC05:34
*** JamesBenson has quit IRC06:03
*** JamesBenson has joined #openvswitch06:04
*** JamesBenson has quit IRC06:09
*** yamamoto has joined #openvswitch06:11
*** darkemon has quit IRC06:13
*** darkemon has joined #openvswitch06:14
*** JamesBenson has joined #openvswitch06:15
*** yamamoto has quit IRC06:20
*** donhw has quit IRC06:24
*** dholler has joined #openvswitch06:29
*** jaicaa has quit IRC06:37
*** jaicaa has joined #openvswitch06:39
*** yamamoto has joined #openvswitch06:51
*** yamamoto has quit IRC07:00
*** slaweq has joined #openvswitch07:01
*** yamamoto has joined #openvswitch07:30
*** yamamoto has quit IRC07:38
*** ralonsoh has joined #openvswitch07:43
*** yamamoto has joined #openvswitch08:01
*** eelco has joined #openvswitch08:04
*** yamamoto has quit IRC08:05
*** mdgray has joined #openvswitch08:22
*** rcernin has quit IRC08:37
*** yamamoto has joined #openvswitch09:16
*** yamamoto has quit IRC09:23
*** darkemon has quit IRC09:34
*** darkemon has joined #openvswitch09:34
*** yamamoto has joined #openvswitch10:03
*** darkemon has quit IRC10:09
*** darkemon has joined #openvswitch10:11
*** yamamoto has quit IRC10:16
*** yamamoto has joined #openvswitch10:47
*** yamamoto has quit IRC10:57
*** yamamoto has joined #openvswitch10:57
*** thaller has quit IRC12:13
*** thaller has joined #openvswitch12:14
*** camelCaser has quit IRC12:15
*** thaller has quit IRC12:24
*** thaller has joined #openvswitch12:24
*** thaller has quit IRC12:30
*** thaller has joined #openvswitch12:31
*** acidfoo_ has joined #openvswitch12:40
*** thaller has quit IRC12:51
*** thaller has joined #openvswitch12:52
*** spatel has joined #openvswitch13:50
*** spatel has quit IRC13:50
*** bostondriver has joined #openvswitch13:55
*** spatel has joined #openvswitch13:57
*** liuyulong has joined #openvswitch14:00
*** tbachman has joined #openvswitch14:30
*** liuyulong has quit IRC14:33
*** liuyulong has joined #openvswitch14:33
*** liuyulong has quit IRC15:04
*** armax has joined #openvswitch15:44
*** eelco has quit IRC16:35
*** acidfoo_ has quit IRC16:44
*** acidfoo_ has joined #openvswitch16:46
*** zhouhan has quit IRC16:47
*** zhouhan has joined #openvswitch16:47
*** dceara has joined #openvswitch16:57
*** dholler has quit IRC17:08
*** elvira has quit IRC17:19
*** ralonsoh_ has joined #openvswitch17:27
*** ralonsoh has quit IRC17:28
*** JamesBenson has quit IRC17:33
*** rnurgaliyev has joined #openvswitch17:59
*** blp has joined #openvswitch18:02
blpGood morning, I'm early for the meeting today because otherwise I'll forget: I'm in a parallel video chat meeting.18:03
*** mmichelson_ has quit IRC18:13
*** mmichelson has joined #openvswitch18:13
blpmmichelson: hi18:13
mmichelsonblp, hi18:13
numansHello18:14
mmichelsonI'll go ahead and start the meeting now. It's slightly early but eh, it's fine I guess.18:14
_lore_hi all18:14
blpstarting the meeting really just means starting the logging, which seems fine.18:15
mmichelson#startmeeting ovn_community_development_discussion18:15
openstackMeeting started Thu Nov 19 18:15:21 2020 UTC and is due to finish in 60 minutes.  The chair is mmichelson. Information about MeetBot at http://wiki.debian.org/MeetBot.18:15
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.18:15
openstackThe meeting name has been set to 'ovn_community_development_discussion'18:15
dcearahi18:16
mmichelsonHopefully y'all saw my email about delaying branching until 4 December.18:16
blphi dceara18:16
mmichelsonAnd it turns out this is pretty timely. We've been seeing some test failures based on some recent commits.18:17
mmichelsonI'm personally working on fixing a test failure I caused with the SNAT CT zone selection patch.18:17
numansyeah, there are few tests which fail for me when run in parallel18:17
blpI usually run with RECHECK=yes, which helps.18:18
mmichelsonIn short, the test is failing because of the change to no longer associate LOG_TO_PHY flows with a port binding18:18
blpBut it's better to make the tests reliable.18:18
mmichelsonThe result is that a gateway router that is bound to one chassis (hv3) still has flows installed on another chassis sometimes (hv1) and the test fails because it explicitly tries to ensure that hv1 is not seeing the traffic that should be handled by the gateway chassis18:19
mmichelsonI have a plan for how to fix it. The problem is it's making the code really ugly.18:19
blpmmichelson: Why isn't this a matter of making sure that all the chasssis are caught up with nb?18:20
mmichelsonIn addition to the LOG_TO_PHY flows though, I found that even without my patch, there are gateway router flows on the non-gateway chassis, which seems wrong. It's like the gateway router is being set as a local datapath even though it shouldn't be.18:20
numansmmichelson, does having these flows in another chassis a problem ?18:20
mmichelsonblp, they are, but the incremental processing logic in ovn-controller got updated due to my CT zone patch. And the logic can cause a failure18:20
numansif not is it ok to change the test case ?18:20
blpmmichelson: incremental processing is so difficult18:20
mmichelsonnumans, Well, if ALL gateway router flows were removed from the non-gateway chassis then the problem I'm seeing wouldn't happen.18:21
mmichelsonblp, yes it is18:21
mmichelsonnumans, the test case is valid18:21
mmichelsonnumans, it's perfectly reasonable for someone to set up the same scenario and expect it to work properly18:22
numansmmichelson, ok. I'm just thinking is it ok to have those flows vs making the code more ugly to fix it :)18:22
numansBut if its a wrong behavior, sure we should fix it.18:22
mmichelsonnumans, so my proposed fix is some ugly code that makes it so that when CT zones are changed, we need to iterate over all port bindings and find if the datapath that those port bindings belong to was updated, then erase all physical flows having to do with both those bindings and their peers18:22
mmichelsonnumans, but another possible fix might be to detect the gateway router as not being local and remove all its flows. That might be slightly less ugly.18:23
mmichelsonmaybe18:23
numansok18:23
mmichelsonAnyways, getting the CT zone fix in and getting this test fixed has taken up way too much of my time this week. I haven't had proper time to do reviews, which is what I wanted to spend the majority of the week doing18:24
blpmmichelson: anything else?18:25
mmichelsonAt this moment, I've read through imaximets' dp groups patch series, and on the surface it seems good. I haven't put an ack on the series yet because I wanted to do some testing of my own18:25
mmichelsonI think that's all from me.18:25
blpmay i go next?18:25
numanssure18:26
blpnumans: Thanks for the test script. I'll try it. My other priority for v8 is to try to use ovsdb-idl, or important parts of it.18:26
blpnumans: Also, I posted a tentative schedule for ovscon: https://ovscon.site/18:26
numanswelcome18:26
*** zhouhan_ has joined #openvswitch18:26
blpI tried to make the page amenable to adjusting to anyone's local time zone. If there are bugs in that, please let me know.18:27
numansthat's cool.18:27
numansI'll check it out.18:27
blpI didn't put in a time zone selector because I'm really confused about DST in the southern hemisphere (especially NZ and AU)18:27
blpso you just have to type your GMT offset.18:27
blpDoes anyone want to comment on my ddlog patches? That's about all I've got for today.18:28
numansno additional comments from me other than what I reported earlier today.18:28
blpYes, thanks again for those numans18:29
numansthanks for fixing the probe intervals.18:29
numanswelcome18:29
blpThere was more going wrong with probe intervals than I thought! I did fix them though.18:29
blpOK then.18:29
blpWho's next?18:29
numansI can go next18:30
numansI worked on new version of load balancer hairpin patches.18:30
*** zhouhan has quit IRC18:30
numansthanks dceara for the review and the suggestions.18:30
*** zhouhan_ has quit IRC18:30
numansmmichelson, If you could take a look at the hairpin patches that would be great. dceara has acked patches 3-7.18:30
*** zhouhan has joined #openvswitch18:31
mmichelsonnumans, will do18:31
numansI submitted a patch to pin ovn-controller to specific version of ovn-northd to minimize downtime during upgrades.18:31
numansApart from that I did code some reviews.18:31
numansI'm reviewing Anton's parallel patches and testing them out. Reading and learning the ovs's hmap implementation on the go.18:32
numansThat's it from me.18:32
_lore_can I go next?18:32
_lore_this week I worked on bfd implementation in ovn, the solution is stable now18:33
_lore_I discussed with openshift folks about the scale18:33
_lore_they are targetting ~30 connections for gw node18:33
_lore_I tested it at this scale and the solution seems to work18:34
blpnumans: If you have any questions about hmap, that's all me.18:34
_lore_I think we should test it in a real deployment18:34
_lore_I will post a rfc v2 soon18:34
_lore_the I addressed dceara's comments on my raft debug patch, posted v318:35
_lore_and I added the capability to log commands on ovn-scale-test18:35
_lore_that's all from me18:35
numansblp, sure. I'll ask you. Thanks.18:36
numansIn case you've bandwidth to review  - https://patchwork.ozlabs.org/project/ovn/list/?series=21271018:36
numanscertainly I'm not qualified much in that area to do a proper review and give comments.18:36
blpThere's already cmap for highly parallel access to a hash table, is there a discussion of why another data structure?18:37
numansI don't think we had any discussion about it in the mail thread.18:38
numansI'll check out cmap too during the review.18:39
numansthanks.18:39
mmichelsonI have to admit I didn't know about cmap until you just mentioned it18:39
mmichelsonYeah, it's not used anywhere in OVN. That's probably why18:40
zhouhanmmichelson: what's the plan after ddlog is merged? Do we update both C northd and ddlog northd for same feature changes?18:41
blpThat's the hope.18:41
mmichelsonzhouhan, yes18:41
zhouhanWhen ddlog is proved stable/production ready we can then abandon C version?18:42
blpThat's *my* hope, at least.18:42
mmichelsonzhouhan, yes, though determining stability and production readiness is going to be tough.18:43
zhouhanWould ddlog benefit from the parallel northd change?18:43
blpI haven't read the parallel northd patches. How do they work?18:43
numansI don't think they would.18:43
zhouhanOr I guess we wouldn't need parallel processing for ddlog, right? (since there won't be any recompute at all?)18:44
imaximetsblp, he needs a separate hmap implementation since he wants to slice data, i.e. process slices of hmap in diferent threads.18:44
mmichelsonzhouhan, right, parallelization isn't as useful with incremental processing. It could be useful for full recomputes I guess?18:44
blpddlog has built-in support for multithreading, although we haven't enabled it. It is not clear whether it would be beneficial in this case.18:45
numansfrom what I understand till now, it builds the flows in parallel by creating a thread for each datapath hmap bucket18:45
mmichelsonblp, the implementation divides an hmap up between threads. So thread 1 processes items 0, 0+N, 0+2N, etc. Thread 2 processes items 1, 1+N, 1+2N, etc.18:45
mmichelsonWhere N is the number of threads.18:45
blpOK.18:46
mmichelsonAnd then the results are merged together. That's the general idea18:46
blpimaximets: cmap can do that, I think, although the details are important.18:47
imaximetsblp, cmap is mostly about working on the same data from different threads, but he wants to work on different data and wants this data to have no intersections.18:48
imaximetsblp, it's just a matter of API extension, I think.18:49
blpOK.18:49
imaximetsto have access to some internals.18:49
mmichelsonOk, I've lost track of who's turn it is. Who'd like to go next?18:50
dcearaI can go next if that's ok.18:51
blpdceara: i think so18:51
dcearablp: Thanks for the discussion on the nb_cfg ovn-controller patch and for the conditional ack.  I'll wait a bit though before sending a v3 to think more about potential other solutions.18:51
dcearablp: This also means that the patch in your ddlog series that removes "sleep" in most tests might have to wait.  I'd say it's better to make tests (more) reliable before merging that one.18:52
blpdceara: OK.18:53
dcearablp: Except for that I've been doing some reviews here and there and quite a lot of debugging.18:53
dcearablp: That's it from me I guess, thanks.18:53
*** ralonsoh_ has quit IRC18:53
mmichelsonOK, anybody else?18:54
imaximetsI have a quick update.18:54
imaximetsLast week I prepared a first version of "logical datapath group" implementation.  This is to reduce number of lflows by combining them.18:55
imaximetsFirst results are good.  On my test it reduced size of db in 6 times.18:56
imaximetsHowever, there are few issues uncovered during unit testing.18:56
imaximetsovsdb-idl seems to not handle row deletions correctly causing memory leaks and use-after-free conditions.18:56
blpI wonder if that got introduced in the last year or two. It was correct once.18:57
blpI wrote pretty thorough tests for it, I thought.18:57
imaximetswe debugged that a lot with dceara and seems to have one solution, but will test it further.18:57
blpThe mapping from uuids to pointers in the idl code is complex.18:58
blpI don't like that code, but it makes using the database from C bearable.18:58
imaximetsblp, it's hard to say. SB db has a lot of dependencies between rows and my patch-set seems to add some more.18:58
imaximetsone more issue is that for some reason ovn-controller operates with freed port_binding rows.18:59
blpIf you find a general problem in the idl code, then it might be wise to add a new test to ovs to find it.18:59
blpThere are idl tests there for row dependencies already, so it might be easy to add another.19:00
imaximetsblp, yep.  that's a good point.19:00
blp(I liked building the Python IDL better, it does not build row dependency graphs.)19:00
imaximetsovn-controller issue could be workarounded by receiving all updates on port_binding table, but that is not a good solution.  Need to figure out what happens when conditional monitoring works as needed.19:01
zhouhanimaximets: 6 times sounds great. How many datapaths and lflows-per-DP in this testing?19:02
blpTo get the ddlog code to use the idl code, instead of its ad hoc code, I might end up refactoring the idl into a couple of layers. That might make it easier to understand.19:02
imaximetsblp: we have some feature-set difference between C and python idl, unfortunately.19:02
imaximetszhouhan, it was 1.1 million flows in total.  reduced to 170 thousands.19:03
imaximetszhouhan, 100 logical switches in this test.19:03
zhouhanimaximets: Great! thx19:03
imaximetsblp, layers might be good to have.  Dependency tracking is hard to understand in current idl.19:04
blpimaximets: We'll see. I haven't read enough code yet to figure it out.19:05
imaximetsSo, once we'll figure out what is going on with idl, I'll prepare v2 with fixes and some performance optimizations for the code.19:05
imaximetsthat's it form my side.19:05
mmichelsonAnyone else?19:06
mmichelsonOK, I guess that's all then. Thanks everyone19:08
mmichelson#endmeeting19:09
openstackMeeting ended Thu Nov 19 19:09:00 2020 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:09
openstackMinutes:        http://eavesdrop.openstack.org/meetings/ovn_community_development_discussion/2020/ovn_community_development_discussion.2020-11-19-18.15.html19:09
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/ovn_community_development_discussion/2020/ovn_community_development_discussion.2020-11-19-18.15.txt19:09
openstackLog:            http://eavesdrop.openstack.org/meetings/ovn_community_development_discussion/2020/ovn_community_development_discussion.2020-11-19-18.15.log.html19:09
blpThanks everyone!19:09
*** blp has quit IRC19:09
imaximetsBye!19:09
dcearaBye!19:09
*** dceara has quit IRC19:10
*** rnurgaliyev has quit IRC19:10
*** dcbw has quit IRC20:05
*** dcbw has joined #openvswitch20:10
*** rcernin has joined #openvswitch20:33
*** rcernin has quit IRC20:37
*** rcernin has joined #openvswitch21:02
*** mdgray has quit IRC21:33
*** dcbw has quit IRC22:15
*** spatel has quit IRC22:16
*** rcernin has quit IRC23:35
*** rcernin has joined #openvswitch23:35
*** bostondriver has quit IRC23:46

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!