kota | looking at https://wiki.openstack.org/wiki/Meetings/Swift . it's still pointing at 3rd Aug | 12:02 |
---|---|---|
timburke | 😬 sorry -- i really need to update that! | 17:28 |
kota | good morning | 20:59 |
timburke | sorry (again) -- my lunch ran long | 21:09 |
timburke | #startmeeting swift | 21:09 |
opendevmeet | Meeting started Wed Aug 31 21:09:38 2022 UTC and is due to finish in 60 minutes. The chair is timburke. Information about MeetBot at http://wiki.debian.org/MeetBot. | 21:09 |
opendevmeet | Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. | 21:09 |
opendevmeet | The meeting name has been set to 'swift' | 21:09 |
timburke | who's here for the swift meeting? | 21:09 |
kota | o/ | 21:09 |
mattoliver | o/ | 21:10 |
timburke | just want to go through a few patches this week, first up | 21:11 |
timburke | #topic get info memcache skipping | 21:11 |
timburke | mattoliver, thanks again for pushing up some fixes to https://review.opendev.org/c/openstack/swift/+/850954 | 21:11 |
mattoliver | Nps, I like it :) | 21:12 |
timburke | the idea there is to randomly add a memcache miss to get_account_info and get_container_info calls, similar to what we're already doing for shard ranges | 21:13 |
timburke | the updates looked good to me, too -- do we want to go ahead and merge it, or wait until we've run it in prod for a bit first? | 21:14 |
mattoliver | I guess it doesn't hurt to wait a week. But on the other hand it is disabled by default. | 21:15 |
timburke | we can wait -- i'll make sure there's a ticket for our ops to try it out post-upgrade, and plan on letting y'all know how it goes next week | 21:17 |
mattoliver | Seeing as we are carrying in prod from this week, maybe we take advantage of that and see if it works :) | 21:17 |
mattoliver | Kk | 21:17 |
timburke | next up | 21:17 |
timburke | #topic object metadata validation | 21:17 |
timburke | one of our recent hires at nvidia took a look at a bug we were seeing where we had a healthcheck that talked directly to object-servers to verify that we can PUT/GET/DELETE on every disk in the cluster | 21:19 |
timburke | unfortunately, the healthcheck would write the bare minimum to get a 201, resulting in the reconstructor blowing up if the DELETE didn't go through (or if there was a race) | 21:20 |
timburke | end result was a patch i'm liking | 21:20 |
timburke | #link https://review.opendev.org/c/openstack/swift/+/853321 | 21:20 |
timburke | though i kind of want to go a little farther and add some sanity checks for replicated policies, too, as well as using the new validation hook in the auditor | 21:21 |
timburke | #link https://review.opendev.org/c/openstack/swift/+/855296 | 21:22 |
zaitcev | Interesting. | 21:22 |
timburke | (fwiw, the specific bug we were seeing stemmed from us not including a X-Object-Sysmeta-Ec-Etag header in the PUT part of the healthcheck -- we'd include frag index, but not the client-facing etag) | 21:23 |
timburke | just wanted to call attention to them -- i don't think there's much discussion that needs to happen around them (except maybe thinking of further checks we'd like to make) | 21:25 |
timburke | next up | 21:26 |
timburke | #topic ring v2 | 21:26 |
timburke | #link https://review.opendev.org/c/openstack/swift/+/834261 | 21:27 |
timburke | i've had a chance to play more with the patch -- found a few more rough edges, but the core of it still seems solid | 21:27 |
timburke | i also started putting together some benchmarking using some pretty big rings from prod (20k+ devices, part power like 20 or something like that -- 5MB or so in size) | 21:29 |
mattoliver | K, I haven't looked at it for q few weeks, so I'll revisit it today. | 21:29 |
mattoliver | Oh good idea | 21:29 |
timburke | long and short of it is that v1 and v2 seem to be within a few percentage points of each other, which doesn't seem too surprising given that the formats are largely related | 21:30 |
timburke | interestingly, i haven't seen the performance improvement i was expecting in going from v0 to v1 -- i remember that seemed to be the driving force during my format-history research | 21:31 |
timburke | but then, i also discovered that we didn't specify the pickle protocol version when serializing v0 rings in the new patch, so maybe there were some protocol-level improvements | 21:32 |
timburke | i'll dig into it a little more | 21:32 |
mattoliver | Nice work. Yeah performance testing is kinda what we need on this. How well does it work. | 21:33 |
timburke | https://bugs.launchpad.net/swift/+bug/1031954 for the old performance bug | 21:33 |
mattoliver | And memory performance.. ie we can load only what we want out of the format too. | 21:34 |
timburke | for sure -- though it becomes more noticeable if/when we merge the ring and builder files | 21:35 |
mattoliver | Yeah +1 | 21:36 |
mattoliver | But metadata only won't include devices which will help :) | 21:36 |
timburke | #topic sync_batches_per_revert | 21:39 |
timburke | i also did some quasi-benchmarks for this patch -- basically doing an A/B test in my home cluster while rebalancing a few terabytes | 21:39 |
timburke | #link https://review.opendev.org/c/openstack/swift/+/839649 | 21:39 |
mattoliver | Oh another good patch.. I kinda forgot about | 21:39 |
mattoliver | This also come into its own when wanting to make progress on bad disks right? Or am I thinking of another patch? | 21:40 |
timburke | yes -- but i'm definitely seeing it being a good/useful thing in healthy (but fairly full) clusters | 21:41 |
mattoliver | Oh nice | 21:42 |
timburke | so i've got three object nodes -- i drained one completely so i could simulate a large expansion. the other two nodes, i ran with handoffs_first -- and one tried the old behavior of a single big rsync, while the other tried batches of 20 | 21:43 |
timburke | then i watched the disk usage rate while they both rebalanced to the "new" guy. the single rsync per partition would see long periods where disk usage wouldn't move much, with periodic spikes that would go up to like -200, -400, -600MB/s | 21:45 |
timburke | the node that broke it up into batches would fairly consistently be going something like -50MB/s, occasionally jumping up to -100MB/s -- which which matched the rate of rsync transfers *much* better | 21:47 |
timburke | i don't have log aggregation set up at home (yet), but i also got the sense that there were a lot more rsync errors with the single-rsync transfers | 21:48 |
mattoliver | Oh cool interesting. Less of a list to build and compare at both ends before starting, with things that might change? | 21:49 |
mattoliver | Per rsync | 21:49 |
timburke | less that -- i've got a fairly static cluster; now and then some writes, but mostly pretty read heavy. the bigger win was that we didn't need to wait for a whole partition to transfer before we could start deleting | 21:50 |
mattoliver | Oh yeah, that's huge actually. | 21:51 |
timburke | so my proposition is that this would help a cluster-full situation: you might be able to get to 90% full, finally get your expansion in, and not need to wait a day or two for the initially-failed rsyncs to get retried a few times to start bringing down usage on the old disks | 21:53 |
mattoliver | Yeah, +1, that's pretty great | 21:54 |
kota | +1 | 21:54 |
timburke | bonus: the batched node finished its rebalance several hours earlier (again, likely due to the lowered likelihood of rsync errors) | 21:55 |
timburke | all right, sorry -- i took a bunch of time with that. it was kinda fun, though it took a bit :-) | 21:55 |
timburke | #topic open discussion | 21:55 |
timburke | what else should we talk about this week? | 21:56 |
zaitcev | Well, it's been almost a full hour. | 21:56 |
mattoliver | Sounds awesome! Let's get it in | 21:56 |
timburke | all right, i'll call it | 21:58 |
timburke | thank you all for coming, and thank you for working on swift! | 21:58 |
timburke | #endmeeting | 21:58 |
opendevmeet | Meeting ended Wed Aug 31 21:58:45 2022 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4) | 21:58 |
opendevmeet | Minutes: https://meetings.opendev.org/meetings/swift/2022/swift.2022-08-31-21.09.html | 21:58 |
opendevmeet | Minutes (text): https://meetings.opendev.org/meetings/swift/2022/swift.2022-08-31-21.09.txt | 21:58 |
opendevmeet | Log: https://meetings.opendev.org/meetings/swift/2022/swift.2022-08-31-21.09.log.html | 21:58 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!