*** tqtran has quit IRC | 00:04 | |
clayg | notmyname: it's a common problem - most people are not in australia | 00:04 |
---|---|---|
mattoliverau | well with Trump, I suspect that to change somewhat, at least with regards to Americans :P | 00:11 |
*** vint_bra has quit IRC | 00:12 | |
*** kei_yama has quit IRC | 00:30 | |
*** kei_yama has joined #openstack-swift | 00:32 | |
notmyname | http://docs.aws.amazon.com/AmazonS3/latest/dev/object-tagging.html | 00:34 |
timburke | notmyname: idk, indexing doesn't sound so "simple" to me :P | 00:48 |
notmyname | well, like most of their API, it's pretty complicated (ie hard to type in curl) | 00:50 |
notmyname | I'm curious about the implementation | 00:50 |
notmyname | is it simply the first bytes that return on an key read, and they filter once they have those? | 00:51 |
notmyname | are they building a per-user index of stuff that they're querying before the read? | 00:51 |
notmyname | the eventual consistency callout implies the second | 00:51 |
openstackgerrit | Clay Gerrard proposed openstack/swift: Remove ring md5 integration check from recon unittests https://review.openstack.org/404500 | 00:51 |
notmyname | or maybe they've got some per-server or per-drive fast cache where they store the tags for the keys on that drive. but it might have some async component, thus giving some possible consistency windows | 00:52 |
notmyname | like our async pendings | 00:52 |
notmyname | implying that extra value in s3 isn't as much clever programming as it is clever hardware selection (+ volume purchasing) allowing them a more time-budget per request to do interesting things like this | 00:53 |
notmyname | extra value == new exciting features | 00:54 |
openstackgerrit | Matthew Oliver proposed openstack/swift: Quarantine malformed database schema SQLite errors https://review.openstack.org/405031 | 00:54 |
openstackgerrit | Clay Gerrard proposed openstack/swift: Remove ring md5 integration check from recon unittests https://review.openstack.org/404500 | 00:55 |
*** nikivi has joined #openstack-swift | 00:55 | |
notmyname | mattoliverau: thanks for picking up that bug! | 00:55 |
clayg | i hate changes that are only refactoring :'( | 00:56 |
mattoliverau | notmyname: nps. Am debugging and watching containers shard.. and so while I was in the db broker anyway thought I might as well fix it. | 00:57 |
notmyname | perfect :-) | 00:57 |
*** dmorita has quit IRC | 00:58 | |
*** catintheroof has joined #openstack-swift | 01:05 | |
*** kei_yama has quit IRC | 01:05 | |
*** kei_yama has joined #openstack-swift | 01:06 | |
clayg | notmyname: why would you assume they're doing anything different that you would do? something in process and consistency troller to put metadata in an index (yeah probably shared per user) | 01:07 |
clayg | the integration is sweet tho -> That is, you cannot create a policy to grant or deny a user permissions to delete or override an existing object based on its existing tags. | 01:07 |
*** catintheroof has quit IRC | 01:08 | |
notmyname | clayg: well, I'm not exactly sure how *I'd* do it :-) | 01:11 |
notmyname | probably one of those ways | 01:11 |
* mattoliverau is going to lunch | 01:13 | |
mattoliverau | well really Bec and Lucy are at the beach playing in the sand.. so I'm going to take a break and have a beach walk, play with the daughter and find something to eat. be back in about an hour | 01:13 |
mattoliverau | its a hard life :P | 01:13 |
clayg | notmyname: i probably don't understand some nuanced part of what's confusing you - to mee it looks like glossed up and scaled out version of exactly what timur is doing with elastic search (like add in something that *attempts* to do it on ingest i.e. middleware + some api integration with index queries vs. direct access to the indexes) | 01:15 |
clayg | oh.. and they probably use some cool internal Lucene based index that's all geo'd up in their internal business and multi-tenant so they can have seperate indexes for each account (or maybe per bucket, and then you make at most 100 queries per account if you need to do cross bucket stuff?) | 01:16 |
*** dmorita has joined #openstack-swift | 01:18 | |
notmyname | yeah, that seems like what timur is doing and what we've talked about overall in the community. it's a reasonable way to do it | 01:18 |
notmyname | but a different way, only reasonable if you have control over the hardware, would be to have something more directly integrated into the actual servers themselves instead of a different external service | 01:19 |
*** nikivi has quit IRC | 01:19 | |
notmyname | like if their DCs have Awesome(tm) networks and/or protocols --meaning new requests aren't expensive -- then putting an NVMe card in each storage server to index the tags on the keys on that server might be simpler | 01:20 |
*** dmorita has quit IRC | 01:21 | |
notmyname | eg if it's not an expensive operation to have an api server make a request to a storage server and check the tags against some policy | 01:21 |
*** dmorita has joined #openstack-swift | 01:21 | |
notmyname | alternatively, i could have just been playing with hardware too much lately, and that's my current hammer ;-) | 01:22 |
*** dmorita has quit IRC | 01:24 | |
*** dmorita has joined #openstack-swift | 01:24 | |
notmyname | oh, and timur's stuff is and will be totally awesome | 01:25 |
clayg | notmyname: but the tags are an aggregation - you can split that up across every storage server in s3? well maybe i don't know the science behind distributed indexes - seems like there'd be a diminishing returns spreading it around if you'd expect only 1/1K servers to have a hit for given key | 01:25 |
*** dmorita has quit IRC | 01:25 | |
*** dmorita has joined #openstack-swift | 01:26 | |
notmyname | what do you mean by "tags are an aggregation"? they're stored per key right? | 01:29 |
notmyname | (AWS key == swift object) | 01:29 |
notmyname | oh, they use "object" in their doc. i'll keep using that | 01:29 |
notmyname | what do you mean by "tags are an aggregation"? they're stored per object right? | 01:29 |
notmyname | hmm..it does look like the bucket listings could be filtered by tag. well, they imply that but don't give much info | 01:31 |
notmyname | otherwise it seems like a very object-centric design | 01:32 |
notmyname | I don't see anything that is "show me all my objects that are color=blue". just modifications on requests that also include the actual object name itself | 01:32 |
notmyname | oh, wait. not even the lifecycle stuff is aggregate. "lifecycle management in which you can specify tag-based filter, in addition to key name prefix" | 01:34 |
*** jamielennox is now known as jamielennox|away | 01:34 | |
notmyname | oh, but the next line is "list objects in your bucket by tag" | 01:34 |
notmyname | hmm | 01:34 |
*** isotope has quit IRC | 01:35 | |
openstackgerrit | Merged openstack/python-swiftclient: Fix 'url' to 'URL' https://review.openstack.org/379379 | 01:38 |
openstackgerrit | Merged openstack/swift: Multi Swift - Multiple Swift clusters on same h/w https://review.openstack.org/393794 | 01:38 |
openstackgerrit | Merged openstack/swift: Show team and repo badges on README https://review.openstack.org/402856 | 01:39 |
clayg | mattoliverau: you know I've never really noticed the corrupted and malformed db's in our test suite before - and right there under test/unit/common/ too!? | 01:40 |
kota_ | yeah, exactly, http://docs.aws.amazon.com/AmazonS3/latest/dev/object-tagging.html says that we can filter but... i cannot find the api in http://docs.aws.amazon.com/AmazonS3/latest/API/v2-RESTBucketGET.html | 01:40 |
kota_ | notmyname:^^ | 01:40 |
kota_ | that *will* be available in the future (like v3) api? or sort of hidden parameter? | 01:41 |
clayg | yeah I'm sure they store the keys on the object the same way they do any object metadata | 01:42 |
clayg | also note that to change a tag on an object you have to do a server side copy ;) | 01:42 |
*** m_kazuhiro has joined #openstack-swift | 01:43 | |
*** dmorita has quit IRC | 01:50 | |
openstackgerrit | Merged openstack/python-swiftclient: Show team and repo badges on README https://review.openstack.org/402853 | 01:56 |
*** adu has joined #openstack-swift | 01:59 | |
*** kei_yama has quit IRC | 02:03 | |
*** m_kazuhiro has quit IRC | 02:06 | |
*** kei_yama has joined #openstack-swift | 02:10 | |
*** klrmn has quit IRC | 02:11 | |
*** adu has quit IRC | 02:12 | |
*** wanghua has joined #openstack-swift | 02:22 | |
*** catintheroof has joined #openstack-swift | 02:24 | |
*** supamatt has joined #openstack-swift | 02:36 | |
supamatt | question on swift | 02:37 |
*** bkopilov has quit IRC | 02:38 | |
*** clu_ has quit IRC | 02:38 | |
mattoliverau | clayg: I built a db with a malformed schema.. secret is to build one, the turn on editable schma PRAGMA, then exit the scema via an SQL update to the sqlite_master table. Will attach it to the patch, but look at adding it to a test in my patch as well. | 02:50 |
mattoliverau | s/exit/edit/ | 02:50 |
mattoliverau | whats up supamatt | 02:50 |
*** jamielennox|away is now known as jamielennox | 02:54 | |
supamatt | mattoliverau: If a failure to insert is complete (all replicas failed) and a 503 is returned, will the key be present or absent in the container listing? | 02:54 |
supamatt | swift object key that is | 03:03 |
mattoliverau | supamatt: if they all failed then the update to the container server wouldn't have been fired off. However swift wont return success unless it get quorum from the storage nodes. So there is a chance that 1 succeeded but still caused a failure, if one make it, it would trigger a container update so the objedct might appear in the list.. and if nothing happens to the < quorum replicas that made it, they could get | 03:06 |
mattoliverau | replicated around and actually exist in the cluster. | 03:06 |
supamatt | So yes, it's possible then. | 03:07 |
mattoliverau | yes, when thinking the logic through in my head ;) | 03:08 |
supamatt | That's baddd maybe? | 03:09 |
supamatt | depends on the data you have in there | 03:09 |
*** kei_yama has quit IRC | 03:11 | |
*** kei_yama has joined #openstack-swift | 03:12 | |
mattoliverau | well its good if the data you wanted to put in, actually goes in :P. Tho I might have to go look at the code to see if what I say is actually true ;) | 03:13 |
*** jamielennox is now known as jamielennox|away | 03:17 | |
openstackgerrit | Matthew Oliver proposed openstack/swift: Quarantine malformed database schema SQLite errors https://review.openstack.org/405031 | 03:17 |
mattoliverau | clayg: ^^ now with a malformed schema db for your testing pleasure | 03:17 |
*** jamielennox|away is now known as jamielennox | 03:26 | |
*** catintheroof has quit IRC | 03:28 | |
*** catintheroof has joined #openstack-swift | 03:28 | |
*** catintheroof has quit IRC | 03:28 | |
*** catintheroof has joined #openstack-swift | 03:30 | |
*** catintheroof has quit IRC | 03:30 | |
*** catintheroof has joined #openstack-swift | 03:31 | |
*** tqtran has joined #openstack-swift | 03:31 | |
*** catintheroof has quit IRC | 03:31 | |
*** tqtran has quit IRC | 03:37 | |
*** jamielennox is now known as jamielennox|away | 03:47 | |
*** psachin has joined #openstack-swift | 03:47 | |
notmyname | supamatt: are you asking about the object write? if that fails on all replicas and 503 is given to the client, what happens? | 03:50 |
notmyname | or what? where's the failure? | 03:50 |
mattoliverau | turns out I was wrong, it wont happen because we're (swift) is smart and checks for minimum puts (i remember the reorganising patch now that I'm looking at the code) at each chuck and before the final commit. | 03:53 |
mattoliverau | notmyname: turns out is was a customer questiion, and supamatt's a racker. They we're worried about whether a container update could happen if the object servers are too busy to put the obj. | 03:55 |
notmyname | ah | 03:55 |
notmyname | nope | 03:55 |
*** bkopilov has joined #openstack-swift | 04:02 | |
*** thurloat has quit IRC | 04:04 | |
*** PsionTheory has joined #openstack-swift | 04:14 | |
*** thurloat has joined #openstack-swift | 04:15 | |
*** deep_1 has joined #openstack-swift | 04:22 | |
*** jamielennox|away is now known as jamielennox | 04:33 | |
*** PsionTheory has quit IRC | 04:49 | |
*** xionchen_ has quit IRC | 04:54 | |
*** links has joined #openstack-swift | 05:00 | |
*** SkyRocknRoll has joined #openstack-swift | 05:07 | |
*** tqtran has joined #openstack-swift | 05:34 | |
*** qwertyco has joined #openstack-swift | 05:38 | |
*** qwertyco has quit IRC | 05:38 | |
*** qwertyco has joined #openstack-swift | 05:39 | |
*** tqtran has quit IRC | 05:39 | |
*** m_kazuhiro has joined #openstack-swift | 06:03 | |
*** ppai has joined #openstack-swift | 06:12 | |
*** dmorita has joined #openstack-swift | 06:18 | |
*** dmorita has quit IRC | 06:22 | |
*** m_kazuhiro has quit IRC | 06:34 | |
*** ChubYann has quit IRC | 06:36 | |
*** tqtran has joined #openstack-swift | 06:36 | |
*** tqtran has quit IRC | 06:41 | |
*** _JZ_ has quit IRC | 06:44 | |
cschwede | just stumbled upon missing recon metrics in http://docs.openstack.org/developer/swift/admin_guide.html - and remembered i was already looking at them | 06:52 |
cschwede | and indeed, there’s a fix for that! patch 343321 is an easy review ;) | 06:52 |
patchbot | https://review.openstack.org/#/c/343321/ - swift - Add missing recon metrics to admin_guide | 06:52 |
*** sams-gleb has joined #openstack-swift | 07:12 | |
*** sams-gleb has quit IRC | 07:12 | |
*** sams-gleb has joined #openstack-swift | 07:12 | |
openstackgerrit | Clay Gerrard proposed openstack/swift: Safe cleanup listdir for db replicator https://review.openstack.org/405134 | 07:14 |
clayg | mahatic_: in patch https://review.openstack.org/#/c/331601/7/swift/common/db_replicator.py@116 - os.rmdir(suff_dir) changes the state of the filesystem but not the string values in the list referred to by the variable suffixes | 07:17 |
patchbot | patch 331601 - swift - remove empty db hash and suffix directories | 07:17 |
clayg | mattoliverau: that is sick | 07:22 |
clayg | re: malformed_schema.db | 07:22 |
clayg | mattoliverau: well replicated objects are auto commited - it only takes one successful 200 from the backend to rebuild the object - so that one server that returned the 200 can do the container update | 07:28 |
clayg | mattoliverau: on the EC side once the proxy thinks an object is durable ("enough" frags responded) then it sends commit and anyone still alive can send their container updates | 07:29 |
mahatic_ | clayg: hmm if there are no more suffixes (and it's an empty partition), that's what it's supposed to do right? delete the part dir if it's empty? | 07:29 |
clayg | ... either way container is always updated after successful object PUT - so the higher risk would be an object PUT successfully and then die before the container update - which is why the conatiner update is distributed | 07:29 |
mahatic_ | empty part dir* | 07:30 |
clayg | mahatic_: IME empty part dirs *are* cleaned up... patch 138524 | 07:34 |
patchbot | https://review.openstack.org/#/c/138524/ | 07:34 |
*** hseipp has joined #openstack-swift | 07:37 | |
*** ppai has quit IRC | 07:39 | |
mahatic_ | clayg: right, and with this change https://review.openstack.org/#/c/331601/7/swift/common/db_replicator.py@99 we might end up again with an empty part dir which is not cleaned up | 07:44 |
patchbot | patch 331601 - swift - remove empty db hash and suffix directories | 07:44 |
mahatic_ | clayg: Not sure how possible it is in reality, but in a recursive clean up, we might end up there | 07:45 |
mahatic_ | clayg: http://paste.openstack.org/show/591062/ -> linie#99 is gonna clean up afb and afc and then move onto 460, leaving empty 450 | 07:55 |
*** ppai has joined #openstack-swift | 07:55 | |
mahatic_ | that is what I think would happen | 07:56 |
*** cschwede has quit IRC | 08:05 | |
*** tmoreira has quit IRC | 08:07 | |
*** ppai has quit IRC | 08:07 | |
clayg | mahatic_: put an empty part dir in one of your container datdirs and run the replicator a couple of times - I think it's cleaned up by line 91 | 08:09 |
clayg | ... but that's not really my point - I get what your saying about trying to clean it up in the same pass - but I don't feel that your acknowledging to me that you recognize the values of the variable suffix doesn't change with line 99 - the code is not right - maybe that is obvious and the conversation moved on from implementation to behavior - unclear | 08:11 |
mahatic_ | line 91 will clean it up, I think I was trying getting it cleaned up in one pass replicator | 08:11 |
mahatic_ | clayg: right! | 08:11 |
clayg | ok, i don't think it important to catch it in the same pass - but if you wanted to do that you'd need another listdir - value held in suffixes would not reflect cleared out empty suffix dirs at line 115 otherwise | 08:12 |
mahatic_ | clayg: and correct, my bad, I did not acknowledge that suffix variable doesn't change and that line of code is wrong! | 08:13 |
mahatic_ | clayg: right, thanks. If it's not that important, one more listdir is not worth it I think | 08:15 |
*** oshritf has joined #openstack-swift | 08:19 | |
*** ppai has joined #openstack-swift | 08:19 | |
*** rledisez has joined #openstack-swift | 08:21 | |
clayg | yeah i'm fine with it cleaning up as it goes, I think the listdir would mostly come back with a bunch of suffixes - i'd rather save the listdir for the next pass when we would expect to do something with all the suffixes we found | 08:24 |
clayg | ... or something | 08:24 |
*** Jeffrey4l_ has quit IRC | 08:26 | |
*** jcook has quit IRC | 08:28 | |
mahatic_ | okay | 08:28 |
*** Jeffrey4l_ has joined #openstack-swift | 08:30 | |
*** csmart has quit IRC | 08:33 | |
*** jcook has joined #openstack-swift | 08:34 | |
*** rcernin has joined #openstack-swift | 08:34 | |
clayg | mahatic_: please feel free to squash any code from patch 405134 as well if you do another rev | 08:35 |
*** csmart has joined #openstack-swift | 08:35 | |
patchbot | https://review.openstack.org/#/c/405134/ - swift - Safe cleanup listdir for db replicator | 08:35 |
*** Guest66666 has quit IRC | 08:36 | |
clayg | mahatic_: i dropped the implementation of test_empty_suffix_and_hash_dirs_get_cleanedup that was based on the arcane test_roundrobin_datadirs | 08:36 |
*** zaitcev has quit IRC | 08:36 | |
clayg | i just use the file system and assert the behaviors rather than the implementation | 08:37 |
*** rcernin has quit IRC | 08:37 | |
*** rcernin has joined #openstack-swift | 08:37 | |
*** Guest66666 has joined #openstack-swift | 08:38 | |
*** geaaru has joined #openstack-swift | 08:38 | |
*** sams-gleb has quit IRC | 08:40 | |
mahatic_ | clayg: sure. from a glance I think it has neat refactoring, plus it also fixes https://bugs.launchpad.net/swift/+bug/1646362 | 08:40 |
openstack | Launchpad bug 1646362 in OpenStack Object Storage (swift) "spurious files next to hashdir don't get cleaned up" [Undecided,New] | 08:40 |
clayg | mahatic_: also feel free to *not* do another rev - I put a +2 on it because I think it fixes the bug and should merge - if there's any cleanup that needs to happen *before* it can merge and fix the bug - I didn't see it | 08:40 |
*** sams-gleb has joined #openstack-swift | 08:40 | |
mahatic_ | clayg: I will do that, maybe you could swap the author/co-author on it (I think I'm sure I'll mess it up) | 08:41 |
clayg | mahatic_: I screwed up on that bug - i should have bugged the db-replicator behavior seperatly from the object-replicator behavior | 08:41 |
mahatic_ | clayg: ah okay, well that sounds fine too | 08:41 |
clayg | mahatic_: hell, i probably should have *just* refactoed/cleaned up the test and spurious rmdir calls from your change in the follow up then gone after the hashdir is file fix in another change :'( | 08:42 |
clayg | so many fucking bugs | 08:42 |
clayg | mahatic_: as long as we close lp bug #1583719 i'm happy | 08:43 |
openstack | Launchpad bug 1583719 in OpenStack Object Storage (swift) "empty db hash/suffix directories can pile up" [Medium,In progress] https://launchpad.net/bugs/1583719 - Assigned to Mahati Chamarthy (mahati-chamarthy) | 08:43 |
clayg | i have some long since rebalanced off parts laying on this old disks in this one cluser - but they damn part dirs won't die because they have some empty suffixes in them because who knows how long ago some db's got quarantined | 08:44 |
*** amoralej|off is now known as amoralej | 08:44 | |
mahatic_ | clayg: I'm thinking of a way that maybe could make it easy for you. I could split your follow up if you'd wish that, just that it will be more clear on what it's doing | 08:45 |
*** sams-gleb has quit IRC | 08:45 | |
clayg | mahatic_: sounds *amazing* - i'll take it! | 08:45 |
mahatic_ | clayg: great! | 08:45 |
clayg | mahatic_: just ping me in irc what you need me to review or add me in gerrit - i'll probably catch it in emails | 08:46 |
mahatic_ | yup sure | 08:46 |
clayg | thanks | 08:46 |
mahatic_ | np, thanks too! | 08:46 |
clayg | acahcaphcaph cschwede is not in channel!? | 08:47 |
clayg | wasn't there some critical replication bug fix related to suffix listdirs we still needed to land? | 08:48 |
clayg | lp bug #1644807 | 08:51 |
openstack | Launchpad bug 1644807 in OpenStack Object Storage (swift) "hashes.pkl suffix update is not deterministic and might be never triggered" [High,In progress] https://launchpad.net/bugs/1644807 - Assigned to Christian Schwede (cschwede) | 08:51 |
clayg | s/be never/never be/ - it even *sounds* like cschwede ;) | 08:53 |
*** sams-gleb has joined #openstack-swift | 08:55 | |
*** asettle has joined #openstack-swift | 08:55 | |
*** Guest36 has joined #openstack-swift | 08:59 | |
*** acoles_ is now known as acoles | 09:00 | |
*** hseipp has quit IRC | 09:02 | |
*** ppai has quit IRC | 09:06 | |
acoles | good morning | 09:06 |
*** sams-gleb has quit IRC | 09:07 | |
*** sams-gleb has joined #openstack-swift | 09:08 | |
*** wer has quit IRC | 09:08 | |
*** hseipp has joined #openstack-swift | 09:10 | |
*** sams-gleb has quit IRC | 09:10 | |
*** sams-gleb has joined #openstack-swift | 09:10 | |
*** hogepodge has quit IRC | 09:11 | |
*** sams-gleb has quit IRC | 09:12 | |
*** sams-gleb has joined #openstack-swift | 09:12 | |
*** hogepodge has joined #openstack-swift | 09:12 | |
mahatic_ | acoles: o/ | 09:12 |
*** wer has joined #openstack-swift | 09:13 | |
*** sams-gleb has quit IRC | 09:17 | |
*** ppai has joined #openstack-swift | 09:23 | |
*** tmoreira has joined #openstack-swift | 09:26 | |
*** rcernin has quit IRC | 09:29 | |
*** rcernin has joined #openstack-swift | 09:30 | |
*** hogepodge has quit IRC | 09:31 | |
*** hogepodge has joined #openstack-swift | 09:32 | |
*** wanghua has quit IRC | 09:38 | |
*** Guest36 has quit IRC | 09:40 | |
*** gabor_antal has joined #openstack-swift | 09:45 | |
*** SkyRocknRoll has quit IRC | 09:46 | |
*** davideagle_ has joined #openstack-swift | 09:53 | |
*** openstack has joined #openstack-swift | 10:04 | |
*** rcernin has quit IRC | 10:06 | |
*** rcernin has joined #openstack-swift | 10:07 | |
*** ppai has joined #openstack-swift | 10:09 | |
*** SkyRocknRoll has joined #openstack-swift | 10:16 | |
*** SkyRocknRoll has quit IRC | 10:20 | |
*** SkyRocknRoll has joined #openstack-swift | 10:21 | |
*** psachin has quit IRC | 10:22 | |
*** SkyRocknRoll has quit IRC | 10:29 | |
openstackgerrit | Alistair Coles proposed openstack/swift: Add missing expirer recon metric to admin_guide https://review.openstack.org/405265 | 10:33 |
*** mvk has quit IRC | 10:33 | |
*** tmoreira has quit IRC | 10:33 | |
*** sams-gleb has joined #openstack-swift | 10:33 | |
*** tmoreira has joined #openstack-swift | 10:34 | |
*** dmorita has joined #openstack-swift | 10:34 | |
*** sams-gleb has quit IRC | 10:35 | |
*** sams-gleb has joined #openstack-swift | 10:35 | |
*** psachin has joined #openstack-swift | 10:36 | |
*** aj701_ has joined #openstack-swift | 10:36 | |
openstackgerrit | Mahati Chamarthy proposed openstack/swift: Safe cleanup listdir for db replicator https://review.openstack.org/405134 | 10:38 |
*** patchbot has quit IRC | 10:39 | |
*** tonyb has quit IRC | 10:39 | |
*** aj701 has quit IRC | 10:39 | |
*** dmorita has quit IRC | 10:39 | |
*** sams-gleb has quit IRC | 10:39 | |
*** tmoreira has quit IRC | 10:42 | |
*** patchbot has joined #openstack-swift | 10:43 | |
*** tonyb has joined #openstack-swift | 10:43 | |
*** SkyRocknRoll has joined #openstack-swift | 10:43 | |
*** tmoreira has joined #openstack-swift | 10:44 | |
*** SkyRocknRoll has quit IRC | 10:44 | |
*** SkyRocknRoll has joined #openstack-swift | 10:45 | |
*** cschwede has joined #openstack-swift | 10:45 | |
openstackgerrit | Kazuhiro MIYAHARA proposed openstack/swift: Remove redundant codes from unit test of object updater https://review.openstack.org/405271 | 10:48 |
*** ChanServ sets mode: +v cschwede | 10:51 | |
openstackgerrit | Mahati Chamarthy proposed openstack/swift: Cleanup spurious files next to hashdir https://review.openstack.org/405274 | 10:58 |
mahatic_ | clayg: ^^ there you go 405134 and 405274 | 11:00 |
*** mvk has joined #openstack-swift | 11:09 | |
openstackgerrit | Kazuhiro MIYAHARA proposed openstack/swift: Rename ObjectUpdater variables in unit test to appropriate names https://review.openstack.org/405281 | 11:10 |
*** bapalm has quit IRC | 11:10 | |
*** bapalm has joined #openstack-swift | 11:15 | |
*** deep_1 has quit IRC | 11:43 | |
openstackgerrit | Clay Gerrard proposed openstack/swift: Cleanup tests from deterministic suffix updates https://review.openstack.org/405295 | 11:56 |
*** bkopilov has quit IRC | 12:09 | |
*** catintheroof has joined #openstack-swift | 12:25 | |
*** deep_1 has joined #openstack-swift | 12:34 | |
*** SkyRocknRoll has quit IRC | 12:41 | |
*** catintheroof has quit IRC | 12:48 | |
openstackgerrit | Clay Gerrard proposed openstack/swift: Move documented reclaim_age option to correct location https://review.openstack.org/374419 | 12:56 |
*** vint_bra has joined #openstack-swift | 13:00 | |
*** jamielennox is now known as jamielennox|away | 13:00 | |
*** catintheroof has joined #openstack-swift | 13:13 | |
*** catinthe_ has joined #openstack-swift | 13:18 | |
*** catintheroof has quit IRC | 13:19 | |
*** amoralej is now known as amoralej|lunch | 13:21 | |
*** kei_yama has quit IRC | 13:29 | |
*** ppai has quit IRC | 13:29 | |
*** qwertyco has quit IRC | 13:47 | |
*** dmorita has joined #openstack-swift | 13:52 | |
*** bkopilov has joined #openstack-swift | 13:52 | |
*** dmorita has quit IRC | 13:56 | |
*** jpich has joined #openstack-swift | 14:01 | |
*** rcernin has quit IRC | 14:10 | |
*** rcernin has joined #openstack-swift | 14:11 | |
*** StraubTW has joined #openstack-swift | 14:15 | |
openstackgerrit | Mahati Chamarthy proposed openstack/swift: Confirm receipt of SLO PUT with etag https://review.openstack.org/390901 | 14:16 |
mahatic_ | ^ needs some eyes before I run into merge conflict with tests.py again ;) | 14:19 |
* mahatic_ is taking off mon and tue. will be on/off IRC tomm | 14:19 | |
*** amoralej|lunch is now known as amoralej | 14:21 | |
openstackgerrit | Clay Gerrard proposed openstack/swift: Cleanup tests from empty suffix quarantined db fix https://review.openstack.org/405134 | 14:30 |
clayg | mahatic_: I think patch 331601 is ready for a +A - the follow in patch 405134 is pretty clean at this point I think | 14:32 |
patchbot | https://review.openstack.org/#/c/331601/ - swift - remove empty db hash and suffix directories | 14:32 |
patchbot | https://review.openstack.org/#/c/405134/ - swift - Cleanup tests from empty suffix quarantined db fix | 14:32 |
clayg | mahatic_: i split up the bugs so patch 405274 can close the spurious file issue - I think it'll take a little more review to agree on all the right behaviors | 14:32 |
patchbot | https://review.openstack.org/#/c/405274/ - swift - Cleanup spurious files next to hashdir | 14:32 |
*** manous has joined #openstack-swift | 14:33 | |
clayg | mahatic_: note that lp bug #1646502 is totally wide open for a confirmation or failing unittest or even a fixing patch! I think it's unrelated to your work in the container db walker | 14:34 |
openstack | Launchpad bug 1646502 in OpenStack Object Storage (swift) "spurious files next to object hashdir trigger bad quarantine" [Undecided,New] https://launchpad.net/bugs/1646502 | 14:34 |
*** psachin has quit IRC | 14:34 | |
mahatic_ | clayg: all-nighter? | 14:36 |
cschwede | clayg: are you sleeping sometimes? | 14:36 |
cschwede | mahatic_: heh, same thought :) | 14:36 |
mahatic_ | clayg: ah, you've moved the unlink part as well while splitting up. I thought since it's put a function in place for rmdir calls, it should go with patch 405134. But either way is fine | 14:37 |
patchbot | https://review.openstack.org/#/c/405134/ - swift - Cleanup tests from empty suffix quarantined db fix | 14:37 |
mahatic_ | maybe it more belongs in patch 405274 as it's handling the unexpected files in suffixdir as well | 14:38 |
patchbot | https://review.openstack.org/#/c/405274/ - swift - Cleanup spurious files next to hashdir | 14:38 |
clayg | mahatic_: ah, that might could have been added there i see - but i'm pretty sure that's closer to the spurious files fix - not the empty dir fix | 14:38 |
clayg | and regardless it wasn't "stright forward cleanup over patch 331601" - which is really what i was going for with patch 405134 | 14:39 |
patchbot | https://review.openstack.org/#/c/331601/ - swift - remove empty db hash and suffix directories | 14:39 |
patchbot | https://review.openstack.org/#/c/405134/ - swift - Cleanup tests from empty suffix quarantined db fix | 14:39 |
clayg | cschwede: what do you wanna do about the deterministic hashdir fix's tests? | 14:39 |
mahatic_ | clayg: ack on lp bug #1646502. I'd like to take a better look at it. will only get to do it next (i'm off work until wed) | 14:39 |
openstack | Launchpad bug 1646502 in OpenStack Object Storage (swift) "spurious files next to object hashdir trigger bad quarantine" [Undecided,New] https://launchpad.net/bugs/1646502 | 14:39 |
mahatic_ | clayg: yeah, I see that | 14:39 |
jrichli | mahatic_, clayg: I'll retest and check-out patch 331601 today | 14:40 |
patchbot | https://review.openstack.org/#/c/331601/ - swift - remove empty db hash and suffix directories | 14:40 |
mahatic_ | jrichli: thanks | 14:40 |
cschwede | clayg: what do you mean? i see you already added a cleanup? i was thinking to add sth to probetests | 14:43 |
cschwede | clayg: and probably your cleanup should be squashed into the patch itself? | 14:45 |
clayg | "sth"? | 14:50 |
clayg | yeah we can squash it down - we need to get someone else to review it - onovy maybe? | 14:50 |
jrichli | mahatic_: np! Maybe clayg decided to go to sleep finally ... nope, still awake. Crazy man! | 14:50 |
mahatic_ | clayg: gotcha! I more often than not end looking up the acronyms you use. I think "sth" = something | 14:52 |
mahatic_ | s/end/end up | 14:52 |
mahatic_ | ;) | 14:52 |
cschwede | clayg: i mean i am looking into modifying the test_replication_servers_working.py test, and ensuring that a hashes.pkl with one missing suffix gets replicated finally | 14:52 |
clayg | ah, wow - ok | 14:53 |
cschwede | clayg: i think this bug (together with the random fix of the missing suffix in hashes.pkl) is quite serious, and i want to make sure there is no regression of this in the future | 14:54 |
*** sams-gleb has joined #openstack-swift | 14:55 | |
*** daemontool has joined #openstack-swift | 15:00 | |
*** links has quit IRC | 15:01 | |
*** stevemar has joined #openstack-swift | 15:21 | |
*** stevemar_znc has joined #openstack-swift | 15:29 | |
openstackgerrit | Clay Gerrard proposed openstack/swift: Fixups for ghost listing fix https://review.openstack.org/405450 | 15:32 |
clayg | cschwede: everyone was so jazzed up about those invalid files being faster somehow we forgot that we're messing with a critically important component? probetests are always good | 15:34 |
openstackgerrit | Pavel Kvasnička proposed openstack/swift: WIP: optimize hash calculation when suffix hash invalidated https://review.openstack.org/402043 | 15:34 |
*** silor has joined #openstack-swift | 15:37 | |
openstackgerrit | Alistair Coles proposed openstack/swift: Remove duplicate code in test_diskfile.py https://review.openstack.org/405455 | 15:39 |
openstackgerrit | Pavel Kvasnička proposed openstack/swift: WIP: optimize hash calculation when suffix hash invalidated https://review.openstack.org/402043 | 15:41 |
acoles | clayg: cschwede I have been thinking that it would be good to have a unit test that mocks HASH_PATH_*FIX and does a series of PUTs, POSTs, DELETEs checking the absolute result of a REPLICATE request as it goes in an attempt to prevent any variation in suffix hashes calculations | 15:42 |
*** links has joined #openstack-swift | 15:48 | |
*** silor1 has joined #openstack-swift | 15:49 | |
*** silor has quit IRC | 15:51 | |
*** silor1 is now known as silor | 15:51 | |
*** oshritf has quit IRC | 15:53 | |
*** isotope has joined #openstack-swift | 15:53 | |
*** isotope has quit IRC | 15:56 | |
*** krypto has joined #openstack-swift | 16:00 | |
krypto | hello all i want to remove a swift node from my cluster permanently for other purpose ,right now there are 5 servers total.Can i simply power the server and will it re balance automatically? | 16:02 |
*** geaaru has quit IRC | 16:07 | |
*** stevemar_znc is now known as topol_ | 16:12 | |
*** rcernin has quit IRC | 16:13 | |
*** _JZ_ has joined #openstack-swift | 16:15 | |
*** topol_ is now known as topol | 16:20 | |
*** silor has quit IRC | 16:23 | |
*** sams-gleb has quit IRC | 16:23 | |
*** sams-gleb has joined #openstack-swift | 16:24 | |
*** sams-gleb has quit IRC | 16:28 | |
notmyname | good morning | 16:34 |
notmyname | krypto: no, you need to remove its drives from the ring, rebalance, and push the new ring | 16:35 |
notmyname | krypto: swift will treat a powered-down node as unavailable (as opposed to a durability risk) and it won't automatically rebalance that | 16:36 |
*** siva_krish has joined #openstack-swift | 16:40 | |
*** rcernin has joined #openstack-swift | 16:50 | |
*** links has quit IRC | 16:53 | |
*** links has joined #openstack-swift | 16:53 | |
*** dmorita has joined #openstack-swift | 16:54 | |
*** oshritf has joined #openstack-swift | 16:54 | |
notmyname | FYI https://github.com/blog/2288-git-2-11-has-been-released (and it's now available for macos via brew). I like the indent-heuristic features | 17:01 |
*** davideagle_ has quit IRC | 17:03 | |
notmyname | not as useful for day-to-day in the way openstack works with branches, but the `git log 53d4d2130086c656f47e3a7cb5d6c411e8306546^-1` syntax is interesting too | 17:04 |
notmyname | (that commit is the merge that brought in storage policies, and the ^-1 tells git log to only show the commits that the referenced merge commit brought in | 17:04 |
*** jpich has quit IRC | 17:07 | |
*** krypto has quit IRC | 17:13 | |
*** asettle has quit IRC | 17:24 | |
*** deep_1 has quit IRC | 17:26 | |
*** oshritf has quit IRC | 17:28 | |
*** klrmn has joined #openstack-swift | 17:29 | |
*** isotope has joined #openstack-swift | 17:31 | |
*** diogogmt has joined #openstack-swift | 17:34 | |
*** chlong has joined #openstack-swift | 17:45 | |
openstackgerrit | Merged openstack/swift: Add missing expirer recon metric to admin_guide https://review.openstack.org/405265 | 17:52 |
*** bkopilov has quit IRC | 17:54 | |
openstackgerrit | Merged openstack/swift: Ensure update of the container by object-updater https://review.openstack.org/99598 | 17:54 |
*** manous has quit IRC | 17:57 | |
*** bkopilov has joined #openstack-swift | 17:58 | |
*** siva_krish has quit IRC | 17:59 | |
*** siva_krish has joined #openstack-swift | 18:01 | |
*** dmorita has quit IRC | 18:07 | |
*** dmorita has joined #openstack-swift | 18:11 | |
*** mvk has quit IRC | 18:17 | |
*** henryn has joined #openstack-swift | 18:17 | |
henryn | good morning all | 18:18 |
openstackgerrit | Alistair Coles proposed openstack/swift: Further improvement to deterministic suffix re-hashing tests https://review.openstack.org/405559 | 18:19 |
*** amoralej is now known as amoralej|off | 18:20 | |
*** sams-gleb has joined #openstack-swift | 18:21 | |
acoles | cschwede: clayg: left my comments on patch 402324, would be +2 IFF Clay's follow up for tests is squashed in. | 18:23 |
patchbot | https://review.openstack.org/#/c/402324/ - swift - Fix non-deterministic suffix updates in hashes.pkl | 18:23 |
*** silor has joined #openstack-swift | 18:37 | |
*** acoles is now known as acoles_ | 18:38 | |
*** tqtran has joined #openstack-swift | 18:39 | |
*** ChubYann has joined #openstack-swift | 18:40 | |
*** silor1 has joined #openstack-swift | 18:46 | |
*** silor has quit IRC | 18:47 | |
*** silor1 is now known as silor | 18:47 | |
*** henryn has quit IRC | 18:48 | |
*** daemontool has quit IRC | 18:55 | |
*** mvk has joined #openstack-swift | 19:14 | |
*** sams-gleb has quit IRC | 19:22 | |
*** sams-gleb has joined #openstack-swift | 19:22 | |
*** sams-gleb has quit IRC | 19:27 | |
*** david-lyle_ has joined #openstack-swift | 19:28 | |
*** davideagle_ has joined #openstack-swift | 19:28 | |
*** henryn has joined #openstack-swift | 19:29 | |
*** david-lyle has quit IRC | 19:31 | |
*** openstackgerrit has quit IRC | 19:32 | |
*** openstackgerrit has joined #openstack-swift | 19:33 | |
*** openstackgerrit has quit IRC | 19:36 | |
*** davideagle_ has quit IRC | 19:38 | |
*** hseipp has quit IRC | 19:43 | |
*** jamielennox|away is now known as jamielennox | 19:53 | |
*** david-lyle_ is now known as david-lyle | 20:01 | |
*** henryn has quit IRC | 20:06 | |
*** silor has quit IRC | 20:08 | |
*** dmorita has quit IRC | 20:09 | |
*** links has quit IRC | 20:09 | |
*** dmorita has joined #openstack-swift | 20:09 | |
*** dmorita has quit IRC | 20:10 | |
*** dmorita has joined #openstack-swift | 20:13 | |
*** dmorita has quit IRC | 20:17 | |
*** dmorita has joined #openstack-swift | 20:19 | |
*** dmorita has quit IRC | 20:26 | |
*** henryn has joined #openstack-swift | 20:27 | |
*** isotope has quit IRC | 20:30 | |
*** catinthe_ has quit IRC | 20:34 | |
*** isotope has joined #openstack-swift | 20:42 | |
*** openstack has joined #openstack-swift | 20:45 | |
*** siva_krish has quit IRC | 20:51 | |
*** henryn has quit IRC | 20:58 | |
*** siva_krish has joined #openstack-swift | 20:58 | |
*** dmorita has joined #openstack-swift | 20:58 | |
*** openstackgerrit has joined #openstack-swift | 21:04 | |
openstackgerrit | Merged openstack/swift: remove empty db hash and suffix directories https://review.openstack.org/331601 | 21:04 |
*** rcernin has quit IRC | 21:10 | |
*** nexusz99 has joined #openstack-swift | 21:11 | |
openstackgerrit | OpenStack Proposal Bot proposed openstack/python-swiftclient: Updated from global requirements https://review.openstack.org/89250 | 21:17 |
openstackgerrit | OpenStack Proposal Bot proposed openstack/swift: Updated from global requirements https://review.openstack.org/88736 | 21:18 |
pdardeau | this is probably a dumb question, is use of jumbo frames the norm for swift deployments? | 21:21 |
notmyname | pdardeau: I've heard that question before. let me ask around | 21:24 |
pdardeau | notmyname: thanks! | 21:26 |
notmyname | pdardeau: ok, I got an answer from some our support people :-) | 21:26 |
notmyname | short answer: good technical idea, generally bad operationally | 21:27 |
notmyname | so it's probably not worth it | 21:28 |
pdardeau | notmyname: thanks! i've heard similar things in the past, but wanted to see if that's still the case | 21:29 |
notmyname | operationally it's not great because it requires the whole networking infrastructure to be set up with it. and, to quote our support person, "they complain swift isn't working and we ask if the MTUs are the same everywhere and then two weeks later it's 'oh yeah, someone changed a switch config' or something" | 21:30 |
*** siva_krish has quit IRC | 21:30 | |
notmyname | so unless you've got decent controls over that sort of thing and are tuning for a specific workload, it's not worth the operational overhead | 21:31 |
ahale | yeah, im pretty sure when i was with rackspace we didn't do any of that sort of thing as it was too much hassle | 21:33 |
notmyname | oh hi ahale! | 21:33 |
ahale | o/ | 21:34 |
*** Jeffrey4l_ has quit IRC | 21:34 | |
*** siva_krish has joined #openstack-swift | 21:35 | |
*** Jeffrey4l_ has joined #openstack-swift | 21:37 | |
pdardeau | ahale: thx for sharing | 21:44 |
*** dmorita has quit IRC | 21:44 | |
openstackgerrit | Merged openstack/pyeclib: Cleanup lint errors in test suite https://review.openstack.org/404457 | 21:45 |
*** dmorita has joined #openstack-swift | 21:45 | |
*** diogogmt has quit IRC | 21:45 | |
clayg | timburke: i'm not so sure the rmdir is not a big deal - it might be? maybe a script that will put a roundrobin_datadirs into a tight loop; while another script does some direct container PUTs? | 21:47 |
notmyname | clayg: good morning! | 21:48 |
clayg | looking lock_path it does look like maybe the mkdir/open(lockpath) could blowup | 21:48 |
timburke | clayg: yeah, our directory locking seems like it could be problematic :-( | 21:49 |
clayg | the ENOTEMPTY in roundrobin (should such a thing occur) would be addressed in patch 405274 if we ever make it that far | 21:49 |
patchbot | https://review.openstack.org/#/c/405274/ - swift - Cleanup spurious files next to hashdir | 21:49 |
clayg | timburke: i'm not aware of ever having seen that race in traceback - it seems like other things (container db reclimation) are cleaning suffix dirs currently | 21:51 |
clayg | so if it *is* a problem it'd just be making it more frequent | 21:51 |
clayg | ... it's possible there's some other interaction with container PUT that resolves the race | 21:51 |
tdasilva | is clayg in japan or australia? | 21:52 |
clayg | jrichli: missed that about the erorr case test in all the mocks - mocks are definately the way to handle the error case - a targeted test to catch the OSError line would be the ideal! | 21:53 |
jrichli | yes, that would be better | 21:54 |
timburke | clayg: that reminds me: why are we so gung-ho to delete things when we don't know wtf the are? am i the only one that gets nervous doing that? | 21:54 |
jrichli | timburke: I was nervous too, but the rmdir doesn't succeed if its non-empty. And a random empty dir seems ok to remove. | 21:55 |
clayg | timburke: we have to make progress, i'm not that *gung-ho* about deleting spurious files next to suffix dirs - but I'd rather than not stop the partition from being balanced off the node | 21:55 |
clayg | so i'm looking to the object-replicator for inspiration | 21:56 |
clayg | it seems to quarantine spurious hashdir-files and while it ignores spurious suffixdir-files it *will* recursively destory them (and the hashes.pkl they sit next too) when it delete's the handoff partdir after sync (n.b. sync on moves suffixdirs never spurious suffixdir-files) | 21:57 |
clayg | it's a crap system - but it works | 21:57 |
jrichli | maybe I need to look again: I was only focused on recent changes made, and I dont see where we are removing more files than we had before | 21:57 |
timburke | jrichli: yeah, the empty dirs are fine. i'm looking at the patch clayg just linked | 21:58 |
jrichli | timburke: oh right, thx! | 21:58 |
clayg | timburke: i'm very much hoping someone has better ideas ;) | 21:58 |
clayg | timburke: if there's part you like but are less sure on other's we can try to get mahatic_ to split it up some more | 21:59 |
clayg | i'm mostly just glad to have lp bug #1583719 closed - hopefully we didn't wreak anything else | 21:59 |
openstack | Launchpad bug 1583719 in OpenStack Object Storage (swift) "empty db hash/suffix directories can pile up" [Medium,In progress] https://launchpad.net/bugs/1583719 - Assigned to Mahati Chamarthy (mahati-chamarthy) | 21:59 |
timburke | idk, emit a warning? quarantine it? do something so the operator knows that these things we expected to all be directories had a file in 'em | 22:00 |
mattoliverau | morning | 22:00 |
clayg | no why what happened! patch 393276 | 22:00 |
patchbot | https://review.openstack.org/#/c/393276/ - pyeclib - ISA-L Cauchy support | 22:00 |
timburke | ahahaha must've been me. but it's also you :P | 22:01 |
clayg | timburke: object-replicator just ignores them and then delete's them when it removes the partition - what's wrong with that pattern? I don't want parts to not balance until operator "does something" - I'd be ok if we could trick the quarantine machinery into eating them | 22:01 |
*** sams-gleb has joined #openstack-swift | 22:02 | |
clayg | os.path.join(suffixdir_path_file, 'some-pretend-hashdir', 'some-pretend-filename') might work? did you see lp bug #1646502 | 22:02 |
openstack | Launchpad bug 1646502 in OpenStack Object Storage (swift) "spurious files next to object hashdir trigger bad quarantine" [Undecided,New] https://launchpad.net/bugs/1646502 | 22:02 |
clayg | timburke: so... are you going to do that roundrobin datadir and container PUT race script test? | 22:03 |
*** openstackgerrit has quit IRC | 22:03 | |
clayg | timburke: or do you think we can wait and see if the tracebacks happen in the wild? in either race I feel like a) roundrobin datadirs will throw and try again next pass b) container server will throw and 500 (but other container replicas probably won't) | 22:04 |
*** clu_ has joined #openstack-swift | 22:05 | |
*** sams-gleb has quit IRC | 22:06 | |
*** sams-gleb has joined #openstack-swift | 22:07 | |
jrichli | timburke: do we create the .lock in such a way that it will create sub-dirs if they don't exist? | 22:10 |
jrichli | and if we do, it seems like an OS thing at that point for the race | 22:10 |
*** sams-gleb has quit IRC | 22:11 | |
clayg | omg the stuff people get away with when here is no linting!? 3 spaces, srly!? ;) | 22:12 |
clayg | jrichli: there is no system call for open this file and create the dirs if they don't exist | 22:13 |
jrichli | ok :-) | 22:13 |
clayg | jrichli: timburke: gah the more I look at common/db.py the more I see stuff that will get mad when the container-replicator removes empty dirs from under it :'( | 22:17 |
jrichli | which rmdirs are risky? some already existed IIRC | 22:19 |
jrichli | like the one we just merged? | 22:20 |
*** vint_bra has quit IRC | 22:20 | |
clayg | jrichli: yeah timburke pointed out that I should probably be more scared about the rmdir race that I was (since we were going to be doing more rmdirs!) | 22:24 |
clayg | but... i wasn't scared... and now I'm thinking he was probably right and I should be scared | 22:25 |
jrichli | so he said just a traceback. is that because there are retries? | 22:25 |
jrichli | or would something fail and not get retried? | 22:25 |
clayg | ... but i'm not writing the script to test if the new roundrobin_datadirs + heavy container PUT load will race - so maybe I'm not *that* scared | 22:25 |
clayg | jrichli: unclear how it will fail w/o testing | 22:25 |
clayg | jrichli: might not fail at all in practice - could be entirely theoretical | 22:26 |
clayg | but it's a *good theory*! | 22:26 |
jrichli | agreed | 22:27 |
jrichli | i have wondered before about having a heavy load test in the gate. i guess it would too resource intensive. | 22:29 |
jrichli | i know, in this case, you'd have to have those empty dirs too | 22:29 |
jrichli | but it just made me think of it | 22:29 |
jrichli | well, i guess in theory they'd be created under this high load | 22:30 |
jrichli | just run replication more than usual | 22:31 |
*** chlong has quit IRC | 22:50 | |
timburke | clayg: jrichli: ok, that's about as tight a loop as i think i can manage http://paste.openstack.org/show/591185/ | 22:51 |
timburke | i'm not gonna worry about it. can't for the life of me get the race to trigger; not with threads, not with processes | 22:51 |
clayg | timburke: is he best! | 22:54 |
jrichli | timburke: glad you aren't worried. I am still running a test of my own :-) | 22:55 |
*** isotope has quit IRC | 22:59 | |
clayg | timburke: that script is great - for me the tipping point is ~35 microseconds of sleep between lock_path's mkdir and open (sleep(0.000035)) does will cause it trip in for loop | 23:07 |
jrichli | so, i am looking at storage logs more than I usually do. I am seeing a bunch of HEADs and GETs to .misplaced_objects. is that normal when you replicate? | 23:07 |
jrichli | like HEAD /sdb2/701/.misplaced_objects | 23:08 |
clayg | that's just the reconciler doing it's thing | 23:09 |
jrichli | clayg: whew :-) | 23:09 |
clayg | it's sort of like the object expirer | 23:09 |
clayg | maybe it's frequency is less or something idk | 23:09 |
timburke | clayg: by "trip", do you mean to say that the loop didn't run? or that it would actually hit an OSError during open()? | 23:09 |
timburke | 'cause that last one is what i was worried about, but i just didn't see it | 23:10 |
*** isotope has joined #openstack-swift | 23:12 | |
clayg | yeah with some sleep in there you can hit the OSError - that's how I knew the code was exercising the race | 23:12 |
clayg | it's in there - just NBD in practice | 23:12 |
timburke | oh, i see what you were saying now! got it | 23:12 |
clayg | if this code can't even hit hit a replicator vs a container have only a snowballs chance in hell (meaning well totally get that traceback some day - but it won't be as soon as this sha makes it out to prod) | 23:13 |
*** StraubTW has quit IRC | 23:13 | |
*** isotope has quit IRC | 23:16 | |
jrichli | I still have some ideas for testing i'll do later for fun. but i'm goin to eat dinner now | 23:17 |
clayg | do eet! | 23:17 |
clayg | nom nom | 23:17 |
*** kei_yama has joined #openstack-swift | 23:30 | |
*** siva_krish has quit IRC | 23:40 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!