*** gyee has quit IRC | 00:27 | |
*** MooingLe1ur has quit IRC | 01:24 | |
*** MooingLemur has joined #openstack-swift | 01:25 | |
openstackgerrit | Merged openstack/swift stable/victoria: Update .gitreview for stable/victoria https://review.opendev.org/752750 | 02:22 |
---|---|---|
*** rcernin has quit IRC | 02:38 | |
*** rcernin has joined #openstack-swift | 02:42 | |
*** psachin has joined #openstack-swift | 03:27 | |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Fix liberasurecode-src tox target https://review.opendev.org/753936 | 03:58 |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Use liberasurecode-src tox target in gate https://review.opendev.org/753938 | 04:03 |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Use liberasurecode-git tox target in gate https://review.opendev.org/753938 | 04:32 |
*** evrardjp has quit IRC | 04:33 | |
*** evrardjp has joined #openstack-swift | 04:33 | |
openstackgerrit | Merged openstack/swift stable/victoria: Fixing S3 credential caching region support https://review.opendev.org/753839 | 04:40 |
*** psachin has quit IRC | 04:45 | |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Fix liberasurecode-git tox target https://review.opendev.org/753936 | 04:55 |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Use liberasurecode-git tox target in gate https://review.opendev.org/753938 | 04:55 |
*** m75abrams has joined #openstack-swift | 05:06 | |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Use liberasurecode-git tox target in gate https://review.opendev.org/753938 | 05:37 |
openstackgerrit | Merged openstack/pyeclib master: Fix liberasurecode-git tox target https://review.opendev.org/753936 | 05:51 |
*** baojg has quit IRC | 07:29 | |
*** baojg has joined #openstack-swift | 07:30 | |
*** djhankb has quit IRC | 07:31 | |
*** djhankb has joined #openstack-swift | 07:31 | |
*** rcernin has quit IRC | 08:35 | |
*** rcernin has joined #openstack-swift | 08:41 | |
*** rcernin has quit IRC | 08:57 | |
*** baojg has quit IRC | 09:04 | |
*** baojg has joined #openstack-swift | 09:41 | |
*** adriant has quit IRC | 09:52 | |
*** adriant has joined #openstack-swift | 09:53 | |
*** baojg has quit IRC | 10:22 | |
*** baojg has joined #openstack-swift | 10:23 | |
*** rcernin has joined #openstack-swift | 10:35 | |
*** rcernin has quit IRC | 10:56 | |
*** tdasilva has quit IRC | 13:15 | |
*** tdasilva has joined #openstack-swift | 13:17 | |
*** ChanServ sets mode: +v tdasilva | 13:17 | |
*** baojg has quit IRC | 13:21 | |
*** rcernin has joined #openstack-swift | 13:36 | |
*** jv has quit IRC | 13:50 | |
*** rcernin has quit IRC | 13:51 | |
*** jv has joined #openstack-swift | 13:54 | |
*** baojg has joined #openstack-swift | 14:21 | |
*** baojg has quit IRC | 14:46 | |
*** baojg has joined #openstack-swift | 14:47 | |
openstackgerrit | Tim Burke proposed openstack/swift master: Add a new URL parameter to allow for async cleanup of SLO segments https://review.opendev.org/733026 | 15:05 |
*** tdasilva_ has joined #openstack-swift | 15:12 | |
*** ChanServ sets mode: +v tdasilva_ | 15:12 | |
*** tdasilva has quit IRC | 15:14 | |
*** m75abrams has quit IRC | 15:31 | |
*** baojg has quit IRC | 15:45 | |
*** gyee has joined #openstack-swift | 15:59 | |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Use liberasurecode-git tox target in gate https://review.opendev.org/753938 | 16:54 |
timburke | clayg, hmm.... https://github.com/openstack/swift/blob/2.26.0/swift/common/middleware/staticweb.py#L431-L435 | 17:25 |
clayg | and we normally miss it because of env.get('REMOTE_USER') in __call__ ? | 17:27 |
timburke | yup -- https://github.com/openstack/swift/blob/2.26.0/swift/common/middleware/staticweb.py#L544-L546 | 17:27 |
clayg | well spotted - curious about your methodology - i poked at this for awhile and got stuck | 17:28 |
timburke | heh. wrote up a little middleware like http://paste.openstack.org/show/798350/ and put it between most every middleware in my pipeline :-) | 17:29 |
timburke | grepping for asdf and running the unauthed req gave me logs like http://paste.openstack.org/show/798351/ | 17:31 |
*** baojg has joined #openstack-swift | 17:32 | |
timburke | seems weird that we copy the env just to update user agent and source :-/ | 17:35 |
timburke | looks like that `tmp_env = dict(env)` behavior goes *way* back: https://github.com/openstack/swift/commit/eac721b003ae16931f0deb1f1457d5cbca263d74 | 17:38 |
clayg | fucking *brilliant* 🤩 | 17:43 |
timburke | lol i was feeling like "this is so stupid and brutish" | 17:50 |
zaitcev | https://blocksandfiles.com/2020/09/24/seagate-cortx-object-storage-software/ Kynetic debacle was not enough for Seagate | 18:33 |
openstackgerrit | Tim Burke proposed openstack/pyeclib master: Move liberasurecode-git behavior into all tox envs with compilelibs marker https://review.opendev.org/753938 | 18:47 |
openstackgerrit | Tim Burke proposed openstack/liberasurecode master: Be willing to write fragments with legacy crc https://review.opendev.org/738959 | 19:05 |
ormandj | when a HEAD requests fails (hits conn_timeout) on a proxy, due to a busy backend disk, should we not see a retry to another location (3x replication)? | 19:09 |
*** openstackgerrit has quit IRC | 19:14 | |
timburke | ormandj, i would definitely expect HEADs to try alternate locations on a timeout | 19:25 |
ormandj | timburke: can you think of any reason they wouldn't? i grep the path in the logs and definitely do not see a retry | 19:26 |
ormandj | this may be a part of a composite operation like a mass delete or something because i don't see the HEAD in the lb logs | 19:30 |
timburke | were you looking at proxy or object logs? or both? | 19:32 |
ormandj | proxy | 19:32 |
ormandj | see the failed HEAD to the ip/port of one of the storage node drives (we're on servers_per_port now) | 19:33 |
ormandj | conn_timeout (at 20s currently) then no access attempts again for the same object | 19:33 |
ormandj | looking at LB logs (in front of the proxies) i don't see any actual HEAD requests for that object so i'm guessing it's part of another operation the proxy is expanding into multiple calls | 19:35 |
ormandj | then i look at the storage node logs, and it gets really crazy. i just looked for the object in question, and found zillions of deletes. (storage node that one of the timeouts was for) for that same object, all with the same txn | 19:41 |
ormandj | scratch that, was different objects, it's definitely part of a mass delete | 19:43 |
timburke | might be related to s3api's multi-delete behavior -- i know we do a HEAD before delete to check if it's an MPU (in which case we need to add a flag so SLO sill clean up the segments) | 19:46 |
ormandj | yeah, i guess that HEAD isn't retried then | 19:46 |
timburke | i take it the proxy logged a 503 for the HEAD? | 19:47 |
ormandj | looking now | 19:48 |
ormandj | Sep 24 19:37:21 BLAH proxy-server: ERROR with Object server x.x.x.x:7021/sdaa re: Trying to DELETE /AUTH_idstuff/veeam/Veeam/Archive/extendedretention/lotsofstuff/00000000-0000-0000-0000-000000000000/blocks/anotherid/lotsofobjectgunk.blk: ConnectionTimeout (20.0s) (client_ip: x.x.x.x) | 19:50 |
ormandj | doesn't really log much of anything, unlike normal client requests | 19:50 |
ormandj | but if you grep for that object.blk file you'll not see access again after the timeout | 19:51 |
ormandj | i had to go onto the storage node to even find the txn by grepping for the object name | 20:02 |
ormandj | and once i had the txn i was able to see it was part of a mass delete | 20:02 |
timburke | wacky -- i would've expected a normal access line like `Sep 24 19:54:07 saio proxy-server: 127.0.0.1 127.0.0.1 24/Sep/2020/19/54/07 HEAD /v1/AUTH_test/bucket/not-really-there%3Fsymlink%3Dget HTTP/1.0 404 - aws-cli/1.18.140%20Python/3.6.9%20Linux/4.15.0-46-generic%20botocore/1.17.63%20StaticWeb - - - - txa04ff5bb91d44b0691143-005f6cf95f - 0.0193 SW - 1600977247.688429356 1600977247.707708120 0` | 20:02 |
ormandj | timburke: yeah, exactly, that would have made it a lot easier to hunt down for sure ;) | 20:02 |
ormandj | it seems like all of these re: mass delete look like this | 20:03 |
timburke | is force_swift_request_proxy_logturned on, or off/not present? | 20:03 |
ormandj | looking | 20:03 |
ormandj | no, defaulted to false | 20:03 |
ormandj | should that be on? | 20:03 |
timburke | depends on what you're interested in i suppose. though if having it off means we don't log proper subrequests, that sure seems like a bug... | 20:04 |
ormandj | https://opendev.org/openstack/swift/src/branch/master/etc/proxy-server.conf-sample#L593 <-- we basically mirror these on every update, then just apply our changes to override things we need to specific to our environment | 20:05 |
ormandj | well, we definitely want enough logging to be useful hah, so we can turn that one. the account info would be very helpful in addition to 'fixing' this so we'll flip the bits | 20:06 |
timburke | might want to look at log_msg_template -- recently we merged some fixes so that {account} works as you'd hope for s3 requests | 20:07 |
ormandj | recently as in ussuri? | 20:08 |
timburke | victoria -- https://opendev.org/openstack/swift/commit/b2efd185c | 20:10 |
ormandj | well, guess when we do that upgrade we'll reap the benefits, until then, we can do this other one | 20:11 |
*** zamba has quit IRC | 20:41 | |
*** zamba has joined #openstack-swift | 20:54 | |
*** baojg has quit IRC | 21:53 | |
*** baojg has joined #openstack-swift | 21:54 | |
*** baojg has quit IRC | 21:58 | |
*** baojg has joined #openstack-swift | 21:59 | |
*** openstackgerrit has joined #openstack-swift | 22:42 | |
openstackgerrit | Tim Burke proposed openstack/swift master: Add a new URL parameter to allow for async cleanup of SLO segments https://review.opendev.org/733026 | 22:42 |
*** rcernin has joined #openstack-swift | 22:58 | |
*** rcernin has quit IRC | 23:14 | |
*** rcernin has joined #openstack-swift | 23:14 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!