Monday, 2023-11-06

opendevreviewYian Zong proposed openstack/cinder master: Add Cinder active-active support for Dell PowerFlex driver  https://review.opendev.org/c/openstack/cinder/+/89979505:23
opendevreviewCuiye Liu proposed openstack/cinder master: Dell PowerMax : Add the function of Dell PowerMax live migration without a pool name  https://review.opendev.org/c/openstack/cinder/+/89818807:50
*** geguileo is now known as Guest593910:24
*** elodilles_pto is now known as elodilles11:39
opendevreviewYian Zong proposed openstack/cinder master: Add Cinder active-active support for Dell PowerFlex driver  https://review.opendev.org/c/openstack/cinder/+/89979512:38
opendevreviewBryan Neumann proposed openstack/cinder master: Dell EMC: PowerMax - Configurable SRDF snapshots  https://review.opendev.org/c/openstack/cinder/+/89905115:06
noonedeadpunkHey folks. Quick question - should cinder-volumes have access to the DB in the cinder.conf? As I've spotted we do have that, but have concerns if that should be the case or cinder volume should communicate with DB only through scheduler/api alike to nova-computes?15:14
noonedeadpunkIt looks like they do communicate with DB through RPC to me, but want to double-check to be sure :)15:17
greatgatsby_Hello.  I'm trying to use the cinder CLI version 9.4.0 against our yoga openstack environment.  I keep getting "ERROR: Version 3.70 is not supported by the API.  Minimum is 3.0 and maximum is 3.68".  However, even if I pass --os-volume-api-version 3.68 I still get the same error.  15:40
greatgatsby_thanks for any suggestions you might have15:41
greatgatsby_I've also tried the OS_VOLUME_API_VERSION env var, but no joy16:00
*** gouthamr_ is now known as gouthamr16:09
rosmaitagreatgatsby_: i think you are getting that error because the API service you are contacting (not the client) only supports up to 3.6816:21
rosmaitagreatgatsby_: 3.68 is the max available in yoga: https://docs.openstack.org/api-ref/block-storage/api_microversion_history.html#maximum-in-yoga16:22
rosmaitagreatgatsby_: and i misread your question ... you already know you only want 3.6816:23
noonedeadpunkbasically why I've started asking about cinder-volumes accessing DB, is that I'm trying to find out reasons for cinder catching deadlocks and not re-trying afterwards, ie: https://paste.openstack.org/show/boTkGGE7HZAMxXO8kywD/16:26
noonedeadpunkI catch quite some of these things after upgrade to 2023.116:28
greatgatsby_rosmaita: really appreciate the response.  So am I misunderstanding what --os-volume-api-version does?  Does that not make the client talk to the api at that specified version?16:28
rosmaitagreatgatsby_: no, you are understanding, i misread your question ... the --os-volume-api-version is supposed to specify the mv you want the client to use16:29
greatgatsby_maybe I'll try downgrading the client and using that option, perhaps it's something not working in a recent release16:30
rosmaitanoonedeadpunk: cinder-volume server absolutely needs to talk to the db16:30
noonedeadpunkmhm, ok, I see. It just felt it talks through rpcc according to these stack traces...16:32
greatgatsby_rosmaita: I'd still like to get the standalone cinder client working, but in case this is an XY problem, is there an OSC equivalent for `cinder quota-class-update`? 16:33
rosmaitagreatgatsby_: https://docs.openstack.org/python-openstackclient/2023.1/cli/decoder.html#cinder-cli16:37
rosmaitalooks like there is, let me know what happens16:37
greatgatsby_oh wow, thanks for that link!16:38
noonedeadpunkshouldn't be there some attempts to retry transaction in case of catching deadlocks?16:38
noonedeadpunkBecause I feel a bit confused right now on where to dig to be frank... 16:39
noonedeadpunkLike disabling deadlock detection in innodb doesn't sound very right, as then transactions may really start piling up16:39
noonedeadpunkand I had some hope that with coordination deadlocks shouldn't really happen....16:40
greatgatsby_rosmaita: seems this might be an XY problem anyway, appears the whole "class" thing is deprecated anyway, I'll have to figure out what we want to do here.  Again, *really* appreciate the help.16:49
rosmaitagreatgatsby_: if you have a few minutes, please file a bug against the cinderclient16:49
rosmaitahttps://launchpad.net/python-cinderclient16:50
greatgatsby_rosmaita: yes, I'm downgrading the client until (hopefully) the original command with the api version specified works, and I'll include that in the ticket16:50
rosmaitathanks!16:50
greatgatsby_with the mapped command, I'm not sure what we want to do yet16:50
rosmaitanoonedeadpunk: what's in the cinder-volume log for that API call?  you should be able to use the request-id to find the relevant stuff17:02
rosmaitanoonedeadpunk: i have to go afk for about an hour, will check back later17:02
noonedeadpunkrosmaita: on cinder-volume it's pretty much same: https://paste.openstack.org/show/bJFQ91WBLW3Zbjshep6p/17:05
rosmaitadang, i was hoping we'd get more context17:11
noonedeadpunkwell17:13
noonedeadpunkI've found an interesting thing now.17:13
noonedeadpunksame request ID but different host: https://paste.openstack.org/show/bIkyBnC577JrtFaNl4iL/17:13
noonedeadpunkSo... they're really racing to execute the request17:13
noonedeadpunkand that basically what's causing the deadlock I assume17:14
noonedeadpunkit's active/active setup with ceph backend fwiw17:15
noonedeadpunkand using zookeeper as coordination driver17:15
noonedeadpunkit's really confusing... As from trace I see it goes through @synchronized....17:20
noonedeadpunkAnd in Zookeeper I do have an empty object /cinder/locks/cinder-attachment_update-de2d8ab8-e743-4837-885f-eab22a5f8e31-compute1317:24
noonedeadpunkfwiw, I have soooooooo much of them - feels like they're never being cleaned up17:25
noonedeadpunkand in zookeeper logs I don't see anything really fishy. actually, it seems normal to have that much data there, since locks are created inside each such object according to the log17:30
noonedeadpunk /cinder/locks/cinder-attachment_update-de2d8ab8-e743-4837-885f-eab22a5f8e31-compute13/0f4064ff7fb54ab680f2262585926c00__lock__0000000006+cinder-19ecf8fb-a517-4382-8ef9-8b513ec2ed37worldanyone17:31
noonedeadpunkI kinda wonder if I have active/active setup misconfigured, as that has happened with me before and I got confused with how to properly set it up in past....17:47
noonedeadpunkhuh, but accroding to trace from cinder-volume - it doesn't go through coordination driver, while I guess it should?17:50
greatgatsby_rosmaita: https://bugs.launchpad.net/python-cinderclient/+bug/2042863  sorry if it's not the greatest bug report, I'm pretty under the weather right now17:51
noonedeadpunkand another issue I've just spotted is related to cinder-manage db purge.: https://paste.openstack.org/show/bE0lNjzEdZu0CzZz15Xx/18:11
noonedeadpunkthat looks a bit like commit is somehow needed to proceed... But it could be just smth obsolete having old schema....18:11
noonedeadpunkI guess I'd need to check that volume_admin_metadata is having correct info in it and wasn't messed up manually....18:12
noonedeadpunkSo disregard that for now :)18:13
noonedeadpunkRace condition for cinder-volume with coordination is waaaay more interesting :p18:13
*** thelounge5514 is now known as thelounge55119:17
*** JayF is now known as Guest606119:55
*** JasonF is now known as JayF19:55
*** zigo_ is now known as zigo19:57
opendevreviewBryan Neumann proposed openstack/cinder master: Dell EMC: PowerMax - Configurable SRDF snapshots  https://review.opendev.org/c/openstack/cinder/+/89905121:53

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!