Tuesday, 2020-11-24

openstackgerritHervé Beraud proposed openstack/oslo.messaging master: fix variable name  https://review.opendev.org/c/openstack/oslo.messaging/+/76397513:49
hberaudkgiusti: o/ do you think it make sense to also `enable_cancel_on_failover` on `_driver.impl_rabbit.Connection._publish_and_creates_default_queue`? https://github.com/openstack/oslo.messaging/blob/master/oslo_messaging/_drivers/impl_rabbit.py#L127613:53
* kgiusti looks...13:54
hberaudas it was added on consumer https://github.com/openstack/oslo.messaging/blob/master/oslo_messaging/_drivers/impl_rabbit.py#L28513:56
gmannhberaud: thanks14:04
hberaudkgiusti: I wonder if it can help to solve https://bugzilla.redhat.com/show_bug.cgi?id=1898578 by notifying the server that the binding is deleted.14:05
openstackbugzilla.redhat.com bug 1898578 in openstack-nova "[OSP 16.1] n-cpu raising MessageUndeliverable when replying to RPC call" [High,New] - Assigned to nova-maint14:05
gmannstephenfin: if you are around https://review.opendev.org/c/openstack/oslo.upgradecheck/+/76348414:06
kgiustihberaud: I really don't know for sure.  According to the rabbitmq docs, the x-cancel-on-ha-failover needs to be set by the _consumer_.  That publish_and_creates_default_queue is for _publishing_ clients (notify-send).14:08
kgiustihberaud: https://www.rabbitmq.com/ha.html#cancellation14:09
*** zzzeek has joined #openstack-oslo14:09
hberaudI see thanks14:09
kgiustihberaud: we (oslo project) really needs an in house RabbitMQ expert.  It's a shame we've lost such talented folks.14:12
hberaudkgiusti: also can you take a look to https://bugzilla.redhat.com/show_bug.cgi?id=1898578 , I submitted a reply from our oslo.messaging point of view however I'm not sure if we can do something from our side.14:13
openstackbugzilla.redhat.com bug 1898578 in openstack-nova "[OSP 16.1] n-cpu raising MessageUndeliverable when replying to RPC call" [High,New] - Assigned to nova-maint14:13
hberaudkgiusti: I agree14:13
kgiustihberaud: that response is fine - you've covered everything I could think of (and more).14:33
kgiustihberaud: given the CI history of failures we've seen in the rabbitmq failover tests there must be something the rabbitmq driver is doing wrong.14:34
kgiustihberaud: I've spent time trying to get that failover test to pass consistently and have gotten no where with it.14:34
hberaudconcerning the failover tests I suppose that you speak about https://review.opendev.org/c/openstack/oslo.messaging/+/763975  , isn't?14:37
hberaudkgiusti: ^14:37
kgiustihberaud: was a variable name issue causing the Rabbitmq failover tests (functional test) to fail frequently?14:39
hberaudkgiusti: ah oups sorry wrong link14:39
kgiustihberaud: man *that* would've been excellent! :)14:39
hberaudI was thinking about this one https://review.opendev.org/c/openstack/oslo.messaging/+/74789214:40
hberaudkgiusti: ^ . Yes it would've been so straightforward :)14:40
kgiustihberaud: yeah {2} oslo_messaging.tests.functional.test_rabbitmq.RabbitMQFailoverTests.test_failover_scenario_enable_cancel_on_failover [0.368216s] ... FAILED14:42
kgiustihberaud: not sure if that was/is a Pifpaf issue14:42
kgiustihberaud: but that RabbitMQFailoverTests has been unstable for awhile14:43
* hberaud hate the zuul's output on the last gerrit version14:44
hberaudtempest 25.0.1.dev81 has requirement jsonschema>=3.2.0, but you'll have jsonschema 3.0.2 which is incompatible.14:48
hberaudERROR: tempest 25.0.1.dev81 has requirement paramiko>=2.7.0, but you'll have paramiko 2.6.0 which is incompatible.14:48
hberaudkgiusti: normally failover is now a non voting job, however the full rabbit test is still a voting test and it stuck us but all the errors found in its logs doesn't looks like related to oslo.messaging (oslo.messaging-src-dsvm-full-rabbit) (https://zuul.opendev.org/t/openstack/build/ff791791893144ae9dac63c3919268cf/log/job-output.txt?severity=0)14:53
hberaudkgiusti: concerning the failover test it looks like to the pifpaf issue previously discussed in the patch's comments "WARNING [pifpaf.drivers] `psutil.Popen(pid=3322, status='terminated')` is already gone, sending SIGKILL to its process group"14:55
hberaudkgiusti: however this functional test is now non voting14:56
kgiustihberaud: ah, so the tempest test failure is unrelated to oslo.messaging.  Is there a fix for the pifpaf issue that's causing the functional test to fail?  Ideally we'd want that to be a voting job.14:59
hberaudkgiusti: yes a fix exist (https://github.com/jd/pifpaf/pull/124), I already tried to introduce the fix for stein with => https://review.opendev.org/c/openstack/requirements/+/753239 I need to check which version of pifpaf is used on train, but pifpaf errors seems similar15:07
kgiustihberaud: that's great news - we can enable voting for the functional test and simple skip the failover test if the version of pifpaf is broken.15:09
hberaudkgiusti: on train pifpaf's upper-constraints is 2.2.0 and the fix is landed after 2.4.0 so we are in the same situation that with stein15:40
hberaudkgiusti: https://github.com/openstack/requirements/blob/stable/train/upper-constraints.txt#L40415:40
hberaudkgiusti: https://review.opendev.org/c/openstack/requirements/+/75323915:40
hberaudkgiusti: however some people aren't inclined to raise the pifpaf version on these stable branches15:42
hberaudand yes we can simply skip the failover test15:43
kgiustihberaud: skipping that one test will at least allow the functional test to flag other breaking changes for those stable branches.  Best we can do if they cannot bump the upper constraint.15:44
openstackgerritHervé Beraud proposed openstack/oslo.messaging stable/train: Cancel consumer if queue down  https://review.opendev.org/c/openstack/oslo.messaging/+/74789215:52
hberaudkgiusti: ^ this new PS could be a good consensus15:52
openstackgerritMerged openstack/oslo.upgradecheck master: Add common checks for json to yaml policy file migration  https://review.opendev.org/c/openstack/oslo.upgradecheck/+/76348415:53
kgiustihberaud: indeed. Hopefully Zuul will agree :D15:57
hberaudcrossing my fingers16:01
gmannhberaud: oslo.upgradechecks and oslo.policy patches are merged now. can we do a release for them ?  https://review.opendev.org/c/openstack/oslo.upgradecheck/+/763484  https://review.opendev.org/c/openstack/oslo.policy/+/76326116:40
gmannso that this new things can be used on project side16:40
-openstackstatus- NOTICE: The Gerrit service on review.opendev.org is being restarted quickly to troubleshoot an SMTP queuing backlog, downtime should be less than 5 minutes16:42
gmannhberaud: let me know if you want me to push the release patch and you can +216:42
hberaudgmann: sure feel free to submit the new release17:17
hberaudgmann: however I hope you are not too much in a hurry because we are stuck by some SSH/gerrit issues that block our release tagging jobs http://lists.openstack.org/pipermail/openstack-discuss/2020-November/018976.html17:20
hberaudgmann: hopefully all the pieces of the solution will be merged soon ( https://review.opendev.org/c/openstack/project-config/+/763797/ , https://review.opendev.org/c/openstack/project-config/+/763830/ , https://review.opendev.org/c/zuul/zuul-jobs/+/763834/ )17:22
gmannhberaud: ohk. let me propose and once release tagging fix those can merge17:32
hberaudgmann: ack thanks17:32
*** iurygregory has joined #openstack-oslo18:27
openstackgerritDavid Vallee Delisle proposed openstack/oslo.config master: Adding --check-defaults to validator  https://review.opendev.org/c/openstack/oslo.config/+/75933818:55
openstackgerritGhanshyam proposed openstack/oslo.policy master: Fix oslopolicy-j2y-convertor tool for RuleDefault  https://review.opendev.org/c/openstack/oslo.policy/+/76405619:30
gmannhberaud: lbragstad once more fix in oslo.policy which i would like to include on new release  - https://review.opendev.org/c/openstack/oslo.policy/+/76405620:58
hberaudgmann: ack, I'll take a look later20:58
gmannthis is found while generating the yaml file for glance which still using RuleDefault instead of DocumentedRuleDefault20:59
gmannhberaud: thanls20:59
lbragstadgmann that makes sense - looks good to me but i'll let hberaud kick it through after he takes a look21:01
gmannsure, lbragstad  thanks21:04
hberaudgmann: done21:17
gmannhberaud: thanks21:17
openstackgerritMerged openstack/oslo.policy master: Fix oslopolicy-j2y-convertor tool for RuleDefault  https://review.opendev.org/c/openstack/oslo.policy/+/76405622:25
