Tuesday, 2024-11-12

*** f0o_ is now known as f0o06:21
gokhan_Good morning folks, first member of the rabbitmq hosts is failed and can not be added to cluster again. I am trying to solve this with rabbitmq-upgrade=true. is it possible to change bootstrap host from member 1 to member 2 ? 07:07
noonedeadpunkgokhan_: yes, sure that's possible08:48
noonedeadpunkthere's a variable rabbitmq_primary_cluster_node https://opendev.org/openstack/openstack-ansible-rabbitmq_server/src/branch/master/defaults/main.yml#L4308:49
noonedeadpunkbut I guess I'd try to recover manually there, by running `rabbitmqctl forget_cluster_node`08:51
noonedeadpunkand then on failed node `rabbitmqctl join_cluster`08:51
noonedeadpunkpotentially, you'd need to drop mnesia db on failed node before joining the cluster08:51
gokhan_thanks noonedeadpunk  I tried manually to solve issues but it seems all nodes are crashed. I rebuild rabbitmq containers again and solved issues.  10:55
noonedeadpunk#startmeeting openstack_ansible_meeting15:00
opendevmeetMeeting started Tue Nov 12 15:00:38 2024 UTC and is due to finish in 60 minutes.  The chair is noonedeadpunk. Information about MeetBot at http://wiki.debian.org/MeetBot.15:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.15:00
opendevmeetThe meeting name has been set to 'openstack_ansible_meeting'15:00
noonedeadpunk#topic rollcall15:00
noonedeadpunko/15:00
mgariepyhey15:01
mgariepybeen a while 15:01
noonedeadpunkindeed! hope all is well15:01
mgariepyyep everything is going well :)15:02
mgariepyhow about you?15:02
noonedeadpunka bit /o\ with everything going on15:03
noonedeadpunkbut overall things are moving at least15:03
jrossero/ hello15:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-plugins master: Move healthcheck playbooks to collection  https://review.opendev.org/c/openstack/openstack-ansible-plugins/+/93361015:04
noonedeadpunk#topic office hours15:04
noonedeadpunkSo, mariadb 11.4 is pretty much ready15:05
noonedeadpunkI still had to disable TLS verification, but it affects only mariabackup15:05
noonedeadpunk#link https://review.opendev.org/c/openstack/openstack-ansible-galera_server/+/922377/12/templates/cluster.cnf.j215:05
noonedeadpunkregarding rabbitmq 4.0 - I somehow thought that we're using 3.13 for 2024.1 :(15:06
noonedeadpunk#link https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93406015:07
noonedeadpunkbut it's indeed 3.12 :(15:07
noonedeadpunkI was thinking it was part of 2024.1 but it's not https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/92237815:08
noonedeadpunkso I wonder what are we gonna do with that...15:08
noonedeadpunkas this means - we can't have 4.0 for 2025.1 then as well15:08
andrewbonneyI hadn't thought about SLURP impact on RMQ version jumps. That's not ideal15:11
mgariepy3.13 is EOL as of now for community release15:12
noonedeadpunkwell, community is supporting always only current release nowdays15:12
noonedeadpunkso we can't always be on latest for stable branches anyway15:13
noonedeadpunkSo eventually we either need to wait for 2025.2 for 4.0, or backport 3.13 to 2024.1 and write a release note with requirement of minor upgrade before proceeding15:13
noonedeadpunkneither are ideal to say the least15:13
andrewbonneyIf we wait for 4.0 it feels like there's a risk of getting way behind, subject to RMQ's release cadence15:14
mgariepyA pre-step to upgrade rabbit on 2024.1 for SLURP to 2025.1 isn't too bad IMO.15:18
noonedeadpunkprobably it's a lower risk indeed...15:18
noonedeadpunkbad we realized this after getting 29.1.0 out though15:19
noonedeadpunkbut we pushed it out not too far ago15:19
andrewbonneyCould we add any automation to step through major versions from current to target, or does that feel awkward? Feels like this may come up again in future15:19
noonedeadpunkI'd say it's indeed quite awkward... While it could be possible, I'm not sure we want to have such complexity in code15:20
andrewbonneyFair enough15:21
noonedeadpunkas we have a check for current rabbitmq version. so we can add another upgrade loop somehow... somewhere...15:21
noonedeadpunkbut it should be on playbook level anyway I assume15:21
noonedeadpunknot on a role level15:22
noonedeadpunkso it will be not neat at all15:22
noonedeadpunkbut I think indeed that bumping rabbitmq version for 2024.1 might be best option now.15:26
noonedeadpunkbut then there will be 2 concenring things with 4.015:27
noonedeadpunkfirst - quorum queues requirement, and second 3.13 requirement.15:27
noonedeadpunknone of them are critical to have that said15:27
noonedeadpunkI will push patch to 2024.1 I guess and some release notes supporting this requirement15:28
noonedeadpunkI also wrote some doc around how I was doing pretty endpoint names 15:28
noonedeadpunk#link https://review.opendev.org/c/openstack/openstack-ansible/+/93453615:28
andrewbonneynoonedeadpunk: I think if backporting 3.13 we'll also need Idb5c02109458771853e0fdbc7f6bb27beaa731b4, otherwise broken experimental feature flags will get enabled15:29
andrewbonneySee 'khepri_db' in https://www.rabbitmq.com/docs/upgrade15:30
NeilHanlono/ way late, sry. triple double booked today because of timezone fun 🙃15:32
noonedeadpunkandrewbonney: I thought you wanted to link another patch, but I see what you mean15:33
noonedeadpunkI thought that's also in 2024.1 though :D15:33
mgariepywe also need to make sure that the rabbitmq validate the version before upgrade in case older branch of 2024.1 was deployed and not upgraded before the SLURP to 2025.115:36
noonedeadpunkthat is a very good point15:37
noonedeadpunkand we can fail with meaningfull error if we can't15:44
noonedeadpunkupgrade15:44
mgariepybefore the cluster is screwed :D hahha15:48
noonedeadpunkpreferably "D15:49
noonedeadpunkand we're less then 1 month until final release15:49
noonedeadpunkwe also have quite some outstanding reviews15:50
noonedeadpunk#link http://bit.ly/osa-review-board-v516:00
noonedeadpunk#endmeeting16:00
opendevmeetMeeting ended Tue Nov 12 16:00:22 2024 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)16:00
opendevmeetMinutes:        https://meetings.opendev.org/meetings/openstack_ansible_meeting/2024/openstack_ansible_meeting.2024-11-12-15.00.html16:00
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/openstack_ansible_meeting/2024/openstack_ansible_meeting.2024-11-12-15.00.txt16:00
opendevmeetLog:            https://meetings.opendev.org/meetings/openstack_ansible_meeting/2024/openstack_ansible_meeting.2024-11-12-15.00.log.html16:00
opendevreviewMerged openstack/openstack-ansible-os_rally master: Add retries for UC fetching over HTTP  https://review.opendev.org/c/openstack/openstack-ansible-os_rally/+/93414417:20
opendevreviewMerged openstack/openstack-ansible-haproxy_server master: Do not add vip['address'] so SAN if haproxy is binded to interface  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/93454917:37
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_rally stable/2024.1: Add retries for UC fetching over HTTP  https://review.opendev.org/c/openstack/openstack-ansible-os_rally/+/93481517:44
mgariepywhy so much post_failure ? 17:54
fricklerinfra issues17:54
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Get exact version of installed rabbitmq  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93482218:22
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Bump RabbitMQ version to 4.0  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93406018:40
noonedeadpunkandrewbonney: I'm actually a bit surprised that feater flags are not enabled on post-upgrade. As I would expect they should be according to my understanding18:41
noonedeadpunkas in case of cluster is healthy - this should execute anyway? https://opendev.org/openstack/openstack-ansible-rabbitmq_server/src/branch/master/tasks/main.yml#L85-L11118:41
noonedeadpunkmaybe some kind of race condition....18:42
andrewbonneyI'll take another look tomorrow. After a run in an AIO that was my finding, but I did have to back out of the 4.0 upgrade, so possible something was in an unusual state18:51
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server stable/2024.1: Upgrade RabbitMQ to 3.13 for 2024.1  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93482619:19
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible master: [doc] Document requirement for RabbitMQ upgrade  https://review.opendev.org/c/openstack/openstack-ansible/+/93482819:28
opendevreviewMerged openstack/openstack-ansible-haproxy_server master: Generate PKI SAN as a list  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/93455020:00
opendevreviewMerged openstack/openstack-ansible-ceph_client stable/2024.1: Fix a list of packages allowed from epel  https://review.opendev.org/c/openstack/openstack-ansible-ceph_client/+/93453222:38

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!