Thursday, 2025-02-06

opendevreviewMichal Arbet proposed openstack/kolla master: Fix permissions for ironic metrics  https://review.opendev.org/c/openstack/kolla/+/94051406:16
opendevreviewMichal Arbet proposed openstack/kolla master: Fix permissions for ironic metrics  https://review.opendev.org/c/openstack/kolla/+/94051406:24
opendevreviewMichal Arbet proposed openstack/kolla master: Install pycadf from pypi package  https://review.opendev.org/c/openstack/kolla/+/94060506:31
opendevreviewMichal Arbet proposed openstack/kolla master: Fix permissions for ironic metrics  https://review.opendev.org/c/openstack/kolla/+/94051406:31
kevkofrickler: replied to your comments 06:44
opendevreviewRoman Krcek proposed openstack/kolla-ansible master: Merge of container_facts modules  https://review.opendev.org/c/openstack/kolla-ansible/+/91246008:47
opendevreviewRoman Krcek proposed openstack/kolla-ansible master: Add action for getting container names list  https://review.opendev.org/c/openstack/kolla-ansible/+/92438908:47
opendevreviewRoman Krcek proposed openstack/kolla-ansible master: Add container engine migration scenario  https://review.opendev.org/c/openstack/kolla-ansible/+/83694108:47
opendevreviewGrzegorz Koper proposed openstack/kolla-ansible stable/2024.2: Fix Grafana datasource update  https://review.opendev.org/c/openstack/kolla-ansible/+/94085608:52
opendevreviewGrzegorz Koper proposed openstack/kolla-ansible stable/2024.1: Fix Grafana datasource update  https://review.opendev.org/c/openstack/kolla-ansible/+/94085708:52
opendevreviewGrzegorz Koper proposed openstack/kolla-ansible stable/2023.2: Fix Grafana datasource update  https://review.opendev.org/c/openstack/kolla-ansible/+/94085808:52
opendevreviewPierre Riteau proposed openstack/kayobe stable/2024.2: Support forcing time synchronisation  https://review.opendev.org/c/openstack/kayobe/+/94086910:05
opendevreviewMatt Crees proposed openstack/kayobe master: Drop kolla-tags and kolla-limit  https://review.opendev.org/c/openstack/kayobe/+/93566911:06
kevkofrickler: can u please review comments so we can continue reviewing/merging rabbitmq queue manager ? 11:52
opendevreviewVerification of a change to openstack/kayobe master failed: Replace pause with chronyc waitsync in ntp sync  https://review.opendev.org/c/openstack/kayobe/+/94064612:05
opendevreviewMerged openstack/kolla-ansible master: Fix Grafana datasource update  https://review.opendev.org/c/openstack/kolla-ansible/+/94013312:06
andreykurilinHi folks! Can someone suggest the direction to dig? One of CI jobs failed with `NoneType: None` message on `kolla-ansible reconfigure -i /etc/kolla/inventory -vvv` step.12:11
opendevreviewGrzegorz Koper proposed openstack/kolla-ansible stable/2024.2: Fix Grafana datasource update  https://review.opendev.org/c/openstack/kolla-ansible/+/94085612:36
opendevreviewGrzegorz Koper proposed openstack/kolla-ansible stable/2024.1: Fix Grafana datasource update  https://review.opendev.org/c/openstack/kolla-ansible/+/94085712:36
opendevreviewGrzegorz Koper proposed openstack/kolla-ansible stable/2023.2: Fix Grafana datasource update  https://review.opendev.org/c/openstack/kolla-ansible/+/94085812:37
opendevreviewMichal Arbet proposed openstack/kolla master: Fix permissions for ironic metrics  https://review.opendev.org/c/openstack/kolla/+/94051412:42
opendevreviewMichal Arbet proposed openstack/kolla master: Install pycadf from pypi package  https://review.opendev.org/c/openstack/kolla/+/94060512:43
opendevreviewMichal Arbet proposed openstack/kolla master: Fix permissions for ironic metrics  https://review.opendev.org/c/openstack/kolla/+/94051412:43
opendevreviewJakub Darmach proposed openstack/kolla stable/2024.1: Add support for Ubuntu 24.04 LTS  https://review.opendev.org/c/openstack/kolla/+/93238614:35
dcapone2004when upgrading rabbitmq between 2023.1 and 2023.2, the docs reference om_enable_rabbitmq_quorum_queues  and om_enable_rabbitmq_high_availability ... neither of these options are in my current globals.yml file on 2023.1.... is this correct?15:11
dcapone2004I am also not seeing that variable in etc_examples folder of the 2023.2 release, so I am tring to follow the docs but I am unsure where these variables are set and how to safely upgrade rabbitmq for a transition to 2023.215:22
opendevreviewAndrey Kurilin proposed openstack/kolla-ansible master: Makes Grafana database ssl options configurable  https://review.opendev.org/c/openstack/kolla-ansible/+/94088615:23
priteaudcapone2004: The quorum queue option was added as a backport after release: https://review.opendev.org/c/openstack/kolla-ansible/+/90238715:25
priteauYou can probably diff the changes to bring your own globals.yml up to date.15:26
dcapone2004priteau: I assume I need that variable in my globals.yml to safely upgrade?15:26
priteauThere is a precheck that will complain if you are not using either om_enable_rabbitmq_quorum_queues or om_enable_rabbitmq_high_availability15:27
priteauYou need to pick one15:27
dcapone2004it is a bit confusing as the "upgrade" page of the 2023.2 docs do not make mention of separately upgrading rabbitmq unless you are performing a SLURP update, but if you did into the rabbitmq configuration page, it has a separate specific procedure for upgrading rabbitmq, so I am a little confused if the rabbitmq separate upgrade is needed15:28
dcapone2004dig*15:28
kevkoandreykurilin: link ? 15:36
andreykurilinkevko: https://zuul.opendev.org/t/openstack/build/1616a7134fe84526a50fdf185329255c from https://review.opendev.org/c/openstack/kolla-ansible/+/94082515:37
dcapone2004priteau:  also, fyi it appears that the inclusion of the variable itself into the globals.yml was left off the merge and updates as I reviewed the last version of globals.yml in the KA repo and it also does not include the variables, so diff I do not think would help here15:37
dcapone2004it actually isn't included in any of the globals.yml files up to and including the current master repo .... so does that variable get defined in globals.yaml or elsewhere?15:41
kevkoandreykurilin: non-related i think 15:42
kevkoandreykurilin: mariadb didn't create a cluster 15:43
kevkoandreykurilin: 2025-02-05 19:46:13 2 [ERROR] WSREP: ./gcs/src/gcs.cpp:s_join():990: Sending JOIN failed: -103 (Connection was closed).15:43
kevko2025-02-05 19:46:13 2 [Warning] WSREP: Failed to JOIN the cluster after SST gcs_join(01e127e1-e3f9-11ef-9365-b799a7089f5d:21) failed15:43
kevkoandreykurilin: https://ee4b9eb34e1fd5d099ea-4540e2fcfebb146227dc28f41e3112d0.ssl.cf5.rackcdn.com/940825/1/check/kolla-ansible-ubuntu-mariadb/1616a71/primary/logs/kolla/mariadb/mariadb.txt15:43
kevkoandreykurilin: so monitor user was not created ..and playbook failed 15:43
kevkoandreykurilin: just recheck 15:43
bbezakfrickler: got issue with that change - https://review.opendev.org/c/openstack/kolla-ansible/+/910503. not sure why it can't workflow - as related changed got merged - however somehow on the different patchset15:44
dcapone2004ok, in digging further, I think I understand....on 2023.1 by default transient queues are enabled.... ONLY IF you want to switch to durable queues, does the extra steps need to be followed15:48
dcapone2004however, if upgrading to 2023.2, to maintain transient queues, I need to add om_enable_rabbitmq_quorum_queues : false and om_enable_rabbitmq_high_availability: true to the globals.yml file before upgrading15:49
dcapone2004am I also correct that switching to durable queues requires downtime???  The procedure indicates a need to stop services that rely on rabbitmq to make the change which afaik includes neutron and also afaik if neutron containers are stopped, network traffic will stop15:50
andreykurilinkevko: thank you for looking!15:53
priteaudcapone2004: that's correct, it requires a full stop of all services using RMQ16:00
dcapone2004priteau:  thanks....am I correct that I am safely upgrade without downtime, by simply using KA upgrade IF I set om_enable_rabbitmq_quorum_queues : false and om_enable_rabbitmq_high_availability: true in globals.yml with no other special steps?16:02
fricklerbbezak: depends-on isn't merged https://review.opendev.org/c/openstack/kolla-ansible/+/89961516:05
bbezakRight. I didn't notice in this long commit msg. Thank you16:06
-opendevstatus- NOTICE: nominations for the OpenStack PTL and TC positions are now open, for details see https://governance.openstack.org/election/16:08
opendevreviewMassimiliano Favaro-Bedford proposed openstack/kayobe master: Add variable to allow image builds of different platforms.  https://review.opendev.org/c/openstack/kayobe/+/94089416:13
kevkodcapone2004: well, we can't stop services ..we will migrate it on the fly 16:14
kevkodcapone2004: but we need to patch oslo.messaging i think ...and still not done 16:14
priteaudcapone2004: If you are not already using ha queues, I think you will need full stop?16:15
kevkobbezak: https://review.opendev.org/c/openstack/kolla-ansible/+/940496 ? 16:15
kevkocan u please ? 16:15
dcapone2004kevko:  the docs as of now say a full stop is needed to go from transient to durable queues....I think your comment is saying that you intend to change this requirement in the future, but it isn't completed yet16:16
kevkodcapone2004: well, switch to quorum queues can be fatal ...we have +/- 150 computes and  thousands of k8s running on top .... if there  is some service down ...it's really fatal for us ...16:17
kevkodcapone2004: so, we will probably patch oslo.messaging and provide migration path somehow ...from a code it can be done when I've checked firstly ...16:18
kevkodcapone2004: but yeah, official opendev or kolla way don't exist ...just stop ..reset ..restart yeagh 16:18
dcapone2004kevko:  so safest way to upgrade from 2023.1 to 2023.2 and beyond for the time being is staying on transient queues16:19
dcapone2004kevko:  which I can do like any othe rupgrade in the past, clone 2023.2 repo, merge passwords / globals.yml variables16:19
dcapone2004kevko: add om_enable_rabbitmq_quorum_queues : false and om_enable_rabbitmq_high_availability: true to the globals.yml file16:20
kevkodcapone2004: correct 16:20
dcapone2004kevko: run kolla-ansible upgrade...16:20
kevkodcapone2004: exactly 16:20
kevkodcapone2004: you can run it until rabbitmq 4.0 will be here in kolla 16:21
dcapone2004kevko: awesome, thank you very much....all the rabbitmq noise was stressing me out lol16:21
kevkodcapone2004: I like this noise; after all, if everything worked as it should, many of us wouldn't have a job , right ? :D 16:22
dcapone2004kevko: valid point16:27
dcapone2004kevko:  one last question, does this mean that openstack and KA is going to remain on RMQ 3.13 for the foreseeable future until that issue is fixed as the docs mentioned support for transient queues was dropped in RMQ 4.0?16:56
dcapone2004kevko: I am essentially trying to make sure I do not upgrade to an openstack release that requires the transition to quorum queues and want to know where my eol may be16:57
opendevreviewJakub Darmach proposed openstack/kolla stable/2024.1: Add support for Ubuntu 24.04 LTS  https://review.opendev.org/c/openstack/kolla/+/93238616:58
priteaudcapone2004: the move to rmq 4.0 is planned for 2025.1 (Epoxy)17:01
dcapone2004ok, so do not upgrade to 2025.1 without having a solution to switch to quorum queues17:01
dcapone2004got it17:01
dcapone2004ok.....last thing I am noticing merging the configs is that the default ceph keyrings seem to have dropped "ceph." from the start of them, my existing keyring files are all ceph.client.<service>.keyring ... do the playbooksnow automatically prepend "ceph." to the value specified in the globals.yml or do I need to ensure my globals.yml include the ceph.17:38
dcapone2004trying to look through the playbooks to check, but hoping I could get a good answer here17:41
priteaudcapone2004: I believe it is transparent. See https://review.opendev.org/c/openstack/kolla-ansible/+/87741317:44
priteauIn particular, see cinder_ceph_backends in ansible/roles/cinder/defaults/main.yml and how its "cluster" key is used in e.g. https://review.opendev.org/c/openstack/kolla-ansible/+/877413/17/ansible/roles/cinder/tasks/external_ceph.yml17:45
dcapone2004priteau: thx for the references, saved me some digging17:46
opendevreviewPierre Riteau proposed openstack/kayobe master: Replace pause with chronyc waitsync in ntp sync  https://review.opendev.org/c/openstack/kayobe/+/94064617:57
dcapone2004is the gerrit system the best place to make the suggestion that the raabitmq queue variables to be added into the globals.yml example file and commented for the impacts?18:02
priteauSure, you can submit a patch on Gerrit, core reviewers will give their opinion18:16
priteauhttps://docs.openstack.org/kolla-ansible/latest/contributor/contributing.html18:17
-opendevstatus- NOTICE: The Gerrit service on review.opendev.org will be offline momentarily while we upgrade for a new jeepyb feature and switch our database container image source repository18:52
shermanmhey, I had a (possibly silly) question about dns resolution when using ovs / neutron-dhcp-agent20:18
shermanmit seems like dnsmasq defaults to trying to act as a dns resolver from each dhcp instance, but if a subnet has no router attached, then dnsmasq can't reach any upstream resolvers, not even one on the host outside its network namespace20:18
shermanmis there a sane way to provide access to external dns resolution for subnets without a router?20:18
shermanmthe context being that I'm trying to get internal dns resolution to work, and I can't seem to get dnsmasq to handle that case if it can't also reach an upstream resolver20:22
fricklershermanm: you can tell dnsmasq to use the host networking, see https://docs.openstack.org/neutron/latest/admin/config-dns-res.html#case-2b-queries-are-forwarded-to-dns-resolver-s-configured-on-the-host20:42
shermanmfrickler: so, we tried that, in which case it attempts to use the contents of /etc/resolv.conf (removes --no-resolv flag)20:43
shermanmbut, resolvers on the host, even e.g. 127.0.0.53, aren't accessible from the network namespace where the dnsmasq instance is running20:43
shermanmunless a router is attached to the relevant subnet20:44
shermanmthis is only an issue for us because, in the case that you create 2 subnets with default settings, only one with a router, and then attach an instance to both of them, dhcp will advertise a dns server for both21:09
shermanmand dns for the non-routable subnet will take 10s before rejecting any query, causing lots of "hangs" on the guest os querying it21:09
fricklershermanm: ah. you can do "openstack subnet set --dns-nameserver 0.0.0.0" in order to make dhcp not send an dns server21:31
shermanmyep! it's not an issue for our more expert users, but very non-intuitive for more novice ones. tbh if we could just set a default nameserver  for subnets to use, that would fix the user experience21:33
fricklershermanm: yes, I would have liked to switch the meaning of 0.0.0.0 and --no-dns-nameserver, but that wasn't possible in order to stay backwards compatible, so it had to be this kind of twisted operation21:36
fricklerhow would having a default nameserver help in the isolated subnet case?21:37
shermanmit would help because we could then disable resolution in dnsmasq entirely, without breaking the case where a subnet is routable, but didn't have a custom nameserver set via --dns-nameserver21:39
shermanmroutable subnets just use the default, and non-routable ones fail cleanly, because you get "no route available" to the upstream resolver, instead of a timeout before failing21:39
fricklershermanm: I see, it might make sense to add such an option to neutron, do you want to propose an RFE? but also, mostly everyone seems to be going the OVN path and that messes up DNS even more badly21:44

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!