Wednesday, 2024-02-14

opendevreviewDmitriy Rabotyagov proposed openstack/ansible-hardening master: Use valid value for CREATE_HOME  https://review.opendev.org/c/openstack/ansible-hardening/+/90897708:44
jrossergood morning08:50
noonedeadpunko/08:54
noonedeadpunkit's so nice being just 2 pages of bugs :)09:00
noonedeadpunkor well, 3 bugs short of that...09:01
opendevreviewAndrew Bonney proposed openstack/openstack-ansible-plugins master: Add override for gluster host used for bootstrap operations  https://review.opendev.org/c/openstack/openstack-ansible-plugins/+/90898110:07
opendevreviewAndrew Bonney proposed openstack/openstack-ansible master: [doc] Use bootstrap node override for gluster primary upgrade  https://review.opendev.org/c/openstack/openstack-ansible/+/90898210:09
opendevreviewJonathan Rosser proposed openstack/openstack-ansible master: Replace common-tasks with lxc_container_setup role from plugins collection  https://review.opendev.org/c/openstack/openstack-ansible/+/90898410:22
noonedeadpunkjrosser: maybe instead of ^ it makes sense to finally just move playbooks as a whole?11:36
noonedeadpunkI can check later today on that11:36
noonedeadpunkor well, if we wanna go this route ofc11:37
jrosserso 908984 is needed for the capi patches anyway (or indeed any extension of osa that is coming from a collection)11:44
jrosserand it's a bit independant if we move the other playbooks or not, but yes thats totally an option11:44
noonedeadpunkah, ok, got what you mean11:45
noonedeadpunkyeah, makes sense11:45
jrosserELK would be in a similar position if we converted that to a proper collection in the ops repo11:45
noonedeadpunkI guess one note though...11:45
jrosseroh hold on 908984 is not actually needed to merge the capi patches as we already put the role in the plugins repo11:46
jrosserit's more like a tidy up to remove duplicate code11:46
noonedeadpunkyeah, but it make sense before/after moving anyway11:46
noonedeadpunkthe only thing we need to update a-c-r as well11:46
jrossermaybe that should be `master` rather than a SHA?11:47
noonedeadpunkwell, might be. we do test master in CI anyway....11:48
nixbuilderI have my new deployment installed with three infra nodes and using the haproxy.  To ease troubleshooting I shutdown infra02 and infra03.  However when I do that my neutron-metadata is throwing connection errors (https://paste.openstack.org/show/bcZv6gDc5l2nshpWwH2Z/) and a test instance can no longer ping 169.254.169.254.  So I assume I missed something on the install that causes problems with high11:48
nixbuildera service.  What could I have missed?11:48
noonedeadpunknixbuilder: so, services must reconnect to currently active rabbitmq server11:49
noonedeadpunkit's up to client to find one, as long as it has all 3 configured11:49
noonedeadpunkone thing about neutron-metadata-agent - is that its logging not great,,,,11:50
jrossernixbuilder: there might also be more subtle ways to do troubleshooting, shutdown 2 of 3 infra nodes is pretty invasive11:50
opendevreviewJonathan Rosser proposed openstack/openstack-ansible master: Replace common-tasks with lxc_container_setup role from plugins collection  https://review.opendev.org/c/openstack/openstack-ansible/+/90898411:53
nixbuilderjrosser: Fortunately this new deployment is not in production yet so it's no big deal.11:55
jrosserif you want to make all requests go to one particular host then you can put backends in maintainance in haproxy11:55
nixbuildernoonedeadpunk: My rabbitmq server on my remaining infra01 node is up and running... https://paste.openstack.org/show/b8e1NguiEZjILj8WvIbu/11:57
noonedeadpunkso, eventually it might be neutron-metadata who just does not log successfull reconnection, for instance11:58
noonedeadpunkbut also what sucks utterly, is that there;s no dst logged which it tried to connect to...11:58
noonedeadpunkusually it would include host and port it tried to connect11:59
jrosserisn't there some oslo logging syntax you can use12:03
noonedeadpunknot sure? as that's basically exception text I assume12:04
noonedeadpunkbut maybe I just don't know about that :)12:05
opendevreviewMerged openstack/openstack-ansible-os_nova stable/2023.2: Fix nova device_spec to support multiple values  https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/90881112:06
nixbuildernoonedeadpunk: Still doing some testing... brought infra02 back up.  Still no joy.  Brought infra03 back up... still no joy.  Rebooted my test instance... everything is back to normal.  So what I am wondering is if one of my infra nodes does crash, does that mean that new instances cannot be created do to failure of high availability???  Like I said... still testing this out.12:20
noonedeadpunkum, no, I don't think it's a case in fact. or wel, it should not be at very least by multiple reasons12:26
noonedeadpunkSo apparently, services got disconnected from rabbitmq and need some time to re-connect to it12:26
noonedeadpunkleft rabbitmq node, in case of enabled ha queues or quorum queues should contain messages left for services in such cases12:27
noonedeadpunkso ideally, once they re-connect, they should consume messages and execute required RPC12:27
noonedeadpunkBut then it can be some service didn't for $reason send an rpc, which can legitimaly result in metadata not providing metadata as it doesn't know anything about instance12:29
opendevreviewMerged openstack/openstack-ansible-ceph_client stable/2023.2: Don't load systemd parent service for object cache  https://review.opendev.org/c/openstack/openstack-ansible-ceph_client/+/90880912:30
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_ironic stable/2023.2: Fix a typo in pxe_redfish definition  https://review.opendev.org/c/openstack/openstack-ansible-os_ironic/+/90893212:34
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_ironic stable/2023.1: Fix a typo in pxe_redfish definition  https://review.opendev.org/c/openstack/openstack-ansible-os_ironic/+/90893312:35
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_ironic stable/zed: Fix a typo in pxe_redfish definition  https://review.opendev.org/c/openstack/openstack-ansible-os_ironic/+/90893412:35
opendevreviewMerged openstack/openstack-ansible-ops master: Add collection to deploy magnum cluster-api with vexxhost driver  https://review.opendev.org/c/openstack/openstack-ansible-ops/+/90145012:58
opendevreviewMerged openstack/openstack-ansible-ops master: Cluster API Bootstrapping playbook  https://review.opendev.org/c/openstack/openstack-ansible-ops/+/90217813:14
opendevreviewMerged openstack/openstack-ansible-ops master: Add role to install and run sonobouy k8s validation tests  https://review.opendev.org/c/openstack/openstack-ansible-ops/+/90605413:54
mgariepynoonedeadpunk, for the bridge stuff, i gueess it depends on a few factor like lxb vs ovs and ovs openflow vs iptables shim stuff no ?14:05
opendevreviewMerged openstack/openstack-ansible stable/2023.2: [doc] Update dist upgrade guide for 2023.1 / Ubuntu Jammy  https://review.opendev.org/c/openstack/openstack-ansible/+/90880414:06
noonedeadpunkmgariepy: I think not? As even for LXB, for br-vxlan can stand any interface - it just need to have an IP from tunnel14:12
noonedeadpunksame pretty much for br-storage on computes - they just need l3 access?14:13
noonedeadpunkand then br-vlan will be added to another bridge - so bridge in a bridge situation14:13
mgariepyhmm ok right14:16
noonedeadpunkand then on computes/net nodes even br-mgmt can be just interface ...14:17
noonedeadpunkwe jsut have bridges requirement in doc for consistency of naming14:17
mgariepyhmm yeah ok ;) haha14:18
noonedeadpunkeventually, if you have sr-iov on controllers - you can do without bridges even there kinda.... 14:21
noonedeadpunkat least I had a use-case of passing IB devices to containers directly, withoug br-stor14:21
mgariepyyeah but well i like simple networking on the controllers. lol14:21
noonedeadpunksure, totally.14:22
noonedeadpunkI gues point was - would be nice to be explicit that it's written like that just for naming consistency14:22
mgariepyi only do bridges on the controllers theses days. for lxc containers.14:22
noonedeadpunkyeah, fair, same here14:22
mgariepyyeah, it does also simplify the config.14:22
noonedeadpunktried OVS bridges, but haven't found any difference more or less14:23
mgariepyno need to overrides the neutron agent config on some esoteric named interfaces..  that changes on distro upgrade.14:23
noonedeadpunkwell, you can name interface to any name with systemd-networkd or netplan?14:24
mgariepythat's what i do.14:24
noonedeadpunklike nothing stops naming a regular vlan as br-mgmt lol14:24
mgariepyyes something stops me. future-self will kill me if i do that 14:25
noonedeadpunklol14:25
noonedeadpunktrue14:25
noonedeadpunkwould be sweet 1st april joke to convert some environment to jsut intrafeces but name them as bridges...14:25
mgariepymy point being more that we need eitehr do document how to do that, or warn that you will need to do a bunch of overrides.14:25
mgariepywho doesn't like interface named like that: enp45s0u1u3u3 haha14:27
noonedeadpunkbut yeah, I kinda agree here, that this really huge doc refactoring would be needed with examples and drawings14:27
opendevreviewMerged openstack/openstack-ansible-ceph_client stable/2023.1: Don't load systemd parent service for object cache  https://review.opendev.org/c/openstack/openstack-ansible-ceph_client/+/90881015:39
opendevreviewMerged openstack/openstack-ansible master: Allow virtualisation type to be defined in a test scenario  https://review.opendev.org/c/openstack/openstack-ansible/+/90732716:16
noonedeadpunknixbuilder: actually, I've seen `Connection failed: [Errno 111] ECONNREFUSED (retrying in 11.0 seconds): ConnectionRefusedError: [Errno 111] ECONNREFUSED` but that was when I did not defined transport_url in neutron.conf at all...17:14
nixbuildernoonedeadpunk: Thanks for the tip... let me check that.17:15
noonedeadpunkbut I doubt it will work on it's own again...17:17
noonedeadpunkbut otherwise I see where it tries to connect at very least...17:17
noonedeadpunkfound quite /o\ thing in couple of roles: https://opendev.org/openstack/openstack-ansible-os_neutron/src/branch/stable/yoga/tasks/neutron_post_install.yml#L136-L14217:58
noonedeadpunkalso present in cinder https://opendev.org/openstack/openstack-ansible-os_cinder/src/branch/master/tasks/cinder_post_install.yml#L131-L13717:58
noonedeadpunkin first case it makes rootwrap.d mode 0640, in second all files in rootwrap.d 75017:58
noonedeadpunkI'm also not convinced at all why this task exist at the first place....17:59
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_neutron master: Fix permissions for rootwrap files  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/90903418:03
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_neutron master: Add VPNaaS OVN support  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/90834118:06
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/2023.1: [doc] Update documentation for galera cluster recovery  https://review.opendev.org/c/openstack/openstack-ansible/+/90757621:14
opendevreviewMerged openstack/openstack-ansible-os_nova stable/2023.2: Evaluate my_ip address once  https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/90869822:55
opendevreviewMerged openstack/openstack-ansible-os_masakari stable/2023.2: Updated from OpenStack Ansible Tests  https://review.opendev.org/c/openstack/openstack-ansible-os_masakari/+/90307322:55
opendevreviewMerged openstack/openstack-ansible-os_nova stable/2023.1: Fix nova device_spec to support multiple values  https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/90881223:01
opendevreviewMerged openstack/openstack-ansible stable/2023.1: [doc] Update dist upgrade guide for 2023.1 / Ubuntu Jammy  https://review.opendev.org/c/openstack/openstack-ansible/+/90880523:52
opendevreviewMerged openstack/openstack-ansible stable/2023.1: [doc] Update documentation for galera cluster recovery  https://review.opendev.org/c/openstack/openstack-ansible/+/90757623:52

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!