Wednesday, 2023-04-05

opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/zed: Remove nova-scheduler from ironic compute container_skel  https://review.opendev.org/c/openstack/openstack-ansible/+/87948205:42
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/zed: Add documentation on refreshing hosts file  https://review.opendev.org/c/openstack/openstack-ansible/+/87948305:43
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/yoga: Add documentation on refreshing hosts file  https://review.opendev.org/c/openstack/openstack-ansible/+/87948405:43
kleiniGood morning, if you setup bonded network interfaces, which bond type do you use? OpenStack docs have active-backup in their examples. I used LACP and I am stumbling now across a larger amount of packet loss and try to find the cause for that.07:26
jrosserkleini: it kind of depends on how you have your switches set up really08:01
kleinitwo redundant switches08:02
kleiniso if one fails, the other one should be able to take over all traffic08:02
jrossera single switch with bonds to it, or a pair of switches with MLAG/VPC you can do active/active, if it's just a pair of switches with no fancy multichassis features then you need to do active/backup08:02
jrosseri have nexus9k and use cisco VPC to make portchannels across both switches, then can use active/active mode 4 bond between the two, for example08:04
kleinia pair of Lenovo switches doing MLAG08:04
jrosserhaving said that, it has been very vulnerable to bugs in both the switch firmware and NIC firmware08:05
jrosserand if i was starting again from scratch would probably look at trying to do L3 routing from each host instead08:05
kleiniwe're doing here the same, too. but I see a lot of packet loss and bad network throughput. multiple documents, howtos and guides now claim, that 802.3ad is not fully compliant implemented everywhere. Especially OVS should have problems with it.08:05
kleiniAre there any guides/introduction into L3 routing?08:07
jrosserwe also have a set of steps which must be followed *exactly* to upgrade switch firmware otherwise things go pretty bad if you don't do it exactly right08:07
jrosseri don't think i've seen a pure L3 implementation08:09
jrosserthat would be super interesting to work on08:09
noonedeadpunkmornings08:34
jrossergood morning08:35
jrosserneed to try to get some of the haproxy stuff moving this morning08:35
jrosserif damiandabrowski is around this is the bottom of the dependancy tree i think https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/87577908:37
noonedeadpunkI was just looking at this patch08:38
jrosseri see on a bunch of others there are unaddressed comments08:39
jrosseri also saw some repo gpg key errors on stable branch backports of the ovs fixes08:40
noonedeadpunkI assume that was mariadb?08:40
noonedeadpunkAs they've updated for centos, but should affect only new releases...08:40
jrosserit was for the ovs pacakge actually08:41
noonedeadpunkhuh08:41
jrosseralso TIL you can `bind blah.example.com:8000-9000` in a haproxy frontend config to handle an entire port range in one statement08:46
noonedeadpunkoh, that's interesting, didn't know that09:16
noonedeadpunkwtf. So you wanna say that gpg is wrong on Zed but not in Yoga while jobs finished at exact same time?10:11
noonedeadpunkAnd well. Zed was passing at some point as well...10:11
noonedeadpunkand it doesn't affect rocky on top10:12
hamidlotfiHello,10:17
hamidlotfiI'm back again with a duplicate error message but this time in adding a Compute node to my environment.10:17
hamidlotfiRun the following command, I want to add the `compute03`10:17
hamidlotfi`openstack-ansible playbooks/setup-openstack.yml --limit localhost,compute03`10:17
hamidlotfishow this error message:10:17
hamidlotfi`10:17
hamidlotfiTASK [Gather software version list] ********************************************10:17
hamidlotfifatal: [compute03 -> localhost]: FAILED! => {"msg": "The task includes an option with an undefined variable. The error was: 'ansible.vars.hostvars.HostVarsVars object' has no attribute 'ansible_local'\n\nThe error appears to be in '/opt/openstack-ansible/playbooks/os-nova-install.yml': line 52, column 7, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n    10:17
hamidlotfi# venv tag for all hosts in the 'cinder_all' host group.\n    - name: Gather software version list\n      ^ here\n"}10:17
hamidlotfi`10:17
hamidlotfi@jrosser 10:17
noonedeadpunkyeah, I think your nodes are deployed already.10:29
noonedeadpunkas this failure happens only after al lrequired steps are done.10:30
hamidlotfilet me check10:30
noonedeadpunkSo unless you have zun or masakari - you should be good enough now10:31
hamidlotfiYes, right, the new compute node is added successfully.10:32
noonedeadpunkbut it's actualy exact same bug you had with controllers10:33
hamidlotfiYes, but in the controller add the `cinder_all` in the `limit` section and skiped 10:35
hamidlotfiHere also I added the same order but it also shows me the error.10:37
jrosserwell, it's a different error really10:37
jrosserthis time is it missing information about the nova_all group, not cinder_all10:38
jrosserroot cause is different i mean10:38
noonedeadpunkwell... it's different place but same issue10:38
noonedeadpunkbut yeah10:39
hamidlotfiohh, but the end of the error message says ` venv tag for all hosts in the 'cinder_all' host group`10:39
noonedeadpunkoh wait, indeed10:40
noonedeadpunkbrr, how cinder_all ended up being asked in os-nova-install.yml10:40
hamidlotfiI don't know10:42
jrosserisnt that a bug that i already patched?10:44
jrosserhttps://github.com/openstack/openstack-ansible/commit/7ddc02946dfebf6fd39540b131ac5137f711fd0010:45
hamidlotfiI work with ZED version.10:45
jrosserit's backported to Zed as well https://review.opendev.org/c/openstack/openstack-ansible/+/87667810:46
jrosserhamidlotfi: it's probably that you're using a point release of Zed that does not contain my fix to that comment10:47
hamidlotfiYes, in my file don't have your patch.10:49
hamidlotfihttps://www.irccloud.com/pastebin/F0bblMAs/10:49
hamidlotfi@jrosser  Thank you for your hint.10:51
jrosseri don't think that anything has actually failed, even though its bad that the playbook ends in error10:51
jrosserunfortunately there is not a trivial fix to this and it's going to need some consideration of what the correct approach is10:52
hamidlotfiYes, the mission is completed successfully 🫡10:54
noonedeadpunkhamidlotfi: it's not functional patch just in case11:22
hamidlotfiI understood.11:24
noonedeadpunkIt looks like we indeed have some gpg issue for centos....11:35
noonedeadpunkwrt ovn/ovs which is provided by SIG11:36
noonedeadpunkthat is soooooooooo annoying11:36
noonedeadpunkand why only centos and not rocky....11:36
noonedeadpunkit basically means, that `centos-release-nfv-openvswitch` has issue in packaging on centos, while rocky pack it correctly11:40
noonedeadpunkwell, obviously I can't reproduce a thing locally11:55
noonedeadpunkBUT, what I see in /etc/pki/rpm-gpg/RPM-GPG-KEY-CentOS-SIG-NFV in VM and https://www.centos.org/keys/RPM-GPG-KEY-CentOS-SIG-NFV are 2 completely different GPGs11:56
noonedeadpunkAND fingerprint is kinda same https://paste.openstack.org/show/b9TiKYNc44AmldSXZU5X/12:04
noonedeadpunkwith one provided here https://www.centos.org/keys/12:05
noonedeadpunkwell, fingerprint is exactly the same in both, so likely just gpg format is different12:18
NeilHanlonI am meeting with the NFV sig this morning.. will talk to them about this noonedeadpunk12:52
noonedeadpunkI've already bothered amoralej just in case... This could potentialy be infra mirror issue, but weird...12:52
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/yoga: Drop `echo` from add-compute.sh script  https://review.opendev.org/c/openstack/openstack-ansible/+/87947613:56
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/zed: Drop `echo` from add-compute.sh script  https://review.opendev.org/c/openstack/openstack-ansible/+/87947513:57
noonedeadpunkNeilHanlon: I _think_ that centos-release-nfv-openvswitch simply ships gpg key in SHA1 that's not supported anymore by gpg https://paste.openstack.org/show/bXtPkgJ96bEzleXjcAW6/14:33
noonedeadpunkProbably it's time to switch jobs to rocky, as it's some nightmare with CentOS going on recently and I'm quite fed up with it14:40
noonedeadpunkhttps://bugzilla.redhat.com/show_bug.cgi?id=218464014:52
NeilHanlonugh14:59
noonedeadpunkso. all current container/cloud images of centos are completely borked14:59
NeilHanlonsick15:00
NeilHanlonwhat can I do to help? I am happy to go through and submit reviews on projects switching centos to NV and replacing them with Rocky15:01
NeilHanloni mean not "happy" in that it brings me joy, since I hate to really have to do it, but.15:01
noonedeadpunkNeilHanlon: most helpful would be to ensure rocky is not backporting that as well atm :)15:02
noonedeadpunkchange is brought by gnupg2-2.3.3-315:04
NeilHanlongnupg2-2.3.3-2.el9_0.x86_64.rpm                    20-Sep-2022 20:42             259935915:06
NeilHanlonrhel 9.2 beta shipping -2, as well15:07
NeilHanlonhttps://skiprocky.linuxdn.org/tmp/rockygitcompare/rhel9beta_vs_rocky9_full.html15:07
noonedeadpunkI wonder if they've released 2.3.3-3 after this page being generated...15:26
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible master: Replace CentOS 9 Jobs with Rockylinux  https://review.opendev.org/c/openstack/openstack-ansible/+/87967116:16
noonedeadpunkI wonder how far we are from marking CentOS 9 S as experimental16:16
noonedeadpunkBut I kinda hate where this ends up16:17
noonedeadpunkAnd clueless what else we can do 16:17
noonedeadpunkexcept wait for couple of weeks to all repos updating their gpg keys...16:18
noonedeadpunkWell NFV has already pushed their and it should be in mirrors really soon16:18
noonedeadpunkSo we will be technically unblocked, until zuul images will be updated so it will hit Base repo16:19
damiandabrowskido you think it's reasonable to (temporarily) set c9s to non-voting for now?16:20
noonedeadpunkYeah, I've pushed 879671 actually for that as well.16:23
noonedeadpunkBut I'd expect things being fixed in the morning to be frank even without it. 16:23
noonedeadpunkUnless they will break again 16:23
damiandabrowskiahhh sorry, I missed it. Great then16:25
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible master: Add support for TLS backends  https://review.opendev.org/c/openstack/openstack-ansible/+/87908517:24
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible master: [DNM] Enable openstack_backend_service_https  https://review.opendev.org/c/openstack/openstack-ansible/+/87950117:24
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-os_horizon master: Change default value for horizon_enable_ssl  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/87951517:25
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-os_horizon master: Fix horizon_enable_ssl logic  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/87951417:25
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-os_horizon master: Rename horizon_enable_ssl to horizon_backend_https  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/87951617:25
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-os_horizon master: Add PKI support to horizon backends  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/87951717:25
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-os_horizon master: Add PKI support to horizon backends  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/87951717:29
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-os_horizon master: Add PKI support to horizon backends  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/87951718:04
spateljamesdenton noonedeadpunk i have booked my flight for Vancouver :) 19:01
noonedeadpunksweet )19:01
spatelI will see you there with some action plan!! 19:01
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-galera_server master: Remove old GPG key in advance  https://review.opendev.org/c/openstack/openstack-ansible-galera_server/+/87968019:19
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible master: Replace CentOS 9 Jobs with Rockylinux  https://review.opendev.org/c/openstack/openstack-ansible/+/87967119:19
noonedeadpunkoh, seems like NFV repo is already fixed19:21
noonedeadpunkas centos metal is passing now19:21
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/zed: Remove nova-scheduler from ironic compute container_skel  https://review.opendev.org/c/openstack/openstack-ansible/+/87948219:25
noonedeadpunkso hopefully I will be able to fix stable branches tomorrow...19:32
* NeilHanlon still is thinking about sneaking away to Vancouver...20:02
NeilHanlonhere is my dilemma... i need to be in Charlotte, NC on June 9-10... and then Infra summit is 13-15... and we're trying to buy a house this summer 😅20:03
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible master: [DNM] Enable openstack_backend_service_https  https://review.opendev.org/c/openstack/openstack-ansible/+/87950122:47
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-repo_server master: Add TLS support to repo_server backends  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/87642922:59
opendevreviewMerged openstack/openstack-ansible-haproxy_server master: Provide custom handler name to PKI role  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/87577923:19
opendevreviewMerged openstack/openstack-ansible-haproxy_server master: Fix haproxy_service_configs format conversion  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/87877123:19

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!