Wednesday, 2024-01-10

noonedeadpunkmornings08:52
jrossergood morning08:53
jrosserupgrade bug question in #openstack fwiw08:53
* noonedeadpunk just joined the channel08:54
noonedeadpunkWas it same as a bug report? https://bugs.launchpad.net/openstack-ansible/+bug/2048842 ?08:54
* noonedeadpunk checking eavesdrop08:55
jrosserblues11> I was trying to upgrade from 'Zed' to 'Antelope' using OpenStack ansible upgrade script and get the below error   https://paste.openstack.org/show/bMGGLkzALkRFYvvGbmab/  When I traced the error I noticed that the monitoring service was down, we noticed that a variable in ceph conf(located in monitor lxc container) is not expanded properly and I reckon this caused ceph monitor service down   Has anyone had similar issues 08:55
jrosseror any clue about this? 08:55
jrosseri think thats the same as the bug you linked08:57
noonedeadpunkyeah08:57
noonedeadpunkit feels like we've landed most of bugfixes?08:59
jrosseri think pretty much09:01
jrossermaybe we are close to some point releases?09:02
noonedeadpunkyeah exactly what I was thinking about09:07
noonedeadpunkWell, this 1 potentially good to have - it can break Ironic CI (or better say Swift)09:08
noonedeadpunkhttps://review.opendev.org/c/openstack/openstack-ansible/+/90494109:08
noonedeadpunkThis is Magnum Octavia backport: https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90451009:10
noonedeadpunkOther then that it looks like we're good to go09:11
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-ops master: WIP - Bootstrapping playbook  https://review.opendev.org/c/openstack/openstack-ansible-ops/+/90217809:27
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_magnum master: Add job to test Vexxhost cluster API driver  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90519909:28
jrosser^ i had some better idea about how to put this together in CI09:31
noonedeadpunkI wonder if it's worth to add `/` to the end of src? As  I guess you want to copy only some files from there?09:39
noonedeadpunkrather then directory alltogether?09:39
noonedeadpunkAs at this point there already be /etc/openstack_deploy (it's also done at pre-stage, but will be eariler anyway)09:40
jrossersomehow it runs no zuul job at all09:40
jrosseropenstack_deploy is made with pre playbook? i did not spot that09:42
andrewbonneyjrosser: looks like there's a mis-quoting on L23 of the playbook09:42
noonedeadpunkjrosser: jobs.yaml09:42
noonedeadpunknot .yml09:42
noonedeadpunkIIRC zuul was picky about that09:42
jrosserargh :)09:42
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_magnum master: Add job to test Vexxhost cluster API driver  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90519909:43
noonedeadpunkyeah, quotes are wrong as well:)09:43
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_magnum master: Add job to test Vexxhost cluster API driver  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90519909:44
* jrosser goes to get coffee09:44
jrosserbrain not engaged yet today09:44
noonedeadpunkstill nothing, huh09:47
noonedeadpunkI would expect at least something to be scheduled or error out as a gerrit comment tbh09:47
jrosseryeah usually you get some description of whats wrong09:48
noonedeadpunknot saying I don't see anything wrong....09:48
noonedeadpunkbut I defenitely do like the idae :)09:49
opendevreviewMerged openstack/openstack-ansible-os_magnum stable/zed: Add missing magnum octavia client configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90451011:23
noonedeadpunkI in fact can't reproduce magnum upgrade issue with tls scenario :(11:25
noonedeadpunkBut indeed `No image found with ID fedora-coreos-latest _image_get ` is in glance logs11:28
noonedeadpunkThough it might be fine when client tries to resolve name to ID....11:29
jrosseroh well you might need to use uuid there11:30
jrosseri see similar for the cluster api stuff and have not looked actually where that is problematic11:31
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-ops master: WIP - Bootstrapping playbook  https://review.opendev.org/c/openstack/openstack-ansible-ops/+/90217811:35
Tadioso/, Hasnt this been fixed already? After a fresh deployment, specifically after the os-neutron-install playbook, the cpu usage on the infra nodes spike up and i have "ValueError: non-zero flags not allowed in calls to send() on <class 'eventlet.green.ssl.GreenSSLSocket'>" error on my neutron-server service https://paste.openstack.org/show/822819/11:52
Tadiosbasically performing the following will resolve the issue, ansible neutron_server -m shell -a "systemctl restart neutron-server", i am on 2023.111:52
noonedeadpunkTadios: frankly speaking - was not aware of this issue11:56
Tadiosnoonedeadpunk: oh i though it was some how related to this bug https://bugs.launchpad.net/openstack-ansible/+bug/202785411:59
noonedeadpunkahhhhhhhhhhhhhhhhhhhhhhhhhhhhh11:59
* noonedeadpunk short on memory :D11:59
jrossernoonedeadpunk: you are right about problems with my os_magnum patch, openstack-ansible pre-osa-aio.yml does all the boostrapping before the os_magnum pre.yaml playbook gets run /o\12:00
noonedeadpunkTadios: IIRC this was actually related to the OVS bug itself, that has been fixed in 2.17.312:01
noonedeadpunkjrosser: it's easy to fix though?12:01
noonedeadpunkLike you need not to copy directory, but just content of directory?12:01
jrosserbut it is too late12:01
jrosserbootstrap-ansible and bootstrap-aio have already happened12:01
noonedeadpunkah, you mean that12:01
noonedeadpunkfor u-c-r12:02
noonedeadpunkAh!12:02
noonedeadpunkI think you can pass some Zuul var to prevent this12:02
Tadiosnoonedeadpunk: anything i need to do on my end when deploying? i pretty much cloned and start deployment from 27.3.0 12:03
noonedeadpunkAnd that is Rocky/Centos? 12:04
noonedeadpunkNah, unlikely....12:04
Tadiosnoonedeadpunk: ubuntu 22.0412:04
noonedeadpunkor well. I meant that probably it does not depend on OS then12:05
noonedeadpunkso this fix should have been included in 27.3.012:05
Tadiosya that's why am confused12:05
noonedeadpunkBut, the thing is that upper constraints were not updated. So inside neutron venv likely still old ovs python package is installed12:07
noonedeadpunkSo this can be the root-cause of the issue12:07
noonedeadpunkOSA follows generic upper-constraints, and I'm not sure about really good way to workaround this, except fork the repo and edit the version of ovs in u-c12:08
noonedeadpunkAs our proposals to update ovs version there were rejected12:08
jrosseri do wonder if we should implement some constraints override mechanism12:11
noonedeadpunkTadios: so that is the point of your interest I believe: https://opendev.org/openstack/requirements/src/branch/stable/2023.1/upper-constraints.txt#L18512:12
noonedeadpunkand then you can override repo of requirements here: https://opendev.org/openstack/openstack-ansible/src/branch/master/inventory/group_vars/all/source_git.yml#L19-L2012:13
Tadiosnoonedeadpunk: what do i do here exactly, https://opendev.org/openstack/openstack-ansible/src/branch/master/inventory/group_vars/all/source_git.yml#L19-L20 am confused can i set ovs to a specific version here?12:15
jrosserTadios: you should fork the openstack/requirements repo on github12:15
jrosserpatch it as you need12:16
jrosserand then override those variables to point to your fork12:16
Tadiosjrosser: ohh, got it, Thank you.12:16
jrosserthis is truly terrible user experience unfortunately, but as noonedeadpunk says our attempts to fix this directly in the requirements repo have not been successful12:17
Tadioswhy does restarting neutron-server fix the issue though?12:17
jrosserthats an interesting question12:18
jrosserTadios: by any chance were you able to reproduce this in an all-in-one build?12:19
Tadiosjrosser: no i faced this issue twice, and they are both on multinode deployment12:19
jrosserso you've not tried, or it's OK in AIO?12:20
Tadiosno i haven't tried12:20
jrosserok cool12:20
noonedeadpunkjrosser: looking into current code, I don't think there's a variable available to skip bootstrap at pre-stage12:25
jrosserits just unlucky the order zuul runs the playbooks12:26
jrosserparent first then child job12:26
noonedeadpunkthere's one in scripts/gate-check-commit.sh12:26
noonedeadpunkbit not in zuuul12:26
jrosseractually maybe thats the answer - make a new parent job that doesnt have the pre-playbooks12:27
noonedeadpunkI think it should be easy to add another condition here: https://opendev.org/openstack/openstack-ansible/src/branch/master/zuul.d/playbooks/pre-osa-aio.yml#L4312:27
jrosseryeah but then i actually do want to run it12:27
noonedeadpunkyou will?12:28
* jrosser confused12:28
noonedeadpunkhttps://opendev.org/openstack/openstack-ansible/src/branch/master/zuul.d/playbooks/run.yml#L2312:28
noonedeadpunkor well...12:28
noonedeadpunkthe problem of new job is that you'd need to have huuuge list of required-projects12:31
jrossernaah they aggregate12:32
jrosseroh sorry yes i see12:32
jrosserso - maybe another question is why do we have pre-osa-aio.yml as a zuul pre playbook12:35
noonedeadpunkso it was made initially to reduce our timeout limit in zuul12:36
jrosserdo we break anything by moving that from pre-run: to run: in the base job config12:36
noonedeadpunkas timeout is calculated for run step separately from all pre-post steps iirc12:36
jrosserthen a child job pre: playbook would go first12:36
noonedeadpunkand failure of pre step also triggers retry of job, when run just failure12:37
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible master: Add support for extra Python packages inside OSA venv  https://review.opendev.org/c/openstack/openstack-ansible/+/90522114:17
opendevreviewMerged openstack/openstack-ansible stable/2023.2: Skip installing curl for EL  https://review.opendev.org/c/openstack/openstack-ansible/+/90484514:23
opendevreviewJonathan Rosser proposed openstack/openstack-ansible master: Add zuul job which does not run pre- playbooks.  https://review.opendev.org/c/openstack/openstack-ansible/+/90525016:43
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_magnum master: Add job to test Vexxhost cluster API driver  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90519916:45
opendevreviewJonathan Rosser proposed openstack/openstack-ansible master: Add zuul job which does not run pre- playbooks.  https://review.opendev.org/c/openstack/openstack-ansible/+/90525016:47
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_magnum master: Add job to test Vexxhost cluster API driver  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90519917:14
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_magnum master: Add job to test Vexxhost cluster API driver  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/90519918:01
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible master: Return back /healtcheck URI verification  https://review.opendev.org/c/openstack/openstack-ansible/+/90494119:09

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!