Monday, 2023-07-31

opendevreviewAndrew Bonney proposed openstack/openstack-ansible-haproxy_server stable/2023.1: Correct default Content-Type for security.txt  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/89000707:04
jrossergood morning08:23
noonedeadpunko/08:40
noonedeadpunkfwiw I'm mostly away during this week08:40
noonedeadpunkI think most nasty things right now that we have - this keystone thing08:41
noonedeadpunkI tried to reach keystone folks but had no reply08:41
jrossernoonedeadpunk: yeah - we are just looking at Z->2023.1 upgrade here and wondering what to do about this09:08
noonedeadpunkI'm really not sure other then bug keystone folks or keep old keystone (or have a fork with revert)09:09
* noonedeadpunk facepalms that we don't use <service>_galera_port in any <service>.conf09:27
noonedeadpunkwell, we do _only_ in nova09:32
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_octavia master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_octavia/+/89008812:15
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_neutron master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/89008912:25
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_cinder master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_cinder/+/89009112:28
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_adjutant master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_adjutant/+/89009212:30
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_aodh master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_aodh/+/89009312:32
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_barbican master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_barbican/+/89009513:00
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_blazar master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_blazar/+/89009613:01
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_cloudkitty master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_cloudkitty/+/89009713:03
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_octavia master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_octavia/+/89008813:03
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_designate master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_designate/+/89009813:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_glance master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_glance/+/89009913:07
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_gnocchi master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_gnocchi/+/89010013:08
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_heat master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_heat/+/89012213:09
jrosserHas anyone attempted a Z -> 2023.1 upgrade?13:12
jrosserwe see total failure as soon as keystone is upgraded https://paste.opendev.org/show/bGfPvKRVJbHXS6I08QMT/13:13
jrosserwhich is a result of this https://opendev.org/openstack/keystone/commit/f6a0cce4409232d8ade69b7773dbabcf4c53ec0f13:13
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_magnum master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_magnum/+/89012313:13
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_masakari master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_masakari/+/89012413:15
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_mistral master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_mistral/+/89012513:17
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_manila master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_manila/+/89012613:18
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_murano master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_murano/+/89012713:20
* noonedeadpunk don't use oauth as of today13:20
jrosserit doesnt matter13:20
jrosserthis blew up our test lab which doesnt either13:20
noonedeadpunkhuh13:21
jrosserit's all over our upgrade CI jobs too13:21
andrewbonneyhttps://zuul.opendev.org/t/openstack/build/0c561f4e655241e09e4517f2f9407da4/log/logs/host/keystone-wsgi-public.service.journal-09-24-38.log.txt#1360713:21
jrosserlike all the existing issued tokens suddenly don't work any more13:21
jrosserthe upgrade jobs only "work" because they update/restart all the services which then get new tokens13:22
jrosserbut from the point keystone is upgraded to the point that all the services are restarted, things are in a pretty bad state13:23
noonedeadpunkBut upgrade job worked only until we had resetted passwords?13:23
jrosserthis is different13:23
jrosserin our lab we have reverted the password length keystone change13:23
noonedeadpunkBut I'm not sure that we have configured all services to have caching?13:23
noonedeadpunkit's likely limited to nova/neutron I believe13:24
noonedeadpunkI wonder if flushing memcacached should work?13:24
jrosserwe lost placement / designate / radosgw / glance / etc etc13:24
jrosserbasically as soon as we updated keystone the whole monitoring went red13:25
noonedeadpunkugh...13:25
noonedeadpunkhalali: should be working on testing upgrades right now13:26
jrosserthat whole log that andrewbonney posted is full of WTF exceptions, like missing users / roles / project13:27
noonedeadpunkyeah, saw that....13:28
jrosserafter ~1hour it came good after the tokens expired and were reissued13:29
noonedeadpunkI think that doing smth like `ansible -m shell -a "echo 'flush_all' | nc localhost 11211" memcached_all` should also help?13:30
jrossertried to understand the keystone code a bit but it's pretty difficult13:32
jrossertheres the token as a string, seems to be then decoded to a dict (?) and then theres also the TokenModel class13:32
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_placement master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_placement/+/89013013:37
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_senlin master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_senlin/+/89013113:39
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_sahara master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_sahara/+/89013213:40
noonedeadpunkum, decoding string to dict sounds... weird...13:40
andrewbonneyIt may have been a string ID which is used to look up a token, but it's a little tricky to follow13:42
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_tacker master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_tacker/+/89013313:43
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_zun master: Use proper galera port in configuration  https://review.opendev.org/c/openstack/openstack-ansible-os_zun/+/89013413:46
halalinoonedeadpunk in progress...16:45
jamesdenton_any known issues with 27.0.1 and first_found?17:07
jrosserjamesdenton_: do you have something breaking?18:19
halaliupgrade from Yoga 25.4.0 -> stable/2023.1 I lost neutron API https://paste.openstack.org/show/bsx1QDpjKmPmUY0otWK1/19:14
jamesdenton_Yeah, I've done 2 deploys now using a dedicated deploy node, 27.0.0/27.0.1 on jammy, and the unbound plays are failing when trying to gather vars: https://paste.opendev.org/show/bH6D8p8BZzS35dDkzc8U/ 19:51
jamesdenton_doesn't happen with 26.1.219:51
jamesdenton_not sure what the trigger is, so i'd like to narrow that down before anyone wastes time19:52
mgariepyisn't it a stalled/expired fact issue ?19:55
jamesdenton_not sure - i've had this happen in two different envs20:04
jamesdenton_i did t-shoot some last week i've just forgotten the details, i'll try it again20:05
jamesdenton_halali have you tried restarting haproxy?20:11
jrosser^ would going from yoga -> 2023.1 switch the networking backend to OVN unless $something is done to prevent that?20:12
jamesdenton_there should be logic there to prevent that20:13
halalijamesdenton_ restarted haproxy did not help, 20:20
halalijrosser does this suppose to keep OVS as is if I did not change it to OVN instead 20:20
jamesdenton_if you had defined ml2.ovs previous that should have stayed20:21
jamesdenton_"journalctl -xe -f -u neutron-server" might show whats going on20:21
halalijamesdenton_ it's AIO using default user_variables.yml, I moved from Xena -> Yoga (was good and smooth with no issue) then 2023.1 that got failed 20:22
halaliand if the upgrade process suppose to transfer the neutron plugin from OVS -> OVN then the neutron container and inventory it suppose to change which is not currently 20:24
jamesdenton_well, even then using linuxbridge there was supposed to be something there, but if you went Y->A then maybe something in Z was missed20:24
jamesdenton_there is no mechanism to migrate anyone to OVN automatically (on purpose, anyway)20:25
jamesdenton_what does the journal show?20:25
halaliyou are right, it complaining on ml2 plugin https://paste.openstack.org/show/bkann3WINk1diMHLq0Yz/20:27
jrosseroh, isnt that something we fixed?20:28
jamesdenton_neutron-24.6.1? isnt that still xena?20:28
halaliand the content of ml2_conf.ini is https://paste.openstack.org/show/b30brqUCvD3xjE3Ghelz/ 20:28
jrosser`Failed to find some config files: /etc/neutron/neutron.conf,/etc/neutron/plugins/ml2/ml2_conf.ini`20:29
jamesdenton_hum, well that's not what we have for ml220:29
jamesdenton_if you were upgrading20:29
halaliO_o 24.6.1 how it come 20:29
halalineutron-api venv/neutron-27.0.1 and 25.4.0 and 24.6.1 is exist20:32
jrosserthey don't get deleted during an upgrade20:33
jamesdenton_https://github.com/openstack/openstack-ansible/blob/stable/zed/scripts/upgrade-utilities/define-neutron-plugin.yml20:34
jamesdenton_https://docs.openstack.org/openstack-ansible/zed/admin/upgrades/major-upgrades.html20:34
jamesdenton_that's a play that needs to run from Y->Z to ensure that the plugin doesn't change20:34
jamesdenton_it's also listed here:20:35
jamesdenton_https://docs.openstack.org/openstack-ansible/2023.1/admin/upgrades/major-upgrades.html20:35
jamesdenton_but is missing from here:20:35
jamesdenton_https://docs.openstack.org/openstack-ansible/latest/admin/upgrades/major-upgrades.html20:35
jamesdenton_jrosser i was just observing that the service/binary still seemed to be running from that older venv20:36
jrosseryes, that could be a failed previous upgrade20:36
jrosserfrom X20:36
jamesdenton_i haven't seen this particular issue, though.. the file not found20:36
halaliinteresting, but I wonder how is Y upgrade is pass the test and success, tbh did not check directly on neutron-api but the api was response and no haproxy complain on backend either, 20:39
jamesdenton_Must just be enough compatibility there, hard to say20:40
halaliI see, so the process as follow 20:41
halalifrom X -> Y then the define-neutron-plugin.yml script to make sure neutron plugin NOT change then I able to upgrade to 2023.1 20:42
jamesdenton_Well, from Y->Z or Y->2023.1 you need(ed) to run define-neutron-plugin.yml20:43
jamesdenton_which uses magic to determine that linuxbridge was the default and hardsets it moving forward20:43
jamesdenton_if you were already using ml2.ovs or had hardcoded ml2.lxb then it shouldn't have been an issue20:43
halaliI see, and on the way from X -> 2023.1 we must go through Y ?20:44
halalior at least X -> Z -> 2023.1 20:45
jamesdenton_X->Y->Z->2023.1 for sure. Not sure if a skip is supported anywhere in there, yet20:46
halaliOK thanks 20:53

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!