Friday, 2022-06-10

damiandabrowski[m]most of the people are on their way home from openinfra summit, but normally yes, we are active here every day ;)06:36
admin1hi07:12
Mouaahi07:14
admin1quick checking .. if anyone knows when/how limits added on existing flavors are activated 07:15
jrosser_admin1: I’m not sure what you mean - do you have an example?07:16
admin1found my answer here: https://docs.openstack.org/nova/xena/admin/flavors.html ..   adding extra specs like limits need a cold migration 07:19
MouaaHi. When upgrading from ubuntu 18.04 to 20.04 under ussuri, I noticed a lot of problems with the keealived check scripts (vrrp_script haproxy_check_script and vrrp_script pingable_check_script). Script pingable_check_script now returning 2 and VRRP_Script(haproxy_check_script) failed (exited with status 1). 07:51
MouaaThe only related bug found is https://bugs.launchpad.net/ubuntu/+source/keepalived/+bug/1806004 related to "script_user root" option in the keepalived config, présent in our config. When I run the scripts manually as root in a shell, I don't reproduce these errors.07:51
MouaaThe only solution I found was to modify these scripts, script "/bin/kill -0 `cat /var/run/haproxy.pid`" changed to script "/usr/bin/killall -0 haproxy" and remove 1>&2 redirect from vrrp_script pingable_check_script. 07:52
MouaaHas anyone encountered a similar problem? I prefer to ask before raising a bug...07:52
*** ysandeep|out is now known as ysandeep08:05
andrewbonneyMouaa: do you override the keepalived_scripts variable at all? This sounds familiar and I can see changes such as https://github.com/openstack/openstack-ansible/commit/f91b88078a71525608664df6b4559cd7908117b2 which are important when moving from keepalived v1.x to v2.x08:31
Mouaa@andrewbonney This problem appeared during the OS upgrade of the nodes, not while redeploying therefore by replaying the ussuri playbooks, so my fix was directly made in the keepalived conf on the controllers, not actually by overriding the keepalived  vars. Indeed, OS upgrades have moved from Keepalived v1.3.9 to v2.0.19.08:58
andrewbonneyHad you set the keepalived_scripts variable previously though? So that it differs from the variable settings provided in the Ussuri OSA release?08:59
*** dviroel_ is now known as dviroel11:30
Mouaa@andrewbonney, it's seems you're right, I asked the initial committer why he overridden the behavior of check_scripts11:39
mgariepyhttps://github.com/openstack/openstack-ansible/commit/d6ca5af79ec10d1a6c2cd98b27ca7e755abfc9b512:23
mgariepyMouaa, andrewbonney i think it's missing for U ?12:23
mgariepyor not..12:24
mgariepyare you on the latest Ussuri ?12:24
mgariepyMouaa, ^^12:25
MouaaThis platfrom are in place from stein version, actually git on "ussuri-em" for the migration from train to ussuri, actually i'm upgrading the ubuntu OS. Install method : distro12:32
mgariepywhat does the global_def in keepalived.conf file looks like ?12:34
MouaaThanks for the help ! global_defs { enable_script_security   script_user root }12:36
mgariepy`which kill` on the controller? 12:41
mgariepyi thing the path is not quite correct.. 12:41
mgariepyhmm. both should exist :/12:42
mgariepycan you try to add the full path for cat in the script ?12:45
mgariepydoing killall if there are other haproxy process on the server might cause some other issues.12:47
Mouaa@mgariepy, Good idea, I will try to put full path for cat12:49
MouaaI think to reproduce https://github.com/openstack/openstack-ansible/commit/f91b88078a71525608664df6b4559cd7908117b2 manually on the controllers (without playbook), after upgrading the OS and LXC containers, when I execute the deployments to upgrade from ussuri to victoria, everything will fit in order...12:49
Mouaafull path for cat didn't change nothing : Script `haproxy_check_script` now returning 112:52
mgariepy:/ that's weird.12:52
mgariepyhow did you do the upgrade ? os upgrade on one controller?12:53
mgariepythen rerun the playbook for it?12:53
mgariepydoes /var/run/haproxy.pid exist ?12:55
MouaaI upgraded ubuntu 18.04 to 20.04 sequentially (1 by 1) each controller, looking at the status of each service (keepalived, haproxy, ...) between each and restarting each service if out of sync12:59
mgariepycan you run the haproxy playbook ?13:00
MouaaI haven't rerun the playbooks yet, I was thinking of upgrading all the nodes, destroying the LXCs and then replaying the playbooks13:01
mgariepyusually i do controllers one by one. and redeploy services on the reinstalled node.13:02
*** ysandeep is now known as ysandeep|break13:02
MouaaOK, i'll rerun the haproxy-install.yml for testing13:04
mgariepysomething like that : https://etherpad.opendev.org/p/osa-newton-xenial-upgrade or https://etherpad.opendev.org/p/osa-rocky-bionic-upgrade13:04
MouaaThe only doc for upgrade reference is https://docs.openstack.org/openstack-ansible/victoria/admin/upgrades/distribution-upgrades.html, so not adapted for me because distro install method, so i'm trying on the DEV (nested openstack in openstack PF)13:07
mgariepywhy did you go with distro install, out of curiosity ?13:08
Mouaano idea, I get the project set up in 2018, I don't have all the info...13:10
Mouaa@mgariepy I spent a lot of time making our dev PF functional (Openstack DEV nested on opensatck PROD: vxlan in vxlan, physical mtu 1500, tenant network and instance ports in DEV to 1400). I took some time to configure everything to have something functional thing. And since the keepalived heartbeats use vxlan, I directed my research on that13:27
mgariepyyeah it's kinda a mess when you need to mess with mtu and such.13:28
*** ysandeep|break is now known as ysandeep13:44
goldenfriHi, is anyone around that could some advice? I've been running a rocky install of OSA for years, but recently something happened and the openstack-ansible command no longer works, I get an Unexpected Exception Error14:21
goldenfriTypeError: expected string or buffer14:22
jrosser_goldenfri: you can copy the error to paste.opendev.org perhaps?14:52
goldenfrihere is is, thanks! https://paste.opendev.org/show/bFmFQarBe89z8tQ76DEL/14:55
*** dviroel is now known as dviroel|lunch15:02
*** ysandeep is now known as ysandeep|out15:21
jrosser_goldenfri: i'm not sure really - it looks like something invalid in your inventory15:22
jrosser_i think there is a tar archive kept in /etc/openstack_deploy of the inventory history so you could check if anything has changed there15:22
jrosser_or hopefully you have that directory in git to track things :)15:23
jrosser_also you could try to see if `ansible-inventory` will parse the inventory15:23
goldenfriThanks @jrosser_ that at least give me some place to start looking15:28
Mouaamgariepy FYI, yours advices correct my keepalived/haproxy problems15:29
mgariepycool15:32
*** dviroel|lunch is now known as dviroel16:13
*** dviroel is now known as dviroel|afk19:20

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!