Friday, 2021-11-19

noonedeadpunkI wonder if we try to restart mariadb now _before_ upgrade is finished07:59
noonedeadpunkFeels like we need to find the way to check for upgrade to finish with debian-start....08:00
*** akahat|rover is now known as akahat|lunch08:36
*** akahat|lunch is now known as akahat|rover09:59
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-lxc_hosts master: Add CentOS 8.5 support  https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/81853210:46
opendevreviewJames Gibson proposed openstack/openstack-ansible master: Add security headers to HAProxy Horizon service  https://review.opendev.org/c/openstack/openstack-ansible/+/81853310:50
rohit02hi tem,deploying OSA victoria on centos 8 facing issue in ansiblehost: FAILED - RETRYING: Ensure that the LXC cache has been prepared (58 retries left).what could be the reason to be failed at this tasks11:24
noonedeadpunkrohit02: I believe you use centos, not centos stream?11:52
rohit02noonedeadpunk:yes centos 8 11:58
noonedeadpunkyou need this https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/81853211:59
noonedeadpunkeventually centos doesn't reliable differentiated between stream and non-stream so we did some assumptions that are broken now with 8.5 release11:59
opendevreviewJames Gibson proposed openstack/openstack-ansible master: Add security headers to HAProxy Horizon service  https://review.opendev.org/c/openstack/openstack-ansible/+/81853312:01
rohit02noonedeadpunk:  https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/818532 not getting this part could plzz send me the steps12:02
noonedeadpunkcd /etc/ansible/roles/lxc_hosts/vars; ln -s centos-8.3.yml centos-8.5.yml12:03
rohit02noonedeadpunk: centos-8.3.yml not present does centos-8.4 works for me for the ln command12:15
rohit02noonedeadpunk: and that file[centos-8.4.yml] is empty12:17
noonedeadpunkhm... I'm looking at the repo, and it feels like it should have been a symlink that is broken....12:18
noonedeadpunkwhat if you try to drop it?12:18
noonedeadpunkwhat releaser are you running btw?12:18
noonedeadpunk* release12:18
rohit02noonedeadpunk: [root@controller ~]# cat /etc/redhat-release12:20
rohit02 CentOS Linux release 8.5.211112:20
noonedeadpunkosa release, sorry :)12:20
rohit02noonedeadpunk: any alternative solution to proceed further12:22
noonedeadpunkare you running victoria or ussuri?12:22
rohit02victoria12:22
noonedeadpunkhm... 12:24
noonedeadpunkand what actually error you get after all are failed?12:24
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-lxc_hosts stable/ussuri: Revert "Add CentOS 8.4 support"  https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/81848512:25
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-lxc_hosts stable/victoria: Revert "Add CentOS 8.4 support"  https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/81848612:25
noonedeadpunkdamn. I cherry-picked wrong thing I beleive12:28
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-openstack_hosts stable/victoria: Add CentOS 8.4 support  https://review.opendev.org/c/openstack/openstack-ansible-openstack_hosts/+/81848712:28
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-openstack_hosts stable/ussuri: Add CentOS 8.4 support  https://review.opendev.org/c/openstack/openstack-ansible-openstack_hosts/+/81848812:28
noonedeadpunkrohit02: can you pls try out https://review.opendev.org/c/openstack/openstack-ansible-openstack_hosts/+/818487 ?12:29
rohit02noonedeadpunk: yes will try and let you know......thanx for your help12:35
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-galera_server master: Update mariadb to 10.6.5  https://review.opendev.org/c/openstack/openstack-ansible-galera_server/+/81738412:35
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_keystone master: Drop Nginx webserver support  https://review.opendev.org/c/openstack/openstack-ansible-os_keystone/+/81739013:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_keystone master: Drop Nginx webserver support  https://review.opendev.org/c/openstack/openstack-ansible-os_keystone/+/81739013:10
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_keystone master: Drop Nginx webserver support  https://review.opendev.org/c/openstack/openstack-ansible-os_keystone/+/81739013:13
mgariepynoonedeadpunk, <monty__> mysql_upgrade should normally only be done when one updates the MariaDB package. As part of the process MariadB server should probably be used with another socket and then restarted with the new socket. That would solve the problem as no one would be able to connect until upgrade is done13:32
mgariepyi'm at around 200 reinstallation of mariadb without any issue on my testbed.13:32
mgariepyhttps://paste.opendev.org/show/811193/13:38
noonedeadpunkwell, Michael is great actually...13:56
noonedeadpunkhe's always super helpful and solves stuff :)13:57
mgariepyyep :D13:57
noonedeadpunkI was kind of surprised first time that he's still hanging around and do stuff on his own 13:58
noonedeadpunkin the meanwhile - upgrade does create tables, so we can check if they do exists and assume that migration is done if they are...13:59
noonedeadpunkI know it's not great way to do that...13:59
mgariepyor we divert the debian-start not to run upgradE?14:00
mgariepyhmm.14:00
mgariepyit's annoying, for greenfield deploy it doesnt really matter. but if you upgarde a prod system you might end with a DB that is dead locked.14:01
noonedeadpunkso you mean to set in /etc/default/mariadb empty MYUPGRADE?14:01
noonedeadpunkthen upgrade jsut won't run?14:01
mgariepyyeah it would be run by the task later in the playnook14:02
mgariepyplaybook.14:02
noonedeadpunkand eventually we already ship defaults file...14:02
noonedeadpunklooking on https://paste.opendev.org/show/811194/ you can't really make it empty... but can provide smth meaningless14:05
mgariepyotherwise we can edit debian-start not to call it.14:08
noonedeadpunknah, it's more nasty hack I guess :)14:08
mgariepycan we dpkg-divert the file in /etc ?14:08
mgariepywe divert the file, restart the db, upgrade it then revert-back the file ?14:08
mgariepyor we can also wait to see how the debian pkg will be fixed14:09
mgariepyunless your really want to ship mariadb 10.6.5 with X ?14:10
mgariepyi'll tell infra to release the vm now.14:11
mgariepyunless you think i should keep it a bit longer? 14:11
noonedeadpunknah, I think you can release now...14:12
noonedeadpunkEventually for several minutes I thought that you can just define `MYUPGRADE` in /etc/default/mariadb but just realized that it won't be respected14:13
noonedeadpunklet's at least upgrade rabbitmq) https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/81738014:14
opendevreviewJames Gibson proposed openstack/openstack-ansible master: Add security headers to HAProxy Horizon service  https://review.opendev.org/c/openstack/openstack-ansible/+/81853314:14
noonedeadpunkand we also do need https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/81853214:15
mgariepynoonedeadpunk, i guess for the debian var issue you will need to create a new issue.14:23
noonedeadpunkwonder wtf is going here https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_9f9/817390/5/check/openstack-ansible-deploy-aio_metal-debian-bullseye/9f977d0/logs/ara-report/results/1201.html14:30
noonedeadpunko_O [Fri Nov 19 13:59:04.059426 2021] [mpm_event:error] [pid 65708:tid 139901775236416] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.14:30
noonedeadpunkfeels like we have weird defaults set for apache14:48
noonedeadpunkin terms of ThreadsPerChild/MaxRequestWorkers/ThreadLimit14:48
rohit02noonedeadpunk: the fix you have provided for centos 8.4 support not works for me....centos-8.yml is there but its empty14:52
noonedeadpunkit's supposed to be redhat-8, not centos-814:52
noonedeadpunkand it should not be empty14:53
rohit02and redhat-8.yml file is not there14:53
noonedeadpunkare we talking about openstack_hosts role now?14:53
rohit02my deployment failed at https://paste.opendev.org/show/811198/14:55
rohit02 redhat-8.yml file should be in this directory  cd /etc/ansible/roles/lxc_hosts/vars right?14:57
noonedeadpunkno14:57
noonedeadpunkplease check https://review.opendev.org/c/openstack/openstack-ansible-openstack_hosts/+/81848714:57
rohit02noonedeadpunk: have u chk this  https://paste.opendev.org/show/811198/15:00
opendevreviewMerged openstack/ansible-role-python_venv_build stable/train: Revert "Set centos-7 jobs to non voting"  https://review.opendev.org/c/openstack/ansible-role-python_venv_build/+/81722115:01
noonedeadpunkit doesn't say anything about failure reason15:01
noonedeadpunkimage build log should be stored in /var/log/lxc-cache-prep-commands.log15:02
rohit02fix already in stable/victoria branch so i dont need to add fix locally right?15:04
rohit02so i can rebuild the complete openstack15:04
noonedeadpunkI'm not sure since I don't see what error do you hit15:10
noonedeadpunkBecause reall error are mentioned only in log file15:10
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_keystone master: Drop Nginx webserver support  https://review.opendev.org/c/openstack/openstack-ansible-os_keystone/+/81739015:52
noonedeadpunkrohit02: also, I guess you know that CentOS 8 will enter EOL on 1st Januray 2022?15:54
opendevreviewMerged openstack/openstack-ansible-lxc_hosts master: Add CentOS 8.5 support  https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/81853215:56
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_keystone master: Drop Nginx webserver support  https://review.opendev.org/c/openstack/openstack-ansible-os_keystone/+/81739015:58
rohit02noonedeadpunk: yes but for our product testing will need that.....what i observed now in lxc-cache-prep-commands log dnf command itself in hang state for a while16:01
rohit02[root@controller log]# tail -f lxc-cache-prep-commands.log16:01
rohit02(195/197): zlib-1.2.11-17.el8.x86_64.rpm        2.8 MB/s | 102 kB     00:0016:01
rohit02(196/197): yum-4.7.0-4.el8.noarch.rpm           2.3 MB/s | 205 kB     00:0016:01
rohit02(197/197): util-linux-2.32.1-28.el8.x86_64.rpm  6.4 MB/s | 2.5 MB     00:0016:01
rohit02--------------------------------------------------------------------------------16:01
rohit02Total                                            12 MB/s | 116 MB     00:0916:01
rohit02Running transaction check16:01
rohit02Transaction check succeeded.16:01
rohit02Running transaction test16:02
rohit02Transaction test succeeded.16:02
rohit02Running transaction16:02
noonedeadpunkhm16:10
noonedeadpunkwhat if you jsut increase timeout?16:10
noonedeadpunkhttps://opendev.org/openstack/openstack-ansible-lxc_hosts/src/branch/master/defaults/main.yml#L190-L19416:11
noonedeadpunkmaybe you have jsut slow machine or smth like that...16:11
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-lxc_hosts stable/wallaby: Add CentOS 8.5 support  https://review.opendev.org/c/openstack/openstack-ansible-lxc_hosts/+/81857716:15
noonedeadpunkmgariepy: oh, that _is_ interesting https://zuul.opendev.org/t/openstack/build/15e624950e2743cd95f2a6e460605b18/logs16:19
noonedeadpunkif you look at output, it seems that `/var/lib/mysql/sys/waits_global_by_latency.frm` is present as `Wait for MySQL Upgrade to finish` passes as OK16:19
noonedeadpunkBUT16:19
noonedeadpunkin https://zuul.opendev.org/t/openstack/build/15e624950e2743cd95f2a6e460605b18/log/logs/openstack/aio1_galera_container-3c8aeb78/mariadb.service.journal-15-47-19.log.txt this is not reflected...16:20
noonedeadpunkbut I haven't checked actually if this table present even without that upgrade, damn....16:20
noonedeadpunkso disregard...16:20
mgariepylol16:21
mgariepythe mysql_upgrade should be completed tho.16:21
mgariepybrb . need to reboot my laptop.16:24
mgariepynoonedeadpunk, could we have a anto-hold on timeout on this job ?16:41
mgariepyor.. can we add mysql show processlist to the logs before we exit ?16:42
spateljamesdenton around?17:11
jamesdentonyessssss17:11
spatelI am setting up SSL for OVN and having some issue, my issue is NB trying to connect to SB on same server using SSL 17:15
spateli believe that should be Unix socket correct? on same box we don't need them to use SSL to communicate 17:16
spateldoes OSA default use TCP port for NB/SB communication? 17:16
jamesdentoni believe tcp is used, yes17:18
spatelhmm.. 17:18
jamesdentonbut it's overridable if you want to try a new value17:18
jamesdentonneutron_ovn_sb_connection i think17:18
spatellets say i want to change it to unix socket right now just test my SSL issue.. what is the command?17:19
spatelhttps://opendev.org/openstack/openstack-ansible-os_neutron/src/branch/master/tasks/providers/setup_ovs_ovn.yml#L2917:21
jamesdentoni don't know what the socket file name would be17:21
spateloh that that config is for ovn-controller 17:21
jrosserisnt that a list?17:22
jamesdentonyou could try unix:/var/run/ovn/ovnsb_db.sock but i'm not sure that is the preferred approach17:23
jamesdentonand yeah, neutron_ovn_sb_connection is a list17:24
jrosserspatel: what actual issue do you have? seems unusal to treat the local connection specially when it's a cluster anyway17:24
jamesdentonthough these docs do mention a socket: https://docs.openstack.org/neutron/latest/configuration/ovn.html17:24
spateljamesdenton jrosser as soon as i turn on SSL for OVN i started seeing error in nb logs that not able to communicate with sb - https://paste.opendev.org/show/811202/17:26
jamesdentonhow did you enable it?17:26
spatelhttps://github.com/ovn-org/ovn-kubernetes/blob/master/docs/INSTALL.SSL.md17:26
spatelcurrently i am following OVN doc for PKI just to understand process. later we will deal with osa/pki17:27
jamesdenton6641:10.62.7.252: bind: Address already in use17:27
jamesdentonso maybe try a different port? or clear out the existing entires17:27
jamesdenton*entries17:27
spatelhmm 17:28
jrosseralso "received JSON-RPC data on SSL channel" <- one end talks not-ssl, the other expects ssl17:28
spatellet me first fix port and then see.. 17:31
jamesdentonjrosser have you looked at ASAP^2 recently?18:13
jamesdenton(and is it still a thing)18:13
jrossernot for a long time18:13
jrosserthe things we need off the mellanox hardware currently only work for SRIOV18:14
jrosseryou can't combine them with e-switch18:14
jamesdentoni see18:14
jrosserfocus for us is currently on bluefield18:15
jrosseri think that asap^2 is still certainly a thing on the datasheets for cx-618:16
jamesdentoni found some recent OSP 16.1 docs that mention it, too18:17
jamesdentonwondering if ovs firewall (security groups) is supposed or not. It wasn't at one time18:17
jrossermight be worth revisiting because seems the nova/neutron understanding of smartnic in general seems really much better than when we talked about this before18:19
jamesdentonindeed18:19
jamesdentongiven spatel's foray into sriov/dpdk, could be worthwhile for him, too18:19
jrosserhave you seen these too? https://review.opendev.org/q/topic:%22integration-with-off-path-network-backends%22+(status:open%20OR%20status:merged)18:19
jrosserthats for where the NIC is it's own linux system running effectively OVS in between the SFP and PCIe slot18:20
jamesdenton*mind blown*18:20
jrosserso ironic++ for example18:20
jrosserthere was also a talk at nvidia GTC last week where some university had implemented per-tenant-network crypto on a bluefield smartnic18:22
jamesdentontime to dust off the lab18:24
spateljamesdenton sure i can take that :)18:39
spateljamesdenton jrosser as soon as i remove OVN clustering and just use single node everything works with SSL (i believe i need to debug cluster SSL part)18:40
jamesdentongood to know18:41
jrosserare you sure the connection string is specifying ssl for each cluster member?18:41
spatelfirst i thought cluster doesn't need SSL communication because they have totally different ports to communicate 6643 and 6644. but somehow it doesn't like when i turn on SSL. (at this point i don't have much data but i will keep poking to find a workaround) 18:43
spateljamesdenton i am talking to one of INTEL developer to find issue related DPDK performance. He claimed that it should work and give you good boost in performance even with virtio.18:46
spatelI have zoomed with him and he was able to generate 7million packet using dpdk on single CPU core. now next step is to try with OVS switch and see where is the bottleneck 18:47
jamesdentonoh, awesome18:49
spatelbut truly speaking i don't like DPDK, lots of piece and hard to troubleshoot  :(18:50
spateljamesdenton i tried this but getting that error not sure from where its coming - https://paste.opendev.org/show/811203/20:49
spatelhttps://github.com/openshift/ovn-kubernetes/blob/master/docs/OVN-NORTHD.SSL.md20:51
spateljamesdenton damn it figure it out :)21:05
spateltrust me OVN document is all over the place and not very clear... 21:05
spatelLet me put this on my blog, next i am going to see how we can fit this lego pieces together using OSA/PKI 21:08

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!