Tuesday, 2021-08-10

opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Set ovn hostname using nodename facts  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398804:18
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Fix OVN metadata protocol to point https  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398904:18
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Add support of OVN for CentOS-8-Stream  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398704:18
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Use list of cluster member for ovn ml2 agent to directly connect  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80399004:18
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Set ovn hostname using nodename facts  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398804:34
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Fix OVN metadata protocol to point https  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398904:34
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Add support of OVN for CentOS-8-Stream  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398704:34
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Use list of cluster member for ovn ml2 agent to directly connect  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80399004:34
opendevreviewSatish Patel proposed openstack/openstack-ansible-os_neutron master: Replace centos-8.x jobs with centos-8-stream  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80403604:34
*** rpittau|afk is now known as rpittau06:15
opendevreviewMarcus Klein proposed openstack/openstack-ansible stable/ussuri: Backported fixes for healthcheck playbooks  https://review.opendev.org/c/openstack/openstack-ansible/+/80404407:22
opendevreviewMarcus Klein proposed openstack/openstack-ansible stable/ussuri: Backported fixes for healthcheck playbooks  https://review.opendev.org/c/openstack/openstack-ansible/+/80392607:25
opendevreviewMarcus Klein proposed openstack/openstack-ansible stable/ussuri: Backported fixes for healthcheck playbooks  https://review.opendev.org/c/openstack/openstack-ansible/+/80392607:28
opendevreviewMerged openstack/openstack-ansible-os_keystone stable/wallaby: Fix shibboleth compatibility for ubuntu 18.04  https://review.opendev.org/c/openstack/openstack-ansible-os_keystone/+/80355211:24
kleiniansible-hardening started to fail for me in production with U release. configuration for auditd.conf is somehow missing although it is defined in the vars directory of the role. Do you have any hints how to debug/resolve this?11:34
spateljrosser morning! 12:12
spatelThis is what i did to stack them, hope this is what you were talking about yesterday - https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80403612:12
opendevreviewAndrew Bonney proposed openstack/ansible-role-python_venv_build master: Add distro/arch to requirements file path  https://review.opendev.org/c/openstack/ansible-role-python_venv_build/+/80173812:22
opendevreviewAndrew Bonney proposed openstack/ansible-role-python_venv_build master: Add variables to set permissions for created files  https://review.opendev.org/c/openstack/ansible-role-python_venv_build/+/80407312:22
opendevreviewAndrew Bonney proposed openstack/openstack-ansible master: Fix permissions for files created on repo server  https://review.opendev.org/c/openstack/openstack-ansible/+/80407712:28
opendevreviewMerged openstack/openstack-ansible stable/victoria: haproxy: decrease check interval for letsencrypt back ends  https://review.opendev.org/c/openstack/openstack-ansible/+/80395612:54
opendevreviewMerged openstack/openstack-ansible stable/wallaby: haproxy: decrease check interval for letsencrypt back ends  https://review.opendev.org/c/openstack/openstack-ansible/+/80395512:54
spatelDoes anyone upgrade from V -> W  because i am seeing git clone repo task is failing at here - failed: [localhost] (item={'name': 'ceph-ansible', 'scm': 'git', 'src': 'https://github.com/ceph/ceph-ansible'13:10
spatelI am assuming we don't need this task because it always failed - TASK [Clone git repos (parallel)]13:10
jrosserspatel: https://github.com/openstack/openstack-ansible/commit/82592edf6ff3b8c7e2825aa2e5977791f4cb283213:21
spateljrosser beauty! 13:30
spatelso this is pending to merge ?13:30
spatelanything red on screen freak me out during upgrade :)13:31
spateljrosser got encounter here and i don't have pki file structure in my /etc/openstack_deploy/pki 13:42
spatelfailed: [os-lab-infra-1] (item={'name': 'ExampleCorpRoot'}) => {"ansible_loop_var": "item", "changed": false, "item": {"name": "ExampleCorpRoot"}, "msg": "file not found: /etc/openstack_deploy/pki/roots/ExampleCorpRoot/certs/ExampleCorpRoot.crt"}13:42
spatelThis is during setup-host playbook TASK [pki : Slurp up CA certificates from pki setup host (localhost)]13:44
jrosserspatel: those dirs should be on the deploy host13:49
spateli am on deploy host and nothing there 13:50
opendevreviewAndrew Bonney proposed openstack/openstack-ansible-rabbitmq_server master: Fix erlang upgrades for rabbitmq external_repo installs  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/80371113:50
spatel'/etc/openstack_deploy/pki': No such file or directory13:51
spatelI am following this document for my V -> W upgrade - https://docs.openstack.org/openstack-ansible/victoria/admin/upgrades/major-upgrades.html 13:52
jrosserthis has run? https://github.com/openstack/openstack-ansible/blob/master/playbooks/setup-hosts.yml#L1613:55
spateli have executed setup-hosts.yml and encounter that error so assuming it ran but let me run individual playbook 13:57
spatelvery interesting.. that playbook created directory 13:58
spatelwhy not this playbook run? - openstack-ansible setup-hosts.yml --limit '!galera_all:!rabbitmq_all'13:58
spatellook like using --limit not allowing to run that playbook 14:02
spatelwe should add extra step in that case to just run that playbook for V -> W upgrade to setup pki 14:03
jrosseryou have excluded localhost with that limit14:19
jrosseroh14:20
jrosserspatel: https://zuul.opendev.org/t/openstack/build/10b59ed57bd14b848b11f94eded6805f/log/job-output.txt#14362-1438114:43
jrosseryou know you linked to the victoria upgrade instructions?14:44
spatelhold on in meeting 14:54
spatelsorry i am back now15:01
noonedeadpunk#startmeeting openstack_ansible_meeting15:01
opendevmeetMeeting started Tue Aug 10 15:01:51 2021 UTC and is due to finish in 60 minutes.  The chair is noonedeadpunk. Information about MeetBot at http://wiki.debian.org/MeetBot.15:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.15:01
opendevmeetThe meeting name has been set to 'openstack_ansible_meeting'15:01
noonedeadpunk#topic rollcall15:01
noonedeadpunk\o/15:02
spatelnoonedeadpunk is back that is big update :) 15:02
noonedeadpunkwell, unfortunatelly I'm not back for long :(15:02
noonedeadpunkhave tons of family business this week as well. I really hoped I would be online more this days, but this doesn't worked out :(15:03
noonedeadpunkhttps://review.opendev.org/c/openstack/openstack-ansible/+/803097 looks really bad. I think it's because we haven't backported neutron patch...15:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/wallaby: Bump OpenStack-Ansible Wallaby  https://review.opendev.org/c/openstack/openstack-ansible/+/80309715:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/wallaby: Bump OpenStack-Ansible Wallaby  https://review.opendev.org/c/openstack/openstack-ansible/+/80309715:04
noonedeadpunk#topic office hours15:05
noonedeadpunkso, what's happening forum?15:07
noonedeadpunkI'll be fully back only next week(15:07
jrossero/ hello15:07
jrosseri think we need this and it's backport https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/79896015:08
noonedeadpunkbut for this I guess we need neutron patch spatel working on?15:09
jrosserfor master yes15:09
jrosserwhatever unblocks os_neutron needs to go first15:09
spateljrosser i have tried to stack my all neutron patches here, may need your input https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/804036/115:10
noonedeadpunkwell, it's cherry-pick is not in a good shape either... https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/79888115:10
jrosserno, there is so much to fix in os_neutron sadly15:10
noonedeadpunkhttps://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/803987 looks like a light in the end 15:11
opendevreviewAndrew Bonney proposed openstack/openstack-ansible-rabbitmq_server master: Fix erlang upgrades for rabbitmq external_repo installs  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/80371115:12
noonedeadpunkSo I'd say we can set jobs as NV in https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/804036/1 ?15:12
noonedeadpunkand do bunch of rebases?15:12
spatelnoonedeadpunk yes that patch is part os multiple stack and i need to understand from you guys why other patches failed and solve their inter dependency when they are in chain 15:13
jrosserspatel: the first patch is tested on it's own15:13
jrosserand is not enough to make the tests pas15:13
noonedeadpunkyeah ^15:13
jrosserthe first and second then get tested together15:13
jrosserthen first / second / third, and so on15:14
jrosserlets just do this now?15:14
noonedeadpunkso let's set failing jobs as NV, merge all that and revert NV patch at the end of the day15:14
noonedeadpunkBut backporting that would be meh...15:14
jrosserthen work out what to do with W too15:14
jrosseryeah, as there is centos and centos-stream there15:15
jrosserand we kind of need to decide what we do/dont support15:15
noonedeadpunkW is upgrade for Centos to Stream, right?15:16
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_neutron master: Replace centos-8.x jobs with centos-8-stream  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80403615:16
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_neutron master: Set ovn hostname using nodename facts  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398815:17
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_neutron master: Fix OVN metadata protocol to point https  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398915:17
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_neutron master: Add support of OVN for CentOS-8-Stream  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80398715:17
spatelcentos-8.4 has broken OVS job, question is should we fix that or drop CI job and continue working with stream ? 15:17
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_neutron master: Use list of cluster member for ovn ml2 agent to directly connect  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80399015:17
noonedeadpunkOn W I'd say it would be great to fix that, but actually I kind of missed the reason at the first place. Smth related to some repos?15:18
jrosserits several things, we need to fix ubuntu OVN on W regardless15:18
noonedeadpunkyes, totally15:19
jrosserwe can decide if we add centos-8-stream OVN to W or not - or we say that is a feature from X onward15:19
spatelYes OVN repo is broken and this patch can fix it so i cut one patch to fix it but question is should we keep going with this kind of hack or drop it - https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/80240215:19
jrosserand then there is the old centos-8 OVS jobs15:19
jrosserit feels like this kind of delays 23.1.015:20
noonedeadpunkspatel: I'd say this patch would be relevant only for W and V?15:20
noonedeadpunkNo reason to merge that to master anyway15:21
jrosseri think this was from before we dropped the old centos jobs from master15:21
spateli believe on W because i got issue with that, i didn't try on V yet 15:21
noonedeadpunkyeah15:21
spatelCentOS community moved their repo to stream and that is why its failing to find latest OVS version for centos-8.4 15:22
jrosserlets deal with one thing at a time :/15:23
jrossertoo many moving parts15:23
spatel+115:23
noonedeadpunkOk, another thing. I was wondering if you have any thoughts regarding pain point, that could be selected as next community goals? 15:24
noonedeadpunk#link https://etherpad.opendev.org/p/pain-point-elimination15:24
spatelCould you guy give review and bump to this patch, it will fix broken centos-8-stream distro deployment - https://review.opendev.org/c/openstack/openstack-ansible-openstack_hosts/+/80347515:27
jrosserspatel: did you see i left a comment on the depends-on?15:29
spatelJust replied - var directory doesn't exist on chroot environment so when i tried /etc/yum/vars it failed to create var directory. may be we need to create first /etc/yum/vars directory first and then copy contents 15:29
spatelchroot doesn't have /etc/yum directory also 15:30
spatelfor look like first we need to create empty /etc/yum and /etc/yum/var 15:30
*** owalsh_ is now known as owalsh15:31
noonedeadpunkI commented mentioned patch as well15:31
noonedeadpunkAlso I'd love to ask for some reviews of https://review.opendev.org/q/topic:%22bump_osa%22+(status:open)15:31
noonedeadpunkAs we didn't have a release for quite a while15:32
spatelnoonedeadpunk - nova has dependency requirement of some foo packages and that is part of this repo, without this repo it won't let you install nova components 15:34
spatelthat is why distro installation failing 15:34
jrosseroh yes this is all nasty15:34
jrosserits actually errors on the compute nodes that this fixe15:34
noonedeadpunkdoh15:34
noonedeadpunkthat is soooooo stupid....15:34
spatelYes.. 15:34
noonedeadpunkgood way to push ppl out of lxb...15:35
spatelRPMs stuff someone may be be rebuildrequired: foo in spec file and that is why we endup here 15:35
noonedeadpunkok, thanks for explanation15:35
spateli think its totally safe for add repo its not going to hurt so far15:36
noonedeadpunkconsidering it's for distro setup only - I don't care _that_ much15:37
spateltotally! that patch exclusively adding repo, no matter source or distro 15:39
noonedeadpunkuh, that's worse.15:39
noonedeadpunkas eventually we're trying to control packages that are to be installed from repos that not present by default15:40
spatelwe can put condition if that is better way to handle it 15:40
noonedeadpunkthat's why we drop epel-release there iirc15:40
noonedeadpunkas well as centos-release-storage-common15:41
spatelhmm, pervasively i created this patch to solve distro issue but then abandon it - https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/80336815:41
jrosserwell nova role doesnt run on network nodes though? so we end up with wierd differences in OVS repo between hosts15:42
spatelif we are concern then lets put condition == distro and push that patch out after review 15:44
noonedeadpunkyeah15:45
noonedeadpunkwe can really do like (service_install_method | default('source') == 'distro') | ternary('present', 'absent')15:45
spatelok 15:46
opendevreviewDmitriy Rabotyagov proposed openstack/ansible-role-vault master: Initial commit to Vault role  https://review.opendev.org/c/openstack/ansible-role-vault/+/80079215:48
noonedeadpunkbtw, have any folks were coming with questions regarding ^ ?15:53
noonedeadpunkor they just PMing me regardless....15:53
noonedeadpunkok, whatever actually:)15:58
noonedeadpunk#endmeeting15:58
opendevmeetMeeting ended Tue Aug 10 15:58:15 2021 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)15:58
opendevmeetMinutes:        https://meetings.opendev.org/meetings/openstack_ansible_meeting/2021/openstack_ansible_meeting.2021-08-10-15.01.html15:58
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/openstack_ansible_meeting/2021/openstack_ansible_meeting.2021-08-10-15.01.txt15:58
opendevmeetLog:            https://meetings.opendev.org/meetings/openstack_ansible_meeting/2021/openstack_ansible_meeting.2021-08-10-15.01.log.html15:58
opendevreviewDamian Dąbrowski proposed openstack/openstack-ansible master: Implement RabbitMQ cluster rolling restart feature  https://review.opendev.org/c/openstack/openstack-ansible/+/80410916:19
*** rpittau is now known as rpittau|afk16:31
spatelquick question for disk cache mode you guys using for good disk performance, i am seeing this on my virsh dumpxml <instance> 16:51
spatel<driver name='qemu' type='qcow2' cache='none' discard='ignore'/>16:51
spateljrosser i got this error  - https://paste.opendev.org/show/807985/16:52
spatelduring this playbook run - openstack-ansible setup-infrastructure.yml -e 'galera_upgrade=true' -e 'rabbitmq_upgrade=true'16:53
jrosserrestart galera by hand in the container16:53
spatelhmm16:54
spatelthen run the playbook?16:55
jrossercould be this https://jira.mariadb.org/browse/MDEV-2482916:55
spatelwhat if i don't want to upgrade galera if this version is buggy?16:56
spatelnow it failed here - https://paste.opendev.org/show/807986/ should i use -e galera_ignore_cluster_state=true ?16:57
spatelwhy don't we have default instead of saying you don't have?16:58
jrosser?16:59
spatelI am saying i never set wsrep_cluster_name as far as i know then why its saying its different now ?17:01
spateli didn't check play just asking question, may be this is recent change which i am not aware17:01
jrosserno it's always been like this17:03
jrossereach database instance has a cluster name which is defined in the defaults17:03
spatelthen why did i encounter this error in upgrade but not first installation 17:04
jrosserthe error is that the db you just installed doesnt have the right cluster name, which is normally set during bootstrapping the db cluster17:04
jrosseris the galera instance you just upgraded a member of the cluster?17:05
spatelno its my LAB which has single controller or infra node but multiple compute nodes 17:06
spatelin short single infra node17:07
spateli have added wsrep_cluster_name = "openstack_galera_cluster" in user_variables.yml and re-running same play..17:11
jrosseri don't know - there are upgrade jobs running for all CI, so this does work17:14
spatelstill getting same error let me understand what is going on and debug 17:16
spateljrosser issue was - SQL Error 1040: Too Many Connection 17:23
spatelin my.cnf has max_connections = 200 17:24
spateljrosser this is very odd now i don't have data in mysql - https://paste.opendev.org/show/807988/17:31
spatelnevermind.. look like root account can't see other databases17:32
spatelroot is powerless account now 17:32
spateli can see data after i use mysql -u nova -p 17:33
spatelnow i stuck here - https://paste.opendev.org/show/807989/ 17:40
spateldebugging.. 17:40
jrosserspatel: this https://jira.mariadb.org/browse/MDEV-2503017:56
jrosserthere is now an 'admin' user instead of root17:57
opendevreviewMerged openstack/openstack-ansible stable/ussuri: Bump OpenStack-Ansible Ussuri  https://review.opendev.org/c/openstack/openstack-ansible/+/80309917:58
spateljrosser what is the point to create admin instead or root? security?18:00
jrosserwe previously made changes to the built-in root user in the db, which is kind of unsupported18:01
jrosseryou're supposed to make another user and give that the rights you want18:01
spateloh! 18:01
jrosserfrom the release notes `Galera privileged username has changed from root to admin. Old ‘root’@’%’ user can be removed after upgrade process.`18:02
spatelany idea about this play -  https://paste.opendev.org/show/807989/  going to turn on logging to see what is going on 18:02
jrosseri don't know18:03
spateljrosser thank you! didn't read release notes yet :) 18:03
jrosserif your database was in trouble before the upgrade with too many connections, then i'm not sure whats happening now18:03
spateli have bump connection to 1000 and now i am not seeing too many connection issue18:04
opendevreviewMerged openstack/openstack-ansible stable/victoria: Bump OpenStack-Ansible Victoria  https://review.opendev.org/c/openstack/openstack-ansible/+/80309818:05
jrosserspatel: right, but your upgrade did go strange?18:06
spatelonly has issue with mysql but after restarting i didn't see issue 18:06
spateli am thinking to downgrade mysql version to original this one might has some unknown bugs 18:07
jrosseri'll be sticking with the W released version18:09
jrosserthese point upgrades of galera have happened on every OSA major releas18:09
jrosserwe've just been very unlucky with a bug in both 10.5.8 and 10.5.918:10
jrosserif you stick with 10.5.8 you will stick with the "does not start up every so often" mutex error which is worse than the broken grant stuff, which there is a workaround for in the code18:10
spatelopenstack doesn't care about mysql version right? no dependency at all 18:11
spatelbefore upgrade i was running 10.5.6 and it was super stable 18:12
spateldo you think 10.5.6 also has issue ?18:15
spateljrosser very strange error here - https://paste.opendev.org/show/807991/18:18
spateldid you see that before?18:18
jrosseri guess /openstack/venvs/utility-23.0.0/bin/python: not found18:20
jrosser^ is that there?18:21
spatel'/openstack/venvs/utility-23.0.0/bin/python': No such file or directory18:21
spateleven i don't have this dir also /openstack/venvs/utility-23.0.018:22
jrosserso thats the problem, you're missing the utility venv, or the path is wrong..... i just read the errors :)18:22
spateldid i miss some steps? 18:23
spateli am just following our official doc for upgrade18:23
jrosserdo you have anything in /openstack/venvs/ on 10.65.6.17 ?18:23
spatelhttps://paste.opendev.org/show/807992/18:24
spatellet me check on that specific host 18:25
spateljrosser i have this - https://paste.opendev.org/show/807993/18:25
spateli don't have 23.0.0 18:26
jrosserright, so you have some kind of mixture of checking out 23.0.0 tag and then checking out stable/wallaby (or some other point) and not doing bootstrap-ansible18:26
jrosseror something like that18:27
jrosseryou've done part of the deployment from 23.0.018:27
jrosseroh not 23.x18:28
spatelLet me tell you what i did 18:28
jrosserbuy you know what i mean18:28
spatelfirst i upgrade my V with minor version using this checkout ( git checkout tags/22.2.0 -b stable/victoria ) 18:28
spatelonce i upgraded to 22.2.0 then i did checkout 18:28
jrosserright, and we see the utility venv for that utility-22.2.018:29
spatelgit checkout tags/23.0.0 -b stable/wallaby18:29
spatelbased on this doc i did run bootstrap - https://docs.openstack.org/openstack-ansible/victoria/admin/upgrades/major-upgrades.html18:29
spatel${SCRIPTS_PATH}/bootstrap-ansible.sh18:29
spatelI am doing manual upgrade method instead of script18:30
jrosserone of the first things in setup-infrastructure.yml should have been to build the utility container and it's venv18:31
jrosserwhats happened to that?18:31
spatelhmm 18:31
jrosseroh18:31
jrosserwell it's not first is it https://github.com/openstack/openstack-ansible/blob/master/playbooks/setup-infrastructure.yml#L2318:31
spatellet me see.. i think i know what went wrong, when my mysql failed i did restart but then didn't run that playbook again 18:32
jrosserok, like i say the same steps are done in the upgrade CI jobs18:32
spatelre-running - openstack-ansible setup-infrastructure.yml -e 'galera_upgrade=true' -e 'rabbitmq_upgrade=true'18:33
spatelso there is no other solution to fix this issue? TASK [galera_server : Create galera users]18:42
spatelplaybook always throwing error here and as you said restart mysql 18:43
spateldon't we need that play to create users etc..? 18:45
spateljust curious why its not failing in CI job18:55
admin1spatel you can do like vvvv and also edit the playbook to unhide the exact message 18:56
spatelall i can see that play trying to create users using root@localhost account and that failing because we don't use root account18:57
admin1it does set the root password to what is in the user_secrets 18:58
spatelhttps://paste.opendev.org/show/807994/18:58
spateljrosser mentioned earlier osa now using admin account for mysql instead of root 18:59
spateljrosser should i leave this failure here and just move to next playbook, if this play doesn't matter for upgrade19:00
admin1it looks like its trying to login as root and doing something, ( as in a newly installed mysql ) but failing due to that 19:02
admin1if its new, dropping galera containers and redoing it might help 19:03
spatelThis is V -> W Upgrade i am testing so it has data 19:03
admin1oh .. 19:03
admin1in that case, what i would do is check the secrets and add those accounts manually in the db 19:04
admin1so that when the playbooks try to run, they don't get an issue due to access not being proper 19:04
b1tsh1ft3rHey all, removed a compute node from the cluster because hardware failed. Looks as if after running the playbook to remove the node and then re-running the play to re-provision the hardware and bring it back into the cluster it seems launching instances on it or migrating to it doesnt work at all. I cant seem to get any instances to launch on the new node. Im also not seeing any sort of errors in the log19:05
b1tsh1ft3rs or ui!19:05
admin1b1tsh1ft3r, check the nova and libvirt logs19:06
admin1they will tell why it fails 19:06
admin1or the scheduler if you are not even getting it to schedule to that node19:06
spateli can tell you problem is part of placement :) 19:07
spatelb1tsh1ft3r ^19:07
spatelyou may need to delete node from resource and restart nova-compute which will re-generate new UUID19:07
spateli had that kind of issue many time when i remove and add compute nodes 19:08
b1tsh1ft3rso, ive got through the process and shut down nova-compute and neutron-linuxbridge-agent, run the node removal playbook (appears to remove it from the hypervisors list in the ui and the back end service list). Is that not the same as what you just mentioned?!19:11
spatelb1tsh1ft3r check this - openstack resource provider list19:13
spatelif you see your compute delete from resource provider 19:14
spatelthen go back to compute node and restart service which will re-register itself with new UUID 19:14
b1tsh1ft3rgotcha. let me give it a try19:16
admin1is deleting the lxc containers enough to start fresh ? 19:17
admin1do i need to delete anything else 19:17
admin1things like clearning out dnsmasq etc entries  ..  19:19
spatelcleaning up lxc is enough 19:21
b1tsh1ft3rspatel: Looks like removing the resources provider and re-starting nova resulting in the same uuid being generated19:21
admin1it will be in a new id i think 19:21
spatelgeneration vs uuid 19:22
spatelboth are same?19:22
b1tsh1ft3rid is new it looks like. uuid is the same19:22
spatelthat is good 19:23
spateljust build your vm and you should be good 19:23
admin1does osa support ironic ? 19:29
admin1how is it checked in ci ? 19:29
spateli don't think we have CI job for ironic for OSA19:34
spatelit does has role but i never tested because it required special need like PXE vlan and ilo access etc.. 19:35
spatelb1tsh1ft3r does that solve your problem?19:47
admin1my lxc containes are not getting ip address . not on eth0 ( dnsmasq is running ) and also not on the eth1 19:49
admin1the setup-hosts just passes all steps and fails when trying to download packages, coz there is no ip address 19:49
admin1if i go inside any container and do ip link, i dont see eth0 or eth1, though I see that on  the lxc configs 19:50
b1tsh1ft3rspatel: no, i think it might just be easier to give the compute node a new name and just run the plays again and bring it in as a new node tbh. Is there an easy way to regenerate the inventory ?!19:50
admin1a reboot in one fixed it . not sure what had happened 19:52
spatelb1tsh1ft3r what is your log saying 19:53
spatelcompute logs and then check placement logs 19:53
spatelalso check openstack compute service list and see if any duplication there 19:55
spatelThis mysql 10.5.9 is disaster :(20:02
spatelnot letting me set permission causing not able to move forward20:02
admin1spatel, you can put it in safe mode and play with permissions 20:19
spateli am going to upgrade 10.6.4 stable release.. i don't want hand fix in production 20:21
admin1i thought you were in staging 20:23
admin1where some stuff like this is allowed20:23
spatelmy lab is close to production 20:23
admin1ok20:23
spatelgood luck 10.6.4 also throwing same error :)20:27
spateli gotta go i will look into later but look like upgrade isn't that easy (specially with mysql()20:27
opendevreviewMerged openstack/openstack-ansible stable/wallaby: Bump OpenStack-Ansible Wallaby  https://review.opendev.org/c/openstack/openstack-ansible/+/80309720:39
jrosserspatel: your original galera playbook failed becasue you had too many connections error21:23
jrosserthat means it did not run the handlers at the end of the play which fix up the 10.5.9 problems21:24
jrosserthis https://github.com/openstack/openstack-ansible-galera_server/blob/master/handlers/galera_upgrade_hook.yml21:24
opendevreviewMerged openstack/openstack-ansible-os_murano stable/train: Remove Debian jobs  https://review.opendev.org/c/openstack/openstack-ansible-os_murano/+/80399523:00
opendevreviewMerged openstack/openstack-ansible-os_murano stable/ussuri: Remove Debian jobs  https://review.opendev.org/c/openstack/openstack-ansible-os_murano/+/80399623:00
opendevreviewMerged openstack/openstack-ansible-os_murano stable/victoria: Remove Debian jobs  https://review.opendev.org/c/openstack/openstack-ansible-os_murano/+/80399723:01
opendevreviewMerged openstack/openstack-ansible-os_murano stable/wallaby: Remove Debian jobs  https://review.opendev.org/c/openstack/openstack-ansible-os_murano/+/80399823:01
opendevreviewMerged openstack/openstack-ansible-os_murano master: Remove Debian jobs  https://review.opendev.org/c/openstack/openstack-ansible-os_murano/+/80399923:02

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!