Friday, 2024-03-22

opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_horizon stable/2023.2: Deploy default policy files  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/91381504:31
farbodHello, I get an error for deployments others than stable/2023.2 on bootstraping deployment host on task Install collection requirements with ansible galaxy. Is it normal?  07:26
jrosserfarbod: can you paste the output you get, maybe at paste.opendev.org so we can see what goes wrong?07:45
noonedeadpunkI don't think we have any issues with 2023.1 bootstrap - our CI's doing that look healthy09:17
opendevreviewMerged openstack/openstack-ansible-os_horizon master: Do not change mode of files recursively  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/91142009:28
jeroheya09:38
jeroiam trying to upgrade from 2023.1 to 2023.2 via ./scripts/run-upgrade.sh09:39
jerogiving me following fatal09:39
jerohttps://paste.opendev.org/show/baKix3QrmtEdV8V6a6JR/09:39
jeroany idea to further debug this?09:39
noonedeadpunkjero: hey09:45
noonedeadpunkI think I know the reason for it09:45
noonedeadpunkwhich is basically a gnocchi issue 09:45
noonedeadpunkbut huh, I thought it should be affecting only master...09:46
noonedeadpunkbasically talking about https://github.com/gnocchixyz/gnocchi/pull/137909:46
noonedeadpunkjero: but about debug - executing /openstack/venvs/ceilometer-28.0.2.dev7/bin/ceilometer-upgrade does write output to journalctl09:47
jerothanks for pointing in the journalctl direction - not thought about that :x09:52
jeroseems like the upgrade is not the only problem09:52
jerohttps://paste.opendev.org/show/bbczJvN3t5OyJfOj98ru/09:52
noonedeadpunkjero: yeah, exactly why I told about gnocchi09:53
noonedeadpunkI faced same issue like previous week, but the PR for gnocchi was solving issue iirc.09:54
noonedeadpunkbut sandbox was on master09:54
noonedeadpunkso worth checking gnocchi-api logs and I assume it's in a restart loop or smth09:54
noonedeadpunkah... and another issue were policies09:55
noonedeadpunkie https://review.opendev.org/c/openstack/openstack-ansible-os_gnocchi/+/91324409:56
noonedeadpunkbut that was also only for master I guess09:56
noonedeadpunkjero: so checking gnocchi journal can be helpful to see what's the exact issue10:01
jeroMar 22 10:00:50 infra1-gnocchi-container-4abde65d gnocchi-api[450494]: 2024-03-22 10:00:50,987 [450494] ERROR    oslo_policy._cache_handler: Config file not found /etc/gnocchi/policy.yaml: No such file or directory10:01
noonedeadpunkok, yeah10:01
* noonedeadpunk guessed10:01
jeroyes,you was right10:02
noonedeadpunkjero: out of interest - do smth like rm /etc/gnocchi/policy.*10:02
noonedeadpunkand restart service10:02
noonedeadpunkas I've seen this issue but failed to reliably replicate it10:03
noonedeadpunklike services should not need presence of policy.yaml but they might not like it just vanishing during runtime10:03
noonedeadpunkas we can potentially just miss just handler trigger here: https://opendev.org/openstack/openstack-ansible-os_gnocchi/src/branch/master/tasks/gnocchi_post_install.yml#L52-L5910:05
jerorm /etc/gnocchi/policy.* and systemctl restart gnocchi-api.service - no more errors10:14
jerobut still, when running /openstack/venvs/ceilometer-28.0.2.dev7/bin/ceilometer-upgrade throws the same rror10:15
noonedeadpunkhuh?10:15
noonedeadpunkhav you done that on all containers?10:16
noonedeadpunkand another thing - can you interact with gnocchi through cli? 10:18
jeroi did now10:24
jerono more errors10:24
jero:)10:24
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/xena: Switch SHAs to EOM  https://review.opendev.org/c/openstack/openstack-ansible/+/91341310:27
noonedeadpunkjero: I will push a patch now to restart service on policy file removal - I find it as most likely issue10:28
noonedeadpunkunless you wanna test it more and push the patch as well :)10:28
jerothanks for pointing me in the right direction10:29
jeroill rerun the upgrade-script10:29
noonedeadpunkjero: um, I'm not sure you can re-run it10:30
noonedeadpunkor it's worth doing10:30
noonedeadpunk(maybe you can though...)10:30
noonedeadpunkbut best would be to run remaining playbooks or just setup-openstack.yml10:30
noonedeadpunkas ceilometer is really pretty much close to the end of list10:31
jeroi see, oke lets see :)10:31
noonedeadpunkmeaning most services are already upgraded10:31
noonedeadpunkwhen run-upgrade.sh fails it throws out "left" things10:31
jerooh, yes i see10:33
jerogood to know10:33
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/wallaby: Switch SHAs to EOM  https://review.opendev.org/c/openstack/openstack-ansible/+/91341410:45
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/victoria: Switch SHAs to EOM  https://review.opendev.org/c/openstack/openstack-ansible/+/91341810:46
jerothere was a second error when running the upgrade-script: https://paste.opendev.org/show/bIwDsFo2gM6iPLJ6PKT5/10:56
jerofatal: [compute1]: FAILED! => {"msg": "Unexpected failure during module execution: 'mtime'", "stdout": ""}10:56
opendevreviewJonathan Herlin proposed openstack/openstack-ansible master: Fix inventory defenition for Cloudkitty  https://review.opendev.org/c/openstack/openstack-ansible/+/91226910:57
noonedeadpunkjero: ok, this one is slightly less trivial...11:08
noonedeadpunksounds like some file that is expected to be in repo is missing...11:08
gokhan__Hello folks, I have problems on ceph cluster. ceph -s gives no response. I checked mons everthing seems ok and quorum is correct. Cluster status is warning for some osds.on ceph dashboard I can monitor ceph cluster. Do you have any ideas? How can we troubleshoot this problem? 11:32
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-haproxy_server master: Do not resolve all host_vars when haproxy_backend_node is a mapping  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/91392711:36
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/xena: Switch SHAs to EOM  https://review.opendev.org/c/openstack/openstack-ansible/+/91341311:38
noonedeadpunkgokhan__: ceph -s from ... compute?11:44
noonedeadpunkthis can be smth like mtu issue 11:44
noonedeadpunkor firewall11:45
noonedeadpunkwe had that couple of times, but always that was smth related to networking11:45
gokhan__noonedeadpunk, it has worked before but after changing mon daemons it can't work again. I checked monmap it is ok 11:46
noonedeadpunkand new daemons are in ceph.conf I assume?11:48
gokhan__I am trying on both compute and infra nodes. 11:48
noonedeadpunkI guess I'm not sure what is meant under changing daemons11:49
noonedeadpunkas then it's kinda same questions - mtu and firewall...11:49
noonedeadpunkok, crap, seems we need to merge unmaintained patches without any real testing11:50
noonedeadpunkas branches are dropped now, so they're being checked-out to master....11:50
gokhan_noonedeadpunk, I am trying distribution upgrade. I had to change nodes which mons are running on.   11:50
noonedeadpunkwell, again - check that ceph.conf is updated if mon IPs changed. then that you generally can reach monitors through telnet or smth, and that storage interfaces have same mtus11:52
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/xena: Switch SHAs to EOM  https://review.opendev.org/c/openstack/openstack-ansible/+/91341312:03
opendevreviewMerged openstack/openstack-ansible-os_horizon stable/2023.2: Deploy default policy files  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/91381512:08
opendevreviewMerged openstack/openstack-ansible stable/2023.2: Determine if upgrade source branch is stable/ or unmaintained/  https://review.opendev.org/c/openstack/openstack-ansible/+/91243414:38
opendevreviewMerged openstack/openstack-ansible-os_neutron master: Restart OVN on certificate changes  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/91276214:42
opendevreviewMerged openstack/openstack-ansible-haproxy_server master: Imporove Jinja indentation for service templates  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/91392515:00

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!