Monday, 2022-05-16

*** ysandeep|out is now known as ysandeep|rover05:08
jrossergood morning07:40
*** ysandeep|rover is now known as ysandeep|rover|lunch08:02
noonedeadpunkmornings!08:20
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Add upgrade path from lsyncd to shared filesystem.  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/83941109:31
damiandabrowski[m]hey!09:48
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_nova master: Do not adjust libvirtd sysconfig for centos-9  https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/84186209:59
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_nova master: Do not adjust libvirtd sysconfig for centos-9  https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/84186210:01
*** ysandeep|rover|lunch is now known as ysandeep|rover10:01
jrosserok so i get a tempest pass on centos-9 metal locally here10:03
jrossernoonedeadpunk: ^^ next thing would be how we get an LXC install10:08
noonedeadpunkoh10:22
noonedeadpunkI'm trying to figure out wtf happened with centos-nfv-openvswitch package.... 10:23
noonedeadpunkalse seems like infra scenario is broken?10:23
noonedeadpunk `aodhclient 2.4.1 depends on pbr>=1.4  The user requested (constraint) pbr===5.8.1` ugh10:25
jrosserhrrm10:28
jrossernoonedeadpunk: do you have a link to a patch which breaks like that?10:28
jrosserhttps://opendev.org/openstack/python-aodhclient/src/branch/master/requirements.txt#L510:29
jrossera lot of infra jobs were broken because of this https://review.opendev.org/c/openstack/openstack-ansible/+/84159510:34
jrosseris there something else too?10:34
noonedeadpunkI'm not really sure why it's happened now. As requirement was there for a while and they're not contraversary10:35
noonedeadpunk* contradictory10:35
jrosserwhere is that breaking?10:36
noonedeadpunkIN your centos 9 patches, is https://zuul.opendev.org/t/openstack/build/487c4b5f64e043c8916a29b0d660bb64/log/job-output.txt#847010:37
jrosseroh wow10:41
jrosserlooking back through the #opendev logs, looks like there was a bunch of pypi CDN trouble on thursday11:01
jrosseralso mentioning trouble with pbr11:01
opendevreviewMerged openstack/openstack-ansible-os_nova master: Pass valid cert regen variable to pki role  https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/84161711:01
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Remove all code for lsync, rsync and ssh  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/83758811:03
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Clean up legacy lsycnd, rsync and ssh key config  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/83785911:03
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Use the same vars file for all versions of centos  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/84161811:03
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Use distro packages for nginx on centos.  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/84161911:03
*** dviroel|out is now known as dviroel11:21
jrosserhow do i debug which version of config_template we are using11:47
jrosserwell, which actual config_template gets called, i mean11:48
jrosserconfusion between ceph_ansible config_template vs plain config_template vs openstack.config_tempalate.config_template11:48
*** spotz_ is now known as spotz11:55
noonedeadpunkI can recall debugging this one... On master it's not ours for sure. And ours plugin/collection is exactly same thing12:41
noonedeadpunk(in terms of codebase)12:41
noonedeadpunkregarding ceph_ansible it's tougher. and it's likely the one that being used by default atm.12:42
noonedeadpunkI merged PR to ceph-ansible master (before deprecation) that replaces their config_template with our collection12:43
noonedeadpunkbut that will never be released now...12:43
*** ysandeep|rover is now known as ysandeep|rover|afk12:45
*** ysandeep|rover|afk is now known as ysandeep|rover13:11
jrossernoonedeadpunk: i looked at making config template understand properties files https://paste.opendev.org/show/beItOWLGjuuCJUmOt88e/13:32
jrossernot sure if that is a giant hack or not......13:32
jrosserit treats the properties as if they were already in the default section of an ini file13:33
noonedeadpunkjrosser: I wonder if calling them sysctl would make sense to anybody except me...13:38
noonedeadpunkbut having "default" section hack I saw somewhere on stackoverflow as a solution :D13:39
jrosserindeed - thats exactly what i copied13:39
jrosseri was also wondering if the large code we have in there for --diff actually works at all13:40
jrosserit doesnt appear to report any changes at all from what i see in diff mode13:43
noonedeadpunkI can imagine it being borked. I can recall spotting that when ara was missing diffs for it.13:44
noonedeadpunkthere was some tricky thing to fix it...13:46
*** ysandeep|rover is now known as ysandeep|out15:02
*** dviroel is now known as dviroel|lunch15:25
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Add upgrade path from lsyncd to shared filesystem.  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/83941115:30
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Remove all code for lsync, rsync and ssh  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/83758815:30
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Clean up legacy lsycnd, rsync and ssh key config  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/83785915:30
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Use the same vars file for all versions of centos  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/84161815:30
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Use distro packages for nginx on centos.  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/84161915:31
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-repo_server master: Use distro packages for nginx on centos.  https://review.opendev.org/c/openstack/openstack-ansible-repo_server/+/84161915:32
*** dviroel|lunch is now known as dviroel16:24
spateljrosser do you know what is going here? - https://paste.opendev.org/show/bmRI8tFkmylUMiAXptYN/16:28
spatelI reboot one of infra node and noticed that error16:29
jrosserbad superblock means that something is not formatted16:32
spatelhmm16:32
jrosseri have /var/lib/machines.raw on /var/lib/machines type ext4 (rw,noatime,nodiratime)16:33
spatelmount| grep machines 16:33
spatelnothing mounted16:33
spatelI have installed 3 infra node last week and today reboot 2 and noticed failed service on 2 rebooted node 16:34
spatel1 node which i didn't reboot yet showing mounted /var/lib/machines16:35
spatelI am 100% it will failed if i reboot last node16:35
jrosser`file /var/lib/machines.raw` ?16:36
spatel  /var/lib/machines.raw: Linux rev 1.0 ext4 filesystem data, UUID=b2c7accf-c3bb-4ab0-9301-371a099de544 (extents) (64bit) (large files) (huge files)16:36
spatelType=btrfs  in /usr/lib/systemd/system/var-lib-machines.mount ? I am running ubuntu 20.04  16:38
spatelwhy its saying ext4 then?16:38
jrosseri dont know16:38
spatelI just confirmed this on other production deployment and found service failed over there too 16:41
jrossersorry i'm just on the phone16:43
spatelnp, i am trying to figure out what is wrong 17:03
spatelhow critical its if that filesystem not mounted17:04
spateljrosser found issue and it was related Type=  https://paste.opendev.org/show/b1XI4qKoDks3u7A6XKNS/17:07
spatelwe need to patch it 17:07
jrosserdo you see a bug anywhere? https://github.com/openstack/openstack-ansible-lxc_hosts/blob/master/tasks/lxc_cache_rootfs.yml#L3617:08
jrosserthat looks hardwired to use ext417:09
spatelhmm very interesting 17:10
spatellook like bug somewhere.. may be we have override somewhere.. just guessing 17:11
jrosserthere is no variable there that i see to override17:14
spatelI am seeing same behavior on all my production deployment running ubuntu 17:15
spatelif you are not seeing in your deployment then i would be shocked :)17:17
jrosserit is the same on one here17:17
jrosseri don't yet see how anything except ext4 should be in that file though17:17
jrosseroh hold on17:20
spatelhttps://paste.opendev.org/show/by0761bd7XSyaIactsJx/17:21
spatelchasing this now...17:21
jrosserspatel: wait17:21
jrosseryou are looking at code for master branch there17:21
spateli am sure you found something :)17:21
jrosseryou're running Xena?17:21
spatelI am running 23.3.0 17:22
spatelhttps://paste.opendev.org/show/bfsoT63IuYAjM2E0W9SD/17:22
prometheanfiretwice now, I've gone from 24.0.0 to 24.2.0 and running setup-everything (specifically galera) has caused the galera cluster to fall on it's face17:23
spatelI am seeing this issue in every single tags of wallaby17:23
jrosserfor wallabty/xena systemd_mount role is not used17:23
spateloh ok17:23
jrosserspatel: it is super trivial https://github.com/openstack/openstack-ansible-lxc_hosts/blob/stable/wallaby/tasks/lxc_cache_rootfs.yml#L35-L3917:24
jrosserand the file is not even a template, so there is even less way for btrfs to end up in there https://github.com/openstack/openstack-ansible-lxc_hosts/blame/stable/wallaby/files/var-lib-machines.mount17:24
spatelyou are correct.. 17:25
spatellook at full content of file and look like its coming from somewhere else.. https://paste.opendev.org/show/bJabhynjHMXd5wTt6Y3o/17:27
jrosserspatel: and this is a fresh install of wallaby?17:27
spatelYes last week i deployed this environment 17:27
spatelfirst i use 23.2.2 and then you suggested to go with 23.3.0 to fix FD leak for oslo so i did minor upgrade 17:28
jrosserthis is unrelated to that17:29
jrosserwe have accidentally used a file that is shipped with systemd https://packages.ubuntu.com/search?suite=focal&arch=any&mode=exactfilename&searchon=contents&keywords=var-lib-machines.mount17:29
jrossernoonedeadpunk: ^17:29
spateljrosser you are correct, its not part of OSA deployment17:30
jrosserprometheanfire: you may have to be a bit more specific with some debugging info......17:30
jrosserspatel: maybe best to make a bug on launchpad for this, i don't have a quick idea of what the best fix is17:33
noonedeadpunkI wonder how accidentally it was17:34
noonedeadpunkconsidering it's part of systemd-container that I assume is nspawn 17:34
jrossernot sure, i think the btrfs stuff all has roots in nspawn17:34
jrosseryes17:34
jrosserbut we can't do btrfs any more anyway17:34
jrosseri wonder if it's possible to make a systemd drop-in for a mount to change type=17:35
noonedeadpunkI wonder why it's not in overrides though...  as like we intentionally override that17:35
spatelwhy OSA didn't override that file with this file. or may be it order issue - https://github.com/openstack/openstack-ansible-lxc_hosts/blob/stable/wallaby/tasks/lxc_cache_rootfs.yml#L35-L39 17:35
jrosserif you update systemd then this file could get overwitten17:36
jrosseror if systemd-container is installed later, perhaps17:36
noonedeadpunkHmm. I wonder if https://github.com/openstack/openstack-ansible-lxc_hosts/commit/337ddf87802c7597b7c51a071eb9829a4b87defa might be wrong then17:36
noonedeadpunkif we just need an override....17:36
jrosserwell, read the file though17:37
jrosser# This unit is required for pre-240 versions of systemd......17:37
jrosserso the behaviour is not always the same, depending on systemd version17:37
prometheanfirejrosser: just recovered, running the playbook again, will provide logs if it occurs again17:38
prometheanfireworked this time17:41
* prometheanfire shrugs17:41
*** dviroel is now known as dviroel|out20:51

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!