Thursday, 2023-02-16

jrossermorning09:24
noonedeadpunk\o/09:31
damiandabrowskihi!10:06
damiandabrowskiyesterday I noticed that it's not possible to use "item is defined" conditional for a loop: https://paste.openstack.org/show/bQkDFF9AFI3a4jktMH4M/10:09
damiandabrowskidid you know about it? do you have any alternative in mind?10:09
damiandabrowskii tried this trick with setting default value but it also doesn't work10:10
damiandabrowskihowever, everything works fine when using "with_items"10:10
jrosserdamiandabrowski: well with_items and loop are not directly equivalent https://docs.ansible.com/ansible/latest/playbook_guide/playbooks_loops.html#with-items10:15
damiandabrowskiyeah, i tried using flatten but it didn't work10:16
jrosserinteresting10:16
jrosserwhich ansible version are you using there?10:17
damiandabrowskis/didn't work/didn't solve the issue/10:18
damiandabrowski2.13.410:18
jrosserhmmm https://paste.opendev.org/show/bIO3iaQM6wMRiZ6J2uSB/10:19
damiandabrowskiO.o that's weird10:21
jrosserthats just regular ansible in a fresh venv, nothing to do with OSA install at all10:22
damiandabrowskisomething was wrong in my /root/.ansible/plugins (it's my AIO environment)10:22
damiandabrowskii removed this folder and now everything works as expected10:22
jrosserwow :) surreal result that gave10:22
damiandabrowskibut the interesting think is that I didn't do anything special on that VM(except deploying openstack and executing ./run_tests.sh on several repositories)10:25
damiandabrowskianyway, thanks for your help! yesterday i spent few hours trying to figure out what is going on :D 10:25
jrosseroh doesnt run_tests.sh on things like roles use tox to setup venvs and ansible?10:26
jrosseri can totally see that putting things in ~root/.ansible10:26
damiandabrowskiyeah, exactly10:27
damiandabrowskiI'm trying to find what part of plugins directory broke my playbook10:28
damiandabrowskiso I noticed that removing /root/.ansible/plugins fixed issue only for ansible-playbook command. openstack-ansible still didn't work10:46
damiandabrowskito fix openstack-ansible I had to comment out: `export ANSIBLE_STRATEGY="${ANSIBLE_STRATEGY:-openstack.osa.linear}"` in /usr/local/bin/openstack-ansible.rc10:46
damiandabrowskithat's getting really weird10:46
opendevreviewDamian Dąbrowski proposed openstack/openstack-ansible-haproxy_server master: Prepare haproxy role for separated haproxy config  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/87118813:50
opendevreviewDamian Dąbrowski proposed openstack/openstack-ansible master: Prepare service roles for separated haproxy config  https://review.opendev.org/c/openstack/openstack-ansible/+/87118913:52
opendevreviewDamian Dąbrowski proposed openstack/openstack-ansible master: Prepare service roles for separated haproxy config  https://review.opendev.org/c/openstack/openstack-ansible/+/87118913:54
jrosserNeilHanlon: do you know if there will be cloud-init v22.2+ in Rocky9 ?16:01
NeilHanlonunlikely, seems centos stream 9 is still on 22.116:03
NeilHanlonare there any specific patches? I can try and get them into c9s16:04
NeilHanlonI have this one open for c8s which is also valid for c9s; https://bugzilla.redhat.com/show_bug.cgi?id=2163657 16:04
spatelQuestion on rescue instance, first time trying it and its asking me to select image. should i be selecting same image which i used for originally? 16:08
spatelHow does rescues stuff work?16:08
noonedeadpunkspatel: it depends :)16:11
noonedeadpunkdepends if you're booted from epehemeral or from volume16:12
spatel?16:12
spatelI booted from ephemeral from ceph backend 16:12
noonedeadpunkas for BFV you might need to have a specific tags set16:12
noonedeadpunkyeah, then you can use same image16:12
jrosserNeilHanlon: we run across this sort of thing https://bugzilla.redhat.com/show_bug.cgi?id=203606016:12
jrosser-ish16:12
spatelnoonedeadpunk my ceph storage went down because of power and after that my VMs started complaining - blk_update_request: I/O error, dev sda, sector 233000 op 0x1: (WRITE) flags 0x800 phys_seg 8 prio class 0  16:16
spatelTrying to run fsck using rescue disk.. 16:16
noonedeadpunkbasically what it will do - it will add image as first drive to the VM and will boot it16:16
spatelI did boot from rescue but getting same error I/O error..16:16
noonedeadpunkwell, you can try using different image then....16:18
noonedeadpunkBut is ceph cluster reporting as healthy?16:18
spatelCeph is showing all good. 16:18
spatelI can spin up new VMs etc.. all good but only old vms impacted with filesystem errors16:19
noonedeadpunkBut recovery is eventually what you need16:22
spatelAm i missing something in ceph config.. to prevent data corruption with power outage?16:23
noonedeadpunkspatel: you can do try hacky way, like rpd map on hypervisor which will create device16:23
spatelYou guys running ceph so you must have more knowledge compare to me :)16:24
spatelwhat are the basic thing i should be care about to make sure data integrity 16:24
noonedeadpunkum... I think you can't do anything as long as any caching is around16:26
noonedeadpunklike writeback for libvirt16:26
spateldisable writeback in libvirt correct?16:27
noonedeadpunkwell, it would degrade performance a lot16:27
spateldamn!! 16:28
spatelDoes rescue image required to be password base? 16:35
spatelIt won't let you in if password not set on image correct. 16:35
jrossertry system rescue cd or something like that16:36
spatelLet me understand rescue disk is just like CD rom and boot from CD and mount disk and run fsck.. am i correct?16:36
spatelrescue cd  ? is this a special image?16:37
jrosserjust don’t try to rescue using the same image are you are rescuing, in my experience that ends badly16:37
spatelOhhh! so use other image correct? 16:37
jrosserhttps://www.system-rescue.org/16:37
spatelLet me upload new image and try16:37
spatelcan i import in openstaxck? 16:38
spateli meant glance16:38
jrosseryou can16:38
spateldoing it..16:38
jrosserregardless, if you rescue with another regular image it should get the same metadata as the original, so I believe your ssh key will be resent if you had one on the original vm at boot16:39
jrossersystem-rescue image is useful for when you know the ssh key is lost and there’s no password either16:40
spatel+1 16:40
spatel i should be using --disk-format iso correct to import image?16:40
jrosserspatel: i can't remember :) it was a long time ago16:46
spateljrosser for testing i boot with same ubuntu image in rescue mode and ssh in using same sshkey. How do i find disk which i should run fsck? - https://paste.opendev.org/show/bnzK3ft9GPebNpVmOh6b/16:46
spatelHow do i know that i am really in rescue mode :) 16:46
jrosserwell that is one of the fun parts with using the same image16:47
spateljrosser hmm! 16:47
spatellet me try systemrescue ISO and see how it goes 16:48
jrosseror debian or some other OS16:48
jrosserit's massively confusing about which system you are in, if you are or are not in the right system if you chroot to the "other disk"16:49
jrosserand the FSID is the same too just to make it even worse16:49
jrossermy experience previously with the FSID matching was that is a factor in if you get the right partition mounted at / once rescued16:50
spateltotally agreed with you. 16:52
spateli would prefer using centos for ubuntu and ubuntu for centos :)16:53
spateleasy to distinguish 16:53
spateljrosser FYI - https://help.switch.ch/engines/documentation/rescue-vm/16:57
spatelHow to import ISO to glance 16:57
spatel$ openstack image create --file systemrescuecd-6.0.1.iso --disk-format iso --container-format bare --min-ram 800 --property hw_cdrom_bus=scsi systemrescuecd16:57
jrossernot sure how old that doc is but from what it says it's from before rescue supported boot-from-volume16:59
jrosserbut anyway - if its working for you.... all good :)16:59
spatelhope.. I will keep it posted here.. 17:00
opendevreviewDamian Dąbrowski proposed openstack/openstack-ansible master: Prepare service roles for separated haproxy config  https://review.opendev.org/c/openstack/openstack-ansible/+/87118917:18
spateljrosser so rescue disk works - https://ibb.co/y84BLGY18:10
spatelwhat should i understand of it? 18:10
spatelfilesystem not recoverable? 18:11
noonedeadpunkare you really sure ceph is healthy?18:14
jrosser^ this18:14
noonedeadpunkas that does look to me as some PG is inactive18:14
noonedeadpunk`ceph health detail`?18:15
spatelnoonedeadpunk - https://paste.opendev.org/show/bnG8rvXJydADTZknd2QD/18:21
spatelhttps://paste.opendev.org/show/bRZFgPiNem03oknFJDbi/18:21
spateli am not seeing anything wrong in ceph 18:22
spatelmgr crash is acceptable because everything got crash 18:22
spatelpgs:     258 active+clean18:22
spatelAre there any command to validate ceph data or scrubbing? 18:24
spatelAm i missing anything here?18:59
NeilHanlonjrosser: i'm going to validate some stuff but overall, i will be requesting Red Hat rebase to at least cloud-init 22.2 in RHEL 919:17
NeilHanlonif you want, i can make a copr repo with it for now..19:18
NeilHanloni might do this anyways..19:18
jrosserwe could certainly test it to see if it deals with the trouble we've had19:18
jrosserwe're looking at this sort of thing right now so it's a good time to dig a bit19:19
jrosserspatel: you don't have some trouble like MTU error when your hosts rebooted?19:36
spatelNoo19:40
spatelI did test MTU using ping -M do 192.168.1.11 -s 9000 19:41
spateljrosser does it looks ok to you? - https://paste.opendev.org/show/bWlGk4iA1uSCJ3KCwwSj/19:42
spatel9000 not pinging up 8972 works19:42
spatelI have setup MTU 9000 on all my interfaces 19:42
NeilHanlonjrosser: https://copr.fedorainfracloud.org/coprs/neil/cloud-init/ - can `dnf copr enable neil/cloud-init`19:46
NeilHanlonsource: https://git.shrug.pw/rpms/cloud-init19:47
jrosserawesome thankyou - will give that a spin tomorrow19:47
spatelI can ping 8972 that means MTU setting is good19:52

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!