Friday, 2022-12-09

opendevreviewMerged openstack/openstack-ansible-os_designate stable/yoga: Fix race condition during designate setup  https://review.opendev.org/c/openstack/openstack-ansible-os_designate/+/86701300:05
opendevreviewMerged openstack/openstack-ansible-os_neutron master: Separate OVN gateway functions from ovn-controllers  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/76064700:26
*** chandankumar is now known as chkumar|ruck04:41
noonedeadpunkhm, what's going to be installed on northd? 09:02
noonedeadpunkAs I think we don't have neutron to be installed there?09:03
jrosseri was just looking at my remianing ironic patches and something looks broken09:03
noonedeadpunkyup because of https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/86354609:04
noonedeadpunkAs we still copy neutron configs when they're not needed09:04
noonedeadpunkand smart_sources rely on it being around09:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_neutron master: Do not provision neutron config when not needed  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/86708709:11
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible master: Update AIO to use OVN-style provider network for Octavia  https://review.opendev.org/c/openstack/openstack-ansible/+/86705209:11
noonedeadpunkHope this will do it09:12
jrosseroh i think i saw that the image module now supports tags09:24
jrosserwe should be able to get rid of a command: here https://github.com/openstack/openstack-ansible-os_octavia/blob/master/tasks/octavia_amp_image.yml09:24
noonedeadpunkYup they should now :)09:43
noonedeadpunkI wonder wtf is OPNFV CI09:43
noonedeadpunkjrosser: TBH I still think we should create some openstack_resources role in plugins repo that can be called from roles to setup stuff, like images/flavors/networks/aggregates/etc09:46
noonedeadpunkand maybe move service_setup in there09:46
jrosseri was also thinking about trying to write a module for wheel build09:47
jrosserbut i first also wanted to look at how many tasks we run for each role in total09:48
jrosserto see where it might be worth putting work in to reduce the number of tasks09:48
noonedeadpunkUm, it's a bit different I guess? As it was unification vs reducing. As we have different code to upload images for magnum/octavia/trove/smth else09:49
noonedeadpunkOr well, I have more narrow use-case, which is represented by this patch: https://review.opendev.org/c/openstack/openstack-ansible/+/85423509:51
jrosserthe trouble with a generic resources role would be maybe introducing many many skipped tasks for all the things that are not wanted in some other role09:52
jrosseri think we suffer from this with systemd_service that has so many special things it does09:52
noonedeadpunkThe problem with systemd_service is that we don't have proper structure for it...09:52
noonedeadpunkAs we should likely be splitting these cases into different task sets09:53
noonedeadpunkso that include could be skipped if not defined09:53
noonedeadpunkSkipping inlcude is fast (I have impression it's faster then skipping task even, but haven't tested that)09:54
noonedeadpunkAlso we could do tasks_from09:54
jrosseri think we have a special case for skipping in the connection plugin too09:54
jrosserlike early bail-out in some situations09:55
noonedeadpunkSo my usecase with host aggregates is to automatically add compute from az1 to it's aggregate as part of it's setup.09:58
noonedeadpunkAnd I struggle to see good approach for that....09:58
noonedeadpunkLike add to end of os-nova-install playbook...09:59
opendevreviewMerged openstack/openstack-ansible-os_neutron master: Remove support for calico ml2 driver.  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/86612311:02
*** dviroel|afk is now known as dviroel|rover11:06
noonedeadpunkSo https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/867087 looks good according to https://review.opendev.org/c/openstack/openstack-ansible/+/867052 results11:33
noonedeadpunkI think we should figure out wtf happening in CI with keystone during tempest tests...11:39
noonedeadpunkThat's super annoying and cause a lot of CI resources waste11:39
*** frenzy_friday is now known as frenzy_friday|food12:38
opendevreviewMerged openstack/openstack-ansible-os_ironic master: Tidy definition of http dir for inspector  https://review.opendev.org/c/openstack/openstack-ansible-os_ironic/+/86613212:44
mgariepyhttps://zuul.opendev.org/t/openstack/build/d143a14c71564135bd6adfdef8af9f4a/log/logs/host/keystone-wsgi-public.service.journal-10-38-14.log.txt#15059-1508413:27
mgariepythat's probably not too good.13:27
jamesdentonmornin13:29
mgariepyhttps://zuul.opendev.org/t/openstack/build/d143a14c71564135bd6adfdef8af9f4a/log/logs/host/keystone-wsgi-public.service.journal-10-38-14.log.txt#25670-2572113:29
mgariepymorning jamesdenton 13:33
noonedeadpunko/13:38
jamesdentonhi hi hi13:39
mgariepykeystone it's not 100% clear why it went down :/13:39
mgariepyL7 down, means that the app didn't respond in time i guess.13:42
mgariepymaybe it got saturated here ? https://zuul.opendev.org/t/openstack/build/d143a14c71564135bd6adfdef8af9f4a/log/logs/etc/host/uwsgi/keystone-wsgi-public.ini.txt#1014:02
opendevreviewMarc Gariépy proposed openstack/openstack-ansible master: Increase thread/process to 2 for keystone  https://review.opendev.org/c/openstack/openstack-ansible/+/86711314:14
noonedeadpunkYeah, might be14:15
opendevreviewMarc Gariépy proposed openstack/openstack-ansible master: Update AIO to use OVN-style provider network for Octavia  https://review.opendev.org/c/openstack/openstack-ansible/+/86705214:17
noonedeadpunkI thought for some reason it doesn't matter much as Apache is main headache  but worth trying14:17
mgariepyoops.14:17
mgariepylol should have rebased lol14:17
noonedeadpunkdoesn't matter much 14:18
noonedeadpunkshould work anyway14:18
mgariepyyeah i know14:18
mgariepyit's kinda obscure why haproxy disconnect the keystone backend.14:19
mgariepybut if apache is waiting on the data from uswgi. it might explain the timeout haproxy sees.14:20
noonedeadpunkhaproxy is supposed to recheck kind of....14:20
noonedeadpunkand it's down just for a second according to log14:21
noonedeadpunkbut yeah14:21
mgariepymaybe we could set fall `2 or 3` and rise 1 ? https://zuul.opendev.org/t/openstack/build/d143a14c71564135bd6adfdef8af9f4a/log/logs/etc/host/haproxy/conf.d/keystone_service.txt#3214:23
mgariepyonce haproxy thinks the backend is dead. it disconnect it and no one can connect back14:23
noonedeadpunkoh, why in the world it's like that14:31
noonedeadpunkI haven't spotted that14:31
noonedeadpunkIMO it shouldn't be like fall 114:31
mgariepyit should fall either.14:32
mgariepyshouldn't**14:32
noonedeadpunkoh, we have fall 1 raise 1 for everything?14:33
noonedeadpunkhm14:33
noonedeadpunkWe override that in CI?14:33
noonedeadpunkas default is 314:34
noonedeadpunkhm, these 2 variables are never used https://opendev.org/openstack/openstack-ansible-haproxy_server/src/branch/master/defaults/main.yml#L25-L2614:35
noonedeadpunkwtf the logic is that https://opendev.org/openstack/openstack-ansible-haproxy_server/src/branch/master/templates/service.j2#L14214:35
noonedeadpunko_O14:36
*** dviroel|rover is now known as dviroel|rover|afk14:40
mgariepywell.14:40
mgariepydid infra job failed less on that issue ?14:40
mgariepyhrm.. https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/82989914:42
mgariepylol14:42
mgariepydo we need to clean openstack-ansible repo from calico as well ?14:42
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-haproxy_server master: Make use of haproxy_rise and haproxy_fall variables  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/86711614:44
noonedeadpunkmgariepy: yup. And there was patch for that14:45
noonedeadpunkI jsut think that we need to wait for neutron fix to land first14:45
mgariepywhere is the patch ?14:45
mgariepyi did look but didn't found it.14:46
mgariepyonly saw the os_neutron on.14:46
mgariepyone*14:46
noonedeadpunksec14:46
noonedeadpunkhttps://review.opendev.org/c/openstack/openstack-ansible/+/86611914:46
noonedeadpunkWell, it would still require recheck...14:46
*** frenzy_friday|food is now known as frenzy_friday14:50
noonedeadpunkdamn, https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/867087 is failing in gates :(14:51
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-os_neutron master: Do not provision neutron config when not needed  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/86708714:52
noonedeadpunkwith `ERROR neutron OSError: write error` huh14:53
mgariepyneutron went down on haproxy also.14:55
noonedeadpunkYeah....14:56
noonedeadpunkwait?14:56
noonedeadpunkfor what patch?14:56
mgariepythe one that fails gates.14:57
noonedeadpunkyeah, but it sounds more like underlying FS issue then haproxy or smth...14:58
noonedeadpunkas neutron-server died with write error14:58
spatelmgariepy quick question, are you using snap for container for deployment host or lxc ?15:40
noonedeadpunkhm, it would be fun to pack osa in snap package and do bootstrap in install....15:49
noonedeadpunkalso fun for auto-updates15:49
spatelI wish we can put small doc related how to spin up target node in LXD 15:51
noonedeadpunkI wish there would be option for LXD that is not snap....15:52
spatelyou can install lxc instead of lxd without snap15:52
jrosseri think that debian bookworm will have lxd5.0 as apt packages rather than snap15:52
noonedeadpunkthat is exactly what we're doing for osa? 15:53
noonedeadpunkyes, I just found that15:53
jrosserhopefully that will make its way back into ubuntu :)15:53
noonedeadpunkLike someone jsut re-build it and publish in ppa...15:53
spatelyes.. but why not snap/lxd ? are they not good?15:53
jrossersnap is really (imho) meant for client / desktoppy things15:54
jrosserand as such it auto-updates in a way you cannot prevent unless you do horrible hacks15:54
spatelouch! 15:55
noonedeadpunkI've installed thunderbird with snap jsut to get version with Matrix client inside and still regret that15:55
jrosserso to have what is essentially a complete hypervisor randomly able to update / restart itself outside of any planned maintainance is pretty gross15:55
noonedeadpunkIt asks to restart Thunderbird like weekly. But the problem is that restart and even laptop reboot didn't help to get rid of the message that restart is required :D15:56
jrosserwe do run a bunch of stuff in snap/lxd here15:56
noonedeadpunkWell, I've heard that LXD snap has merged some workaround to prevent LXC restart on snap update or smth15:56
jrosserbut we have to do a horrible hack to prevent the updates15:56
noonedeadpunkIt's actually not that horrible last time you've shown it :D15:57
noonedeadpunkbut yes, totally not the approach that should be mainstreamed 15:57
jrosserone time it updated to a version with a bug and took down a bunch of supposedly H/A things because they all updated mostly at the same time15:57
jrosserlike nameservers :(15:58
noonedeadpunkFUN15:58
noonedeadpunk(Fucked Up Nameservers)15:58
jrosseri would totally use a ppa instead of a snap if someone rebuilt the debian one15:58
noonedeadpunkI wonder how hard that would be15:59
noonedeadpunkFor centos there's already CORP that exists15:59
spatelon ubuntu i am trying to install lxc like OSA use then what package i need to install16:00
spatelby default when i did apt install lxd (it install snap :( )16:00
noonedeadpunkLol, we have "create snap package" button on our OSA launchpad page....16:01
jrosserspatel: for a deploy host its really fine16:02
noonedeadpunkdamn, installing osa with snap sounds fun in my head16:02
spatelit would be much quicker 16:02
spatelI am playing with kolla-ansible and i found kolla is very fast because image is already and all you need drop config and start container16:03
spatelwish osa has something like that where container is ready and we can drop config and start service16:03
ElDuderinosame here. kolla has been pretty cool.16:03
noonedeadpunkyou loose a lot of flexability with that approach16:04
spatelI did upgrade kolla in 10 minute Vs OSA took few hours :(16:04
noonedeadpunkalso fwiw kolla does install only released tags of openstack from pypi16:05
ElDuderinowe run OSA in prod, and I'm a huge fan of it, but kolla was pretty cool.16:05
noonedeadpunkso good luck with local forks16:05
noonedeadpunkor some extra code that you want in16:05
spatelIn that case we can create own images right?16:05
noonedeadpunkLikely?16:06
spatelYes, that is what we doing. when we need to patch neutron we do create image in docker with patch and rollout 16:06
mgariepyspatel, i use snap lxd16:07
noonedeadpunkWhy you need kolla then if you build images on your own...16:07
spatelkolla create image with software re-compiled in it so all you need spin up docker and ansible just drop neutron.conf file and done 16:07
noonedeadpunkto drop neutron.conf?16:08
spatelImage building process is one time 16:08
spatelit use overlay filesystem so kind of quick 16:08
noonedeadpunkI was thinking about building in code in lxc images, but I kind of failed to see how that would speed things up16:09
spatelI am not saying kolla is best and osa not but just saying this is what i noticed. whenever i want to rollout in kolla is super quick and fast 16:09
noonedeadpunkexcept storing them somewhere on repo_container16:09
noonedeadpunkbut that covers only 3 hosts, and osa spends most time on deployment of computes and net nodes anyway16:09
noonedeadpunkthat are still bare metal16:09
noonedeadpunklike deploying heat or magnum takes couple of minutes as well16:10
noonedeadpunkand upgrading them same time16:11
noonedeadpunkalso install from pypi is super fast....16:12
noonedeadpunkMaybe we should also get an option to jsut install from pypi....16:13
noonedeadpunkand get our releases to tags instead of SHAs16:13
spatelwhatever we can do in advance and pre-build so next time just rolling out code..16:14
spatelmgariepy how do you copy file in lxd container? 16:21
spatellike in OSA lxc we can browse folder inside /var/lib/lxc/<container/rootfs 16:21
mgariepyyou can do a lxd snapshot and send it over.16:22
mgariepybut it's not live. it doesn't update automatically either.16:23
spatelall i want copy bunch of files inside container 16:23
spateli can use scp but wonder if simple way to copy file in container with browse filesystem from host machine16:24
mgariepyyou can use the lxd client to copy files around16:26
mgariepyit won't be easier i don't think16:26
spatelthere is a way - ls -l /var/snap/lxd/common/lxd/storage-pools/lxd/containers/mycontainer/rootfs/16:27
spatelI am trying to create osa deployment node from existing on..16:35
spatelJust created lxd container and checked out repo and my production tag git checkout 23.3.016:35
spatelnow copying /etc/openstack_deploy inside my lxd container16:36
spatelthen going to run scripts/bootstrap-ansible.sh16:36
spateland copy /etc/hosts + ssh keys16:36
jrosserspatel: `lxc file push <blah>`16:38
spateldoes that work with directory also?16:38
jrosser--recursive --create-dirs apparently16:39
spatelnice16:41
jrosseryou can use cloud-init with lxd so you can use that to insert your ssh key16:45
jrosserthen just treat it like a regular server16:45
spatelI am playing with lxd to get more familiar 16:47
spateljrosser should i run scripts/bootstrap-ansible.sh or better copy all files from original deployment host like /etc/ansible directory etc.16:48
jrosserwell, if everything is done right only two things matter, the sha you check out the openstack-ansible repo to16:49
jrosserand the contents of /etc/openstack-deploy16:50
mgariepyssh-keys and maybe ansible-vault stuff.16:50
spatelvault ?16:50
jrosserlike if you patch any roles then hopefully you've already got those as forks on github or something16:51
spatelin my case i didn't do any patching 16:51
spatellet me run scripts/bootstrap-ansible.sh and see, technically it should download all roles etc.. 16:52
mgariepydepending on how you encrypt your secret with ansible-vault16:52
jrosseri think also in the past there were some files in ~root - maybe octavia CA or something16:52
jrosserbut that was an oversight16:52
mgariepyon this i need to go. i got a sick kid here..16:53
mgariepyhave a nice weekend16:53
spatelhave a good weekend!! thanks you.. its flu time so make sure nothing serious 16:53
noonedeadpunkwait, I have recalled - we have distro install for those who prefer deployment speed !16:54
noonedeadpunkas that is really fast as well as with upgrades16:55
noonedeadpunkspatel: the thing is that we pre-build wheels and rolling them out net time is way faster then building them16:57
spatelnoonedeadpunk you are correct but in kolla there is a option for source and it works as fast as distro 16:57
noonedeadpunkbut building wheels itself take quite some time - installing from pypi like kolla do is way faster16:57
noonedeadpunkit's not source - it's pypi16:57
noonedeadpunkit's huge difference in process and result16:57
spatelyes.. its pypi but they called it method: source 16:58
noonedeadpunkWhile you can do re-install anything with pre-built wheels without external connectivity - in kolla you would need to mirror pypi which is tough thing to do16:58
noonedeadpunkInstallation from pypi whould be much faster I believe....16:59
noonedeadpunkespecially if we could cache things16:59
spatelyeppp 16:59
spateljrosser i did run scripts/bootstrap-ansible.sh which download /etc/ansible/role etc.. 17:00
spatelBut when i compare size of folder its different then original. Old deployment node has 91MB size of role folder and in new deployment has 63MB 17:01
spateldoes that indicating something wrong?17:01
jrosseri don't know17:11
spatelI think its just filesystem block size issue.. 17:11
jrosserperhaps over time we deprecate some roles too17:11
spateli have compared files side by side and didn't see any difference 17:11
spatelnow running playbook to see if its going to change anything on one of my compute node or not :)17:12
noonedeadpunkI think new shallow_since does reduce amount of content that's being fetched from git17:18
spatelwhat is the tags to distribute nova keys ?17:18
noonedeadpunkon which version? as on Y it's not needed anymore17:18
spatelWallaby 17:18
noonedeadpunknova-key17:19
spatelcool but in newer version it should do itself correct17:20
noonedeadpunkyup17:21
noonedeadpunkit's ssh certs instead of rsa keys17:21
noonedeadpunkwhich are used only for offline migration....17:22
spatelyes.. we don't have ceph so using block migration 17:23
spatelis this correct command openstack-ansible os-nova-install.yml --tags nova-key --limit compute_hosts17:23
*** dviroel|rover|afk is now known as dviroel|rover18:05
noonedeadpunklooks fair18:53
noonedeadpunkspatel: there's proof it's good :D https://opendev.org/openstack/openstack-ansible/src/branch/stable/xena/scripts/add-compute.sh#L3718:54
spatelThank you!! so we have script to add compute nodes :)19:02
spateli didn't know that19:02
opendevreviewMerged openstack/openstack-ansible-os_neutron master: Do not provision neutron config when not needed  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/86708719:10
*** dviroel|rover is now known as dviroel|out21:20
*** tosky is now known as Guest152722:34
*** tosky_ is now known as tosky22:34

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!