Tuesday, 2022-11-01

*** dviroel|rover|dinner is now known as dviroel|rover00:33
*** dviroel|rover is now known as dviroel|out00:36
*** ysandeep is now known as ysandeep|afk06:42
-opendevstatus- NOTICE: review.opendev.org (Gerrit) is currently down, we are working to restore service as soon as possible07:30
*** ysandeep|afk is now known as ysandeep07:51
derekokeeffe85Morning all, ran into an issue you guys helped me with already but for the life of me I can't remember what I did to fix it. https://paste.openstack.org/show/bEzHCNsWXPjByvlYIKsf/ I was sure it was log into the container and umount /var/www/repo but that's not working for me now09:26
noonedeadpunkI'm not sure about unmount. It should be mounted as otherwise you will get other repo containers broken09:33
jrosserderekokeeffe85: you are getting connection refused from the loadbalancer?09:48
jrosserderekokeeffe85: if you don't even get a 5xx code back then that suggest some fundamental networking problem09:49
jrossertry the same URL with curl to double check09:49
derekokeeffe85Maybe the umount was for a different issue so noonedeadpunk. Ok a networking error jrosser, the curl doesn't work either10:13
jrosserthe error code is important10:14
jrosserconnection refused from haproxy means one thing10:14
jrosserconnection but 5xx code means something else10:14
*** dviroel|out is now known as dviroel|rover11:28
*** ysandeep is now known as ysandeep|afk11:36
derekokeeffe85jrosser thanks. Had the wrong config on an interface :(12:12
*** ysandeep|afk is now known as ysandeep12:15
*** ysandeep is now known as ysandeep|dinner14:22
-opendevstatus- NOTICE: review.opendev.org (Gerrit) is back online14:25
noonedeadpunk#startmeeting openstack_ansible_meeting15:00
opendevmeetMeeting started Tue Nov  1 15:00:18 2022 UTC and is due to finish in 60 minutes.  The chair is noonedeadpunk. Information about MeetBot at http://wiki.debian.org/MeetBot.15:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.15:00
opendevmeetThe meeting name has been set to 'openstack_ansible_meeting'15:00
noonedeadpunk#topic rollcall15:00
noonedeadpunko/15:00
jrosserhello15:00
mgariepyhello15:00
noonedeadpunkyay, gerrit finally back -jsut in time :D15:01
noonedeadpunkwith regards time change I guess it makes sense to raise question if current meeting time is ok for you?15:02
jrosserthis is ok for me15:02
noonedeadpunkOr should I make a poll and pick up new one ?15:02
noonedeadpunkok, if everyone is fine with 15UTC - let's leave it as is15:04
mgariepyworks for me15:04
noonedeadpunkIf not - let me know and we will arrange a poll15:04
noonedeadpunk#topic office hours15:05
noonedeadpunkSo I have good progress on zookeeper role15:05
noonedeadpunkI still feel quite confused about how to have 2 repos for same purpose under opendev umbrella... 15:05
noonedeadpunkI tired to fit in what's already there and it's quite far from beeing usable by us I'd say15:06
jrosserthere is more than one deployment project so maybe we don't worry too much15:06
noonedeadpunkyeah....15:07
noonedeadpunkI planned to push patches for repo creation today after the meeting15:07
noonedeadpunkAnd also create skyline repo with that15:07
noonedeadpunkWe have quite a few patches for review15:08
*** dviroel|rover is now known as dviroel|rover|lunch15:09
* noonedeadpunk checking PTG etherpad15:10
mgariepyi'm still working on ovn ssl stuff, i'm getting closer (i think)15:11
noonedeadpunkI think we have quite good progress for things we want for Z15:12
jrosseri am doing another ironic deployment so hopefully we find any last bits there too15:13
noonedeadpunkdamiandabrowski is not around today and I'm not sure about conclusions what we should do with glance. As we've landed (or about to land) changes that will disable show_multiple_locations by default15:13
noonedeadpunkand if we still need to run 2 api servers just for show_directl_url as it was considered as lower risk from what I got15:14
jrosseryes i think that was the result15:16
jrosserjust need a tidy way to do it15:16
noonedeadpunkYeah, will try to check that once done with zookeeper15:19
noonedeadpunkoh, btw, one frustrating thing has happened during PTG. Regarding u-c and our way of filtering15:21
noonedeadpunkSo basically I was told that u-c as of today should be never used for stable branches as basically we deploy outdated software - requirements team does not manage security issues in packages that are in u-c15:22
jrosserthats always been the case i think15:22
jamesdentonhey, sorry i'm latye15:23
noonedeadpunkAnd when I asked how then openstack should be installed, as it will hardly pass without u-c they said that system packages is the only way to do that15:23
noonedeadpunkbecause we all trust distributions to maintain python bindings...15:23
noonedeadpunkOn top - filtering projects (like neutron or ceilometer) should be considered as a bug and fixed from our side15:24
jrosserbut - you cant?15:24
jrosserlike pip blows up?15:24
noonedeadpunkWhile having these project in u-c is okey.15:24
noonedeadpunkSo what they proposed - install just using u-c, and then update package on top of installed one15:25
jrosseri thought neutron is only there becasue work on neutron-lib is not complete15:25
jrosserthats the actual bug15:25
noonedeadpunkie - do installation 2 times separately, first install all requirements with constraints and then install whatever needed 15:25
noonedeadpunkThere was quite harsh argue for this topic15:25
noonedeadpunkAnd because they didn't want to change anything it all ended up in - u-c for CI only, never use on prod, use system packaged15:26
noonedeadpunkwhich got me very frustrated and confused 15:27
jrosseri can imagine15:27
jrosseranyone from kolla perspective there?15:27
noonedeadpunkby that time I guess not. But infra folks were on releases team side15:27
jrosserno comment15:27
noonedeadpunkWell, when things a bit calmed down I got suggestion to install requirements+u-c and then package from source independently15:29
noonedeadpunkwhile this can work - there's one tricky thing (at least) - if package is older then from u-c and we're building wheels - it can still be troublesome15:30
* noonedeadpunk got frustrated again after raising this topic....15:31
jrosserperhaps it's worth talking to mgoddard or someone from kolla as the problem i guess is identical for them?15:32
noonedeadpunkThey install things jsut from pip though15:33
noonedeadpunkWe quite recently got issue when running cinder-api deployed at beginning on Xena, but we added a bunch of cinder-volume from top of stable/xena (due to some bugfixes in code) and they were ignoring detach commands - nova was detaching, but cinder-volumes just ignored that. Until we had to upgrade cinder-api to same version15:34
noonedeadpunkSo I wonder if I will be suggested to enable unattended-upgrades as well to cover that issue with system packages...15:35
noonedeadpunkanyway15:35
noonedeadpunkwell, I've checked devstack and it's also filtering the same way we do. But argument was - devstack is CI only, so we can do nasty things there while osa/kolla should not do that15:36
jamesdentonwouldn't you want prod to mirror what you're doing in testing? <insert kermit meme>15:37
noonedeadpunkyeah, sure, you're right.15:37
noonedeadpunkbtw, we can drop that filtering for tempest, as tempest in not in u-c for a while now15:39
noonedeadpunkeventually, neutron also is not for Zed as of today. But not sure if it's intended or not15:39
*** ysandeep|dinner is now known as ysandeep15:40
noonedeadpunkso if it's only ceilometer that left.... ugh15:40
noonedeadpunkfix bug by dropping telemetry roles ? :D15:40
noonedeadpunkanyway15:40
jamesdenton:D15:41
noonedeadpunkI don't have anything else on agenda15:41
jamesdentoni hope to revisit the default ml2 plugin drama today or tomorrow15:41
noonedeadpunkaha, yes, good15:42
mgariepyi hope being able to fix the ovn ssl stuff today :/15:42
noonedeadpunkMy personal opinion is that we should provide some default....15:42
noonedeadpunk(and we still do in neutron role)15:42
jamesdentonthe haproxy templates need to be adjusted to account for neutron_plugin_type not being global, which in one case is for calico but i thought about adding an 'is defined' check15:42
jamesdentonyes, a default is fine. just trying to avoid stepping on the toes of lxb default deployments15:43
noonedeadpunkyeah, let's add this check and will clean up later with calico itself15:43
jamesdentonthat works15:43
noonedeadpunkI tried to take a look on calico yestarday actually and get done quite fast15:43
noonedeadpunkI think current issue at very least is that etcdgw driver can't work with modern etcd properly15:44
noonedeadpunkas it tries to check URLs for health that are not valid with latest stable etcd15:44
jamesdentonI found this, not sure how relevant it is today (from 2020): https://github.com/projectcalico/calico/issues/3015#issuecomment-57309499715:45
noonedeadpunkAnd I didn't want to dig much deeper there15:45
jamesdenton" In particular, we used to advise core_plugin = ml2 and configuring Calico as an ML2 mechanism driver, but now we prefer core_plugin = calico"15:45
noonedeadpunkSo `calico-dhcp-agent.service` fails now with not able to connect to etcd15:46
noonedeadpunkNot sure if it's related to ml2 overall...15:46
noonedeadpunkAnd we don't install neutron-metadata-agent... But yes, we still have core_plugin=ml2 for this scenario15:48
noonedeadpunkBut I'm not sure how it will solve that calico services rely on https://opendev.org/openstack/etcd3gw/commits/branch/master/etcd3gw/client.py that seems not to work with modern etcd15:49
noonedeadpunkThey even don't override DEFAULT_API_PATH.. Also I kind of hardly understand how it's supposed to be overriden when its set as constant...15:50
noonedeadpunkanyway15:50
noonedeadpunkI gave up15:50
jamesdentonwell, to be fair, even the upstream docs stop at 18.0415:51
jamesdentonso who knows how well it's been maintained15:51
jamesdentoni don't know how much it will be missed if it were deprecated and removed15:52
noonedeadpunkyeah, no idea. I know logan- was using it, but I haven't seen him for a while now...15:53
noonedeadpunkor heard15:53
noonedeadpunkso no idea15:54
jrosseri think that we should remove it15:57
jrosserthere is a big mess with calico metadata service and internal SSL as well that there is no good solution to15:58
jrosserit uses an iptables rule to forward traffic to the metadata service, rather than haproxy like neutron would normally do15:58
jamesdentonthe iptables method was the old way, even for neutron15:59
jrosserso there is no opportunity to resolve (metadata http request) -> (internal endpoint being https)15:59
noonedeadpunk#endmeeting16:02
opendevmeetMeeting ended Tue Nov  1 16:02:00 2022 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)16:02
opendevmeetMinutes:        https://meetings.opendev.org/meetings/openstack_ansible_meeting/2022/openstack_ansible_meeting.2022-11-01-15.00.html16:02
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/openstack_ansible_meeting/2022/openstack_ansible_meeting.2022-11-01-15.00.txt16:02
opendevmeetLog:            https://meetings.opendev.org/meetings/openstack_ansible_meeting/2022/openstack_ansible_meeting.2022-11-01-15.00.log.html16:02
noonedeadpunkyeah, I agree with that. Except someone really wants to step in and maintain that16:02
jamesdentonThese diagrams will need to be updated/expanded to support the LXB, OVS/OVN deployments: https://docs.openstack.org/openstack-ansible/latest/reference/architecture/container-networking.html. but i can try to highlight OVN as the prferred way forward16:03
noonedeadpunkwe should communicate though this decision first16:03
noonedeadpunkyeah.... I think we actually should also document/draw on how to use ovs bridges for lxc connectivity...16:03
jamesdentonuhhhh yeah, i've been avoiding that16:04
mgariepylol16:04
noonedeadpunkDoesn't that much easier for things like octavia?16:04
noonedeadpunkBut I think I will take that part later on16:04
noonedeadpunkHopefully we will have some deployment soon where will try this out16:05
jamesdentoni don't know, really. Our Octavia lbaas-mgmt network is routed, so it's not really a problem16:05
jamesdentoni noticed for OSP, that their lbaas-mgmt network is vxlan and there's some cute plumbing done to make that accessible from the control plane16:05
noonedeadpunkis it also routed between computes? octavia uses unicast for vrrp?16:05
johnsomYes, unicast only16:06
jamesdentonrouted, as in a vlan provider network that hangs off a firewall/router somewhee16:06
noonedeadpunkjamesdenton: and you have single or active/passive setup?16:09
jamesdentonthe amphora?16:09
noonedeadpunkyeah16:09
jamesdentonactive/passive i believe16:09
noonedeadpunkjohnsom: yeah, unicast indeed makes more sense as there're 2 amphoras tops16:09
noonedeadpunkhm... why I thought that routed setup won't work then...16:10
johnsomYeah, it was a design point to allow the lb-mgmt-net to be fully routed.16:10
noonedeadpunkmaybe I thought it's multicast...16:10
johnsomMulticast was also to unreliable in some of the ML2's when we started.16:10
jamesdentoni think for OSA CI it was problematic, which resulted in some convoluted bridging and veths16:11
johnsomYeah, that old design in OSA was... interesting.16:11
jamesdentonthat was a case of test -> prod16:11
*** dviroel|rover|lunch is now known as dviroel|rover16:12
noonedeadpunkah, you also run things on bare metal... I wonder what can go wrong with containers and fully routed setup...16:12
noonedeadpunkAs if switch our keepalived to unicast as well - everything should jsut work I guess....16:12
johnsomIt simplifies your keepalived instance IDs too. grin16:13
jamesdentonyes, we do baremetal now, but i'm not sure it matters, since the octavia worker has egress networking (even if it's SNAT'd), the routing just needs to be there16:14
jamesdenton*octavia worker in LXC16:14
noonedeadpunkjohnsom: I just not sure how unicast is good when having 3 or 5 keepalived servers16:15
noonedeadpunkas then it likely it makes less sense 16:16
johnsomIt's one packet, per instance, per heartbeat interval. I guess it just depends on how fast you want convergence. We don't do sub-second failover configs usually.16:18
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_ironic master: Allow ironic bmaas network gateway and dns servers to be undefined  https://review.opendev.org/c/openstack/openstack-ansible-os_ironic/+/86316016:42
*** ysandeep is now known as ysandeep|out17:07
mgariepyjamesdenton, do you think it would be better to deploy northd before the compute nodes  ?17:42
noonedeadpunkjrosser: do you have zuul internally? if not - can you kindly drop zuul.d directory from https://github.com/jrosser/openstack-ansible-os_skyline/blob/master/zuul.d/ as it will fail to get cloned - https://zuul.opendev.org/t/openstack/build/e2085d34f3dd447a94d58855fb36609e17:47
noonedeadpunkor well, I can push PR I guess as well17:49
jrosserah yes I probably just search/replaced one of the existing roles to make that17:52
jrossermake a PR and I can merge it when I get home17:52
jrossernoonedeadpunk: it’s merged18:21
jamesdentonmgariepy i didn't really consider the order. are you seeing issues?18:41
mgariepyif you have a lot of computes it might take quite a while for the controller to get up.18:48
mgariepyand we usualy setup the control plane before the compute service18:49
jamesdentonany approach in mind?18:50
mgariepynot 100% sure18:52
mgariepynot 100% sure it would cause issue tbh18:53
mgariepyhmm `ansible neutron_ovn_northd -m command -a "ovs-appctl -t /var/run/ovn/ovnnb_db.ctl cluster/status OVN_Northbound"`19:07
mgariepyjamesdenton, what does it give you ?19:07
jamesdentonlet's see19:09
mgariepythe bind is on port 6643 ?19:10
jamesdentonhttps://paste.opendev.org/show/baK0RyP1TjrrfqQyuEbU/19:10
jamesdentonyes, 664319:10
mgariepysomething is listening on 6641 also ?19:11
jamesdentonovsdb-server?19:11
mgariepywhat are the args ?19:13
mgariepyhttps://paste.openstack.org/show/b2zWK0WMxIrzdwqyd1cF/19:13
mgariepymine doesn't listen to the port anymore :( lol19:13
mgariepywhat a mess19:13
jamesdentonyou borked it good19:14
jamesdentonhttps://paste.opendev.org/show/b95vmTwjtmVyKCxWX3IC/19:14
mgariepyyou know when a app has 200 options with defaults that are not compatible.19:15
mgariepyit does create some edgecase lol19:15
jamesdenton:D19:15
jamesdentoni don't trust my /etc/default/ovn-central looks correct19:15
mgariepy` /usr/share/ovn/scripts/ovn-ctl --help`19:16
mgariepyi wonder if i need to set `--db-nb-sync-from-proto`19:17
mgariepyhttps://paste.openstack.org/show/bRb0pyBP48HTYttPNkcB/19:17
mgariepyha. nop ovn-sbctl set-connection pssl:664219:26
*** dviroel|rover is now known as dviroel|rover|bbl21:32

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!