Monday, 2023-02-20

soltanedareHi07:34
noonedeadpunko/07:48
damiandabrowskihey folks09:22
damiandabrowskiFYI: my separated haproxy config changes passed gating over the weekend09:22
damiandabrowskihttps://review.opendev.org/c/openstack/openstack-ansible/+/871189/09:22
damiandabrowskihttps://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/87118809:22
damiandabrowskii fixed a lot of your suggestions there and added a summary of recent changes in a comment09:23
noonedeadpunkwill try to have a look during the day or tomorrow morning at worst09:24
admin1\o10:48
*** dviroel_ is now known as dviroel12:15
damiandabrowskiso our 'linear' strategy plugin really breaks loops, i was able to reproduce it on a fresh AIO environment12:30
damiandabrowskihttps://bugs.launchpad.net/openstack-ansible/+bug/200784912:30
noonedeadpunkwell, that's quite sad12:33
jrosserhttps://github.com/openstack/openstack-ansible-plugins/blob/master/plugins/strategy/linear.py#L9712:33
noonedeadpunkIt's actually good question if it's still needed. It used to fix flaws in default linear plugin when talking about containets12:34
noonedeadpunkas it was somehow helping out with running against hosts and containers at the same time to prevent race conditions or smth like that...12:34
noonedeadpunkBut that was soooo long ago12:34
noonedeadpunkcloudnull: maybe you recall details about it?12:35
jrosserit probably needs to detect with* and loop12:37
jrosseror just be deleted entirely12:37
noonedeadpunkthat;s the commit that added strategy https://github.com/openstack/openstack-ansible-plugins/commit/cb01efef6657fce5003f099e5209b7086a0cd46912:41
noonedeadpunkSo yeah, it was added with SSH plugin and still imports it from what I see. Or well - attempts to import it, but I'm not sure if import works...12:41
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible-plugins master: [DNM] Check if everything works fine without linear plugin  https://review.opendev.org/c/openstack/openstack-ansible-plugins/+/87442514:21
noonedeadpunkdamiandabrowski: ssh connection plugin is needed for sure...14:45
noonedeadpunkas we don't start ssh inside containers with some exceptions, like keystone14:45
noonedeadpunkwell....14:45
noonedeadpunkmaybe with ssl certs we can start ssh inside lxc, but then jrosser for example will need to pass extra network to containers14:45
noonedeadpunkWhich is smth I bet he don't want to do14:46
noonedeadpunkAs containers are isolation layer and software inside lxc don't really have access to SSH14:46
noonedeadpunk*ssh network14:46
noonedeadpunkor do they?14:46
admin1so you mean we cannot do ssh util  from deploy .. but always have to login to c* and then lxc-attach ? 14:58
admin1any thoughts on moving to lxd as well ? 14:58
noonedeadpunkadmin1: as soon as it will be possible to avoid using snap with lxd....15:09
noonedeadpunkand yes, that's exactly how it works now15:09
admin1got it .. 15:23
Mohaa7Hi15:28
Mohaa7If I'm not going to use the modules which are listed after the Horizon in the setup-openstack.yml, should I comment them in the yml file?15:29
noonedeadpunkMohaa7: um, I would not change playbooks to be frank. If you want to - you can use your own playbooks of course. But the thing is - that out of all playbooks only roles that have some hosts in inventory will be ran16:00
noonedeadpunkIn other words - if you have not defined any host for trove in openstack_user_config - trove role will be ignored even playbook is included16:01
Mohaa7+1, If they are fully skipped, while not having hosts in the openstack_user_config, yeah, it's ok16:19
Mohaa7https://usercontent.irccloud-cdn.com/file/Ob7fmy4D/error-but-working.png16:20
Mohaa7My recent deployments, all are results to a working environment, but on all of them I get a failure on compute01, as you see in the above image. It's weird!16:21
Mohaa7resulted*16:21
noonedeadpunkMohaa7: um, well, you'll need to find task that's failing so that we could help you16:23
Mohaa7both on VMS and bare metals with different networks.16:23
Mohaa7It's a huge log file! It's not saying on which part there's an issue!16:24
Mohaa7Ah, I found the position of error, here: https://paste.opendev.org/show/bMLvOPRo628mi2RZ6J6c/16:27
noonedeadpunkhm, can you check if there's some error also a bit above?16:30
noonedeadpunkAs it basically says that neutron-ovn-metadata-agent service does not exist on compute01 for some reason16:31
Mohaa7it's happened twice: https://usercontent.irccloud-cdn.com/file/3C5bI5fi/image.png16:31
Mohaa7yes, I'll check now16:31
noonedeadpunkwell, it's both in handlers, and smth could happen also before handlers 16:31
noonedeadpunkhandlers will be triggered regardless16:32
Mohaa7There's another error before them: https://usercontent.irccloud-cdn.com/file/bLi8kSEG/image.png16:33
noonedeadpunkthat is interesting16:34
noonedeadpunkI can recall seing that some time ago, but can't recall about what caused that16:35
noonedeadpunkMohaa7: you have cutted screenshot in a bit unfortunate way :) Can you kindly also include output of the previous task?16:37
Mohaa7I'm checking this error on two different deployed environment, and same error on Compute01 in the logs of bothe env!16:37
noonedeadpunk` Fetch override files`?16:37
Mohaa7I didn't get it!16:38
Mohaa7what do you mean by "Fetch override files"?16:39
noonedeadpunkok, and what the content do you have in /openstack/venvs/neutron-26.0.1/etc/neutron?16:39
noonedeadpunkI kind of wonder if for some reason you don't have neutron installed on compute01 for some reason16:41
Mohaa7noonedeadpunk  /openstack/venvs/neutron-26.0.1/etc/neutron: dnsmasq-neutron.conf with these two options: `dhcp-ignore=tag:!known` and `user=neutron`16:42
noonedeadpunkhm... it should contain more....16:43
noonedeadpunkAnd is content on another compute is same?16:43
Mohaa7noonedeadpunk: lins above/under that screenshot lines: http://sprunge.us/tFeADB --> ctrl+f f or `failed`16:52
noonedeadpunkwhat a weird set of hosts btw, why compute01 is executed along with infra but not compute02...16:54
noonedeadpunkit's really weird output16:54
Mohaa7Oh, it was from infra01; Here is the list of files in `/openstack/venvs/neutron-26.0.1/etc/neutron`:16:54
Mohaa7https://usercontent.irccloud-cdn.com/file/tJAnvyX6/image.png16:54
Mohaa7Compute02: https://usercontent.irccloud-cdn.com/file/4Omkh3Bw/image.png16:55
Mohaa7some items are missed on compute01 in that path, but why?16:55
noonedeadpunkthat is really good question16:55
Mohaa7indeed, on file is missed: neutron_ovn_metadata_agent.ini16:57
Mohaa7one*16:57
noonedeadpunkI'd say it's due to https://opendev.org/openstack/openstack-ansible-os_neutron/src/branch/master/tasks/neutron_post_install.yml#L108-L115 not being executed for compute01 for some reason16:58
noonedeadpunkThat kind of smells like a bug... OR smth is off with groups definition in your envs16:59
Mohaa7I have metadata hosts set on infra nodes: https://usercontent.irccloud-cdn.com/file/a7kd0YDi/image.png17:01
noonedeadpunkSo neutron-ovn-metadata-agent should be launched only on neutron_ovn_controller group17:02
noonedeadpunkand not on computes17:02
noonedeadpunkWhich is network-gateway_hosts17:03
noonedeadpunkWell, depending on what you've defined as network-gateway_hosts ofc17:03
noonedeadpunkneutron_metadata_agent shouldn't be defined at all for OVN scenario iirc17:04
noonedeadpunkcan you share your definitions from openstack_user_config?17:04
Mohaa7has been set in this way: `network-gateway_hosts: *compute_hosts`17:04
Mohaa7yes, w8 please17:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible master: Update documentation for LXC/metal and LXB/OVS/OVN  https://review.opendev.org/c/openstack/openstack-ansible/+/86757717:06
Mohaa7noonedeadpunk: the config file is here: https://paste.ubuntu.com/p/Bbj7sMkXjg/17:09
noonedeadpunkMohaa7: I don't think you should define network_hosts though I'm also not sure it should cause issues like that17:15
noonedeadpunkas network_hosts is valid for ovs/lxb scenarios but not ovn17:15
Mohaa7Mohaa7: I added t because you are using it in the AIO17:16
Mohaa7it*17:16
noonedeadpunkso you should have neutron_ovn_metadata_agent on computes but should not have neutron_metadata_agent on infra17:16
noonedeadpunkyeah, but network-infra_hosts should be more the enough17:18
noonedeadpunkstill I don't think it's the root cause...17:18
noonedeadpunkI'm quite suspicios about https://opendev.org/openstack/openstack-ansible-os_neutron/src/branch/master/tasks/neutron_post_install.yml#L108-L115 as it should have ran but it's not in fact17:19
noonedeadpunkAnd likely run_once is the reason why....17:19
Mohaa7What if I run the os-neutron-install.yml again after removing network_hosts from the config file?17:20
noonedeadpunkWell, they're already in openstack_inventory.json, which then should also be carefully cleaned up.17:20
Mohaa7Oops!17:20
noonedeadpunkMohaa7: to be frank - I assume if you will jsut run `openstack-ansible playbooks/os-neutron-isntall.yml --limit compute01` - it will finish without errors17:21
noonedeadpunkIf the problem is in the place I'm thinking about17:21
Mohaa7Then, I keep `network_hosts` there and run yours to see if it creates the metadata file in `/openstack/venvs/neutron-26.0.1/etc/neutron` or not17:22
noonedeadpunkAs this part from your output looks like a bug to me https://paste.openstack.org/show/bGn3zicwfTGsES4MyHDg/17:22
noonedeadpunkyeah, let's try this out17:23
Mohaa7I also try the whole process on a new lab, this time without `network_hosts`, to know if it's the cause17:24
noonedeadpunkThat would be great if you have a place for multi-node lab as it would take time for me to spawn one17:28
Mohaa7noonedeadpunk: I run it on comute01; result: success; And now there's a `neutron_ovn_metadata_agent.ini` in /openstack/venvs/neutron-26.0.1/etc/neutron/ on compute0117:30
noonedeadpunkSo, would be great if you could submit a bug so that we won't loose it17:31
Mohaa7Sure, but let me try for a new env without network_hosts that takes ~4 hours17:32
noonedeadpunkoh, yes, totally17:32
noonedeadpunkthat is super interesting to check17:35
Mohaa7(:17:35
Mohaa7Thanks for your help17:35
Mohaa7I finally deployed OSA on bare metals (our acceptance env) successfully.17:37
Mohaa7Thank you so much to all of you in this channel17:37
noonedeadpunkwell, there's at least 1 bug you're affected with...17:43
Mohaa7yeah17:53
Mohaa7Here: https://docs.openstack.org/openstack-ansible-haproxy_server/latest/configure-haproxy.html it's said that `haproxy_keepalived_external_vip_cidr: 192.168.0.4/25` as a sample. I'm confused as someone here mentioned that it definitely should be set as `<ip>/32` regardless of the subnet. But in the link it's using /25!17:56
noonedeadpunkyes thta must be /3217:58
noonedeadpunkMohaa7: do you want to push a change?:)17:59
Mohaa7Yes, now I have some space to work on the docs.  Let me read https://docs.openstack.org/doc-contrib-guide/ at first18:02
noonedeadpunkMohaa7: In short - you need to have an account in Ubuntu One which will allow you to configure gerrit account. There you will need to fill in your nickname and upload ssh key18:05
noonedeadpunkthen ensure you have git-review plugin for git.18:05
Mohaa7+118:06
noonedeadpunkThen it's like that - clone repo, make commit, do `git review`. If you need to make a change - amend existing commit rather then adding new one (that's biggest difference with github/gitlab)18:06
noonedeadpunkAs gerrit identifies changes not based on branch, but pased on `Change-Id` that is part of the commit message18:07
noonedeadpunkIf you want to donwload some patch locally to adjust it - do `git review -d <id>`, do changes, `git add .`, `git commit --amend`, `git review`18:09
noonedeadpunkyou can add `-f` to last command if you want to delete branch that was created by downloading patch18:09
opendevreviewDamian DÄ…browski proposed openstack/openstack-ansible master: [DNM] Check if everything works fine without linear plugin  https://review.opendev.org/c/openstack/openstack-ansible/+/87448218:13
Mohaa7I do; If something in this regard is unclear to me, I will raise it here (:18:15
*** gmann is now known as gmann_afk18:37
*** gmann_afk is now known as gmann18:51
admin1the VIP can be set to anything. i have /32 in some, i have /24 in some, /22 in some 19:06
admin1i have not seen an issue on it so far 19:06
admin1but it also depends on your specific use case 19:06
jrwr_doing my first setup, putting all containers on one host, I'm getting a lockup without error at [lxc_container_create : Write default container config] for setup-hosts.yml, running with -vvv shows a OK for lxc.apparmor.profile=unconfined on manila_container and stops here every time. there are no SSH sessions active when this locks up22:03
jrwr_for debug, I've ran openstack-ansible lxc-containers-destroy.yml to try and clear out that host22:07
jrwr_removed manila and metering-compute out of user config and moved on, (I didn't want those services /anyway/)22:15
jrosserjrwr_: it's always useful to build an all-in-one as a reference to compare first attempts with https://docs.openstack.org/openstack-ansible/zed/user/aio/quickstart.html22:21
jrosserthats auto-configured with a pretty minimal set of services22:22
Mohaa7admin1: then it's meaningless to define the subnet alongside the IP in `haproxy_keepalived_external_vip_cidr: 192.168.0.4/25`. It would be only enough to set an IP itself: `haproxy_keepalived_external_vip_cidr: 192.168.0.4`, wouldn't it?22:34
*** jrwr_ is now known as jrwr22:35
Mohaa7In my case I'm choosing an IP, for example 172.20.21.22, from a range of `/24`. But I set it as `172.20.21.22/32` in the user_variables.yml file; And it works well. If it's also going to work by `172.20.21.22/24`, so what's the purpose of defining its subnet there?22:40
admin1Mohaa7, the subnet there defines how you route . if you just use 10.0.0.1 for example and it added /8 by itself, then the interface where this is added will hold the route for 10.0.0.x and your other interfaces where any 10x might not work22:48
admin1so netmask has its place there 22:48
admin1as you are adding an ip, the netmask defines the arp/broadcast domain and how linux will add that to its route 22:49

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!