Thursday, 2022-12-22

*** chandankumar is now known as chkumar|rover05:27
jrossermorning09:16
noonedeadpunko/09:23
noonedeadpunkso... mirrors are still desynced...09:57
jrosserodd that noone else has trouble09:59
noonedeadpunkand I know why10:05
noonedeadpunkcheck this devstack job, that succeeded https://zuul.opendev.org/t/openstack/build/1b37b0e31dd14aa398a71377ee27eb4b/log/job-output.txt#226410:06
noonedeadpunkAnd I'm not sure how much projects event test against centos10:07
noonedeadpunks/event/even10:07
*** dviroel|out is now known as dviroel10:48
admin1if i create a port-binding ( sr-iov) ..  and then add a new sr-iov host, can that port be used.. or i have to create a new port when a new host is added ? 10:48
jrossernoonedeadpunk: at some point maybe we make Rocky the first class RHEL-alike distro and make centos experimental10:57
noonedeadpunkwell, I think we even said that during PTG10:57
jrosserbut without running the jobs we will break conditionals for sure10:58
jrosserperhaps this would be a good thing to look at early this cycle10:58
jrossertrying to get rid of anything centos specific so that test on Rocky cover mostly both10:58
noonedeadpunkJust check these 2 https://opendev.org/opendev/system-config/commit/e76e0089d1c4a7fe781027feb7977ae874899443 and https://opendev.org/opendev/system-config/commit/cc2dd16d3a7194a4185ad6e1da854cb4fde01b1c10:59
jrossergrrr10:59
noonedeadpunkI think, that for Rocky there're no infra mirrors btw11:00
noonedeadpunkyeah, https://zuul.opendev.org/t/openstack/build/1cc5dd9cb91c4d7eb17512207cce15c7/log/logs/etc/host/yum.repos.d/rocky.repo.txt11:00
noonedeadpunkso not sure how good is that ^11:01
noonedeadpunkshould we propose yet another revert on the latest revert? :D11:02
admin1in case of using sriov agent , when does privsep daemon starts ?  .. is it after a vm has initiated, or always running  ? .. i have 1 node where its running and works fine, in another node, vms fail to deploy and the diff is in working one, privsep process is also running.. in non working one, only sr-iov agent is running .... 12:31
admin1just checking if anyone else is also running sr-iov 12:31
opendevreviewJames Denton proposed openstack/openstack-ansible-os_neutron master: Create separate lock path for neutron-ovn-metadata-agent  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/86841512:53
opendevreviewJames Denton proposed openstack/openstack-ansible master: Start 2023.1 Antelope development  https://review.opendev.org/c/openstack/openstack-ansible/+/86795412:58
noonedeadpunkjamesdenton: fwiw I've narrowed down the patch that breaks things13:50
noonedeadpunkah. But I see you found how to fix that13:50
noonedeadpunkAs things become failing on https://review.opendev.org/c/openstack/neutron/+/86164913:51
noonedeadpunkBut I'm not sure if that patch actually helps....13:57
noonedeadpunkjamesdenton: yeah, it looks like you're right14:07
spatelOVN driver for octavia is so awesome! :)14:17
spatelatleast for basic use14:17
spatelI am using it with kolla but i going to try it out with OSA and blog it - https://satishdotpatel.github.io/openstack-kolla-deploy-octavia-ovn/14:17
noonedeadpunkyeah. we should add ovn support to octavia14:22
noonedeadpunkshouldn't be too tricky tbh14:22
spatelNo.. it just work out of box if you have OVN deployment 14:29
spatelI will get that going in my charisma break.. 14:29
noonedeadpunkwell, I think you only need to install octavia "plugin" for that14:30
spatelyep14:40
spatelnoonedeadpunk what do you think about multi-region deployment of OSA? 14:47
spatelHow do you guys deploying multiple openstack and linking them with single keystone? 14:47
noonedeadpunkI'm currently not. But I think it should be quite posisble. Though would suggest some keystone federation. And likely not keystone to keystone...14:48
noonedeadpunkBut maybe some ldap or smth14:49
spatelcurrently i am using LDAP for all my deployment (single identity) 14:49
noonedeadpunkAs if you have a keystone just in one region and others do connect to to this keystone - it's bad idea14:49
spatelI was thinking how about single horizon/keystone style deployment where end user select region from GUI14:50
noonedeadpunkThen I don't think there's any issue with that. Well, except you'de need to mess up with env.d a bit14:50
spatelI am just exploring all option that how public cloud company manager this kind of deployment. 14:50
noonedeadpunkEventually for that usecase - you don't even need to have multiregion14:50
noonedeadpunkYou can have 2 standalone regions14:50
noonedeadpunkAnd just configure horizon accordingly14:51
spatelhmm.. without shared keystone? 14:51
noonedeadpunkI did that couple of times14:51
noonedeadpunkyup14:51
spatelhow.. how?14:51
noonedeadpunkFor horizon it doesn't matter14:51
spatelbut horizon get endpoint from keystone right?14:51
noonedeadpunkUm, kind of, but you can explicitly provide independant keystones as a regions14:52
noonedeadpunkhttps://opendev.org/openstack/openstack-ansible-os_horizon/src/branch/master/defaults/main.yml#L259-L26114:52
noonedeadpunkSo you still will have a drop-down there with selection of regions, but it will connect to 2 independant keystones based on the selection14:53
noonedeadpunkNasty part, is that user credentials will be different if there's no federation14:53
noonedeadpunkBut if it's ldap - meh, do that14:53
spatelvery interesting.. 14:54
spatelHow about federation? how its different compare to shared keystone? 14:55
noonedeadpunkFranqly, once folks also did some nasty hooks to horizon, to make switching between regions even nicer, as with such setup when you switch regions you will be redirected to the login page (as you've not auth to another keystone).14:55
noonedeadpunkWell, I think shared keystone means that if link between regions goes down - you have problems14:55
spatelBut you can have multi-master replication to avoid that :) but anyway upgrade etc would be tricky.. 15:00
spatelshared is terrible in anyway. 15:01
noonedeadpunkwell, if you're going to have 3 regions - then might be. But then maybe you want AZs, not regions?15:14
spatelAZ is just logical grouping of servers/racks etc.. correct15:16
noonedeadpunkwell, depends15:16
noonedeadpunkbut regions are as well, kind of?15:16
noonedeadpunkbut yes, you also have services per region15:16
noonedeadpunkand AZs share control plane15:17
jamesdenton_o/15:17
jamesdenton_good day, all15:17
noonedeadpunko/15:17
jamesdenton_you may not believe it, but when the children are home off school, the days tend to start... 2 hours earlier15:17
noonedeadpunkspatel: but if you bother yourself with stretching keystone, why not to stretch glance api and re-use same images15:18
noonedeadpunkjamesdenton_: oh, well, I do15:18
noonedeadpunkdespite mine is too young, but when it's weekend he tends to wake up even earlier then during working days15:19
jamesdenton_there's no sleeping in around here15:19
spatelnoonedeadpunk we only use single image (single OS i meant) in glance :) 15:20
spatelWe don't give option to end users to pick images.. haha.. 15:21
spateljamesdenton_ did you use HP virtual connect or any experience?15:21
jamesdenton_no experience15:22
spatelnp15:24
jamesdenton_oh spatel, you had a question yesterday that i missed. but i don't have the scrollback. do you recall?15:24
spatelI have OVN lab and it has eth1:physnet1 which i am using for flat networking, Now i want VLAN base provider so trying to use eth2 nic so how do i tell ovn use eth2:physnet2 for vlans ?15:25
jamesdenton_ok15:26
spatelwhere should i add eth2 and map it in ovn for physnet2 ?15:26
jamesdenton_how are you defining provider networks, in openstack_user_config.yml?15:26
spateli don't have access of lab at present but you are saying just add in openstack_user_config and OSA will do all magic? 15:27
spatelhow does OSA map physnetX ?15:27
spateli know in legacy way you can map entry in ml2 like eth1:physnet1,eth2:physnet2 etc..15:28
jamesdenton_yes15:28
jamesdenton_one sec15:28
spateldoes same way it apply to OVN :)15:28
jamesdenton_https://paste.openstack.org/show/bj0tyuNGfbtDvwnhUKas/15:29
jamesdenton_playbooks will create the ovs bridges and connect network_interface15:29
jamesdenton_and create the mapping and apply to ml2 config and ovn15:29
spatelNiceee! 15:34
spatelDo we have this in OSA networking example :) 15:35
jamesdenton_i think so15:35
spatelwhat is this group for ? neutron-ovn-gateway ?15:35
jamesdenton_definitely here: https://docs.openstack.org/openstack-ansible-os_neutron/latest/app-openvswitch.html, for OVS15:36
jamesdenton_but it applies to LXB and OVN15:36
spatelit will make chassis gateway but which node? assuming network node but in ovn there is no network node correct? 15:36
jamesdenton_so, there's a patch in master/zed that splits out gateway from controller15:36
jamesdenton_so you can have ovn-controllers (computes) and ovn-controller+ovn-gateway (dedicated network nodes)15:37
jamesdenton_for example15:37
jamesdenton_since the initial OVN implementation assumed all computes would also be gateways15:37
spatelyes that is what i am thinking.. all computes are network node..15:38
jamesdenton_they still can be15:38
jamesdenton_but now they don't have to be15:38
spatelIn short if you want dedicated network node then just install ovn-controller and set gateway flag15:38
jamesdenton_so your grouping would be neutron-ovn-controller instead15:38
jamesdenton_yeah, i don't recall how to trigger it, i think there's a new group15:38
spateldamn.. we need more docs :)15:39
jamesdenton_some liberties were taken when we pulled OVN out of experimental status. And the consensus was... spatel is the only one using this15:39
jamesdenton_so, let us know how it goes.15:39
jamesdenton_:D15:39
spatelI am using ovn in small production environment until i get some expertise. so far i didn't hit any issue.. 15:40
jamesdenton_awesome15:41
spatelI have deployed OVN in some of my customer end with 30 to 40 node deployment and still no issue.. 15:41
jamesdenton_how's metadata?15:41
jamesdenton_i think i saw your name on a bug15:41
spatelYes.. i had and issue where meta service required restart and i didn't see any solution so i am doing cron restart (daily) 15:43
spatelBut in newer deployment i haven't see any issue related metadata.. 15:44
spatelI believe they fixed issue after wallaby release15:44
jamesdenton_oh cool. i have a deployment building now and need to check15:44
jamesdenton_it was not working in my AIO earlier15:44
spatelHmm! really.. i had no issue recently with meta15:45
*** dviroel is now known as dviroel|afk15:46
spatelI am building my home lab on Intel NUCs 15:48
spatelI have 10 Intel NUC so it would be good for any kind of testing15:49
jamesdenton_nice, i've put together some stuff to do a quick multi-node, but it does require an openstack undercloud15:50
spatelnoonedeadpunk  what do you mean openstack undercloud16:35
jamesdenton_meaning my process deploys multi-node openstack cluster using VMs16:36
jamesdenton_so i use my regular openstack cloud (lab) to deploy like... 8-9 instances (2x haproxy, 3x controller, 3x compute, 1x deploy)16:37
jamesdenton_terraform+ansible16:37
spatelopenstack on openstack :)16:41
spatelHow many nodes you have in home lab (physical nodes ) running openstack?16:42
jamesdenton_3x controllers on ESX and 3x metal computes16:43
spatelI have 2 HP gen9 servers which and they are running VMware Host to create any kind of lab 16:43
jamesdenton_these are Gen9 as well, E5-2680 i think16:43
jamesdenton_v416:43
jamesdenton_they're just loaded ewith ram16:44
spatelHow much electricity bills you are paying for those hardware? 16:44
jamesdenton_https://github.com/busterswt/mnaiov2, and inspired by https://github.com/openstack/openstack-ansible-ops/tree/master/multi-node-aio16:44
jamesdenton_oh, i dunno... napkin math a while back was like, 50-70/mo if i had to guess, but likely lower16:45
spatelNice!! MNAIO :)16:45
spatelI am waiting for Tesla solar panel after that my home will be all using green energy then i will run datacenter in my basement 24x7 16:46
jamesdenton_so, i have them hooked up to some Emporia plugs, and it's showing about $30 as of 22-Dec, so maybe 40 by the EOM16:46
spatelEmporia plugs .. i need to look into that16:47
spatelWhat do you use for cooling in summer?16:48
jamesdenton_i think they're called emporia smart plugs16:49
jamesdenton_i don't do anything in particular... the gear is sitting in a large upstairs room in the cab, and isn't super loud and doesn't run too hot. it gets warm in that room but not too hot. ambient is maybe 85 in the summer w/ the servers on16:50
jamesdenton_running that way for a couple of years now, no problems16:50
spatelHmm! 16:52
*** dviroel|afk is now known as dviroel17:09
moha7Hi there; During deploying `repo-install.yml`, I get 3 mounting errors  like this: 17:25
moha7fatal: [infra1_repo_container-b253b706]: FAILED! => {"changed": false, "cmd": "systemctl reload-or-restart $(systemd-escape -p --suffix=\"mount\" \"/var/www/repo\")"17:25
moha7but it filly is finished witout any failed task! --> ok=66   changed=3    unreachable=0    failed=0    skipped=24   rescued=1 17:26
moha7The playbook output: https://p.teknik.io/aIJDc17:26
moha7finally*17:28
moha7I think it leades to: infra1_galera_container-c57c8a02 : ok=10   changed=0    unreachable=0    failed=117:33
moha7Running `setup-infrastructure.yml` exits with this error: https://p.teknik.io/xuK1H17:35
moha7`openstack-ansible galera-install.yml -vvv` --> https://p.teknik.io/AIlqO17:39
jrossermoha7: see it says "rescued=1"17:41
jrosseryou can read the explanation of that here https://opendev.org/openstack/ansible-role-systemd_mount/src/branch/master/tasks/systemd_mounts.yml#L75-L10417:42
moha7Ah, so it's expected and not the root cause of that Galera error17:44
jrosseryes its completely expected17:55
jrosserit is an unfortunate way for ansible to report the tasks as both failed and rescued17:55
jrosserwhich means the ultimate outcome was success17:55
jrossermoha7: i think that the galera playbook will be the first one that tries to adjust the loadbalancer state17:57
jrossermoha7: i would guess that haproxy is not running and the socket that the `haproxy` ansible module tries to connect to is not present18:00
jrossercheck your haproxy service and the log to see what is happening18:00
moha7I had som error in haproxy, so I run it manually first to fix it; here is my user_variables.yml file: http://ix.io/4ji5hmm 18:03
moha7I'm going to revert the VMs to fresh state and run `setup-infrastructure.yml` again.I18:03
moha7some errors*18:03
jrosserwell, you need to get it to a state where haproxy comes up cleanly with the playbookes18:04
jrosserand you can also run all of the playbooks individually like you did with the galera one18:05
moha7The correct link to `user_variables.yml`: https://p.teknik.io/ps60L18:05
jrosserit would be just as good to re-run the haproxy role and fix up whatever variables you need to18:05
jrosserreverting the VM many times is not especially helpful18:05
moha7As I use an official signed certificate, haproxy now starts well after adding `haproxy_ssl_self_signed_regen: true` to the user_variables file18:06
jrosserhaproxy_keepalived_external_interface / haproxy_keepalived_internal_interface look unusual18:06
jrosserthis is "internal and external virtual IP for the loadbalancer"18:07
jrosserinternal is usually on the mgmt network18:07
jrosserexternal is usually on some outward facing network, or also another IP on the mgmt network depending if you want that to be accessible by your end users18:07
jrosser(you most likley dont)18:08
jrosserthe mgmt network i mean, thats ideally private if you care about security of your deployment18:08
moha7Oops, I'll move it to mgmt range18:08
jrosserhaproxy_ssl_self_signed_regen is not really related to using a official certificate18:09
jrosserit might be a bug you found though if you started with an OSA self-signed one and wanted to change to an externally provided one, we may have some error there in switching?18:10
jrosserfor a production deployment you might have some extra vlan that is on your controllers that the external endpoint is in, one that is accessible by your users18:11
jrosserbut one that is different from the mgmt network18:11
jrosserconsider how you would build a deployment where the external IP was on the internet, you would not want this to be the mgmt network18:11
moha7you mean admin and interl endpoints on vlan1, and the external endpoint on the other on, vlan2? Then Where I should define this setting?18:17
moha7internal*18:17
jrosserthe host networking is entirely up to you18:18
jrosseropenstack-ansible does not do this18:18
jrossersingle interface / bond / whatever......18:19
moha7for a provided certificate,  Does it need to any option in the user_variable file, other than these: haproxy_user_ssl_cert, haproxy_user_ssl_key, haproxy_user_ssl_ca_cert ?18:19
jrosserno that should be fine18:20
jrosserit is needed to do some thinking about how you want the network to be18:20
jrosser"my users are in vlan x / subnet y", "my openstack mgmt network is firewalled from that and on vlan z"18:21
moha7You're right18:21
moha7Going to deploy again with above updates; Thanks18:21
jrosseror "i don't care about security and everything can talk to everything else"18:21
jrosseronly you can decide that and setup the host networking as you need18:21
moha7I'm working on a Lab; But it will be deployed into a production env. in 3 next months18:22
jrosserright - so i would recomment making your lab as structurally representative of your production environment as you can18:22
jrossersame separation of trust / access even if you have to make that artifical in the lab18:23
jrosserthen there are less surprises later18:24
moha7+118:24
jrosserbut i would always recommend starting with the "all-in-one" deployment just so that you have something simple and easy as a reference point18:24
jrosserthen if your multinode lab deployment breaks you can go poke around the AIO to see how things should have turned out18:25
*** dviroel is now known as dviroel|out20:15
spatelis ceilometer still use to collect metrics from openstack components?20:36
*** ianw is now known as ianw_pto23:43

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!