Friday, 2022-01-21

*** mac189_ is now known as mac18905:49
*** elenalindq_ is now known as elenalindq05:52
noonedeadpunkI guess t->w should work though as well. Never tested that, but t->v worked and v->w was also quite straightforward06:46
noonedeadpunkwell, not sure if nova will like that though06:46
noonedeadpunk(can't recall if it was W or X when they force fail because of rpc version missmatch06:57
*** frenzy_friday is now known as frenzyfriday07:05
*** johnsom_ is now known as johnsom09:03
jrosserthe bump_osa patches fail on old centos-8 jobs09:06
jrosseri'm not around until tuesday next week to look at it but this topic needs to merge https://review.opendev.org/q/topic:%22osa%252Fremove-centos8%22+(status:open%20OR%20status:merged)09:07
*** melwitt is now known as Guest131209:32
opendevreviewMerged openstack/openstack-ansible master: Collect the contents of /etc/ssh during zuul job log gathering  https://review.opendev.org/c/openstack/openstack-ansible/+/82547510:33
*** melwitt is now known as Guest132010:50
opendevreviewMerged openstack/openstack-ansible master: Drop hosts file distribution tasks  https://review.opendev.org/c/openstack/openstack-ansible/+/80952110:57
*** dviroel|out is now known as dviroel11:25
*** anbanerj is now known as frenzyfriday14:41
*** dviroel is now known as dviroel|lunch14:58
*** promethe- is now known as prometheanfire16:03
*** dviroel|lunch is now known as dviroel16:15
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-tests master: Remove unused centos-8 functional job definitions  https://review.opendev.org/c/openstack/openstack-ansible-tests/+/82067416:32
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-tests master: Remove unused centos-8 functional job definitions  https://review.opendev.org/c/openstack/openstack-ansible-tests/+/82067416:35
MrClayPoleHi, I've recently moved cinder-volume to my control nodes from my compute nodes. They previously ran on metal. I've removed them from openstack_user_variables.yml but they're still in the inventory. I've attempted to use the "inventory-manage.py" but this only remove hosts but I'm looking to remove a service from a host? What's the best way to do this?16:37
noonedeadpunkMrClayPole: I don't think it's implemented now in inventory-manage.py. We've recently added only fully removing group, but not specific host from group. It's good thing to implement though...16:39
noonedeadpunkIn the meanwhile I think the only way is to manually edit /etc/openstack_deploy/openstack_inventory.json :(16:40
MrClayPoleOK thanks, I though that would be the case but just wanted to check first16:40
spatelnoonedeadpunk i have question for you. I have mounted 800T glusterfs in /var/lib/nova for shared storage for VMs but interesting thing nova placement still think you have local disk storage so showing disk size 70GB (how does nova report 800TB to placement?) 17:10
noonedeadpunkspatel: are you sure that nova user has access to that storage?17:28
spatelyes.. and i can create vm also everything works fine.. only my placement showing low disk 17:28
spatelmay be it doesn't know we have bigger storage may be... 17:28
noonedeadpunkhm, I was absolutely sure it does smth like getting size of mountpoint for  /var/lib/nova... But I think I need to find code to verify that17:29
spatelnoonedeadpunk damn it i know what is going on again :)17:30
spatelmy bad.. 17:30
noonedeadpunkhuh?17:31
spatelmy mount point issue its not mounted on couple of servers and by mistake i pick that server to verify :) 17:32
spateli can see proper reporting on other compute which has proper mounted filesystem 17:32
noonedeadpunkoh, well :)17:33
spateli think i need to write script to verify if mount point failed to mount then don't start nova service because otherwise it will use local disk to start VM17:33
noonedeadpunkwell, if you'd use systemd-mount, then you can apply nova init overrides and make it dependant on mount17:34
noonedeadpunkat least we did that to mount cephfs to add space for huge images that are being converted17:36
spateli am using /etc/fstab but i think you are correct i should move it to systemd-mount and make it dependent for nova17:36
noonedeadpunkand we have https://opendev.org/openstack/ansible-role-systemd_mount/ ;)17:37
spatelsweeeeet! i will use same 17:38
spatelnoonedeadpunk other question, i am building two new cloud in remote datacenter so is Xena is ready for production?17:39
spateli want to upgrade wallaby -> xena in other datacenter soon 17:39
noonedeadpunkI'd wait for 24.0.1 for sure which was should have been already released with https://review.opendev.org/c/openstack/openstack-ansible/+/82539117:40
noonedeadpunkbut we need to drop ci for centos-8 now to get this merged17:41
spatelsweet! i have 1 month in hand because currently we are buying servers and racking up so it will take little time so hope by the time its ready to rollout17:41
noonedeadpunkwe already upgraded w->x and it was straigforward, but we pulled some patches from 24.0.1 manually17:41
noonedeadpunkwait, we upgraded v->x :P17:41
spatellol17:42
spatelnow i have 100G nic and trying to run iperf3 to test but only able to hit 40G.. damn it. 17:46
spatelnoonedeadpunk did you work on infiniband ?17:50
spateli need some help to understand 17:50
noonedeadpunkI did one day17:57
noonedeadpunkIt brought really tons of troubles though... So using same mellanox cards in ethernet mode would be so much easier17:58
noonedeadpunkunless you need rdma ofc17:58
noonedeadpunkWhat I can recall that switching to connected mode improved IPoIB performance a lot comparing to default datagram18:00
noonedeadpunkI believe mostly because of MTU that is 4k tops in datagram18:01
noonedeadpunkbut well, we had only 60g back then I believe18:02
noonedeadpunk(via ConnectX-3 Pro)18:02
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/victoria: Remove CI jobs for centos-8  https://review.opendev.org/c/openstack/openstack-ansible/+/82457018:05
spatelnoonedeadpunk This cloud is for HPC use and RDMA will give better performance compare to IPoIB 18:15
spatelTell me if i am wrong here, my plan is to configure SRIOV on infiniband nic and attach vf to vm (vm can see rdma nic) 18:16
spateli didn't find any good document related how to build HPC on openstack (all i saw big high level concept but not good config based document) 18:17
spatelmost of people running HPC using ironic but in my case i want to use VMs 18:18
noonedeadpunkI never had experience with HPC. Regarding rdma - yes, but it kind of require software that able to work with it18:19
noonedeadpunksriov works nicely there though18:19
noonedeadpunkBut what you can't do with ib interfaces - make bridge of them. it's smth to keep in mind. You won't do bridges though if looking for rdma18:20
noonedeadpunkbut we used infiniband for storage and on computes only18:21
noonedeadpunkso sriov was applicable only on controllers when we needed to pass storage to containers (as sriov is  the simplest way considering bridges or mac-vlan not an option).18:23
mgariepythe last ones that i knew were using IB on openstack did switch all the network to ethernet because at some point melanox was not providing patches for the ib in neutron.18:23
noonedeadpunkand ceph with rdma was in experimental state only, so it was all a mess18:23
noonedeadpunkfor neutron we were using built-in 10G :p18:24
mgariepylol nice :D haha18:24
noonedeadpunkbut yes, IB was kind of pita if aim to use IPoIB18:24
mgariepythey were using the ib network directly but they deployed with some patches on top and then support dropped ..18:25
noonedeadpunkdrivers were always failing to build against new kernels, this and that on every step, some weird lags from time to time...18:25
noonedeadpunkcards could get lost after reboot so power off/on was required for compute to find that... Likely it was partially because of rubbish hardware we used...18:27
spatelIn my case we want to run MPI job on infiniband (no IP stuff) 18:29
spatelfor MPI job i would like to expose raw nic to vm via SRIOV. 18:30
noonedeadpunkthen I'd say it likely proper solution, as performance there should be great indeed18:30
spatelI have configured mallanox switch for SM (subnet manager) 18:30
spatelnow trying to understand how to map pkay (partition) to make with vlan etc.. that part i am having hard time to understand 18:31
noonedeadpunkoh, SM was other PITA lol18:31
spatelpkey*18:31
noonedeadpunkwell, it's jsut hex number18:32
spatelyes and how to map them with vlan etc18:33
spatelfor multi-tenancy 18:33
noonedeadpunkthis pkey _is_ vlan18:33
noonedeadpunkbut iirc everybody can use it kind of18:34
noonedeadpunkor well, on SM you define auth key. And every VM with that auth key can get all pkeys managed by SM18:34
spatelhmm18:35
spatelwhat is the use of pkey ?18:35
noonedeadpunkalso you must somehow ensure that SM is not ran on any instance that has IB connected as they might conflict18:35
noonedeadpunkI think logical traffic separation?18:35
spatelI am running SM on Mallanox switch 18:35
spatelib sm enabled 18:36
noonedeadpunkBut nobody can stop from starting SM anywhere where OFED is installed :p18:36
noonedeadpunkthey kind of run in HA then, but it's all messy a bit...18:36
spatelI have started standalone SM so i don't think it will allow anyone to join cluster18:37
spateli am not running in HA mode18:37
noonedeadpunkAnd you have generated a uniqe key for it?18:38
spatelall i did on switch is > ib sm enabled 18:38
spateli have no idea about generating key etc..18:39
spatelthere was other command also called > ib sm-ha  (i have only single switch so i thought i don't need HA )18:39
noonedeadpunkthen I guess it;'s default which means if somebody do `systemctl start opensm` it will likely join cluster with switch18:39
spateli will give it a try and see if its impacting anything or not (This is not in production so i can mess with it)18:40
spatelwhere this pkey will get use? 18:41
noonedeadpunkso pkey is a replacement for vlans. So what we did we were defining partition `Vxlan=0x87d2,ipoib: ALL=full;` and we had an isolated network and interface ib1.87d218:42
spatelohhhh18:43
noonedeadpunkmgariepy: so for neturon vxlans we kind of used IB18:43
noonedeadpunkbut to spawn interface I think either some service restart or echo to /sys/class/net/ib1/create_child was required18:44
spatelin my case if i create vm foo then how do vm know which pkey or partition i should be ?18:44
mgariepyhuh ok so it was all supported well and integrated in neutron ?18:44
spatelor its part of neutron-agent job?18:44
mgariepy;p18:45
noonedeadpunknah... for vxlan all you need is IP on compute, and net node right?:)18:45
noonedeadpunkas for ml2 you jsut define IP thorugh which traffic would be incapsulated18:45
noonedeadpunkand we assigned IP to IB 18:46
mgariepyif using ipoverib maybe ;) not sure mpi would work great on that tho.18:46
spatelYes in IPoIB case that will work but in my case i don't want to configure IP on ib0 interface. i want to use RDMA 18:46
noonedeadpunkspatel: have no idea. can't imagine how to do that18:46
noonedeadpunkall sr-iov devices would come with default pkey18:47
spatellet me mess with config and see. if solution is IPoIB then i should go with that 18:47
spatelhmm! do i need to create pkey on SM or it will get auto generate if i don't do anything 18:48
noonedeadpunkyou need to define pkeys on SM18:48
spatelok... let me do it.. (if all work out then i will blog that shit out because i didn't find any good document on internet :( )18:49
noonedeadpunkbut you for sure should define sm_key somewhere....18:50
noonedeadpunkand use connected mode :)18:50
spatelI will take a look at also about security status 18:50
spateldon't worry i will keep poking you until it all work out :)18:51
noonedeadpunk`This option specifies the SM's SM_Key (64 bits). This will effect SM authentication. Note that OpenSM version 3.2.1 and below used the default value '1' in a host byte order, it is fixed now but you may need this option to interoperate with old OpenSM running on a little endian machine.`18:51
noonedeadpunkso maybe not needed now, dunno....18:52
spatelIn my case i have Mallanox switch running SM so assuming they took care of security 18:52
noonedeadpunkbut I have some feeling, that pkey configuration would need to be done inside VM then...18:52
noonedeadpunkas you jsut passthrough device, which at that point knows nothing about pkeys18:53
noonedeadpunkI really have doubts they had any multi-tenancy in mind18:53
spatelthat is where i am confused where should i put pkey ? 18:53
spateli think i should ask this question to mallanox mailing list or support 18:54
noonedeadpunkin /etc/opensm/partitions.conf18:54
mgariepyyou also need a network manager for IB i think.18:54
noonedeadpunkifupdown handles IB nicely18:55
noonedeadpunkdoesn't work at all with netplan though18:55
spatelthere is a command on switch > ib sm virt enable18:55
spatelmay be it will run SM per partition 18:55
mgariepynot network/ subnet manager 18:55
mgariepyho. sm (subnet manager)18:56
spatel+118:56
noonedeadpunkyeah, so why I'm a bit confused now as I realized that we ran SM on ceph mons at first for HA. And only afterwards moved to switch18:57
spatelone more question currently i am using CentOS 8s based driver for all nic and they works fine. do i need to install MLNX OFED based driver? 18:57
noonedeadpunkSo maybe, jsut creating child is enough18:57
noonedeadpunkWell it contains tons of software you will likely need to have18:57
spatelonly problem is i have to compile etc.. (it doesn't have RPM )18:58
noonedeadpunkbut watch out - on ubuntu there's cross dependency for IB drivers and ceph. So each time we were upgrading ofed - it was dropping all ceph packages18:58
spateli was thinking i can try with OFED and if it work then i don't need to install compile lots of stuff18:58
noonedeadpunkmgariepy: btw, can you please review https://review.opendev.org/q/topic:"osa%252Fremove-centos8"+(status:open) ?18:59
spatelBRB19:01
mgariepynoonedeadpunk, looking19:01
mgariepythe link doens't work tho for ${REASON}19:02
noonedeadpunkbecause of quotes I guess... copy/paste should work though19:13
*** Guest1320 is now known as melwitt19:18
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/xena: Remove CI jobs for centos-8  https://review.opendev.org/c/openstack/openstack-ansible/+/82456719:58
*** dviroel is now known as dviroel|out20:53
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-tests master: Remove unused centos-8 functional job definitions  https://review.opendev.org/c/openstack/openstack-ansible-tests/+/82067420:58
noonedeadpunkseems https://review.opendev.org/c/openstack/ansible-role-qdrouterd/+/824537 is going to pass now :)21:06
mgariepyhow comes it's not updated here via opendevreview when you push a new patch ?21:10
krumelmonsterjrosser: Should we move the conversation here?21:53
krumelmonsterI asked in #openstack how I'd configure letsencrypt/certbot for horizon21:54
jrosserkrumelmonster: if i understand correctly, you’d set this to your fqdn rather than the ip https://github.com/openstack/openstack-ansible/blob/master/etc/openstack_deploy/openstack_user_config.yml.example#L11521:57
jrosserultimately it ends up here https://github.com/openstack/openstack-ansible-haproxy_server/blob/master/tasks/haproxy_ssl_letsencrypt.yml#L7822:00
jrosserthose two bits of config are tied together here https://opendev.org/openstack/openstack-ansible-haproxy_server/src/branch/master/defaults/main.yml#L22422:03
jrosserto deploy that, re-run the haproxy playbook22:04
jrosseronce that is done and working, the other thing that will be wrong is the service catalog entries which will still point to your old ip rather than the fqdn22:08
opendevreviewMerged openstack/openstack-ansible-tests master: Remove unused centos-8 functional job definitions  https://review.opendev.org/c/openstack/openstack-ansible-tests/+/82067422:25
-opendevstatus- NOTICE: The Gerrit service on review.opendev.org is being restarted briefly to apply a bugfix23:01
krumelmonsterjrosser: It didn't work on the first tries, I'll look into it further tomorrow. Thank you for your help.23:50

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!