Wednesday, 2025-04-23

opendevreviewYu Zou proposed openstack/ironic master: feat: add verify ca conf support for drivers  https://review.opendev.org/c/openstack/ironic/+/94754402:49
JayFI got what I needed, thanks03:00
opendevreviewAmarachi Ordor proposed openstack/bifrost master: docs: updated test guide, improve how-to clarity, add troubleshooting tips  https://review.opendev.org/c/openstack/bifrost/+/94695303:35
arne_wiebalckGood morning, Ironic!06:36
rpittaugood morning ironic! o/07:13
abongaleGood morning ironic!08:15
freemanboss[m]Good morning abongale: 08:17
abongaleGood morning freemanboss[m] 08:18
queensly[m]Good morning 08:36
opendevreviewMerged openstack/ironic master: Update some docs  https://review.opendev.org/c/openstack/ironic/+/94788011:12
iurygregoryomg arne_wiebalck is around =P 11:52
iurygregorygood morning ironic o/11:52
arne_wiebalckGood morning iurygregory o/12:02
dtantsurcid: https://review.opendev.org/c/openstack/ironic-python-agent/+/946091 passed most integration jobs already, yay!12:44
dtantsurfor the next step, please update https://review.opendev.org/c/openstack/requirements/+/947727 per Sean's request (include answers to questions in the commit message)12:45
dtantsurthen https://review.opendev.org/c/openstack/ironic-python-agent/+/946091 needs a release note to warn operators that something significant has changed + we can merge https://review.opendev.org/c/openstack/ironic-python-agent/+/947744/ into it12:46
ciddtantsur: And it has passed all \o/13:09
dtantsur\o/13:23
TheJuliagood morning13:30
JayFcid: dtantsur: One other thing we need to remember to do is to check and see if we need to do anything explicit to pick an oslo.$anything back end that's not enventlet based. Even if that's happening automatically, we need to raise the requirements to only require Oslo versions that don't use eventlet13:32
dtantsurfair point13:37
cid++13:43
opendevreviewVerification of a change to openstack/ironic master failed: Add several NV GPU types  https://review.opendev.org/c/openstack/ironic/+/94721315:22
opendevreviewKaifeng Wang proposed openstack/ironic-specs master: Flow control on PXE related port requests  https://review.opendev.org/c/openstack/ironic-specs/+/94796115:46
opendevreviewMerged openstack/ironic master: Trivial: Fix spelling issue in configuration desc  https://review.opendev.org/c/openstack/ironic/+/94788117:36
opendevreviewJay Faulkner proposed openstack/ironic-python-agent master: Remove eventlet from Ironic Python Agent  https://review.opendev.org/c/openstack/ironic-python-agent/+/94609118:03
JayFthis should be ready for review once the requirements change merges \o/ go cid go18:04
JayFShould https://opendev.org/openstack/molteniron still exist?18:07
JayFcd ..18:08
JayFwhoops18:08
TheJulialikely not at this point18:09
JayFSo I did an audit of all Ironic managed projects; IPA migration pending review, only remaining ones that need eventlet gone are NGS, NBM, and Ironic-proper18:10
TheJuliacool, that matches my upstream and downstream notes18:16
opendevreviewMerged openstack/ironic master: Add several NV GPU types  https://review.opendev.org/c/openstack/ironic/+/94721318:34
opendevreviewJay Faulkner proposed openstack/networking-baremetal master: WIP: Remove eventlet  https://review.opendev.org/c/openstack/networking-baremetal/+/94798518:43
JayFCID and I did some research, it looks like oslo.service does not have a working backend for anything but eventlet right now, and the WIP one didn't work (assuming I used it right). It will not be a good use of our time to pursue further eventlet migrations until this is in a better state.18:46
dtantsurI'm curious how much of oslo.service we'll need once we get rid of its wsgi bits18:51
JayFLoopingCall is a big user18:51
JayFanother one too18:51
dtantsurLoopingCall should be reasonably easy to replace18:52
JayFbut they have the threading backends for those in progress, I see no need to reimplement it on top of them instead of just waiting18:52
dtantsurcid, JayF, https://review.opendev.org/c/openstack/requirements/+/947727 needs more love still, they won't merge it until we do the needfull18:52
dtantsuryeah, I agree, maybe we just wait at least for some time18:53
cidYeah, on my list of things to do today. Though, the last time I looked at it, it wasn't very clear. Things always make sense to me the second time, so, I should have addressed any concerns on that today, tomorrow18:53
TheJuliaI'd be *super* hesitant to wait beyond the end of this cycle18:53
dtantsurcid: the wiki is a bit vague, but check any previous patch that added a new requirements18:53
dtantsure.g. https://review.opendev.org/c/openstack/requirements/+/93422018:54
* dtantsur disappears again since it's 9pm18:54
cid++18:55
JayFTheJulia: I already put a comment linking to the NBM change on that oslo.service cchange, I really think it's close19:00
JayFTheJulia: and I suspect with a waiting user it might go even faster19:00
TheJuliaack19:01
JayFyeah literally that threading patch updated this morning and is moving fast19:01
JayFHerve and Daniel on the case :D 19:01
TheJuliacool cool19:37
BrodeyHey everyone! I hope it's okay to ask questions in here... If I'm doing this improperly I apologize (first time user).20:32
BrodeyWe are struggling with an Ironic problem right now where we are trying to plug tenant network vlans on all baremetal node ports (we have 12 ports). 1 is pxe_enabled which works through the cumulus genericswitch plugin, but when booting the machine into the tenant network, we're noticing the ports are never sent to the ml2 driver for plugging. Is this supported somehow?20:34
JayFYou're in the (well, a) right place. Although for a question like that you might have better luck on the list. Just hang out a bit and maybe someone will know how to point you20:36
BrodeyThanks Jay, will post in mailing list if there's no luck in here.20:37
* TheJulia sees 12 ports and blinks20:37
TheJuliaBrodey: what is the network_interface set to?20:38
TheJuliafor the ironic baremetal node20:38
Brodeynetwork_interface is neutron20:39
TheJuliacould you provide insight into the vifs your attempting to plug, your saying vlans and I guess we need to try an dunderstand structure and if it matches out or not20:40
BrodeyThe VIFs we’re attempting to plug are on VLAN-backed provider networks. Each baremetal port right now is mapped to a physical switch/interface using local_link_connection, and we assign VLAN segments to the networks manually. We then boot through nova using --network flag.20:43
BrodeySorry, hope that is helpful20:43
TheJuliano, that is good20:43
TheJuliadoes `openstack baremetal node vif list <node_uuid>` work for the deployed node indicating the vifs which are expected ?20:44
TheJuliathose should all have segmentation ids already set20:44
BrodeyIf I list VIFs on the baremetal node, I only see one, which I assume is the VIF for the pxe_enabled port20:45
BrodeyYeah, confirmed. That's the pxe_enabled port.20:46
TheJuliaif you ask neutron to show that vif (port) (openstack port show <vif_id>), what do you see?20:46
BrodeyIt's the active port with the mac_address that maps to the pxe_enabled port on the baremetal node. bound_drivers.0='genericswitch'20:47
Brodeybinding_vnic_type baremetal20:47
TheJuliaany change you can share the output, redacting mac address and ip addreses ?20:48
Brodeysure, paste the entire json in here? 20:48
TheJuliause paste.openstack.org :)20:48
TheJuliaand share the link20:48
Brodeyroger, one sec20:48
Brodeyhttps://paste.openstack.org/show/btDcoC83X0MpGGue1lqc/20:50
TheJuliaI guess that is a port which is being requested with openstack server create command as well?20:51
TheJuliasince it has a name test-port ?20:51
BrodeyYep!20:52
TheJuliaokay, so all signs point to only one vif actually being requested20:52
TheJuliaAnd everything there looks good, but your missing everything else your seeking20:52
BrodeyIndeed. Should we be attaching vifs manually? 20:53
TheJuliawhich I guess might be a gap to do it based upon networks. (Long story short, there are some fun issues and challenges around how networking can be requested for nova instances and how that can get translated out to nodes, so I guess we need to understand the commands used in a bug and if you can provide reasonably redacted details into a bug in launchpad20:53
TheJuliayou can, I'd think nova should be doin git, I'm wondering if it is some case which has a bug or if everyone has just done explict ports20:54
TheJulia(there are literally like 5 ways to get to a nova instance to have networking as well, which doesn't help the overall challenge and result)20:54
BrodeyHmmm, we should be able to create the ports and then attach VIFs for now if that would work?20:55
TheJulia(and turns out, if you request ports with deferred addressing, bad things also happen... which happens to be a bug I'm working on20:55
TheJuliayeah, that *should* work and I'd expect that to work20:56
TheJuliaI'm not sure it is really expected by nova for one to mix models of requesting networking either, which might not be helping the overall issue20:56
TheJuliabecause your obviously getting a deployed instance20:56
TheJuliaironic is only being told 1 vif20:56
TheJuliawhen it sounds like there should be *12*20:57
TheJulia(btw, impressive!)20:57
BrodeyOk so at nova create time, go through each non-pxe_enabled port on the baremetal node, create a port in nova and then do a baremetal node vif attach --port-uuid20:57
BrodeyHeh, yeah we're a GPU cloud provider and hoping to use Ironic, we're at POC stage. 12 ports include gpu fabric etc :) 20:57
TheJuliayou can just request the instance from nova with all the specific ports you want20:57
TheJuliaAHHH20:57
TheJuliawell, depending on what your doing, I'm sort of wondering if you even should be using nova in that case20:58
Brodeyvs just deploying to ironic directly? 20:58
TheJuliayou can alternatively attach vifs directly after the fact or separately, you just want to be mindful of the overall flow20:58
TheJuliaSort of depends on what happy path you want to take20:58
BrodeyNova would be nice to have as our direct interface given we want to be able to provision VMs alongside baremetal20:59
TheJuliayeah20:59
TheJuliaThat is generally why folks tend to do it20:59
TheJuliaat least when you have a blend of needs and cases21:00
BrodeySo, your recommendation is to create all 12 ports for the nova instance upfront when creating the instance and let Nova/Neutron try to attach the VIFs? Otherwise, create the ports and do the baremetal vif attach manually?21:00
Brodeybtw, we also tried neutron.add_all_ports option but that didn't seem to work for us21:01
TheJuliayeah, ideally, the flow which is actually used is nova will tell ironic what the vifs are, and then ironic will attempt to map the vifs to ports on the phyiscal node as part of the deployment/provisioning flow and not before21:01
JayFBrodey: which GPU cloud provider? I work for G-Research (OSS side) and we likely share a lot of problems :)21:02
TheJuliaheh21:02
JayFBrodey: While I'm not a network guru, happy to touch brains at some point :)21:02
* TheJulia was wondering until someone was going to jump on this discussion21:02
JayFNGS troubleshooting is just not my bag 😆21:02
* TheJulia still expects her neighbor to remark about burning ears next when she gets home21:02
TheJulia(although, different networking and use cases)21:03
BrodeyOk, sounds good. I'll try the first option (create all ports when creating the instance) and see what happens.21:03
BrodeyJay, I work for WhiteFiber, a pretty new GPU cloud. :) 21:03
TheJuliaBrodey: clearly you've brushed up against a bug, do you know where to file bugs?21:04
JayFBrodey: drop me a line at jay@gr-oss.io -- we have a whole group at GR-OSS that loves to sync up with folks solving similar problems, openstack or otherwise (you may have heard of armadaproject.io ?)21:05
BrodeyI do not. Do you have a 'how to file a bug' link handy? otherwise I can find one. :) 21:05
JayFbugs.launchpad.net21:05
JayFproject name is ironic21:05
Brodeyheh sounds good, appreciate it21:06
BrodeyJay, sounds good i'll do that21:06
TheJuliahttps://bugs.launchpad.net/ironic21:06
TheJuliaA more direct-ish link ;)21:06
JayFhttps://bugs.launchpad.net/ironic/+filebug MAXIMAL DIRECT LINK :p 21:06
TheJulialol21:06
JayFhonestly, I <3 that launchpad has useful urls like this21:06
JayFone of the benefit of using an older less wizbang tool21:07
TheJuliaheh21:07
* TheJulia gets text message from the wifey asking for me to stop for cat foot and wonders if it is just time to packup and head out21:07
JayFTheJulia: btw; I haven't documented this anywhere upstream yet; but I've asked cid to look at port fetching perf in the api and to look at maybe hooking NBM to sharding21:09
JayFBasically I learned today that at some scaled environments it's turned off because that query times out21:09
TheJuliak, eek21:09
JayFyeah, I figured I'd mention to you specifically since you were on the nodes speedup21:09
TheJuliayeah, harald had a change for that someplace along the way21:10
JayFI showed him the benchmark stuff and pointed him at the 2019 blogpost21:10
JayFoooh21:10
TheJulia... gawd 2019 was so long ago21:10
JayFcid: look at https://review.opendev.org/q/owner:hjensas@redhat.com+status:open for some relevant changes to that port/nbm speedup work21:11
JayFcid: looks like he has changes up for improving benchmarking and trying to setup filtering by conductor group21:11
JayFhttps://review.opendev.org/c/openstack/ironic/+/862292 in particular will be helpful21:12
JayFhook this up and shard to the node21:12
TheJuliahttps://review.opendev.org/c/openstack/ironic/+/864781 <-- the one I was thinking of21:13
TheJuliamerged in 202321:13
JayFthat means it's not in the downstream that reported the issue21:13
JayFnice21:13
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Patch configdrive metadata  https://review.opendev.org/c/openstack/ironic/+/94667721:16
TheJuliaspeaking of fun bugs21:16
TheJuliastill need to check/validate the pre-mastered ISO image path and write more tests21:17
TheJuliaJayF: fwiw, I don't think anyone sat down and benchmarked ports really after that, but yeah, any using any other pattern further away from the DB server makes port listing slow... just like it did with node listing back in the day21:18
JayFyep, it makes sense21:18
JayFI'm just glad to know we have performance to upgrade into21:18
TheJuliathe assocation_proxy stuff ends up doing two closely knit db queries and sqlalchemy merges the results before returning the to the caller21:19
TheJulia++21:19
TheJuliaspeaking of networking-baremetal, one of vasyl's chorts posted a neutron patch which we really need to get folks to look at21:19
TheJuliawhich addresses orphaned agent records21:20
TheJuliawell, enables it, adds a agent status call to the rpc method21:21
* TheJulia begins to back up to head to the store and then back up the mountain (because triple overtime last night wasn't enough...)21:27
JayFI gotta be honest, I'm too old for 3OT unless it's like a game 7 for the  cup or semis lol21:27
TheJuliaJayF: kind of yeah, I was starting to call it the game that never ends since we were like less than 2 minutes from a forth overtime21:28
TheJulia(wranglers v firebirds, so two strong offensive teams too)21:29
TheJuliasee y'all in the morning21:42
JayFo/21:43

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!