Tuesday, 2025-01-21

opendevreviewJulia Kreger proposed openstack/ironic master: OCI container adjacent artifact support  https://review.opendev.org/c/openstack/ironic/+/93789600:37
opendevreviewMerged openstack/ironic-python-agent master: Silence modprobe loading errors for IPMI drivers  https://review.opendev.org/c/openstack/ironic-python-agent/+/93704201:48
opendevreviewcid proposed openstack/ironic master: API/Testing: Inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921702:02
opendevreviewcid proposed openstack/ironic master: DB: migrate inspection rules' database  https://review.opendev.org/c/openstack/ironic/+/93931802:02
opendevreviewcid proposed openstack/ironic master: Apply Rules: inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921802:02
opendevreviewcid proposed openstack/ironic master: DB: migrate inspection rules' database  https://review.opendev.org/c/openstack/ironic/+/93931802:33
opendevreviewcid proposed openstack/ironic master: API/Testing: Inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921702:33
opendevreviewcid proposed openstack/ironic master: Apply Rules: inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921802:33
rpittaugood morning ironic! o/08:07
rpittauJayF: I'll check the metal3 errors, and apologies I'm a bit late with the documentation on the metal3 job, I'll try to get it up for review this week08:07
rpittauJayF: and thanks for fixing the pep8 error!08:08
rpittauI wonder if it's because of the switch to UEFI in metal3-dev-env https://github.com/metal3-io/metal3-dev-env/pull/132608:12
rpittauah! libvirt: QEMU Driver error : Failed to open file '/usr/share/OVMF/OVMF_VARS.fd': No such file or directory08:12
opendevreviewRiccardo Pittau proposed openstack/ironic master: Force legacy boot for metal3 integration job  https://review.opendev.org/c/openstack/ironic/+/93968708:21
rpittauJayF, TheJulia, cardoe, cid, iurygregory ^ this should fix the metal3 integration job until we run noble 100% in metal3-dev-env08:22
opendevreviewRiccardo Pittau proposed openstack/ironic master: [WIP] test fix for metal3 job UEFI boot on noble  https://review.opendev.org/c/openstack/ironic/+/93969408:43
opendevreviewRiccardo Pittau proposed openstack/ironic master: [WIP] test fix for metal3 job UEFI boot on noble  https://review.opendev.org/c/openstack/ironic/+/93969408:45
opendevreviewJakub Jelinek proposed openstack/ironic-python-agent stable/2024.2: Fix RAID volume name  https://review.opendev.org/c/openstack/ironic-python-agent/+/93970010:16
opendevreviewJakub Jelinek proposed openstack/ironic-python-agent stable/2024.2: Add a release note for 939340  https://review.opendev.org/c/openstack/ironic-python-agent/+/93970110:16
kubajjrpittau: I am sorry, I did it wrong. Will try to figure out how to merge them into one backport10:19
opendevreviewDerek Higgins proposed openstack/ironic master: Fix typo calling save_and_reraise_exception  https://review.opendev.org/c/openstack/ironic/+/93970410:53
* jssfr curses the Intel board10:53
jssfrI lost an hour trying to get it to speak NFS via the Web UI10:53
jssfrI should've gone for sushycli immediately, there it just works.10:53
lucadelmontehello, not sure if this is the right channel to ask some clarificatin about ironic node console integration with nova, i am having some issue to make it work, following the documentation i setup a node with ipmitool-socat console driver, and issuing the baremetal command i am able to get the socat connection working, when i issue the command from nova (openstack console url show --debug  --serial <SERVERID>) i re11:01
lucadelmonte"unexpected exception in api method: attributeerror: 'node' object has no attribute 'get_node_console", does someone have some ideas on how to fix it?11:01
opendevreviewcid proposed openstack/ironic master: DB: migrate inspection rules' database  https://review.opendev.org/c/openstack/ironic/+/93931811:53
opendevreviewcid proposed openstack/ironic master: Apply Rules: inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921811:53
opendevreviewcid proposed openstack/ironic master: API/Testing: Inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921711:53
opendevreviewcid proposed openstack/ironic master: DB: migrate inspection rules' database  https://review.opendev.org/c/openstack/ironic/+/93931811:58
opendevreviewcid proposed openstack/ironic master: Apply Rules: inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921811:58
opendevreviewcid proposed openstack/ironic master: API/Testing: Inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921711:58
iurygregoryrpittau, ack12:23
rpittaukubajj: no worries :)13:25
TheJuliaugh, metal3's job still uses legacy boot?!14:14
opendevreviewJulia Kreger proposed openstack/ironic master: Automatic zstd detection and decompression...  https://review.opendev.org/c/openstack/ironic/+/93890414:14
rpittauTheJulia: well... yeah :)14:15
rpittaubut recent change moved to uefi default and we should be able to switch that soon(TM)14:15
opendevreviewJulia Kreger proposed openstack/ironic master: Automatic zstd detection and decompression...  https://review.opendev.org/c/openstack/ironic/+/93890414:21
TheJuliarpittau: okay, cool14:44
TheJuliajust yeah, bios defaults != good at this point ;)14:44
* TheJulia watches ironic download fedora coreos quietly from an image registry14:44
TheJuliaConsole log from podman's machine-os runtime image :) https://usercontent.irccloud-cdn.com/file/josY3lSr/fcos-booted.png15:10
opendevreviewMuhammad Ahmad proposed openstack/ironic master: doc/source/admin fixes part-4  https://review.opendev.org/c/openstack/ironic/+/93946715:20
opendevreviewMerged openstack/ironic master: Force legacy boot for metal3 integration job  https://review.opendev.org/c/openstack/ironic/+/93968715:23
rpittaugood night! o/17:04
opendevreviewJulia Kreger proposed openstack/ironic master: ci: fix ipv6 CI job  https://review.opendev.org/c/openstack/ironic/+/93973217:07
TheJuliaI *think* that will fix our v6 CI job17:07
opendevreviewMerged openstack/ironic master: Migrate documentation from ironic-lib  https://review.opendev.org/c/openstack/ironic/+/93927517:29
*** eandersson0 is now known as eandersson17:51
*** edebeste6 is now known as edebeste17:51
keekzhi all, we're using nova+ironic and getting errors during deletes. i've come across https://review.opendev.org/c/openstack/nova/+/883411 which seems to be the issue i'm hitting, but this issue appears to have gotten stuck or abandoned along the way. hoping we can maybe get more traction on it or something17:54
masgharIn your change, Julia, the metal3-integration job is failing with an error 404 on this URL: https://github.com/kubernetes-sigs/cluster-api/releases/download/v1.9.4/clusterctl-linux-amd6418:04
JayFthat's what happens when a release is performed and it's not updated yet AIUI18:05
JayFin this case, probably not a metal3 release but a capi release18:05
masgharYeah, the link works for v1.9.318:07
masgharWe could recheck in a few hours, I guess?18:07
JayFUsually the window is more like 30-60 minutes AIUI18:12
JayFbut that's an educated guess moreso than actual fact :D 18:12
masgharAlright, that should be soon then18:12
TheJuliamasghar: QHIXH XHNfw?18:13
TheJuliaerr18:13
TheJuliamasghar: which change?18:13
masgharThis one: https://review.opendev.org/c/openstack/ironic/+/93973218:14
TheJuliaI'm not sure if I posted it before, or after rpittau's change18:15
TheJuliayay for instability18:16
masghar\o/18:16
TheJuliakeekz: It likely needs to be revised at some point, that being said the errors really shouldn't be fatal18:18
keekzthe nova instance goes in to 'error' and is stuck there until the user second delete is issued, so it breaks things like ansible or terraform since it's an error18:19
JayFI rebased that nova change to "bump" it since it was going to be reviewed at one point :) 18:20
keekzthanks Jay!18:22
TheJuliakeekz: I would engage on that change then explicitly note that so people can relate to your experience18:28
JayF++18:28
opendevreviewJulia Kreger proposed openstack/ironic master: ci: fix ipv6 CI job  https://review.opendev.org/c/openstack/ironic/+/93973218:37
opendevreviewJulia Kreger proposed openstack/ironic master: Fix hold/wait step logic in step validation  https://review.opendev.org/c/openstack/ironic/+/93940119:20
rm_work[m]<JayF> "So there's a new firmware update..." <- the current process was not via redfish but we are JUST starting to support that sooooo I'm gonna find out if I should tell them "just wait until redfish support is fully rolled out" lol20:10
JayFI think part of that statement was also 20:11
JayFI don't know if we've integrated agent-style updates into those flows yet20:11
JayFbut you can, as always, just make a step and run it at will :) you just forgo the nice api for it20:11
rm_work[m]<rpittau> "rm_work: you can find recent..." <- thanks, I looked at this before but I think I misunderstood part of it, when I had looked at the code I saw very specific firmware managers but this does look more generic on second pass so I'll see if I can make it work :)20:11
rm_work[m]JayF: yeah i think we're semi-ok with just having something that does these triggers, but it would be nice if it just happened on every normal clean actually I think? unless I'm missing something important. I'm not actually a baremetal expert... yet ;)20:12
rm_work[m]my first look at Ironic was when I was at Yahoo20:13
JayFit's not difficult to add a step to the default flow for automated cleaning if you're doing a custom hwm already20:13
JayFThat was a pretty nonstandard installation fwiw20:13
JayFwith a lot of things I would suggest not emulating :)20:13
rm_work[m]lol, noted20:14
rm_work[m]so wait, what is "via Firmware Interface" vs "via Management Interface", both are redfish?20:20
rm_work[m]our firmware guys are telling me we need to update some stuff in-band, this is considered... in-band or out-of-band? like NIC and Drive firmwares20:21
rm_work[m]which I see it doing, but...20:21
rm_work[m]I also showed them this page and they still told me to do something else, so I don't know if they are misunderstanding or I am XD20:22
JayFThis is a lossy version of the story20:23
JayFbut basically many Ironic base APIs are a bit imperative: setup some context, or some configuration, then things can work20:23
JayFbut not very declarative20:23
JayFfor things like metal3 to integrate, we need a more declarative API for it to adapt to20:23
JayFso we've been moving in that direction a while (even before metal3) 20:23
JayFso think of ManagementInterface as the "everything" tool, and FirmwareInterface as the "firmware" tool20:24
JayFusing the FW Interface lets you rely on things like e.g. asking our API what firmware version you're using20:24
JayFand allowing you to do it out of band (I assume, similar to RAID, we one day may integrate in-band with this as well)20:24
opendevreviewJulia Kreger proposed openstack/ironic-specs master: OCI Container Registry Image Source  https://review.opendev.org/c/openstack/ironic-specs/+/93361220:24
JayFso for your case, you may be better off using the more generic, blunt tool (agent-based steps)20:25
rm_work[m]wait, so is one new vs old, or are they both parallel?20:25
JayFFirmwareInterface is "new" 20:25
JayFbut the old way was essentially "define a step and run it"20:25
JayFwhich continues to work with no real end in sight20:25
JayFiurygregory: ^^ I am probably butchering this description20:25
rm_work[m]ok, and FWI is out-of-band?20:25
rm_work[m]is MI in-band or also out-of-band?20:26
JayFSo MI is out of bank20:26
JayFagent steps are kind weirdly in the middle20:26
JayFbecause agent is technically deploy interface20:26
rm_work[m]honestly I feel like a dummy here because I only have a minimal conceptual idea what those even mean lol20:26
rm_work[m](in this context)20:26
JayFmanagement interface is usually more like, what is my next boot device? am I UEFI or BIOS booting?20:26
JayFbasically think of it as:20:28
JayF- API driven firmware update that supports OOB (and can be used in cleaning as well)20:28
JayFvs20:28
JayF- Creating a custom agent step that runs when you want it to (that just happens to update firmware)20:28
jssfrTheJulia, I managed to make the cursed intel board boot off an nfs-hosted image today. We'll work on getting more nodes up and once we reach the next batch of nodes with that intel board, I'll look into patching Ironic to support automatic translation of http to nfs URLs. So don't be surprised if it takes a couple weeks.20:28
rm_work[m]ok, I will go over your answers with someone who actually understands the firmware process and maybe I can actually get a grasp on this :P thanks20:32
rm_work[m]not sure why they didn't think the existing thing wouldn't work without code modifications, maybe there was a misunderstanding and I said something that made them think that20:34
JayFI mean, if you have something that you can run on a machine noninteractively and upgrade firmware if needed20:37
JayFyou're only minor glue code into getting it done20:37
rm_work[m]yeah I just wasn't sure if code was needed AT ALL or if we just need to pass the right json blob somewhere, which is looks like ... MAYBE20:37
JayFlook at https://opendev.org/openstack/ironic-python-agent in the examples directory20:39
JayFI think we explicitly have an in-band firmware update example20:39
JayFplus google for "ironic python agent hardware manager" for more info on that20:39
JayFwe have good docs they are on IPA side not in Ironic docs tho20:39
JayFyeah, vendor device example20:40
TheJuliajssfr: ack, okay!21:26
opendevreviewJulia Kreger proposed openstack/ironic master: ci: fix ipv6 CI job  https://review.opendev.org/c/openstack/ironic/+/93973221:46
rm_work[m]JayF: ^^ rchow_geico is our actual firmware guy, so maybe he can explain our difficulties / differences from the upstream process and see if there's a good way for us to proceed, and if anything we're doing would actually be useful upstream or if it's all downstream garbage :D21:54
JayFI'm only going to help if I get to meet the talking gecko /s21:55
rm_work[m]hey I want to meet him too, me first :P21:56
rchow_geico[m]No but 15 minutes could save you 15% or more on car insurance21:56
TheJulialol21:56
TheJulia.... Definitely can't on my mobile command center.21:56
TheJulia(RV is larger than Geico's cutoff)21:57
rm_work[m]wonder if there are friends and family exceptions... <_<21:57
TheJuliaEh, I'm in the realm of specialty policy, since National General knows they need to order a flatbed up if I need a tow....21:58
* TheJulia hopefully forever avoids such a need.21:59
JayFhow the hell did I not see that answer coming21:59
TheJuliamagic?!22:00
TheJuliaso I suspect the ipv6 job issue is also neutron is slow at getting radvd up as well, so in test scenarios the job fails. Also tinycore linux doesn't work since it has no dhcpv6 client, so hopefully my fix will just "work"22:01
TheJuliaand then we make the v6 job voting22:01
rchow_geico[m]Thanks for the intro, i'll try to color the situation:... (full message at <https://matrix.org/oftc/media/v1/media/download/AaTYU5zwj-8pguQJcI1_9w8OMLC63Xm6f42xfWoF2x4BLKjo5RvxIrwcTbaWyRNrTe-80h01zlMEwH9DBgAshPZCeU0v3aBAAG1hdHJpeC5vcmcvQ1lMTUxqYUZnbGpTYmJ4VGNaVmtZWXJ6>)22:03
TheJulia#2 seems easy enough with a custom hardware manager, which depending on available steps it returns could involve itself to do it during deployment, or during something like cleaning22:06
rchow_geico[m]I saw some examples via the nvidia/marvell manager if that's the hardware manager you're referring to22:08
TheJuliaWell, that is integrated in because they proposed the code, but realistically you can create your own hardware manager which exposes available steps which can then be pulled in22:08
TheJuliaJayF is like the best advocate of hardware managers. :)22:09
rm_work[m]yeah that is the interface I was looking at extending, making my own manger there to do our custom steps, and generating images with the firmware blobs baked in rather than pulling them for consistency/verification22:14
rm_work[m]so basically you'd just say "step: do the thing" and it would ... do the thing22:14
TheJuliaYeah, that is sort of a preference item, up to you. I think like... cern puts them in a container and mounts the container up right before launching the agent22:15
JayFand I'd say in terms of "installed ironic doing things", most installations I know of are still using the in-band/step methods for firmware22:21
JayFobviously we don't want that to be true forever, but it's not like you'd be strange to go down that route22:21
JayFlook at https://opendev.org/openstack/ironic-python-agent in the examples directory -- the vendor device one -- that's an example of what you need to do with a lot of info inline, and a google for "hardware managers ironic python agent" should get you the full docs from the IPA repo22:22
rchow_geico[m]kk will review this with rm_work and see if this makes sense between the both of us...thanks!22:33
rm_work[m]o722:34
rm_work[m]yeah maybe possible with no-code? would love that22:35
rm_work[m]or is this an example of how you'd do it given a custom device manager for a vendor22:35
rm_work[m]yeah ok reviewed the code, i see what you are saying, this IS an example hardware manager22:36
JayFif you want to do interesting hardware things, you're going to get into the business of a custom hardware manager22:36
JayFunless you only buy homogenous 100% perfectly redfishy hardware :D 22:37
rm_work[m]that sounds fantastic, rchow_geico can we just switch to doing that? :D22:38
rm_work[m]I see there is also an example custom disk eraser, which is something else we likely want to do22:40
rm_work[m]great examples :D22:41
TheJuliaJust as a general recommendation, we generally advise not to replace the actual step which can erase disks which is included in ironic-python-agent's source code. There have been some hard learned lessons captured in that code. That being said, it may have some downsides depending on exact cases your trying to support22:42
TheJuliaOne thing, for example, one of the redhat folks complained internally that ironic erased the thumb drive he put into the back of a DPU which he ran cleaning on22:43
TheJulia... obviously, it presents as a block device, and is thus erased22:43
rm_work[m]ah I was thinking it would be an additional step -- that is the idea right? or are you saying we should probably just patch the existing one22:43
TheJuliaSort of depends on what you need to do and achieve, and what risk your attempting to manage at the end of the day22:43
rm_work[m]though I need to double-check if we REALLY need to do custom stuff, some things I think we can either get away with just doing what we get out of the box22:44
rm_work[m]or else ... punt on22:44
rm_work[m]yes, priorities, risk management, ROI22:44
TheJuliayup22:44
TheJuliaexactly22:44
TheJulia(and occassionally push some worthwhile patches upstream) :)22:45
rm_work[m]I would like to make that happen, yes22:45
JayFrm_work[m]: we have seen more than a couple of folks who essentially thought "I can erase things with simpler code to make it More Secure(tm)"22:45
JayFrm_work[m]: so the cautionary tale is given :)22:45
TheJuliaSpeaking of... my next door neighbor just asked for the OCI Container Registry client stuffs. Folks should review that quick! :)22:45
rm_work[m]lol who are you in a caravan with? :D22:46
TheJuliaJayF: Remember when that person came in after bricking all those disks?! :)22:46
TheJuliarm_work[m]: my next door neighbor is an old ironic contributor22:46
rm_work[m]nice22:46
JayFshould I tell him you called him old? /s22:47
rm_work[m]eugh, offtopic -- i got back to openstack finally and was excited to get to meet up with folks again, but heard we're pretty much done with IRL summits? at least anything larger than local/regional?22:47
TheJuliaJayF: *shrug* :)22:47
JayFI know many of us will be at OpenInfra Days NA colocated at SCALE22:48
JayFand SCALE is a great conference anyway22:48
* TheJulia puts on the board chair hat22:48
rm_work[m]well I missed the opportunity to go over to JayF's for BBQ when I was still in the area, now over on the east coast for a bit :/22:48
TheJuliaA side-effect of the pandemic is largely that in-person events have become harder to market from a massive sense of getting all community members into one place. Some of it is corporations just keeping tight travel budgets, coupled with the pandemic really changed people's views on travel and even exposure to others in terms of in a space and possibly getting sick.22:49
rm_work[m]yeah :/22:50
rm_work[m]prior I always just expected con-plague was a thing and I'd be sick for a few days after22:50
rm_work[m]¯\_(ツ)_/¯22:50
rm_work[m]was usually worth it22:50
TheJuliaAs such, we have focused more to the regional events where we're also trying to overlap with other events to enable higher value of attendance *and* also enable cross-pollination of ideas/communities.22:50
rm_work[m]yeah makes sense22:50
TheJuliaYeah, I know of contributors who outright refuse all travel now22:51
rm_work[m]will have to see if we can get buy-in internally to head to some of these22:51
TheJuliaI'm looking forward to SCaLE22:51
TheJuliabut for me it is just a drive, really22:51
rm_work[m]well, i had 14 flights in 2024 with ~60h in the air, I am still good with travel :P22:52
TheJuliaYeah, I didn't fly as much as I would have preferred, but it is what it is22:53
JayFI am trying for a light-travel year22:55
JayFpartially b/c it's likely I'll take some significant time off this summer22:55
TheJulia++22:55
opendevreviewcid proposed openstack/ironic master: API/Testing: Inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921723:41
opendevreviewcid proposed openstack/ironic master: Apply Rules: inspection rules migration  https://review.opendev.org/c/openstack/ironic/+/93921823:41
opendevreviewJulia Kreger proposed openstack/ironic master: ci: fix ipv6 CI job  https://review.opendev.org/c/openstack/ironic/+/93973223:45

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!