Tuesday, 2025-05-20

opendevreviewcid proposed openstack/ironic master: Fix IPA external inspection callback url override  https://review.opendev.org/c/openstack/ironic/+/94952100:03
iurygregoryok, inserted on slot 1, but the machine timeouts yay01:07
iurygregory Cleaning for node 11111111-2222-3333-4444-555555555555 failed. Timeout reached while cleaning the node. Please check if the ramdisk responsible for the cleaning is running on the node. Failed on step {}.01:07
iurygregoryyeah, I wish i had ramdisk logs at all =(01:08
iurygregorytomorrow will be a new day, I will try something else on iDRAC10, good night folks!01:28
cid\o01:46
opendevreviewJacob Anders proposed openstack/sushy master: Skipping UsbCd workaround on Supermicro ARS-111GL-NHR  https://review.opendev.org/c/openstack/sushy/+/94911602:06
opendevreviewMerged openstack/ironic master: Fix redfish driver URL parsing  https://review.opendev.org/c/openstack/ironic/+/94959704:35
rpittaugood morning ironic! o/06:49
ContinuityMorning Ironic08:05
opendevreviewElod Illes proposed openstack/ironic unmaintained/xena: [Stable Only] pin virtualbmc/sushy-tools/ironic-tempest-plugin to last released tag  https://review.opendev.org/c/openstack/ironic/+/94571608:50
abongaleGood Morning Ironic!08:58
dtantsuriurygregory: let me join the voices thanking you for chasing the iDRAC 10 issue :)09:17
dtantsuriurygregory: after doing https://github.com/dell/iDRAC-Redfish-Scripting/issues/324#issuecomment-2892526106, does the machine actually boot into the ISO if rebooted?09:18
dtantsurIf not, it's a valuable information to tell them IMO09:18
dtantsurAh, I should have read the scrollback, right? So, try booting it with their scripts only, not Ironic. If it does not boot, complain on the bug and let us escalate it.09:24
iurygregorydtantsur, the machine shows a screen that is trying to boot.. but after some time "power off" I'm going to re-install other firmware to test and try to manually boot an iso to install an OS just to see how things go.10:54
Sandzwerg[m]So I created a bug for the UEFI/MBR-partition thing: https://bugs.launchpad.net/ironic/+bug/211131911:08
Sandzwerg[m]Does it already make sense to notice dell of https://github.com/dell/iDRAC-Redfish-Scripting/issues/324 ? If yes then I'll open a ticket internally.11:10
iurygregorySandzwerg[m], you work at Dell? <eyes>11:20
Sandzwerg[m]No, Not at all. But we have a Dell representative at my place (also a HPE and Lenovo one) and can make internal tickets for stuff we want them to fix11:24
Sandzwerg[m]Like we get premium support and have meetings every two weeks. And if you say you can't buy their hardware because it breaks your automation that sometimes helps to fix things.11:25
Sandzwerg[m]Of course if they follow the redfish standard all should be fine11:25
iurygregoryIt would be good to reach out to Dell, the way I've found was by opening the issue 11:27
Sandzwerg[m]OK I'll open a issue so our contact is aware. We don't have idrac 10 yet but that would be a blocker11:29
iurygregorymuch appreciated Sandzwerg[m] o/11:30
iurygregorydtantsur, ok seems like the latest iDRAC firmware had some problems, i was unable to manually boot, did a rollback to  1.20.25.00  at least now it gets to the screen saying `Virtual CD Boot Requested by iDRAC`, will test via their script now11:58
dtantsur++12:28
opendevreviewRiccardo Pittau proposed openstack/bifrost master: Default ansible to version 10.x  https://review.opendev.org/c/openstack/bifrost/+/94824512:29
opendevreviewRiccardo Pittau proposed openstack/ironic master: [WIP] Run metal3 integration job using UEFI boot (default)  https://review.opendev.org/c/openstack/ironic/+/93969412:32
opendevreviewVerification of a change to openstack/ironic unmaintained/xena failed: [Stable Only] pin virtualbmc/sushy-tools/ironic-tempest-plugin to last released tag  https://review.opendev.org/c/openstack/ironic/+/94571612:38
opendevreviewElod Illes proposed openstack/ironic unmaintained/xena: [Stable Only] pin virtualbmc/sushy-tools/ironic-tempest-plugin to last released tag  https://review.opendev.org/c/openstack/ironic/+/94571612:40
iurygregorymachine boots when doing via their scripting.., testing with ironic again hardcoding the VirtualMedia/1 ...12:42
opendevreviewcid proposed openstack/ironic master: Add port/portgroup list conductor groups filter  https://review.opendev.org/c/openstack/ironic/+/86229212:45
iurygregoryhttps://dl.dell.com/content/manual13739887-overview-of-idrac10-redfish-enhancements.pdf?language=en-us13:06
TheJuliagood morning13:22
TheJuliaI've only read a few fragments of that doc reading, but..... some aspects of it just don't add up13:27
iurygregoryagree13:27
TheJuliaand obviously, it lacks what has been discovered13:31
TheJuliaJayF: rpittau: https://review.opendev.org/c/openstack/ironic/+/950192 was previously approved and had a typo fixed in the release note13:44
TheJuliaA quick re-review would be appreciated13:45
rpittaudone13:47
iurygregorytime to test again idrac10 with ironic and see if the script GetIdracLcSystemAttributesREDFISH.py will give me the same output 14:04
iurygregoryok the script reports Attribute Name: ServerBoot.1.FirstBootDevice, Current Value: VCD-DVD, the iso is inserted on slot 114:19
iurygregoryhttps://paste.opendev.org/show/bejGD6UuT87uOy4F0D0I/14:20
iurygregoryYAY Kernel Panic FTW14:22
iurygregorywondering if it's a bad image =( or some magic in the machine...14:24
queensly[m]I'm working with sushy and using the Redfish emulator from sushy-tools. I’m calling mgr_inst.datetime in Python, but it’s None. I already added DateTimeLocalOffset, and that works fine. Is there something else I need to add in the manager.py file?14:32
queensly[m]* I'm working with sushy and using the Redfish emulator from sushy-tools. I’m calling mgr_inst.datetime in Python, but it’s None. I already added DateTimeLocalOffset, and that works fine. Is there something else I need to add in the manager.py file?14:33
queensly[m]* I'm working with sushy and using the Redfish emulator from sushy-tools. I’m calling mgr\_inst.datetime in Python, but it’s None. I already added DateTimeLocalOffset, and that works fine. Is there something else I need to add in the manager.py file?14:33
opendevreviewVerification of a change to openstack/ironic master failed: CI: Reconfigure jobs to minimize tinyipa usage  https://review.opendev.org/c/openstack/ironic/+/95019215:13
opendevreviewMerged openstack/ironic-python-agent-builder master: Update pip version in dib source install  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/94997415:13
opendevreviewMerged openstack/ironic-python-agent-builder master: Build CS9 DIB IPA ramdisk with python 3.12  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/95015215:39
okamitok[m]Thanks Jay and Julia for the help over the last week.... (full message at <https://matrix.org/oftc/media/v1/media/download/AQjRLyN9iE8HCHAF-ocJsa6K1-8lVudXpwQ45heE9pP1rZIYXSAHcDhuzM3CWdBOGr3UReRqoxLhKiFIo92eCs5CeXNpPYfgAG1hdHJpeC5vcmcvTkJ3S0ltcFpORUhUcmRPaWxWTUJlSExm>)16:15
TheJuliaokamitok[m]: I suspect your nova-compute service is running with config drive disabled, if you explicitly set it to be enabled, then it should be populated on the config drive on the host. That is the best path to take and avoids reliance upon metadata services which have networking related quarks.16:31
TheJuliaokamitok[m]: regarding modifying post deploy, I'd recommend you just change the image to include the needful16:36
opendevreviewJulia Kreger proposed openstack/ironic master: DNM: CI Science - Expand the multinode job  https://review.opendev.org/c/openstack/ironic/+/95020616:49
opendevreviewVerification of a change to openstack/ironic master failed: CI: Reconfigure jobs to minimize tinyipa usage  https://review.opendev.org/c/openstack/ironic/+/95019217:16
opendevreviewJay Faulkner proposed openstack/networking-baremetal master: Remove explicitly use of eventlet  https://review.opendev.org/c/openstack/networking-baremetal/+/94798517:27
TheJulialooks like our devstack plugin is now broken17:56
TheJuliastart_neutron_api method from devstack is no longer found17:56
JayFhttps://github.com/openstack/devstack/commit/9e81048bbb3b3adbfb7bd5307af9bce79290308c17:58
TheJuliawhat is mind bending... start_neutron_api is not found18:01
JayFIt's not it's own process anymore.18:05
JayFit's gotta be metaprogramming somewhere to make those methods18:06
TheJuliaIt was a method at some point, but I don't see it in the history at this point18:09
TheJuliawell18:09
TheJuliastart_neutron is still there18:09
TheJuliaso...18:09
opendevreviewJulia Kreger proposed openstack/ironic master: ci/devstack: Remove start_neutron_api explict call  https://review.opendev.org/c/openstack/ironic/+/95045518:11
TheJulialooks like with the flow we now 503 on the network creation19:19
TheJuliawe're likley going to have to do heavier retooling19:19
* TheJulia sighs19:19
*** jcosmao is now known as Guest1642319:26
iurygregoryok, this is the weirdest thing i saw today, after the kernel panic the machine is unable to boot from the OS or from Virtual Media 19:29
iurygregorySystem if powered off -> HOST boot in progress -> Please wait while the system is initializing.. -> System is powered off19:29
iurygregoryLOL the iDRAC UI says there are no Disks omg19:32
iurygregoryhttps://paste.opendev.org/show/buc9su3tP5XaBmzw0y8D/19:32
* TheJulia blinks19:41
* TheJulia blinks some more19:41
* JayF 👀 on that neutron issue19:50
JayFso 19:58
JayFwe stop neutron-rpc-server (I'm unsure who "we" is yet) https://www.irccloud.com/pastebin/pLQZxkSW/19:58
JayFand it never gets restarted19:58
JayFthat is likely the root cause of the 50319:58
JayFand it doesn't look like failures; it looks like actual logic issues where we never even try to start it back 19:59
JayFTheJulia: ^ if you have anything to add, /me keeps digging19:59
TheJuliayeah, likely20:00
TheJuliaso start_neutron only restarts the API then20:00
TheJuliathat was similar behavior20:00
JayFstart_neutron is not the opposite of stop_neutron20:01
JayFand our code kinda assumes it does20:01
TheJuliayeah20:01
TheJuliawell, kind of20:01
JayFstart_neutron_service_and_check20:02
JayFI think is what we want20:02
JayFhttps://opendev.org/openstack/devstack/src/branch/master/lib/neutron#L61020:02
JayFstart_neutron only does agents20:02
JayFso maybe both20:02
* JayF sciences20:02
opendevreviewJay Faulkner proposed openstack/ironic master: Science: replace start_neutron_api with start_neutron_service_and_check  https://review.opendev.org/c/openstack/ironic/+/95046120:03
JayFstop_neutron doesn't do anything to the api, either20:05
JayFso we may not be restarting that process generally20:05
TheJuliawheeeeeeeeee20:05
TheJuliathe whole reason to cycle the config is because we were changing the config20:06
JayFwell, it's uwsgi20:06
JayFso I think we have to bump the whole uwsgi if we wanna restart one20:06
JayFbut I also wonder if what we do is enough, generally20:06
JayFI see no recent changes to stop_neutron20:10
JayFso I think my change may be enough 🤞20:10
TheJuliahopefully20:12
opendevreviewJulia Kreger proposed openstack/ironic master: WIP Patch configdrive metadata  https://review.opendev.org/c/openstack/ironic/+/94667720:20
TheJuliaI think that is retooled as desired20:22
JayFthat failed, it looks like neutron crashed failing to connect to ovsdb20:43
JayFI think there's a red herring somewhere? start_neutron_api doesn't exist in stable/2025.1 :| 20:53
JayF(devstack stable/2025.1)20:53
TheJuliayeah, I know20:56
opendevreviewMerged openstack/ironic unmaintained/xena: [Stable Only] pin virtualbmc/sushy-tools/ironic-tempest-plugin to last released tag  https://review.opendev.org/c/openstack/ironic/+/94571621:01
JayFneutron_plugin_configure_plugin_agent is gone21:01
JayFhmmm not if you have a plugin loaded tho21:02
opendevreviewJay Faulkner proposed openstack/ironic master: Science: replace start_neutron_api with start_neutron_service_and_check  https://review.opendev.org/c/openstack/ironic/+/95046121:03
JayFTheJulia: until recently; is_service_enabled neutron-api was basically unconditionally false21:08
JayFTheJulia: that's the change. That branch of code hasn't been in actual use in ages afaict21:09
opendevreviewJay Faulkner proposed openstack/ironic master: Science: replace start_neutron_api with start_neutron_service_and_check  https://review.opendev.org/c/openstack/ironic/+/95046121:10
* JayF just removes that whole block to see what happens21:10
JayFhttps://zuul.opendev.org/t/openstack/build/38e5e97dbf9344e6ba08fa083aff2724/log/job-output.txt#2549021:11
* JayF wonders if that whole block of code just never being called has some kind of side effect21:12
JayFheh now time to search git log -p in devstack for start_neutron_api to try and timestamp how long that's been dead code21:15
JayFuh holy crap our jobs may have been operating weirdly for a while21:16
JayFhttps://opendev.org/openstack/devstack/commit/a52041cd3f067156e478e355f5712a60e12ce64921:17
JayFstart_neutron_api in the lib/neutron module has been gone since Nov, 202221:18
JayFso that code has been not running for about 2.5 years 21:18
JayFand the recent devstack "fixes" to make neutron-api properly servicified exposed this21:18
JayFI think my fix is a good fix now, will see if it passes CI and if so will update commit message21:18
opendevreviewJay Faulkner proposed openstack/ironic master: Remove code which has been long-dead  https://review.opendev.org/c/openstack/ironic/+/95046121:23
JayFI went ahead and updated the commit message with all this context, so if/when it passes CI it can be landed21:23
JayFTheJulia: ^ I think I nailed the CI thing21:24
TheJuliawoot21:54
JayFI do wonder if those v6 tweaks need to be moved around21:55
TheJuliawouldn't surprise me21:55
TheJuliaOur devstack plugin could use a good cleaning21:55
JayF...is that possibly the problem you had getting v6 to run outside of an OVN job?21:55
JayFsince that seems to be enabling proxy for ndp which seems useful21:56
TheJuliano, the problem with v6 in non-ovn was a blend of edk2 firmware with dnsmasq21:56
JayFah that's right21:56
TheJuliaAnyway, I just spent the last hour and a half digging through customer logs21:57
JayFdid you find one at least half as straight as a 2x4 from lowes? /s 21:58
TheJulialol21:58
TheJuliaI suspect brand shiny new idrac10s with idrac-wsman on some machines21:58
TheJuliacrazy errors21:58
JayFjust point them at the working redfi.... oh21:58
TheJuliaheh21:59
TheJuliaRedfis.... oh :(21:59
JayFtell them to communicate to their vendor that it's highly recommended to not shuffle API endpoints for the hell of it21:59
TheJuliapretty much, I've explicitly requested a bunch of details21:59
JayFyou could've written an Ironic API client when some of our contributors were still in high school and still run it today21:59
TheJuliaso... time will tell21:59
JayFbut they gotta move stuff around for one hardware revision22:00
JayFit's disrespectful of the entire ecosystem and highly frustrating22:00
TheJuliayeah, $words22:00
TheJuliaAnyhow, I need to step away.22:00
JayFhave a good one o/22:00
JayFmy CI fix patch is already past the error point22:04
JayFany cores around overnight or late today who sees this; this is what need to be approved to fix the gate once it passes CI; please land it: https://review.opendev.org/c/openstack/ironic/+/95046122:04
TheJuliastevebaker[m]: ^22:07
stevebaker[m]done22:13
JayFjob is failing :( 22:15
JayFnot the same error though, so I assume it's one of our usual suspects22:15
TheJulialooks like dhcp no worky22:30
JayFactually no worky22:34
JayFor race condition no worky22:34
JayFI wonder if q-dhcp is now neutron-dhcp...22:34
* JayF waits for CI logs22:34
TheJuliaI think they posted for one of the jobs22:34
JayFonly job that's failing+voting is ovn+ipv622:35
JayFand that doesn't use neutron dhcp22:35
TheJuliayeah, logs will need to be dug through.22:36
JayFI'm looking real quick for any leads22:36
TheJuliak, I should have spoons in the morning22:37
JayFis_service_enabled q-dhcp + neutron-dhcp both pass22:37
JayFso that naming may not matter22:37
TheJuliaThat job is ovn anyhow22:37
JayFq-dhcp service seems to be OK22:39
JayFI'm looking at the multinode shard job22:39
JayFsince I know more about how that's shaped22:39
JayFOVN is mostly still a black box to me :/22:39
JayF2025-05-20T21:36:06.912Z|00003|reconnect|INFO|/var/run/openvswitch/db.sock: connection attempt failed (Address family not supported by protocol) in ovn-controller-vtep.log22:40
JayFbut I have no idea what a good one looks like22:40
JayFIt's also worth noting neutron has landed a lot of eventlet removal stuff22:41
JayFplaces where we were depending on specific ordering or timing may be less happy now22:41
TheJuliaYeah, there is a whole order of operations isuse here22:48
TheJuliawith devstack creating networks22:48
JayFthe port bindings are failing22:48
TheJuliaPerhaps we need to front load in the disabling22:48
JayF> May 20 21:40:59.886533 np0040838796 devstack@neutron-api.service[91224]: ERROR networking_generic_switch.generic_switch_mech [req-fb14966a-feae-469d-ad3d-c8b01d6ee3d6 req-282c99c0-dad7-4fea-8002-cf6a14e43b48 service ironic] Cannot bind port deaec2e4-5e49-4f7b-b4ab-70e6bcb21ef6 as device brbm is not configured. Check baremetal port link configuration.22:48
JayFI am unsure if that is meaningful or not in this context22:48
JayFthis is the ovn v6 job22:48
TheJuliathat seems like a rather unhappy error22:48
JayFhttps://e3fa69918ab3893f89a3-76ad47885070581f857a540cadaa6a6d.ssl.cf1.rackcdn.com/openstack/55cf2727b4c54f06b897353cf71ea0a3/controller/logs/screen-neutron-api.txt22:48
JayFand would follow the pattern, potentially, of a configuration not being picked up22:49
TheJuliathe knob deleted was to prevent neutron from doing initial network configurations... hmmm22:49
JayFI have a killer migraine, weather here is rain/stop/rain/stop/rain/stop which isn't great, I'm going to step away. I should have some time to dig again tomorrow... well, probably not until afternoon tbh but I'll try to make time in the morning :) 22:49
JayFoooh22:49
TheJuliayes, go, step away22:50
JayFI'm still here for another :10, going to try and figure it out22:50
JayFI don't see that switch having gone away?22:51
JayFthe primary effective changes, afaict, are s/q-/neutron-/ except in cases where you're explicitly enabling the q- version of a service22:52
JayFbut clearly some piece is missing22:53
TheJuliaNEUTRON_CREATE_INITIAL_NETWORKS=False22:53
JayFthat is still honored in devstack afaict22:53
JayFoh, you mean it's been disappeared22:54
JayFbut again; I have hard evidence that codepath is dead for 2.5+ years22:54
JayFso even if that's not awesome, it should *not* be the root cause of our issue imo22:55
TheJuliatrue22:56
TheJuliaGet some rest, I'll take a fresh look in the morning22:56
JayFI would posit if it's failing with that code removed, either 1) random failures like we tend to see or 2) something else in the devstack change chunk22:56
JayFI might recheck this once the -1 comes through just to see if it's reproducable22:57
TheJuliareasonable22:57
JayFOK, going to go find some headache medicine and !laptop :) o/22:57
TheJuliasounds like a plan22:57
* TheJulia checks the status of the nearby fire which is luckily heading away22:58

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!