Friday, 2023-03-24

opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic-specs master: [WIP] Firmware Updates Interface  https://review.opendev.org/c/openstack/ironic-specs/+/87850504:16
dtantsurtwo older backports if anyone has time: https://review.opendev.org/c/openstack/ironic/+/876178 and https://review.opendev.org/c/openstack/ironic/+/87617410:08
kaloyankHello Ironic o/11:29
iurygregorymorning ironic11:31
iurygregorydtantsur, looking now11:31
kaloyankTheJulia, dtantsur, I want to implement the snapshot spec (https://specs.openstack.org/openstack/ironic-specs/specs/not-implemented/snapshot-support.html). I have the following questions:11:51
kaloyank1. You mentioned that the spec might be changed as it's a little out of date. I'd like to make some suggestions, how can I do that?11:52
kaloyank2. I'm currently running Yoga, will update to Antelope once it hits RDO but still it remains unknown to me how will I test my code. SHould I setup a testing env?11:53
dtantsurkaloyank: #2 you can use https://docs.openstack.org/ironic/latest/contributor/dev-quickstart.html#deploying-ironic-with-devstack or https://docs.openstack.org/bifrost/latest/install/index.html11:54
kaloyankdtantsur: Thanks, I'll check it out11:55
kaloyankalso, as there are changes to be made to IPA, I presume that I'll have to rebuild the IPA, which is fine. How can I test that the IPA is doing the correct thing? 11:57
kaloyankShall I use a VM that network boots or there's some other way?11:57
dtantsurkaloyank: both devstack and bifrost allow creating a testing environment with VMs serving as fake bare metal machines.11:59
kaloyankawesome, thanks12:01
opendevreviewMaksim Malchuk proposed openstack/bifrost master: Create the log file for the disk-image-create command  https://review.opendev.org/c/openstack/bifrost/+/82289512:25
opendevreviewMaksim Malchuk proposed openstack/bifrost master: Create the log file for the disk-image-create command  https://review.opendev.org/c/openstack/bifrost/+/82289512:33
opendevreviewMerged openstack/ironic stable/yoga: Do not move nodes to CLEAN FAILED with empty last_error  https://review.opendev.org/c/openstack/ironic/+/87617814:08
opendevreviewMerged openstack/ironic bugfix/21.2: Do not move nodes to CLEAN FAILED with empty last_error  https://review.opendev.org/c/openstack/ironic/+/87617414:08
opendevreviewMerged openstack/ironic master: Use main branch of metal3-dev-env to run metal3 integration job  https://review.opendev.org/c/openstack/ironic/+/87760014:08
iurygregorydtantsur, quick question regarding for rpc api impact, you don't think we would need because we will re-utilize the current update_firmware clean step we have?15:26
iurygregoryI'm also a bit puzzled since you mentioned we need to settle on a new standard name for the clean step...15:27
dtantsuriurygregory: clean steps don't require new RPC's, they all go through one generic call15:33
dtantsurdo_node_clean or how is it called?15:33
iurygregoryyeah15:35
iurygregoryonly raid had specific ones 15:35
dtantsuryeah, because it has conductor-side validations IIRC15:36
iurygregorygotcha15:36
iurygregorydo we plan on re-use the clean step update_firmware or would be a new one?15:37
iurygregoryand I think one possible approach to get the current versions of FW available would be maybe via verify_steps...15:37
dtantsuriurygregory: update_firmware is not even on the correct interface :)15:42
iurygregoryyeah, it lives in management I think15:43
iurygregoryso it would be a new clean step (I'm worried about the name, since we have update_firmware in management), will update things in the spec based on your answers o/15:44
iurygregorytks!15:45
prometheanfiredtantsur: got stuff working, fat fingered a neutron config, does the 'no root device specified' when booting a node mean that I need to pass in root device hints?18:12
prometheanfirenevermind, it looks like ipa is trying to use the internal url to talk back to ironic instead of the public endpoint19:28
JayFThat is intentional, iirc19:31
JayFyou can set an external url per node or in config19:31
JayFif you have network segmentation such that the URL an IPA needs to hit is different than what's internal19:31
prometheanfirehappen to know which flag to set? trying to google isn't helping19:31
prometheanfireya, internal url is not https, so sits on it's own non-routed vlan19:32
prometheanfire(OSA deploy)19:32
JayFhttps://docs.openstack.org/ironic/latest/admin/dhcp-less.html#deploying-outside-of-the-provisioning-network19:32
JayFthat is covering a specific case, but I think the abilities there might be helpful to you19:33
prometheanfireya, looks useful19:33
JayFhope it's useful :D 19:33
prometheanfireya, I think it's specifically the callback url19:35
jrosserprometheanfire: you know i updated the OSA documentation for ironic very recently with a worked example?19:39
prometheanfireI have been using the osa ironic docs :D19:41
jrosserprometheanfire: including all the settings you need to make the callback work with an isolated network for ironic provisioning19:41
prometheanfireok, didn't see that before19:41
prometheanfirejust looked at latest os_ironic docs and still didn't see it19:42
jrosserthis https://docs.openstack.org/openstack-ansible-os_ironic/latest/configure-lxc-example.html19:43
prometheanfiredidn't mention the external callback url?19:43
prometheanfireset the external callback url and it's still trying with the internal ip :|19:44
jrosserlook in the example config for `endpoint_override`19:45
prometheanfireok, see that now19:45
jrosserin my example the haproxy endpoints are not reachable *at all* from the provisioning network so the override reflects that19:46
jrosseryou can set it to whatever you need19:46
prometheanfireyep, that downloaded an image, thanks20:01
JayFThanks jrosser \o/ congrats prometheanfire 20:09
prometheanfirethanks20:54
prometheanfirenow I just have to wipe the node since I think the existing software raid is making the deploy to disk sad, that or add some hints20:55
opendevreviewVerification of a change to openstack/ironic master failed: Enables boot modes switching with Anaconda deploy for ilo driver  https://review.opendev.org/c/openstack/ironic/+/86082121:04
prometheanfireya, not sure why, but cleaning, quick, full or express are all not actually cleaning it seems, md127 still found and still messing things up22:36
prometheanfireI tried adding {"interface": "raid","step": "delete_configuration"} in front of the erase_devices step but that wasn't allowed during a clean (agent aborted)22:37
jrosserprometheanfire: is this a previous raid config you need to get rid of?22:41
prometheanfireya, the host had an OS installed22:43
jrosserok this was a massive PITA22:43
prometheanfireI've tried manually cleaning22:43
prometheanfireheh22:43
jrosserthere might be a better way(!), but....22:43
prometheanfireeasier to erase the hard disks out of band? :P22:44
jrosserdisable the raid config `openstack baremetal node set --reset-raid-interface <NODE_NAME>`22:44
jrosserspecify a single disk `openstack baremetal node set <NODE_NAME> --property root_device='{"name": "/dev/sda"}'`22:44
jrosserdeploy an instance to the node, this will remove the raid signature but fail because of the cached partition table still showing the raid signature22:45
jrosserdeploy an instance again, this will now succeed with the single disk22:45
prometheanfirecool, first phase deploying now22:46
jrosserlogin to the node and `wipefs -a -f <partition>` for all the old partitions22:46
JayFprometheanfire: your feedback is noted; I was worried this would be a side-effect of some of our raid support changes recently to less aggressively delete software raids22:46
jrosserundo the root device `openstack baremetal node unset <NODE_NAME> --property root_device`22:46
JayFprometheanfire: I'll make a note to look into that on Monday, unless jrosser specifically is enabling the code to not-erase-MD-raid in nodes22:46
jrosserput back the raid config you actually want `openstack baremetal node set --raid-interface <whatever>`22:47
prometheanfireshould I remove the root_device property after the first phase?  iirc the hint shouldn't be needed and is generally discouraged22:47
JayF3/win 1222:47
JayFwhoops22:47
prometheanfireeh, I can figure out how to add raid later, for now I just want to be able to ssh in :D22:47
jrosserJayF: we also had a terrible time with software raid on nodes that had been previously used for something else that invloved an md setup22:47
JayFYeah that is not supposed to happen22:47
JayFif you don't have raid configured, and you don't have the option enabled to skip erasing the raid, it should be erased aiui22:48
jrosserthose steps i gave were what we came up with the rescue the situation to something usable22:48
JayFand if we somehow changed that behavior I would state it was likely ... misguided22:48
prometheanfirea clean option for all ssd type devices would wipe raid signatures I'd think22:48
jrosserthere might totally be something we missed to do that more easilt of course22:48
JayFI've made a note to review this on Monday. My EOD is in about 12 minutes and you don't want a week-old brain looking at this22:48
prometheanfireor nvme, or whatever22:48
prometheanfireyarp22:48
prometheanfirethanks for all the help22:48
JayFprometheanfire: can you do me a favor, and just give me the output of a node show --detail on the imacted node22:48
JayFprometheanfire: assuming this is Zed Ironic/IPA? Or Antelope?22:49
prometheanfiresure, ya, zed22:49
JayFack22:49
prometheanfiredetail not recignized as an argument?22:49
jrosserrelatedly we abandoned trying to use software raid because of https://storyboard.openstack.org/#!/story/201047922:50
prometheanfireopenstack baremetal node show --detail, not using the ironicclient (if that's still even a thing)22:50
JayFprometheanfire: just give me the node show, I don't know what /node/{}/detail translates to in the CLI right now22:51
JayFjrosser: ack22:52
JayFwill look at that too22:52
prometheanfireyarp22:52
prometheanfirehave to clean it up a little, also note that I did the reset-raid-iterface and set root_device22:53
prometheanfireJayF: https://paste.openstack.org/show/81933422:56
prometheanfirealso, ltns, how's it going?22:56
prometheanfirehmm it's still seeing md127 on the second boot22:58
jrosserprometheanfire: it’s too late here for me to go look but we had some cleaning step config too…23:01
prometheanfireI'll poke at it over the weekend23:02
prometheanfirejrosser: thanks23:02
* prometheanfire is seeing what happens when I said root_device to /dev/md127 out of pure curiousity23:02
jrosserI’ll see if I can dig something out tomorrow23:02
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic-specs master: Firmware Interface  https://review.opendev.org/c/openstack/ironic-specs/+/87850523:35

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!