Thursday, 2021-10-28

opendevreviewSteve Baker proposed openstack/ironic master: Add platform:rpm shim, grub packages to bindep  https://review.opendev.org/c/openstack/ironic/+/81538601:27
opendevreviewSteve Baker proposed openstack/ironic master: Write master grub config on startup  https://review.opendev.org/c/openstack/ironic/+/81558001:28
opendevreviewSteve Baker proposed openstack/ironic master: Capture [pxe]loader_file_paths for distros  https://review.opendev.org/c/openstack/ironic/+/81539201:28
opendevreviewSteve Baker proposed openstack/ironic master: Capture [pxe]loader_file_paths for distros  https://review.opendev.org/c/openstack/ironic/+/81539202:47
opendevreviewHanGuangyu proposed openstack/ironic master: Add description to the mod_wsgi part  https://review.opendev.org/c/openstack/ironic/+/81491605:19
arne_wiebalckGood morning, Ironic!06:35
jandershey arne_wiebalck o/06:38
dtantsurmorning folks06:55
arne_wiebalckhey janders dtantsur o/07:04
jandershey dtantsur o/07:05
rpittaugood morning ironic! o/07:19
rpittausecond breakfast, brb :)08:29
dtantsuris rpittau a hobbit? :)08:38
rpittau:D09:01
rpittaudtantsur: now I remember the main problem with tinycore aarch64, the only port available is for RPi09:22
dtantsurugh09:35
dtantsurdoes it matter for us?09:35
rpittauwell, it's like starting from zero09:45
rpittauthe port is an installer for raspberry pi, it's a bootloader that installs tinycore on RPi09:45
rpittauyou have kernel and everything but it's tied to the RPi cpus and components09:45
dtantsurokay, so in ARM world there is no such thing as "generic aarch64"?09:47
rpittaunot for tinycore AFAIK09:49
rpittauother distributions have something generic, like Fedora or Debian09:49
dtantsur....09:50
dtantsurshould we take a stab at an ipa-builder check job then?09:51
dtantsurI wonder what the RAM requirements will be to actually run such an image09:51
rpittauyou mean with DIB ? not sure about the size, probably similar to x86_64 ?09:54
dtantsuryeah09:56
rpittauI'll fire up a check job with DIB using debian, let's see how it goes09:59
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build arm64 debian based ipa ramdisk  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581510:03
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build arm64 debian based ipa ramdisk  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581510:10
opendevreviewMerged openstack/ironic-python-agent stable/xena: Fix error messages in burnin code  https://review.opendev.org/c/openstack/ironic-python-agent/+/81563010:40
*** dviroel|rover|afk is now known as dviroel|rover10:50
arne_wiebalckTheJulia: dtantsur: https://opendev.org/openstack/ironic-python-agent/blame/commit/be3882162e432b292af9e787661661a05fa7d11a/ironic_python_agent/extensions/image.py#L292 was introduced for 'iscsi' ... do we know if this is needed for 'direct'?12:00
arne_wiebalckWhat I do know is that it is not sufficient for iscsi, but before submitting a patch I would like to understand if this is an issue for direct as well.12:01
arne_wiebalck(not sufficient means: _rescan_device() is not sufficient to make sure the device nodes are there)12:02
opendevreviewMerged openstack/ironic-python-agent stable/xena: Respect global parameters when downloading a configdrive  https://review.opendev.org/c/openstack/ironic-python-agent/+/81544812:05
opendevreviewRiccardo Pittau proposed openstack/metalsmith master: Update pep8 test requirements  https://review.opendev.org/c/openstack/metalsmith/+/81454312:53
dtantsurarne_wiebalck: I'm not sure how this is related to iscsi?13:26
dtantsurI mean, it was "vital for iscsi" as TheJulia wrote in the commit message, but this code is used in all deploy methods13:27
dtantsur(and there is no more iscsi code in-tree)13:27
TheJuliaI suspect it would be advisable to keep it in general becasue the case we were hitting was we were not spotting a change or loading a change to the OS before taking the next step13:39
TheJuliaAnd... the Os doesn't always load changes if locks get involved13:40
arne_wiebalckdtantsur: this was my question since the initial commit said "needed for iscsi"13:51
arne_wiebalckdtantsur: the comment was removed, but the code stayed13:51
dtantsurneeded for iscsi, but added to the generic code13:51
dtantsurdo you think it should or should not be there?13:51
arne_wiebalckdtantsur: this is what I am trying to figure out :)13:52
arne_wiebalckon iscsi, it is not enough13:52
arne_wiebalckI am debugging this a the moment and have added "blockdev --rereadpt" which seems to work (tbc)13:52
arne_wiebalckbut when I read the comment, I was wondering if the scan is needed for direct at all13:53
dtantsurhard to tell 100%13:53
arne_wiebalckhttps://usercontent.irccloud-cdn.com/file/EMwhwvgp/couldnotinstallbootloader.png13:53
TheJuliait may be okay to remove... *but*  it is like the last line of defense for hte logic to make sure the information is present13:54
arne_wiebalckthis is the error rate I saw and how is stopped when I added the blockdev command13:54
dtantsurI'd not remove it unless we're 100% sure it's not needed13:54
arne_wiebalckthis is all on iscsi still13:54
arne_wiebalckdtantsur: ++13:54
arne_wiebalckmy patch will be to extend it actually13:54
arne_wiebalck_rescan_device()13:54
TheJuliaahh, okay13:54
TheJuliasounds good to me13:54
arne_wiebalckI will do some more tests, like trying to figure out if only iscsi, if blockdev is helping, if reproducable on direct, ...13:55
arne_wiebalckit is not fatal, but creates errors in our logs13:56
arne_wiebalck"I will be back" :)13:56
arne_wiebalck(this may take some days since it is a race)13:57
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build arm64 debian based ipa ramdisk  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581513:59
janderssee you on Monday Ironic o/14:14
TheJuliao/14:14
janders(we have a surprise state holiday on Friday - moved from August due to COVID)14:15
TheJuliaenjoyu14:15
TheJulias/yu/y/14:15
jandersthank you TheJulia14:16
TheJuliabfournie: you around?14:16
bfournieHi TheJulia14:25
TheJuliaYou mentioned someone reproduced the UEFI firmware memory loading issue observed on lenovo gear, was that with like a usb drive?14:27
bfournieTheJulia: trying to recall this... was this reproduction in-house do you know?14:29
TheJuliaI think so14:30
TheJuliaI'm going back and forth with jarrod johnson @ lenovo on a few different topics at the moment14:31
*** ricolin_ is now known as ricolin14:40
bfournieTheJulia: so I haven't found any more notes on Lenovo besides whats in 1990590 regarding the size of the ramdisk that iurygregory had added (comment 41) 14:48
TheJuliaheh, its the same hardware14:52
TheJuliamodel wise14:52
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build arm64 debian based ipa ramdisk  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581514:52
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build arm64 debian based ipa ramdisk  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581514:53
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build arm64 debian based ipa ramdisk  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581515:37
tzumainnhi! I've been testing metalsmith a bit recently, and I was wondering - if you deploy and specify a network, would it make sense for the created port to be created with the device_owner set to baremetal:none?15:47
tzumainnwithout that, it seems as if the networking-ansible driver won't work15:48
opendevreviewDmitry Tantsur proposed openstack/ironic-python-agent master: Always include the oslo_log log file in ramdisk logs  https://review.opendev.org/c/openstack/ironic-python-agent/+/81586115:58
arne_wiebalckI see the same mount /boot/efi error with direct, so _rescan_disk() is needed but not enough :)16:02
TheJuliaarne_wiebalck: good to know16:03
arne_wiebalckit is not failing all the time, it is a race again16:03
arne_wiebalckwith the special files showing up too late16:03
TheJuliatzumainn: not sure, I mean, metalsmith development was a few years back at this point and I'm not sure exactly what your trying to achieve with doing such.16:03
TheJuliaarne_wiebalck: oh jeeze :(16:04
arne_wiebalckI will try to confirm if 'blockdev --rereadpt' is enough to fix it16:04
arne_wiebalckI am worried it was just fixed by the additional time of the call ... 16:05
tzumainnTheJulia, well, the issue is that without that setting the networking-ansible driver doesn't work16:05
tzumainnyou end up with this message: "networking_ansible.ml2.mech_driver [req-39e37b77-eedf-4e04-be67-0f5521c42905 5a84b63362ce4bc8b7127fc05c762ec3 75073346\16:05
tzumainn211449fbba47ab6b0ddc56db - default default] Port 2153caef-19df-4160-951c-1328abae415d has device_owner:  and vnic_type: baremetal which is not supported \16:05
tzumainnby networking-ansible, ignoring."16:05
TheJuliawtf16:06
TheJuliaI'd fix networking-ansible, tbh16:06
TheJuliait *shouldn't* care about device owner16:06
tzumainnTheJulia, ah, okay - I noticed that ironic now sets the device owner to baremetal:none, so I was wondering if that was just the new standard16:06
TheJuliaarne_wiebalck: legitimate concern :(16:06
TheJuliatzumainn: no idea, I don't remember that change16:07
arne_wiebalckTheJulia: I will find out :)16:07
rpittaudtantsur: the debian arm64 ipa ramdisk is building -> https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581516:07
rpittauneed to do one change to ipa-builder first16:07
dtantsur\o/16:08
dtantsurone comment inline16:09
rpittaugood, we won't need the new pkgs, just new pip16:09
rpittaudtantsur: ^16:09
dtantsurah, ok16:09
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build arm64 debian based ipa ramdisk  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81581516:10
rpittaugoing to split now, see you tomorrow! o/16:10
dtantsurc u rpittau 16:13
opendevreviewRuby Loo proposed openstack/ironic stable/xena: Fix various issues in the anaconda deploy interface  https://review.opendev.org/c/openstack/ironic/+/81580816:16
opendevreviewRuby Loo proposed openstack/ironic stable/wallaby: Fix various issues in the anaconda deploy interface  https://review.opendev.org/c/openstack/ironic/+/81587016:16
dtantsurrloo: probably bugfix/18.1 as well if you don't mind16:17
rloodtantsur: sure, thx for mentioning that cuz it wasn't in my radar!16:17
opendevreviewRuby Loo proposed openstack/ironic bugfix/18.1: Fix various issues in the anaconda deploy interface  https://review.opendev.org/c/openstack/ironic/+/81587116:18
arne_wiebalckbye everyone o/16:19
TheJuliaiurygregory: you around?16:26
opendevreviewRuby Loo proposed openstack/ironic bugfix/18.1: Minor updates to anaconda doc  https://review.opendev.org/c/openstack/ironic/+/81587216:29
opendevreviewRuby Loo proposed openstack/ironic stable/wallaby: Minor updates to anaconda doc  https://review.opendev.org/c/openstack/ironic/+/81587316:29
iurygregoryTheJulia, yup (not working day) but I'm here =)16:31
rloothe backports for the anaconda fixes have conflicts cuz it updates docs that don't have those ^^ changes. Figured better to backport the doc changes too so they are the same.16:31
TheJuliaup for a really quick question?16:31
iurygregoryyup =)16:31
iurygregorygo ahead16:31
TheJuliaiurygregory: do you remember what drove us to do https://github.com/openstack/ironic-python-agent/commit/b6210be196fea271b2c49f89d3e1638517c1198c#diff-2ac3e3f56d389db15b980c3471cb49965be083d4221667e009a4916595ef8a40R24316:32
TheJuliawe *think* this is causing major issues on lenovo sr650 machines16:32
TheJuliait is basically the only uefi patch we never backported to 16.116:33
iurygregoryoh wow!16:33
iurygregorylet me check things 16:33
TheJuliaand 16.1 worked just fine16:33
dtantsuryou mean, grub-install plainly does not work on UEFI any more?16:34
TheJuliahmmm16:34
TheJuliadtantsur: well, that has been a thing for a little while now16:35
iurygregoryTheJulia, I think the idea was to remove any booty entry that was duplicated16:35
dtantsurah, you mean that line, not the whole patch #facepalm16:35
dtantsurwell, I can assure you that duplicated labels can cause a lot of issues16:35
iurygregorywe could have duplicated labels and this would remove16:35
opendevreviewMerged openstack/ironic master: Fix various issues in the anaconda deploy interface  https://review.opendev.org/c/openstack/ironic/+/81408716:35
opendevreviewDmitry Tantsur proposed openstack/ironic-python-agent master: Always include the oslo_log log file in ramdisk logs  https://review.opendev.org/c/openstack/ironic-python-agent/+/81586116:36
TheJuliaYeah, it is a conundrum16:36
TheJuliaerr, actually it did make it into 16.116:38
TheJuliafreaky16:38
TheJuliabut the code path utilization changed to leverage it and dedupe, since it was ironic1 previously16:38
iurygregoryouch D:16:39
TheJuliaI'm thinking of moving the delete before the add16:41
TheJuliasince which would re-establish the boot order eplicitly16:41
iurygregoryso we would change the regex we need to use I think16:50
iurygregorywe would remove any entry with *ironic*?16:50
TheJuliawell, we use ironic if we dont' have a properly formatted default16:51
dtantsurand any fixed path on a removable media (thank you shim)16:51
TheJuliaseparate patch ;)16:52
TheJuliaactually16:52
iurygregory"efibootmgr: ** Warning ** : Boot0004 has same label ironic" this was the warning we could get after adding the new entry16:52
TheJuliaany ideas on identifying removable media from the list?16:53
dtantsurbtw we're discussion it downstream with derekh and bfournie, and we think to engage the shim team16:54
dtantsurironic can work around it, but the same issue affects pretty much any CoreOS ISO (and maybe more)16:54
dtantsurif we want to fix it in ironic, we need to do it on *any* action: inspection, cleaning, deploy16:54
dtantsurotherwise the machine won't reboot after the 1st action16:54
iurygregorywow16:55
dtantsuryeah, so probably evaluate_hardware_support or something like that16:55
dtantsurin other news, bloody CoreOS IPA takes more than 10 minutes to even start IPA in a nested VM16:58
derekhI still don't see how ironic can deal with this, any BM server with a boot entry pointing to the CDROM will cause the instruction to boot from CDROM to fail, so we can't run IPA to clean it (unless using PXE)16:58
dtantsurderekh: if the record is there, we cannot do anything. if it's just been created on this very start-up, we can purge it.16:58
derekhAlso its described here https://storyboard.openstack.org/#!/story/2008763 16:58
derekhBut its coreos (in the case I'm looking at) what started up and created the entry, what purges it ? 17:00
dtantsurderekh: IPA when it starts (it won't help AI obviously)17:00
dtantsurI do agree that the real fix belongs somewhere between shim and coreos17:01
derekhok17:02
*** derekh is now known as derekh_afk17:04
* TheJulia gets totally derailed17:14
* dtantsur wonders if TheJulia is a Train17:15
TheJuliadtantsur: I've already kind of discussed it with valpertha, it is wired in assertion, but maybe it won't if there is no csv file?17:15
dtantsuryeah, this is why I'm saying that the fix may be in coreos17:15
TheJuliadtantsur: no, trying to get more solar panels is a major headache17:15
dtantsurone would expect the opposite, given the climate situation..17:16
TheJulialike... people don't listen and I know exactly what is required17:16
TheJuliawe're at lowering our expectations point and we'll do the rest of the electrical work ourselves point17:16
dtantsur:(17:16
dtantsurif somebody has time today, my CoreOS IPA work could really use https://review.opendev.org/c/openstack/ironic-python-agent/+/815861/ and https://review.opendev.org/c/openstack/ironic-python-agent/+/81565117:17
dtantsurgood evening folks o/17:18
dtantsurfor those in the USA: I'm out tomorrow afternoon and on Monday, so see you on Tuesday17:19
TheJulia:)17:19
NobodyCamTop of the morning Ironic folks.17:37
NobodyCamI heard a while back (week or so back) about a SkyLake PXE issue? is there a bug I can follow along at home???17:45
TheJuliabfournie: ^^ I'm drawing a blank on what the issue was actually leaning towards, do you remember?17:50
*** sshnaidm is now known as sshnaidm|afk17:50
-opendevstatus- NOTICE: mirror.bhs1.ovh.opendev.org filled its disk around 17:25 UTC. We have corrected this issue around 18:25 UTC and jobs that failed due to this mirror can be rechecked.18:44
opendevreviewJulia Kreger proposed openstack/ironic-python-agent master: Delete EFI boot entry duplicate labels first  https://review.opendev.org/c/openstack/ironic-python-agent/+/81589918:56
lmcgannhey, im trying to control a node via the serial console but Im having some issues. if I telnet to the console url given by 'console show' and provide the node, I can see the output of the node booting. However, I dont seem to be able to control the node. And prompts about login dont appear as they usually do in the web console. Am I misunderstanding something?19:33
TheJulialmcgann: so... console has to be directed to ttyS0 so could they not be on the image?20:33
lmcgannTheJulia: youre saying that I am able to read the output the bios makes but once the image boots, something in the image makes the OS not hook up to the right tty?20:42
TheJuliafor serial, yes20:42
cvstealthWhen an instance is deployed on a node that is using a port group does the interface mapping always have to be calculated as user-data on the instance? Poking around the configdrive and in the metadata service with an instance in this configuraton I didn't see where any port/ip information was being made available. 21:12
TheJuliacvstealth: I believe so, becaues it would come from the neutron ports. Nova has code internally to try and reconcile this as ports can still be in progress21:25
TheJuliait is supposed to be done at some point, but I'm not sure for bonded ports and how they get represented in neutron21:26
cvstealthTheJulia: In that case do you have to know what node your instance is going to land on before launching it?21:29
TheJuliacvstealth: not really, that is supposed to be resolved in placement21:31
TheJuliapart of the reason netorking-baremetal feeds information into neutron if I understand the purpose correctly21:32
*** dviroel|rover is now known as dviroel|rover|afk21:33
cvstealthIt looked like the configdrive contents were being generated prior to the deploy ramdisk being run then the networking-baremetal service was only inserting the neutron ports for the port group after the deploy phase.  It read as a chicken and egg condition where I can't generate the user-data with the bonding configuration without know the mac address of the node that the instance was going to 21:37
cvstealthland on.21:37
TheJuliafor some reason that does sound right and I guess that is likely a legitimate egg because there is no way to really determine what NIC will get attached upon deployment and we still update the information neutron has22:07
TheJuliaYou may just be in the uncanny valley of port binding with bonds. I'm wishing sam betts was still around these parts, he would know off the top of his head22:08

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!