Tuesday, 2021-12-21

*** pmannidi|AFK is now known as pmannidi00:16
opendevreviewSteve Baker proposed openstack/ironic-inspector master: Remove rootwrap rule for dnsmasq systemctl  https://review.opendev.org/c/openstack/ironic-inspector/+/82237302:16
*** pmannidi is now known as pmannidi|AFK06:42
arne_wiebalckGood morning, Ironic!07:02
jandershey arne_wiebalck o/07:07
holtgreweGood morning ;-) OK, so my hardware found a new way to annoy me. In UEFI, my Dell server is apparently storing information related to my previous CentOS installation. Now that I want to install RockyLinux, it does not have sufficient "boot option" slots available.07:10
holtgreweIs there a "clean UEFI boot options" step in ironic?07:10
arne_wiebalckhey janders holtgrewe o/07:16
holtgrewe\o arne_wiebalck janders 07:16
holtgrewehttps://bugzilla.redhat.com/show_bug.cgi?id=1680659 <-- omg, am I now close to Dell iDRAC bugs? *sigh*07:16
arne_wiebalckholtgrewe: I don't think so ... but I think there should be quite some before it runs out of space, no?07:16
holtgreweI have 16 slots filled ;-)07:17
arne_wiebalckthat does not sound like a lot07:17
arne_wiebalckI am pretty sure we have more than that07:18
holtgrewesounds like a power of 207:18
arne_wiebalckholtgrewe: how many slots do you use?07:18
holtgrewemaybe I should not use the ipmi driver but rather redfish or this dell thing07:18
arne_wiebalckI never tried to max it out ... let me check a standard server ...07:18
holtgrewe15x "Unavailable: CentOS Linux", 1x "PXE Device 1: NIC in Slot 4 Port 1 Partition 1", 1x "Unavailable: CentOS"07:19
holtgreweBut maybe it is my disk Frankenstein Rocky disk image ;-)07:19
arne_wiebalckhmm, first one uses only 807:21
arne_wiebalckneed to find one with more interfaces ...07:22
holtgrewearne_wiebalck: maybe let me check my uefi image first07:22
arne_wiebalckfound one with 13 now07:23
arne_wiebalckholtgrewe: ok07:23
arne_wiebalckholtgrewe: just checked the code, Ironic will remove duplicate entries07:30
holtgrewearne_wiebalck: do you also use Dell servers? if so, which driver? ipmi, redfish, or idrac?07:30
arne_wiebalckholtgrewe: no Dell servers07:30
holtgrewearne_wiebalck: so maybe my ironic configuration is problematic?07:30
arne_wiebalckholtgrewe: well, one other complication may be that s/w RAID does not use efibootmgr07:30
arne_wiebalckholtgrewe: this is still on top of a s/w RAID, right?07:31
holtgrewearne_wiebalck: yes, sounds like I'm (again) hitting corner cases07:31
arne_wiebalckholtgrewe: `efibootmgr -v` lists the same entry over and over again?07:31
holtgrewewell, that's how you see that you're either on the bleeding edge ... or doing something really really stupid07:31
holtgrewearne_wiebalck: I could not boot into the machine07:32
holtgrewestill trying to clear out the UEFI entries using Dell iDRAC/BIOS07:32
arne_wiebalckholtgrewe: I guess you can get the current list with iDRAC as well maybe?07:33
arne_wiebalckholtgrewe: with Redfish you can, I think07:33
holtgrewearne_wiebalck: yes, the 07:33
holtgrewehttps://snipboard.io/GXkdon.jpg07:34
arne_wiebalckholtgrewe: ugh :)07:34
holtgreweXML dump is <Attribute Name="UefiBootSeq">Unknown.Unknown.1-1, Unknown.Unknown.2-1, Unknown.Unknown.3-1, Unknown.Unknown.4-1, Unknown.Unknown.5-1, Unknown.Unknown.6-1, Unknown.Unknown.7-1, Unknown.Unknown.8-1, Unknown.Unknown.9-1, Unknown.Unknown.10-1, Unknown.Unknown.11-1, Unknown.Unknown.12-1, Unknown.Unknown.13-1, Unknown.Unknown.14-1, Unknown.Unknown.15-1, NIC.PxeDevice.1-1, 07:35
holtgreweUnknown.Unknown.17-1</Attribute>07:35
arne_wiebalckholtgrewe: I have never seen this before07:35
holtgrewearne_wiebalck: Yes, this summarises my experience with pretty much any BMC07:35
holtgreweI'm probably just holding all of them wrong07:36
arne_wiebalckarne_wiebalck: heh, don't get me started :-D07:36
holtgreweILO turned out to be pretty but also problematic07:36
arne_wiebalckarne_wiebalck: we have one type of BMC which needs to be reset otherwise the server will not reboot 07:36
holtgrewemaybe the one shipping with SuperMicro is better on the command line but the HTML interface is soooo ugly07:36
holtgreweand not really good to use07:37
holtgreweso in my hands, Dell is still the best of the mediocre07:37
arne_wiebalckthe interfaces' quality differs a lot07:37
arne_wiebalckand most of them are made to be used with a handful of servers, but not apt if you manage 1000s07:38
holtgreweAt least with Dell I know a couple of tricks such as dumping config as XML, updating, loading again.07:38
holtgrewearne_wiebalck: haha, Dell has something now where you can manage a dozen from one web interface07:38
holtgreweI guess that's useful ... for windows admins07:38
holtgreweAs for M1000e enclosures, you could bundle up to 8 into one admin interface. o_O07:39
arne_wiebalckyeah ... I would appreciate much more if they provided a nice API: we have one delivery which gives you a one-time iKVM link to the console07:39
arne_wiebalckthis is useful for integration with other tools07:40
holtgreweI think redfish is supposed to be that API.07:40
arne_wiebalckI have handled two deliveries with redfish and had various issues07:41
holtgreweAnd IPMI is a decent protocol, it's just suffering from (at least) -- (a) embrace and extend and (b) interesting BIOS behaviour.07:41
holtgrewe:-D07:41
arne_wiebalckit is supposed to be, yes, but it is not there07:41
arne_wiebalckthe standard, maybe, but the implementations I am not sure07:41
holtgreweI'd rather have them ship with an embedded Raspberry Pi and provide documentation to the sensors and actors.07:42
arne_wiebalckmind you, we have just moved the first redfish managed servers to prof07:42
arne_wiebalck*prod07:42
arne_wiebalckholtgrewe: right, but only if the APIs are the same on all hardware07:43
holtgrewearne_wiebalck: Is it possible to modify the uefi settings from within the booted OS?07:46
arne_wiebalckholtgrewe: you mean beyond setting the boot order?07:46
holtgrewearne_wiebalck: yes07:47
holtgreweYou mentioned removing duplicates.07:47
holtgrewesorry, let met google that myself...07:47
arne_wiebalckholtgrewe: Ironic does this manually, let me get you a link ...07:47
arne_wiebalckholtgrewe: https://opendev.org/openstack/ironic-python-agent/src/branch/master/ironic_python_agent/efi_utils.py#L22907:48
holtgrewearne_wiebalck: and that is not executed when using software raid?07:49
arne_wiebalckholtgrewe: I don't think so.07:50
arne_wiebalckholtgrewe: s/w RAID bypasses most of the UEFI management07:50
holtgrewe:-D07:50
arne_wiebalckholtgrewe: for historic reasons :-D07:51
* arne_wiebalck always wanted to say this.07:51
arne_wiebalckholtgrewe: it was mostly since s/w RAID was done/tested right before a release, and I was not confident enough to redo all with efibootmgr and test before the release.07:52
holtgreweheh07:52
arne_wiebalckholtgrewe: so, we agreed to leave it and clean up later07:52
arne_wiebalckholtgrewe: so, here we are ... 3 or 4 release later07:52
holtgreweSounds like a pragmatic decision. I would have done the same.07:52
arne_wiebalckholtgrewe: we should clean this up eventually, though, there is no reason s/w RAID does sth different07:53
holtgreweIs there a way to prevent the reboot while IPA is running? I still have a devuser setup so I can use efimgr manually.07:53
arne_wiebalckholtgrewe: two ways:07:54
arne_wiebalckholtgrewe: move the node to maintenance when it is in clean_wait, but before the IPA starts07:54
arne_wiebalckholtgrewe: enable fast_track on the conductor07:54
holtgrewearne_wiebalck: thanks, #1 is fine for me07:55
holtgrewearne_wiebalck: https://paste.openstack.org/show/811789/07:58
holtgrewewheee07:58
arne_wiebalckholtgrewe: heh, that looks indeed like a missing cleanup07:59
holtgrewearne_wiebalck: maybe the rocky installation did not write to UEFI...07:59
*** amoralej|off is now known as amoralej08:02
rpittaugood morning ironic! o/08:35
rpittauzigo: backports should work, at least for us, we follow the dib default apt sources config and it has bullseye-backports08:39
zigopriteau: dtantsur: I have uploaded lshw_02.19.git.2021.06.19.996aaad9c7-2~bpo11+1_amd64.changes, now it needs to clear the Debian backports NEW queue (ie: backports FTP masters need to approve the package). This may take some time, but hopefully not  too much, that queue being almost empty it should go fast.09:30
dtantsurthank you zigo!09:30
dmelladodtantsur: o/11:08
dmelladoI've got one more question, so11:08
dmelladoI can't seem to enroll the node, but I can get to redfish in its ip11:08
dmelladohow can I debug this?11:08
dmelladohttps://paste.openstack.org/show/811796/11:10
dmelladoI can seem to get to the redfish api directly11:11
dmelladohttps://paste.openstack.org/show/811797/11:11
dmelladocurl to redfish v111:13
dmelladohttps://paste.openstack.org/show/811798/11:13
dtantsurdmellado: "Resource temporarily unavailable" sounds like one of 2 things: 1) the conductor crashed, 2) the idrac hardware type is not enabled11:14
dtantsurdmellado: also, idrac != redfish, you're configuring WSMAN credentials11:15
dtantsurunless you need any advanced feature, maybe start with just IPMI?11:15
dmelladoYeah, I can take a look11:16
dmelladobut it does seem that conductor indeed failed11:16
dmelladoit seems that Dec 21 05:14:31 jumphost2.dfwt5g.lab ironic-conductor[991788]: 2021-12-21 05:14:31.382 991788 ERROR oslo_service.service ironic.common.exception.DriverLoadError: Driver, hardware type or interface ipxe could not be loaded. Reason: [Errno 13] Permission denied: '/httpboot/boot.ipxe'.11:16
dmelladoit's not there, so I'll reinstall11:16
dmelladoI'm learning lots, though xD11:17
dmelladohmmm seems that I'm missing /httpboot/boot.ipxe11:56
dmelladodtantsur: when would that be created?11:56
dtantsurdmellado: it's created by bifrost during installation11:57
dtantsur(unless you override http_boot_folder)11:57
dmelladoI have made a symlink on the http_boot_folder11:58
dmelladoso the folder itself doesn't get created but uses the symlinked one11:58
dmelladoI guess I'm getting too hacky xD11:59
dtantsurdmellado: yeah, then it could be permissions or selinux11:59
dtantsurif you need it in a different location, setting http_boot_folder is probably a better idea11:59
dmelladoI'll do that, thanks for all the tips!11:59
* dtantsur bbl11:59
*** amoralej is now known as amoralej|lunch13:40
*** amoralej|lunch is now known as amoralej14:19
dmelladodtantsur: last trouble I give you14:57
dmelladoI think I'm almost done, and in a buggy environment14:57
dmelladoxD14:57
dmelladoso, now I could enroll the node and so14:57
dmelladoand I'm just getting a 403 from nginx14:57
dmelladoas I had to move and play with the  folders and paths14:57
dmelladowhere do you set that up in ironic?14:57
holtgrewearne_wiebalck: for some reason the rocky linux installation via IAP does not create the UEFI entry (when using with software RAID)14:57
dmellados/ironic/bifrost14:57
holtgrewearne_wiebalck: could it be that this is a side effect of the "skip everything in case of software RAID" feature that you mentioned earlier?14:58
arne_wiebalckholtgrewe: not sure, there should still be an entry15:02
arne_wiebalckholtgrewe: there is however an issue with cs8 and grub2-install in that it fails due to lack of support for secure boot15:02
arne_wiebalckholtgrewe: another reason we need to move to efibootmgr ... next year :)15:03
arne_wiebalckhave a great break everyone, see you next year o/15:03
holtgrewearne_wiebalck: o/ have a nice break15:04
holtgrewethanks for everything!15:04
*** akahat|ruck is now known as akahat|dinner15:18
*** akahat|dinner is now known as akahat|ruck15:45
dtantsurdmellado: you really should check both nginx logs and selinux audit messages15:58
rpittaubye everyone, see you on thursday! or next year :)16:29
dtantsurrpittau: enjoy the break!16:30
rpittauthank you dtantsur, you too :)16:30
opendevreviewDmitry Tantsur proposed openstack/ironic-python-agent master: WIP Refactor: create image_download module  https://review.opendev.org/c/openstack/ironic-python-agent/+/82253616:58
NobodyCamGood Morning Ironic'ers17:05
dtantsurmorning NobodyCam 17:22
NobodyCamO/ Morning dtantsur 17:23
NobodyCamstaying warm out there17:23
dtantsurwhen heating works - yes :)17:24
*** sshnaidm is now known as sshnaidm|afk17:34
opendevreviewDmitry Tantsur proposed openstack/ironic master: [WIP] ImageCache: respect Cache-Control: no-store  https://review.opendev.org/c/openstack/ironic/+/82232918:01
*** amoralej is now known as amoralej|off18:02
opendevreviewDmitry Tantsur proposed openstack/ironic master: [WIP] ImageCache: respect Cache-Control: no-store  https://review.opendev.org/c/openstack/ironic/+/82232918:21
dtantsurgood night folks o/18:35
*** gmann is now known as gmann_afk19:51
stevebaker[m]good morning21:04
opendevreviewSteve Baker proposed openstack/ironic master: Use driver_internal_info methods for driver utils  https://review.opendev.org/c/openstack/ironic/+/81850521:52
opendevreviewSteve Baker proposed openstack/ironic master: Use driver_internal_info methods for drac driver  https://review.opendev.org/c/openstack/ironic/+/81850621:52
opendevreviewSteve Baker proposed openstack/ironic master: Use driver_internal_info methods for ilo driver  https://review.opendev.org/c/openstack/ironic/+/81850721:52
opendevreviewSteve Baker proposed openstack/ironic master: Use driver_internal_info methods for redfish driver  https://review.opendev.org/c/openstack/ironic/+/81850821:52
opendevreviewSteve Baker proposed openstack/ironic master: Use driver_internal_info methods for other drivers  https://review.opendev.org/c/openstack/ironic/+/81850921:52

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!