Friday, 2023-12-15

*** Kangie_ is now known as Kangie01:17
rpittaugood morning ironic! o/08:22
opendevreviewRiccardo Pittau proposed openstack/bifrost master: Update default ansible version  https://review.opendev.org/c/openstack/bifrost/+/90313608:53
dtantsurJayF: I wonder at which point the health of the requests library becomes an even more serious issue than eventlet..09:47
drannouHello ironic, linked to dtantsur  feedback, I made a simple usb devices inventory detection https://review.opendev.org/c/openstack/ironic-python-agent/+/902769 .  Except I limited the discovery to vendor, product and handle, do you think that I should put more informations ?10:05
dtantsurIt sounds like a good start to me. What is there more to discover?10:05
dtantsurdrannou: left a review, just minor things10:26
drannoumore to discover => we can store a lot of things, but not interesting in my POV10:30
dtantsuryeah, let's start from what you have10:31
dtantsur(I'm looking forward to learning what handle is :D)10:31
opendevreviewDmitry Tantsur proposed openstack/ironic master: Test redfish with reduced sushy-tools feature set  https://review.opendev.org/c/openstack/ironic/+/90336110:39
dtantsurrpittau: morning! looks like bifrost dhcp jobs have been broken since https://review.opendev.org/c/openstack/bifrost/+/875750 although I have no clue how that would cause it..10:59
opendevreviewDmitry Tantsur proposed openstack/bifrost master: DNM Revert "Support ansible-collections-openstack 2 and later"  https://review.opendev.org/c/openstack/bifrost/+/90369411:01
dtantsurtrying in the gate ^^11:01
opendevreviewMerged openstack/ironic-python-agent master: docs: improve rootpwd password generation command  https://review.opendev.org/c/openstack/ironic-python-agent/+/89938011:27
opendevreviewDmitry Tantsur proposed openstack/ironic-python-agent master: Reformat and update the section on injecting root credentials  https://review.opendev.org/c/openstack/ironic-python-agent/+/90375011:34
rpittaudtantsur: that's suspicious indeed, let's see how the revert goes11:36
iurygregorymorning Ironic12:01
opendevreviewMerged openstack/bifrost stable/2023.2: Fix key-order ansible errors  https://review.opendev.org/c/openstack/bifrost/+/90203912:41
rpittaudtantsur: I'm looking at the history of the dhcp jobs, are you sure that's the commit that broke the jobs? I see some success for about another week after the merge13:29
dtantsurrpittau: which dates are you looking at?13:29
dtantsur(I'm definitely not sure)13:29
rpittauI'm looking at master branch between end of february and beginning of march13:29
rpittauI see dhcjp jobs passing until march 8th13:30
dtantsurMarch 8th is when the patch merged13:30
dtantsur08.03.2023 21:2313:30
rpittaummm yeah13:32
rpittaucentos dhcp passed in the revert, but not jammy13:32
dtantsursome jobs fail because they run out of disk space13:33
rpittau OSError: [Errno 28] No space left on device13:33
rpittauyeah13:33
dtantsuryep13:33
dtantsurokay, so the migration to the new collection could be the cause... weird13:33
dtantsurcan it be that they're not passing something?13:33
rpittauis it bigger?13:33
dtantsurI'm more worried about the other failure13:34
dtantsurIIRC nodes fail to boot from network, dnsmasq rejects their requests13:34
rpittauoh I see13:34
rpittau@dt13:49
rpittauehm...13:49
rpittaudtantsur: from the logs I can't really see the difference, but I'm ok with the revert as it looks like dhcp works there13:51
*** dking is now known as Guest1046713:51
*** Guest10467 is now known as dking13:51
rpittauat least on centos9, but jammy has the disk space issue13:52
dkingGood morning. Does anybody know if there's anything in the cleaning steps which checks a device to see if it is a removable media, like a USB, before attempting to clean it?13:53
dkingWe had a server where somebody accidentally left a USB stick in it, and it failed on a custom clean step I have, but it looks like it was already cleaned before I got to it. So I'm assuming that we don't check that. I see checks for virtual media and other things, but I don't see anything for USB.13:56
dtantsurrpittau: I'm not a fan of proceeding with this revert. We'll get ourselves into other problems this way.14:01
dtantsurdking: yeah, we don't exclude any writeable devices by default.14:01
dkingdtantsur: Thank you. should we?14:02
rpittaudtantsur: ok, then I think we should collect at least the dnsmasq config to see if we can find anything14:02
dtantsurdking: I'm not sure. Can we even reliably distinguish removeable devices? Probably a good topic for the next IRC meeting.14:02
dkingdtantsur: Well, there is /sys/class/block/*/removable. If it's not just a simply crazy thought, maybe I'll bring it up at some point.14:05
opendevreviewRiccardo Pittau proposed openstack/bifrost master: [DNM] collect dnsmasq config  https://review.opendev.org/c/openstack/bifrost/+/90375514:08
opendevreviewRiccardo Pittau proposed openstack/bifrost master: Collect lshw output in json format  https://review.opendev.org/c/openstack/bifrost/+/89040814:38
TheJuliagood morning14:54
dtantsurmorning TheJulia 14:54
opendevreviewRiccardo Pittau proposed openstack/ironic-inspector master: [WIP] Handle LLDP parse Unicode error  https://review.opendev.org/c/openstack/ironic-inspector/+/90376014:56
* TheJulia really wishes we had published arm artifacts15:03
dtantsurTheJulia: we have IPA images, what are you looking for?15:08
TheJuliaoooh, we do, hiding under ironic-python-agent-builder15:09
dtantsuryep. because they're kinda experimental (i.e. don't run in the CI)15:09
opendevreviewRiccardo Pittau proposed openstack/bifrost master: [DNM] collect dnsmasq config  https://review.opendev.org/c/openstack/bifrost/+/90375516:04
opendevreviewRiccardo Pittau proposed openstack/ironic-inspector master: [WIP] Handle LLDP parse Unicode error  https://review.opendev.org/c/openstack/ironic-inspector/+/90376016:10
opendevreviewRiccardo Pittau proposed openstack/ironic-inspector master: [WIP] Handle LLDP parse Unicode error  https://review.opendev.org/c/openstack/ironic-inspector/+/90376016:12
opendevreviewRiccardo Pittau proposed openstack/ironic-inspector master: Handle LLDP parse Unicode error  https://review.opendev.org/c/openstack/ironic-inspector/+/90376016:20
rpittaubye everyone! have a great weekend! o/16:55
dtantsurhave a nice weekend folks16:59
JayFI'm taking some PTO for most of the day; I'm around for a bit right now if you need me but will be gone for the day in a bit.17:28
JayFdking: dtantsur: re: not wiping removable devices, that could be a useful feature for rpi support for Ironic, as well, since many "PXE network boot" implementations for rpi involve putting a network boot rom on a usb key17:29
JayFdking: dtantsur: although, as with all optional cleaning features, we'd want to default safe (clean) and opt-in to unsafe (not-clean)17:29
dkingJayF: That sounds good to me. It might not hurt, though, to consider which is less surprising. Are people more likely to use a removable drive and wonder why it wasn't being cleaned, or are they more likely to be surprised when their USB drive is wiped by default. I would go with the feature with least surprise.17:48
JayFI think for the average Ironic use case it'd be more like17:48
JayF"Holy $@%#! someone plugged a removable storage device into my server!?!?"17:48
JayF:)17:49
dkingI would tend to agree. I can't imagine that people are more likely to plug USB drives in and just expect them to get cleaned.17:49
dkingFor my local stuff, I'm skipping the cleaning but logging a warning.17:49
JayFSo maybe it's a ternary option?17:50
JayFremovable_device_action = [wipe,ignore,kaboom] and default to kaboom17:50
JayFso cleanfail if a removable device in, as a security guard17:50
JayFI will be -1 and borderline -2 to a change that just doesn't wipe a device hooked up because it's marked as removable17:50
dkingI'm fine with that. I like the fail early, fail loudly principle. Sometimes, I log errors like "This thing isn't expected, so I'm failing. If you wanted it, do X"17:52
JayFWe had a hardware manager back when I did OnMetal which basically, we had a piece of metadata in the node which dictated what sku it was17:53
JayFand we basically did an inventory check "this should have X ram sticks of Y capacity, these CPUs, these disks, etc etc" 17:53
JayFand if any of it was missing/not what was expected, we would fail cleaning and address it17:53
JayFyou find a lot of flakey hardware that way (I was surprised at the number of cases of "this ram stick shows up 5 out of 10 boots")17:54
dkingYeah, I think that's a good pattern. We aim for that also. When we get past some of the current milestones, I want to go back and add checks against the specs in NetBox.17:56
opendevreviewMerged openstack/bifrost master: Update default ansible version  https://review.opendev.org/c/openstack/bifrost/+/90313621:29
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic master: RedfishFirmwareInterface - Unit Tests & More logs  https://review.opendev.org/c/openstack/ironic/+/90337923:04

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!