Wednesday, 2022-04-20

rpittaugood morning ironic! o/06:42
arne_wiebalckGood morning rpittau and Ironic!06:45
rpittauhey arne_wiebalck :)06:46
dtantsurmorning folks08:02
opendevreviewRiccardo Pittau proposed openstack/ironic master: Fix names of two jobs  https://review.opendev.org/c/openstack/ironic/+/83866108:48
opendevreviewRiccardo Pittau proposed openstack/sushy master: Update jobs names  https://review.opendev.org/c/openstack/sushy/+/83866208:50
opendevreviewRiccardo Pittau proposed openstack/sushy master: Update jobs names  https://review.opendev.org/c/openstack/sushy/+/83866208:52
opendevreviewRiccardo Pittau proposed openstack/ironic master: Fix names of two jobs  https://review.opendev.org/c/openstack/ironic/+/83866108:53
opendevreviewRiccardo Pittau proposed openstack/sushy-tools master: Update jobs names  https://review.opendev.org/c/openstack/sushy-tools/+/83866408:56
opendevreviewRiccardo Pittau proposed openstack/ironic master: Fix names of two jobs  https://review.opendev.org/c/openstack/ironic/+/83866108:56
dmelladomorning all, forgot to turn on the irc09:26
opendevreviewRiccardo Pittau proposed openstack/sushy-tools master: Use python Zed tests  https://review.opendev.org/c/openstack/sushy-tools/+/83867409:44
dtantsurdmellado: I know this feeling. sometimes I forget to turn on slack and keep wondering why the morning is so calm09:53
dmelladodtantsur: heh, totally09:55
dmelladoeven so 'turn on' is just loading the glowing-bear website09:55
dmelladoxD09:56
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic-python-agent-builder stable/wallaby: Add multipath-io element  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/83854910:47
iurygregorygood morning10:48
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic-python-agent-builder stable/yoga: Add multipath-io element  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/83869010:48
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic-python-agent-builder stable/xena: Add multipath-io element  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/83869110:48
iurygregoryHUH?! Could not perform action: Branch refs/heads/bugfix/8.3 does not exist. O.o10:51
dtantsurwtf11:16
iurygregoryyeah, I had this reaction also...11:28
iurygregoryit used to be possible to do cherry-pick via UI to bugfix branches right? (Or I'm going crazy...)11:28
iurygregoryoh god more meetings in my calendar <O> /me needs more coffee today11:30
dtantsuriurygregory: it was definitely possible a while ago11:37
iurygregoryI'm going to try to cherry-pick manually just to see...11:37
iurygregoryok, I was sleepy....11:44
* iurygregory shouldn't open gerrit at 7am 11:48
dtantsurdefinitely11:50
dtantsurand avoid slack11:50
iurygregoryoh, this one I couldn't a lot of message while I was off :D11:51
iurygregoryand very urgent ones..11:52
opendevreviewAija Jauntēva proposed openstack/sushy-oem-idrac stable/yoga: Revert "Update .gitreview after project rename"  https://review.opendev.org/c/openstack/sushy-oem-idrac/+/83869212:11
opendevreviewAija Jauntēva proposed openstack/sushy-oem-idrac stable/xena: Revert "Update .gitreview after project rename"  https://review.opendev.org/c/openstack/sushy-oem-idrac/+/83869312:12
ftarasenkohi, Ironic! Can someone point me to bug with idrac when I get next error while cleaning (raid creation) - Node is in manageable state; should be in one of wait call-back, deploying. Details https://paste.openstack.org/show/bAUHcgWjjjaDQzl6CyiR/12:17
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic-python-agent master: Multipath Hardware path handling  https://review.opendev.org/c/openstack/ironic-python-agent/+/83703912:17
dtantsurajya: do you remember the error ^^^? I *think* it rings some bells, there was something about cleaning reporting.. or maybe it was in iRMC?12:27
dmelladoiurygregory: gerrit at 7am is deff a terrible idea xD12:31
dmelladothere should be a -1 not_enough_coffee_yet12:32
ajyadtantsur don't recall seeing this before.12:32
dtantsurftarasenko: do you have https://opendev.org/openstack/ironic/commit/5e4a617b46c713aba762cc74cc0ea0c64c84a0c3?12:32
iurygregorydmellado, yeah!12:33
dtantsurthis commit is from year 2020, but who knows12:34
dmelladowell, that's a really bad year12:34
dmelladodrop it12:34
dmelladodtantsur: 12:34
dmelladoxD12:34
dtantsurtrue :)12:34
ftarasenkodtantsur: yep, my W has this commit.12:34
dtantsurftarasenko: could you grab more logs from this node? from the beginning of cleaning?12:35
ftarasenkodtantsur: Sure. https://paste.openstack.org/show/bSHcKxSmuW4KmMSF4HXI/ 12:41
ftarasenkodtantsur: Do you need debug print? It's much bigger, need to share whole file. 12:43
dtantsurftarasenko: I'm looking at what I have now, but I may need debug logs indeed12:43
dtantsurthe order of actions is absurd12:43
ajyaftarasenko: so it's indeed cleaning and in logs it says it can't continue deploying? Somehow jumps from cleaning to deploy code?12:48
ajyacould be some recent-ish changes around async tasks that causes this12:49
dtantsurajya: it seems that the continue_polling flag is ignored or not set properly, and the node finishes cleaning on the next heartbeat12:49
ftarasenkodtantsur: ajya: I see that cleaning is finished and have raid ready. but for some reason it says that it wants to start cleaning again. IDRAC is latest one,  5.10.10.00 , I can downgrade to check if it's connected.12:51
ftarasenkoAlso I have log file with debug output to share. Is google drive ok to share file? 12:52
TheJulia good morning12:53
ajyaftarasenko: does it happen with all nodes/all the time? Or from time to time? At the moment, I don't think iDRAC version impacts this. Suspecting something to do with idrac driver cleaning steps.12:53
ftarasenkoajya: it happens from time to time. For example, today I had one node cleaned from the first time, one from second attempt, two nodes from third, and last one still is not cleaned. 12:56
ajyathanks, I could try to reproduce this on my systems unless dtantsur sees why this is happening12:57
TheJuliadtantsur: I think irmc, but I think I spotted something in storyboard at one point that might have been exactly this, but we got no details in the bug report12:58
ftarasenkoajya: tnx. it's R350 with integrated H355 controller. I just try to build RAID with 2 drives.12:59
TheJuliadtantsur: the irmc folks are also the only ones that did storage integration with fiberchannel12:59
dtantsur*nod*13:00
dtantsurgood morning TheJulia 13:00
iurygregorygood morning TheJulia 13:00
rpittaugood morning TheJulia :)13:00
dtantsurajya, ftarasenko, the fact that it only happens sometimes, can be explained by a race between heartbeats and the background polling in the driver13:00
ajyadtantsur: yup and it seems often enough that it should have been seen previously, could there be something that changed here creating this race on Ironic side?13:05
dtantsurpossibly?13:06
ajyamaybe something also on idrac steps, have to check13:06
TheJuliadtantsur: truthfully active/passive setups seem quite rare to me at this point, but it can be policy enforced for purposes of A or B storage fabric failover as well. Out of all of the SANs I've managed in my career, only two did hard blocking active/passive by default but some others it could be configured in for A/B fabric scenarios13:07
ajyaftarasenko: to confirm - it's W release and for raid interface it is idrac-wsman and not idrac-redfish?13:07
ftarasenkoajya: that's correct. idrac-wsman and W. Should I retry with idrac-redfish?13:08
ajyaftarasenko: not needed, I'll check that13:13
TheJuliaAnyone having issues with github?14:30
TheJulianvmd, just loaded on my 4th try14:30
TheJuliaiurygregory: any estimate on when any sort of summary will go up, trying to create some downstream stories to track work for steve, harald, and I14:33
TheJuliai guess I can just use the etherpad14:34
iurygregoryTheJulia, I should have sent yesterday.. but the customer issue took my whole day basically, I wrote part of it, let me just try to get it done before going for lunch o/14:36
TheJuliano worries, thanks14:42
dtantsuriurygregory: hey, when is our next bugfix release due?15:29
iurygregorydtantsur, interesting that you asked me after I was looking at an email about downstream releases =)15:31
dtantsuriurygregory: it's absolutely not a coincidence :)15:31
iurygregorydtantsur, I was thinking first week of May after the email...15:32
dtantsurwe shouldn't plan upstream business around downstream targets.. but if they can line up, that will be great15:32
iurygregoryyeah15:32
iurygregoryI will try to see if it will be possible to sync things =)15:33
iurygregoryso we don't have to break our heads :D15:33
dtantsurcc rpittau 15:33
rpittausome unexpected events created an interesting alignment15:44
rpittauwe may be able to avoid some headaches15:46
iurygregorywe would need before FF downstream right?15:46
rpittauyep15:46
dtantsuror slightly after, if we're ready to do the dirty thing *innocently whistles*15:47
rpittau:(15:47
iurygregoryOMG15:47
iurygregoryXD15:47
dtantsursee you tomorrow o/16:03
romerHello Everyone ! 16:04
rpittaubye! o/17:02
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic-python-agent stable/wallaby: Multipath Hardware path handling  https://review.opendev.org/c/openstack/ironic-python-agent/+/83778417:29
TheJuliawhere oh where did I put my brain...17:30
admiyoSo, as I learn a little bit about Kolla and Ironic etc, I found it worthwhile to take notes. For any Ironics that are wondering about the docker side of things, maybe this will be useful: http://adam.younglogic.com/2022/04/how-do-you-keep-the-kolla-playing/19:38
admiyoWhat do we call people that work on Ironic?  I was a Keystoner back in the day....19:39
JayFI think we're all a bunch of Bare Metal bears, right?19:39
* rm_work waves at JayF 19:43
JayFrm_work: o/ 19:45
JayFrm_work: they haven't taken away my keys to the joint, wanna go hide out and goof off in the secret cores only party room? /s19:46
rm_worklol19:47
TheJuliaNobodyCam: please queue up the bar tender, we'll need tasty beverages :)19:56
JayF**coors only party room /s19:56
NobodyCamwait what19:56
TheJulialol19:57
* TheJulia hates edk219:57
NobodyCamGood Afternoon Ironic folks19:57
admiyoI have to figure out how to get a  custom built ipxe.efi with the right drivers into the ipxe container. For aarch64. 19:58
TheJuliaadmiyo: yay!19:58
admiyoYeah, I got bifrost to work19:59
NobodyCam\o/19:59
admiyoI did it cutting all the corners and compiling all the tech debt19:59
TheJuliaadmiyo: generally, I've use "Ironicans" or "Followers of Irony"20:00
TheJuliaWhich reminds me actually20:00
TheJuliaI'm thinking of pixie boots pins20:00
admiyoBase on Mo Duffy's comic?20:00
NobodyCam:take_my_money: 20:01
JayFadmiyo: TheJulia: Fun story, the word for a group of us is a "Jagged Little Pill"20:01
TheJuliaadmiyo: Mo Duffy's comic?20:02
admiyohttps://i1.wp.com/duffy.fedorapeople.org/blog/drawings/summit2010comics/colonelkernel.png20:02
TheJuliaadmiyo: oh my20:02
JayFat least they didn't have to use a ramfs in the boot sequence20:03
TheJuliaoh jeeze20:04
TheJuliaso... one of my coworkers hands me a partition image which is not working, and I want to reproduce the boot20:04
TheJuliaso I realize... no deployment. Cool, I can fix the image, I know the magic20:04
TheJuliaand I go to do that20:04
TheJuliaand edk2 doesn't see the esp partition20:04
TheJuliawell20:05
TheJuliaIt sees it, but says there are zero files20:05
TheJuliaso of course, it just won't boot now20:05
admiyoMaybe there really are 0 files in there. 20:05
TheJuliatake base file before the qcow, and magically there does happen to be files in it when I loopback mount it20:06
JayFTheJulia: I was helping a guy in #gentoo yesterday who had a laptop with a self-encrypting drive that basically was randomly encrypting and decrypting itself (so basically once every "N" boots it stayed locked, and his sda appeared to the OS as a 32k device with no partitions)20:06
TheJuliaJayF: yeouch20:06
JayFTheJulia: I'm like "I know things about disk wiping, and you need to throw that disk off a cliff" 20:07
JayFlol20:07
JayFTheJulia: loopback mount like, qemu-nbd? 20:07
TheJulialoopback raw file20:07
TheJuliawith gpt partitions20:07
JayFThat's very strange.20:07
TheJuliawhich EDK2 sees!20:07
JayFSo you said this is a partition image, not a whole disk image20:08
JayFyes?20:08
JayFSo why does it have partitions?20:08
TheJuliawell, I converted it to be a whole disk image20:08
TheJuliaand no joy20:08
* TheJulia mounts the whole disk image up, mounts hte filesystems, and extracts the files to figure out the headache the harder way20:09
TheJuliawhich might just be easier20:09
TheJuliaoh joy20:11
admiyoAnyw, to make the AARCH64 ipxe process work, I had to clone https://github.com/ipxe/ipxe.git20:15
admiyo cd ipxe/src/20:15
admiyo make bin-arm64-efi/snponly.efi  ARCH=arm6420:15
admiyosudo  cp src/bin-arm64-efi/snponly.efi /var/lib/tftpboot/ipxe.efi20:15
TheJuliaI feel like I did that ages ago20:16
admiyoI am thinking that the right solution is to get a binary posted somewhere, and always download it20:16
admiyothe dnsmasq config should switch on the arch flag and return a different binary.  Not sure how to properly name things.  And...I don't think that snponly is the way to go, though it worked for me20:17
TheJulia++ I would save it someplace and just re-use, tbh20:18
TheJuliaadmiyo: are you doing only arm or arm and x86_64?20:19
admiyoRIght now only arm20:22
admiyoBut I want the solution in upstream20:22
admiyoEven the most recent package from Ubuntu does not have a functional ipxe.efi20:22
admiyoand it does not make sense to limit the ironic server to boot only the devices of the same architecture20:23
admiyoFor Kolla, the ipxe files are mounted from the directory where they are installed via the .deb file.20:24
admiyoAre there other cases of binaries being pulled in by openstack projects that are not in packages for the underlying distros?20:32
opendevreviewMerged openstack/sushy-oem-idrac stable/yoga: Revert "Update .gitreview after project rename"  https://review.opendev.org/c/openstack/sushy-oem-idrac/+/83869220:49
TheJuliaI feel like... for arm specifically, a lot of folks have just kind of had to go down the painful path in the past20:51
TheJuliaand yeah, we need to collect all of that and fix it20:51
opendevreviewMerged openstack/sushy-oem-idrac stable/xena: Revert "Update .gitreview after project rename"  https://review.opendev.org/c/openstack/sushy-oem-idrac/+/83869320:52
admiyoSo...I could try and get the snp and snponly versions of the ipxe.efi built in the Debian and Ubuntu packages, and then use the unstable versions until they are marked stable.  User could fetch the non-local-arch version of the packages, unpackathem, and use those files.  That seems like the long path.  I'm just wondering what the shorter but acceptable path would be.21:37
rpiosodtantsur: Thank you for your message about DHCP-less. I believe I am all set for now.21:38

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!