Thursday, 2022-06-23

cboucharTheJulia, stevebaker:  I too will checkout tomorrow.  Thank you for responding.00:02
opendevreviewVerification of a change to openstack/ironic master failed: Remove netboot jobs from the gate  https://review.opendev.org/c/openstack/ironic/+/84393700:06
*** rcastillo_ is now known as rcastill00:10
opendevreviewJacob Anders proposed openstack/ironic master: [WIP] Prevent clear_job_queue failures on older iDRACs  https://review.opendev.org/c/openstack/ironic/+/84685900:57
TheJuliastevebaker[m]: https://cloud.centos.org/centos/9-stream/x86_64/images/ ~780mb, 804 mb, 834mb. The images are growing... (╯°□°)╯︵ ┻━┻01:22
stevebaker[m]Ahhhhh01:23
TheJuliaWe needs to allocate more ram I guess :(01:24
stevebaker[m]or get it to stream to disk?01:29
TheJuliastevebaker[m]: VM booting, so actual virtual machines03:16
TheJuliaat *least* on the UEFI one most likely03:16
TheJuliaour IPA image has only grown a little bit, I'm guessing it is excess firmware03:17
stevebaker[m]TheJulia: oh. If more memory is a constraint we could consider building & publishing a custom test image04:25
TheJuliaWe likely need to consider it… I’m wondering why both are now timing out with ash as well04:26
TheJuliaSSH04:26
rpittaugood morning ironic! o/06:49
jandershey rpittau o/07:10
rpittauhey janders :)07:10
opendevreviewMerged openstack/ironic-python-agent bugfix/8.6: Fix discovering WWN/serial for devicemapper devices  https://review.opendev.org/c/openstack/ironic-python-agent/+/84715507:22
opendevreviewVerification of a change to openstack/ironic master failed: [Minor] Fix misspellings of "insufficient"  https://review.opendev.org/c/openstack/ironic/+/84578907:32
opendevreviewVerification of a change to openstack/ironic master failed: Remove netboot jobs from the gate  https://review.opendev.org/c/openstack/ironic/+/84393707:33
dtantsurmorning folks. the CI is still in a disastrous state, isn't it?07:41
rpittaudtantsur: well, some random failures in ironic CI, the disastrous part seems to be ironic-python-agent CI, although I saw some patches merge there08:03
rpittauso I wonder if the issue we saw was transient, I just issued a recheck to another patch that was impacted08:05
opendevreviewMerged openstack/ironic-python-agent stable/xena: Fix discovering WWN/serial for devicemapper devices  https://review.opendev.org/c/openstack/ironic-python-agent/+/84715708:46
snandaHi Ironic !08:54
opendevreviewMerged openstack/ironic-python-agent stable/wallaby: Fix discovering WWN/serial for devicemapper devices  https://review.opendev.org/c/openstack/ironic-python-agent/+/84715908:54
snandaCan we please review this one ? Already added priority tag. https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/845577/308:54
opendevreviewMerged openstack/ironic-python-agent bugfix/8.1: Fix discovering WWN/serial for devicemapper devices  https://review.opendev.org/c/openstack/ironic-python-agent/+/84716008:54
dtantsursnanda: just to keep the file a bit more manageable, would it be possible to move iDRAC tests to a new file?09:01
opendevreviewMerged openstack/ironic-python-agent stable/yoga: Fix discovering WWN/serial for devicemapper devices  https://review.opendev.org/c/openstack/ironic-python-agent/+/84715609:05
snandadtantsur: Is it really necessary to create new file ? As the other iDRAC deployemnt scenarios are also present in test_basic_ops file.09:18
opendevreviewMerged openstack/bifrost master: Exclude ansible collection >1.8.0,<2.0.0  https://review.opendev.org/c/openstack/bifrost/+/84328110:16
dtantsursnanda: ah, there is prior art, hmm10:22
dtantsurokay then10:22
dtantsurwe need to rethink how we compose these tests anyway10:23
cboucharstevebaker: your selinux fix looks like it will do what I need. TY11:05
iurygregorygood morning Ironic11:36
*** rcastill is now known as rcastillo13:02
opendevreviewMerged openstack/ironic-python-agent master: Drop support for Python 3.6 and 3.7  https://review.opendev.org/c/openstack/ironic-python-agent/+/84400415:19
TheJuliaoh, I see what is going on with metalsmith jobs15:45
TheJuliahmm. curious15:56
rpittaugood night! o/16:28
opendevreviewJulia Kreger proposed openstack/metalsmith master: CI: Extend timeouts for RAX  https://review.opendev.org/c/openstack/metalsmith/+/84738016:32
iurygregoryI wish extending timeouts would fix ipa-tempest-bios-ipmi-direct-src XD16:36
TheJuliaiurygregory: what is going on there?16:43
iurygregorymost of the failures seems to related to ipa loosing connection I'm adding my findings in the Whiteboard16:44
iurygregoryhttps://etherpad.opendev.org/p/IronicWhiteBoard#L5716:45
TheJuliaiurygregory: so... I'm suspecting we're booting back into the ramdisk when we think we're on the instance workload16:52
TheJuliaiurygregory: the hint is a cirros user is attempting to be logged into16:53
TheJuliaiurygregory: agent token appears to be a red herring. It gets logged after the conductor has wiped the token from memory if memory serves16:58
TheJuliawell, if my memory serves16:58
opendevreviewVerification of a change to openstack/ironic master failed: Use bifrost on centos9 integration job  https://review.opendev.org/c/openstack/ironic/+/84615517:01
iurygregoryhummm 17:05
iurygregoryI'm wondering what changed in the workflow for this to happen .-.17:05
TheJuliaeeek17:05
TheJuliaalmost 5 minutes before IPa actually starts17:05
TheJulialooks like averaging 247 seconds into boot17:06
iurygregorywe only use cirros in the tinyipa scenario right? when we are using DIB the OS is CentOS9 Stream17:10
TheJuliaI'm not sure I understand what you mean17:18
opendevreviewDmitry Tantsur proposed openstack/ironic master: Do not require deploy_kernel/ramdisk with the ramdisk deploy  https://review.opendev.org/c/openstack/ironic/+/84738817:25
TheJulia++17:27
dtantsurhmmmmm, this has interesting implication for cleaning, isn't it...17:28
iurygregorynvm, I was looking at two things different logs in different jobs...17:28
iurygregorytime to close some tabs17:29
iurygregorywhy I'm having a deja vu feeling " Do not require deploy_kernel/ramdisk" .-.17:30
TheJuliaiurygregory: so it is pxe boot retrying17:30
TheJuliadtantsur: it does... actually17:31
dtantsuriurygregory: ikr?17:31
TheJuliaiurygregory: so, I need to double check the code17:31
dtantsurTheJulia: yep. I guess I can only do it when cleaning is disabled.17:31
TheJuliabut the agent boots, gets a token, pxe retyr timer strikes because the thing is crawling to begin with17:31
TheJuliathings reboot, can't get a token17:31
TheJuliabecause token is already allocated17:31
iurygregorydtantsur, I will take care of backporting since you will be out17:31
TheJuliabasically, a fun race I think17:32
dtantsuriurygregory: thanks! I need to fix this cleaning issue, working on it now17:35
iurygregoryack17:35
dtantsurFYI this is a release blocker for us :(17:36
iurygregoryoh wow17:37
* iurygregory checks downstream channel17:38
opendevreviewDmitry Tantsur proposed openstack/ironic master: No deploy_kernel/ramdisk with the ramdisk deploy and no cleaning  https://review.opendev.org/c/openstack/ironic/+/84738817:38
dtantsurnow testing17:38
opendevreviewDmitry Tantsur proposed openstack/ironic master: No deploy_kernel/ramdisk with the ramdisk deploy and no cleaning  https://review.opendev.org/c/openstack/ironic/+/84738817:49
TheJuliai propose we include ovh in with the rax filter17:50
iurygregoryTheJulia, from the failures I saw based on cloud providers17:51
iurygregoryFailures17:51
iurygregory- inmotion : iad3 4x17:51
iurygregory- ovh : GRA1 4x / BHS1 6x17:51
iurygregory- iweb : mtl01 x1017:51
iurygregoryin the filter to increase the timeout and/or the one to switch to tinyipa?17:53
TheJuliafunky17:53
TheJuliayeah...17:53
TheJuliait feels evil17:53
TheJuliabut... 17:53
TheJuliaanyway, need to write a test and I'll have a fix ready17:53
dtantsurokay, the 3rd patchset passed local testing.17:53
dtantsursee you tomorrow folks o/17:54
iurygregoryevil things ++17:54
iurygregorydtantsur, bye o/17:54
opendevreviewJulia Kreger proposed openstack/ironic master: Prevent pxe retry when agent token exists  https://review.opendev.org/c/openstack/ironic/+/84739218:06
NobodyCamgood morning Ironic folks! Happy almost Friday18:13
NobodyCamany one happen to have in sight in why https://review.opendev.org/c/openstack/ironic/+/840986 didn't merge? rebase maybe?18:14
opendevreviewJulia Kreger proposed openstack/ironic-python-agent master: CI: Use 2 cpus for IPA jobs  https://review.opendev.org/c/openstack/ironic-python-agent/+/84739518:19
opendevreviewJulia Kreger proposed openstack/ironic-python-agent master: CI: Use 2 cpus for IPA jobs  https://review.opendev.org/c/openstack/ironic-python-agent/+/84739518:19
TheJuliaso yeah, we want treat iweb like we do with rax18:21
TheJuliaNobodyCam: lots of CI problems it seems18:21
TheJulia:(18:21
TheJuliaactively working some of it18:21
iurygregoryNobodyCam, yup! you will need to rebase to remove the dependency in the patch in chain that was Abandoned 18:23
NobodyCam+++18:24
opendevreviewJulia Kreger proposed openstack/ironic master: CI: Add iweb to the use tinyipa on list  https://review.opendev.org/c/openstack/ironic/+/84739618:24
TheJuliaiurygregory: ^^^18:24
TheJuliathat should clean up ~40%18:24
iurygregoryTheJulia, .. related to " Use 2 cpus for IPA jobs " ironic-ipa-base uses ironic-base that has  IRONIC_VM_SPECS_CPU: 218:25
iurygregoryhttps://opendev.org/openstack/ironic/src/branch/master/zuul.d/ironic-jobs.yaml#L5318:26
iurygregoryso I'm like "WHAT?!"18:26
iurygregory+2 to the iweb18:26
*** rcastillo_ is now known as rcastillo18:28
TheJuliaoh, doh18:30
TheJulianevermind then on that chagne18:31
TheJuliaworkflowing the iweb change18:32
TheJuliaiurygregory: https://review.opendev.org/c/openstack/metalsmith/+/847380/ is passing, if you wouldn't mind so we can get steve's stuff moving forward too18:33
TheJuliawell, at least super close to passing on one last job18:33
* TheJulia wonders if we could convince nobodycam to be the until 5pm emergency reviewer18:34
TheJuliastevebaker[m]: https://review.opendev.org/c/openstack/metalsmith/+/847380/18:34
NobodyCammaybe18:34
TheJuliasweet, metalsmith's last joba ppears to be passing into logs now18:36
NobodyCamHowever I am only able to give it a +1 18:37
iurygregoryTheJulia, +W18:38
TheJuliaiurygregory: awesome, thanks18:38
TheJulianow, lets see if CI clears up in the next few hours18:38
iurygregoryyeah18:38
iurygregoryfingers crossed18:38
iurygregorygoing to the gym, bbl o/19:06
opendevreviewMerged openstack/metalsmith master: CI: Extend timeouts for RAX  https://review.opendev.org/c/openstack/metalsmith/+/84738020:13
opendevreviewJulia Kreger proposed openstack/ironic master: CI: Add iweb to the use tinyipa on list  https://review.opendev.org/c/openstack/ironic/+/84739621:04
opendevreviewJulia Kreger proposed openstack/ironic master: CI: Add iweb to the use tinyipa on list  https://review.opendev.org/c/openstack/ironic/+/84739621:14
opendevreviewJulia Kreger proposed openstack/ironic master: CI: Default to TinyIPA when nested virt is not possible  https://review.opendev.org/c/openstack/ironic/+/84741221:25
iurygregoryback o/21:39
iurygregoryTheJulia, nice one re - "! $(grep -q vmx /proc/cpuinfo)" =) 21:40
TheJuliaold sysadmin tricks21:40
TheJuliaunix whitebeard21:43
JayF`grep -q` is highly, highly underutilized21:54
JayFTheJulia: that could be written as $(grep -qv vmx /proc/cpuinfo) https://gist.github.com/jayofdoom/ebaa4f3c1406cc05dbe5af411b05346e (I was nerd sniped by the potential of playing bash golf)21:56
JayFand it made me wonder if grep -qv worked as expected (it does)21:56
TheJulianice!21:56
TheJuliaI was thinking it was "not enough flags" on grep21:56
TheJuliabut then just went "oh, it works", onward!21:57
TheJuliaoh joy, centos9 ramdisk is >400megs22:27
iurygregory(╥﹏╥)22:30
TheJulia780mb uncompressed22:30
* TheJulia will interpret that as crying22:30
iurygregoryyes22:30
TheJulia:)22:31
JayFI remember back before we had image streaming22:32
JayFand for agent deploys, your disk image*2 + total size of ramdisk was howm much ram you needed for deploy22:33
TheJulia21Mlibjavascriptcoregtk-4.0.so.18.20.5 <-- do we *really* need javascript....22:33
TheJuliafor gtk at that....22:33
iurygregorywow I wouldn't expect we need js22:34
TheJuliagah, the largest firmare offenders are network cards22:34
jandersgood morning Ironic o/22:34
* JayF in before "sshd needs libXlol for x forwarding, which depends on gtk3-banana which needs javacripe-for-gtk"22:35
TheJulia"smartnics"22:35
iurygregorymorning janders o/22:35
* TheJulia wonders if she can pay bills by making swords... and feels like gtk3-banana could be a useful fuel22:35
JayFTheJulia: I'm using one of those atlantic nics which have GIGANTIC firmwares. I had to install it with a USB network adapter because install livecds wouldn't package the firmware22:35
TheJuliaJayF: so... umm... does your network card run linux too?22:36
* TheJulia wonders what needs cracklib22:36
JayFTheJulia: it's not like one of those mellanox ones that does all the magic which makes it even sadder22:37
JayFTheJulia: afaict it's basically like a winmodem except it's a nic22:37
JayFor really, it probably executes that code on-chip, but like, it's loaded at boot22:37
JayFIDK, it's just obnoxious and makes it super difficult for things like our ramdisk to truly be minimal22:38
JayFtime to make our own ironic ramdisk distro, where it somehow detects and pulls network firmware over the network using like, ipxe-snp or something like that22:38
JayFI'm pretty sure that's gtk-bananas22:38
TheJuliaso!22:38
TheJuliatrammel hudson at lower layer labs has an awesome demo22:39
TheJuliain... UEFI22:39
TheJuliano drivers, no firmware22:39
TheJuliawell22:39
TheJuliawe would be able to literally rm -r -f all of the firmware files22:39
TheJuliaand then... if needed kexec into our full ramdisk22:39
JayFDo you have a link to anything written about that?22:40
JayFMy only worry with a pattern like that would be the edges ... e.g. vlan/bonding support and the like22:40
JayFand yet another set of edges to work around22:40
* TheJulia looks for the tweets22:40
JayFTheJulia: oh, I did some interesting science today22:41
JayFguess what github copilot writes if you feed it `def erase_block_device`22:41
JayFI was afraid it'd spit back our code at us, but instead it's the most cursed implementation ever22:41
TheJuliahmm, I'll  need to message him, looks like  nuked the tweets22:41
TheJuliaJayF: how... cursed?22:41
JayFhttps://gist.github.com/jayofdoom/db94a89d96d5f781e134532f371ded7222:42
* TheJulia twitches22:42
JayFthat's 100% safe, not dangerous at all22:42
JayFand certainly clears off all your [adjusts glasses] megabyte of information22:42
TheJuliaye olde issue of pam requiring cracklib through pw quality... and cockpit requiring it too22:45
TheJuliayeah, we get webkit now22:50
TheJuliarpm -e webkit2gtk3-jsc libproxy-webkitgtk4    saves like 26MB22:51
TheJuliadoes... ipa... *really* need polkit?!?22:53
JayFAlmost certainly not22:53
JayFWe don't even create nonroot users in the ramdisk iirc22:53
TheJulia30+ mb more22:54
TheJuliarpm -e polkit polkit-libs PackageKit polkit-pkla-compat <--- now down to 155MB of libs22:55
opendevreviewJulia Kreger proposed openstack/ironic-python-agent-builder master: Remove some extra Centos Packages  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/84741723:08
TheJuliaugh, v6 is failing again23:09
opendevreviewMerged openstack/metalsmith master: metalsmith list handle missing port  https://review.opendev.org/c/openstack/metalsmith/+/84606723:50

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!