Friday, 2024-12-06

rpittaugood morning ironic! happy friday! o/07:52
*** dking is now known as Guest223310:20
iurygregorygood morning!11:30
*** mko is now known as Guest224312:05
opendevreviewTakashi Kajinami proposed openstack/ironic-python-agent master: Replace crypt module  https://review.opendev.org/c/openstack/ironic-python-agent/+/93717512:28
opendevreviewMerged openstack/ironic master: clean up lints for automated_steps doc plugin  https://review.opendev.org/c/openstack/ironic/+/93707013:50
TheJuliagood morning14:21
opendevreviewDoug Goldstein proposed openstack/ironic master: The i18n function is used but not imported  https://review.opendev.org/c/openstack/ironic/+/93725514:45
cardoeAm I crazy or is ^ broken and should be backported?14:45
opendevreviewMerged openstack/bifrost master: Use python version to set DEFAULT_PIP_ANSIBLE  https://review.opendev.org/c/openstack/bifrost/+/93679714:48
rpittaucardoe: mmm that looks broken indeed and it's not the only place apparently14:52
cardoerpittau: since you're on... may I ask your opinion on which version of Python to target? 3.6/3.7 or 3.9? ala https://review.opendev.org/c/openstack/sushy/+/93491614:55
rpittaucardoe: 3.9 I doubt we'll backport that :)14:58
rpittaualthough dtantsur mentioned the oldest supported for the oldest supported branch, but I don't get why since we're not backporting that change (hopefully)14:58
cardoeWell I think his concern is future changes being applied and needing to be backported and using a style that's 3.9 but then needing to be backported to somewhere older.14:58
rpittauyeah, I got to that right when I was answering his comment :)14:59
rpittauso yep 3.714:59
opendevreviewMerged openstack/ironic stable/2024.2: Update Node Cache after Successful Clean/Service  https://review.opendev.org/c/openstack/ironic/+/93710715:00
-opendevstatus- NOTICE: Gerrit on review.opendev.org is being upgraded to version 3.10 and will be offline. We have allocated an hour for the outage window lasting until 1700 UTC.15:02
rpittaulooks like gerrit wants us to start the weekend early :)15:03
-opendevstatus- NOTICE: Gerrit on review.opendev.org is being upgraded to version 3.10 and will be offline starting at 1600 UTC. We have allocated an hour for the outage window lasting until 1700 UTC.15:05
iurygregory weekend early ++15:21
iurygregorytime for me to organize more things before traveling :D 15:21
priteauHello. Is there a known issue with IPA master branch at the moment? We are hitting this error in Kayobe CI:15:29
priteautraps: ironic-python-a[495] trap invalid opcode ip:7efee73f58a2 sp:7ffcb6eaf7a8 error:0 in _rust.abi3.so[7efee7112000+631000]15:29
priteauIllegal instruction15:29
priteauHum, we are still using TinyIPA in these CI jobs15:30
TheJuliaheh15:32
TheJuliaOhhh, I wonder what the invalid opcode is15:33
TheJuliaIllegal instruction, fun15:33
rpittaubye everyone, have a great weekend! o/15:36
TheJuliao/15:36
priteauThere's no change in IPA-builder though, so either it's a change in TinyLinux 15 itself, or a change in the virtualisation layer (but it affects both our rocky-9 and ubuntu-jammy jobs)15:41
TheJuliawe do *some* compilation in CI15:42
TheJuliait could be a default change in newer ubuntu which doesn't compile in older calls15:43
TheJulia.... cleanest thing to do would be to ensure the VM CPU flags are up to date15:43
TheJuliathis is on VMs your creating?15:43
priteauyes, with openstack/tenks15:43
TheJuliaokay15:43
* TheJulia pulls up tenks to see what it does with the xml15:44
priteauWe can check next week if it resolves itself. The issue only appeared today.15:46
TheJuliapriteau: do you have a job link handy?15:48
priteauhttps://zuul.opendev.org/t/openstack/build/59488a4ffd76437e90045a03d9f9b65b15:48
priteauVM console is at https://2733627e1a0348401721-fb543e4b225e8fdb51d174f0fee213a6.ssl.cf1.rackcdn.com/909686/47/check/kayobe-overcloud-ubuntu-noble/59488a4/primary/kolla/tenks/tk0-console.txt15:48
priteauAnd a rocky-9 one: https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_824/936784/1/gate/kayobe-overcloud-rocky9/8243b42/primary/kolla/tenks/tk0-console.txt15:52
priteauBut maybe don't waste time on this quite yet, in case it's a temporary glitch with the latest build.15:54
* TheJulia pulls up the libvirt manual15:54
-opendevstatus- NOTICE: Gerrit on review.opendev.org is being upgraded to version 3.10 and will be offline momentarily. We have allocated an hour for the outage window lasting until 1700 UTC.16:01
TheJuliapriteau: I'm worried tenks is sending you down a bad path16:02
TheJuliaspecifically, per the logs, it appears to create bios booting (which is kind of insane these days) VMs of the "pc" type16:03
priteauWe switched to uefi by default some weeks ago, I will check if we override this16:03
TheJuliayour machine type should also likely be at least pc-q35 also16:04
TheJulia... In our CI, we have pc-q35-8.2 (https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_f15/896570/12/check/ironic-tempest-uefi-redfish-https/f15fb2e/controller/logs/libvirt/qemu/node-0.xml)16:04
priteauThanks. I will check all this next week.16:04
TheJuliawe're then also enabling the cpu host-model to be passed through16:04
TheJulia<cpu mode="host-model" check="partial"/>16:04
priteauIs this relevant even when using pure qemu (no kvm)16:05
TheJuliabasically, your getting a VM with a a default processor which has a limited instruction set16:05
TheJuliaIt is when you have OSes changing/evolving forcing minimum revisions for compile time artifacts to use newer CPU flags16:05
TheJuliafor example, centos stream 9, required x86_64 v2. stream 10 requires v316:06
TheJuliav3 == haswell processors (from 11 years ago)16:06
TheJuliathe VM you have defined... If I'm understanding libvirt properly is x86_64-v116:06
TheJuliaso if the host OS compiles v2 or v3 flags, yeah, your going to get invalid opcode errors on v116:07
priteauWe're inside TinyLinux here though. But your comment is relevant for our attempt to switch to c9s IPA16:09
TheJuliapriteau: ultimately, it all boils down to whatever the "virsh capabiities" command splits out on the host16:09
TheJuliaand possibly nested virt or not16:10
priteauThanks for the pointers. I will investigate further next week.16:10
TheJuliaeven in tinylinux, we're building on noble as the base build host at this point for latest artifacts16:10
TheJuliaso noble may have it's own spin16:10
priteauI see16:11
TheJuliayeah, its a bit of an onion16:11
priteauThank you for checking :)16:15
JayFI'll note that there are machines in opendev CI cluster that do not support x86-64-v316:32
JayFAnd if you were running an IPA image based on an OS that requires V3, that's exactly the error you'd get16:32
TheJuliaYeah, i was just thinking about that in the shower16:38
TheJuliaif the compilers lean towards v3/v4 automatically as tuning, there is more risk, but I suspect we would have already seen that for ironic's CI...16:38
JayFI doubt that GCC will ever take that automatically, but as you noted above, some distributions have changed their defaults.16:46
JayFMy project to give us a gentoo-based CI ram disk is using the non-optimized binaries from them, so it won't be an issue on that. I'd also be surprised if tiny core was willing to make their binaries incompatible with old hardware. I'd be more worried about Ubuntu or centos.16:47
* JayF notes he is on mobile today and on a sick day since his back is refusing to operate properly16:47
TheJuliaJayF: the invalid opcode was in ironic-python-agent's runtime16:49
TheJuliaso... something which was installed to support it16:49
TheJuliaJayF: Find a ghostbusters movie and watch it from a linear surface16:50
TheJuliaor any other preferred movie16:50
TheJuliaGhostbusters is only on my mind  because I just used "don't cross the streams"16:50
JayFTheJulia: that looks like a cryptography library was installed that was built with V317:04
JayFI have a friend who works on that library, I'm asking if they changed anything17:06
cardoeTheJulia: is x86_64 v3 really haswell? I know Intel v3 was Haswell. I didn't think the GCC ABI standardization followed Intel.17:28
opendevreviewDoug Goldstein proposed openstack/ironic master: apply line length rules to the doc directory  https://review.opendev.org/c/openstack/ironic/+/93726917:30
opendevreviewDoug Goldstein proposed openstack/ironic master: change ambiguous variable name  https://review.opendev.org/c/openstack/ironic/+/93727017:30
opendevreviewDoug Goldstein proposed openstack/ironic master: move imports to top of file for lints  https://review.opendev.org/c/openstack/ironic/+/93727117:30
opendevreviewDoug Goldstein proposed openstack/ironic master: enable ruff in pre-commit with some initial lints  https://review.opendev.org/c/openstack/ironic/+/93727217:30
TheJuliacardoe: On the intel side, I think v3 starts mapping to the haswell line17:32
TheJuliaI think of it much more in processor lines17:32
JayFThere's a very good page on the Gentoo wiki that maps them to consumer processors.17:33
TheJuliaI think the modeling follows features17:33
JayFOh for sure, but that makes it even more confusing when you're trying to map it to actual processor models17:34
TheJuliayeah17:35
TheJuliaI gave up trying to keep any portion of the tree in my head mapping wise17:36
cardoeyeah it doesn't map.17:37
cardoex86-64-v1 = AMD K8 and Intel Prescott  17:37
cardoeaka the O.G. 64-bit stuff. AMD64 and EM64T17:38
TheJuliayup17:38
TheJuliaWhich was a *very* long time ago17:38
cardoex86-64-v2 = AMD Bulldozer / Intel Nehalem. Which for Intel is the architecture before Sandy Bridge which Intel retconned as Intel Core v117:39
cardoex86-64-v3 = AMD Excavator / Intel Haswell. Which happens to be what Intel calls v3. QEMU targets this now for emulation. But older QEMU for Xen still targets x86-64-v1, which could be the issue with some of the CI builders.17:41
cardoex86-64-v4 = AMD Zen 4 / Intel Skylake17:41
cardoeSo all around old stuff. I suspect you're hitting the old QEMU issue, JayF.17:41
JayFI'm personally not hitting anything, I'm just responding to the CI log posted above.17:43
dtantsurside note: any RHEL/CS 9 derivative will require v317:46
TheJuliadtantsur: I thought it was v2 for CS9 and v3 for CS1017:47
dtantsurah, Julia already said that17:47
dtantsurTheJulia: I may be wrong. I remembered it was v3 but I have memory of a goldfish.17:47
TheJuliamaybe more like a pack of goldfish who speak :)17:47
TheJuliaI know there are some dicscussions around CS10 suddenly having v3 as a req and how only OpenMetal and Rackspace Flex offers defaults which end up matching17:48
* dtantsur a pack of goldfish in a hoodie17:48
TheJuliaWhich is going to create pain in different areas17:48
TheJuliadtantsur: heh17:49
dtantsurI see, yeah. The transition to v2 did require updating our virtual machine settings17:49
TheJuliaYeah, and the CS10 discussion is why I'm slightly more aware of this right now17:49
TheJuliaUltimately, in the short term, we might need to see if we can tune the builds for python attributes for tinycore... or just use a binary build leaning towards v2 until other operators can do the needful to document/update so VMs can get launched with newer flags17:50
TheJuliaThis is very much a feedback loop challenge17:50
TheJuliaand we can't let ourselves fall into traps of past consistency17:51
dtantsurwdym "traps of past consistency"?17:51
TheJuliaso if we build artifacts which are geared for just v2, then that will work, but we're also avoiding moving forward17:52
TheJuliaso we sort of trap ourselves in that past consistent state rooted in ages past17:52
TheJuliadoes that make sense?17:52
dtantsurYeah, but I don't think it's on us whether we build stuff for v2 or v3?17:53
dtantsurLike, if you use IPA-builder with CS10, you will get a v3 build17:53
TheJuliaIntel Nehalem was released in November 2008....17:53
TheJuliayup17:53
TheJuliaAnd there *is* a duality, we're going to have to navigate it more and more17:54
dtantsurOr do you mean migrating our official IPA builds to CS 10?17:54
dtantsurI guess it will be dictated on one hand by the support of CS 9, on the other - how many drivers will be removed in CS 10, on the third (WUT, how many hands to I have) - which hardware will only be supported in CS 1017:54
TheJuliaWe will want to eventually, but we're functionally blocked with the state of opendev's ci proiders17:54
TheJuliaunless we can lock our jobs to specific clouds17:54
dtantsurand that as well17:54
dtantsur(I think we essentially understand each other, just cannot agree on the exact words to express that, lol)17:55
TheJuliayeah, definitely17:55
TheJuliaI kind of see this like bios boot versus uefi boot17:55
TheJuliaeventually, everyone needs to be uefi17:55
* dtantsur +217:56
TheJuliathe road for some to get there is going to be different for everyone17:56
TheJulia.... if they are not already17:56
dtantsurYet, I still see folks resisting UEFI, sometimes for kinda valid reasons, most commonly - out of habit17:56
TheJuliaoh wow17:56
TheJuliado they not understand they are limiting all IO to the first 4GB?17:56
JayFOne thing worth consideration is whether or not our customers, meaning openstack ironic users, are all new enough hardware. That should be what drives how quickly we move forward, not the decisions of some distributions IMO17:56
* TheJulia is semi-mindblown17:57
JayFFor instance, if cardoe is using ironic plus IPA, we know for a fact that Rackspace has some machines that don't meet the spec17:57
dtantsurTheJulia: I did not ask, to be honest (mostly because TIL)17:57
dtantsurWell, it is 0 days since somebody told me "we need IPMI because of old hardware" (and 1 days since the previous instance)17:58
JayFExactly 😂😂😭17:58
TheJuliaFrom my point of view, I get it takes time to move hardware, I get people might want to keep much older hardware in production for longer if the tax advantage is there and they still have replacement parts. Distributions are trying to make their choices. Even tools are as well. The challenge is also some clouds out there which default to much older defaults.18:00
TheJuliaLike... I know one provider is super high end hardware within the last ?2? years.... yet they don't expose v3 vms by default.18:01
TheJuliaso there is that lagging state is part of the feedback loop challnege18:01
TheJulia... actually that gear is likely 3 years old now18:01
cardoeI mean I'm not using those old gross machines, JayF 18:01
TheJuliaheh18:02
TheJuliacardoe: I already said flex was definitely v3 friendly18:02
TheJulia:)18:02
cardoeYeah basically anything using paid Xen will probably not be v3 friendly.18:02
TheJuliaexactly18:02
TheJuliaThis is very similar to getting folks to using UEFI VMs by default18:03
TheJuliadtantsur: I've started pushing back on "want to use old hardware" asks with the reality that can burn resources, and at the end of the day it is a business question of support.18:04
TheJuliaDistributions actually setting some of these guardrails in place actually also benefits projects like ours18:05
dtantsurI'm pretty lucky here: most of my customers want redfish virtual media which puts limits on how old the hardware might be18:05
TheJulia"no, your not using that super old hardware with a pile of CPU security issues"18:05
TheJuliayeah18:05
dtantsurtrue18:05
TheJuliaAnyway, I should like... try to focus on/think about code today18:06
TheJuliaOnce I start my second cup of coffee18:06
TheJulia:)18:06
dtantsur++18:06
dtantsurI got two cups of coffee in the afternoon and now speed-running though the ethics and compliance training :D18:06
TheJuliale sigh18:06
TheJuliaThat is likely going to be next week for me18:07
dtantsurIt's not bad this time around. I think I'll be done in under 1.5 hours (maybe even less).18:07
TheJuliaLast year I think mine ran 4 hours total18:07
TheJuliabecause the state I live in requires specific amounts of time per year18:07
dtantsurThanks god, they cut down on videos18:07
dtantsurAhhh18:08
TheJuliaso I end up with like the management ethics course by default18:08
dtantsurouch :(18:08
TheJuliayeah18:08
TheJuliait is what it is18:08
dtantsurI have a constructive suggestion for them: do 1.5 hours of training, fill the rest with cat videos18:08
TheJuliahehehehe18:08
* dtantsur is capable of watching 2.5 hours of cat videos without any issues18:08
TheJuliaI have an orange cat, he alone is sufficient amusement18:09
dtantsurlucky you!18:09
TheJuliaexcept when he wants to walk across or sit on the laptop18:09
dtantsurcat do what cat must18:10
TheJuliaindeed!18:10
* TheJulia sips coffee18:10
dtantsurFYI folks. Our entire team will have an onsite meetup next week. Expect delayed responses and people being in and out of IRC.18:25
dtantsur(in fact, I won't have access to my IRC bouncer and will need to invent something)18:27
cardoedtantsur: you see my comments about Python 3.6 vs 3.7?18:30
dtantsurcardoe: where?18:30
cardoehttps://review.opendev.org/c/openstack/sushy/+/93491618:31
* TheJulia whispers "irccloud...."18:33
TheJulia:)18:34
dtantsurTheJulia: yeah, I'll probably set it up over the weekend.18:34
TheJuliaHave a great weekend and travel safely!18:34
* TheJulia tries to figure out how she broke image artifact checksum verification suddenly18:35
dtantsurcardoe: responded18:37
dtantsurTheJulia: thanks!18:37
dtantsurSee you folks o/18:37
TheJuliaerr, its the mocking, not my code18:40
TheJuliadoh18:40
cardoeI replied to your reply18:44
iurygregorybye everyone, see you in two weeks o/19:36
TheJuliawoot, in a good palce to make more progress on oci image registry support next week19:54
TheJuliamy substrate/flow changes now have no failing unit tests19:55
TheJulianow to just begin to bolt in the client stuffs19:55
cardoeSo am I able to define extra verifying steps?19:56
cardoeWhen adding a node?19:56
cardoeI'm REALLY REALLY wanting to figure out how to get rid of our custom node ingestion and do it all within Ironic steps/hooks/etc.19:56
cardoeI wanna try to then document some examples for folks.19:56
TheJuliaDunno20:05
TheJuliaI honestly don't remember the verifying steps stuff and why20:05
TheJuliabut I suspect a driver could override/inject at a minimum20:05
dxterslabhey guys! do you know how openstack handles source nat for baremetal devices? is this done with ovn and integrating the ovn gw chassis with the physical network. (assume no gpu)20:57
dxterslabi mean assume no dpu20:57
JayFBtw, cryptography did ship a new wheel that depends on a newer glibc recently, so it is possible that's an actual break.21:28
cardoeJayF: you happen to peek at ruff and see any of the other flake8 modules you think would be good for us to use?21:36
opendevreviewDoug Goldstein proposed openstack/sushy master: enable flake8 logging checks in ruff  https://review.opendev.org/c/openstack/sushy/+/93706821:39
opendevreviewDoug Goldstein proposed openstack/sushy master: enable pyupgrade via ruff to Python 3.7  https://review.opendev.org/c/openstack/sushy/+/93491621:39
TheJuliapity dxterslab didn't hang out21:40
TheJuliacould have answered that21:40
TheJuliaJayF: hmm. Quite possible21:40
cardoeYeah I was actually wanting to discuss that with dxterslab21:48

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!