Thursday, 2024-02-08

*** dmellado7452 is now known as dmellado74500:15
opendevreviewJay Faulkner proposed openstack/ironic-tempest-plugin master: Basic API tests for sharding  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/90674900:40
JayF^ should be ready for review, I'll #ironic-week-prio it tomorrow once it V+1s, but if you wanna sneak preview review it I won't mind00:41
TheJuliaCool, I’ll take a look tomorrow. Thanks Jay!00:41
KangieHi all, having finally worked around the... "fun" nature of my employer's network and managed to get bifrost-cli setup from repos in a VM, what's going to be the best way to do a test virtual deployment? Ideally I'd like to run ironic and a few test VMs on the same host on an internal network to get familiar with it before proceeding on prod hardware. Is it sane to set the06:15
Kangieprovisioning network as an internal net and use DHCP/PXE? The libvirt guide cautions against it however I suspect that has to do more with managing bare metal servers at the same time than any real limitation06:15
KangieOtherwise a few VMs and a sushy emulator seems reasonable enough if I'm reading my docs right?06:16
rpittaugood morning ironic! o/08:44
dtantsurKangie: there is a testenv mode that can be used in complete isolation from your production environment. I'd avoid trying to use the same bifrost to do both: it's possible, but the configuration may be overwhelming at first.09:13
iurygregorygood morning11:16
opendevreviewDmitry Tantsur proposed openstack/ironic master: Online migration for inspect_interface inspector->agent  https://review.opendev.org/c/openstack/ironic/+/90739812:24
dtantsurTheJulia, JayF ^^^12:24
opendevreviewDmitry Tantsur proposed openstack/ironic master: [WIP] Add inspection PXE filter  https://review.opendev.org/c/openstack/ironic/+/90799113:03
dtantsurTheJulia, hjensas ^^ without purging with a doc addition13:04
hjensasdtantsur: thank, I like the "leave the purging to whatever is managing the service approach" :)13:51
TheJuliagood morning13:59
opendevreviewMerged openstack/ironic-python-agent master: Trivial: avoid deprecated utcnow  https://review.opendev.org/c/openstack/ironic-python-agent/+/90729814:43
iurygregoryhttps://nvd.nist.gov/vuln/detail/CVE-2023-40547 Shim CVE 14:45
TheJuliaYeah, when downloading using http14:59
JayFI saw that. It's nice when the security issues that might impact your users are juuuuuust over the horizon of scoped into stuff we care about lol16:04
TheJuliadtantsur: do you remember how tight the https cert validation is ?16:23
dtantsurDefine "tight" please16:24
TheJuliaspecifically we've run into issues with clocks recently where it seems a firmware timezone is mucking with the time16:26
dtantsuryeah, we've run into that too recently16:28
TheJuliaon hpe gear16:28
TheJulia?16:28
dtantsurnot sure any more, sorry. possibly.16:28
dtantsurTheJulia: https://opendev.org/openstack/ironic-python-agent/src/branch/master/ironic_python_agent/config.py#L77-L8016:29
TheJuliaack, thanks16:29
dtantsurwe may bump that to 24 hours if we keep seeing timezone problems16:29
clarkbI think there is still a dos in the content length handling in shim, but if you're talking to a nefarious http server you have bigger problems. If you set a terabyte large content-length value you'll clear the new check around buf sizes (because a terabyte is larger than whatever you read) then fail to allocate a buffer for it.16:34
clarkbbut also its been many years since I dealt with C so my reading may be wrong16:35
JayFFor ironic cases, if you're close enough to own the http server, you're likely close enough to screw with PXE as well16:35
JayFso this only really is something that's impactful for UEFI HTTP boot + vmedia, I think (?) 16:35
dtantsurThe vmedia case have everything inside the ISO16:36
JayFyeah, well I also realized I'm thinking only about provisioning actions cases16:37
opendevreviewDmitry Tantsur proposed openstack/ironic master: [WIP] Add inspection PXE filter  https://review.opendev.org/c/openstack/ironic/+/90799116:37
JayFand there may be something in this category around deployment on disk16:37
JayFbut we use grub shim from locally installed package on OS, yeah?16:37
JayFso that is not an issue16:37
opendevreviewJulia Kreger proposed openstack/ironic master: docs: add additional content for host clock skew  https://review.opendev.org/c/openstack/ironic/+/90851116:52
TheJuliadtantsur: ^16:52
TheJuliaso, httpboot in general, since there are two modes, the vulnerability is when your downloading grub as part of the pure network boot path where basically shim expects to be able to grab grub at $CWD/grubx64.efi16:54
TheJuliathe key is you can still feed a stock shim with a signed artifact16:55
TheJuliafrom disk, doesn't apply of course16:55
rpittaugood night! o/17:30
* TheJulia looks at https://docs.openstack.org/kolla-ansible/yoga/reference/bare-metal/ironic-guide.html and twitches from reference to pxelinux.017:54
opendevreviewJulia Kreger proposed openstack/ironic master: docs: augment admin troubleshooting docs for system scope context  https://review.opendev.org/c/openstack/ironic/+/90820318:21
iurygregoryTheJulia, the most recently clock skew issue was in a Dell Poweredge R650 18:33
TheJuliafunky18:38
TheJuliaso, likely we do just need to extend the skew18:38
JayFmaybe find a way, if properly configured, to have IPA to set the BMC clock on boot, perhaps?18:39
TheJuliadtantsur: so, out of curiosity, could a node get stuck in inspectfail ? I'm thinking not really, but it is a fear screaming in the back of my head18:40
TheJuliaJayF: BMC clock is not the issue, although most people mistake it for the host clock18:40
dtantsurTheJulia: should not be possible, you can always "manage" or whatever it18:41
TheJuliaYeah, just worried we're requiring humans to know to get it out of that state, but... yeah18:41
dtantsurDoesn't it apply to all fail states? I might be missing some context18:41
TheJuliain terms of the online upgrade, but I guess it is fine, really18:42
TheJuliaI'm likley just over thinking18:42
dtantsurIt's a good topic to discuss. Potentially, tear down may fail with the new interface. BUT! I'm now seriously wondering if we have any tear down at all :D18:43
* dtantsur should go packing18:46
TheJuliapacking?!?18:50
dtantsurleaving for vacation on Saturday18:51
TheJulia\o/ vacation18:53
TheJulia.... I need a vacation and it is just Feburary 8th....18:53
JayFhttps://zuul.opendev.org/t/openstack/build/bf6c998654024902b5281a0d745ea72521:32
JayFI set max_microversion on my change21:32
JayFbut yet still > Version 1.89 was requested but the minor version is not supported by this service. The supported version range is: [1.1, 1.87].21:32
JayFin 2023.2 jobs (and I suspect zed jobs are failing for similar reasons)21:33
* JayF trying to figure out what he's missing21:33
JayFs/max_/min_/21:34
JayFhmm21:40
JayFdo those jobs need max_microversion set in config?21:40
JayFI think tempest is thinking the max microversion for 2023.2 is 1.87 when in reality it should be 1.89?21:41
TheJuliaafaik, no21:42
iurygregoryJayF, release mappings is correct for 2023.2?21:42
JayFThere's def. something weird going on21:42
JayFThat's exactly what I'm looking for21:42
iurygregorybecause I only see 2023.1 there21:42
iurygregoryhttps://github.com/openstack/ironic/blob/master/ironic/common/release_mappings.py21:42
JayFyikes, that's a big miss21:43
iurygregorymaybe is related, not sure21:44
JayFthat file is missing a LOT of information21:44
JayFthat's for damn sure21:44
iurygregoryI will try to look tomorrow21:44
JayF23.0.0 was our 2023.2 release21:44
JayFand is not in that file at all21:44
iurygregoryneed to go grab dinner with the family =)21:44
JayFI mean, I'm looking now21:44
JayFso hopefully you'll have a review or it'll be fixed 21:45
TheJuliaso why 1.89 ?21:45
JayFsharding.21:45
iurygregorysharding was before 1.89 no?21:45
JayFhttps://docs.openstack.org/api-ref/baremetal/#id236 says 1.82 (!?)21:45
TheJuliayeah, should be 1.82 I think21:46
iurygregoryyeah21:46
iurygregoryhttps://github.com/openstack/ironic/commit/8e34d622aff72d7dd286add31e3d7cd366629bc2#diff-5e4dd7b8baeac0d222e2f6d734549d37b1487f67f5f1dd339101de3df3d5ebbb21:46
JayFyeah it's 8221:46
JayFwe still are missing some mappings21:46
JayFand that should be fixed21:46
JayFbut is a different break21:46
iurygregorycorrect21:46
TheJuliasign, https://github.com/openstack/ironic/blob/master/ironic/common/release_mappings.py#L657 needs to also have 2023.221:46
JayFyeah, that's what we were talking about21:46
JayF2023.2 is missing from that file, we have releases missing21:46
JayFgoing to fix my tempest change then push a fix for that21:47
TheJuliayeah, but that should be entirely disjointed from api microversion behavior21:47
TheJuliaThat is upgrade aliasing, really21:47
TheJuliaA version released only knows master, that is to resolve the list in case of human21:47
JayFyeah the microversion stuff is 100% I have 1.89 in my tempest, I need 1.8221:47
JayFthis is another related issue found while hunting for the root cause here21:48
opendevreviewJay Faulkner proposed openstack/ironic-tempest-plugin master: Basic API tests for sharding  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/90674921:48
TheJuliaoh, you need api_microversion as well as min_microversion, I think21:54
JayFthat is not true if other examples are to be believed21:54
JayFit looked like it's doing the right thing, just was requesting a too-new version in 2023.2 case21:54
JayFbecause I told it a too-new version21:55
JayFso even if that's not right I wanna see the output and understand where the other would come into play21:55
opendevreviewJay Faulkner proposed openstack/ironic master: Fix release mappings for 2023.2  https://review.opendev.org/c/openstack/ironic/+/90853021:55
JayFI made ^ that by looking at a diff of object and api directories between 22.1 and 23.021:56
TheJuliahttps://github.com/openstack/ironic-tempest-plugin/blob/master/ironic_tempest_plugin/tests/scenario/ironic_standalone/test_basic_ops.py most of these use api_microversion, not min_microversion, fwiw21:56
JayFso hopefully it should be right21:56
JayFscenario != api tests21:56
JayFscenario tests have a different method for skipping afaict21:56
TheJuliaahh21:58
TheJulialooks like the mapping on rpc is wrong21:58
opendevreviewJay Faulkner proposed openstack/ironic-tempest-plugin master: Basic API tests for sharding  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/90674921:58
TheJuliathe branch has 1.5821:58
JayFhow does the branch get a newer RPC version than master?21:59
JayFoh, I get what you're saying21:59
JayFI read that as "release mappings in branch has..." and you meant "actual rpc version on branch is..."22:00
TheJuliayes22:00
opendevreviewJay Faulkner proposed openstack/ironic master: Fix release mappings for 2023.2  https://review.opendev.org/c/openstack/ironic/+/90853022:00
opendevreviewJay Faulkner proposed openstack/ironic-tempest-plugin master: Basic API tests for sharding  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/90674922:48
NobodyCamgood afternoon Ironic folks22:56
NobodyCamis there a reason we block setting owner with node create?22:56
NobodyCam`openstack baremetal node create --driver ipmi --owner None --name cjk-test-deleteMe`22:57
NobodyCam`The attribute(s) "owner" are invalid; they are not needed to create nodes.`22:57
JayFWhat version?23:00
JayFfor both cli and api23:00
NobodyCamantelope23:00
ashinclouds[m]Use a system scoped account23:00
NobodyCamand I'll bet I'm really old on cli... /me checks23:00
JayFI would use latest cli for sure23:01
JayFeven on older ironic23:01
NobodyCam```ckrelle$ openstack --version23:01
NobodyCamopenstack 5.5.0```23:01
NobodyCamlatest broke our ussuri regions :(23:01
NobodyCam...broken in our...23:02
TheJuliaWhat version of python-ironiccclient23:02
TheJuliaNobodyCam: are you attempting to create a node with None as an owner to avoid the auto-map to your user's project ?23:09
TheJuliaAssuming your using a project admin and not a system scoped admin23:09
TheJuliaoh wow, i think i found it23:20
TheJuliayup, we have an explicit list23:21
TheJuliaand the way osc stuff is tested, we never hit it23:22
NobodyCamhttps://www.irccloud.com/pastebin/iZ2xDBPu/23:24
JayFTheJulia: what's the issue?23:24
NobodyCamand yep having user add nodes for a different project23:25
NobodyCamyes they are admin23:25
TheJuliarofl, one moment!23:26
TheJuliapatch inbound23:26
opendevreviewJulia Kreger proposed openstack/python-ironicclient master: Fix on-creation ability  https://review.opendev.org/c/openstack/python-ironicclient/+/90853423:29
JayFTheJulia: Should we filter creation fields at client level at all?23:29
JayFTheJulia: is there a reason we shouldn't just let the API complain?23:30
JayFI have some valid answers to this question; I'm curious which one we're going with :D 23:30
TheJuliaI dunno.... I just found it and kept laughing23:30
JayFI am going to +2 this23:30
JayFbut I think I'd also be +2 to something that just ... didn't check that list23:30
TheJuliathe last four + firmware_interface were missing23:31
JayFeveryone of of those represents a massive failure in software development, review, and processes.23:31
JayFexcept for the shard one, that one is okay23:31
JayF:P 23:31
JayF(that is entirely a joke to be clear)23:31
TheJulia*we* do validation on the api side, and we even have some testing on that, so I don't see why the API shouldn't be the one23:31
TheJuliaI just dunno why every single creation has extra logic wrapped around it in the OSC commands23:32
TheJuliamaybe it is some convention we inherented that we were sort of blind to unless we tried on create manually23:32
TheJuliathe frustrating thing is how the test style works, we never actually hit where the list is validated23:32
NobodyCamThank you! :bow23:33
TheJuliaNobodyCam: by all means pull it down and give it a spin23:34
TheJuliaI don't have a running deploy locally, but that is *exactly* where the error is coming from23:34
NobodyCam+++23:34
TheJuliaJayF: funny thing is, you suggested earlier in the week, we might want to set some owners on node create23:34
TheJuliaNobodyCam: but seriously, why are you setting it to None?!23:34
* JayF all the sudden wonders what kinda havoc a project named "None" could cause in OpenStack23:35
TheJuliaMuahahahahahaha23:35
NobodyCam(matching Ussuri behavior) until we can get things updated else where23:35
NobodyCamLoL23:36
TheJuliaso, did you by chance read the troubleshooting.rst file I posted?23:36
NobodyCam:( no23:36
* NobodyCam looks23:36
TheJuliaone moment23:36
TheJuliahttps://review.opendev.org/c/openstack/ironic/+/90820323:37
TheJuliaso, yes, it is auto-magically record your creating project ID most likely23:37
Kangiedtantsur: I've looked at testenv - does it "just work" in a 'separate VMs' configuration (i.e. I'm deploying directly into a QEMU/KVM VM on infrastructure that I'd prefer to use for the task, not onto the bare metal running the hypervisor)? The docs don't really say but I gather maybe nested virtualisation is how it's supposed to work? I don't mind learning some "fun" extra23:37
Kangieconfig bits if configuring a few VMs attched to a virtual provisioning network with DHCP is straightforward and closer to what I'll need to do in a real env23:37
JayF> AssertionError: Versions (23, 0) and (2023, 2) are not sequential23:37
TheJuliaSetting it to a string None is entirely different, if you enforce only new policies, it will disappear23:37
JayF*sigh*23:37
NobodyCamanything that has `Don't Panic` in it can't be bad23:37
JayFKangie: you know ansible?23:38
JayFKangie: it's all ansible.23:38
KangieI will by the time I'm done with this...23:38
JayFheh, good answer23:38
KangieI've already reverse engineered a bit to get around restrictive firewalls23:38
TheJuliaJayF: the assertionerror makes me want to make a margharita and exercise23:39
JayFYeah, basically we use a tool called sushy-tools or VirtualBMC to emulate BMCs23:39
JayFredfish and IPMI respectively23:39
KangieI'm leaning towards sushy and targeting redfish - all our hardware should support it. I hope :D23:39
JayFyeah, so if you're looking for a testenv like that, you'd deploy sushy-tools somewhere (probably network-location close to the hypervisors if it were me)23:39
TheJuliaNobodyCam: anyway, if your not enforcing the new policy, then the field shouldn't *really* have any impact23:40
JayFand then use the existing testenv setup as a template to create "fake nodes" for sushy-tools to talk with and emulated redfish BMCs for Ironic to talk to mapped to them23:40
JayFKangie: I will close this by saying: we tell people not to run sushy-tools in production. If you let sushy-tools mess around with hypervisors in your prod environment, and it breaks a thing, we'll :( and have empathy, but you should be VERY CERTAIN you're comfortable with the changes it'll do before colocating it on a hypervisor with anything23:41
NobodyCamTheJulia: yea I used the set command to fix a couple node, then checked node create help and saw owner so told the enroll folks to use owner in there create and it ran into the issue you have provided a patch for23:41
TheJuliaNobodyCam: ahh!23:41
NobodyCamit a project visibility for scheduling services above us23:41
TheJuliaAhh23:42
TheJuliaso hmm23:42
NobodyCamin our newly deployed antelope region23:42
TheJuliaNobodyCam: so for your RBAC context, let me get some code for you to be aware of23:42
TheJuliaohhh ahhhhhh23:42
NobodyCamwe hit the glance rbac restriction23:43
TheJuliaNobodyCam: https://github.com/openstack/ironic/blob/master/ironic/api/controllers/v1/node.py#L2812-L283123:43
TheJuliaowned_node in that context is if your allowed to create a node23:43
TheJuliadepending on your workflow, you might need a custom policy as it applies to that entire path23:44
TheJuliaOr, you just create in one project and move depending on your workflow23:45
NobodyCam+++23:45
NobodyCamThank you23:46
KangieRight. Thanks JayF. Another reminder that it would be ideal to have actual development environment hardware.23:47
KangieI guess the alternative is nested virtualisation.23:47
JayFThat is what 99.999% of us do for our dev environments23:48
JayFit's not like you put actual workloads in the machines you boot23:48
JayFby default, we use an OS called "cirros" which is a linux distro basically designed for what we use it for (yep, it booted, done)23:48
TheJulia(I'll note, the power bills for rackmount servers can also be cost prohibitive....)23:51
TheJuliaSays the crazy lady with like 16kW of solar23:51
NobodyCam:p23:57
NobodyCamBitcode mining at TheJulia house23:57
NobodyCamhehehehehe23:57
NobodyCams/bitcode/bitcoin/23:57
KangieTheJulia: I'm in the process of building node management for a new HPC cluster...23:58
Kangie(and retrofitting it to part of our existing cluster)23:59
KangieI also have a 42U rack in my garage :D23:59
JayFKangie is another person I dragged over here from the gentoo community with promises of "open source clouds" and "hardware provisioning"23:59
TheJuliaKangie: awesome!23:59

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!