Tuesday, 2023-02-21

JayFbut I will say, if we have parent/child nodes and don't use chassis I gotta question why we don't rip it out00:00
TheJuliaFor modify, we could call it “transmutate”00:00
JayFalso chassis can be about relationships to nodes... e.g. chassis could contain actual-node and dpu-node00:00
TheJuliaVery very true00:01
JayFTheJulia: /me pokes the sleeping "zapping" verb00:01
JayFI'm not saying it's the answer to use chassis; I'm saying we need to know why we can't use it, and enumerate that in the spec00:01
TheJuliaI think we poured concrete on that verb and left it in Seattle00:01
JayFI found it when I moved here00:01
JayFand if that's not the use case for chassis, we need to find one or nuke it (even if we can't API-nuke it, we can effectively deprecate/remove it)00:01
TheJuliaIn the immortal name of starbuck, Frak!00:02
JayFwe can only kick the can down the curb so many times before we run out of curb00:02
TheJuliaI… do think we should likely nuke chassis. I don’t think I’ve ever chatted with an operator that uses it.00:02
TheJuliaLikely time for a poll00:03
JayFwhy not get to the end of the dpu stuff00:03
JayFmake the case or anti-case for using chassis for it there00:03
JayFand that can be the platform we jump off of for whatever is next00:03
TheJuliaWell, we should get a data point of existing use to help guide the decision00:04
JayFI'm just saying, if we're getting towards actual-composable-hardware and are still punting it away from chassis, it's clear chassis is the wrong model and we need to fix it00:04
TheJuliaWell, compositor is not the dpu case at all00:04
JayFyou didn't read all the dpu abstracts I did then00:04
JayFlol00:04
TheJuliacomposible is you ask a central controller to give you some bare metal in a shape/form00:04
JayFyep, I think I read something about using one of these to like, supply storage to a server as native (?) 00:05
* JayF is worried he might be mixing multiple things00:05
TheJuliaYou can absolutely do that with DPUs00:05
JayFthat's a form of composable hardware00:05
TheJuliaBut you have to configure the dpu to pass it through00:05
JayFwho are you?00:06
JayFare we you?00:06
TheJuliaWhich is a bit separate00:06
JayFthat's sorta the space we're exploring now, yeah?00:06
TheJuliaEh… not actual content/running state configuration since it is dynamic00:06
TheJuliaSay you attach/detach networks in neutron, the card would be on the message bus00:06
JayFlets take it to the spec, you're eod o/00:08
TheJuliaO/00:08
stevebaker[m]TheJulia: the docs are a bit vague https://docs.openstack.org/openstacksdk/latest/user/guides/stats.html, here is a paste of /metrics for my fake workload app  https://paste.openstack.org/show/bhe4cxcCX4A8ZI53VtNO/00:12
TheJuliaThe node action stuff in that is confusing00:31
stevebaker[m]In hindsight, these metrics are not useful to me, I'll continue publishing my own03:19
samuelkunkel[m]<TheJulia> "Say you attach/detach networks..." <- Thats exactly what I will try with mine in the lab once I will find time for it… the guys next to me (which are doing neutron stuff) basically want to run an ovn-controller (+vswitchd / ovsdb) on the dpu itself. Basically terminating the vxlan / geneve tunnel on the dpu and just passing a port of that virtual network to the host system06:01
samuelkunkel[m]Just goes a step more into the direction of treating a baremetal Node more like a VM. Also without something alike you are very limited if you are running ovs / ovn in your cloud for the customer networks06:03
opendevreviewrenliang proposed openstack/ironic master: Fix expired links  https://review.opendev.org/c/openstack/ironic/+/87389607:52
opendevreviewrenliang proposed openstack/ironic master: Fix expired links  https://review.opendev.org/c/openstack/ironic/+/87389607:53
rpittaugood morning ironic! o/08:27
rpittauJasonF, fungi, TheJulia, re virtualpdu: I forwarded the mail/thread to you, the last email has what we need08:35
opendevreviewMerged openstack/ironic-inspector stable/zed: [zed-only] Fix functional tests  https://review.opendev.org/c/openstack/ironic-inspector/+/87448809:20
kubajjMorning everyone09:56
opendevreviewRiccardo Pittau proposed openstack/ironic master: [WIP] [PoC] A metal3 CI job  https://review.opendev.org/c/openstack/ironic/+/86387310:04
jandershey Ironicers o/10:04
opendevreviewMichele Costa proposed openstack/sushy-tools master: Ignore FileNotFoundError when ejecting virtual media image  https://review.opendev.org/c/openstack/sushy-tools/+/87456011:52
opendevreviewMark Goddard proposed openstack/networking-generic-switch master: Add Fake device type  https://review.opendev.org/c/openstack/networking-generic-switch/+/87309812:48
TheJuliarpittau: if you can dump out the headers, that would be good, I think phillip is referring to https://github.com/internap/virtualpdu14:09
TheJuliaMathieu remembers opendev :)14:10
opendevreviewMichele Costa proposed openstack/sushy-tools master: Ignore FileNotFoundError when ejecting virtual media image  https://review.opendev.org/c/openstack/sushy-tools/+/87456014:15
rpittauTheJulia: I have the original mail with the headers, let me find a good place to share that14:31
TheJuliamgoddard: o/ w/r/t 873098, pity you can't really cleanly inject random sleeps into that as well :)14:35
TheJuliamgoddard: well, into _send_commands_to_device you could put a random sleep14:35
TheJuliamgoddard: oh, and https://review.opendev.org/c/openstack/networking-generic-switch/+/743283 appears to need to be rebased :(14:37
rpittauTheJulia, JasonF, fungi, share a google docs with you, please let me know if that's enough14:37
mgoddardTheJulia: I'm open to adding a configurable sleep as a follow up. Being super fast was actually a positive for testing and iterating on the batching feature though14:39
TheJuliamgoddard: ahh, that makes sense14:40
TheJuliacool cool14:40
mgoddardinitially I was using a real device for testing, but lost far too much time14:40
TheJulias/time/sanity/ ? :)14:40
mgoddardI only have time left to lose :D14:40
TheJuliamgoddard: at a high level, is the batching putting entries in etcd, then grabbing them out based upon who can get the lock?14:42
TheJuliadoing the needful/etc14:43
TheJuliamgoddard: nvmd, looks like it kind of does14:47
rpittaubifrost CI is broken, I need another coffee14:48
mgoddardTheJulia: yes - port/network update requests push changes to an input key, kick off a task processing thread, then wait on the result key14:48
TheJulia.. this really pushes me to think that we need to finish the neutron callback stuffs14:49
mgoddardTheJulia: the task processing thread tries to get a lock while there is work in the input queue, then processes items, and puts results on the results queue14:49
mgoddardoriginal task picks up the result14:49
mgoddardthere are a few areas for potential improvement in ironic networking14:50
TheJuliaI guess in theory that might handle multiple neutron services getting requests for the same switch14:50
mgoddardright14:50
mgoddarddid the neutron callback work ever start?14:51
TheJuliayes, but it is a noop in ironic's api afaik14:53
TheJuliaissue is how to convey across the fact the thing has been released14:54
TheJuliashort of some sort of global data structure14:54
TheJuliain the conductor14:54
* TheJulia thinks that is more up to the implementer to figure out14:54
*** JasonF is now known as JayF15:15
opendevreviewRiccardo Pittau proposed openstack/bifrost master: Update shim-signed name for ubuntu jammy  https://review.opendev.org/c/openstack/bifrost/+/87465015:43
rpittauthis ^ should fix bifrost CI15:44
dtantsurrpittau: does it also work with previous versions and Debian itself?15:47
dtantsur(I guess I could wait for the CI to tell us..)15:48
rpittaudtantsur: I realized we're still running focal and looking for a way to change that for jammy only15:49
rpittauand yeah, same thing for debian15:49
TheJuliaThis is... the ?3rd? time they've changed the file name?15:49
dtantsurrpittau: the required_defaults mechanism allows is, although I'd probably just inline a condition in the defaults15:49
rpittaulove unannounced breaking changes15:50
dtantsurwho said backward compatibility?15:50
TheJuliaMight actually be the 4th15:50
TheJuliaheh15:50
TheJuliadtantsur: I was thinking the same thing15:50
rpittauwell let's be happy that the package name is still the same at least15:52
JayFIury and I are doing bifrost discussion and demos at my office hours in ~7 minutes @ youtube.com/jayofdoom15:53
opendevreviewRiccardo Pittau proposed openstack/bifrost master: Update shim-signed name for ubuntu jammy  https://review.opendev.org/c/openstack/bifrost/+/87465016:02
rpittau^ this should work for all the distributions we support, hopefully16:02
opendevreviewJulia Kreger proposed openstack/ironic-prometheus-exporter master: WIP Support extraction of ironic internal metrics  https://review.opendev.org/c/openstack/ironic-prometheus-exporter/+/86950916:22
espenflDear everyone. I have been working a bit to get serial to work consistently for us (our configured servers sends ttyS1 over SOL so we need serial to go there). Have it all working up until the restart after the deploy ramdisk have copied the image (meaning the ipxe boot script also works over serial). But we have not yet found an easy way to pass kernel parameters to DIB16:29
espenflwhen building an image. Specifically for instance things related to the console. I think this is needed to make sure the deployed node also gives us serial, not just serial during deployment. Would be very happy for any comments or suggestions here. The element enable-serial-console uses getty etc. and does not give us serial on boot. Thanks in advance.16:29
opendevreviewJulia Kreger proposed openstack/ironic-inspector master: Use UTC for the timezone in functional tests  https://review.opendev.org/c/openstack/ironic-inspector/+/87466116:39
TheJuliaI could have sworn I've fixed ^ before16:43
TheJuliaanyway, with that, plus the already merged db test, I'm able to run python-ironic-inspector-client's functional test suite locally. tl;dr zed ironic-inspector will need to be released before the CI job works16:45
TheJuliaespenfl: so basically you need to edit the bootloader config on the image16:46
TheJuliaespenfl: you need something like https://github.com/openstack/diskimage-builder/blob/b4f768117f8805487799829da84883266e5575f2/diskimage_builder/elements/iscsi-boot/finalise.d/51-open-iscsi-config#L14  I think it would be good if you edited the grub element to support adding an argument to the grub command line16:48
opendevreviewMerged openstack/sushy-tools master: Ignore FileNotFoundError when ejecting virtual media image  https://review.opendev.org/c/openstack/sushy-tools/+/87456016:51
dtantsuriurygregory: nice demo, great job!16:51
JayFThat was excellent iurygregory, and thank you dtantsur for chiming in and being active in chat16:51
dtantsursure thing, I hope I didn't annoy you too much :-P16:52
opendevreviewRiccardo Pittau proposed openstack/bifrost master: Update shim-signed name for ubuntu jammy  https://review.opendev.org/c/openstack/bifrost/+/87465016:55
JayFdtantsur: quite the opposite. Annoying is a white sheet of paper for chat and a 0 viewers number staring back at me  :D16:56
dtantsurright :)16:57
rpittaugood night! o/17:00
iurygregorydtantsur, tks!17:02
fungirpittau: TheJulia: JayF: clarkb and i are travelling for meetings all week and ianw is otherwise unavailable, but i'm going to put it on the agenda for next week's opendev meeting to come up with a plan17:11
JayFthank you o/17:12
fungirough plan that i'll propose though is that we'd announce the intent to hand over control of the repository based on preliminary agreement from one of the people who currently has maintainership, and make sure we cc all the maintainer addresses just to give them every chance to object, then if we hear nothing before the scheduled handover we move forward with it17:25
TheJuliafungi: ack, thanks17:25
espenflTheJulia: Awesome. Thanks for pointing me in the right direction. So we basically just make a new element based on for instance the bootloader and use that with whatever we need. Right?17:26
TheJuliaespenfl: I'd add it to the grub element personally and upload it back up to gerrit, you have a couple folks in this channel with core rights on diskimage-builder17:26
opendevreviewDmitry Tantsur proposed openstack/ironic master: Restructure the inspector module in preparation for its expansion  https://review.opendev.org/c/openstack/ironic/+/87467717:43
espenflTheJulia: Happy to do so. Also opened this which somewhat on the same topic: https://bugs.launchpad.net/diskimage-builder/+bug/2007836.18:03
TheJuliaespenfl: ohhh, good find18:21
TheJuliaespenfl: I'd just propose a change to add the capability. ianw added that line 5 years ago, so I suspect nobody has brought up that case18:22
espenflOkey, I will try to look over the serial things so that it is more consistent when looking into it and submit a suggestion based on that. I also saw similar things here and there.18:27
TheJuliadiskimage-builder is much more "what works for people" driven18:33
opendevreviewJulia Kreger proposed openstack/ironic-specs master: WIP: Framework for DPU management/orchustration  https://review.opendev.org/c/openstack/ironic-specs/+/87418919:16
TheJuliaJayF: added words w/r/t chassis.19:16
stevebaker[m]good morning19:23
JayFTheJulia: a marked improvement; although admittedly I don't know enough about the hardware to have a strong opinion :/19:25
NobodyCamGood Morning Ironic Folks!19:28
JayFo/19:28
NobodyCamhey hey JayF 0/19:29
TheJuliao/ NobodyCam 19:32
TheJuliaNobodyCam: you should go read https://review.opendev.org/c/openstack/ironic-specs/+/87418919:33
NobodyCam0/ TheJulia 19:33
TheJuliaoh, you already did!19:33
NobodyCamhas it changed sense I added my +1?19:33
NobodyCam;p19:34
NobodyCamhehehehe19:34
JayFmy +2 will cost a 4090; dm for address /s 19:34
NobodyCamLoL19:35
JayFfor, uh, important hardware testing19:35
TheJuliaKSP2 drops in just a few days19:35
TheJuliaand apparently, it will demand I get a new machine.19:35
NobodyCamhehehehehe19:35
TheJuliaIn regards to modify steps, or maybe I should just call it ?service steps?, is is does basically enable with our plugin model that someone *could* create a snapshot overlay module19:39
TheJuliawhich could do things like "snapshot my stuff"19:39
JayFI liked the idea of calling the states servicing/service wait19:39
TheJuliaThe idea is sort of growing on me19:41
TheJuliaand in part because preventitive maintence or problem identification is still inherently service19:41
TheJulia... Not that I'm going to point the thermal camera at the physical server19:41
opendevreviewJulia Kreger proposed openstack/ironic-prometheus-exporter master: Support extraction of ironic internal metrics  https://review.opendev.org/c/openstack/ironic-prometheus-exporter/+/86950919:47
TheJuliaiurygregory: I finally figured out I swapped the conditional. Doh!19:47
iurygregoryTheJulia, yay!20:19
TheJuliaiurygregory: woots, passes, and the curl has combined metrics21:32
TheJuliaI do really like it grabs everything21:38
iurygregoryTheJulia, Nice! tomorrow I will take a look o/22:42
TheJuliao/ vanou 23:00
vanouHi TheJulia o/23:00
TheJuliaHow are you doing today?23:01
vanouGood. I had nice sleep :)23:01
TheJuliaAlways good! :)23:01
vanouI'll work on internal task23:02
vanouAfter this review23:02
vanouThanks for review on that backport patch. Let's make sure what the backport means23:03
vanouWhat point is unclear for you?23:03
TheJuliaWhat I'm struggling to understand is how the first patch is needed for the second patch. I don't see where it is used23:04
TheJuliaso I'm wondering if I'm just understanding something, or if there is context which is not making it through23:04
vanouAh. OK. I explain23:04
vanouWhen I make submit first draft on first patch, function of version checking and storing in DB will be used on second patch. However after discussion with you and JayF, such functionality is not needed on second patch because second patch uses try-fallback approach.23:07
vanouSo there is no functionality/code in first patch which second patch needs to work.23:09
TheJuliaso we could just detach the second patch from the first?23:09
vanouYes.23:10
vanouWhen we dertermine to do try-fallback approach on second patch, we could decouple these 2 patches.23:11
vanouThis is context on these 2 patches.23:14
TheJuliaokay, then yeah, decoupling makes sense in my mind23:14
TheJuliathe latter patch just makes sense, the first patch.... *really* that is just a verify step which can be considered a feature23:15
TheJuliaI did a rebase of the second patch locally, and I'll share the diff I got, all which I believe is not needed by the second patch.23:15
TheJuliavanou: https://paste.openstack.org/show/bAa8YyvMmdmFwXhAJK9n/23:16
vanouYes. First patch just provides verify step to check http/https connection and, if fail, warns that failure.23:17
vanouThanks for diff. Yes code of first patch is not needed by second patch23:18
vanouHowever to tell operator the version of bmc firmware doesn't support https, first patch is helpful I think23:19
vanou^sorry s/https/http23:20
vanouIf you have another things you want to make sure, please tell me.23:28

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!