Friday, 2023-09-29

JayFTheJulia: it's not an execute00:39
JayFTheJulia: it's doing an open on a file in /sys to see if it's ro, similar kinda pattern in the other one00:39
TheJuliadoh!00:40
TheJuliathat is not *that* bad, but yeah00:40
JayFwhen I say scary I just mean like, the dread when I saw it erroring *trying to erase /dev/sda* because my *real /dev/sda* is r/o00:45
JayFnothing is that scary once you know what's really going on :D00:45
dtantsurTheJulia, yeah, I know, that's great progress (re VirtualMedia in Systems). Now we only need to implement that..07:34
opendevreviewVerification of a change to openstack/ironic stable/2023.1 failed: redfish_address - wrap_ipv6 address  https://review.opendev.org/c/openstack/ironic/+/89579509:25
opendevreviewMerged openstack/ironic-inspector stable/zed: Handle bracketed IPv6 redfish_address  https://review.opendev.org/c/openstack/ironic-inspector/+/89632409:39
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Enable the new inspection implementation  https://review.opendev.org/c/openstack/bifrost/+/89692510:38
dtantsurfolks, looking for bifrost reviews: https://review.opendev.org/c/openstack/bifrost/+/896143 and https://review.opendev.org/c/openstack/bifrost/+/89169310:40
dtantsurboth are pretty important and pretty simple10:41
opendevreviewDmitry Tantsur proposed openstack/bifrost master: [PoC] Life without Inspector  https://review.opendev.org/c/openstack/bifrost/+/88793410:47
alaysdhi everyone10:53
alaysdis there a way we can use IRONIC / Bifrost to PXE boot an ESXi VM without OS ?10:53
alaysdI donot have access to any BMC, I want to PXE boot it, is it possible?10:53
dtantsuralaysd, hi. you can probably make it work by manually configuring https://docs.openstack.org/ironic/latest/admin/agent-power.html10:54
dtantsurIt's not going to be a straightforward out-of-box experience though.10:54
alaysdokay, I shall check10:55
alaysdBut do we need "driver" to PXE booting a ESXi VM? Can we somehow bypass that? 10:56
alaysdfor physical nodes I do have IPMI so that option seems to work and can sucessfully install OS 10:56
dtantsurplease check the link, it mentions the manual-management driver10:57
alaysdok, thanks 10:57
alaysdbut how is the device discovered by Bifrost machine?11:12
alaysddoes my bifrost server need to be in the same subnet as my ESXi VM? Also, what ACL or next-server configurations do I need to set on Subnet so that PXE boot happens from bifrost ?11:13
alaysdDtantsur, I was just seeing your video Bifrost 101 11:14
dtantsurBifrost provides a DHCP/PXE server, which has to be on the same L2 network as the VM11:14
dtantsurEssentially, everything is similar to a bare-metal node, just without power control11:15
dtantsur(you may also need to enable DHCP traffic in your hypervisor, it's often blocked)11:15
alaysdI guess the DHCP traffic is enabled as the VM is trying to look for PXE server and is erroring out saying "No DHCP or proxyDHCP offers were received"11:17
dtantsur"No offers received" does not sound like enabled to me11:18
dtantsurIt may be called something like promiscuous mode for the vNIC11:18
alaysdare you asking if the VMs can take DHCP IPs?11:20
dtantsurI'm asking if DHCP traffic is allowed to the VMs11:20
dtantsurAt least some hypervisors block it by default11:20
dtantsurIf it is, you need to use tcpdump to see why DHCP requests or offers are not received by the other side11:21
alaysdI m sorry to sound dumb, very new to OpenStack 11:22
alaysddo I need to take the tcpdump on the ESXi VM ?11:23
alaysd@dtantsur, please help :( 11:28
dtantsuralaysd, probably start with tcpdump on the bifrost node. And check the networking settings for the thing I mentioned.11:37
dtantsurI'm trying to help, but you're doing something very non-standard with, as you say yourself, lacking knowledge. That won't be easy. You'll need to do a lot of experimenting and researching.11:37
iurygregorygood morning Ironic11:38
mallikdtantsur, ping11:51
dtantsuro/11:51
mallikdtantsur, wanted to check quickly about inspector in iornic. I have setup master devstack around 2weeks back. I tried doing inband inspection using inspector in ironic. .11:54
mallikdtantsur, I am seeing this error   Failed to start inspection: ['internal', 'public'] endpoint for         |11:54
mallik|                        | baremetal-introspection service in RegionOne region not found 11:54
Nisha_Agarwaldtantsur, is there any documnetation for latest inspector usage in ironic?11:55
Nisha_AgarwalDo we need to create inspector endpoint seperately even if it is merged with ironic?11:57
dtantsurare you two asking the same question or just two different questions? :)11:57
Nisha_Agarwalsame11:57
Nisha_Agarwal:)11:57
dtantsurright11:57
dtantsurDevStack has not been changed to use the new inspection approach.11:57
dtantsurYou won't need to crate an inspector endpoint because there will be no inspector. But our devstack plugin still uses inspector.11:58
Nisha_Agarwalok so if we have to use inband inspection in latest ironic how do we proceed?11:58
dtantsurlike before. nothing has changed so far.11:58
Nisha_Agarwalso inspector service has to be brought up?11:59
iurygregoryyup ^ 11:59
Nisha_Agarwalok if we have to use latest inspection code merged in ironic, how can we do that?12:01
dtantsurWhy are you trying to do it? We'll provide documentation, devstack changes, etc in time.12:01
Nisha_Agarwal:)12:01
mallikdtantsur, is the new inspection not targeted to be completed in bobcat release?12:04
dtantsurIt was not targeted and has not been completed.12:04
mallikdtantsur, ok thanks.12:04
dtantsurIt's ready for experiments already, but that's it. The big opening is expected in Caracal.12:04
mallikok12:05
alaysdhow to know the PXE server from which it has booted?12:34
alaysddtantsur please12:40
alaysd:(((12:40
alaysdI know its non standard but if possible and you have some time then please tell me 12:40
alaysdi want to know atleast the PXE boot happened then it is from the server I want12:42
dtantsuralaysd, I can give you some directions, but I cannot debug your environment for you. You can use tcpdumps on DHCP ports, you can check the logs of the appropriate services (dnsmasq, for example), you can look at the VM's virtual console during boot.12:49
TheJuliaGood morning12:50
* TheJulia sips coffee and reads email12:50
dtantsurmorning TheJulia12:52
dtantsurTGIF!12:52
TheJuliaindeed!12:54
TheJuliaI think today, I'm going to hack on sushy-tools12:56
dtantsurEnjoy!12:56
TheJuliaplus a customer meeting which I really just need not attend12:57
dtantsurI hope to finish Outreachy business this week12:57
dtantsurheh12:57
TheJuliadtantsur: w/r/t virutalmedia in systems, would it be helpful to at least raise visibility of the link in sushy? I was hacking on sushy yesterday for uri booting12:59
TheJuliaand added BootProgress12:59
TheJuliaI'm not sure the on system options object is... well defined based upon what I glanced at yesterday12:59
dtantsurit would be helpful, I just never had time to do it13:10
TheJuliaack, I'll see what I can do13:11
* dtantsur is going through the history of our Outreachy participation and that is making him upset13:14
TheJulia:(13:16
TheJuliaerr, maybe worth waiting. Dell only has the config object with a single field, HPE has not implemented v1.1813:38
opendevreviewDmitry Tantsur proposed openstack/ironic master: Do not store ramdisk logs as part of the inventory  https://review.opendev.org/c/openstack/ironic/+/89694113:42
TheJuliaIf possible, I'd appreciate a few quick reviews on metalsmith branches so I can un-wedge ipa. Specifically https://review.opendev.org/q/Id258b294972c2f5ea816cb274773b8947f92e11013:43
drannou1TheJulia : Thx for the Rescue info, I'm building my own IPA, debian based, and discover that the "wheel" group was not created and so no 'rescue' user was created. When you say that the DHCP should be automatically trigger, how is it done ? neutron should make a shut/noshut on TOR config ? or something trigger by the IPA ?14:05
TheJuliadrannou1: I believe the basis is on on-exit script to be triggered upon IPA's exit14:20
TheJuliaerr, an on-exit14:20
TheJuliaI'm in the middle of a email, I can look when I'm done14:20
TheJuliaokay! chairy email sent14:24
TheJuliadrannou1: you figured out the issue with the user... hmm... what is the dhcp path taken14:31
TheJuliaI think you may have found a bug!14:38
TheJuliaso, the way it is *supposed* to work, is that upon exit, dhcp is supposed to be re-triggered14:38
TheJuliaWe did that in the days of original coreos, we do that in tinyipa, I don't see that happening with the DIB based ipa images14:39
TheJulia... I'm wondering if everyone has been testing them on the same flat network, so it just works14:40
TheJuliadrannou1: so, for my context, I'm guessing your using the "neutron" network_interface, static l2 network, different l3 networks between your tenant workload and your rescue network where the ramdisk boots? I'm guessing no ML2 switch port driver as well?14:46
TheJuliadrannou1: for building, are you using ironic-python-agent-builder at all?14:47
drannou1TheJulia: We are using our own neutron driver implementation, which vlan based. So the "customer" is using his own vlan, and we are forcing a specific rescue vlan, which host the TFTP server (so a "classical" infrastructure I would say)15:08
drannou1For the build we are using    ironic-python-agent-builder  --release bullseye debian15:09
TheJuliadrannou1: Interesting, When we tear down the vif, we expect the port to go down, I guess line carrier is held up in your case?15:09
TheJuliathe line carrier dropping *would* normally force dhcp to re-trigger15:10
drannou1Yeah that's what we where thinking about, but We don't see neutron asking for a "down"15:10
TheJuliaso we did it in the neutron ml2 plugin if memory serves15:10
drannou1give me a few tens of minutes (booting a server is sooooo long), I'm testing it15:13
TheJuliano worries15:13
TheJuliaso looking at networking-generic-switch, we shutdown the port from the unbind operation of the prior vif when we move it to the rescue network15:14
TheJuliathen we re-setup the port15:14
drannou1we don't saw that on the other drivers, but may be we missed it, we are checking15:16
TheJuliaYeah, it is sort of super well hidden deep inside the base driver code in networking-generic-switch, and afaik some of the vendor SDN stuffs out there work the same way15:23
drannou1ok so Here we are : We manually made the down up on the switch (shut/noshut), I see in the host VNC the link going down and UP, but no DHCP renew append15:24
drannou1of course if I manually restart dhclient, it will make the DHCP request15:26
TheJuliaI guess there is not something like NetworkManager running....15:27
TheJuliahmmm15:27
drannou1but I fill that it would be better if IPA force it15:27
TheJuliaSo I guess the path forward is *likely* code to explicitly bounce dhcp by default if visible like we do with tinycore testing15:28
TheJuliawe need something here https://github.com/openstack/ironic-python-agent-builder/blob/master/dib/ironic-python-agent-ramdisk/static/usr/local/bin/ironic-python-agent-create-rescue-user.sh15:29
drannou1yes there or directly in the rescue driver15:30
drannou1to better separe user creation and restart of the network15:30
TheJuliayou'd basically need a similar pattern, on ramdisk exit do the needful15:30
TheJuliaIPA doesn't explicitly trigger dhcp, it expects the OS to do the needful because we don't want IPA's API to ever possibly touch an untrusted network, but since we made that decision, we've implemented additional controls so we might be willing to accept a patch to toggle dhcp before completing teardown of the agent15:32
drannou1going back on the switch side, it seems that this 'down/up' is optional: https://opendev.org/openstack/networking-generic-switch/src/branch/master/networking_generic_switch/devices/__init__.py#L43 ex : https://opendev.org/openstack/networking-generic-switch/src/branch/master/networking_generic_switch/devices/netmiko_devices/__init__.py#L283 15:34
drannou1Yeah but I also agree that if there is this "down/up" on the host POV, it should trigger back the DHCP. so may be a DIB wrong config that need to be check15:36
TheJuliazigo: have you ever tried rescue, out of curiosity?15:36
TheJuliaI do agree, there are many variables, unfortunately15:37
iurygregoryfacepaml, I just noticed the api-ref is not showing information about v1/nodes/{node_ident}/firmware ...16:59
TheJuliait happens17:00
TheJuliaapi-ref is not the easiest thing17:00
* TheJulia takes the corgi overlord for a walk he is demanding17:00
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic master: api-ref for nodes/{node_ident}/firmware  https://review.opendev.org/c/openstack/ironic/+/89695817:51
TheJuliasushy-tools makes my head hurt18:10
*** awb_ is now known as awb18:20
JayFDo not expect me to be around today; got in a small fender bender during my morning volunteering. I am physically OK but dealing with the logistical fallout.18:40
TheJuliaugh, ack18:45
iurygregoryJayF, take care19:48
JayFYeah I'm physically OK; truck damage appeared to be superficial but it's got a shimmy in the front end so something probably got bent19:55
JayFthank you for the concern 19:56
opendevreviewJulia Kreger proposed openstack/sushy-tools master: WIP: Support testing for HttpBootUri  https://review.opendev.org/c/openstack/sushy-tools/+/89696321:04
opendevreviewMerged openstack/metalsmith stable/yoga: Stable-Only: CI: Force install an appropriate openstacsdk version  https://review.opendev.org/c/openstack/metalsmith/+/89676322:47
opendevreviewMerged openstack/metalsmith stable/xena: Stable-Only: CI: Force install an appropriate openstacsdk version  https://review.opendev.org/c/openstack/metalsmith/+/89676522:47
opendevreviewMerged openstack/metalsmith stable/wallaby: Stable-Only: CI: Force install an appropriate openstacsdk version  https://review.opendev.org/c/openstack/metalsmith/+/89676622:47

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!