Wednesday, 2023-04-05

adam_metal3Hi Ironic05:18
rpittaugood morning ironic! o/07:33
adam_metal3rpittau, good morning 07:37
rpittauhey adam_metal3 :)07:37
dtantsurTheJulia: hi, since you probably have opinions about inspector, could you check https://review.opendev.org/c/openstack/ironic-specs/+/878001 please? Has 2x +2 already, really close to landing.07:56
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: Move ubuntu jobs to jammy  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/87953809:40
hjensasdtantsur: I made a comment regarding DHCP filter, even with direct DB access is there not a risk that the frequency we would need to read from the DB would become an issue in large deployments?09:53
dtantsurhjensas: these are valid concerns, I'm just not sure I have better ideas..10:45
dtantsurwe could, of course, make the new script/service expose JSON RPC10:46
dtantsur... which will be pretty close to having our own DHCP agent :D10:50
opendevreviewMerged openstack/ironic stable/zed: Add error logging on lookup failures in the API  https://review.opendev.org/c/openstack/ironic/+/87947010:53
adam_metal3Sandzwerg[m], hi, would you be available to send the network_data.json with the working bond config ?11:01
hjensasyeah, RPC or a rest api that ironic could post events to.11:02
hjensasI had look at multilevel dhcp providers idea I mentioned on one of the reviews, it get's complicated because there is code to get IP address via the DHCP provider. AFICT getting the IP is only required for ELILO uefi boot, which is deprecated ... makes me want to rip code out.11:04
dtantsurhjensas: I think elilo support is gone already11:07
hjensasyes, elilo support is gone. There is a reno, but the code added in dhcp provider is still there - and we seem to link grub2 pxe configs to under both IP address and MAC address. (But grub2 should be happy with just the MAC ...)11:09
hjensasdtantsur: I'm thinking instead of our own DHCP agent over dnsmasq, it would be better to look at kea-dhcp (or other options). We could even opt not to use the neutron DHCP provider, just configure kea-dhcp with static IPs from neutron vifs.11:11
hjensasdtantsur: the kea-dhcp option also has the benefit of HA/load-balancing capabilities. What is Metal3's thoughts on HA/load-balancing DHCP, not a requirement or something you would like to see at some point?11:16
dtantsurhjensas: we haven't thought about that (which does not mean a hard no)11:17
dtantsurwe need to solve the ironic HA/load-balancing problem first :)11:17
dtantsurand that involves 1) doing something about inspector, 2) doing something about cross-conductor PXE configuration11:18
hjensasack, oh another question - discovery is a usecase you want to support in Metal3?11:32
iurygregorymorning ironic11:44
Sandzwerg[m]adam_metal3: does that help? https://etherpad.opendev.org/p/gtFooG_MOO9tB79DUb6H11:50
dtantsurhjensas: possibly. not on the roadmap now, but there have been requests upstream11:52
adam_metal3Sandzwer[m], thanks I will check!!!11:52
iurygregoryI'm still puzzled that the grenade job didn't failed on my patch12:38
opendevreviewMaksim Malchuk proposed openstack/bifrost master: Remove extra symbols accidentally added  https://review.opendev.org/c/openstack/bifrost/+/87954712:43
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/zed: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955012:48
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/yoga: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955112:49
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/xena: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955212:49
iurygregorybackport of features? .-. 12:50
dtantsurespecially to xena..13:02
TheJulia… on video at13:03
TheJuliaerr13:05
TheJuliasilly autocorrect13:05
TheJuliai typed bifrost, hit enter, it changed it13:05
* TheJulia goes and makes coffee13:05
opendevreviewMerged openstack/ironic master: Fix requests calls with timeouts  https://review.opendev.org/c/openstack/ironic/+/87949713:13
TheJuliagood morning everyone!13:15
TheJuliahjensas: dtantsur: the mac address support was added to be able to eventually remove the by ip behavior which is now the far flung fallback of grub2 after machine hardware uuid (the internal one) and the interface mac address13:19
dtantsurmorning TheJulia, ack13:19
hjensasTheJulia: good morning! yes, that is my conclusion on reading grub2 docs as well. 13:21
TheJulia... at one point it didn't support the mac address which seems weird13:21
TheJuliabut they added that in13:21
TheJulia(Thank you universe!)13:21
opendevreviewMaksim Malchuk proposed openstack/bifrost master: Remove extra symbols accidentally added  https://review.opendev.org/c/openstack/bifrost/+/87954713:50
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/xena: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955213:55
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/yoga: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955113:57
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/zed: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955013:59
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/xena: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955214:00
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/yoga: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955114:00
opendevreviewMaksim Malchuk proposed openstack/bifrost stable/zed: Feature: Enable user_data's file generation  https://review.opendev.org/c/openstack/bifrost/+/87955014:01
mnasiadkaGood afternoon Ironic14:04
mnasiadkaHave sort of an issue with idrac-redfish node enrollment14:04
mnasiadkaERROR ironic.conductor.verify [req-bd409ce5-20f1-44f5-928b-edaf69fef7a9 bifrost_user - - - -] Failed to get power state for node 55ed12a1-c878-4d33-9653-c751d8309baf. Error: Redfish exception occurred. Error: 'NoneType' object has no attribute 'encode': ironic.common.exception.RedfishError: Redfish exception occurred. Error: 'NoneType' object has no attribute 'encode'14:05
mnasiadkaTried searching for storyboard bugs, but it seems I'm not really good with it ;-)14:05
iurygregoryfirst time I see this one14:05
mnasiadkahmm, let me check, I think I haven't set redfish_user and redfish_password, just blindly set drac_user and drac_password (but docs say it's only for wsman)14:09
*** dking is now known as Guest1006214:19
mnasiadkaok, that fixed it14:31
opendevreviewMaksim Malchuk proposed openstack/bifrost master: Remove extra symbols accidentally added  https://review.opendev.org/c/openstack/bifrost/+/87954714:36
dtantsurI actually wonder at which point we can deprecate WSMAN support given that Redfish is the bright future.14:47
dtantsurwdyt TheJulia?14:47
TheJuliahuh whaaaattt14:54
* TheJulia blinks because her brain has been deep in ironic-grenade14:54
TheJuliadtantsur: .... maybe? a slow process for sure since even fairly recently people expressed trying to use ?idrac6? in production14:55
TheJuliaor maybe it was idrac714:56
dtantsurle sigh14:58
opendevreviewJay Faulkner proposed openstack/ironic-specs master: Add 2023.2 Workitems discussed at Ironic PTG  https://review.opendev.org/c/openstack/ironic-specs/+/87938115:16
rpittaugood night! o/15:59
dtantsurTheJulia, hjensas, a pretty raw thought dump on PXE/DHCP and x-conductor: https://etherpad.opendev.org/p/ironic-multiconductor-static-dhcp16:00
dtantsurwanted to get it out of my head before I leave for the holidays16:00
TheJuliamy gut feeling is we've got a race condition which we're kind of skirting around with recovery on failure16:11
TheJuliabut I think that would require a similar brain dump to handle16:11
TheJuliawe could potentially do a dhcp service, but then we're re-inventing neutron :\16:11
TheJuliaand the idea I had was we would trigger the neutron update, cross-conductor wouldn't handle static dhcp updates...16:13
TheJuliahmm16:13
dtantsurRe-invent all your neutrons \o/16:15
dtantsurI wish neutron was more (de-)composable16:16
dtantsuranyway, my brain is shrimp already, switching to the easter break mode. see you next Wednesday!16:16
TheJuliahave a great break16:17
TheJuliaI think I've identified what is going on with some of the grenade jobs16:17
TheJuliay'all wont like it16:17
dtantsursounds promising...16:17
TheJuliatl;dr looks like the vm networking is coming online late16:17
TheJuliathere could also be a dnsmasq crash red-herring involved16:18
TheJulialike splash screen not reached until 88 seconds after vm starts16:19
TheJulianetwork up 21 seconds in, network back up 83 seconds in16:19
JayFhow does that present so weirdly w/r/t the flavor stuff?16:27
TheJuliaso flavor errors are a result of our use and the case we were digging into yesterday remains a mystery16:28
TheJuliain this specific case, the vm fired up16:28
opendevreviewMohammed Boukhalfa proposed openstack/sushy-tools master: Add fake_ipa inspection, lookup and heartbeater to fake system  https://review.opendev.org/c/openstack/sushy-tools/+/87536616:28
TheJuliaeverything worked until the fip couldn't be pinged16:28
JayFso this is the "vip won't ping" issue dtantsur was originally looking at16:28
TheJuliayup16:28
TheJuliaI think we can extend the timeout for grenade and it should make that issue just go away16:29
JayFis the answer to sleep harder? serious question16:29
JayF+++++16:29
TheJuliabasically eys16:29
TheJuliayes16:29
TheJuliainstead of sleep to cuddle cats for a moment, maybe take a quick cat nap16:29
TheJuliaoh sigh16:33
TheJuliait is a static setting in grenade16:33
TheJuliahttps://github.com/openstack/grenade/blob/3f9fe2e8fc1fccf0324538274e3b07b3e90b96b9/projects/60_nova/resources.sh#L13716:33
JayFonly until we make it non-static at least16:34
TheJuliawe upgdated our version of cirros last cycle didn't we16:34
TheJuliaupdated16:34
JayFIDR16:34
TheJulianeutron folks looked at some average jobs and indicated they see it take 35 seconds at present16:35
TheJuliaso a bit longer is not... unreasonable 16:35
clarkbI think the cirros update got reverted, but that may be worth double checking16:35
TheJulia... this is also on rax right now16:36
TheJuliaI wonder if we're seeing just rax hosts fail grenade16:36
TheJuliaall emulated VM workloads16:37
TheJuliayeah, this is 5.2, so oldish16:38
TheJuliaso inmotion and rax16:44
TheJuliaand ovh16:45
TheJuliamight just be luck of the draw as it were16:45
clarkbinmotion rax and ovh doesn't leave much else if that is where you see failures16:45
TheJuliayeah16:45
TheJuliatime for a grenade change16:45
clarkbother than those three you've got vexxhost which only does special labels now and arm64 clouds16:45
TheJuliaso interestingly enough, we likely need to check our vm networking settings again for spanning tree16:54
opendevreviewJulia Kreger proposed openstack/ironic master: Set ironic-grenade to wait 120 seconds  https://review.opendev.org/c/openstack/ironic/+/87967516:59
TheJuliaokay, now we wait()16:59
*** Peter is now known as Guest1007817:33
opendevreviewMerged openstack/ironic-lib master: Exclude all files starting with . from flake8 tests  https://review.opendev.org/c/openstack/ironic-lib/+/87840217:37
opendevreviewMerged openstack/ironic stable/2023.1: Add error logging on lookup failures in the API  https://review.opendev.org/c/openstack/ironic/+/87940718:21
opendevreviewMerged openstack/ironic bugfix/21.2: Add error logging on lookup failures in the API  https://review.opendev.org/c/openstack/ironic/+/87940918:21
opendevreviewMerged openstack/ironic bugfix/21.3: Add error logging on lookup failures in the API  https://review.opendev.org/c/openstack/ironic/+/87940818:27
opendevreviewMerged openstack/ironic master: Remove outdated API version information from the enrollment docs  https://review.opendev.org/c/openstack/ironic/+/87942818:27
opendevreviewMerged openstack/ironic master: On rpc service stop, wait for node reservation release  https://review.opendev.org/c/openstack/ironic/+/87283418:48
opendevreviewVerification of a change to openstack/ironic master failed: Always fall back from hard linking to copying files  https://review.opendev.org/c/openstack/ironic/+/87916118:48
opendevreviewMerged openstack/ironic-python-agent-builder master: Add checksum generation support  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/87800918:58
opendevreviewMerged openstack/ironic-python-agent-builder master: Remove ussuri CI jobs  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/87935719:12
opendevreviewVerification of a change to openstack/ironic-python-agent-builder master failed: Always generate checksum files on image builds  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/87829719:12
opendevreviewMerged openstack/ironic master: Exclude all files starting with . from flake8 tests  https://review.opendev.org/c/openstack/ironic/+/87878819:20
opendevreviewMerged openstack/ironic-python-agent master: Report system firmware information in the inventory  https://review.opendev.org/c/openstack/ironic-python-agent/+/87904919:37
opendevreviewVerification of a change to openstack/ironic stable/wallaby failed: Suppress Bandit B509 in snmp.py  https://review.opendev.org/c/openstack/ironic/+/86084719:45
opendevreviewMerged openstack/ironic master: Do not log into the template1 database during test-setup  https://review.opendev.org/c/openstack/ironic/+/87914820:17
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Add hold steps  https://review.opendev.org/c/openstack/ironic/+/87906021:26
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Add hold steps  https://review.opendev.org/c/openstack/ironic/+/87906021:59
TheJulianew state diagram on that change :)21:59
* TheJulia calls it a day22:03
JayFiurygregory: maybe we sync up earlier tomorrow? I'm having trouble getting even a basic testenv up for bifrost on stable/2023.122:20
JayFangry libvirt errors in `bifrost-cli testenv`22:20
JayF> fatal: [127.0.0.1]: FAILED! => {"changed": false, "msg": "internal error: Network is already in use by interface enp1s0"}22:20
iurygregoryJayF, sure, at what time are you thinking?22:24
JayFwell, I just gotta get this actually working 22:24
iurygregoryhumm networking already in use...22:24
JayFI looked at virsh net-list --all22:25
JayFonly one there is default, autostart enabled22:25
iurygregoryI seem to remember something like this22:25
JayFI disabled autostart, rebooted22:25
JayFsame err22:25
iurygregorydo you have any virbr0 available already?22:25
JayF(note: this is a VM I'm trying to install on)22:25
JayFonly a single interface, it's the NAT network from the parent hypervisors' libvirt22:26
iurygregorygotcha, never tested this approach (but it's like our CI I would say)22:26
JayFyeah, my hunch is a package or some presetup is missing that's assumed22:26
JayFbut missing in the testenv instructions22:27
JayF(or bitrotted over the life of ubu 22.04)22:27
iurygregory./bifrost-cli testenv22:28
iurygregory./bifrost-cli install --testenv22:28
JayFtestenv will not succeed22:28
JayFthat's what's erroring22:28
iurygregoryoh wow22:28
iurygregoryO.o22:28
JayFlet me get you a bigger log with this reproduction22:28
JayFunless I fixed it :P22:28
iurygregoryare you trying master?22:28
iurygregorynever tested in ubuntu, but we do this in our CI...22:29
JayFhttps://gist.github.com/jayofdoom/98321f655f303a2127eb8587c7d2f4be22:29
JayFI'm on stable/2023.122:29
JayFI can  try master if you think that's a better idea, but was hoping to demo with latest stable 22:29
iurygregoryI would use stable/x version22:30
JayFyeah, that's the bifrost I have checked out22:30
JayFstable/2023.122:30
iurygregoryensure libvirt network is running .-.22:30
iurygregorydo you have qemu/kvm installed  in the vm?22:30
JayFiurygregory: see the added comment on the gist22:31
iurygregorymaybe you need to upgrade packages22:31
* iurygregory press f522:31
JayFrunning a dist-upgrade22:31
JayFI'll note it installed/uninstalled half the world the first testenv run22:32
JayFstill a lot of updates22:32
iurygregorynormally it's good to update packages before installing 22:32
iurygregoryI always do that22:32
iurygregorylet me check the task in our code22:32
JayFI'm happy to change distro if you think it works better; but I believe we use ubu here so I'd prefer do that22:33
JayFdist-upgraded, restarted, trying again22:33
JayFsame error :| 22:35
JayFgoing to try master, then maybe zed22:35
iurygregoryhttps://opendev.org/openstack/bifrost/src/branch/master/playbooks/roles/bifrost-create-vm-nodes/README.md22:38
JayFdtantsur: https://gist.github.com/jayofdoom/98321f655f303a2127eb8587c7d2f4be if you have any ideas on this, please lmk. I'm trying to get a bifrost running on ubuntu 22.0422:38
JayFwindow bare22:38
JayFoops22:38
JayFlooking at link22:38
iurygregorycan you check if any the packages mentioned are installed?22:38
JayFso basically we need qemu-kvm but it's not installed by testenv22:38
iurygregoryso we do have a bug22:38
iurygregory:D22:38
iurygregorybecause I think it should install22:38
JayF> Note, selecting 'qemu-system-x86' instead of 'qemu-kvm'22:39
iurygregoryhttps://opendev.org/openstack/bifrost/src/branch/master/playbooks/roles/bifrost-create-vm-nodes/defaults/main.yml#L2822:39
JayFwhich was already installed22:39
JayFbingo iurygregory 22:39
JayFI'm on a default libvirt network22:39
JayFso it's conflicting with my local network22:39
JayFthat's exactly the problem22:39
JayFI suspect it's easier to change it on the hyp and give me a clear demo lol22:40
iurygregoryprobably22:40
iurygregoryhttps://docs.openstack.org/bifrost/latest/install/index.html#quick-start-with-bifrost-cli22:40
iurygregorymaybe you can test with a specific network-interface and a dhcp-pool 22:41
JayFI'm resetting my hyp to non-default settings22:42
JayFso I can have a less-messed-with VM to demo in22:42
iurygregoryfingers crossed22:43
JayFI'm rebuilding the whole VM to fix the network22:44
JayFso you'll have to get the update in the morn I think :D22:44
iurygregorynp o/22:45
JayFwhoever it was that told me that virt-manager can connect to an external hypervisor is a wonderful person :D22:48
JayFiurygregory: GREAT SUCCESS23:03
JayFdtantsur: ignore previous message23:03
JayFiurygregory: nice find, thank you23:03
opendevreviewJay Faulkner proposed openstack/bifrost master: Add warning about default libvirt networks  https://review.opendev.org/c/openstack/bifrost/+/87971123:06
iurygregoryJayF, awesome!23:12
iurygregory+W since it was a doc change 23:14
iurygregoryjust checked the build result23:14
opendevreviewMerged openstack/bifrost master: Add warning about default libvirt networks  https://review.opendev.org/c/openstack/bifrost/+/87971123:40

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!