Friday, 2024-05-03

jandersBifrost question: what's the purpose of baremetal vs target in https://opendev.org/openstack/bifrost/src/branch/master/playbooks/inventory/group_vars ?01:28
jandersand more specifically, what is the origin of the image download URL passed to the IPA?01:28
janderscontext: I installed bifrost with --network-interface set to eth1 and it still tried to use virbr0's 192.168.122.1 IP address, I suspect it's taking it from network_interface but since it's set in multiple places I do not know which one is causing the problem01:29
jandersknowing this should help track down why --network-interface install option doesn't seem to take care of this01:30
ikoliveiraODE_UUID:04:08
ikoliveira<ikoliveira>  Received heartbeat for node 878c3113-0035-5033-9f99-46520b89b56d with callback URL https://10.7.0.175:9999. This is not expected, and the heartbeat will not be processed.#033[00m04:08
ikoliveira<ikoliveira> May  3 04:01:14 lsd-srv-238 ironic[665280]: 2024-05-03 04:01:14.041 665280 DEBUG ironic.api.method [req-8fcc726a-6850-44d7-a0a5-69d21354f999 req-41ceeb1c-36c8-48ae-9b9e-f9473cf84c27 - - - - - -] Client-side error: Detected change in ramdisk provided "callback_url" format_exception /opt/stack/bifrost/lib/python3.10/site-packages/ironic/api/method.py:124#033[00m04:08
ikoliveira<ikoliveira> May  3 04:01:14 lsd-srv-238 ironic[665280]: 2024-05-03 04:01:14.043 665280 INFO eventlet.wsgi.server [req-8fcc726a-6850-44d7-a0a5-69d21354f999 req-41ceeb1c-36c8-48ae-9b9e-f9473cf84c27 - - - - - -] 10.7.0.175 "POST /v1/heartbeat/878c3113-0035-5033-9f99-46520b89b56d HTTP/1.1" status: 400  len: 483 time: 0.0156338#033[00m04:08
ikoliveira<ikoliveira> Can someone assist me in resolving this issue? I'm uncertain as to why it's occurring. The node's status after deployment shows as "active," yet the boot process doesn't occur.04:08
opendevreviewJacob Anders proposed openstack/bifrost master: Make boot_interface customisable  https://review.opendev.org/c/openstack/bifrost/+/91792706:21
rpittaugood morning ironic! happy friday! o/07:10
opendevreviewRiccardo Pittau proposed openstack/bifrost master: Update supported Python versions  https://review.opendev.org/c/openstack/bifrost/+/91802908:18
*** mmalchuk_ is now known as mmalchuk08:40
opendevreviewAdam Rozman proposed openstack/ironic-python-agent master: add quiet cleanup option  https://review.opendev.org/c/openstack/ironic-python-agent/+/91803911:23
opendevreviewAdam Rozman proposed openstack/ironic-python-agent master: add quiet cleanup option  https://review.opendev.org/c/openstack/ironic-python-agent/+/91582511:26
jandersI'm looking for second +2 for https://review.opendev.org/c/openstack/bifrost/+/917927 if anyone has time for a quick review. Thanks in advance!11:33
iurygregorygood morning11:40
iurygregoryjanders, sure I will add to my list to look at it11:41
*** lucasagomes_ is now known as lucasagomes12:08
jandersTY iurygregory12:16
dtantsurTheJulia, hey, has anyone in your team reported or heard reports of memory leaks in Inspector?12:25
TheJuliadtantsur: we had someone mention it in irc 2 or 3 years ago. I remember then we looked at the code and what they indicated before leaving irc and thought it might be something with database interaction, but I think that is about where the search ended.13:04
dtantsurI see, I see.. we have a very "nice" graph in https://issues.redhat.com/browse/OCPBUGS-3226113:06
TheJuliahmm13:15
TheJuliain the past, we knew it to be introspection data in process such that it looked like it was from the db, is that with sqlalchemy 2.0 or pre 2.0?13:15
dtantsurSorry, did not parse.13:16
dtantsurI don't think this environment has constant (re-)inspection, so I'm more suspicious of the sync loop.13:16
TheJuliagiven the progression, that aligns with what I am thinking13:17
dtantsurInterestingly, I don't see anything suspicious on my local environment.13:19
TheJuliacrazy thought, since the incline is different (drastically) from the original report. What if it is webserver responses... and that is all the process monitoring13:38
dtantsurhmmm13:39
opendevreviewBaptiste Jonglez proposed openstack/networking-generic-switch master: Introduce NGS agent design  https://review.opendev.org/c/openstack/networking-generic-switch/+/89704713:51
TheJuliaLooking at items in BZ, we've got https://bugs.launchpad.net/ironic/+bug/1595625 and I'm not sure it makes sense to keep open at this point in time. Thoughts?14:15
rpittauTheJulia: I think we can close it14:28
TheJuliaI wonder if https://bugs.launchpad.net/ironic/+bug/1686675 is still the case14:34
JayFIf we're still setting that environment variable, the bug that works around was fixed literally years ago14:38
TheJuliaI think we removed it....14:39
TheJuliaI've not checked yet14:39
JayFSame, I am going to be very scattered in my availability today and I'm not in front of a desktop right now14:41
JayFI'd suggest 1595625 would likely be superceded by runbooks14:47
JayFand in a world where those exist, we'd likely implement it from a different perspective14:47
TheJuliaI'd just close it out at this point because at it's age, the possibility of doing so was *always* "just ask for the step you want"14:47
TheJulia... I have tons of folks that like to just run the step they want for manual cleaning14:48
JayFI think when I read that (is that my RFE? it sounds like something I'd suggest) I was thinking 'automated_clean' as an alias step anywhere we take a step14:48
JayFthat subs the automated steps14:48
JayFbut that makes much less sense now than it did in 201614:48
TheJuliayeah14:48
JayFI think our use case was, for onmetal14:49
JayFwanting to ensure the node went through cleaning after some kinda maintenance was done14:49
JayFmight have even been before we did a clean from manageable -> available14:49
TheJuliareally, there is no reason one can't create a step which stubs it anyway in a hardware manager14:49
JayFI wontfix'd it with a comment saying if you want this, reopen iwth a use case14:51
TheJuliahttps://github.com/openstack/ironic/blob/master/ironic/cmd/__init__.py#L2214:51
TheJuliacool cool14:52
TheJuliaThanks JayF14:52
JayFzap that from orbit14:52
JayFTheJulia: I'm erradicating that before I step out14:53
opendevreviewJay Faulkner proposed openstack/ironic master: Remove eventlet workarounds  https://review.opendev.org/c/openstack/ironic/+/91808214:57
JayFbeaker on the burner; bbl14:58
opendevreviewJulia Kreger proposed openstack/ironic-python-agent master: Remove eventlet workaround  https://review.opendev.org/c/openstack/ironic-python-agent/+/91808715:19
opendevreviewMerged openstack/ironic-python-agent stable/2023.2: Unmount config drives  https://review.opendev.org/c/openstack/ironic-python-agent/+/91773415:20
opendevreviewMerged openstack/ironic-python-agent stable/2024.1: Unmount config drives  https://review.opendev.org/c/openstack/ironic-python-agent/+/91773315:27
rpittaubye everyone, have a great weekend! o/15:33
Sandzwerg[m]hmmm, so https://docs.openstack.org/ironic/latest/user/deploy.html#capabilities mentions that it's possible to set the root-device hint per instance. But at which step? When I try to set this during the deployment (before IPA booted) I get an error that the node can not be updated while it's in a state transition, but after the deployment a root device hint would not make any sense. I tried to pass this as property during the15:35
Sandzwerg[m]server create but that had no effect. And the linked root device hint page doesn't mention this possibility at all.15:35
Sandzwerg[m]hmm setting it before the deployment sets the property but feels strange. Let's see if it has an effect.15:38
opendevreviewJulia Kreger proposed openstack/ironic-python-agent stable/2023.1: ci: stable-only: explicitly pin centos for builds  https://review.opendev.org/c/openstack/ironic-python-agent/+/91790315:43
opendevreviewJulia Kreger proposed openstack/ironic-python-agent stable/2023.1: Unmount config drives  https://review.opendev.org/c/openstack/ironic-python-agent/+/91785215:43
TheJuliahmmmmm15:48
TheJuliaSandzwerg[m]: looks like it is actually a thing, but it might vary based upon how the data goes in, so if you can get the details of your instance_info field that help wrap our brains around it15:55
TheJuliait might be a bug in how data is handled from nova and hten populated into ironic where it was just not "front of mind" then15:56
Sandzwerg[m]<Sandzwerg[m]> "hmm setting it before the..." <- OK that works, but has the issue that I'm not sure on which node I end up. I'm looking for a way to "modify" the root device hint for one customer. I think I need to look into owner/lessee again15:57
Sandzwerg[m]TheJulia: hmkay, so I could open a bug with nova?15:57
TheJuliaSandzwerg[m]: they would just ignore it15:57
TheJuliaWe would need to see what and how exactly the data gets posted into the field15:58
Sandzwerg[m]hmk15:59
TheJuliaSandzwerg[m]: how did you create it on a flavor?16:00
Sandzwerg[m]on a flavor?16:00
TheJuliayeah16:00
TheJuliaIs that not what your trying to do, have a customer specific flavor?16:00
Sandzwerg[m]that might be another way. Right now we have a generic flavor because the hardware is the same and used by multiple customers. But one wants his OS to end up one the (bigger) second disk instead of the smaller first one, But I don't want to change this for all nodes because it would impact the other customers as well.16:02
Sandzwerg[m]And we have the root device hint on node level, not on the flavor. I was not aware that one could set it on the flavor16:02
TheJuliaI'm *not* sure it would work, but basically instance_info data gets posted out to instance_info16:03
TheJuliahttps://github.com/openstack/ironic/blob/master/ironic/drivers/modules/deploy_utils.py#L150516:04
TheJuliahttps://www.irccloud.com/pastebin/lLQvA0n0/16:05
TheJuliaSandzwerg[m]: example instance_info field from our tempest test, I'm not sure how much the flavor data gets dropped in16:06
Sandzwerg[m]yeah I now used "openstack baremetal node set $NODE_UUID     --instance-info root_device='{"size":"> 400"}'" before deploying to that specific node16:06
Sandzwerg[m]Hmm I need to check that next week, that might work as well. My other thought was to use owner/lessee and assign them some nodes for which I could then switch the hint on node level, assuming other customers wouldn't end up on these nodes and they only end up on these nodes16:07
Sandzwerg[m]But using a custom flavor sounds nice. Great Idea16:08
TheJuliaenjoy!16:08
Sandzwerg[m]Thanks, need to drop, have a nice weekend \o/16:11
TheJuliayou too16:13
TheJuliahmm, more changes likely needed, time will tell16:32
TheJulia(regarding 2023.116:32
TheJulia)16:32
opendevreviewMerged openstack/ironic-lib master: destroy_disk_metadata: support 4096 sector size  https://review.opendev.org/c/openstack/ironic-lib/+/91793716:33
JayFI'd also note, generally, you'll be disappointed often if you look at Ironic docs for what we can do in some use cases (standalone / metal3 / non-nova openstack) versus what gets exposed via nova driver, it's a bit of a lowest common denominator17:10
opendevreviewMerged openstack/ironic master: Remove eventlet workarounds  https://review.opendev.org/c/openstack/ironic/+/91808217:53
opendevreviewJulia Kreger proposed openstack/ironic stable/2023.1: ci: stable-only: explicitly pin centos build  https://review.opendev.org/c/openstack/ironic/+/91811817:56
opendevreviewJulia Kreger proposed openstack/ironic-python-agent stable/2023.1: Unmount config drives  https://review.opendev.org/c/openstack/ironic-python-agent/+/91785217:57
opendevreviewJulia Kreger proposed openstack/ironic-python-agent stable/2023.1: Unmount config drives  https://review.opendev.org/c/openstack/ironic-python-agent/+/91785217:57
TheJuliaI guess lets see if that clears the issue up18:05
opendevreviewJulia Kreger proposed openstack/ironic-tempest-plugin master: Additional test state fixes  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/91800118:30
iurygregoryany DIB experts around? 20:06
iurygregoryikoliveira is having some trouble after deploying a bm node, ironic shows active but the node seems to hang during boot , I think it might be a problem with the image used in the node20:06
TheJuliao/20:21
TheJuliaikoliveira: Hi!20:21
TheJuliaikoliveira: where is the node hanging in the boot process?20:22
ikoliveiraTheJulia, hi! The node loads BIOS and starts boot process, but doesnt starts the operating system after this. Only shows an black screen with an underscore blinking20:25
TheJuliaikoliveira: so, odds are your bootloader has no console= setting20:26
TheJuliayou may need to modify the dib element20:26
TheJuliauhhhh20:26
TheJuliado you see the bootloader screen?20:28
iurygregoryit doesn't even reach the bootloader i think when he was showing to me 20:30
ikoliveiraTheJulia, just like iurygregory said, it doesnt appears to me20:30
iurygregory"super micro" logo loading... black screen 20:31
TheJuliahmmm20:31
iurygregoryI have the feeling the image is broken or something... I told him to try building a centos8 stream to test, using $ disk-image-create centos bootloader vm -o centos8-wholedisk20:32
TheJuliatry rebuilding your image with "DIB_BOOTLOADER_VIRTUAL_TERMINAL=tty0" as a environment setting. If you can share the command line that would be helpful.20:32
TheJuliaat this point, don't bother with centos820:32
TheJuliaokay, so....20:32
ikoliveirahmmm20:32
TheJuliaI'd likely do something like "env DIB_BOOTLOADER_VIRTUAL_TERMINAL=tty0 DIB_RELEASE=9-stream disk-image-create centos block-device-efi vm bootloader -o centos9-wholedisk"20:33
iurygregoryhe is doing legacy boot, since the machine is old if I recall20:34
TheJuliaoh20:34
TheJuliauhhhh20:34
iurygregoryyeah =X20:34
TheJuliablock-device-mbr then :)20:34
iurygregoryuniversity doesn't have a lot of money for new hardware 20:34
ikoliveiranice, gonna try those ideas. tks a lot!!!!!!20:36
opendevreviewMerged openstack/bifrost master: Make boot_interface customisable  https://review.opendev.org/c/openstack/bifrost/+/91792720:48
TheJuliadtantsur: any thoughts on the json_rpc code in ironic lib and allowing "session" to be able to be client input, thinking for mercury to be able to have a separate configuration section because we're talking about a service "another team" owns which means it could have entirely different parameters23:26
TheJuliaserver wise, should be "good as-is"23:28

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!