Tuesday, 2022-07-19

cboucharTheJulia:  TY00:00
*** tkajinam is now known as Guest543507:14
*** Guest5435 is now known as tkajinam07:36
rpittaugood morning ironic! o/08:16
smachiHello ironic08:58
smachiResolved the conflicts for the path https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/845356 and also received +2 in code review09:02
smachiwaiting for it to be merged09:02
timeuDid anybody use Packer for building wholedisk images for baremetal deployments ? I have been using disk-image-builder but was wondering if by using Packer I could make it less dependend on the host (build the image in OpenStack VMs instead)09:26
ftarasenkotimeu: I found no option to create EFI partitions with packer, have you solved this issue or not affected?09:30
timeuyes that was what I was wondering too. I haven't really started to play around but after some research the EFI partition setup is a bit of a pain in Packer09:31
ftarasenkoI use VM in OpenStack with gitlab runner and disk-image-builder installed to create images.)  09:35
timeuftarasenko: Good to know. Maybe there is a way to integrate disk-image-builder into packer. 09:36
jandershey rpittau timeu ftarasenko and Ironic o/10:47
iurygregorygood morning Ironic o/11:01
opendevreviewUemit Seren proposed openstack/sushy master: Revert workaround for weak Etags  https://review.opendev.org/c/openstack/sushy/+/85034512:42
iurygregory◕_◕12:45
TheJuliatimeu: would that patch above happen to be you?12:47
TheJuliaI'm going to guess yes, thanks /whois12:50
TheJulia:)12:50
timeuyeah ;-)12:51
timeumy handle is my first name reverse ;-)12:52
TheJuliaAhh!12:52
timeuso I tested it. Seems like the newest XCC firmware fixes my original patch of stripping the \W tag 12:52
timeuso now the etag can be returned as is and it works. 12:52
TheJuliaI guess I'm wondering what was the behavior that was being observed with the weak etag workaround? I guess someone needs to go hunt down the original why it was done as well12:52
timeuI verified that it was still an issue with the older firmware 12:53
timeuI did the patch originally 12:53
timeuin the older Lenovo firmware I got an weak etag and if I passed it as is, the Lenovo box complained about it 12:53
timeuso I stripped it 12:53
TheJuliaahh okay12:53
timeuwhich fixed the issue. Seems that the recent Lenovo firmware fixed the issue that they accept the etag that they send us in the header 12:54
TheJulianice12:54
* TheJulia wonders if we should check the DMTF docs w/r/t etags just to be on the safe side12:54
timeuAccording to https://storyboard.openstack.org/#!/story/2009680#comment-192175 both the HTML standard as well as the redfish implementation requires strong Etags12:56
timeuLenovo for some reason doesn't adhere to it. So this PR should not break any vendors that do 12:56
timeubecause they will send a strong Etag 12:56
timeuand we just return it 12:56
iurygregoryI think we had a problem with weak etag in another hardware... not only Lenovo12:59
iurygregorygoing to double check with others to be sure about that12:59
TheJuliaiurygregory: your memory is the same as mine13:01
TheJuliaI seem to think there were etags issues on a particularly popular whitebox hardware manufacturer 13:01
iurygregoryshould we say yay?13:02
timeuiurygregory: would be great if you could test the change on that hardware13:03
timeuat least sending both etags breaks the virtual media function on the Lenovo one 13:03
iurygregoryinteresting13:04
TheJuliaiurygregory: https://www.youtube.com/watch?v=XJYmyYzuTa813:04
TheJuliaI mean, you can stop after the first 20 seconds :)13:04
iurygregorytimeu, thanks for the heads up! this would explain the new BZ janders was working on *I think* (since it was a Lenovo machine)13:05
iurygregoryTheJulia, perfect!13:05
timeuiurygregory: do you have a link or this is a non-public BZ ?13:06
iurygregorytimeu, let me see if I can find and check if is public or not give me some minutes =)13:07
timeuthanks !13:07
timeuI think arne_wiebalck should probably also test my patch with the SuperMicro nodes as I am reverting his patch more or less but I think the SuperMicro behave fine ;-)13:09
mraineriWith regards to ETags and Redfish, the general guidance I've given clients is to treat the ETag value as an opaque string regardless of it being strong or weak13:14
mraineriJust copy it as-is in the If-Match header, some database of cached resources to monitor when things change, etc13:15
iurygregorytimeu, https://bugzilla.redhat.com/show_bug.cgi?id=2103710 13:34
opendevreviewRiccardo Pittau proposed openstack/networking-baremetal stable/wallaby: Set agent_type in tests  https://review.opendev.org/c/openstack/networking-baremetal/+/85004913:54
iurygregoryReminder - Revise Ironic Vision meeting in 1hr14:00
iurygregoryhttps://etherpad.opendev.org/p/revise-ironic-vision14:00
opendevreviewMerged openstack/ironic-tempest-plugin master: Set minimal tempest version to 27.0.0  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/84528514:02
timeuiurygregory: Thanks. that's actually the error message I got until I created the PR. But this requires a relatively recent XCC (Lenovo's BMC) firmware 14:14
iurygregorytimeu, by any chance you remember the version had when the workaround was required?14:24
iurygregoryand which version you have now14:24
timeuso for the SD530 model (Type7X21) the current XCC version that doesn't need the workaround is 5.20 (Build ID: TEI3C8M). The one that still needed the workaround was at 4.80 (Build ID: TEI3C0K)14:26
timeulooking at the changelog between those two versions I couldn't find antying that would suggest that this was changed tough14:28
timeuThere are version 5.00 and 4.83 in between 14:29
opendevreviewAija Jauntēva proposed openstack/ironic stable/xena: Fix Redfish RAID to update raid_config  https://review.opendev.org/c/openstack/ironic/+/85035614:43
iurygregoryJayF, rloo https://etherpad.opendev.org/p/revise-ironic-vision15:01
iurygregorywe started the meeting feel free to join15:03
rlooiurygregory: last minute meeting... will try to get there later.15:04
iurygregoryrloo, ack =)15:04
JayFiurygregory: I'm in a meeting now, can join ~83015:19
JayF(11 minutes)15:19
iurygregoryJayF, ack 15:19
iurygregoryJayF, basically now we are sharing where we see ironic in 202615:29
iurygregorythis is the etherpad https://etherpad.opendev.org/p/revise-ironic-vision15:29
* JayF joins meeting15:30
* JayF hears "no need for IPA at all15:30
JayFlol15:30
iurygregoryLOL15:59
TheJulianoooooooooooo!15:59
rpittaucut short15:59
iurygregoryWTF GOOGLE15:59
rloo59 min only... we can count...15:59
iurygregoryhttps://meet.google.com/bvo-rxhi-dka15:59
iurygregory-.-'15:59
rpittauno no, makes sense, it's my punishment for being distracted!15:59
iurygregoryI didn't know they would kill the meeting if you schedule for 1hr16:00
iurygregorymy mistake16:00
iurygregory-.-'16:00
JayFthe message that popped said16:00
TheJulianext meeting, thanks everyone!16:01
JayF1hr limit on free meetings16:01
rpittauI'll add something to the etherpad16:01
iurygregoryJayF, ohhh16:01
JayFso I don't think it was a scheduling problem, it's just free services being limited16:01
iurygregorythat's what happen I use my personal account LOL16:01
iurygregorynext time I will try to use bluejeans probably... or my redhat email16:01
iurygregorysorry for that everyone, please update things in the etherpad we can schedule another meeting o/16:02
rpittauiurygregory: or discord :P16:05
iurygregory++16:06
rpittaualright, time to go seek refuge in the fridge, see you tomorrow, if I'm not melted o/16:07
JayFdiscord for work is something I've managed to avoid so far16:07
timeuiurygregory: I tested my patch with a real deployment and it works for the SD530 with XCC 5.20. Also I noticed that the IPA that comes with OSP16.2 now works with virtual media. Not sure if the size is < 500 or the XCC 5.20 fixed the limitation 16:29
timeuwell I celebrated too early. I run into a different error now at the end of the deployment when changing the boot device to disk16:35
timeudid anybody see this error message before: ""The value 'Continuous' for the property BootSourceOverrideEnabled is not in the list of acceptable values."16:42
iurygregoryhummmm 16:43
* iurygregory checks if there was any BZ like that16:43
timeuI will create a story and dig into it via direct sushy calls. Will try to find out what the exceptable values are. I also get a SQLalchemy exception when sushy throws this error btw16:44
timeuI get following ironic error: 2022-07-19 16:31:22.782 7 ERROR ironic.drivers.modules.agent_base_vendor [req-8d98ffa2-d3d8-407f-9dbb-c813212aba44 - - - - -] Failed to change the boot device to disk when deploying node d77e4414-6a9b-41c5-9ecd-32b16:46
timeu1920c73b2. Error: (builtins.TypeError) Object of type 'BootSource' is not JSON serializable16:46
iurygregoryI thought Ironic would set to Once since is virtual media deployment16:48
timeuthis is OSP16.2 so Train release not sure if this is also the case with that release16:49
iurygregorynot JSON serializable doesn't sound good lol16:49
timeuyeah I guess the error message that the lenovo node returns might be malformed16:51
iurygregoryI would try running the command via curl for the BMC to see16:51
timeuyeah will do I think the DB error happens in this call: 2022-07-19 16:31:26.029 7 ERROR oslo_db.sqlalchemy.exc_filters   File "/usr/lib/python3.6/site-packages/ironic/drivers/modules/deploy_utils.py", line 555, in set_failed_state16:53
timeu2022-07-19 16:31:26.029 7 ERROR oslo_db.sqlalchemy.exc_filters     manager_utils.deploying_error_handler(task, msg, msg, clean_up=False)16:53
timeunot sure if I should open a story also for ironic 16:54
iurygregoryfeel free to open one under ironic =)17:19
opendevreviewMerged openstack/networking-baremetal stable/wallaby: Set agent_type in tests  https://review.opendev.org/c/openstack/networking-baremetal/+/85004917:42
JayFWhat is OSP?17:52
TheJuliaOSP == short hand for RH OSP or Open Stack Platform18:12
TheJuliatimeu: have definitely seen the continious override thing before, that originated in a miscommuniation along the way in one of the ?2019? dmtf redfish revisions18:14
TheJuliaahh, so we're trying to handle the error and the response... is so unexpected we're actually raising a different exception. That is awesome18:15
timeuAccording to this documentation: https://sysmgt.lenovofiles.com/help/index.jsp?topic=%2Fcom.lenovo.systems.management.xcc.restapi.doc%2Fupdate_next_onetime_bootconfig_patch.html and also querying the XCC itself Continous is not allowed:   'BootSourceOverrideEnabled@Redfish.AllowableValues': ['Once', 'Disabled']},18:27
timeuso I think it's rather an issue of ironic which should probably check if Continuous is supported 18:27
TheJuliaugh, that change rearing it's head again three years later18:28
timeuwell we are eageraly waiting for the OSP 17.x release so we can move off Train to Wallaby at least18:29
TheJuliawell, I don't think that specifically is going to be in there18:29
timeuI guess 90% of the issues are fixed anyways in OSP 17.x18:29
timeuthe fix ? 18:29
TheJuliawell, there really is not a fix for the lenovo issue your hitting18:30
TheJuliaexcept maybe they fix their firmware18:30
TheJuliagive me a few and I can pivot and hunt down the docs18:30
timeusure interestingly it worked with the older firmware. I will check again if the Continous string is part of the allowed values in the older firmware18:30
TheJulialikely is18:31
TheJuliathere was this version where there was a suggestion on eway, and a couple vendors went to removing the value ability18:31
TheJuliaand yeah :(18:31
timeuaha it's also not in the old firmware:   'BootSourceOverrideEnabled@Redfish.AllowableValues': ['Once', 'Disabled'] so it means that they started to check it 18:34
TheJuliaheh18:34
timeuin the newer firmware 18:34
TheJuliai bet it still worked though18:34
TheJuliaI guess we're going to need a story. Maybe there is a path there...18:35
timeusushy or ironic story ? 18:37
TheJuliasushy most likely18:37
timeusushy story I created https://storyboard.openstack.org/#!/story/201016118:37
TheJuliak18:37
iurygregoryadded ironic-triaged 18:38
timeuone approach would be to check if the BootSourceOverrideEnabled value passed to sushy is in the AllowedValues and if not fallback to Once18:38
iurygregorysounds reasonable I would say18:39
timeuI can come up with a PR draft if you like 18:39
iurygregoryhappy to review it o/18:39
timeuok will have a stab at it tomorrow 18:39
TheJuliathat woudl be much appreciated18:40
TheJuliaI've got the session cache issue thing I'm kind of trying to figure out what is going on there :(18:41
JayFTheJulia: can I help?18:55
JayFTheJulia: If you need someone to quack at I can :)18:55
TheJuliaJayF: based upon the email thread, I think https://review.opendev.org/c/openstack/ironic/+/850259 might do the trick... Got logs like a half hour ago and haven't looked yet19:07
JayFack19:10
TheJuliadepending on what I see in the logs, will govern the storyboard entry19:11
TheJuliaewww19:27
TheJuliasushy thinks it was able to refresh the session19:27
JayFTheJulia: want me to fix rpittau's issues with https://review.opendev.org/c/openstack/ironic-python-agent/+/839084 so it can land?20:04
TheJuliaJayF: by all means, wrapping my head around this sushy issue :\20:05
opendevreviewJay Faulkner proposed openstack/ironic-python-agent master: Guard shared device/cluster filesystems  https://review.opendev.org/c/openstack/ironic-python-agent/+/83908420:24
JayFTheJulia: rpittau: ^ fyi, minimal changes but I did reword a couple of sentences instead of directively applying the review feedback20:25
TheJuliathanks20:31
TheJuliaso looking at a this sushy issue, it *looks* like reauth just never occurs properly20:32
TheJuliabut that should fail20:32
TheJulia*or* it just works. :\20:32
opendevreviewJulia Kreger proposed openstack/sushy master: WIP: Add more debugging to session auth process  https://review.opendev.org/c/openstack/sushy/+/85042520:54
opendevreviewUemit Seren proposed openstack/sushy master: Revert workaround for weak Etags  https://review.opendev.org/c/openstack/sushy/+/85034521:22

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!