Friday, 2021-09-17

opendevreviewJulia Kreger proposed openstack/ironic master: CI: set Neutron OVS of_inactivity_probe for CI load  https://review.opendev.org/c/openstack/ironic/+/80942600:10
jandersgood morning Ironic o/00:21
*** pmannidi is now known as pmannidi|AFK00:59
opendevreviewMerged openstack/ironic stable/train: Fix idrac-wsman set_power_state to wait on HW  https://review.opendev.org/c/openstack/ironic/+/80898202:12
opendevreviewMerged openstack/ironic stable/train: Update the clear job id's constant  https://review.opendev.org/c/openstack/ironic/+/79866802:41
stevebakerTheJulia: you can ignore my request to look at those logs. It turns out the nodes are still running in bios mode. I'll probably need to pick your brains on Monday05:31
arne_wiebalckGood morning, Ironic!06:37
jandersgood morning arne_wiebalck06:53
arne_wiebalckhey janders o/06:53
iurygregorygood morning janders arne_wiebalck and Ironic o/06:55
iurygregoryTGIF06:55
jandershey iurygregory o/06:55
* janders had to google TGIF :)06:56
* iurygregory goes to grab more coffee06:56
iurygregoryjanders, really? :O06:56
jandersthe IF part was pretty obvious but wasn't sure about the TG :)06:56
iurygregorygot it =)06:57
arne_wiebalckheyh iurygregory o/06:58
opendevreviewVerification of a change to openstack/ironic bugfix/18.1 failed: Fix driver task pattern to reduce periodic db load  https://review.opendev.org/c/openstack/ironic/+/80937607:07
opendevreviewAija Jauntēva proposed openstack/ironic stable/wallaby: Add better error messages for invalid conf molds  https://review.opendev.org/c/openstack/ironic/+/80950807:54
opendevreviewAija Jauntēva proposed openstack/ironic stable/wallaby: Fix iDRAC import configuration missing task handling  https://review.opendev.org/c/openstack/ironic/+/80950907:54
opendevreviewAija Jauntēva proposed openstack/ironic stable/wallaby: Fix iDRAC import configuration job with errors  https://review.opendev.org/c/openstack/ironic/+/80955107:54
opendevreviewAija Jauntēva proposed openstack/ironic stable/wallaby: Redfish RAID: Use identity instead of durable_name  https://review.opendev.org/c/openstack/ironic/+/80955207:55
opendevreviewVerification of a change to openstack/ironic master failed: Support HttpHeaders in create_subscription  https://review.opendev.org/c/openstack/ironic/+/80685908:01
opendevreviewVerification of a change to openstack/ironic master failed: Support HttpHeaders in create_subscription  https://review.opendev.org/c/openstack/ironic/+/80685908:34
opendevreviewTadeas Kot proposed openstack/ironic-inspector master: Add support for state selector in the list introspection  https://review.opendev.org/c/openstack/ironic-inspector/+/80757809:15
janderssee you next week Ironic o/09:28
jandershave a great weekend everyone09:28
iurygregorybye janders o/09:35
TheJuliagood morning13:08
iurygregorygood morning TheJulia 13:25
opendevreviewOpenStack Release Bot proposed openstack/ironic-python-agent-builder stable/xena: Update .gitreview for stable/xena  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/80967113:28
opendevreviewOpenStack Release Bot proposed openstack/ironic-python-agent-builder stable/xena: Update TOX_CONSTRAINTS_FILE for stable/xena  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/80967213:28
opendevreviewOpenStack Release Bot proposed openstack/ironic-python-agent-builder master: Update master for stable/xena  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/80967313:29
opendevreviewOpenStack Release Bot proposed openstack/networking-baremetal stable/xena: Update .gitreview for stable/xena  https://review.opendev.org/c/openstack/networking-baremetal/+/80967413:29
opendevreviewOpenStack Release Bot proposed openstack/networking-baremetal stable/xena: Update TOX_CONSTRAINTS_FILE for stable/xena  https://review.opendev.org/c/openstack/networking-baremetal/+/80967513:29
opendevreviewOpenStack Release Bot proposed openstack/networking-baremetal master: Update master for stable/xena  https://review.opendev.org/c/openstack/networking-baremetal/+/80967613:29
TheJuliastevebaker: ahh, yeah, they are still in bios mode (i looked anywa13:29
opendevreviewOpenStack Release Bot proposed openstack/networking-baremetal master: Add Python3 yoga unit tests  https://review.opendev.org/c/openstack/networking-baremetal/+/80967713:29
TheJuliaanyway13:29
TheJulia)13:29
TheJuliastevebaker: so yeah, if the xml files were never updated (since virtualbmc can't know how to swap wholesale configs back/forth for libvirt), then the would still be in bios mode and overriding everything would still cause things to boot. Coupled with libvirt binaries still speaking ipxe, the machine would have still booted.13:34
TheJuliaThere is likely an error in ipa in that it seems to not have caught the libvirt failure, but we were already down a funky rabbit hole  too13:35
TheJuliaStderr: "Installing for i386-pc platform.\ngrub2-install: error: install device isn't specified.\n".:13:35
opendevreviewOpenStack Release Bot proposed openstack/networking-generic-switch stable/xena: Update .gitreview for stable/xena  https://review.opendev.org/c/openstack/networking-generic-switch/+/80969613:48
opendevreviewOpenStack Release Bot proposed openstack/networking-generic-switch stable/xena: Update TOX_CONSTRAINTS_FILE for stable/xena  https://review.opendev.org/c/openstack/networking-generic-switch/+/80969713:48
opendevreviewOpenStack Release Bot proposed openstack/networking-generic-switch master: Update master for stable/xena  https://review.opendev.org/c/openstack/networking-generic-switch/+/80969813:48
opendevreviewOpenStack Release Bot proposed openstack/networking-generic-switch master: Add Python3 yoga unit tests  https://review.opendev.org/c/openstack/networking-generic-switch/+/80969913:48
iurygregoryarne_wiebalck, https://github.com/metal3-io/ironic-hardware-inventory-recorder-image/pull/12 \o/13:59
iurygregoryit didn't add the label before because prow was down13:59
arne_wiebalckiurygregory: \o/14:00
arne_wiebalckwhat is prow?14:00
iurygregoryarne_wiebalck, its a k8s based CI/CD14:01
iurygregoryhttps://github.com/kubernetes/test-infra/tree/master/prow14:01
arne_wiebalckOh, ok ... thanks! 14:03
TheJuliaAny other reviews on https://review.opendev.org/c/openstack/ironic/+/803855 :)14:28
opendevreviewAija Jauntēva proposed openstack/ironic master: Fix idrac-wsman having Completed with Errors jobs  https://review.opendev.org/c/openstack/ironic/+/80974714:31
opendevreviewJulia Kreger proposed openstack/ironic-python-agent master: Fail when grub fails with current boot mode of BIOS  https://review.opendev.org/c/openstack/ironic-python-agent/+/80975414:39
NobodyCamGood Morning Ironic'ers... And OFC ....15:28
NobodyCamTGIF15:28
TheJuliagood morning NobodyCam 15:30
NobodyCamGood Morning15:30
NobodyCamTheJulia: 15:31
NobodyCamhehehe 15:31
* TheJulia raises an eyebrow15:31
TheJuliaSo I have this idea for something to superceed blendtec infomercials15:31
TheJulia"will it melt"15:31
NobodyCamdiving in to the Joys of SNMP driver15:32
TheJuliaoh my15:32
NobodyCamI like it, several levels heat,15:32
NobodyCam*of HEAT.15:32
arne_wiebalckGood morning TheJulia and NobodyCam o/15:39
NobodyCamGood Morning arne_wiebalck 15:39
sam_zmornin' ironic folks15:41
* arne_wiebalck is dizzy from connecting to consoles for hours to fix nodes as part of the migration from virtual to physical instances ... older hardware can be challenging :)15:43
arne_wiebalckGood morning sam_z o/15:43
NobodyCammorning sam_z 15:44
opendevreviewJulia Kreger proposed openstack/ironic master: CI: set Neutron OVS of_inactivity_probe for CI load  https://review.opendev.org/c/openstack/ironic/+/80942615:47
* TheJulia crosses fingers....15:47
TheJuliaonly other option is to start hacking on devstcak15:50
arne_wiebalckbye everyone o/15:57
NobodyCamnight arne_wiebalck 15:58
sam_zuh oh devstack16:20
NobodyCamso here is question; say someone has multiple resource-classes of hardware that use the SNMP driver, and ONE of those classes needs additional time between deployment SNMP power off (not IPA shutdown) and SNMP power on. 16:33
TheJuliaNobodyCam: a special trait woudl make sense and then a deploy step that I guess acted if there step was present16:44
NobodyCamoh I was thinking deploy steps wouldn't be able to effect the actions beyond IPA's16:45
TheJulianope, they are logic code executed on the conductor16:45
TheJuliaNobodyCam: dmitry put together a custom-agent recently which is entirely conductor side if memory serves16:48
TheJuliablarg17:08
sam_zheyo, i'm poking around in the API code, and i noticed that sometimes, controller methods are decorated with @pecan.expose and sometimes they're decorated with @ironic.api.methods.expose-- why is that?17:18
opendevreviewJulia Kreger proposed openstack/ironic master: CI: set Neutron OVS of_inactivity_probe for CI load  https://review.opendev.org/c/openstack/ironic/+/80942617:39
TheJuliasam_z: oh... umm... umm... they should all be @ironic.api.methods.expose. We went through an effort to nuke pecan a while back17:42
TheJuliabut I guess... we missed something or it got re-introduced17:43
sam_zoh lol. it still lives on in some corners of the controller code17:46
sam_zwhat additional functionality does the @ironic.api.methods.expose decorator provide over the @pecan.expose decorator?17:48
sam_z"$ grep -R "@pecan.expose" . | wc -l" tells me there's 9 instances of @pecan.expose in the ironic repo btw17:50
TheJuliasam_z: got a link to an example real quick?18:14
* TheJulia is currently deep in the neutron code trying to wrap her head around why it blows up on startup if there is a transient appearing failure18:14
sam_zhttps://opendev.org/openstack/ironic/src/branch/master/ironic/api/controllers/v1/__init__.py#L25418:15
TheJuliahuh18:19
sam_zhere's a list of where the 9 pecan.expose calls are (according to grep): https://paste.opendev.org/show/809416/18:20
sam_zline numbers may be slightly off actually since i was messing with the code lol18:20
TheJuliahmm, there must be a reason18:21
* TheJulia makes mental note to chat with stevebaker next week18:22
sam_zTheJulia: all instances of pecan.expose are decorating either _lookup() or _route() methods apparently: https://paste.opendev.org/show/809417/18:29
TheJuliathat seems intentional18:30
sam_zthere are 9 instances of @pecan.expose, 6 instances of _lookup(), and 3 instances of _route() in the codebase so there's probably a reason18:30
sam_zwhat that reason is, i have no idea because i'm cracking open the pecan documentation for the first time today B)18:31
TheJuliasam_z: I am fairly sure steve will know18:31
sam_zwow that's a lot of messages!18:53
TheJuliastevebaker: so... seems our neutron issues are all tied back to the initialization of the firewall. I don't grok the inner depths of that code very well, it seems that the code basically bails immediately upon failure because OVS reports an error about the openflow datapath not being valid... and I don't know where that is coming from but its a new transactional command to OVS so what might be happening is <200 ms of 18:55
TheJuliacross thread lag *or* the integration bridge just not being fully fully up yet in the agent.  18:55
TheJuliastevebaker: I suddenly wonder if ovs expects cross-cpu thread cache to actually be coherent18:55
TheJuliastevebaker: because it is explicitly impossible in VMs...18:56
* TheJulia only knows this because she once had a database which *did* use it and when it got re-scheduled on to another CPU on a VM the cache was gone, so the VMs always got only one CPU which of course irked the DBA.... but all we could really tell the DBA was "complain to the vendor, their single threaded database!"18:59
opendevreviewMerged openstack/ironic stable/ussuri: Fix idrac-wsman set_power_state to wait on HW  https://review.opendev.org/c/openstack/ironic/+/80898119:05
TheJuliasweet19:32
opendevreviewJulia Kreger proposed openstack/ironic master: DNM: Disable Neutron firewall  https://review.opendev.org/c/openstack/ironic/+/80989519:37
TheJuliastevebaker: I'm going to try and discuss ^^^ with neutron folks next week, because I don't think neutron should die on startup.19:37
NobodyCamany interest in a patch that adds a configurable delay to snmp power on and off actions?21:47
TheJuliacould be useful22:16
TheJuliashe is hitting post event drop... hardcore22:18
TheJuliaerr, whoops22:18
opendevreviewVerification of a change to openstack/ironic master failed: Support HttpHeaders in create_subscription  https://review.opendev.org/c/openstack/ironic/+/80685922:28
opendevreviewChris Krelle proposed openstack/ironic master: add snmp power action delay  https://review.opendev.org/c/openstack/ironic/+/80990222:32
NobodyCamthere are no release notes or other things just 7 lines at this point. 22:47
opendevreviewVerification of a change to openstack/ironic master failed: Support HttpHeaders in create_subscription  https://review.opendev.org/c/openstack/ironic/+/80685922:50
NobodyCamAre ironic standalone test failing?23:29
TheJuliaNobodyCam: not about an hour ago23:33
NobodyCam:) I saw two patches fail on the same test23:34
TheJuliaO.o23:34
TheJulialink?23:34
NobodyCamhttps://zuul.opendev.org/t/openstack/build/1d2adc777778417bafcafa534d5187b023:35
NobodyCamlooks like devstack23:35
TheJuliait is neutron most likely23:36
TheJuliawhy I'm chasing down the path of https://review.opendev.org/c/openstack/ironic/+/80989523:36
NobodyCamhttps://zuul.opendev.org/t/openstack/build/2b270ae477d6454ca7df85d52e6be4e823:36
NobodyCam+++23:37
TheJuliayup https://195efc623dcc108061a1-e3638d6f93b7f404b738b516d37f0f7d.ssl.cf5.rackcdn.com/806859/4/check/ironic-standalone/1d2adc7/controller/logs/screen-q-agt.txt23:37
TheJuliavery end of the file23:37
TheJuliasame23:39
TheJulialiterally starts, gets down into the firewall startup and blows up23:39
NobodyCam:)23:39
TheJuliawhy in the world it doesn't keep retrying, I don't understand23:40
TheJuliaI guess the model of "systemd will fix it" maybe, but... that is the wrong model to really take with a critical path item23:40
NobodyCamI've seen cases where systems didn't detect an app had crashed; or really mostly crashed23:45
TheJuliaand systemd gives up eventually23:45
NobodyCamsystems/systemd/23:45
TheJuliawhat I don't really get is just how it fails, but I *suspect* if I start a day with no interruptions, I could likely trace it down23:46
TheJuliaThere is also just zero logging really around it all23:46

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!