Wednesday, 2021-11-03

*** pmannidi|AFK is now known as pmannidi00:33
hgySorry to bother you, I would like to ask whether Ironic counts the resources of bare metal nodes which are managed by  ironic into the total available resources02:44
hgySorry, I know this question is unclear02:50
hgyThe reason is I got “exceed quote for cores and ram”02:52
hgyWhen I deploy a bare metal node02:52
hgyOh, sorry, I find the cause, I didn't modify the default quote05:39
hgyI will try it05:39
*** redrobot2 is now known as redrobot05:58
arne_wiebalckGood morning, Ironic!07:09
arne_wiebalckhgy: The quota is not managed nor checked by Ironic, so it follows the same rules as for virtual machines.07:10
arne_wiebalckNobodyCam: Good morning! Sorry, I left right after that comment. I will try to have a look today!07:12
iurygregorygood morning arne_wiebalck and Ironic o/07:25
MikeCTZAmorning arne_wiebalck and all ironic07:27
arne_wiebalckGood morning iurygregory and MikeCTZA o/07:31
MikeCTZAopenstack hypervisor stats show should show me non 0 values based on the baremetal node that I have in openstack right? and likewise hypervisor show should match (assuming just 1 node), mine are all zero07:37
MikeCTZAI'm having 1 more go at trying to get a box working before our brainstorming meeting in a few hours time07:40
MikeCTZA(an internal team meeting that is)07:40
arne_wiebalckMikeCTZA: I barely use "openstack hypervisor ..."08:01
arne_wiebalckMikeCTZA: let me check what I see ...08:01
arne_wiebalckMikeCTZA: so, on an active hypervisor, the resources are used (vcpus 0, vcpus_used 64)08:12
arne_wiebalckMikeCTZA: on an available hypervisor, the resources are not used but also not there (vpus 0,  vcpus_used 0)08:16
MikeCTZAOK so I need to look a bit more at nova scheduler then as when I try deploy a box I get No valid host was found, been working through the docs on what to check but not found anything that solves my issue as yet08:23
rpittaugood morning ironic! o/08:34
jandershey arne_wiebalck iurygregory MikeCTZA rpittau and Ironic o/08:37
iurygregoryhey janders rpittau o/08:37
rpittauhey janders iurygregory :)08:39
dtantsurmorning ironic09:14
rpittauhey dtantsur :)09:14
arne_wiebalckMikeCTZA: this is on which release?09:14
arne_wiebalckMikeCTZA: first thing I would check for "no valid host" is placement09:14
arne_wiebalckhey janders rpittau and dtantsur o/09:14
rpittauhey arne_wiebalck :)09:14
zigoHi there! Can someone have a look at this Debian bug and help me fix?09:37
zigohttps://bugs.debian.org/cgi-bin/bugreport.cgi?bug=99754509:37
dtantsurhi zigo! stendulker may be able to09:41
dtantsurI wonder if it's related to a combination of newer sushy and older proliantutils?09:41
zigodtantsur: I can try to upgrade both, sure! :)09:43
stendulkeryes, it needs to use newer sushy09:46
zigostendulker: dtantsur: I'm already on sushy 3.12.010:15
zigoAnd it's also latest proliantutils 2.12.0 ...10:15
opendevreviewMerged openstack/ironic-python-agent master: Delete EFI boot entry duplicate labels first  https://review.opendev.org/c/openstack/ironic-python-agent/+/81589910:19
opendevreviewVerification of a change to openstack/ironic-python-agent master failed: Add blockdev to re-read the partition table  https://review.opendev.org/c/openstack/ironic-python-agent/+/81619210:19
dtantsurarne_wiebalck: rebase required, I suspect ^^10:19
*** pmannidi is now known as pmannidi|AFK10:34
stendulkerzigo: It looks like some of more calls needs to be fixed in proliantutils 10:41
arne_wiebalckdtantsur: thanks, I will have a look!10:54
*** dviroel|out is now known as dviroel|rover11:21
janderssee you tomorrow Ironic o/11:30
opendevreviewArne Wiebalck proposed openstack/ironic-python-agent master: Add blockdev to re-read the partition table  https://review.opendev.org/c/openstack/ironic-python-agent/+/81619211:51
arne_wiebalckdtantsur: rpittau ^^ (I rebased the patch)11:51
dtantsurthanks!11:52
* dtantsur is not looking forward towards his rebase after these 2 patches11:52
arne_wiebalckdtantsur: yeah, I was hoping to get mine in before your 800(?) line patch :-D11:52
dtantsurwhich pretty much just moves the code around :)11:53
dtantsurbut yeah11:53
arne_wiebalckwould it be a lot easier if we stop mine and merge yours first?11:54
arne_wiebalckmine is a one-liner, it is mostly the tests11:54
arne_wiebalckbut yours was first clearly :)11:55
rpittauI'll hold approving11:57
dtantsurarne_wiebalck: yours will need a backport, so it should probably go first11:59
dtantsurrpittau: ^^^11:59
dtantsurI think I'll survive a rebase :)11:59
rpittauack11:59
dtantsurarne_wiebalck: meanwhile, could you check https://review.opendev.org/c/openstack/ironic-python-agent/+/815629? will make rebasing slightly easier12:07
opendevreviewMerged openstack/metalsmith master: Update pep8 test requirements  https://review.opendev.org/c/openstack/metalsmith/+/81454312:55
timeuMikeCTZA: regarding your "no valid host", the troubleshooting guide has some infos: https://docs.openstack.org/ironic/queens/admin/troubleshooting.html. Best is to check the nova scheduler log and check what the various filters return. Are you using traits or resource_class for your flavors/nodes ? 12:57
opendevreviewVerification of a change to openstack/ironic-python-agent-builder master failed: Bump pip for tinyipa to 21.3.1  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81489413:27
opendevreviewMerged openstack/ironic-python-agent master: Stop requiring mocking of utils.execute if ironic-lib execute is mocked  https://review.opendev.org/c/openstack/ironic-python-agent/+/81562914:19
TheJuliaarne_wiebalck: when you changed the conductor group for a nova compute node, what exactly happened?14:42
opendevreviewAija Jauntēva proposed openstack/ironic master: Fix idrac-wsman deploy with existing non-BIOS jobs  https://review.opendev.org/c/openstack/ironic/+/81652114:46
arne_wiebalckTheJulia: I have to run to a meeting in a moment, but I think the resource provider was (soft)deleted and could not be recreated by the new responsible.14:58
TheJuliaresource provider soft deleted because it just completely disappeared right?14:59
TheJuliaat least, to nova's handling of partition keys14:59
TheJuliaarne_wiebalck: I'm going to add you on a nova change I'm working on. Closely related issues15:00
dtantsursee you tomorrow folks o/16:00
rpittaubye dtantsur o/16:13
opendevreviewAija Jauntēva proposed openstack/ironic master: Fix idrac-wsman deploy with existing non-BIOS jobs  https://review.opendev.org/c/openstack/ironic/+/81652116:13
opendevreviewMerged openstack/ironic-python-agent-builder master: Bump pip for tinyipa to 21.3.1  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/81489416:38
rpittaugood night! o/16:59
TheJuliao/17:03
arne_wiebalckTheJulia: Meeting done :)17:14
arne_wiebalckTheJulia: So, the issue was that Nova (not placement) soft-deleted the compute node and when it appeared on another nova-compute the insertion with the same UUID fails.17:14
arne_wiebalck*soft-deleted the node/resource provider17:15
arne_wiebalckFWIU, the initial registration is done by nova and in nova as well as in placement.17:15
arne_wiebalckThe resource tracker only updates the resource of this resource provider.17:16
TheJuliaThat seems like an area patched with-in the last couple of cycles17:16
arne_wiebalckTheJulia: this is very possible17:16
arne_wiebalckTheJulia: Our nova is on Stein.17:16
arne_wiebalckTherefore my hesitation to make definitive statements on this, it may have been all fixed already.17:17
arne_wiebalckbye everyone, see you tomorrow o/17:22
opendevreviewJulia Kreger proposed openstack/ironic-python-agent master: TBD: Add Lenovo SR650 issue note  https://review.opendev.org/c/openstack/ironic-python-agent/+/81654517:35
TheJuliaNobodyCam: what if we had a "standby" state18:53
TheJuliawhich may power things down, or not?18:53
TheJuliaor do other "magic"18:53
NobodyCamOh so a state beyond active18:55
NobodyCamAnd good morning18:55
NobodyCamHow would the system know to “wake” up is powered down or a sleep18:56
TheJuliaI think it would need to be an explicit case18:56
TheJuliaso the model I know exists in some shops is you have two clusters18:57
TheJuliaLike A side and B side18:57
TheJuliaB is secondary, but needs to be "on"18:57
TheJuliabut not used18:57
TheJuliaA, until it is explicitly failed, takes the workload18:57
NobodyCamThat’s interesting. So a pool of cold standbys19:00
TheJuliawell19:01
TheJuliawarm/luke warm19:01
NobodyCamIf they are sleeping or powered down. My concern would how are they spun up on demand19:02
TheJuliaI think $higher level thing19:04
TheJuliaerr, there should have been underscores19:05
TheJuliasomething has to make the decision to fail a cluster19:05
TheJuliaand that act may be a set of additional work, where it is not really a reprovision, just a state update19:05
TheJuliaand maybe bios reconfiguration19:05
NobodyCamHumm. I see where that could be useful.19:07
TheJulia"reconfigure"19:08
TheJuliaalmost19:08
NobodyCamI would okay with a standby / state. Concern is would we need to boot ipa to make the bios updates and the. Reboot back into the image19:08
NobodyCamThat could add 10 minutes to power on19:09
TheJuliaYeah, I *think* reconfigure may be a totally valid thing there19:09
TheJuliabut the case I'm thinking is largely OOB bios settings and reboot19:09
TheJuliaIf we were to do standby, reconfigure could just make sense at the same time19:16
TheJuliasince we're wiring in the same places19:16
TheJuliabut... how to know what to do then...,19:16
stevebaker[m]morning19:25
TheJuliagood morning stevebaker[m]19:27
*** dviroel|rover is now known as dviroel|rover|afk19:45
opendevreviewJulia Kreger proposed openstack/ironic-specs master: WIP: Enable improved power/state management  https://review.opendev.org/c/openstack/ironic-specs/+/81655119:49
TheJuliastevebaker[m]: NobodyCam: tzumainn: zer0c00l: rloo: eandersson: arne_wiebalck: thoughts/insight might be useful on https://review.opendev.org/c/openstack/ironic-specs/+/816551 19:54
stevebaker[m]looking19:54
tzumainnTheJulia, at a first glance it seems really cool; I have to admit that I don't really understand what 'standby' would mean for a node20:55
tzumainnI'm going to point the spec out to the MOC people to see if they have thoughts20:55
*** dviroel|rover|afk is now known as dviroel|rover21:29
*** dviroel|rover is now known as dviroel|rover|out22:18
TheJuliatzumainn: thinking like a standby configuration or state, like on reserve22:19
TheJuliaPlease don’t reuse this machine to mine bitcoin22:19
TheJuliaKeep it in low power mode22:20
jandersgood morning Ironic o/23:43

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!