Monday, 2023-06-19

rpittaugood morning ironic! o/07:15
yde_hi, i'm trying to build an ipa image using the ipa-builder, i just dont get how to tell the ipa-builder how to use my local IPA repo instead of https://opendev.org/openstack/ironic-python-agent08:03
yde_any clue ? thanks a lot08:03
rpittauyde_: you should be able to override ipa_source_path08:39
yde_rpittau: ok but how to pass it to the builder ? through env vars ? through extra-args ?08:41
yde_rpittau: i've seen this env var DIB_REPOLOCATION_ironic_python_agent08:43
rpittauyde_: tghat's defined by ipa_source_path in the ansible role08:44
rpittauyde_: there's also an example here https://docs.openstack.org/ironic-python-agent-builder/latest/contributor/index.html08:45
rpittausorry, I meant override DIB_REPOLOCATION_ironic_python_agent :)08:45
yde_rpittau: thanks a lot08:47
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: Remove outdated install pyyaml with pip2  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/88637908:48
Nisha_Agarwalrpittau, ironic weekly meeting is there today, right?13:09
rpittaummm she quit13:28
rpittauanyway, should be13:28
opendevreviewBaptiste Jonglez proposed openstack/networking-generic-switch master: Fix batching error due to outdated etcd3gw  https://review.opendev.org/c/openstack/networking-generic-switch/+/88640413:57
opendevreviewBaptiste Jonglez proposed openstack/networking-generic-switch master: Honor ngs_save_configuration setting when using batch commands  https://review.opendev.org/c/openstack/networking-generic-switch/+/88640513:58
rpittauJayF: I'm starting to have doubts about the meeting today, is it happening?14:53
JayFin 7 minutes14:53
rpittauack thanks14:53
JayFnot sure how useful my tired self will be beyond that today :D 14:55
JayFbut going to do that for sure14:55
JayF#startmeeting ironic15:00
opendevmeetMeeting started Mon Jun 19 15:00:36 2023 UTC and is due to finish in 60 minutes.  The chair is JayF. Information about MeetBot at http://wiki.debian.org/MeetBot.15:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.15:00
opendevmeetThe meeting name has been set to 'ironic'15:00
JayFGoooood morning Ironic, who all is here for our meeting today15:00
rpittauo/15:00
matfechnero/15:00
masgharo/15:01
JayFI would expect lower attendance than usual as some contributors may still be on holiday/travelling home from the summit.15:01
JayF#topic Announcements/Reminder15:01
JayF#note Standing reminder to review patches tagged ironic-week-prio and to hashtag any patches ready for review with ironic-week-prio: https://tinyurl.com/ironic-weekly-prio-dash15:01
JayF#note Please ensure to fully investigate CI failures, and especially make note of TIMED_OUT failures in unit tests, and do not spam recheck as we are trying to work the issues out.15:02
JayF#topic Review Action items from previous meetings15:02
JayFlast meeting was June 5, and there were no actions. Moving on.15:02
JayF#topic Review Ironic CI Status15:02
JayFdoes anyone have a meaningful update on the unit test pain?15:03
JayFI lost the thread on that at summit15:03
rpittauwell my patches with the timeout setting are just waiting for a +W :)15:03
rpittauthey could maybe give us some more hints15:03
JayFcan you link that here? I'll land it15:03
rpittauyep, it's a chain that starts from here https://review.opendev.org/c/openstack/ironic/+/88537215:05
JayFack 15:05
JayFI will ensure that lands after the meeting15:05
rpittauthanks!15:05
JayF#topic Review ongoing 2023.2 Workstreams15:05
JayF#link https://etherpad.opendev.org/p/IronicWorkstreams2023.2 15:05
JayFdoes not look like significant progress during summit week, as expected15:05
JayFAny comments on ongoing 2023.2 work before I move on15:06
JayF#topic Open Discussion15:07
JayFlooks like bug 2021995 is on the agenda15:07
JayFNisha_Agarwal: I presume that was you? What's up!15:07
JayF#link https://bugs.launchpad.net/ironic/+bug/202199515:08
Nisha_AgarwalJayF, Hi15:08
Nisha_AgarwalYes15:08
Nisha_Agarwalwe wanted to have clarity on the bug...how the bug is reproduced15:08
Nisha_Agarwaland which hardware it is seen15:08
JayFIt's iLO Gen 10 as mentioned in the bug; I don't know the specific SKU but you can ask there and Scott or someone else shoudl be able to get it15:09
Nisha_Agarwalas of now if it is Gen10 or gen10+ then proliantutils bank on sushy for connecting to hardware for redfish calls15:09
JayFas far as reproduction, we haven't found a tight reproducer15:09
JayFI'd prefer we keep most of the commentary in the bug though; the impacted parties are not in the Ironic meeting, and we (me/you/the impacted bug filers) are all in different timezones15:10
JayFso it'd be easier to coordinate there15:10
Nisha_Agarwalas far as i understand, during power_state_sync ironic is trying to power on while for some reason node is getting rebooted/power off15:10
JayFso the hardware is power looping in this case15:10
Nisha_Agarwaland probably ilo is not able to understand which one to honor15:11
JayFit's set to pxe and is failing15:11
Nisha_Agarwalmay be a race condition in ilo15:11
JayFthat's exactly what I'm wondering15:11
JayFand like, I don't expect the power status loop to get a sane value out of a node with flapping power statuses15:11
Nisha_Agarwalbut to connect to hardware team we need to reproduce the issue so that we could tell them the exact reproducer15:11
JayFbut we should ensure it doens't get stuck or left in a bad state15:11
Nisha_Agarwalthere was a bug in gen9 for blade enclosures15:12
Nisha_Agarwalaround 6-7 years back15:12
JayFI believe you may be able to reproduce the circumstances by: 1) setting node to pxe 2) rebooting node into environment with no pxe boot 3) see power status flap15:12
Nisha_Agarwalwhere i guess it was fixed for something similar in ilo(and a workaround was provided in proliantutils for gen9)15:12
JayFbut I am remote to the hardware as well, so that's as far as my insight goes15:12
JayFif you have specific advice for trying to repro the bug, if we get that into the bug text I can make sure Scott sees it.15:12
Nisha_AgarwalI tried to reach Aaron/Scott on the launchpad but didnt get any response so far15:13
JayFScott was at Summit15:13
Nisha_Agarwaleven on IRC i couldnt get him15:13
Nisha_AgarwalOhk15:13
JayFhe's not usually on IRC, the bug is the best way to communicate15:13
Nisha_AgarwalOk15:13
JayFand we work directly together so if I see a comment has sat for a few days, I'll pass it on :)15:13
Nisha_AgarwalThanks Jayf,15:14
Nisha_AgarwalThat will help15:14
JayFabsolutely :) Thanks for taking a look!15:14
JayFIs there anything else for the meeting? More on this bug? Another open discussion topic?15:14
Nisha_Agarwali will try the reproducer you have given and see if we can hit the issue15:14
Nisha_Agarwalfor now this topic is doe15:14
Nisha_Agarwaldone*15:14
Nisha_AgarwalThank you15:14
JayFGoing to give a minute for anyone to come forward with addl. open discussion topics15:15
JayFOne thing to note: I'll be looking in a day or two to email a short summart of the summit PTG sessions; there are some notes in 15:15
JayF#link https://etherpad.opendev.org/ironic-openinfra-202315:15
rpittaunice, was going to ask about that :)15:15
JayFbut mainly just time spent thinking about what Ironic might look like in 5/10 years15:15
JayFand some design discussion for NGS15:16
JayFI think is mostly all that happened15:16
JayFI'm going to call it, thank you for those who've been here o/15:16
JayF#endmeeting15:16
opendevmeetMeeting ended Mon Jun 19 15:16:51 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)15:16
opendevmeetMinutes:        https://meetings.opendev.org/meetings/ironic/2023/ironic.2023-06-19-15.00.html15:16
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/ironic/2023/ironic.2023-06-19-15.00.txt15:16
opendevmeetLog:            https://meetings.opendev.org/meetings/ironic/2023/ironic.2023-06-19-15.00.log.html15:16
JayFrpittau: landed that chain, assuming zuul cooperates15:45
JayFrpittau: anytime you have something like that pending review, and I don't have a +2 on it, and it's univeresally helpful, feel free to link to me directly ... especially in cases like last week where I'm not closely tuned into IRC/gerrit15:45
rpittauJayF: thanks, will do :)15:51
rpittauthe idea with that is that since we can control the timeout we should be able to see what fails in a reasonable time15:52
JayF++15:52
opendevreviewVerification of a change to openstack/ironic master failed: Add test timeout to tox config  https://review.opendev.org/c/openstack/ironic/+/88537216:02
opendevreviewVerification of a change to openstack/ironic master failed: Use tox env variables in coverage tests  https://review.opendev.org/c/openstack/ironic/+/88550716:02
opendevreviewVerification of a change to openstack/ironic master failed: Allow setting migrations timeout value from tox  https://review.opendev.org/c/openstack/ironic/+/88583716:02
rpittauJayF: one more thing if you have a moment https://review.opendev.org/c/openstack/ironic/+/86905216:02
JayF+2a16:02
rpittaummm the timeout patch failed not for timeout this time16:03
rpittauweird16:03
rpittauI'll have a look tomorrow16:04
rpittaugood night! o/16:04
opendevreviewVerification of a change to openstack/ironic master failed: Remove instance image after deployment  https://review.opendev.org/c/openstack/ironic/+/87868016:10
zorunJayF, TheJulia: following our NGS design discussions at the summit, I opened a RFE bug to discuss it further: https://bugs.launchpad.net/networking-generic-switch/+bug/202438517:07
zorunfeel free to chime in on the HA & hash-ring topic, this is the part I'm least knowledgeable about17:08
JayFI'd suggest posting it to the list with some context17:08
JayFto see if you can catch more interested parties17:09
JayFI know many of the folks at the summit interested do not often frequent irc17:09
zorunright, good idea17:09
opendevreviewVerification of a change to openstack/ironic master failed: Use jammy for base jobs  https://review.opendev.org/c/openstack/ironic/+/86905217:14
iurygregorygood morning Ironic17:24
* iurygregory is alive but still a bit tired of the travel...17:24
JayFGlad you managed to find your way home; looked like on socials you had a bit of a headache17:25
iurygregoryyeah =(17:35
iurygregoryif anyone has time today https://review.opendev.org/c/openstack/ironic/+/883062 https://review.opendev.org/c/openstack/ironic/+/885276/ I would appreciate o/19:16
JayFah, firmware interface19:19
JayFI will review but would like someone who cares more about the api design, e.g. dmitry, to be one of the cores signing off in the end if possible19:19
JayFjust stating I'm reviewing for code correctness, less so for api correctness and friendliness :D 19:19
iurygregoryack19:19
iurygregoryno worries19:19
iurygregoryCI still in a bad shape right?19:20
* iurygregory checks weekly meeting logs19:20
JayFI think we're all basically a week outta date around that19:20
JayFrpittau's change to add timeouts was approved by me, but it failed CI19:21
JayFin what looked like a suspicious failure in both standalone jobs19:21
JayF(a -maybe- net new issue? oh boy!19:21
iurygregoryI just saw this patch https://review.opendev.org/c/openstack/ironic/+/885276/19:22
JayFyeah even our failures are kinda probabalistic19:22
iurygregoryyup =(19:23
iurygregoryjust me or zuul is really slow today?19:23
iurygregoryfrom the results in this patch I think TheJulia had some patch to change something related to ConnectionEvents..19:24
JayFgiven the failures in our tests that we were tracking before were sqlite19:26
JayFer, I mean19:26
JayFwere mysql/postgres migration tests19:26
JayFit's hard to imagine that sqlite change being impactful19:26
JayFbut I'd believe a lot of stuff at this point lol19:26
iurygregoryyup!19:27
JayFhttps://github.com/openstack/ironic/commit/75b881bd31952b8f858b4708fb83f7d2ca7c42cc might be what you were thinking of, yeah?19:27
JayFor at least one of the changes in there19:28
iurygregoryhttps://review.opendev.org/c/openstack/ironic/+/88579719:29
iurygregorythis one19:29
JayFthat's just fixing up an error in sql2.0 unit tests19:29
JayFthat don't impact <2.019:29
JayFjust a relocated thing19:29
iurygregoryyeah, this would fix what we see in https://review.opendev.org/c/openstack/ironic/+/885276/ I think19:30
iurygregorybut doesn't solve the weird failures we see in our CI .-.19:30
TheJuliaWe could cast unit tests with different parameters most likely19:31
JayFthe best path forward for CI, I think, is getting rpittau's timeout changes in19:31
TheJuliaWell, launch. There is a special code path19:31
JayFso we change TIMED_OUT failures to actually troubleshootable errors19:31
JayFTheJulia: I don't understand19:31
TheJuliaSo if it is the sqlalchemy changes from like early may, we can change it to isolate the code path since unit tests get launched in a different way19:32
* JayF wonders if https://github.com/openstack/ironic/commit/75b881bd31952b8f858b4708fb83f7d2ca7c42cc#diff-caf8ada388bea74c1068bdaa9b41e7243bd2aef6a37697db86790bc9e2fc8c10R30 is hooked up in a way to force sequential execution19:33
JayFor if it's succeptable to an error where the connection goes away before it can execute19:33
JayFI can imagine bugs which would cause our behavior in something like this19:33
TheJuliaSo there is a whole separate db launch path in the db tests code because of the pre population of schema19:53
JayFwe likely don't use that path for migration testing then, right?19:53
JayFsince we don't want a schema there?19:54
TheJuliaI just checked into my last hotel, flying back early tomorrow19:54
TheJuliaOh, that is a super good question19:54
JayFit would finally explain the "why is this different" problem, if that's the case19:54
JayFI *must* be missing something20:02
JayFhttps://github.com/openstack/ironic/blob/master/ironic/tests/unit/db/sqlalchemy/test_migrations.py#L134620:02
JayFthat self.engine looks like it's a MagicMock()20:02
JayFbut there are some that inherit from it20:04
JayFand must inject an engine20:04
TheJuliaAs well as migration_api20:06

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!