Friday, 2023-07-07

opendevreviewVerification of a change to openstack/ironic master failed: Add DB API for Firmware and Object  https://review.opendev.org/c/openstack/ironic/+/88306201:56
opendevreviewMerged openstack/ironic master: Add hold steps  https://review.opendev.org/c/openstack/ironic/+/87906004:28
opendevreviewVerification of a change to openstack/ironic master failed: Utilize the JSON-RPC port  https://review.opendev.org/c/openstack/ironic/+/87921504:28
opendevreviewVerification of a change to openstack/bifrost master failed: Fix key-order[task] linter warnings  https://review.opendev.org/c/openstack/bifrost/+/87485405:53
opendevreviewMerged openstack/bifrost stable/2023.1: chore: allow ironic-inspector to work with IPv6 disabled  https://review.opendev.org/c/openstack/bifrost/+/87749406:08
rpittaugood morning ironic! Happy Friday! o/06:55
opendevreviewVerification of a change to openstack/ironic master failed: Remove python 3.6 mock hack  https://review.opendev.org/c/openstack/ironic/+/88702307:03
dtantsurJayF, rpittau, any plans to release python-ironicclient?08:40
opendevreviewDmitry Tantsur proposed openstack/bifrost master: [PoC] Life without Inspector  https://review.opendev.org/c/openstack/bifrost/+/88793408:42
opendevreviewDmitry Tantsur proposed openstack/bifrost master: [PoC] Life without Inspector  https://review.opendev.org/c/openstack/bifrost/+/88793408:43
rpittaudtantsur: I think the release was blocked by JayF because we were waiting for a patch to go in09:02
rpittaunvm it was merged yesterday https://review.opendev.org/c/openstack/releases/+/88749709:04
dtantsurah, hmm, apparently I cannot read09:14
iurygregorygood morning Ironic11:43
opendevreviewDmitry Tantsur proposed openstack/bifrost master: [PoC] Life without Inspector  https://review.opendev.org/c/openstack/bifrost/+/88793412:07
opendevreviewDavid Hill proposed openstack/ironic stable/train: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88795413:17
TheJuliagood morning13:28
dtantsurmorning TheJulia 13:28
opendevreviewDmitry Tantsur proposed openstack/bifrost master: [PoC] Life without Inspector  https://review.opendev.org/c/openstack/bifrost/+/88793413:41
iurygregorygood morning TheJulia 13:54
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Build tinyipa with tinycore 14.x  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/88775414:17
TheJuliaoh osc format, you make my head hurt14:30
opendevreviewMerged openstack/ironic master: Remove python 3.6 mock hack  https://review.opendev.org/c/openstack/ironic/+/88702314:41
opendevreviewDmitry Tantsur proposed openstack/bifrost master: [PoC] Life without Inspector  https://review.opendev.org/c/openstack/bifrost/+/88793414:58
iurygregoryI'm giving up in trying to understand what is happening in the standalone jobs .-. 15:04
iurygregorynext week I will be out, so not much time to spend on it .-.15:05
TheJuliarevert jammy?15:05
iurygregorywas thinking about this tbh15:05
TheJuliait had to have passed at some point... why does it continue to fail though is the question15:05
TheJulia... i mean aside from DHCP hating us15:05
iurygregoryyeah15:06
iurygregorydidn15:06
iurygregoryops15:06
iurygregorydidn't we had some similar issue in the past?15:06
iurygregoryand we decided to pin to an old version or something?15:06
opendevreviewJulia Kreger proposed openstack/python-ironicclient master: WIP: Parent_node support  https://review.opendev.org/c/openstack/python-ironicclient/+/88753515:06
iurygregoryor I have a lot of things in my mind lol15:06
TheJuliawell, we had issues with ipxe in general failing15:07
TheJuliabut it looks like things are partially working there15:07
TheJuliawe've had to pin ipxe15:07
TheJuliaI guess we just need to reproduce, really15:07
rpittauwe have another job not running on jammy already, but it was constantly failing -> ironic-tempest-wholedisk-bios-snmp-pxe15:10
rpittauthe standalone job passed most of the times in the patch and I was not able to reproduce the issue locally15:10
iurygregoryhttps://review.opendev.org/c/openstack/ironic/+/869052 looking at it I see similar failures for the standalone .-.15:14
iurygregorythe same response timeout etc 15:15
iurygregory.-.15:15
TheJuliaso... I guess my challenge at the moment is you guys sort of said opposite things, at least that is how my brain interpretted it15:15
rpittauyeah, but it passed most of the times15:15
TheJuliaAnd we have a ton of variables at play, is there a variable we know we can unwind in this?15:16
TheJulia... (realistically, aside from jammy would be nice, but also being realistic I know that is the huge variable at the moment unless we can reproduce it locally15:16
iurygregoryand looking at the history seems like it much worse after we switched to jammy .-.15:17
iurygregoryhttps://zuul.opendev.org/t/openstack/builds?job_name=ironic-standalone-redfish&project=openstack%2Fironic&branch=master&skip=0 15:17
rpittauagain, I was not able to reproduce the problem when I tried locally on jammy, but if you think moving back to focal for standalone would improve the situation, I have nothing against it15:18
iurygregoryon my mind switching to focal would be the last thing to try15:19
rpittauI doubt I will have the time to troubleshoot further next week anyway15:19
TheJuliaiurygregory: why would it be the last thing to try?15:20
TheJuliaif it does suddenly start behaving as expected, that is a solid data point15:20
TheJuliawe can then print t-shirts, something along the lines of "jammy ipxe == pain"15:20
iurygregoryperfect!15:21
rpittauI don't exclude jammy is the problem, considering that one other job fails 100% of the times15:21
TheJuliawhich job is that?15:21
rpittauironic-tempest-wholedisk-bios-snmp-pxe15:21
iurygregorylol https://askubuntu.com/questions/1420601/dns-not-working-on-22-04-server15:21
iurygregoryI just googled jammy ipxe and this was the first thing on google for me15:22
TheJuliahttps://zuul.opendev.org/t/openstack/builds?job_name=ironic-tempest-wholedisk-bios-snmp-pxe15:22
rpittauTheJulia: that job is pinned to focal now cause it was never passing in the jammy patch15:23
rpittausorry, I need to leave now, if you decide to pin standalone to focal I will approve it15:23
rpittauor we can continue talking on Monday15:23
rpittauhave a great weekend o/15:23
TheJuliahave a wonderful weekend rpittau 15:24
iurygregoryhave a great weekend rpittau 15:24
TheJulialets pin the job to focal, and then start a single patch to work the effort to unpin the three jobs15:26
TheJuliaand use that for troubleshooting/diagnosis15:26
iurygregoryok15:26
iurygregorywant me to push the patch to pin? 15:27
TheJuliaif you wouldn't mind doing so, I'd appreciate it15:29
iurygregoryfinishing to write the commit message 15:30
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic master: Move standalone jobs to focal  https://review.opendev.org/c/openstack/ironic/+/88797115:31
opendevreviewIury Gregory Melo Ferreira proposed openstack/ironic master: [DEBUG] Moving back to Jammy  https://review.opendev.org/c/openstack/ironic/+/88797215:34
iurygregoryand the debug patch =) 15:34
opendevreviewJulia Kreger proposed openstack/ironic master: DNM Enable OVN  https://review.opendev.org/c/openstack/ironic/+/88508715:35
TheJuliaso I didn't realize the job was still on focal, that might just sort of work now15:38
* TheJulia crosses fingers and waits15:38
iurygregoryTheJulia, seems like your patch had a merge conflict https://review.opendev.org/c/openstack/ironic/+/879215 15:38
TheJuliayeah, hold steps15:39
* TheJulia fixes15:39
dtantsurmm, interesting https://review.opendev.org/c/openstack/governance/+/88796615:41
TheJuliaI'm very concerned by the fact we would need to opt-in15:42
TheJuliato keep a branch, that is15:42
dtantsurI guess that's similar to our past "if there is no vendor behind it, it's not going to last" discussion15:47
opendevreviewJulia Kreger proposed openstack/ironic master: Utilize the JSON-RPC port  https://review.opendev.org/c/openstack/ironic/+/87921515:48
TheJuliayeah15:48
TheJuliaChallenge is there seems to have been a classical disassociation and automatic assumption15:48
TheJuliafixed the merge conflict on the release mapping changes that happened in other patches15:50
TheJuliaalso fixed the reno, since I missed the version update last time I had to rebase it15:50
iurygregoryack, I will check after lunch15:51
TheJuliaiurygregory: it just occured to me what is happening in https://zuul.opendev.org/t/openstack/build/5a4438deadf94548b4df9d944b579493/log/controller/logs/ironic-bm-logs/node-1_console_log.txt15:51
TheJuliawe're not actually changing over to the other loader15:52
TheJuliawell, to the config pointer from ipxe15:52
iurygregoryhumm15:52
TheJuliaThat seems like a hard break in ipxe behavior15:52
iurygregory"perfect"15:52
TheJuliabut then there are other questions, but it could be just different builds15:52
iurygregoryI saw another different error, the one where we just timeout when trying to start and can't download NBP15:53
* iurygregory brb lunch15:54
TheJulia... why wouldn't it16:00
dtantsurmmm, snponly.efi just crashes? lovely..16:07
TheJuliaerm... Jul 06 18:55:19.592727 np0034598195 neutron-server[60220]: DEBUG neutron_lib.api.validators [req-76cdd891-8a79-4869-bb7d-9c6ee1e987c6 req-eca8ce79-251a-4289-a997-9445f57a1479 service ironic] tag:!ipxe,67 is not in valid_values {{(pid=60220) validate_values /usr/local/lib/python3.10/dist-packages/neutron_lib/api/validators/__init__.py:165}}16:11
TheJuliaI'm not sure it is crashing16:11
opendevreviewDmitry Tantsur proposed openstack/bifrost master: [PoC] Life without Inspector  https://review.opendev.org/c/openstack/bifrost/+/88793416:12
dtantsurTheJulia: ouch! are we on OVN now?16:12
TheJulianope16:12
TheJuliahttps://github.com/openstack/neutron-lib/blob/master/neutron_lib/api/validators/__init__.py#L166C41-L166C4116:13
dtantsurI wonder if valid_values got changed16:16
dtantsuranyway, see you next week o/16:21
TheJuliaI'm not sure it is filtering it out16:21
TheJuliaso the port has both a v4 and a v6 address at the same time16:23
TheJulia... I wonder if that confuses dnsmasq16:23
TheJuliasince we end up trying to do both parameters16:23
TheJuliai think we need a tcpdump capture16:45
TheJuliaor to precisely reproduce16:45
TheJuliaor just intercept a ci node16:45
iurygregoryhummm16:48
iurygregorywe can probably compare how it was in focal (if we had v4 and v6)16:49
TheJulia++16:51
TheJuliaI've asked for the next standalone job build failure to get held for us16:51
TheJuliait looks like it is using the v4 options, given it logs them separately16:53
iurygregoryperfect the standalone failed in focal also 16:57
JayFTheJulia: dtantsur: I am in full support of that TC res, and helped Kristi edit it some. If there is feedback from Ironic-in-general I'm happy to share it, but you can trust that I won't support something that'd restrict our ability to support releases (especially since we have our own (broken, lol) CI infra)16:57
iurygregoryhttps://zuul.opendev.org/t/openstack/build/31cdb311f65e48629cc3ead0ea00e649/log/controller/logs/ironic-bm-logs/node-1_console_log.txt16:58
JayFthat being said I'm very close to turning into a pumpkin for my trip to london16:58
opendevreviewDavid Hill proposed openstack/ironic stable/train: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88795417:00
TheJuliaJayF: I'm more worried that it eradicates the ability for those loosely engaging to share patches without becoming fully engaged in upstream development, which is a hard lift for some of the ops folks out there that are supporting older releases. in other words, we end up heading down a path of unwinding what they sought because extra assumptions got piled in past the original simple ask of ops in Sydney from the 2018 17:01
TheJuliachange17:01
JayFTheJulia: I'll note that 1) those operators don't really exist in practice, or at least have not moved the needle in contribution and 2) that a supported branch requires someone committing to support it, not just someone willing to toss patches up occassionally17:02
JayFso I think we might just disagree here; I'd rather us take the bitter pill of saying "we don't have enough folks to support this" than misleading operators into thinking something is supported when, in reality, it isn't17:02
TheJuliaI think some of the folks in the room were disagreeing to the first perception, to be entirely honest. Granted, most of the time it is the upstream developers trying to be nice people too, so it just makes it cloudy in the fog sort of way17:03
TheJuliaand the latter is obviously an issue, even thought hat is not what the operators were explicitly asking for upfront17:03
TheJuliawe, as a community, expanded the requirements17:03
JayFI think it'll be incredibly difficult to get contributors -- including me -- to agree to a system where we leave something open and hope that some kind people will toss patches into it17:04
TheJuliayeah17:05
JayFbecause then someone runs a half-broken Ironic $oldVersion and we get hit with the bad PR17:05
TheJuliawe inherently want it working too, by default17:05
JayFnobody is going to draw a line when theregister.com publishes "Ironic breached at $megaCorp, 10k servers pwned"17:05
JayFfrom $megaCorp using an open branch that we said was in '$anyWord maintenance"17:05
TheJuliaagree completely17:06
JayFthat's mainly where the buck stops for me; watering down branch support requirements to get more contributors is a nice idea; but we can't ship low quality software without it damaging us17:06
JayFso instead, flip the default to retire-first and allow people to opt in to longer maintenance, and have that maintenance guaranteed by a person willing to say that they'll maintain the branch17:07
TheJuliaI agree, the challenge though, is will we get fixes like dave hill's change ever if there is not a path17:07
* TheJulia tries to convince dave hill to maybe change it a little more17:07
JayFI don't think we're doing people operating train that much of a favor by enabling them to continue to run 3 year old software :/ 17:08
JayFand I've worked places where the ... generosity of EM support was used as a method to keep delaying upgrade projects17:08
JayFthat's ^ not really playing into my TC-resolution-appoving logic, but I do think it's the truth17:09
opendevreviewDavid Hill proposed openstack/ironic stable/train: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88795417:09
TheJuliathat is really just going to create more work for developers who work for firms who support those versions, unfortunately. At the same time, it is best for upstream to not feel encumbered by it.17:14
JayFI'll note I worked places that would sometimes do that stable branch work on their own17:15
JayFbecause backporting the patches into our set of patches was easier than rebasing everything we had on top17:15
JayFI sure hope nobody has that many patches at this point, though17:15
TheJuliayeah, nobody wants to be that deep down a rabbit hole17:18
* TheJulia tries to convince dave to add a unit test17:20
iurygregorylooking at a failed job and green one from standalone, they have same message in the neutron server "tag:!ipxe,67 is not in valid_values"17:30
TheJuliaI'm thinking they might just be red herrings17:31
TheJuliabecause if they were disregarded we wouldn't get anywhere17:31
iurygregoryyup, agree17:32
opendevreviewJulia Kreger proposed openstack/ironic master: DNM Enable OVN  https://review.opendev.org/c/openstack/ironic/+/88508717:35
TheJuliaI'm seeing if we can recheck https://review.opendev.org/c/openstack/ironic/+/887271 and get the ironic-standalone node held if it fails17:37
TheJuliasince obviously we failed in a different way it seems17:38
iurygregoryack17:39
iurygregoryoh nvm seems like in focal is working, I was looking at the debug patch <facepalm>18:03
iurygregoryhttps://review.opendev.org/c/openstack/ironic/+/88797118:03
opendevreviewJulia Kreger proposed openstack/ironic master: Fix db migration tests for sqlalchemy 2.0  https://review.opendev.org/c/openstack/ironic/+/88743218:15
opendevreviewJulia Kreger proposed openstack/ironic master: Add job to test with SQLAlchemy master (2.x)  https://review.opendev.org/c/openstack/ironic/+/88602018:16
opendevreviewDavid Hill proposed openstack/ironic stable/train: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88795418:31
TheJuliafiling under: maybe it was eventlet all along: https://paste.opendev.org/show/bLg4eqoQnZBS1PphUu63/18:32
TheJuliaJayF: ^18:32
opendevreviewDavid Hill proposed openstack/ironic stable/train: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88795418:33
TheJuliacould I get another review on https://review.opendev.org/c/openstack/ironic-python-agent/+/882367 ?18:40
opendevreviewDavid Hill proposed openstack/ironic stable/train: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88795418:52
JayFTheJulia: what does that mean, though18:57
JayFI'm terms of repair action 18:57
JayF**in18:57
opendevreviewJulia Kreger proposed openstack/ironic master: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88799018:58
TheJuliadunno18:58
TheJuliawalking to lunch for a little bit18:58
JayFI'm sus that excluding os is finally coming home to roost https://github.com/openstack/ironic/blob/master/ironic/cmd/__init__.py#L2619:05
JayFI'll look at open bugs and such but won't really have time to go deep.19:06
JayF(today)19:06
ashinclouds[m]Eh, that shouldn’t cause unit test issues… at least I don’t think19:06
* ashinclouds[m] awaits her salad19:06
opendevreviewDavid Hill proposed openstack/ironic stable/train: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88795419:09
opendevreviewDavid Hill proposed openstack/ironic master: Make sure there's no stale path_tmp  https://review.opendev.org/c/openstack/ironic/+/88799119:14
iurygregoryTheJulia, when you have time https://review.opendev.org/c/openstack/ironic/+/887971 o/19:33
TheJuliaack, still hoping to snag us a CI node to diagnose before that merges19:39
TheJulia:)19:39
iurygregorygotcha =)19:47
opendevreviewJulia Kreger proposed openstack/ironic master: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88799019:58
JayFhttps://github.com/eventlet/eventlet/issues/79819:59
JayFinteresting one from gibi19:59
JayFnot sure it's ours but worth a glance19:59
TheJuliainteresting20:00
TheJuliawell20:00
TheJuliahmmm20:00
JayFheh it showing up in standalone job more often is interesting20:00
JayFbecause if nova is blcoking bad eventlets in requriements20:01
JayFwould that save us from a bad eventlet in tempest but not in standalone/unit tests?20:01
JayFuh, yeaaahhh20:01
JayFthat might check20:01
JayFscience20:02
opendevreviewJay Faulkner proposed openstack/ironic master: Limit to eventlet 0.30.1 and newer  https://review.opendev.org/c/openstack/ironic/+/88799620:02
JayFwe allow a lot older eventlet than nova20:03
JayFI'll poke at that from the airport tomorrow20:03
JayFfwiw, gibi's reported bug is .33.1 so these are opposed ideas (if that bug impacts us, we are already on newer eventlet)20:04
* JayF is going to be mostly out of IRC for the rest of the day, but will be around some tomorrow killing time in the airport20:05
iurygregorysafe travels JayF =)20:15
opendevreviewJulia Kreger proposed openstack/ironic master: Cleanup if images.fetch fails  https://review.opendev.org/c/openstack/ironic/+/88799020:24
opendevreviewVerification of a change to openstack/ironic-python-agent master failed: Allow md5 to be disabled from the conductor  https://review.opendev.org/c/openstack/ironic-python-agent/+/88236720:40
JayFfyi standalone failed on 887996 and freeze shows eventlet==0.33.320:58
opendevreviewJay Faulkner proposed openstack/ironic master: DNM: Eventlet science  https://review.opendev.org/c/openstack/ironic/+/88799621:05
TheJuliait feels super close with jammy and ovn21:06
JayFit'll be great when you get that working21:15
JayFI know you've been poking it for ages21:16
TheJuliait looks like either it doesn't see the packets, or I've got the bind wrong, or it is not setting up dhcp21:17
TheJuliaI won' thave new clues until next week since I've messaged one of the ovn folks21:17
* TheJulia lets CI spin and hopefully we'll have a node isolated soon to dig into21:19
iurygregorybye everyone, see you on July 17 o/21:56
opendevreviewVerification of a change to openstack/ironic master failed: Fix the HTTP code of the BadRequest exception  https://review.opendev.org/c/openstack/ironic/+/88727123:12

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!