Friday, 2023-11-17

opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Add HTTP versions of network boot interfaces  https://review.opendev.org/c/openstack/ironic/+/90096500:17
opendevreviewJulia Kreger proposed openstack/ironic master: DNM: CI test for httpboot jobs  https://review.opendev.org/c/openstack/ironic/+/90118200:17
opendevreviewJulia Kreger proposed openstack/ironic-tempest-plugin master: Add snmp variant of ramdisk iso boot test  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/90121301:06
TheJuliathe first http boot in CI https://a37003535f40ac4a096b-3fa6c7b03298e65764ba8baf4a8a5f26.ssl.cf2.rackcdn.com/901182/3/check/ironic-tempest-bfv/b6bba8f/controller/logs/ironic-bm-logs/node-0_console_2023-11-17-01%3A10%3A05_log.txt01:24
opendevreviewJulia Kreger proposed openstack/ironic-tempest-plugin master: Add snmp variant of ramdisk iso boot test  https://review.opendev.org/c/openstack/ironic-tempest-plugin/+/90121301:50
opendevreviewJulia Kreger proposed openstack/sushy-tools master: Simplify UEFI logic and change the UefiHttp flow  https://review.opendev.org/c/openstack/sushy-tools/+/90120802:00
TheJuliahttps://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_3a9/901182/3/check/ironic-tempest-ipa-partition-uefi-pxe-grub2/3a92312/controller/logs/ironic-bm-logs/node-0_console_2023-11-17-01%3A13%3A00_log.txt <-- second, grub. *almost* there. just needs the extra grubby things deposited02:01
timeuHi all, I noticed a difference in behavior when deploying Centos7 vs RHEL8 on a baremetal node using a whole disk image (with EFI partition): In the centos7 case the root partition is not automatically grown to the size of the disk (it stays at 4GB = size of the image) while in the rhel8 case the root partition is grown to the size of the disk. Is this is a known issue ?08:02
timeuboth deployments are done with the same IPA image (ussuri). Ironic version is the one from OSP16.208:05
dtantsurtimeu: it could be difference in cloud-init versions or configuration. That's what grows your partitions. It's also possible that in C7 the root partition is not the last on the disk.08:39
timeu@dtantsur: thanks for the info. I was wondering what is responsible for growing the partition. As far as I can tell the partition schema is the same for both OS (EFI, BIOS boot, root, 64 MB linux fs at the end). 08:51
timeuso ironic just writes the image to the disk and the cloud-init of the image will take care of growing the partition 08:51
dtantsurcorrect09:03
dtantsurTheJulia: really nice! (re HTTP boot). Is it redfish or DHCP?09:10
iurygregorymorning Ironic11:25
TheJuliadtantsur: both the logs are dhcp based. Redfish is being a little problematic in that I added a sanity check into the driver code to generate an exception and fail validation if the hardware doesn't support it which seemed reasonable to me. And then I had a bad conditional that was not quite working causing the template to not work. I can't run any of the unit tests locally for sushy-tools since I'm on debian and it 13:44
TheJuliarequires the libvirt module to be compiled :\13:44
TheJuliaand it is not like there really are many unit tests in sushy-tools13:52
-opendevstatus- NOTICE: Gerrit will be unavailable for a short time starting at 15:30 UTC as it is upgraded to the 3.8 release. https://lists.opendev.org/archives/list/service-announce@lists.opendev.org/thread/XT26HFG2FOZL3UHZVLXCCANDZ3TJZM7Q/14:06
timeu@dtantsur: FYI: using the latest centos7 cloud image seems to have fixed the issue14:22
dtantsurgood!14:36
TheJuliasigh.... 15:33
opendevreviewJulia Kreger proposed openstack/sushy-tools master: Simplify UEFI logic and change the UefiHttp flow  https://review.opendev.org/c/openstack/sushy-tools/+/90120815:43
-opendevstatus- NOTICE: The Gerrit upgrade is complete, however we have Zuul offline in parallel for a schema migration, so any events occurring during this time will be lost (requiring a recheck or similar to trigger jobs once it returns to service); we'll update again once this is complete.16:34
-opendevstatus- NOTICE: Zuul is fully back in service now, but any events occurring prior to 17:05 UTC may need a recheck to trigger jobs.17:13
JayFTheJulia: I am hunting thru email history for the emails about the ARM CI project proposed for a university group last cycle17:28
JayFTheJulia: trying to find it to re-propose it for the New Mexico State thing posted to the list this week17:28
dtantsurJFYI I think https://github.com/dtantsur/ironic-operator has reached the state where an interested person could start playing with it. Beware: rough edges everywhere, nearly untested.17:28
dtantsurHave a nice weekend o/17:28
JayFdtantsur: is that for a single process/sqlite17:28
JayFor a full throttle ironic?17:28
JayFhave a nice one o/ see you in a week (I'm gone all next week)17:28
dtantsurJayF: single process, supports sqlite and mariadb17:28
JayFack17:29
dtantsurah, nice, enjoy!17:29
iurygregorySo, I was looking at the idea of caching https://opendev.org/openstack/ironic-python-agent/src/branch/master/ironic_python_agent/hardware.py#L237 to avoid spending a lot of time when the hardware has a lot of devices, after looking at our code base for a while my idea is to include the information we get in _get_multipath_parent_device when it's called in list_all_block_devices https://opendev.org/openstack/ironic-python-agent/src/branch/17:29
iurygregorymaster/ironic_python_agent/hardware.py#L566 into the BlockDevice, does this make sense or I'm in the wrong path? if anyone has ideas please let me know =)17:29
dtantsuractually, it has code to support 3 ironic instances (1 per control plane node), but that probably does not work correctly, especially with sqlite17:29
dtantsuro/17:29
dtantsurs/sqlite/dnsmasq/ in the previous message (of course, it does not work with sqlite, but that's expected)17:30
JayFI've mentioned this a few places, going to say it here: it's my last day online working until Nov 27 (a week from Monday). If you need something from me speak soon :)17:39
JayFiurygregory: I wonder if that would cause the fault tolerance to be less-tolerant of faults17:40
JayFiurygregory: e.g. if a failure in one path occurs between "when the cache was pulled" and "when we use it", will it be problematic?17:40
iurygregoryJayF, from the previous discussion we had (on Oct) I don't think this would be a problem17:42
JayFack17:42
JayFI don't know much about multipath in general17:42
JayFthat was just the thought I had :)17:42
iurygregoryI had the link handy if you want to check the irc conversation we had17:42
TheJuliaJayF: ack, so fwiw, the week before christmas, I'll be in NM ~30 minutes from UNM17:43
* JayF wonders if NM means something different than New Mexico17:43
JayFoooooh17:43
JayFI just grasped the relevance17:43
JayFsorry, had to pick the context up off the ground lol17:43
* TheJulia is always happy to go go have a cup of coffee in new mexico with people17:43
TheJuliaiurygregory: trying to grok/think through it17:45
TheJuliaiurygregory: so, if you mean by that, so we return the md device *instead* of the base devices, sure.17:46
iurygregorymaybe, ok I've found something strange when looking at the logs and our code .-.18:21
iurygregory"WARNING root [-] We have identified a multipath device sdei, this is being ignored in favor of (35000c500d706f623) and its related child devices"18:21
iurygregory(35000c500d706f623) shouldn't be dm-46? .-.  https://paste.opendev.org/show/bxyDYugFagTGnWNgSYoU/ 18:22
iurygregorymy idea is that we could have some mapping when we execute the multipath -ll <device> and we could have {'sdei': 'dm-46', 'sdbc': 'dm-46', 'sdho': 'dm-46', 'sdku': 'dm-46', ....}, and we wouldn't need to check 3 devices (this would improve the time) not even sure if would make sense ...18:26
iurygregoryif we don't have the information for the device we normally execute _get_multipath_parent_device (and when doing we save this information for and keep it) so when we go to verify sdbc we won't execute _get_multipath_parent_device18:27
TheJuliaso I guess the thing is we should be ignoring the underlying devices completely19:18
TheJuliaif there is a dm-device19:18
TheJuliaI guess this might just be easier to mentally model with code19:23
TheJulia15T device 8|19:23
iurygregoryyeah, in case we find one we could ignore the others 19:31
* iurygregory trying to get a parser for the output that would work 19:32
iurygregoryTheJulia, we probably need to update our current code, to provide the right device (not the numbers right?)19:33
iurygregorywondering if output differs based on the fiber channel etc 19:33
iurygregorysince I know the current code we have worked for a Customer when we had an escalation19:34
opendevreviewMerged openstack/bifrost master: ironic: Perform online data migrations with localhost DB  https://review.opendev.org/c/openstack/bifrost/+/89594820:57
opendevreviewJulia Kreger proposed openstack/sushy-tools master: Simplify UEFI logic and change the UefiHttp flow  https://review.opendev.org/c/openstack/sushy-tools/+/90120821:51
TheJuliaiurygregory: what do you mean device numbers?21:51
TheJuliaiurygregory: sorry, had my 1-on-1 and got distracted21:52
iurygregoryTheJulia, currently our code thinks the output is always in this format https://opendev.org/openstack/ironic-python-agent/src/branch/master/ironic_python_agent/hardware.py#L271-L28021:53
iurygregoryin the bug I'm working we are returning "(35000c500d706f623)" instead of "dm-46"21:53
iurygregoryhttps://paste.opendev.org/show/bxyDYugFagTGnWNgSYoU/21:54
TheJuliaoh!21:54
TheJuliayeah21:54
TheJuliait would be21:54
iurygregoryso we can't always think the second element of the split is the right thing21:54
TheJuliaewwwwww21:54
iurygregory=(21:54
TheJuliaugh21:54
iurygregorymy idea so far is to just check the split and grab the one that startswith "dm"21:55
iurygregorynot sure if is the right approach...21:55
iurygregorybut it works on my test here using the output from the bug XD21:56
iurygregoryI will finish things and push the patches on monday 22:00
iurygregorygoing to turn off the laptop o/ have a great weekend 22:01
TheJuliaiurygregory: that is likely a good approach22:01
iurygregoryack o/22:02
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Add HTTP versions of network boot interfaces  https://review.opendev.org/c/openstack/ironic/+/90096522:03

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!