Thursday, 2021-08-26

iurygregory_good morning Ironic o/06:28
*** iurygregory_ is now known as iurygregory06:28
*** rpittau|afk is now known as rpittau07:24
rpittaugood morning ironic! o/07:24
iurygregorygood morning rpittau o/07:25
rpittauheu iurygregory :)07:25
rpittauor even hey07:25
iurygregoryheu sounds nice also :D07:26
dtantsurmorning ironic07:53
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Use native Centos 8 Stream for dib images  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/80595708:25
opendevreviewMark Goddard proposed openstack/bifrost stable/wallaby: Use stable/wallaby branch to install dependencies  https://review.opendev.org/c/openstack/bifrost/+/80610408:27
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Add uWSGI role and use systemd instead of emperor mode  https://review.opendev.org/c/openstack/bifrost/+/80610608:31
iurygregorygood morning dtantsur 08:32
opendevreviewAija Jauntēva proposed openstack/ironic master: Redfish RAID: Use identity instead of durable_name  https://review.opendev.org/c/openstack/ironic/+/80586608:43
opendevreviewAija Jauntēva proposed openstack/ironic master: Add RAID mode conversion for idrac-redfish  https://review.opendev.org/c/openstack/ironic/+/80411208:49
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Keystone: consolidate uWSGI config, drop non-existing plugin  https://review.opendev.org/c/openstack/bifrost/+/80612109:09
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Keystone: consolidate uWSGI config, drop non-existing plugin  https://review.opendev.org/c/openstack/bifrost/+/80612109:11
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Add uWSGI role and use systemd instead of emperor mode  https://review.opendev.org/c/openstack/bifrost/+/80610609:13
iurygregorytimeu, jungleboyj hey o/ since you are Lenovo experts, have you ever tested redfish virtual media with Lenovo SR650 (ThinkSystem) - Type 7X06 ? we are having some weird issues because the iso generated by ironic causes a Kernel Panic https://paste.opendev.org/show/808333/ , I'm wondering if there is something specific that we need to set in the XClarity controller (any required configuration or something). Thanks!09:14
opendevreviewMichal Nasiadka proposed openstack/bifrost master: Allow configuring enabled raid interfaces  https://review.opendev.org/c/openstack/bifrost/+/80612309:24
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Keystone: deprecate a separate admin service  https://review.opendev.org/c/openstack/bifrost/+/80612509:31
*** sshnaidm|afk is now known as sshnaidm09:38
timeuiurygregory09:41
timeuyes 09:41
timeuhad the same issue, was due to the size of the IPA09:41
iurygregoryhummm09:41
timeuwhich caused an error when loading the ramdisk09:41
timeuand ended up in a kernel panic09:42
timeubecause it couldn't load the root filesystem09:42
timeuwe had to swithc tot he upstream victoria IPA images that are smaller09:42
iurygregoryWOW09:42
timeuhowever we tested this on a SR530 09:42
timeubut I believe it might be the same issue09:42
timeuthe redhat upstream IPA images are too big as it seems09:42
iurygregoryyeah .-.09:42
timeusec I have a screenshot of the console 09:43
dtantsuroookay, now we have an example of something that will get worse if we switch to Stream..09:43
iurygregory:D09:43
* iurygregory wondering if there is a possible workaround =X09:43
* iurygregory will cry trying to solve the Lenovo bug now09:44
timeuwe tested this on the latest XCC/UEFI firmware 09:44
timeuwe get the error " can't allocate initrd" 09:44
iurygregoryI think they are using the latest XCC09:44
iurygregory6.6 or something09:44
timeuand then a kernel panic with VFS unable to mount root fs on unknown-block09:44
timeuthey should try the https://tarballs.opendev.org/openstack/ironic-python-agent/dib/files/ipa-centos8-stable-ussuri.kernel 09:45
iurygregorytime to buy more beer (I think I will need)09:45
timeuussuri centos8 ipa images and see if that fixes their issue ;-)09:45
timeuhehe yeah or vodka ;-)09:45
iurygregory++09:45
iurygregoryvodka is always good09:45
dtantsurmaybe we should indeed publish both Debian (smaller) images and Stream (larger)?09:46
iurygregorythis would probably solve for upstream09:46
timeuour 200 lenovo nodes would appreciate this :-p09:46
timeuI documented some of the Lenovo issues here btw: https://storyboard.openstack.org/#!/story/200885209:46
dtantsurrpittau: ^^09:46
iurygregoryoh nice!09:47
timeueventually once we have played around more with ironic, I can make a PR for the docs regarding the caveats with the Lenovo servers (we have SR530,SR630,SR650,SD950,SR850, SR670) so almost their entire fleet to test ;-)09:49
dtantsurwould be awesome09:50
iurygregory++09:50
rpittaummmm I guess we'll have to tweak a lot the centos8-stream images, they're currently ~200MB bigger (not compressed)09:59
* iurygregory brain stopped working09:59
rpittautimeu: do you happen to know the size limit of the ISO ?10:01
iurygregoryI know the iso downstream we generated was 466MB10:02
rpittauso we're probably around 400 MB 10:03
iurygregoryusing wallaby10:03
rpittaudtantsur: I agree publishing both would help, and we already have debian jobs voting, so it doesn't change too much, it's just a matter of adding the publishing job10:04
rpittauand of course in the latest patches the debian jobs are failing :D10:06
iurygregoryrpittau, round 3 ? :D10:06
rpittauheh round 3 not finished yet10:07
opendevreviewMerged openstack/bifrost stable/wallaby: Use stable/wallaby branch to install dependencies  https://review.opendev.org/c/openstack/bifrost/+/80610410:32
jeevaokay i get that i need a sperate PXE boot VLAN for IRONIC, but what must the "production" VLAN be in correlation to openstack ?10:59
jeevabecause kolla-ansible only wants these 3 network settings:11:00
jeevaironic_dnsmasq_dhcp_range: "192.168.5.100,192.168.5.110"11:01
jeevaironic_cleaning_network: "public1"11:01
jeevaironic_dnsmasq_default_gateway: 192.168.5.111:01
jeevalike the same network as my mellanox_openstack_vm-to-vm ?11:03
dtantsurmgoddard: ^^11:12
opendevreviewMichal Nasiadka proposed openstack/bifrost master: Allow configuring enabled raid interfaces  https://review.opendev.org/c/openstack/bifrost/+/80612311:12
mgoddardjeeva: what do you mean by a production VLAN?11:13
jeevatrying to wrap my head around creating an ironic setup11:24
jeevathe network part11:24
jeevait takes about a cleaning network & PXE boot network, but how does the IPMI network comes into play ?11:26
mgoddardjeeva: kolla doesn't care about the IPMI network, the controllers just need to be able to access it11:28
jeevaok cool11:29
jeevathanks11:29
rpittauyep, debian is broken11:44
jeevadoes the OS controllers need to be exposed on the same PXE-VLAN ? or just the cleaning network11:44
dtantsuriurygregory: do you maybe have a second for https://review.opendev.org/c/openstack/sushy/+/805367 ?11:45
dtantsurapparently, there was a wrong deadline for non-client libraries, and we now have a chance of sneaking more stuff in11:45
rpittauoh cool11:45
iurygregorydtantsur, sure! 11:45
dtantsurI've asked to hold https://review.opendev.org/c/openstack/releases/+/806115 until this merges11:45
iurygregorytks dtantsur 11:51
opendevreviewMichal Nasiadka proposed openstack/bifrost master: Add support for being dhcp relay target  https://review.opendev.org/c/openstack/bifrost/+/80448212:00
opendevreviewDmitry Tantsur proposed openstack/bifrost master: CI: copy bifrost logs to the log directory  https://review.opendev.org/c/openstack/bifrost/+/80615512:07
opendevreviewDmitry Tantsur proposed openstack/bifrost master: CI: collect keystone information  https://review.opendev.org/c/openstack/bifrost/+/80615612:07
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Keystone: deprecate a separate admin service  https://review.opendev.org/c/openstack/bifrost/+/80612512:07
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Keystone: deprecate the separate admin service  https://review.opendev.org/c/openstack/bifrost/+/80612512:07
opendevreviewMichal Nasiadka proposed openstack/bifrost master: Add support for being dhcp relay target  https://review.opendev.org/c/openstack/bifrost/+/80448212:32
opendevreviewMerged openstack/sushy master: Support credentials for VirtualMedia URI  https://review.opendev.org/c/openstack/sushy/+/80536712:46
opendevreviewDmitry Tantsur proposed openstack/bifrost master: CI: copy bifrost logs to the log directory  https://review.opendev.org/c/openstack/bifrost/+/80615513:00
opendevreviewDmitry Tantsur proposed openstack/bifrost master: CI: collect keystone information  https://review.opendev.org/c/openstack/bifrost/+/80615613:00
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Keystone: deprecate the separate admin service  https://review.opendev.org/c/openstack/bifrost/+/80612513:01
viks__hi, what is the reason for the below warning in ironic-conductor.log:13:09
viks__WARNING ironic.common.nova [req-3d717ae0-f11a-4763-acc4-606d0d9bd7aa - - - - -] Could not connect to Nova to send a power notification, please check configuration. An auth plugin is required to determine endpoint URL: MissingAuthPlugin: An auth plugin is required to determine endpoint URL13:09
viks__i have `[service_catelog]` section set in `ironic.conf`13:10
timeurpittau: I think it's around 500 MB. At least the rhel train IPA images where around 600 MB which was too big. 13:22
iurygregoryenrr =(13:22
rpittaummm13:22
timeuok I checked the train rhel image is 538MB which didn't work13:23
iurygregoryI think it's probably less considering https://paste.opendev.org/show/808347/13:23
timeuso 538 MB for the rhel one and 338 MB for the ussuri upstream one 13:25
dtantsurinteresting, I think the RDO upstream images are smaller13:26
dtantsurthe master version is clearly under 500M: https://images.rdoproject.org/centos8/master/rdo_trunk/current-tripleo/13:26
dtantsureven smaller for train: https://images.rdoproject.org/train/rdo_trunk/current-tripleo/13:27
dtantsurI wonder why RHEL ended up with 500+, it should be more or less the same as RDO13:27
dtantsurviks__: it's a different section, probably [nova]13:27
viks__dtantsur: ok.. thanks13:28
dtantsuriurygregory: I'm pondering how we can have a two-stage loader for IPA. sounds doable, can be annoying in practice, of course13:28
iurygregorydtantsur, yeah it would be interesting13:29
dtantsurtimeu: this initramfs size problem, was it with iPXE or virtual media?13:38
timeudtantsur: only with virtual media. iPXE worked fine with the bigger IPA image14:05
dtantsurack, this is useful14:05
timeuboth with UEFI btw14:06
iurygregorygood to know :D14:06
timeumaybe the virtual media slot of the Lenovo machines silently truncates the iso 14:07
timeubut in the Lenovo BMC web interface it detects the correct size of the iso 14:07
iurygregoryredfish + iPXE? or did you use the xclarity driver from ironic?14:07
iurygregoryor even ipmi =)14:08
timeuvirtual media was with redfish 14:08
timeuiPXE I tried with both I think 14:08
timeuipmi and redfish 14:08
iurygregorynice!14:09
dtantsurokay, at least iPXE works14:09
* iurygregory now can at least provide some good news in the bz \o/14:10
timeuis the bz public ? 14:10
iurygregorynope =(14:11
timeuso redfish and ipmi + iPXE definately works with Lenovo machines. Virtual media has the image size limitation14:12
opendevreviewDmitry Tantsur proposed openstack/bifrost master: CI: copy bifrost logs to the log directory  https://review.opendev.org/c/openstack/bifrost/+/80615514:13
iurygregorymy brain started working fine again after reading this hehe :D14:13
timeusorry for the confusing sentence :-p14:14
opendevreviewDmitry Tantsur proposed openstack/bifrost master: CI: collect keystone information  https://review.opendev.org/c/openstack/bifrost/+/80615614:15
opendevreviewDmitry Tantsur proposed openstack/bifrost master: Keystone: deprecate the separate admin service  https://review.opendev.org/c/openstack/bifrost/+/80612514:15
dtantsurtoo many rebases..14:15
rpittauseems like the debian image is broken on DIB, some repo issue, hopefullt this https://review.opendev.org/c/openstack/diskimage-builder/+/806188 should fix it14:17
dtantsursigh14:17
rpittauthey dont test debian stable in the CI by default14:18
rpittauso anything can happen \o/14:18
TheJuliatimeu: say if a storyboard item was noting that there are not usable virtual media slots due to a size limitation inside the BMC, then I woudl happilly send that over to some folks I know at lenovo so maybe it could get put on their engineering backlog to fix...e ventually14:20
TheJuliaeventually14:20
timeuTheJulia: sorry I am not sure I can follow. The Lenovo nodes have different kinds of virtual media slots. If you use an old firmware the 2 (RODC1 and RODC2) are listed first and they don't support mounting via HTTP(s) and thus virtual media deployment via ironic fails. Upgrading the firmware changes the sort order and the EXTx slots come first. Those support mounting ISOs14:27
timeuvia http(s), however they seem to only support mounting ISOs up to ~ 400 MB (there is no error, but deployment fails). Should I update the storyboard to make this clearer ? 14:27
TheJuliatimeu: Yes, update the storyboard and I'll send it over to some folks at Lenovo14:27
dtantsurtimeu: unrelated, but do RODC* advertise supporting CD type? otherwise I'm not sure why ironic picks them.14:28
TheJuliaspecifically the size limitation being key :)14:28
TheJuliadtantsur: they may support access via something like CIFS14:28
timeuThe MediaTypes for all virtual media slots seems to be the same ("CD, DVD, Floppy, USBStick"). There is actually an issue on the Lenovo github org: https://github.com/lenovo/python-redfish-lenovo/issues/21 where I brought this up14:29
dtantsurahhh. maybe we should check TranfserProtocol as well14:29
dtantsur"TransferProtocolType" and "TransferMethod" properties are not supported yet, but per your voices, development team plan to support them on future release". sigh.14:30
TheJuliayeaaaah14:30
dtantsurmaybe we need an explicit exception for vendor == 'lenovo' and 'RODC1 in name?14:30
dtantsur'RODC' in name14:30
iurygregoryso we can only choose 'RODC' devices?14:32
iurygregorywhen using Lenovo?14:32
timeuRODC won't work14:34
timeuso we need to exclude them 14:34
timeufor Lenovo14:34
iurygregoryoh ok =)14:34
iurygregory10 slots for vmedia (I'm wondering why so many hehehe)14:34
TheJuliaLikely different customer use cases14:35
timeuTheJulia I updated the storyboard ("Update2") and tried to make it a bit clearer. Is this ok ? 14:36
TheJuliatimeu: much appreciated14:37
dtantsurTheJulia: maybe you can check my memory: do I recall it right that keystone no longer requires a separate admin API on port 35357? (re https://review.opendev.org/c/openstack/bifrost/+/806125)14:52
TheJuliadtantsur: I honestly don't remember14:53
dtantsurack, no worries14:53
dtantsurI can throw a quick patch disabling it to see what breaks14:53
opendevreviewDmitry Tantsur proposed openstack/bifrost master: DNM remove the keystone admin endpont  https://review.opendev.org/c/openstack/bifrost/+/80620714:54
opendevreviewFelix Huettner proposed openstack/python-ironicclient master: Validate path when passing a config-drive  https://review.opendev.org/c/openstack/python-ironicclient/+/80620915:31
opendevreviewRiccardo Pittau proposed openstack/ironic-python-agent-builder master: [WIP] Use native Centos 8 Stream for dib images  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/80595715:51
opendevreviewAija Jauntēva proposed openstack/ironic master: Add clear foreign config to idrac-redfish RAID  https://review.opendev.org/c/openstack/ironic/+/80484815:55
rpittauso I tested it locally and https://review.opendev.org/c/openstack/diskimage-builder/+/806188 fixes debian-minimal in ipa-builder (and dib in general)16:00
rpittausee ya tomorrow! o/16:02
*** rpittau is now known as rpittau|afk16:02
TheJuliaiurygregory: timeu: w/r/t lenovo gear, the word I'm getting back is that we do need to exclude RODC, it sounds like they are for updating the BMC....16:26
iurygregoryyeah, but the limitation about the size happens in the other vmedia slots (I think)16:27
TheJuliaiurygregory: that also sounds like a known issue16:29
iurygregory:D16:29
TheJuliaRemoteX are apparently web browser injected16:30
TheJuliatrying to clarify size stuff16:31
iurygregoryand probably there is no workaround for the size (I think)16:32
TheJuliastripping every ounce of firmware out might do it16:35
TheJuliabut at what cost to the telecoms of the wordl16:35
TheJuliawordl16:35
TheJuliagah16:35
TheJuliaworld16:35
iurygregoryI know they were using BMC 6.00 / UEFI 2.9016:37
iurygregoryI saw 6.60 on the lenovo page, so I asked them to update and try again, but the problem was the same16:38
TheJuliaiurygregory: so it seems like it is an outstanding question on the size limitation16:43
dtantsurI tried imagining how a 2-stage IPA would work.. but that's not trivial.16:43
dtantsurIPA itself is not the largest thing we ship, splitting it away doesn't bring us much.16:43
dtantsurWe can of course revive the idea of a container...16:43
dtantsurPretty much the same thing I did for CoreOS IPA, but without CoreOS :)16:43
TheJuliathe conundrum is the firmware assets really16:47
TheJuliathat stuff is already compressed, it is huge16:47
dtantsurthere is enough heavy stuff without them too16:47
TheJuliaand generally not really needed except in the fewer less common cases, but yeah16:48
TheJuliaYeah, some stuff that we can't cleanly rip out because of odd dependency chains that seem nonsensical16:48
dtantsurif we use a container with IPA, we can do without Python even16:48
dtantsur(in the 1st stage, I mean)16:48
TheJuliayeah16:48
dtantsurthis is the container we build: https://github.com/openshift/ironic-agent-image16:49
dtantsurthe last image I built was around 450 MiB16:50
opendevreviewDmitry Tantsur proposed openstack/sushy-tools master: [WIP] Support uploading certificates for virtual media  https://review.opendev.org/c/openstack/sushy-tools/+/80625317:19
dtantsursee you tomorrow folks17:19
sam_z1hey i noticed the list of power states contains "power_off"/"reboot" but also "soft_power_off"/"soft_reboot"-- what's the difference between the two?19:18
*** sshnaidm is now known as sshnaidm|afk19:32
TheJuliasam_z1: power off/reboot/power on are absolute actions20:03
TheJuliasoft actions are "hi, will you please kindly power off or power on or reboot20:03
sam_z1oh so as a node power state "soft power off" would be "someone asked me to shut myself off"?20:09
TheJuliasam_z1: exactly20:13
sam_z1cool cool!20:15
sam_z1ty!20:15
jandersgood morning Ironic o/22:41
jandersback to the living after pfizer-122:49
stevebakerjanders: welcome back23:11
jandersthank you stevebaker :)23:12

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!